Journal of Nutrition LabDiet, Your World of Nutritional Answers

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Purchase Article
Right arrow View Shopping Cart
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shi, H.
Right arrow Articles by Kristal, B. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shi, H.
Right arrow Articles by Kristal, B. S.

© 2002 The American Society for Nutritional Sciences J. Nutr. 132:1031-1038, 2002


Nutritional Models

Characterization of Diet-Dependent Metabolic Serotypes: Proof of Principle in Female and Male Rats1 ,2

Honglian Shi*, Karen E. Vigneau-Callahan{dagger}, Alexander I. Shestopalov*, Paul E. Milbury**, Wayne R. Matson{dagger} and Bruce S. Kristal*,{ddagger}3

* Dementia Research Service, Burke Medical Research Institute, White Plains, NY 10605; {dagger} ESA, Incorporated, Chelmsford, MA 01824; ** Antioxidants Research Laboratory, Jean Mayer U.S. Department of Agriculture Human Nutrition Research Center on Aging at Tufts University, Boston, MA 02111; and {ddagger} Departments of Biochemistry and Neuroscience, Cornell University Medical College, New York, NY 10021

3To whom correspondence should be addressed. E-mail: Bkristal{at}burke.org.

Back


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
Our research seeks to identify a serum profile, or serotype, that reflects substantial changes in food intake. Earlier studies demonstrated that a number of low-molecular-weight, redox-active compounds of metabolome were sufficiently stable analytically and biologically to identify biomarkers of dietary restriction (DR, restriction of total food intake) in rats. A second initial requirement is to demonstrate feasibility, i.e., that concentration changes in selected serum metabolites can contain sufficient information to classify rats by diet. The current study distinguished 101 (female) and 112 (male) chromatographically identifiable compounds that differ between ad libitum (AL) consumption and DR 6-mo-old rats. In a cohort of female rats, both hierarchical cluster analysis (HCA) and principal component analyses (PCA) could distinguish dietary groups with 100% efficiency (101 metabolites). Repeating the classification studies using the 63 biologically and analytically most robust metabolites decreased noise without affecting categorical separation. In a cohort of male rats, PCA, but not HCA, distinguished the original dietary groups with 100% accuracy (112 metabolites). A subset of 52 of the 112 metabolites enabled both HCA and PCA to group the male rats with 100% accuracy. These data demonstrate that quantitative analysis of selected serum metabolites can yield sufficient information by which to classify the dietary intake of a group of rats, identify such markers chromatographically and set the stage for validation of these metabolic serotypes in independent datasets.


KEY WORDS: • dietary restriction • serum metabolite • HPLC • multivariate analysis • biomarker • rats


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
Dietary restriction (DR),4 undernutrition without malnutrition, has been explored extensively for >70 y, and substantial evidence has accumulated to prove its beneficial effects on health in several animal models, particularly rodents. DR is usually conducted experimentally by feeding subjects of interest 50–70% of the amount consumed ad libitum (AL) by their littermates, who have unlimited access to food. DR extends longevity in essentially all animals in which it has been tested, including multiple mammalian species (rat, mouse, guinea pig), and is the most potent and reproducible known means of extending life span and reducing morbidity in higher animals (1Citation –11Citation ). DR extends both mean and maximum life span, reduces age-related morbidity and delays or prevents most age-associated physiologic dysfunction. DR also alters many basic physiologic processes, including metabolism, hormonal balance, and the generation of, detoxification of and resistance to reactive oxygen species (12Citation ). The protective effect of low energy diets in animals appears mirrored in the multiple human epidemiology studies providing evidence for a detrimental effect of obesity (13Citation ).

The multiple, robust effects of DR on physiologic and pathologic indices suggest the feasibility of identifying a metabolic serotype characteristic of this nutritional intervention. Based on this hypothesis, we are attempting to identify serum markers for AL and DR feeding in rats (14Citation ). We are pursuing serotypes of DR (1Citation –10Citation ,15Citation –17Citation ) composed of low-molecular-weight molecules that are chemically redox active. Initial studies showed that HPLC separations coupled with coulometric array detectors (18Citation –27Citation ) could detect ~1200 compounds in rat sera, of which ~300 were analytically reliable and ~240 were biologically reliable in young female rats (14Citation ). The previous work demonstrated biological and analytical feasibility of the proposed approach by identifying components of the metabolome that could be used to monitor changes in dietary intake. That previous work represents the first stage in identifying of metabolic serotypes, i.e., method validation.

This report addresses the second stage of identifying metabolic serotypes, i.e., determining whether the metabolome retains sufficient information to allow the initial diet groups to be determined based solely on serotypes. In this report, we are concerned only with proof of the principle that the subset of serum metabolome retains sufficient information to determine the dietary group of origin. The companion paper (28Citation ) addresses the next question, i.e., whether such subsets can determine diet groups in independent cohorts. This report demonstrates proof of principle in both female and male rats.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
Animal husbandry.

Details of the rats and husbandry conditions used in this study were reported previously (14Citation ). Briefly, male and female Fischer 344 x Brown Norway F1 rats were obtained monthly from the National Institute on Aging colony at Harlan (Indianapolis, IN). The basic animal husbandry for the animals used followed NIA/NIH guidelines as implemented by Harlan. Rats were fed NIH-31 (AL rats only) or vitamin/mineral–fortified NIH-31 (DR rats only) [see (14Citation ) for detailed diet compositions]. All rats were individually housed and DR feeding regimens (40% less food than eaten by AL rats) were implemented at 6 wk of age. Rats were imported from Harlan at 5 mo of age and killed ~1 mo later. As noted in the previous report (14Citation ), we found that rats in our colony that consumed food AL ate slightly less food than at Harlan, and restriction was therefore slightly modified. Specifically, once adjusted, food restriction was imposed at a level of ~35% less than ad libitum consumption (14Citation ). Sera were collected after killing by decapitation to avoid known differential effects of anesthesia on variables of interest. Collected blood was allowed to clot on ice for 30 min before centrifugation (1000 x g, 10 min). Female cohorts 1 and 2 had 8 AL and 8 DR rats; male cohort 1 had 5 AL and 8 DR rats; male cohort 2 had 7 AL and 8 DR rats. All animal experiments were performed under institutionally approved protocols and complied with NIH guidelines (29Citation ).

HPLC methodology.

HPLC separations and coulometric array detection was conducted essentially as described previously using an ESA CoulArray system (ESA, inc., Chelmsford, MA) (14Citation ,18Citation ,25Citation ,26Citation ). It was noted during validation studies that setting HPLC quantitation algorithms to quantitate on the basis of all matched channels (as opposed to only using those that matched on ratio) yielded a set of analytically valid peaks (metabolites) that partially, but not entirely, overlapped with the set of 297 metabolites previously identified (14Citation ). To maximize the probability of identifying metabolites of interest (i.e., those that help distinguish AL and DR rats), we have validated an additional set of metabolites using the procedures previously defined (14Citation ). This set of 325 metabolites was used in the initial studies of female rats presented here. Merger of the two datasets (i.e., the 297 and 325 datasets) will be conducted when the male and female datasets are combined (unpublished data). A detailed discussion of the analytical issues involved in these distinct quantitation approaches is in press (30Citation ).

Statistical analysis.

Data were analyzed using the programs CEAS 504 (ESA, Chelmsford, MA), Statview 5.0.1 (SAS Institute, Cary, NC) and Pirouette 2.7/3.0 (Infometrix, Woodinville, WA). Data analysis is described in the text.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
To establish a general metabolic serotype for DR, as well as specific serotypes based on sexes and ages, our overall, long-term study will utilize both male and female Fischer 344 x Brown Norway F1 rats of several different ages. In this report, we present data designed to demonstrate the feasibility of using sets of serum metabolites to characterize diet. Male and female rats (6 mo old) were used. First, metabolites of interest were initially selected using simple statistical comparison and metabolites that appeared unique to the cohort studied were removed. Second, exploratory analyses, principal component analysis (PCA) and hierarchical cluster analysis (HCA), were carried out to test whether sets of the metabolites of interest were sufficient to allow discrimination of the groups. Third, more detailed studies of the metabolites chosen were conducted, and the resulting information used to pare our working dataset.

Biological variability.

    Males. We have demonstrated that the HPLC-based approach used could consistently identify 297 peaks in a set of pooled samples (14Citation ). This suggests that we have a valid analytical approach to address the problem in determining serotypes. Biological variability was analyzed in a male cohort (male cohort 1). Visual inspection of the entire sera chromatograms generated from the 13 serum samples showed that the majority of the chromatograms were qualitatively well conserved, whereas some peaks of AL rats differed from those of DR rats. Coulometric array software (CEAS 504) was used to identify the 297 analytically valid metabolites in the 13 HPLC chromatograms from the male cohort. As listed in Table 1Citation , >87% of the 297 peaks were found in all 13 samples automatically, whereas 98% of the peaks (291 total) were found in at least 5 of 8 DR samples and 3 of 5 AL samples. This suggests that 291 metabolites exist that are potential targets for automated metabolic serotype analysis in male rats.


View this table:
[in this window]
[in a new window]
 
TABLE 1 Combined analytical reproducibility/biological variability parameters for automated detection across primary dataset in a cohort of 5 ad libitum consumption (AL) and 8 dietary restricted (DR) male rats1

 
    Females. Biological validation carried out in our previous study demonstrated that a pool of ~240 metabolites existed that were legitimate targets for automated metabolic serotype analysis in female rats (14Citation ). As described in the methods, this pool of 240 metabolites was identified from a pool of 325 metabolites chosen using slightly different analytical parameters from those used in males. A detailed discussion of the analytical issues involved in these distinct quantitation approaches is in press (30Citation ).

Selection of metabolites of interest by statistical comparison.

In theory, it is possible for us to measure all (240/291) metabolites in a (female/male) sample of interest, but there are several practical considerations that argue against this approach. One practical limitation is that, at present, we continue to rely on manual confirmation of computer-driven peak identification and quantitation to improve qualitative and quantitative accuracy. A second consideration is that removing noninformative data can help classification algorithms work more efficiently. For this reason, we continually refine the focus of ongoing work to those metabolites that most effectively distinguish AL and DR rats. We begin to pare the metabolites studied by using a strategy that reduces Type II statistical errors at the expense of making an increased number of Type I statistical errors. Specifically, we chose to avoid Type II errors (believing that the serum concentration of a given metabolite does not differ between AL and DR when in reality it does) because this results in a loss of information that may be helpful in distinguishing AL and DR rats. We are willing to accept an increased frequency of Type I errors (believing that the serum concentration of a given analyte does differ between AL and DR when in reality it does not) for three major reasons. First, even moderately differing peaks can impart information to pattern-recognition algorithms through their interactions with other variables. Second, the classification algorithms available today, including HCA, PCA and pattern recognition–driven classification are all relatively efficient at mathematically identifying the "signal" (i.e., informative variables) amid the noise (i.e., noninformative variables). Third, we expect to (and do) eliminate these errors later by additional studies in future cohorts [see companion paper (28Citation )].

The strategy was implemented by conducting unpaired t tests ({alpha} = P <= 0.2) on each of the 240/291 metabolites previously described for young female/male rats to identify metabolites of potential interest. This strategy increases Type I errors (relative to normal statistical conventions for biological research) in two ways. First, no considerations for multiple comparisons were used (e.g., ANOVA, Bonferroni correction). As a result of not adjusting for multiple comparisons, we would predict that ~48 variables were determined to differ statistically between groups based solely on statistical artifacts (e.g., sampling). Second, the chosen value of {alpha} = P <= 0.2 is much higher than that used conventionally ({alpha} = P < 0.05), resulting in a theoretical difference of 15% of the samples being studied. Because we cannot know the true distribution of the set of variables in question in the (statistical) population of AL and DR rats, however, it is impossible to predict accurately the quantity of Type I errors that this manipulation of {alpha} introduces.

Two-tail, unpaired t tests were conducted on each of the 240/291 metabolites. This t test analysis identified 112 and 94 metabolites for male and female samples, respectively, that differed between AL and DR samples with {alpha} value smaller than 0.2 after outlier data were deleted from the dataset by visual inspection (Table 2Citation ). Visual inspection of the dataset of female samples used for the t tests led us to include seven additional metabolites that failed the t test for clearly artifactual reasons (e.g., insufficient automated recognition of the peak in one of the groups). Overall, 101 metabolites for female and 112 for male were regarded as the primary dataset to identify dietary groups.


View this table:
[in this window]
[in a new window]
 
TABLE 2 Two tailed, unpaired t test on the levels of 240/291 analytical and biological valid metabolites as quantitated by automated analysis without manual correction12

 
The analyses carried out were primarily multivariate, but the P-values of the datasets are listed for three reasons: 1) to emphasize that there are metabolites whose levels differ between AL and DR rats on the basis of the more commonly used statistical approaches; 2) to understand intuitively the difference of metabolites in sera between AL and DR rats; and 3) to understand the components of datasets established via more commonly used statistical approaches. Among the 112 male metabolites that had P-values < 0.2, there were 57 metabolites with P < 0.05 (Table 2)Citation ; 62 of the 101 female metabolites had P-values < 0.05. The levels of these metabolites in AL and DR male rats were thus significantly different according to more commonly used statistical approaches.

HCA with the primary subsets of metabolites distinguishes AL and DR rats.

To determine whether the metabolites identified as potentially interesting retained sufficient information to identify their dietary group of origin, we examined these datasets using HCA. HCA is a method of data analysis that emphasizes the natural groupings of the dataset. In contrast to analytical methods that emphasize distinguishing differences between two groups, HCA uses algorithms that reduce complex datasets to establish these groups without preconceived divisions. In dendrograms, such as those shown in Figure 1Citation and described in more detail below, relative similarity within the total study population increases as one moves from right (0.0) to left (1.0, biochemical identity) on the horizontal axis. The smaller the distance is from identity (left side) to the point at which two samples (i.e., individual rats in Fig. 1Citation ) are linked by a vertical line, the greater the relatedness of the two samples (i.e., individual rats in Fig. 1Citation ). Alternatively stated, the closer the split between two samples is to the right of the figure, the greater the disparity between two samples or groups of samples.



View larger version (43K):
[in this window]
[in a new window]
 
Figure 1. Hierarchical cluster analysis (HCA) can distinguish ad libitum consumption (AL) and dietary restricted (DR) serotypes in female rats with 100% accuracy. Dendrogram of analysis of the sera from 16 6-mo-old AL and DR female Fischer 344 x Brown Norway F1 rats based on automated analysis of 80 serum metabolites identified as potential markers. Six independent analyses were conducted as described in the text. The sequence of appearance of the individual rats vertically is determined by the algorithm and their juxtaposition is indicative of their similarity. Relative similarity within the total study population increases as one moves from right (0.0) to left (1.0, biochemical identity) on the horizontal axis. The most distinct groups are linked at "0" similarity by HCA as implemented by Pirouette. Heavy horizontal line added to emphasize that AL and DR clusters are completely separated.

 
For initial feasibility analysis, we chose to work with 80 metabolites that were identified in either 15 or 16 of the chromatograms from female rats in the cohort by automated analysis.5 We then asked whether these metabolites encoded sufficient information to identify group of origin using HCA. For this initial analysis, metabolites not quantitated by the automatic quantitation scheme used were estimated by Pirouette using the mean-fill function. To determine the extent to which the results obtained using HCA were mathematically robust, we asked this question using two distinct preprocessing algorithms (Autoscale, Range scale) and three different grouping algorithms (single, complete, centroid).

In general, preprocessing is used to help (mathematically) reduce the possibility that choices of arbitrary measurement scales and magnitudes can become a major determinant of the effect of a specific variable on classification. Alternatively stated, and of more direct relevance for the studies conducted, preprocessing attempts to ensure that the relative magnitude of one variable to another is not as significant as are the relative concentration differences within a variable (and between groups). We chose two different preprocessing options. In Autoscale preprocessing, data for each metabolite is mean-centered and then variance scaled. Because of its wide applicability and its effectiveness at standardizing metabolites, we expect that the majority of our future studies will focus on Autoscale data. In Range Scale preprocessing, which is commonly used for graphing, the highest value in the dataset is assigned a value of 1 and the lowest value is assigned a value of 0. Remaining values are determined by proportion. This method is highly sensitive to outliers and has limited utility in subsequent cohorts because the overall range in a subsequent cohort may change; it is useful, however, for initial data exploration. More complete mathematical descriptions of the algorithms chosen are presented in the Pirouette manual.

Clusters can be built using either agglomerative or divisive techniques. Pirouette uses the agglomerative approach, in which samples begin as their own cluster, and the clusters are then progressively grouped on the basis of a predefined distance measure. We chose one representative of each basic approach available to build clusters. Single Link, or nearest neighbor analysis, assigns a sample to the cluster of the single sample which is nearest. Complete Link, or farthest neighbor analysis, assigns a sample to the cluster whose farthest neighbor is closest. Centroid Link, or centroidal linkage analysis, assigns a sample to the cluster whose center is nearest. More complete mathematical descriptions of the algorithms chosen are presented in the Pirouette manual.

As shown in Figure 1Citation , cluster analysis was 100% successful at distinguishing the diet group of all 16 rats in the female cohort, regardless of the mathematical parameters used.

For male rats, we initially worked with 103 of the 112 metabolites that were identified in either 12 or 13 chromatograms from the 13 rats in the cohort. The results from HCA analysis with the 103-metabolite dataset showed that 5 AL and 8 DR rats were grouped by their dietary origin with 92% accuracy when single, centroid and incremental were used for grouping algorithms, whereas they could be distinguished with only 62% accuracy when complete was used for grouping algorithm (Fig. 2Citation ). The results suggest that "noise" metabolites that were expected on the basis of statistical and analytical considerations in the dataset obtained by automated analysis existed in the dataset and affected the efficiency of HCA to group the samples. To increase the efficiency of the classification algorithm in grouping the rats with the dataset, "noise" metabolites were identified and removed from the dataset (see below).



View larger version (41K):
[in this window]
[in a new window]
 
Figure 2. Hierarchical cluster analysis (HCA) can distinguish ad libitum consumption (AL) and dietary restricted (DR) serotypes in male rats with 93% accuracy. Dendrograms derived from HCA with 103 serum metabolites identified as potential markers in the serums from 13 6-mo-old AL and DR male Fischer 344 x Brown Norway F1 rats based on automated analysis without manual verification. Autoscale was used as preprocessing algorithm. Four independent analyses were conducted as described in the text. Relative similarity within the total study population increases as one moves from right (0.0) to left (1.0, biochemical identity) on the horizontal axis. Heavy horizontal line added to emphasize that, other than complete, each of the grouping algorithms identifies dietary group of origin with 93% accuracy. Shaded samples are those samples that were incorrectly grouped.

 
PCA of the AL and DR serotypes.

To further evaluate the ability of the metabolites being studied to encode sufficient information to identify their group of origin, we examined this dataset using PCA. PCA, also called eigenvector analysis, is used to determine linear combinations of original metabolites that account for maximal variation. Thus, PCA can be used to reduce the dimensionality of the data by using only some of the eigenvectors. Lower number principal components possess greater ability to explain variation in the dataset, i.e., the ability of principal component 1 to explain variation is greater than that of principal component 2. For our purposes (i.e., classification), the subset of eigenvectors chosen can then be evaluated in terms of their ability to distinguish members of the different groups. In the current report, this is done on the data set from which the original variables were chosen, thereby examining feasibility. In the accompanying report (28Citation ), we test against subsequent, independent cohorts. More important, in the context of our long-range goals, PCA can be used to determine which of the multiple compounds that may differ between AL and DR rats are the most useful for classification purposes.

    PCA analysis with female rats. The first question that we addressed by PCA analysis was, "How many mathematical factors are required to define serotype?" Alternatively stated, "How few mathematical factors, which are themselves linear combinations of the data from the 80 variables (i.e., ‘principal components’ or ‘components’), can replace the majority of the data from the variables?" Figure 3Citation shows that, for Autoscaled data, ~50% of the total variability present can be captured by a single mathematical factor (principal component), and that ~5–6 principal components are sufficient to describe ~80% of the variability present. For Range-scaled data, the capture is even better, describing ~80% with a single mathematical factor. Modeling power analysis (Fig. 4Citation ) reveals that PCA analysis of automated data captures >50% of the variation of over one half the analytes using either Range or Autoscaled data (mean modeling power ± SD = 54 ± 16% [Autoscaled] and 52 ± 16% [Range-scaled]; median modeling power 56% [Autoscale] and 52% [Range-scale]). Overall, the data presented in Figures 3Citation and 4Citation show that we are able to capture a good deal of the overall information content of the data using PCA. One aim is to test whether PCA-based approaches offer the potential to distinguish dietary groups of origin. Figure 5Citation shows that AL and DR samples were completely distinguished by three principal components.



View larger version (11K):
[in this window]
[in a new window]
 
Figure 3. Principal component analysis (PCA) studies reveal that most variability present in the dataset is captured in few mathematical components. PCA of the sera from 16 6-mo-old ad libitum consumption (AL) and dietary restricted (DR) female Fischer 344 x Brown Norway F1 rats based on automated analysis of 80 serum metabolites identified as potential markers. Both Autoscale and Range Scale preprocessing are shown. Percentage refers to the percentage of variability captured by the cognate number of principal components. The thin line shows the variability captured by a single component; the thick line shows the cumulative variability.

 


View larger version (28K):
[in this window]
[in a new window]
 
Figure 4. Analysis of modeling power reveals that even basic principal component analysis (PCA) captures at least one half of the variability of approximately one half of the analytes. PCA of the sera from 16 6-mo-old ad libitum consumption (AL) and dietary restricted (DR) female Fischer 344 x Brown Norway F1 rats based on automated analysis of 80 serum metabolites identified as potential markers. Modeling power, or the ability of the components to accurately describe each metabolite in the dataset, is described as [1 - (metabolite residual variance/total metabolite variance)]. Therefore, as the components accurately capture the variation present in a given metabolite, the second term approaches 0, and modeling power approaches 1. Thus, metabolite variation across the different rats in a cohort is better captured as modeling power approaches 1, and more weakly captured as numbers approach 0. The first 8 components were used in determining modeling power. Thick lines show Autoscale data; thin line show Range-scaled data. Data are plotted both by analyte number to allow comparison of Autoscale and Range scales and by increasing modeling power to allow comparison of overall modeling efficiency.

 


View larger version (12K):
[in this window]
[in a new window]
 
Figure 5. Principal component analysis (PCA) can distinguish ad libitum consumption (AL) and dietary restricted (DR) serotypes in female rats. PCA of the sera from 16 6-mo-old AL and DR female Fischer 344 x Brown Norway F1 rats based on automated analysis of 80 serum metabolites identified as potential markers. The graphic shown is a two-dimensional projection of three principal component axes. The three axes are marked Factor 1, Factor 2 and Factor 3, and refer to Principal Components 1, 2 and 3, respectively. Rotations of the axes were carried out to highlight group separation and should not be used to quantitate degree of separation. Each mark (solid squares, DR; open circles, AL) represents an individual rat. Double dotted line added to emphasize that the regions occupied by AL and DR clusters are completely separated.

 
    PCA analysis with male rats. PCA of data from the 103 metabolite male data set obtained by automated analysis showed no outliers (Fig. 6ACitation ). Figure 6Citation B shows the individual samples present in the dataset as discriminated by the three primary principal components. Note that the 5 AL and 8 DR rats were distinguished by PCA analysis using these three components. These results indicate that PCA-based approaches can be used to identify the group of origin. The first two principal components captured 50% of the total variability present, and 6 components captured 80% of the variability present (Fig. 6Citation C). Modeling power analysis (Fig. 6Citation D) revealed that six components were sufficient to capture ~40% (mean 42 ± 25%; median 45%) of the variation of the 103 analytes.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 6. Principal component analysis (PCA) can distinguish ad libitum consumption (AL) and dietary restricted (DR) serotypes in male rats. PCA with 103 serum metabolites identified as potential markers in the serums from 13 6-mo-old AL and DR male Fischer 344 x Brown Norway F1 rats based on automated analysis. Autoscale was used as preprocessing algorithm. (A) Outlier analysis. Dot, AL samples; triangle, DR samples. Outliers would be above the thick horizontal line or to the right of the thick vertical line. (B) Scores analysis. Factor 1, Factor 2 and Factor 3 refer to Principal Components 1, 2 and 3, respectively. Dot, AL samples; triangle, DR samples. Dotted line added to emphasize the ability of using principal components to distinguish AL and DR serums. (C) Component analysis: % refers to the percentage of variability capture by the principal PCA representation of the data. Single and cumulative percentages are shown. (D) Modeling power, i.e, the ability of the components to accurately describe each metabolite in the dataset, is described as [1- (metabolite residual variance/total metabolite variance)]. Data are plotted by increasing modeling power (See Fig. 4Citation for additional information).

 
In comparing the data set from the male cohort with that from the female cohort, there were more biologically valid metabolites in male rats (291) than in female rats (240) and more metabolites with P < 0.2 in male rats (112) than in female rats (101). Despite this, the dataset established by automated analysis for male rats was weaker for grouping rats with PCA and HCA than that used for female rats. Specifically, HCA was 100% successful at distinguishing the diet group of origin with a 101-metabolite dataset in 16 female rats, regardless of the preprocessing algorithms (autoscale, range scale) or grouping algorithms (single, complete, centroid) used, whereas HCA was unable to group male rats with single, complete, and centroid grouping algorithms with the same efficacy. With 101 metabolites in female rats, the first principal component captured 50% of the total variability presented, whereas 2 components were needed in the male dataset.

PCA and HCA analyses after reducing "noise" metabolites in the dataset by manual inspection.

The studies described above were carried out on metabolite sets that had been evaluated only by automated analysis. To reduce noise in the dataset, we returned to a more detailed study of the female 101 and male 112 peaks initially described above. This secondary analysis addressed two issues: 1) Are the peaks analytically valid? Specifically, are both peak identification and peak quantitation reliable? 2) Is the peak found consistently in other cohorts? This second criterion is important due to the long-range goals of our study to identify serums markers for DR.

The analytical validity of each peak was examined individually. Of the female 101 peaks marked for future study, 18 did not appear sufficiently analytically reliable to warrant continued evaluation. These peaks included, for example, shoulders on other peaks that appeared subject to artifacts in analysis, small peaks in a series of equivalent peaks in which peak identification could not be confirmed, peaks with questionable chromatographic reproducibility and peaks that were misidentified by automated peak identification and analysis software. Each of the 83 peaks that remained in the analysis was then examined qualitatively by inspection in a second cohort. To do this analysis, chromatograms of the 8 female AL rats in the initial cohort were compared with chromatograms of the 8 AL female rats in the next cohort that entered our animal facility. Despite being brought into the colony only 1 mo apart, we found that 20 peaks differed substantially (>=5-fold) in the two female cohorts. Many of these 20 peaks were, in fact, apparently absent in the sera from the second cohort. These 20 peaks were removed from the analysis. Note that this analysis was meant to screen out those analytes that displayed major cohort-cohort instability; other analytes also varied by smaller amounts but were retained in the dataset. Thus, 63 female metabolites were defined for further study. Similarly, 76 metabolites were selected from the 112 metabolites for male samples6 . The P-values for the 63 female and 76 male metabolites are shown in Table 3Citation . In these subsets, 47 of 63 (female) and 44 of 76 (male) metabolites were significantly different between AL and DR rats. HCA and PCA were then conducted on the basis of the new sets of metabolites.


View this table:
[in this window]
[in a new window]
 
TABLE 3 Two-tailed, unpaired t test on 63 (female)/76 (male) analytically and biologically valid metabolites, with HPLC quantitation of each peak confirmed by manual inspection12

 
    Females. To determine whether the dataset retained the properties of interest after the above winnowing, we repeated the multivariate analyses described above on the new dataset. First, quantitation of each of the 63 peaks that remained was individually confirmed by manual inspection. Then, HCA was used to show that the dataset retained sufficient discriminating power to distinguish AL and DR rats (data not shown). Specifically, the 63-peak dataset had an equal, or slightly greater, ability to discern AL and DR rats, suggesting that removing the 38 peaks did not decrease, and might have slightly increased the signal-to-noise ratio. PCA analysis also showed that the smaller data set distinguished all AL and DR female rats (data not shown). Modeling power analysis in PCA showed that the variability present in the smaller dataset was better captured than before (mean 61 ± 17%; median 61%).

    Males. HCA grouped the 13 male samples with grouping algorithm incremental at 92% accuracy, and with single, centroid and complete at 85% accuracy by using the new dataset of 76 metabolites. These results again suggest that the set of metabolites selected for male rats was not as robust as that for female rats in grouping dietary origin. To further improve the metabolite set in male rats, HCA was carried out with metabolite sets that included different numbers of metabolites (from 4 to 76, with an increment of one metabolite at a time) with grouping algorithm single, centroid, complete, and incremental. The metabolites that prevented 100% separation of the dietary groups in males were temporarily excluded from the test set. It was found that 11 metabolites interfered with HCA separations when the incremental grouping algorithm was used. Fifteen metabolites, including these 11, interfered with HCA-based separations using the complete grouping algorithm. Twenty-four metabolites, including the 15 noted above, interfered with HCA separations using the centroid grouping algorithm. The 52 remaining metabolites (76 original metabolites minus 24 interfering metabolites) also enabled HCA to distinguish dietary groups using the single grouping algorithm; 34 of the 52 metabolites were with P < 0.05 between the levels of AL and DR samples (Table 4)Citation .


View this table:
[in this window]
[in a new window]
 
TABLE 4 Two-tailed, unpaired t test on 52 analytically and biologically valid metabolites whose levels were confirmed by manual inspection1

 
With the 52-metabolite dataset, 5 AL and 8 DR male rats were grouped completely by their dietary origin regardless of the linkage method chosen (Fig. 7Citation ). PCA results obtained using the 52-metabolite dataset were also improved compared with the 103-metabolite dataset (compare Figs. 8Citation and 6Citation ). Analysis again showed that there were no outliers in the 13 samples analyzed (Fig. 8Citation A). Figure 8Citation B shows that 5 AL and 8 DR rats could be distinguished by the first three principal components. The first principal component captured 48% of the total variability present compared with 42% in the 112-metabolite dataset, and 5 components were able to capture 80% of the variability present, compared with 6 principal components in the 112-metabolite dataset (Fig. 8Citation C). Analysis of modeling power revealed improvement over the larger dataset (Fig. 8Citation D, mean 49 ± 19%; median 51%).



View larger version (39K):
[in this window]
[in a new window]
 
Figure 7. Hierarchical cluster analysis (HCA) with 52 metabolites distinguishes dietary group with 100% accuracy. Dendrograms of HCA analyses with 52 serum metabolites screened by different grouping algorithms as potential markers in the serums from 13 6-mo-old ad libitum consumption (AL) and dietary restricted (DR) male Fischer 344 x Brown Norway F1 rats. Four independent analyses were conducted. Relative similarity within the total study population increases as one moves from right (0.0) to left (1.0, biochemical identity) on the horizontal axis. Heavy horizontal line added to emphasize ability of the 52 metabolites.

 


View larger version (19K):
[in this window]
[in a new window]
 
Figure 8. Principal component analysis (PCA) can distinguish ad libitum consumption (AL) and dietary restricted (DR) serotypes with 100% accuracy. PCA based on 52 sera metabolites screened by different grouping algorithms as potential markers in the serums from 13 6-mo-old AL and DR male Fischer 344 x Brown Norway F1 rats. See legend to Figure 6Citation for other information.

 
These data of male and female samples analyses showed that we can clean the dataset of biologically or statistically "noisy" metabolites without losing any of the discriminating power of the metabolome.

The data presented in this report provide proof of principle, i.e., these analyses indicate that there exists a group of metabolites in rat sera that retain sufficient information to distinguish dietary groups of origin by classification algorithms. As an initial step in validation, the dataset identified will be tested in an independent cohort [see companion paper (28Citation )]).

Applications

Although it has been known for approximately 70 y that the reduction of food intake promotes longevity and reduces age-related morbidity, our understanding of this model and ability to apply this knowledge has remained frustratingly limited. For example, basic questions such as, "What is the mechanism of DR?" remain unanswered, although many theories exist. Identification of metabolic serotypes may help address this question by giving insight into those components of the metabolome that truly distinguish AL and DR. Another open question concerns the applicability of the DR paradigm to higher animals. Although nonhuman primate studies are in progress (e.g., 31Citation –33Citation ), it is difficult to determine how restricted these animals are compared with the classical rodent model. Once our serotype models are complete, we can address this question directly from the metabolic standpoint.

The serotypes being developed may also contribute to studies in the area of human epidemiology. For example, grouping individuals by body mass index clearly identifies the increasing risk associated with obesity, but is limited by the possibility that two people of equal height and weight may have fundamentally different metabolisms (13Citation ). The serotypes being developed may contribute to a better understanding of the roles of obesity/metabolism in disease by allowing us to categorize individuals biochemically in these studies. Similarly, questionnaire-based epidemiologic studies have other potential problems, including problems with recall and/or fluctuations in intake. Our long-term study aims, again, at allowing metabolic distinctions to be used to reduce the signal-to-noise ratio in such studies. We consider these only a possible list of the applications of these serotypes.

In summary, the major finding of this paper is that specific components of the serum metabolome can retain sufficient information to distinguish AL and DR rats. Note that, in this report, the cohort used to identify the markers of interest was also the cohort that was distinguished by the grouping analysis. Furthermore, because the selection scheme utilized is designed to make type I statistical errors, we know that this dataset includes multiple false positives. Because of this, we have not yet addressed the similarities and differences between the metabolites that enabled us to group male and female rats. In the next report, we begin to validate the datasets by showing that the serotypes developed in this and the previous report (14Citation ) can classify independent cohorts of male and female rats.


    ACKNOWLEDGMENTS
 
We thank Thomas Vogl and Walter Willett for their helpful discussions and contributions to the overall experimental design, and John Blass for his comments on the manuscript.


    FOOTNOTES
 
1 Presented in part in oral presentation form at Experimental Biology 2001, March 31-April 4, Orlando, FL [Kristal, B. S., Vigneau-Callahan, K.E., Shi, H., Matson, W. & Milbury, P.E. (2001) Diet-dependent metabolic serotypes. FASEB J. 15: A65 (abs.)]. Back

2 Supported by National Institutes of Health National Institute on Aging R01-AG15354 (B.S.K.), ESA, Incorporated and the Winifred Masterson Burke Relief Foundation. Back

4 Abbreviations used: AL, ad libitum; DR, dietary restricted; HCA, hierarchical cluster analysis; PCA, principal component analysis. Back

5 This decision was a practical one based on the requirement in Pirouette for all datum cells to have a value. The choice of limiting analysis to those compounds that had been identified in either 15 or 16 chromatograms enabled us to readily use Pirouettes "Fill" function to place a relatively neutral "assigned value" into a single blank cell without creating too much noise. This neutral value can then be replaced later with the true value by manual inspection and manual oversight-based computer quantitation. The choice of using 80 of the 101 initial variables reflected the number of variables in which we had either 15 or 16 values after automated analysis. Back

6 Although a 103-variable dataset was used for the above HCA and PCA analyses, we inspected all 112 variables to avoid losing useful variables. Back

Manuscript received 9 June 2001. Initial review completed 25 July 2001. Revision accepted 11 February 2002.


    LITERATURE CITED
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 

1. Kristal, B. S. & Yu, B. P. (1994) Aging and its modulation by dietary restriction. Yu, B. P. eds. Modulation of Aging Processes by Dietary Restriction 1994:1-36 CRC Press Boca Raton, FL. .

2. Weindruch, R. & Walford, R. (1988) The Retardation of Aging and Disease by Dietary Restriction 1988 Charles C. Thomas St. Louis, MO. .

3. McCay, C. M. (1935) Cellulose in the diets of rats and mice. J. Nutr. :435-447.

4. Maeda, H., Gleister, C. A., Masoro, E. J., Murata, I., McMahan, C. A. & Yu, B. P. (1985) Nutritional influences on aging of Fischer 344 rats: II Pathology. J. Gerontol. 40:671-688.[Abstract/Free Full Text]

5. Carlson, A. J. & Hoelzel, F. (1946) Apparent prolongation of the life span of rats by intermittent fasting. J. Nutr. 31:363-375.

6. Goodrick, C. L., Ingram, D. K., Reynolds, M. A., Freeman, J. R. & Cider, N. (1990) Effects of intermittent feeding upon body weight and lifespan in inbred mice: interaction of genotype and age. Mech. Age. Dev. :55.

7. Ross, M. H. & Bras, G. (1970) Food preference and length of life. Science (Washington, DC) 190:165-167.

8. Weindruch, R. H. & Walford, R. L. (2000) Dietary restriction in mice beginning at one year of age: effects on life-span and spontaneous cancer incidence. Science (Washington, DC) 215:1415-1418.

9. Nolen, G. A. (1972) Effect of various restricted dietary regimens on the growth, health, and longevity of albino rats. J.Nutr. 102:1477-1494.

10. Tannenbaum, A. (1945) The dependence of tumour formation on the composition of the calorie-restricted diet as well as on the degree of restriction. Cancer Res. 5:616-625.[Free Full Text]

11. Stucklikova, E., Juricova-Horakova, E. M. & Deyl, Z. (1975) New aspects of the dietary effect of life prolongation in rodents: what is the role of obesity in aging. Exp. Gerontol. 10:141-144.[Medline]

12. Yu, B. P. (1996) Aging and oxidative stress: modulation by dietary restriction. Free Radic. Biol. Med. 21:651-668.[Medline]

13. Willett, W. C., Dietz, W. H. & Colditz, G. A. (1999) Guidelines for healthy weight. N. Engl. J. Med. 341:427-434.[Free Full Text]

14. Vigneau-Callahan, K. E., Shestopalov, A. I., Milbury, P. E., Matson, W. R. & Kristal, B. S. (2001) Characterization of diet-dependent metabolic serotypes: analytical and biological variability issues in rats. J. Nutr. 131:924S-932S.[Abstract/Free Full Text]

15. Iwasaki, K., Gleister, C. A., Masoro, E. J., McMahan, C. A. & Yu, B. P. (1988) The influence of dietary protein source on longevity and age-related disease processes of Fischer rats. J. Gerontol. 43:B5-B12.[Abstract]

16. Iwasaki, K., Gleister, C. A., Masoro, E. J., McMahan, C. A., Seo, E.-J. & Yu, B. P. (1988) Influence of the restriction of individual dietary components on longevity and age-related disease of Fischer rats: the fat component and the mineral component. J. Gerontol. 43:B13-B21.[Abstract]

17. McCay, C. M., Crowell, M. F. & Maynard, L. A. (1935) The effect of retarded growth upon the length of lifespan and upon the ultimate body size. J. Nutr. 10:63-79.

18. Kristal, B. S., Vigneau-Callahan, K. E. & Matson, W. R. (1998) Simultaneous analysis of the majority of low-molecular-weight, redox-active compounds from mitochondria. Anal. Biochem. 263:18-25.[Medline]

19. Beal, M. F., Matson, W. R., Swartz, K. J., Gamache, P. H. & Bird, E. D. (1990) Kynurenine pathway measurements in Huntington’s disease striatum; evidence for reduced formation of kynurenic acid. J. Neurochem. 55:1327-1339.[Medline]

20. Matson, W. R., Gamache, P. H., Beal, M. F. & Bird, E. D. (1987) EC array sensor concepts and data. Life Sci. 41:905-908.[Medline]

21. Matson, W. R., Bouckoms, A., Svendson, C., Beal, M. F. & Bird, E. D. (1990) Generating and controlling multiparameter databases for biochemical correlates of disorders. Basic, Clinical and Therapeutic Aspects of Alzheimer’s and Parkinson’s Diseases II:513-516 Plenum New York, NY. .

22. LeWitt, P. A., Galloway, M. P., Matson, W. R., Milbury, P. M., McDermott, M. & Srivastava, D. K. (1992) Markers of dopamine metabolism in Parkinson’s disease: The Parkinson’s Study Group. Neurology 42:2111-2117.[Abstract/Free Full Text]

23. Ogawa, T., Matson, W. R., Beal, M. F., Myers, R. H., Bird, E. D., Milbury, P. E. & Saso, S. (1992) Kynurenine pathway abnormalities in Parkinson’s disease. Neurology 42:1702-1706.[Abstract/Free Full Text]

24. Beal, M. F., Matson, W. R., Storey, E., Milbury, P. E., Ryan, E. A., Ogawa, T. & Bird, E. D. (1992) Kynurenic acid concentrations are reduced in Huntington’s disease cerebral cortex. J. Neurol. Sci 108:80-87.[Medline]

25. Matson, W. R., Langials, P., Volicer, L., Gamache, P. H., Bird, E. D. & Mark, K. A. (1984) n-Electrode three dimensional liquid chromatography with electrochemical detection for determination of neurotransmitters. Clin. Chem. 30:1477-1488.[Abstract/Free Full Text]

26. Milbury, P. E. (1997) CEAS generation of large multiparameter databases for determining categorical process involvement of biomolecules. Coulometric Array Detectors for HPLC 1997:125-141 VSP International Science Publication Zeist, The Netherlands. .

27. Acworth, I. N. & Gamache, P. H. (1996) The coulometric electrode array for use in HPLC analysis, part 1: theory. Am. Lab. 5:33-37.

28. Shi, H., Vigneau-Callahan, K. E., Shestopalov, A. I., Milbury, P. E., Matson, W. R. & Kristal, B. S. (2002) Characterization of diet-dependent metabolic serotypes: III. Primary validation of male and female serotypes in independent cohorts of rats. J. Nutr. 132:1039-1041.[Abstract/Free Full Text]

29. National Research Council (1985) Guide to the Care and Use of Laboratory Animals. Publication no. 85–23 (rev.) 1985 National Institutes of Health Bethesda, MD. .

30. Shi, H., Vigneau-Callahan, K., Matson, W. R. & Kristal, B. S. (2002) Attention to relative response across sequential electrodes improves quantitation of coulometric array. Anal. Biochem. 302:239-245.[Medline]

31. Hansen, B. C., Bodkin, N. L. & Ortmeyer, H. K. (1999) Calorie restriction in nonhuman primates; mechanisms of reduced morbidity and mortality. Toxicol. Sci. 52:56-60.[Abstract/Free Full Text]

32. Roth, G. S., Ingram, D. K., Black, A. & Lane, M. A. (2000) Effects of reduced energy intake on the biology of aging: the primate model. Eur. J. Clin. Nutr. 54:S15-S20.

33. Ramsey, J. J., Colman, R. J., Binkley, N. C., Christensen, J. D., Gresl, T. A., Kemnitz, J. W. & Weindruch, R. (2000) Dietary restriction and aging in rhesus monkeys; the University of Wisconsin Study. Exp. Gerontol. 35:1131-1149.[Medline]




This article has been cited by other articles:


Home page
J. Nutr.Home page
J. A. Milner
Incorporating Basic Nutrition Science into Health Interventions for Cancer Prevention
J. Nutr., November 1, 2003; 133(11): 3820S - 3826.
[Abstract] [Full Text] [PDF]


Home page
Sci Aging Knowl EnvironHome page
B. S. Kristal and U. Paolucci
Caloric Restriction in trans
Sci. Aging Knowl. Environ., July 9, 2003; 2003(27): pe19 - 19.
[Abstract] [Full Text]


Home page
J. Nutr.Home page
H. Shi, K. E. Vigneau-Callahan, A. I. Shestopalov, P. E. Milbury, W. R. Matson, and B. S. Kristal
Characterization of Diet-Dependent Metabolic Serotypes: Primary Validation of Male and Female Serotypes in Independent Cohorts of Rats
J. Nutr., May 1, 2002; 132(5): 1039 - 1046.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Purchase Article
Right arrow View Shopping Cart
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shi, H.
Right arrow Articles by Kristal, B. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shi, H.
Right arrow Articles by Kristal, B. S.


Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Copyright © 2002 by American Society for Nutrition