agilent metabolomics discovery

16
Agilent Metabolomics Discovery Workflow Overview Introduction to Metabolomics 2 Required Items 3 What is Metabolomics? 5 The Metabolomics Workflow 7 Working with Mass Profiler Professional 8 For More Information 13

Upload: others

Post on 12-Sep-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery

Workflow Overview

Introduction to Metabolomics 2Required Items 3What is Metabolomics? 5The Metabolomics Workflow 7Working with Mass Profiler Professional 8For More Information 13

Page 2: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Introduction to Metabolomics

Introduction to Metabolomics

Metabolomics is an emerging field of 'omics' research that is concerned with the characterization and identification of metabolites, the end products of cellular metabolism. Metabolomics research leads to complex data sets involving hundreds to thousands of metabolites. Comprehensive analysis of metabolomics data requires an analytical approach and data analysis strategy that are often unique and require specialized data analysis software that enables cheminformatics analysis, bioinfor-matics, and statistics. Agilent products provide you with the necessary data analysis tools.

The Agilent Metabolomics Workflow guides you through the use of Agilent Mass-Hunter Qualitative Analysis and Agilent Mass Profiler Professional to set up your experiment design, collect replicate samples, assess system suitability, improve data quality, and perform comprehensive analysis of metabolomics data.

MassHunter Qualitative Analysis software allows you to automatically find and extract all spectral and chromatographic information from a sample, even when the components are not fully resolved. Powerful data navigation capabilities per-mit you to browse through compound-specific information in a single sample and compare chromatograms and spectra among multiple samples. The software also includes a customizable user interface and the capability to save, export or copy results into other applications, such as Mass Profiler Professional.

Mass Profiler Professional is a powerful data reduction, analysis, and visualiza-tion tool that allows you to identify statistically significant answers to simple questions presented to complex data sets; data sets such as the chemical signa-tures from biological matrices encountered in metabolomics.

You can use the Agilent Metabolomics Workflow as a road map for any analysis that requires the identification of statistically significant answers to simple questions presented to complex data sets. The metabolomics workflow may be used to per-form the following analyses:

• Compare two or more biological groups• Find and identify potential biomarkers• Look for biomarkers of toxicology• Understand biological pathways• Discover new metabolites• Develop data mining and data processing procedures that produce character-

istic markers for a set of samples• Construct statistical models for sample classification.

Small molecule tuning (LC/MS)

Metabolomics involves the analysis of small molecules, molecules nominally with a molecular weight from 50 to 6000 amu. Since the best results for metabolomics studies involve the identification of the exact mass of the molecular ion, LC/MS instrument tunes should be adjusted to (1) improve the sensitivity for intact molecu-lar ions and (2) improve the overall sensitivity for small, low-mass, compounds. A typical automated instrument tune optimizes the instrument sensitivity across the entire mass range and may result in lower sensitivity for small molecules combined with higher fragmentation than desired for metabolomics.

2

Page 3: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Required Items

Required Items The Metabolomics Workflow performs best when using the hardware and software described in the “required” sections below. The required hardware and software is used to perform the data analysis tasks shown in Figure 1.

Figure 1 Agilent hardware and software used in performing metabolomics.

A typical workflow is illustrated in Figure 2 starting with data acquisition through to analysis involving both un-targeted (discovery) LC/MS and targeted (confirmation) LC/MS/MS analyses. Molecular feature extraction (MFE) and Find by Formula (FbF) are two different algorithms used by MassHunter Qualitative Analysis for finding compounds. All results files generated by Agilent analytical platforms can be imported into Mass Profiler Professional for quality control, statistical analysis and visualization, and interpretation.

Figure 2 An Agilent metabolomics workflow from separation to pathway analysis typically involving either or both GC/MS and LC/MS analyses.

3

Page 4: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Required Items

Required hardware • PC running Windows• Minimum: XP SP3 (32-bit) or Windows 7 (32-bit or 64-bit) with 4 GB of RAM• Recommended: Windows 7 (64-bit) with 8 GB or more of RAM

• At least 50 GB of free space on the C Partition of the hard drive• Data from an Agilent GC/MS, LC/MS, CE/MS and/or ICP-MS system or data that

may be imported from another instrument

Required software • Agilent Mass Profiler Professional Software B.02.01 or later• Agilent MassHunter Qualitative Analysis software, Version B.03.01 or B.04.00 or

later• Agilent MassHunter Data Acquisition software, Version B.03.02 or B.04.00 or later

(this will include Agilent MassHunter DA Reprocessor)• Agilent MassHunter Quantitative Analysis software, Version B.03.02 or later

Optional software • Agilent ChemStation software• AMDIS• MassHunter ID Browser B.03.01 or later• PCDL METLIN Personal Metabolite Database• Agilent Fiehn Metabolomics GC/MS RTL Library

4

Page 5: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery What is Metabolomics?

What is Metabolomics? Metabolomics is the process of identifying and quantifying of all metabolites of an organism in a specified biological state. The metabolites of an organism represent a chemical “fingerprint” of the organism at a well defined state where the state is defined by a set of specific circumstances.

Metabolomics involves the study of two types of variables:

Independent variables: When one or more of the attributes of the state of the organism are known to you in advance of sampling those attributes are each referred to as an independent variable.

The known state of the organism or externalities to which the organism was sub-jected may be referred to as a parameter, parameter name, condition, or attribute value during the various steps of the metabolomic data analysis. The known state and externalities represent independent variables in the statistical analyses.

Dependent variables: The biological response to the change in the attributes can be manifest in a change in the metabolic profile. Each metabolite that undergoes a change in expressed concentration is referred to as a dependent variable.

The metabolites in a sample may be individually referred to as a compound, fea-ture, element, or entity during the various steps of the metabolomic data analysis. Metabolites represent dependent variables in the statistical analyses.

When hundreds to thousands of metabolites exist, chemometric data analyses can be employed to reveal accurate and statistically meaningful correlations between the independent variables and the metabolic profile. Meaningful information learned from the metabolite responses can subsequently be used for clinical diagnostics, for understanding the onset and progression of human diseases, and for treatment. This analysis is the metabolomics workflow and is illustrated in Figure 3.

Figure 3 Agilent Metabolomics Workflow Diagram

5

Page 6: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery What is Metabolomics?

The hypothesis The first and most important step in the metabolomic process is to formulate the question of correlation that is answered by the analysis - the hypothesis. This ques-tion is a statement that proposes a possible correlation, for example a cause and effect, between a set of independent variables and the resulting metabolic profile. The metabolomic process is used to prove or disprove the hypothesis.

Natural variability Before the metabolomics workflow is started it is important to understand how any one sample represents the population as a whole. Because of natural variability, the uncertainties associated with both the measurement and associated with the popu-lation, no assurance exists that any single sample from a population represents the mean of the population. Thus, increasing the sample size greatly improves the accu-racy of the sample set in describing the characteristics of the population.

Replicate sampling Sampling the entire population is not typically feasible due to constraints imposed by time, resources, and finances. On the other hand, fewer samples increase the probability of concluding a false positive or false negative correlation. At a minimum, it is recommended that your metabolomics analysis include ten (10) or more repli-cate samples for each attribute value for each condition in your metabolomics study.

System suitability System suitability involves collecting data in such a way to that the data provides you a means to evaluate and compensate for drift and instrumental variations to assure quality results. The techniques of (1) retention time alignment, (2) intensity normalization, (3) chromatographic deconvolution, and (4) baselining employed by the metabolomics workflow produce the highest quality results. However, the pro-cess cannot compensate for excessive drift in the acquisition parameters. The best results are achieved by maintaining your instrument and using good chromatogra-phy.

Improving data quality Improved data quality for metabolomics analysis comes from matching the sampling methodology to the experimental design so that replicate data is collected to span the attribute values for each condition. A larger number of samples appropriate to the population under study results in a better answer to the hypothesis. An under-standing of the methodologies used in sampling and using more than one method of sample collection have a positive impact to the significance of the results.

6

Page 7: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery The Metabolomics Workflow

The Metabolomics Workflow

The Agilent metabolomics workflow is organized into four main components and involves the use of Agilent MassHunter Qualitative Analysis, Agilent MassHunter DA Reprocessor (part of Agilent MassHunter Data Acquisition), and Agilent Mass Profiler Professional:

1. Find Compounds by Molecular Feature (MassHunter Qualitative Analysis and MassHunter DA Reprocessor). Molecular features are extracted from the data based on their mass spectral and chromatographic characteristics. The pro-cess is referred to as Molecular Feature Extraction (MFE). MassHunter Quali-tative Analysis is used to create the MFE method and evaluate the suitability of the method on single samples representative of the experiment. MassHunter DA Reprocessor is then used to automate MFE using the method created by MassHunter Qualitative Analysis on all of the samples in a single batch operation.

The molecular features for each sample data file are exported as a CEF file for importing and filtering by Mass Profiler Professional. These molecular features are referred to as untargeted molecular features because they are selected using data post-processing software algorithms rather than using prior knowl-edge regarding each feature.

2. Feature Selection (Mass Profiler Professional). The untargeted molecular fea-tures are imported into Mass Profiler Professional and filtered to retain only the most statistically significant features. This process is aided by the experi-ment setup and guided workflow provided by Mass Profiler Professional.

The most significant molecular features for each sample data file are exported as a CEF file for recursive finding by MassHunter Qualitative Analy-sis.

3. Find Compounds by Formula (MassHunter Qualitative Analysis). Importing the most statistically significant features back into MassHunter Qualitative Analy-sis as targeted features results is a significant improvement in finding of the features from the samples. This repeated feature analysis is referred to as recursion. Improved reliability in finding leads to a concomitant improvement in the accuracy of the analysis.

The recursively found, most significant molecular features for each sample data file are exported as a CEF file for importing, filtering, and analysis by Mass Profiler Professional.

4. Analysis (Mass Profiler Professional). Recursive finding of the most statisti-cally significant features are processed by Mass Profiler Professional into a final statistical analysis and interpretation. The results from the final interpre-tation may be used to prove or disprove the hypothesis and may be used to create models that may be applied to new data.

7

Page 8: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Working with Mass Profiler Professional

Working with Mass Profiler Professional

Mass Profiler Professional imports CEF files created by MassHunter Qualitative Analysis. Mass Profiler Professional exploits the high information content of chro-matography/mass spectrometry data and can easily import, analyze and visualize GC/MS, LC/MS, CE/MS and ICP-MS data from large sample sets and complex MS data sets.

Mass Profiler Professional provides you with a step-by-step set of guides for (1) importing data and creating your experiment setup - project setup - and (2) perform-ing an initial statistical and graphical analyses - guided workflow. The Guided Work-flow guides you through an initial set of filters and models for differential analysis. It is not recommended for you to skip the Guided Workflow. Regardless of your per-sonal expertise, the Guided Workflow provides you with a quality control to your analysis that improves your results. The Guided Workflow ends with the advanced workflow interface (Figure 5 on page 12) where you may proceed to customize the entire differential analysis.

Note: Help and detail information regarding the various fields and statistical treat-ments are available at anytime by pressing the F1 key on the keyboard or referring to the Mass Profiler Professional User Manual.

Project setup There are three main steps to performing the project setup: staring Mass Profiler Professional, setting up project information, and creating the experiment(s) that are associated with the sample data that are imported.

1. Start Mass Profiler Professional

Double-click the Mass Profiler Professional icon ( ) located on the desk-top, or Click Start > Programs > Agilent > MassProfilerPro > MassProfilerPro

2. Set up a project

A project is a container for a collection of experiments. A project can have multiple experiments on different sample types and organisms. Four steps are involved in the project setup:

Startup: Select creation of a new project.

Create New Project: Type descriptive information about the project.

Experiment Selection Dialog: Select either a new experiment or to open an existing experiment as part of the project.

New Experiment: Type and select information that guides the experiment cre-ation.

3. Perform initial analysis using Guided Import - Experiment Creation

An experiment contains samples, interpretations, and associated entity lists. Several experiments may be within a project. Up to ten steps are involved in the experiment creation. Importing CEF files created with MassHunter Qualita-tive Analysis involves steps 1, 2, 6, 7, 8, 9, and 10:

8

Page 9: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Working with Mass Profiler Professional

1. Select Data Source: Depending upon the type of experiment and supported file format, this displays the options of data source.

2. Select Data to Import: This allows you to select the samples.

3. Input Validation: Validates different types of samples. This step appears only if different types of files are selected and prompts you to select same type of files to proceed. This step applies to ChemStation and AMDIS data.

4. Signal Chooser: Allows you to select signal options specific to Mass-Hunter Quantitation experiments.

5. Sample Summary: This provides an option to select the samples. In Generic Experiments one file can have multiple samples.

6. Filtering: Filters the data features by abundance, mass range, number of ions per feature, and by charge state.

7. Alignment: This aligns the features across the samples based on retention time and mass to the input tolerances.

8. Sample Summary: Displays Mass versus RT plot, spreadsheet, and legend for the distribution of aligned and unaligned entities in the sample.

9. Normalization: Changes the sample feature intensity values (referred to as abundance units) from that based on the absolute value of the extracted ion chromatogram signal measured at the detector to a relative intensity based on the signal provided by either an (1) internal standard, (2) an exter-nal scalar, or (3) both an internal standard and an external scalar.

10. Baselining Options: Changes the signal intensities of the features found in each sample data set from a value that is based on the original instru-mental parameters to a new “abundance unit” that relates the feature strength with respect to the feature strength among all of the sample data. Baselining may be performed with or without normalization.

Guided Workflow Guided Workflow helps you through the basic data analysis of an experiment. Default parameters in conjunction with your experimental design guides you to an initial differential expression that identifies the most statistically significant features from among all of the features previously found using molecular feature extraction. All parameters, including the default parameters used during the Guided Workflow, can be edited at the conclusion of the Guided Workflow by using the Advanced Workflow options in the Workflow Browser.

The Guided Workflow wizard helps you create an initial differential expression by displaying the sequence of steps on the left hand side navigator with the current step highlighted. The workflow allows you to proceed through each step by clicking the Next button. The steps followed are predetermined and based on the experiment type, experimental grouping, and conditions. Some of the steps may be skipped for certain experiments.

Eight steps are involved in the Guided Workflow - Initial Quality Control. To skip later steps in the Guided Workflow, click Finish at any step. When you click Finish, the entity list is saved, and you may commence analysis using the Advanced Workflow.

9

Page 10: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Working with Mass Profiler Professional

Note: The Guided Workflow does not start if Advanced Workflow was selected as the Workflow type in the New Experiment dialog.

1. Summary Report: Displays the summary view of the created experiment based on the parameters provided during the guided import. A Profile Plot with the samples on the X-axis and the Log Normalized abundance values on the Y-axis is displayed. If the number of samples is more than 30, they are represented in a spreadsheet view instead of Profile Plot.

2. Experiment Grouping: Independent variables and the attribute values of the independent variables must be specified to define grouping of the samples. An independent variable is referred to as a parameter name. The attribute values within an independent variable are referred to as parameter values. Samples with same parameter values within a parameter name are treated as repli-cates.

3. Filter Flags: The compounds created during the experiment creation are now referred to as entities. The entities are filtered (removed) from further analysis based on their presence across samples and parameter values (now referred to as a condition). See “Definitions” on page 157 for definitions and relation-ships of the terms used by the metabolomics workflow.

4. Filter by Frequency: Entities are further filtered based on their frequency of presence in specified samples and conditions. This filter removes irreproduc-ible entities.

5. Quality Control on Samples: The samples are presented by grouping and the current Principal Component Analysis (PCA). PCA calculates all the possible principal components and visually represents them in a 3D scatter plot. The scores shown by the axes scales are used to check data quality. The scatter plot shows one point per sample colored-coded by the experiment grouping. Replicates within a group should cluster together and be separated from sam-ples in other groups

6. Significance Analysis: The entities are filtered based on their p-values calcu-lated from a statistical analysis. The statistical analysis performed depends on the samples and experiment grouping.

Steps 7 and 8 are typically skipped during the creation of an entity list that will be used for recursive feature finding in MassHunter Qualitative Analysis.

7. Fold Change: Compounds are further filtered based on their abundance ratios or differences between a treatment and a control that are greater than a spec-ified cut-off or threshold value.

8. ID Browser Identification: The final entity list may be exported in a CEF file and directly imported into IDBrowser for identification.

Advanced workflow The Advanced Workflow in Mass Profiler Professional provides the tools necessary for analyzing features from your mass spectrometry data depending upon the need and aim of the analysis, the experimental design and the focus of the study. This helps you to create different interpretations to carry out the analysis based on the different filtering, normalization, and standard statistical methods.

10

Page 11: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Working with Mass Profiler Professional

It is recommended that you choose and use the Guided Workflow to develop in the beginning of experiment creation by choosing Guided Workflow as the Workflow Type in the New Experiment drop-down list. When the Guided Workflow is com-pleted you are automatically entered in the Advanced Workflow mode and have access to all features available in Mass Profiler Professional. However, at any step in the Guided Workflow you can stop the guided workflow and immediately enter the Advanced Workflow by clicking on the Cancel button.

More information regarding the specific details of the Advanced Workflow may be found in the Mass Profiler Professional User Manual. The main functional areas of the Mass Profiler Professional screen are illustrated in Figure 5. A larger, landscape oriented image of the screen is in Figure 5 on page 12.

Figure 4 The main functional areas of Mass Profiler Professional

11

Page 12: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery Working with Mass Profiler Professional

Figure 5 The main functional areas of Mass Profiler Professional

12

Page 13: Agilent Metabolomics Discovery

Agilent Metabolomics Discovery For More Information

For More Information For details about these procedures, see the Agilent Metabolomics Discovery Work-flow Guide (p/n 5990-7067EN). Visit our Web site at www.agilent.com/chem.

The metabolomics discovery workflow is part of the collection of Agilent manuals, help, application notes, and training videos. The current collection of manuals and help are valuable to users who understand the metabolomics workflow and who may require familiarization with the Agilent software tools. The collection of training videos provide a step-by-step guide to using the software tools to reduce example CG/MS and LC/MS data but may require a significant time investment and ability to extrapolate the example processes. The Agilent Metabolomics Workflow provides a step-by-step overview of performing metabolomics data analysis using Agilent MassHunter Qualitative Analysis and Agilent Mass Profiler Professional.

The following selection of publications provides materials related to metabolomics and Agilent Mass Profiler Professional:

• Application: An LC/MS Metabolomics Discovery Workflow for Malaria-Infected Red Blood Cells Using Mass Profiler Professional Software and LC-Triple Quadrupole MRM Confirmation (Agilent publication 5990-6790EN, November 19, 2010)

• Brochure: Emerging Insights: Metabolomics at Agilent (Agilent publication 5990-6048EN, September 10, 2010)

• Brochure: Integrated Biology from Agilent: The Future is Emerging (Agilent publication 5990-6047EN, September 1, 2010)

• Primer: Proteomics: Biomarker Discovery and Validation (Agilent publication 5990-5357EN, February 11, 2010)

• Brochure: Agilent Mass Profiler Professional Software (Agilent publication 5990-4164EN, December 22, 2009)

• Primer: Metabolomics: Approaches Using Mass Spectrometry (Agilent publication 5990-4314EN, October 27, 2009)

• Application: Profiling Approach for Biomarker Discovery using an Agilent HPLC-Chip Coupled with an Accurate-Mass Q-TOF LC/MS (Agilent publication 5990-4404EN, October 20, 2009)

A complete list of references may be found in “References” in the Agilent Metabolo-mics Workflow.

13

Page 14: Agilent Metabolomics Discovery
Page 15: Agilent Metabolomics Discovery
Page 16: Agilent Metabolomics Discovery

© Agilent Technologies, Inc. 2011Printed in USA, August 2011

*5990-7068EN*

5990-7068EN

www.agilent.com