TY - JOUR
T1 - Experimental Peptide Identification Repository (EPIR): an integrated peptide-centric platform for validation and mining of tandem mass spectrometry data
AU - Kristensen, Dan Bach
AU - Brønd, Jan Christian
AU - Nielsen, Peter Aagaard
AU - Andersen, Jens Roswald
AU - Sørensen, Ole Tang
AU - Jørgensen, Vibeke
AU - Budin, Kenneth
AU - Matthiesen, Jesper
AU - Venø, Peter
AU - Jespersen, Hans Mikael
AU - Ahrens, Christian H
AU - Schandorff, Soeren
AU - Ruhoff, Peder Thusgaard
AU - Wisniewski, Jacek R
AU - Bennett, Keiryn L
AU - Podtelejnikov, Alexandre V
PY - 2004/10/1
Y1 - 2004/10/1
N2 - LC MS/MS has become an established technology in proteomic studies, and with the maturation of the technology the bottleneck has shifted from data generation to data validation and mining. To address this bottleneck we developed Experimental Peptide Identification Repository (EPIR), which is an integrated software platform for storage, validation, and mining of LC MS/MS-derived peptide evidence. EPIR is a cumulative data repository where precursor ions are linked to peptide assignments and protein associations returned by a search engine (e.g. Mascot, Sequest, or PepSea). Any number of datasets can be parsed into EPIR and subsequently validated and mined using a set of software modules that overlay the database. These include a peptide validation module, a protein grouping module, a generic module for extracting quantitative data, a comparative module, and additional modules for extracting statistical information. In the present study, the utility of EPIR and associated software tools is demonstrated on LC MS/MS data derived from a set of model proteins and complex protein mixtures derived from MCF-7 breast cancer cells. Emphasis is placed on the key strengths of EPIR, including the ability to validate and mine multiple combined datasets, and presentation of protein-level evidence in concise, nonredundant protein groups that are based on shared peptide evidence.
AB - LC MS/MS has become an established technology in proteomic studies, and with the maturation of the technology the bottleneck has shifted from data generation to data validation and mining. To address this bottleneck we developed Experimental Peptide Identification Repository (EPIR), which is an integrated software platform for storage, validation, and mining of LC MS/MS-derived peptide evidence. EPIR is a cumulative data repository where precursor ions are linked to peptide assignments and protein associations returned by a search engine (e.g. Mascot, Sequest, or PepSea). Any number of datasets can be parsed into EPIR and subsequently validated and mined using a set of software modules that overlay the database. These include a peptide validation module, a protein grouping module, a generic module for extracting quantitative data, a comparative module, and additional modules for extracting statistical information. In the present study, the utility of EPIR and associated software tools is demonstrated on LC MS/MS data derived from a set of model proteins and complex protein mixtures derived from MCF-7 breast cancer cells. Emphasis is placed on the key strengths of EPIR, including the ability to validate and mine multiple combined datasets, and presentation of protein-level evidence in concise, nonredundant protein groups that are based on shared peptide evidence.
KW - Breast Neoplasms
KW - Chromatography, Liquid
KW - Computer Graphics
KW - Database Management Systems
KW - Databases, Factual
KW - Female
KW - Humans
KW - Information Storage and Retrieval
KW - Mass Spectrometry
KW - Peptides
KW - Reproducibility of Results
KW - Research Design
KW - Software
KW - Tumor Cells, Cultured
U2 - 10.1074/mcp.T400004-MCP200
DO - 10.1074/mcp.T400004-MCP200
M3 - Journal article
C2 - 15284337
VL - 3
SP - 1023
EP - 1038
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
SN - 1535-9476
IS - 10
ER -