Sangam: A Confluence of Knowledge Streams

iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae

Show simple item record

dc.contributor Biological Systems Engineering
dc.creator Sundararaman, Niveda
dc.creator Ash, Christine
dc.creator Guo, Weihua
dc.creator Button, Rebecca
dc.creator Singh, Jugroop
dc.creator Feng, Xueyang
dc.date 2015-12-12T07:02:15Z
dc.date 2015-12-12T07:02:15Z
dc.date 2015-12-12
dc.date 2015-12-12T07:02:15Z
dc.date.accessioned 2023-03-01T18:52:23Z
dc.date.available 2023-03-01T18:52:23Z
dc.identifier BMC Research Notes. 2015 Dec 12;8(1):771
dc.identifier http://hdl.handle.net/10919/64314
dc.identifier https://doi.org/10.1186/s13104-015-1759-7
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/281614
dc.description Background Organisms are subject to various stress conditions, which affect both the organism’s gene expression and phenotype. It is critical to understand microbial responses to stress conditions and uncover the underlying molecular mechanisms. To this end, it is necessary to build a database that collects transcriptomics and phenotypic data of microbes growing under various stress factors for in-depth systems biology analysis. Despite of numerous databases that collect gene expression profiles, to our best knowledge, there are few, if any, databases that collect both transcriptomics and phenotype data simultaneously. In light of this, we have developed an open source, web-based database, namely integrated transcriptomics and phenotype (iTAP) database, that records and links the transcriptomics and phenotype data for two model microorganisms, Escherichia coli and Saccharomyces cerevisiae in response to exposure of various stress conditions. Results To collect the data, we chose relevant research papers from the PubMed database containing all the necessary information for data curation including experimental conditions, transcriptomics data, and phenotype data. The transcriptomics data, including the p value and fold change, were obtained through the comparison of test strains against control strains using Gene Expression Omnibus’s GEO2R analyzer. The phenotype data, including the cell growth rate and the productivity, volumetric rate, and mass-based yield of byproducts, were calculated independently from charts or graphs within the reference papers. Since the phenotype data was never reported in a standardized format, the curation of correlated transcriptomics–phenotype datasets became extremely tedious and time-consuming. Despite the challenges, till now, we successfully correlated 57 and 143 datasets of transcriptomics and phenotype for E. coli and S. cerevisiae, respectively, and applied a regression model within the iTAP database to accurately predict over 93 and 73 % of the growth rates of E. coli and S. cerevisiae, respectively, directly from the transcriptomics data. Conclusion This is the first time that transcriptomics and phenotype data are categorized and correlated in an open-source database. This allows biologists to access the database and utilize it to predict the phenotype of microorganisms from their transcriptomics data. The iTAP database is freely available at https://sites.google.com/a/vt.edu/biomolecular-engineering-lab/software .
dc.description Published version
dc.format application/pdf
dc.format application/pdf
dc.language en_US
dc.rights Creative Commons Attribution 4.0 International
dc.rights http://creativecommons.org/licenses/by/4.0/
dc.rights Sundararaman et al.
dc.title iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae
dc.title BMC Research Notes
dc.type Article - Refereed
dc.type Text


Files in this item

Files Size Format View
13104_2015_Article_1759.pdf 3.620Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse