The value of society's investment in science is strongly dependent on the ability of future scientists to build on the results of previous results. As the pace of scientific productivity has accelerated through genomics and other Big Data technologies, the ability of scientist to master the literature is increasingly dependent on computational tools. One important aspect of making data more reusable is the association of data with annotations that can be used in computer-based data mining and analyses.
My group works on the development and use of biological ontologies, which are sets of standardized controlled vocabularies for annotation. Our current focus is on the Ontology for Microbial Phenotypes, which is being constructed to facilitate the reuse and analysis of data from the awesome power of microbial genetics. We also work with the Gene Ontology Consortium on the annotation of gene functions, an have developed systems for integrating annotation with education in the Community Assessment of Community Annotation with Ontologies (CACAO)
We have also worked on developing systems for building model organism databases for community annotation, including EcoliWiki, which reuses and modifies the open source software built for Wikipedia to provide more specialized scientific data resources.
- Knapp, G. S., & Hu, J. C. (2010). Specificity of the E-coli LysR-Type Transcriptional Regulators. PLoS ONE. 5(12), e15189-e15189.
- Rajagopala, S. V., Yamamoto, N., Zweifel, A. E., Nakamichi, T., Huang, H., Mendez-Rios, J. D., ... Uetz, P. (2010). The Escherichia coli K-12 ORFeome: a resource for comparative molecular microbiology. BMC GENOMICS. 11(1), 470-470.
- Gaudet, P., Chisholm, R., Berardini, T., Dimmer, E., Engel, S. R., Fey, P., ... Consortium, G. O. (2009). The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species. PLoS Comput Biol. 5(7), e1000431-e1000431.
- Knapp, G. S., & Hu, J. C. (2009). The oligomerization of CynR in Escherichia coli. Protein Sci. 18(11), 2307-2315.
- Knapp, G. S., Tsai, J. W., & Hu, J. C. (2009). The oligomerization of OxyR in Escherichia coli. Protein Sci. 18(1), 101-107.
- (2017). Primer on the Gene Ontology.. Methods in molecular biology (Clifton, N.J.). Methods in Molecular Biology. (pp. 25-37). Springer New York.
- Chibucos, M. C., Siegele, D. A., Hu, J. C., & Giglio, M. (2017). The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations.. Methods in molecular biology (Clifton, N.J.). Methods in Molecular Biology. (pp. 245-259). Springer New York.
- Siegele, D. A., Campbell, L., & Hu, J. C. (2000). Green fluorescent protein as a reporter of transcriptional activity in a prokaryotic system. Methods in Enzymology. (pp. 499-513). Academic Press.
- Chibucos, M., Zweifel, A., Siegele, D., Uetz, P., Giglio, M., & Hu, J. (2011). The ontology of microbial phenotypes (OMP): A precomposed ontology based on cross products from multiple external ontologies that is used for guiding microbial phenotype annotation. CEUR Workshop Proceedings. 833, 237-239.
- Champion, M. M., Campbell, C. S., Siegele, D. A., Russell, D. H., & Hu, J. C. (2002). Proteome analysis of Escherichia coli K-12 by two-dimensional native-state chromatography and MALDI-MS. 59-60.
- Bonde, Aniket Sanjiv (2018-12). Identifying Expert Reviews in the Crowd: Linking Curated and Noisy Domains. (Master's Thesis)
- Huo, Zepeng (2017-12). Link Prediction with Personalized Social Influence. (Master's Thesis)
- Zweifel, Adrienne Elizabeth (2010-05). Phenotypic Characterization of Self- Assembling Protein Fragments Using Negative Dominance. (Doctoral Dissertation)
- Venkatraman, Anand (2008-12). Validation of a novel expressed sequence tag (EST) clustering method and development of a phylogenetic annotation pipeline for livestock gene families. (Doctoral Dissertation)
- Diaz Vazquez, Arnaldo Joel (2008-05). Solid-supported phospholipid bilayers: separation matrix for proteomics applications. (Doctoral Dissertation)