A guide to the integrated application of on-line data mining tools for the inference of gene functions at the systems level

Stuart Meier, Chris Gehring*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

25 Scopus citations

Abstract

Genes function in networks to achieve a common biological response. Thus, inferences into the biological role of individual genes can be gained by analyzing their association with other genes with more precisely defined functions. Here, we present a guide, using the well-characterized Arabidopsis thaliana pathogenesis-related protein 2 gene (PR-2) as an example, to document how the sequential use of web-based tools can be applied to integrate information from different databases and associate the function of an individual gene with a network of genes and additionally identify specific biological processes in which they collectively function. The analysis begins by performing a global expression correlation analysis to build a functionally associated gene network. The network is subsequently analyzed for Gene Ontology enrichment, stimuli and mutant-specific transcriptional responses and enriched putative promoter regulatory elements that may be responsible for their correlated relationships. The results for the PR-2 gene are entirely consistent with the published literature documenting the accuracy of this type of analysis. Furthermore, this type of analysis can also be performed on other organisms with the appropriate data available and will greatly assist in understanding individual gene functions in a systems context.

Original languageEnglish (US)
Pages (from-to)1375-1387
Number of pages13
JournalBiotechnology Journal
Volume3
Issue number11
DOIs
StatePublished - Nov 2008
Externally publishedYes

Keywords

  • Annotation
  • Systems biology
  • Transcription

ASJC Scopus subject areas

  • Applied Microbiology and Biotechnology
  • Molecular Medicine

Fingerprint

Dive into the research topics of 'A guide to the integrated application of on-line data mining tools for the inference of gene functions at the systems level'. Together they form a unique fingerprint.

Cite this