Research: Difference between revisions
| Line 67: | Line 67: | ||
| Link to our large-scale gene networks for yeast, worms, mouse, ''Arabidopsis'': http://www.functionalnet.org. An illustration of our ''Arabidopsis'' gene network just won Honorable Mention in the 2010 [http://www.marcottelab.org/paper-pdfs/848.full.pdf ''Science'' Visualization Challenge] | Link to our large-scale gene networks for yeast, worms, mouse, ''Arabidopsis'': http://www.functionalnet.org. An illustration of our ''Arabidopsis'' gene network just won Honorable Mention in the 2010 [http://www.marcottelab.org/paper-pdfs/848.full.pdf ''Science'' Visualization Challenge] & was featured by the [http://www.nytimes.com/slideshow/2011/02/17/science/20110217-visualize-6.html ''New York Times''] | ||
| Link to some of our public bioinformatics resources: http://bioinformatics.icmb.utexas.edu | Link to some of our public bioinformatics resources: http://bioinformatics.icmb.utexas.edu | ||
Revision as of 22:26, 24 February 2011
Our group studies the large-scale organization of proteins, essentially trying to reconstruct the 'wiring diagrams' of cells by learning how all of the proteins encoded by a genome are associated into functional pathways, systems, and networks. We are interested both in discovering the functions of the proteins as well as in learning the underlying organizational principles of the networks. The work is evenly split between computational and experimental approaches, with the latter tending to be high-throughput functional genomics and proteomics approaches for studying thousands of genes/proteins in parallel.
Bioinformatics of protein function and interactions
We've discovered a number of features of genomes that allow us to predict functions for proteins that have never been experimentally characterized. Using these techniques and information from over 30 fully sequenced genomes, we were able to calculate some of the first genome-wide predictions of protein function, finding very preliminary function for over half the 2,500 uncharacterized genes of yeast. Now, with hundreds of genomes in hand, we're extending these techniques, as well as asking fundamental questions about the evolution of protein interactions and the evolution of genomes.
Some of our recent papers on gene networks and the systematic discovery of gene function include:
Lee, Lehner et al., A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans, Nature Genetics, 40(2):181-8 (2008) PubMed Link  
Peña-Castillo et al., A critical assessment of Mus musculus gene function prediction using integrated genomic evidence, Genome Biology, 9 Suppl 1:S2 (2008) PubMed Link
Hart et al., A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality, BMC Bioinformatics, 8:236. (2007) PubMed Link
Lee et al., A probabilistic functional network of yeast genes, Science, 306(5701):1555-8. (2004) PubMed Link
Fraser, Marcotte, A probabilistic view of gene function, Nature Genetics, 36(6):559-64 (2004) PubMed Link
Link to our large-scale gene networks for yeast, worms, mouse, Arabidopsis: http://www.functionalnet.org. An illustration of our Arabidopsis gene network just won Honorable Mention in the 2010 Science Visualization Challenge & was featured by the New York Times
Link to some of our public bioinformatics resources: http://bioinformatics.icmb.utexas.edu
Rational identification of genes affecting traits and diseases
Using the gene networks and other computational tools, we've now gained some ability to rationally predict the consequences to an organism of mutating or interrupting a specific gene. This means that by using these tools, we can often select a small set of candidate genes to be implicated in a particular disease or trait. We've now experimentally validated >100 such candidate genes for diverse traits in a wide range of organisms, including yeast, worms, Arabidopsis, C. elegans, frogs, mice, and humans. For example, in yeast we've used network models to discover a large number of new ribosome biogenesis genes (collaborating with Arlen Johnson), as well as genes controlling such features as cell size. In animals, e.g. using our worm gene network models developed with collaborators Ben Lehner and Andy Fraser, we could successfully identify new genes controlling longevity, as well as genes capable of suppressing the loss of the Retinoblastoma tumor suppressor, thus 'curing' worms of model tumors. In Arabidopsis, with now ex-postdoc Insuk Lee and collaborator Sue Rhee, we could rationally identify new genes regulating root growth, drought resistance, and seedling pigmentation. In vertebrates, working with the Wallingford and Finnell labs, we've been able to use gene network models to help assign functions to a birth defect gene, as well as to identify entirely new birth defect genes, confirming their roles in vivo.
Some of our recent papers on the rational association of genes with traits and diseases:
McGary, Park et al., Systematic discovery of nonobvious human disease models through orthologous phenotypes, Proc Natl Acad Sci U S A, in press: (2010) PubMed Link
Lee et al., Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana, Nature Biotechnology, 28(2):149-156 (2010) PubMed Link
Li et al., Rational extension of the ribosome biogenesis pathway using network-guided genetics, PLoS Biology, 7(10):e1000213 (2009) PubMed Link
Gray et al., The planar cell polarity effector protein Fuzzy is essential for targeted membrane trafficking, ciliogenesis, and mouse embryonic development, Nature Cell Biology, 11(10):1225-32 (2009) PubMed Link
Lee, Lehner et al., A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans, Nature Genetics, 40(2):181-8 (2008) PubMed Link
White et al., Bud23 methylates G1575 of 18S rRNA and is required for efficient nuclear export of pre-40S subunits, Mol Cell Biol, 28(10):3151-61 (2008) PubMed Link
McGary et al., Broad network-based predictability of Saccharomyces cerevisiae gene loss-of-function phenotypes, Genome Biology, 8(12):R258. (2007) PubMed Link
Use our phenolog method to link genes to traits: http://www.phenologs.org
Read more about some of our computational approaches to developmental biology & the UT Developmental and Regenerative Biology Initiative
Proteomics: High-throughput protein expression and interaction profiling
From our work and others, it is apparent that proteins in the cell participate in extended protein interaction networks involving thousands of proteins. By defining these networks, we can not only discover the functions of specific proteins based on their connections, but also use these networks as tools to predict the outcome of perturbing the cell. As part of our research efforts in this area, we have been developing high-throughput methods to measure protein abundances in complex biological samples (e.g., by quantitative shotgun proteomics mass spectrometry) and protein localization with cells (e.g., by high-throughput automated fluorescence microcopy, such as of cell microarrays). These sorts of data help us build a catalog of protein, mRNA and metabolite expression from cells grown under many different conditions, forming a quantitative picture of these molecular events inside cells. We expect that data of these sorts will put us on the road to developing predictive, rather than merely descriptive, theories of biology.
Recent papers in this area include:
Narayanaswamy et al., Widespread reorganization of metabolic enzymes into reversible assemblies upon nutrient starvation, Proc Natl Acad Sci U S A, 106(25):10147-52 (2009) PubMed Link  
Vogel, Marcotte, Calculating absolute and relative protein abundance from mass spectrometry-based protein expression data, Nature Protocols, 3(9):1444-51. (2008) PubMed Link Protocol website
Ramani et al., A map of human protein interactions derived from co-expression of human mRNAs and their orthologs, Molecular Systems Biology, 4:180 (2008) PubMed Link
Lu, Vogel, Wong et al., Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation, Nature Biotechnology, 25(1):117-24 (2007) PubMed Link
Link to our MS/MS data repository: http://www.marcottelab.org/MSdata/
Link to the Open Proteomics Database: http://bioinformatics.icmb.utexas.edu/OPD/
Link to the APEX Protocol website: http://marcottelab.org/APEX_Protocol/
Link to the APEX software tool: http://pfgrc.jcvi.org/index.php/bioinformatics/apex.html
Link to the MSpresso website: http://www.marcottelab.org/MSpresso/
Recent research news
Read about our Texas Xenopus Genome Project, a collaboration with the Wallingford lab and the UT Genomic Sequencing and Analysis Facility, funded by the Texas Institute for Drug and Diagnostic Development

