QB3 Webinar: Anshul Kundaje, Stanford. "Deep Learning for Genomic Discovery"

The human genome contains the fundamental code that defines the identity and function of all the cell types and tissues in the human body. Genes are functional sequence units that encode for proteins. But they account for just about 2% of the 3 billion long human genome sequence. What does the rest of the genome encode? How is gene activity controlled in each cell type? Where do the regulatory control elements lie and what is their sequence composition? How do variants and mutations in the genome sequence affect cellular function and disease?

These are fundamental questions that remain largely unanswered. The regulatory code that controls gene activity is encoded in the DNA sequence of millions of cell type specific regulatory DNA elements in the form of functional DNA words and grammars. This regulatory code has remained largely elusive despite tremendous developments in experimental techniques to profile molecular properties of regulatory DNA.

Deep learning has revolutionized our understanding of natural language, speech and vision. We strongly believe it has the potential to revolutionize our understanding of the regulatory language of the genome. We have developed deep learning frameworks to learn how genomic sequence encodes millions of experimentally measured regulatory genomic events across 100s of cell types and tissues. We have developed novel methods to interpret our models and extract local and global predictive patterns revealing many insights into the syntax and grammar of the regulatory code.

Our models also serve as in-silico oracles to predict the effects of natural and disease-associated genetic variation i.e. how differences in DNA sequence across healthy and diseased individuals are likely to affect molecular mechanisms associated with common and rare diseases. These models enable optimized design of genome perturbation approaches to decipher functional properties of DNA and variants and serve as a powerful lens for genomic discovery.

Where & When

Zoom Webinar
Tuesday, April 5, 2022, 1:00 to 2:00 PM PT

About the Speaker

Anshul Kundaje is an Assistant Professor of Genetics and Computer Science at Stanford University. His primary research area is large-scale computational regulatory genomics. The Kundaje lab specializes in developing statistical and machine learning methods for large-scale integrative analysis of heterogeneous, high-throughput functional genomic and genetic data to decipher regulatory elements and long-range regulatory interactions, learn predictive regulatory network models across individuals, cell-types and species and improve detection and interpretation of natural and disease-associated genetic variation. Previously as a postdoc at Stanford and Research Scientist at MIT, Anshul was the lead computational analyst of the ENCODE Project and the Roadmap Epigenomics Project. Anshul is also a recipient of the 2016 NIH Director's New Innovator Award and the 2014 Alfred Sloan Fellowship.