Naive Predictions from Informed Experiments

Finding meaning in next generation sequencing data.

A Computational Framework for Predicting Regulatory Networks

Next-generation sequencing is easier, cheaper, and more widely available than ever before. But finding meaning in the data is a challenge.

I am developing a computational framework to find meaning in strategically designed sequencing experiments. Specifically, I have established a computational pipeline for predicting transcriptional networks involved in determining cell identity that uses next-generation sequencing of RNA transcripts (RNAseq) and accessible chromatin (ATACseq).

My current model is cortical development: The cortex consists of two broadly defined populations of excitatory projection neurons--"upper layer" neurons and "deep layer" neurons--that arise from the same pool of neural stem cells. My research seeks to discover the networks of genes and regulatory elements, including enhancers and repressors, that determine whether a neural stem cell will become "upper layer" or "deep layer."

Genetic variants associated with disorders of cognitive development, including Autism Spectrum Disorder (ASD), schizophrenia, and intellectual disability, are many. Too many to test individually, and most go undefined, because they fall in regions of the genome that do not code for protein and are therefore unannotated. Screening for variants that fall in or near regulatory elements that I predict to be functional will help narrow down lists of genetic variants to something more tenable in a laboratory and clinical setting.

Example paper:

Notwell, J.H., Heavner, W., Darbandi, S.F., Katzman, S., McKenna, W.L., Ortiz-Londono, C.F., Tastad, D., Eckler, M.J., Rubenstein, J.L.R., McConnell, S.K., Chen, B., Bejerano, G. (2016) Tbr1 regulates autism risk genes in the developing neocortex. Genome Research 26:1013-1022

Github repository