Yusuf Roohani

Yusuf Roohani


About:

I design new machine learning approaches for modeling biological systems. In particular, I'm interested in how artificial intelligence can inform better experiment design for biological discovery. My recent work has focused on computationally guiding the engineering of cells using genetic perturbations.

Currently, I lead a machine learning group at the Arc Institute, where we are working on building a virtual cell by combining large AI models with strategically scaled data generation. Previously, I completed my PhD at Stanford University under the guidance of Jure Leskovec and Stephen Quake. Prior to that, I worked for four years in early stage drug discovery at GSK.

firstname.lastname@arcinstitute.org

Curriculum Vitae:

[CV]

Arc Machine Learning Group Members

Past Members

Featured Publications:

For complete list: Google Scholar

Journal Publications

Cell [2026] Predicting cellular responses to perturbation across diverse contexts with STATE
Adduri, A., Gautam, D., Bevilacqua B., ... and Roohani, Y..
     [Media coverage] Century of Biology, GEN
     [GitHub 500+ Stars]

Nature [2026] [Code] Universal Cell Embeddings: A Foundation Model for Cell Biology
Rosen Y.*, Roohani Y.*, Agrawal A., Samotorcan L., Quake S., Leskovec J..
     [Media coverage] New York Times

Cell [2025] Virtual Cell Challenge: Toward a Turing test for the virtual cell
Roohani, Y., Hua, T., ... Goodarzi, H., Burke D..
     [Media coverage] GEN, Nature

Cell [2024] How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities
Bunne, C.*, Roohani, Y.*, Rosen, Y.*, ... Regev, A., Lundberg, E., Lekovec, J., Quake S..
     [Media coverage] The Atlantic

Nature Biotechnology [2023] [Code] GEARS: Predicting transcriptional outcomes of novel multi-gene perturbations
Roohani, Y., Huang, K., Leskovec J.
     [Best Poster] Intelligent Systems For Molecular Biology (ISMB 2022)
     [Innovation Award] Society for Lab Automation and Screening (SLAS 2023)
     [GitHub 350+ Stars]

Nature Methods [2023] Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
Rosen Y.*, Brbic M.*, Roohani, Y.*, Swanson K., Li Z., Leskovec, J..

(* = equal contribution)

Preprints

bioRxiv [2026] Stack: In-Context Learning of Single-Cell Biology
Dong M., Adduri A., Gautam D., ... and Roohani, Y..

bioRxiv [2025] scBaseCount: An AI agent-curated, uniformly processed, and continually expanding single cell data repository
Youngblut, N., Carpenter, C., ... , Goodarzi, H.* and Roohani, Y.*.

bioRxiv [2024] [Website] PreciCE: Precision engineering of cell fates via data-driven multi-gene control of transcriptional networks
Magnusson, J.*, Roohani, Y.*, Stauber, D., ... Sandberg, R., Lekovec, J., Lei, S. Qi.

Conference Papers

ICLR [2025] BioDiscoveryAgent: An AI agent for designing genetic perturbation experiments
Roohani Y.*, Vora J.*, Huang Q.*, Steinhart Z., Marson A., Liang P., Leskovec J..
     [Best Poster] ICLR 2024 MLGenX Workshop

NeurIPS [2023] Zero-shot causal learning
Nilforoshan H.*, Moor M.*, Roohani Y., Chen Y., Surina A., Yasunaga M., Oblak S., Leskovec J..
     [Spotlight Presentation]