Sang Michael Xie

I am researcher at OpenAI, working on topics relating to synthetic data and RL. My research interests are in 1) using AI for AI (self-improvement algorithms, synthetic data, data feedback loops, data-centric AI) and 2) new capabilities (agents, test-time compute, continual learning, models with memory, multi-agent algorithms). During my Ph.D., I did research on data-centric methods for language models and understanding pretraining and adaptation to downstream tasks, including emergent behavior such as in-context learning. I have also worked on pretraining and self-training methods for robust machine learning.

Before OpenAI, I was a Research Scientist at Meta GenAI on the LLaMA team. I received my Ph.D. in Computer Science from Stanford University, advised by Percy Liang and Tengyu Ma. I was a FY2019 NDSEG Fellow and a Student Researcher at Google Brain, working with Adams Wei Yu, Hieu Pham, and Quoc Le. I received both my B.S. with departmental honors and M.S. in Computer Science from Stanford in 2017, where I am grateful to have worked with Stefano Ermon on the first deep learning and transfer learning methods for sustainability, particularly in poverty mapping using satellite imagery. My work has been recognized in Scientific American's 10 World Changing Ideas, publication in flagship venues such as Science, and covered by media outlets including the New York Times, The Washington Post, Reuters, BBC News, IEEE Spectrum, and The Verge.

[Music] Email: xie AT cs.stanford.edu

Selected Publications

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes [Paper]

Amrith Setlur, Zijian Wang, Andrew Cohen, Paria Rashidinejad, Sang Michael Xie

arXiv 2026

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining [Paper] [Code] [Blog]

Sang Michael Xie, Hieu Pham, Xuanyi Dong, Nan Du, Hanxiao Liu, Yifeng Lu, Percy Liang, Quoc Le, Tengyu Ma, Adams Wei Yu

Conference on Neural Information Processing Systems (NeurIPS) 2023 Spotlight

BayLearn 2023 Oral

Combining Satellite Imagery and Machine Learning to Predict Poverty [Paper] [Video/Maps/Media/Links] [Code] [Top of Mind radio interview]

Neal Jean*, Marshall Burke*, Michael Xie, William Davis, David Lobell, Stefano Ermon

Science 2016

Teaching

Course Assistant for CS 324: Understanding and Developing Large Language Models, Winter 2022
Course Assistant for CS 229: Machine Learning, Spring 2022
Section Leader for ENGR 40M: Intro to Making: What is EE, Winter 2015

I have mentored the following amazing undergraduate/master's researchers:

Kendrick Shen, now ML Research Engineer at Genesis Therapeutics
Robbie Jones, now ML Software Engineer at GridSpace
Fahim Tajwar, now PhD Student at CMU
Ben Newman, now PhD Student at University of Washington

Service

I co-organized the ICLR 2024 Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo) and the ICLR 2024 Workshop on Navigating and Addressing Data Problems for Foundation Models (DPFM).
I co-organized the ICLR 2023 Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo) with Ananya Kumar, Tiffany Vlaar, Yamini Bansal, Mathilde Caron, Tengyu Ma, Hanie Sedghi, Aditi Raghunathan, and Percy Liang.
I participated in a panel discussion with Ludwig Schmidt, Nathan Lambert, and Megan Ansdell at the Data-centric ML Research (DMLR) workshop at ICML 2023.
Reviewer: NeurIPS, ICML, ICLR, COLM, ICLR Workshop Proposals, IEEE SatML, various workshops at NeurIPS, ICML, ICLR, and CVPR.
Area Chair: COLM 2025

Sang Michael Xie

Selected Publications

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes [Paper]

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining [Paper] [Code] [Blog]

An Explanation of In-context Learning as Implicit Bayesian Inference [Paper] [Code] [Video] [Blog]

Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation [Paper]

In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness [Paper]

Understanding and Mitigating the Tradeoff Between Robustness and Accuracy [Paper] [Video]

Combining Satellite Imagery and Machine Learning to Predict Poverty [Paper] [Video/Maps/Media/Links] [Code] [Top of Mind radio interview]

Teaching

Service