About

I am a PhD candidate at Stanford NLP advised by Diyi Yang. In summer 2025, I was fortunate to work with Thinking Machines Lab as their first intern. During the first-year rotation program, I had the pleasure to work with Monica S. Lam and Michael Bernstein. 🎤 I am hosting The AM Podcast with Shannon Shen and Michael Ryan to share human-centered AI work through long-form videos.

Previously, I was an undergraduate student at Yuanpei College in Peking University where I got into ML and NLP research by working with Bing Liu. In summer 2022 , I had a research internship in UCLA hosted by Nanyun Peng. Before that, I have worked as a research intern in Microsoft Research Asia (blog spotlight in Chinese) and an engineering intern in Tensorflow Lite team at Google, Beijing.

My research interests lie in ML and NLP. Nowadays, I’m interested in positioning NLP models (e.g., LLM) into larger systems. Here are some core problems I’m thinking about:

How can AI models bridge human and systems or systems and systems?
How can AI-empowered systems collaborate with users effectively?
How to continually improve these systems through the interaction with human and external systems?

Many kind people helped me a lot in my journey. If you want to talk more about research or seek advice that I might be able to provide, feel free to book a chat here.

News

(Apr, 2026) We introduce CollabSkill, a framework that brings human-agent collaboration skill measurement into Collaborative Gym. Rather than treating variability across human workers as noise, we disentangle and measure skill contributions from both sides. (Platform, Paper)
(Apr, 2026) Gave a talk at MIT CSAIL. You can find the talk abstract here.
(Jan, 2026) 🎤 We launched The AM Podcast to share human-centered AI work. In EP0, we share who we are, why we started this podcast, and what we're looking forward to.
(Dec, 2025) Our study on auditing the automation and augmentation capabilities and desires of AI agents is the most-read story on Stanford HAI in 2025. (Full Paper, Website)
(Dec, 2025) I am honored to receive 2026 NVIDIA Graduate Fellowship! (Announcement)

Selected Projects

Human-Agent Collaboration

While most agent research focuses on full automation, we are developing agents as teammates that collaborate with users in the same work environment.

Highlights:

Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration, In ICLR 2026 (🕹️ Interactive Platform)
We introduce Collaborative Gym, a framework for enabling and evaluating human-agent collaboration. Unlike fully autonomous agents that operate independently, Co-Gym’s AI agents engage in a dynamic collaboration process, allowing human users to intervene and guide decisions in real time.
Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce, Preprint (💡 Website 📰 Forbes Coverage)
We collaborate with economists to develop a principled auditing framework to audit the automation and augmentation potential of AI agents for work. We instantiate this framework to build the WORKBank database,enabling a data-driven analysis of worker-centered needs, the desire–capability landscape, the Human Agency Scale spectrum, and implications for core human skills.

LM-Empowered System for Knowledge Curation

We study the development of knowledge agent for writing long, organized, and well-grounded articles, and how humans can collaborate with knowledge agents.

Highlights:

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models, In NAACL 2024 (📰 Wikimedia Coverage)
We introduce STORM, a system that generates full-length, Wikipedia-style articles by conducting perspective-driven literature research and organizing information into structured outlines.
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations, In EMNLP 2024
We introduce Collaborative-STORM, a human-in-the-loop multi-agent system designed to uncover an individual’s “unknown unknowns” within their topics of interest by scaffolding human participation in multi-agent conversations
storm

Continual Learning in NLP

We study (1) continual pre-training/post-training of language models (LMs) and (2) enabling LMs to continually learn new tasks after deployment.

Highlights:

Continual Pre-training of Language Models, In ICLR 2023
We propose a post-training algorithm with adaptive soft-masking mechanism that selectively updates LM parameters based on the post-training corpus to minimize catastrophic forgetting and enhance knowledge transfer.
Class-Incremental Learning based on Label Generation, In ACL 2023
We investigate continual learning with classification objective and generation objective by examining representation collapse in pretrained models throughout the learning process.
ContinualLM

Other Related Works:

Domain Adaptive Pre-training (EMNLP’22), Few-shot Continual Learning (EMNLP’22), Investigating Continual Learning in Computer Vision (ICLR’24)

Recent Preprints & Publications

(*: Equal Contribution)

Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration

Yijia Shao, Vinay Samuel*, Yucheng Jiang*, John Yang, Diyi Yang

In ICLR 2026.

pdf code demo

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Zora Zhiruo Wang, Yijia Shao, Omar Shaikh, Daniel Fried, Graham Neubig, Diyi Yang

Preprint (arXiv:2510.22780).

pdf code

Generative Interfaces for Language Models

Jiaqi Chen*, Yanzhe Zhang*, Yutong Zhang, Yijia Shao, Diyi Yang

Preprint (arXiv:2508.19227).

pdf code demo website

Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce

Yijia Shao*, Humishka Zope*, Yucheng Jiang, Jiaxin Pei, David Nguyen, Erik Brynjolfsson, Diyi Yang

Preprint (arXiv:2506.06576).

pdf website

📰 Forbes Coverage, Stanford HAI Coverage, etc; Slides for the invited talk at MIT FutureTech

Challenges and Paths Towards AI for Software Engineering

Alex Gu, Naman Jain, Wen-Ding Li, Manish Shetty, Yijia Shao, Ziyang Li, Diyi Yang, Kevin Ellis, Koushik Sen, Armando Solar-Lezama

Preprint (arXiv:2503.22625)

pdf

📰 MIT News Coverage

PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action

Yijia Shao, Tianshi Li, Weiyan Shi, Yanchen Liu, Diyi Yang

In NeurIPS 2024 Dataset and Benchmarks Track.

pdf code website

🎥 Invited Talk at Stanford Center for AI Safety Annual Meeting

Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations

Yucheng Jiang*, Yijia Shao*, Dekun Ma, Sina J. Semnani, Monica S. Lam

In EMNLP 2024.

pdf code

▶️ WorldofAI YouTube video , Higher E-Learning YouTube video

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Eric Zelikman, Georges Harik, Yijia Shao, Varuna Jayasiri, Nick Haber, Noah D. Goodman

In COLM 2024.

pdf code

📰 Interesting Discussions on Hacker News, VentureBeat, Medium, All About AI YouTube Video, etc.

Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

Omar Shaikh*, Michelle Lam*, Joey Hejna*, Yijia Shao, Hyundong Justin Cho, Michael Bernstein, Diyi Yang

In ICLR 2025.

pdf code

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Yijia Shao, Yucheng Jiang, Theodore A. Kanell, Peter Xu, Omar Khattab, Monica S. Lam

In NAACL 2024.

pdf code demo

📰 Wikimedia Coverage, Tech & Learning, Medium, etc.

Selected Awards

NVIDIA Graduate Fellowship, 2026
School of Engineering Fellowship, Stanford, 2023
SenseTime Scholarship, 2022 (awarded to 30 students in China)
May 4th Scholarship, 2021 (the highest honor for students in PKU)
National Scholarship, 2020, 2022
First prize in 12th Chinese Mathematics Competition Final, 2020
Merit Student Pacesetter, 2020, 2021, 2022
First Class Scholarship for Freshmen of Peking University, 2019

Misc.

In my free time, I like cooking, travelling, and competitive ballroom dancing!