![]() |
Principal Research Manager, Knowledge and Language Team, Microsoft Cognitive Services Research Group Ph.D., Computer Science Department, Stanford University M.S., Statistics Department, Stanford University B. Eng., Computer Science & Technology Department, Tsinghua University Email: A [at] B, where A=chezhu and B is microsoft.com [ My Microsoft Homepage | Google Scholar | Books | Publications | Patents | Mentored Interns ] |
I am a Principal Research Manager in Microsoft Cognitive Service Research Group, leading the Knowledge and Language Team. Our team's research directions are 1) Knowledge-enhanced Language Model, 2) Summarization, 3) Few-shot and Prompt Learning, and 4) Multimodal Learning. We develop state-of-the-art deep learning technologies for both research and business applications. Our team has achieved first places in multiple NLP competitions, including achieving human parity in CommonsenseQA, HellaSwag and CoQA, 1st places in CommonGen, FEVER, ARC and SQuAD v1.0. Before Microsoft, I got my PhD in Computer Science Department, Stanford University and Bachelor in Computer Science from Yao Class, Tsinghua University.
I organize the Distinguished Talk Series in Microsoft Cognitive Services Research Group. If you're interested in giving a talk, please contact me.
Feb. 3, 2022: Prof. Diyi Yang from Stanford University and I will give a tutorial on Dialogue Summarization at EACL 2023.
Jan. 20, 2022: 1 paper accepted at ICLR 2023.
Nov. 18, 2022: 1 paper accepted at AAAI 2023.
Nov. 5, 2022: Our tutorial on Knowledge-Augmented Methods for Natural Language Processing has been accepted at WSDM 2023.
Oct. 12, 2022: I am on the thesis committee of Donghan Yu from Carnegie Mellon University and Vin Sachidananda from Stanford University.
Oct. 6, 2022: 11 papers accepted at EMNLP 2022.
Sep. 16, 2022: Chenguang Zhu gave a virtual talk "How We Achieved Human Parity in CommonsenseQA – Fusing Knowledge into Language Models" at ISI Seminar of USC. [ Slides | Video ]
Sep. 15, 2022: We will organize The Workshop on Knowledge Augmented Methods for NLP (KnowledgeNLP-AAAI’23) at AAAI 2023. Welcome to submit!
Sep. 14, 2022: 2 papers accepted at NeurIPS 2022.
Jul. 15, 2022: I am one of the panelists at Deep Learning on Graphs Workshop for Natural Language Processing at NAACL 2022. [ Website | Recording (The panel starts at 2:00:30) ]
May 22, 2022: Tutorial on Knowledge-Augmented Methods for Natural Language Processing at ACL 2022. [ Website | Video ]
May 11, 2022: My team has achieved human parity on HellaSwag leaderboard.
Apr. 26, 2022: My paper on Impossible Triangle in Pre-trained Language Model has been reported in multiple media. [ 机器之心 | AI科技评论 ]
My recent work covers:
Finegrained paraphrasing for NLG evaluation and data augmentation [ Paper ]
The "Impossible Triangle" of pre-trained language models [ Paper ]
How to leverage knowledge in the training data to improve NLP models [ Paper ]
Integrative multimodal learning framework i-Code [ Paper ]
![]() |
Machine Reading Comprehension: Algorithm and Practice (Chinese Edition) 《机器阅读理解:算法与实践》 Chenguang Zhu China Machine Press (机械工业出版社) , 2020.03 Top 5 Favorite IT Books (Artificial Intelligence) in 2020 by 51CTO.com [ Link ] [ Amazon.com | China-pub | jd.com | dangdang.com | tmall.com | Amazon.cn ] [ GitHub Code ] |
![]() |
Machine Reading Comprehension: Algorithm and Practice Chenguang Zhu Elsevier, 2021.04 [ Amazon.com | Google Books | Barnes & Noble ] [ GitHub Code ] |
Generate rather than Retrieve: Large Language Models are Strong Context Generators
Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang
International Conference on Learning Representations (ICLR), Kigali, Rwanda, 2023.
[ arXiv ]
i-Code: An Integrative and Composable Multimodal Learning Framework
Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Yuan Lu, Takuya Yoshioka, Michael Zeng, Xuedong Huang
The 37th AAAI Conference on Artificial Intelligence (AAAI), Washington DC, USA, 2023.
[ arXiv | AI科技评论 | Synced Review | MarkTechPost ]
Towards A Unified Multi-Dimensional Evaluator For Text Generation
Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji and Jiawei Han
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
A Unified Encoder-Decoder Framework with Entity Memory
Zhihan Zhang, Wenhao Yu, Chenguang Zhu, Meng Jiang
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach
Wenhao Yu, Chenguang Zhu, Zhihan Zhang, Shuohang Wang, Zhuosheng Zhang, Yuwei Fang and Meng Jiang
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering
Ziniu Hu, Yichong Xu, Wenhao Yu, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Kai-Wei Chang and Yizhou Sun
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
Best Paper Award at SoCalNLP 2022 Symposium
[ arXiv ]
ParaTag: A Dataset of Paraphrase Tagging for Fine-Grained Labels, NLG Evaluation, and Data Augmentation
Shuohang Wang, Ruochen Xu, Yang Liu, Chenguang Zhu, Michael Zeng
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
Leveraging Locality in Abstractive Text Summarization
Yixin Liu, Ansong Ni, Linyong Nan, Budhaditya Deb, Chenguang Zhu, Ahmed H. Awadallah, Dragomir Radev
Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Automatic Rule Induction for Efficient Semi-Supervised Learning
Reid Pryzant, Ziyi Yang, Yichong Xu, Chenguang Zhu, Michael Zeng
Findings of Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Narrate Dialogues for Better Summarization
Ruochen Xu, Chenguang Zhu, Michael Zeng
Findings of Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
AdaPrompt: Adaptive Model Training for Prompt-based NLP
Yulong Chen, Yang Liu, Li Dong, Shuohang Wang, Chenguang Zhu, Michael Zeng, Yue Zhang
Findings of Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Unsupervised Summarization with Customized Granularities
Ming Zhong, Yang Liu, Suyu Ge, Yuning Mao, Yizhu Jiao, Xingxing Zhang, Yichong Xu, Chenguang Zhu, Michael Zeng, Jiawei Han
Findings of Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv ]
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Zhuosheng Zhang, Shuohang Wang, Yichong Xu, Yuwei Fang, Wenhao Yu, Yang Liu, Hai Zhao, Chenguang Zhu, Michael Zeng
Findings of Empirical Methods in Natural Language Processing (EMNLP), Abu Dhabi, the United Arab Emirates, 2022.
[ arXiv | Code ]
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Zhenhailong Wang, Manling Li, Ruochen Xu, Luowei Zhou, Jie Lei, Xudong Lin, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Derek Hoiem, Shih-Fu Chang, Mohit Bansal, Heng Ji
Conference on Neural Information Processing Systems (NeurIPS), New Orleans, Louisiana, USA, 2022.
[ arXiv ]
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan
Conference on Neural Information Processing Systems (NeurIPS), New Orleans, Louisiana, USA, 2022.
[ arXiv ]
APOLLO: A Simple Approach for Adaptive Pre-training of Language Models for Logical Reasoning
Soumya Sanyal, Yichong Xu, Shuohang Wang, Ziyi Yang, Reid Pryzant, Wenhao Yu, Chenguang Zhu, Xiang Ren
Conference on Neural Information Processing Systems (NeurIPS), Workshop on Distribution Shifts: Connecting Methods and Applications, New Orleans, Louisiana, USA, 2022.
[ arXiv ]
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
The 31st International Joint Conference on Artificial Intelligence (IJCAI), Vienna, Austria, 2022.
[ arXiv | Code | Blog ]
[ Human parity in CommonsenseQA leaderboard, 2021.11.12 ]
Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data
Shuohang Wang, Yichong Xu, Yuwei Fang, Yang Liu, Siqi Sun, Ruochen Xu, Chenguang Zhu, Michael Zeng
Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv | CSDN ]
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering
Donghan Yu, Chenguang Zhu, Yuwei Fang, Wenhao Yu, Shuohang Wang, Yichong Xu, Xiang Ren, Yiming Yang, Michael Zeng
Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-Modal Knowledge Transfer
Woojeong Jin, Dong-Ho Lee, Chenguang Zhu, Jay Pujara, Xiang Ren
Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization
Ziming Mao, Chen Henry Wu, Ansong Ni, Yusen Zhang, Rui Zhang, Tao Yu, Budhaditya Deb, Chenguang Zhu, Ahmed H. Awadallah, Dragomir Radev
Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
Summ^N: A Multi-Stage Summarization Framework for Long Input Dialogues and Documents
Yusen Zhang, Ansong Ni, Ziming Mao, Chen Henry Wu, Chenguang Zhu, Budhaditya Deb, Ahmed H. Awadallah, Dragomir Radev, Rui Zhang
Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts
Wenhao Yu, Chenguang Zhu, Lianhui Qin, Zhihan Zhang, Tong Zhao, Meng Jiang
Findings of Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
Leveraging Knowledge in Multilingual Commonsense Reasoning
Yuwei Fang, Shuohang Wang, Yichong Xu, Ruochen Xu, Siqi Sun, Chenguang Zhu, Michael Zeng
Findings of Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
Dict-BERT: Enhancing Language Model Pre-training with Dictionary
Wenhao Yu, Chenguang Zhu, Yuwei Fang, Donghan Yu, Shuohang Wang, Yichong Xu, Michael Zeng, Meng Jiang
Findings of Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
End-to-End Segmentation-based News Summarization
Yang Liu, Chenguang Zhu, Michael Zeng
Findings of Association for Computational Linguistics (ACL), Dublin, Ireland, 2022.
[ arXiv ]
CLIP-Event: Connecting Text and Images with Event Structures
Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou, Xudong Lin, Chenguang Zhu, Michael Zeng, Heng Ji, Shih-Fu Chang
Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, Louisiana, USA, 2022. (Oral)
[ arXiv ]
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Nanyun (Violet) Peng, Zicheng Liu, Michael Zeng
Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, Louisiana, USA, 2022.
[ arXiv ]
A Survey of Knowledge-Enhanced Text Generation
Wenhao Yu, Chenguang Zhu, Zaitang Li, Zhiting Hu, Qingyun Wang, Heng Ji, Meng Jiang
ACM Computing Surveys (Impact factor: 10.282)
[ arXiv ]
JAKET: Joint Pre-training of Knowledge Graph and Language Understanding
Donghan Yu*, Chenguang Zhu*, Yiming Yang, Michael Zeng
(*: Equal contribution)
36th AAAI Conference on Artificial Intelligence (AAAI), 2022.
[ arXiv ]
DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization
Ming Zhong, Yang Liu, Yichong Xu, Chenguang Zhu, Michael Zeng
36th AAAI Conference on Artificial Intelligence (AAAI), 2022.
[ arXiv ]
Unifying Vision, Text, and Layout for Universal Document Processing
Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng, Cha Zhang, Mohit Bansal
arXiv preprint arXiv: 2212.02623, 2022.
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao
arXiv preprint arXiv: 2211.16504, 2022.
UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning
Yulong Chen, Yang Liu, Ruochen Xu, Ziyi Yang, Chenguang Zhu, Michael Zeng, Yue Zhang
arXiv preprint arXiv: 2211.09783, 2022.
[ Code ]
MACSum: Controllable Summarization with Mixed Attributes
Yusen Zhang, Yang Liu, Ziyi Yang, Yuwei Fang, Yulong Chen, Dragomir Radev, Chenguang Zhu, Michael Zeng, Rui Zhang
arXiv preprint arXiv: 2211.05041, 2022.
Tail Batch Sampling: Approximating Global Contrastive Losses as Optimization over Batch Assignments
Vin Sachidananda, Ziyi Yang, Chenguang Zhu
arXiv preprint arXiv: 2210.12874, 2022.
FAST: Improving Controllability for Text Generation with Feedback Aware Self-Training
Junyi Chai, Reid Pryzant, Victor Ye Dong, Konstantin Golobokov, Chenguang Zhu, Yi Liu
arXiv preprint arXiv: 2210.03167, 2022.
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Pengcheng He, Baolin Peng, Liyang Lu, Song Wang, Jie Mei, Yang Liu, Ruochen Xu, Hany Hassan Awadalla, Yu Shi, Chenguang Zhu, Wayne Xiong, Michael Zeng, Jianfeng Gao, Xuedong Huang
arXiv preprint arXiv: 2208.09770, 2022.
Impossible Triangle: What’s Next for Pre-trained Language Models?
Chenguang Zhu, Michael Zeng
arXiv preprint arXiv: 2204.06130, 2022.
[ 机器之心 | AI科技评论 ]
Want To Reduce Labeling Cost? GPT-3 Can Help
Shuohang Wang, Yang Liu, Yichong Xu, Chenguang Zhu and Michael Zeng
Findings of Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 2021.
[ arXiv ]
Sentence-Permuted Paragraph Generation
Wenhao Yu, Chenguang Zhu, Tong Zhao, Zhichun Guo, Meng Jiang
Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 2021.
[ arXiv ]
Injecting Entity Types into Entity-Guided News Generation
Xiangyu Dong*, Wenhao Yu*, Chenguang Zhu and Meng Jiang
(*: Equal contribution)
Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 2021.
[ arXiv ]
An Exploratory Study on Long Dialogue Summarization: What Works and What's Next
Yusen Zhang*, Ansong Ni*, Tao Yu, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah and Dragomir Radev
(*: Equal contribution)
Findings of Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 2021.
[ arXiv ]
Modeling Entity Knowledge for Fact Verification
Yang Liu, Cenguang Zhu, Michael Zeng
Fact Extraction and VERification Workshop (FEVER) in Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 2021.
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
[ Leaderboard | arXiv ]
Baolin Peng, Chunyuan Li, Zhu Zhang, Chenguang Zhu, Jinchao Li, Jianfeng Gao
Association for Computational Linguistics (ACL), Bangkok, Thailand, 2021.
Fusing Context Into Knowledge Graph for Commonsense Reasoning
Yichong Xu*, Chenguang Zhu*, Ruochen Xu, Yang Liu, Michael Zeng, Xuedong Huang
(*: Equal contribution)
Findings of Association for Computational Linguistics (ACL), Bangkok, Thailand, 2021.
[ arXiv ]
Retrieval Enhanced Model for Commonsense Generation
Han Wang, Yang Liu, Chenguang Zhu, Linjun Shou, Ming Gong, Yichong Xu, Michael Zeng
Findings of Association for Computational Linguistics (ACL), Bangkok, Thailand, 2021.
[ arXiv | 1st place on CommonGen leaderboard, 2021.01.13]
Leveraging Lead Bias for Zero-shot Abstractive News Summarization
Chenguang Zhu, Ziyi Yang, Robert Gmyr, Michael Zeng, Xuedong Huang
The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Montreal, Canada, 2021.
[NIPS 2020 Self-Supervised Learning Workshop version | NIPS Poster | arXiv | Talk ]
Enhancing Factual Consistency of Abstractive Summarization
Chenguang Zhu, William Hinthorn, Ruochen Xu, Qingkai Zeng, Michael Zeng, Xuedong Huang, Meng Jiang
North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, Mexico, 2021.
[ arXiv | Predictions | Talk ]
MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization
Chenguang Zhu*, Yang Liu*, Jie Mei and Michael Zeng
(*: Equal contribution)
North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, Mexico, 2021.
[ arXiv | Hugging Face | GitHub | Talk ]
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding
Yu-An Chung*, Chenguang Zhu*, Michael Zeng
(*: Equal contribution)
North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, Mexico, 2021.
[ arXiv ]
Filtered Inner Product Projection for Multilingual Embedding Alignment
Vin Sachidananda, Ziyi Yang, Chenguang Zhu
International Conference on Learning Representations (ICLR), Vienna, Austria, 2021.
[ arXiv | Code ]
Data Augmentation for Spoken Language Understanding via Pretrained Models
Baolin Peng∗, Chenguang Zhu∗, Michael Zeng, Jianfeng Gao
INTERSPEECH, Brno, Czechia, 2021.
(*: Equal contribution)
[ arXiv ]
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yixin Nie*, Linjie Li*, Zhe Gan, Shuohang Wang, Chenguang Zhu, Michael Zeng, Zicheng Liu, Mohit Bansal, Lijuan Wang (*: Equal contribution)
arXiv preprint arXiv: 2112.04453, 2021.
Does Knowledge Help General NLU? An Empirical Study
Ruochen Xu*, Yuwei Fang*, Chenguang Zhu, Michael Zeng
(*: Equal contribution)
arXiv preprint arXiv: 2109.00563, 2021.
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising
[ arXiv ]
Ziyi Yang*, Chenguang Zhu*, Robert Gmyr, Michael Zeng, Xuedong Huang, Eric Darve
(*: Equal contribution)
Findings of Empirical Methods in Natural Language Processing (EMNLP), 2020.
A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining
[ arXiv | Talk | Code ]
Chenguang Zhu*, Ruochen Xu*, Michael Zeng, Xuedong Huang
(*: Equal contribution)
Findings of Empirical Methods in Natural Language Processing (EMNLP), 2020.
Few-shot Natural Language Generation for Task-Oriented Dialog
[ arXiv | Code & Demo ]
Baolin Peng, Chenguang Zhu, Chunyuan Li, Xiujun Li, Jinchao Li, Michael Zeng, Jianfeng Gao
Findings of Empirical Methods in Natural Language Processing (EMNLP), 2020.
Mixed-Lingual Pre-training for Cross-lingual Summarization
[ arXiv ]
Ruochen Xu*, Chenguang Zhu*, Yu Shi, Michael Zeng, Xuedong Huang
(*: Equal contribution)
Asia-Pacific Chapter of the Association for Computational Linguistics (AACL), Suzhou, China, 2020.
Boosting Naturalness of Language in Task-oriented Dialogues via Adversarial Training
[ Talk ]
Chenguang Zhu
Special Interest Group on Discourse and Dialogue (SIGdial), Boise, Idaho, 2020.
Text Summarization
文本摘要:浓缩的才是精华
Chenguang Zhu, Nanshan Zeng
Communications of the China Computer Federation (CCCF), 《中国计算机学会通讯》,Jul. 2020.
Accelerating Real-Time Question Answering via Question Generation
Yuwei Fang, Shuohang Wang, Zhe Gan, Siqi Sun, Jingjing Liu, Chenguang Zhu
arXiv preprint arXiv: 2009.05167, 2020.
Meta Dialogue Policy Learning
Yumo Xu, Chenguang Zhu, Baolin Peng, Michael Zeng
arXiv preprint arXiv: 2006.02588, 2020.
Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue
[ Poster ]
Chenguang Zhu, Michael Zeng, Xuedong Huang
Empirical Methods in Natural Language Processing (EMNLP), Hong Kong, China, 2019.
Parameter-free Sentence Embedding via Orthogonal Basis
[ Code | Slides | Talk ]
Ziyi Yang, Chenguang Zhu, Weizhu Chen
Empirical Methods in Natural Language Processing (EMNLP), Hong Kong, China, 2019.
Embedding Imputation with Grounded Language Information
[ Poster | Code ]
Ziyi Yang, Chenguang Zhu, Vin Sachidananda, Eric Darve
Association for Computational Linguistics (ACL), Florence, Italy, 2019.
Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question Answering
[ Code | Poster ]
Jianmo Ni, Chenguang Zhu, Weizhu Chen, Julian McAuley.
North American Chapter of the Association for Computational Linguistics (NAACL), Minneapolis, USA, 2019.
SIM: A Slot-Independent Neural Model for Dialogue State Tracking
[ Talk at Stanford HAI OVAL | Poster ]
Chenguang Zhu, Michael Zeng, Xuedong Huang
Special Interest Group on Discourse and Dialogue (SIGdial), Stockholm, Sweden, 2019.
Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization
[ arXiv | Poster ]
Beliz Gunel, Chenguang Zhu, Michael Zeng, Xuedong Huang
Conference on Neural Information Processing Systems (NeurIPS), Knowledge Representation & Reasoning Meets Machine Learning (KR2ML workshop), Vancouver, Canada, 2019.
Machine Reading Comprehension: How to Make Computer Understand Articles
机器阅读理解:如何让计算机读懂文章
Chenguang Zhu
Communications of the China Computer Federation (CCCF), 《中国计算机学会通讯》,Feb. 2019.
FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension
[ arXiv | Code | Poster ]
Hsin-Yuan Huang, Chenguang Zhu, Yelong Shen, Weizhu Chen.
International Conference on Learning Representations (ICLR), Vancouver, Canada, 2018.
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering
[ Code ]
Chenguang Zhu, Michael Zeng, Xuedong Huang.
arXiv preprint arXiv: 1812.03593, 2018.
Measuring the Pulse of a City Via Taxi Operation: A Case Study
Chenguang Zhu, Balaji Prabhakar.
Transportation Research Board 96th Annual Meeting (TRB), Washington, D.C., 2017.
Reducing Inefficiencies in Taxi Systems
Chenguang Zhu, Balaji Prabhakar.
56th IEEE Conference on Decision and Control (CDC), Melbourne, Australia, 2017.
Reducing Road Congestion Through Incentives: A Case Study
Chenguang Zhu, Jia Shuo Yue, Chinmoy V. Mandayam, Deepak Merugu, Hossein Karkeh Abadi, Balaji Prabhakar.
Transportation Research Board 94th Annual Meeting (TRB), Washington, D.C., 2015.
Featured on The New York Times, The Wall Street Journal, International Business Times, Ars Technica and Stanford News
Polling One's Friends: A Graph Theoretic View
Chenguang Zhu, Hossein Karkeh Abadi, Balaji Prabhakar.
53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), 2015.
Information Diffusion and External Influence in Networks
Seth A. Myers, Chenguang Zhu, Jure Leskovec.
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2012
A Novel Click Model and Its Applications to Online Advertising
Zeyuan Allen Zhu, Weizhu Chen, Tom Minka, Chenguang Zhu, Zheng Chen.
ACM International Conference on Web Search and Data Mining (WSDM), 2010
A General Magnitude-Preserving Boosting Algorithm for Search Ranking
Chenguang Zhu, Weizhu Chen, Zeyuan Allen Zhu, Gang Wang, Dong Wang, Zheng Chen.
ACM Conference on Information and Knowledge Management (CIKM), 2009
Inverse Time Dependency in Convex Regularized Learning
Zeyuan Allen Zhu, Weizhu Chen, Chenguang Zhu, Gang Wang, Haixun Wang, Zheng Chen.
IEEE International Conference of Data Mining (ICDM), 2009 Best Student Paper Award Runner-Up
P-packSVM: Parallel Primal Gradient Descent Kernel SVM
Zeyuan Allen Zhu, Weizhu Chen, Gang Wang, Chenguang Zhu, Zheng Chen.
IEEE International Conference of Data Mining (ICDM), 2009
Synthetic data generation for training of natural language understanding models (US11508360B2)
Baolin Peng, Chenguang Zhu, Chunyuan Li, Xiujun Li, Jinchao Li, Nanshan Zeng, Jianfeng Gao
Generation Of Optimized Knowledge-based Language Model Through Knowledge Graph Multi-alignment (US 20220230625 A1)
Chenguang Zhu, Nanshan Zeng.
Generation Of Optimized Spoken Language Understanding Model Through Joint Training With Integrated Knowledge Language Module (US 20220230628 A1)
Chenguang Zhu, Nanshan Zeng.
Generation Of Optimized Spoken Language Understanding Model Through Joint Training With Integrated Acoustic Knowledge-speech Module (US 20220230629 A1)
Chenguang Zhu, Nanshan Zeng.
Automated meeting minutes generator (WO2021242399A1)
Chenguang Zhu, Yu Shi, William Isaac Hinthorn, Nanshan Zeng, Ruochen Xu, Liyang Lu, Xuedong Huang.
Using machine comprehension to answer a question (US 20190156220)
Chenguang Zhu, Hsin-Yuan Huang, Pengcheng He, Weizhu Chen, Yelong Shen, Zheng Chen.
Conversational Virtual Assistant (US 20180232376)
Chenguang Zhu, Weizhu Chen, Jianwen Zhang, Xuedong Huang, Zheng Chen.
Caching Content Addressable Data Chunks for Storage Virtualization (US 20140280664)
Sudipta Sengupta, Chenguang Zhu, Chun Ho Cheung, Jin Li, Abhishek Gupta.
I am very fortunate to have mentored and worked with talented students.
Vin Sachidananda from Stanford University. I am the co-advisor of EE PhD student Vin and our work on embedding alignment and imputation have been published in ICLR 2021 and ACL 2019.
Wenhao Yu (2021 summer), University of Notre Dame. Our three papers on language generation were published in EMNLP 2021 and ACL 2022. Our paper on dictionary-boosted language model was published in ACL 2022.
Han Wang (2020 winter), New York University. Our paper on common sense language generation was published in ACL 2021.
Donghan Yu (2021 summer and 2020 summer), Carnegie Mellon University. Our two papers on language models boosted by knowledge graph were published in AAAI 2022 and ACL 2022.
Yu-An Chung (2020 summer), MIT. Our paper on speech-text co-pretraining was published in NAACL 2021.
Yumo Xu (2020 spring), University of Edinburgh.
Ziyi Yang (2018 summer and 2019 summer), Stanford University. Our paper on word embedding was published in ACL 2019. Our paper on sentence embeddings was published in EMNLP 2019. Our paper on unsupervised text summarization was published in EMNLP 2020. Our paper on zero-shot news summarization was published in SIGIR 2021. Our paper on multilingual embedding alignment was published in ICLR 2021.
Beliz Gunel (2019 summer), Stanford University. Our paper on knowledge-boosted text summarization was published in NeurIPS 2019 workshop of KR2ML.
Jianmo Ni (2018 summer), UC San Diego. Our paper on machine reading comprehension was published in NAACL 2019.
Hsin-Yuan Huang (2017 summer), Caltech. Our paper on machine reading comprehension was published on ICLR 2018.
Panelist at Deep Learning on Graphs Workshop for Natural Language Processing (DLG4NLP at NAACL), Seattle, 2022.7. [ Recording (The panel starts at 2:00:30) ]
Tutorial on Knowledge-Augmented Methods for Natural Language Processing at ACL 2022, 2022.5. [ Website | Video ]
How We Achieved Human Parity in CommonsenseQA – Fusing Knowledge into Language Models" at NLP seminar at Stanford University, 2022.3. [ Slides ]
How We Achieved Human Parity in CommonsenseQA – Fusing Knowledge into Language Models" at NLP and AI seminar at Georgia Tech, 2022.2. [ Slides ]
How We Achieved Human Parity in CommonsenseQA – Fusing Knowledge into Language Models. BLISS seminar (Berkeley Laboratory for Information and System Sciences) at UC Berkeley, 2022.2. [ Slides ]
Fusing Knowledge into Language Model. Machine Learning / Duolingo Seminar at School of Computer Science, Carnegie Mellon University, 2021.10. [ Slides ]
Panelist at SIGDial 2021 Special Session on Summarization of Dialogues and Multi-Party Meetings (SummDial 2021), Virtual, 2021.7
Knowledge Graph and Its Applications in NLP. Seminar at Department of Computer Science and Engineering, University of Notre Dame, 2020.09
Machine Reading Comprehension
Future of Computing Forum, Tsinghua University, 2020.05 [Video]
AI Express, Microsoft Online, 2020.05 [Video]
Research Progress in Task-Oriented Dialogue. First Open Virtual Assistant Workshop, Stanford University, 2019.10 [Video]
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering. Stanford NLP Seminar, Stanford University, 2019.01
FusionNet: Fusing with Fully Aware Attention in Machine Reading Comprehension
Stanford Platform Lab Seminar, Stanford University, 2018.02
Guest lecture at EE392K, Stanford University, 2018.02
ACM International Collegiate Programming Contest (ICPC), World Finals 2012: 13th place (Representing Stanford University), UPE First Solution Award [ Photo at Award Ceremony ]
Winner of Stanford Local Programming Contest, 2010, 2011
Best Student Paper Award Runner-Up at IEEE International Conference of Data Mining (ICDM), 2009
National Champion in US National Table Tennis Championships U2000 Division D, Dec. 2015
Second Runner-up in men's singles table tennis match of Tsinghua University