Recent Preprints
-
Show, Don't Tell: Aligning Language Models with Demonstrated Feedback
arXiv:2406.00888, 2024 [pdf]
-
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
arXiv:2409.04109, 2024 [pdf]
-
Social Skill Training with Large Language Models
arXiv:2404.04204, 2024 [pdf]
-
Design2Code: How Far Are We From Automating Front-End Engineering
arXiv:2403.03163, 2024 [pdf]
Publications [Google Scholar]
-
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
NeurIPS, 2024 [pdf]
-
PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
NeurIPS D\&B, 2024 [pdf]
-
Demystifying Verbatim Memorization in Large Language Models
EMNLP, 2024 [pdf]
-
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies
EMNLP Findings, 2024 [pdf]
-
Unintended Impacts of LLM Alignment on Global Representation
ACL, 2024 [pdf]
-
Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors
ACL, 2024 [pdf]
-
Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future
ACL Findings, 2024 [pdf]
-
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
COLM, 2024. [pdf]
-
Rehearsal: Simulating Conflict to Teach Conflict Resolution
CHI, 2024. [pdf]
-
Grounding or Guesswork? Large Language Models Are Presumptive Grounders
NAACL, 2024. [pdf]
-
DyVal: Graph-informed Dynamic Evaluation of Large Language Models
ICLR, 2024. [pdf]
-
Training socially aligned language models on simulated social interactions
ICLR, 2024. [pdf]
-
Using Large Language Models in Psychology
Nature Reviews Psychology, 2023. [pdf]
-
Can Large Language Models Transform Computational Social Science?
Computational Linguistics, 2023. [pdf]
-
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
EMNLP, 2023. [pdf]
-
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
EMNLP, 2023. [pdf]
-
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules
EMNLP, 2023. [pdf]
-
Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
EMNLP, 2023. [pdf]
-
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
EMNLP, 2023. [pdf]
-
Impressions: Visual Semiotics and Aesthetic Impact Understanding
EMNLP, 2023. [pdf]
-
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
EMNLP, 2023. [pdf]
-
Understanding Black Content Creator Experiences on TikTok
CSCW, 2023. [pdf]
Recognition for Contribution to Diversity and Inclusion Award
-
Multi-VALUE: A Framework for Cross-Dialectal English NLP
-
NormBank: A Knowledge Bank of Situational Social Norms
ACL, 2023. [pdf]
-
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
ACL, 2023. [pdf]
-
Forgotten Knowledge: Examining the Citational Amnesia in NLP
ACL, 2023. [pdf]
Best Paper Honorable Mention
-
Human-in-the-loop Abstractive Dialogue Summarization
ACL (Findings), 2023. [pdf]
-
Task Agnostic Dialect Adapters for English
ACL (Findings), 2023. [pdf]
-
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
ACL (Findings), 2023. [pdf]
-
Parameter-Efficient Fine-Tuning Design Spaces
ICLR, 2023. [pdf]
-
Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers
EACL, 2023. [pdf]
-
Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
EACL (Findings), 2023. [pdf]
-
Metrics for Peer Counseling: Triangulating Success Outcomes for Online Therapy Platforms
SIGCHI, 2023. [pdf]
-
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP
TACL. [pdf]
-
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
TACL. [pdf]
-
Geographic Citation Gaps in NLP Research
EMNLP, 2022. [pdf]
-
When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domain
EMNLP, 2022. [pdf]
-
Robustness of Demonstration-based Learning Under Limited Data Scenario
EMNLP, 2022. [pdf]
-
A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch
ECCV, 2022. [pdf]
-
Modeling Motivational Interviewing Strategies On Online Peer-to-Peer Counseling Platforms
CSCW, 2022. [pdf]
-
SUBS: Subtree Substitution for Compositional Semantic Parsing
NAACL, 2022. [pdf]
-
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
NAACL, 2022. [pdf]
-
Explaining Toxic Text via Knowledge Enhanced Text Generation
NAACL, 2022. [pdf]
-
Measure and Improve Robustness in NLP Models: A Survey
NAACL, 2022. [pdf]
-
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models
NAACL (Findings), 2022. [pdf]
-
Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models
NAACL (Findings), 2022. [pdf]
-
Exploring the Role of Grammar and Word Choice in Bias Toward AAVE in Hate Speech Classification
FAccT, 2022. [pdf]
-
VALUE: Understanding Dialect Disparity in NLU
-
Inducing Positive Perspectives with Text Reframing
ACL, 2022. [pdf]
Outstanding Paper Award
-
Continual Sequence Generation with Adaptive Compositional Modules
ACL, 2022. [pdf]
-
Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems
ACL, 2022. [pdf]
-
Focus on the Action: Learning to Highlight and Summarize Jointly for Email To-Do Items Summarization
ACL (Findings), 2022. [pdf]
-
FairytaleQA: An Authentic Dataset for Narrative Comprehension
ACL, 2022. [pdf]
-
GNN is a Counter? Revisiting GNN for Question Answering
ICLR, 2022. [pdf]
-
Will AI Console Me when I Lose my Pet? Understanding Perceptions of AI-Mediated Email Writing
CHI, 2022. [pdf]
-
Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages
CHI, 2022. [pdf]
Best Paper Honorable Mention
-
Linguistic Characterization of Divisive Topics Online: Case Studies on Contentiousness in Abortion, Climate Change, and Gun Control
ICWSM 2022. [pdf]
-
Simple Conversational Data Augmentation for Semi-supervised Abstractive Dialogue Summarization
EMNLP, 2021. [pdf]
-
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
EMNLP, 2021. [pdf]
-
To Protect and To Serve? Analyzing Entity-Centric Framing of Police Violence
EMNLP (Findings), 2021. [pdf]
-
RECAST: Enabling User Recourse and Interpretability of Toxicity Detection Models with Interactive Visualization
CSCW, 2021. [pdf]
-
Evaluating the Effectiveness of Deplatforming as a Moderation Strategy on Twitter
CSCW, 2021. [pdf]
Best Paper Honorable Mention
-
Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
ACM conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2021. [pdf]
Best Student Paper Award
-
Understanding the Usage of Online Media for Parenting from Infancy to Preschool At Scale
SIGCHI, 2021. [pdf]
Best Paper Honorable Mention
-
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability
ACL, 2021. [pdf]
-
A Dataset for Understanding Disfluencies in Question Answering
ACL (Findings), 2021. [pdf]
-
Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs
NAACL, 2021. [pdf]
-
Continual Learning for Text Classification with Information Disentanglement Based Regularization
NAACL, 2021. [pdf]
-
Personalized Response Generation via Generative Split Memory Network
NAACL, 2021. [pdf]
-
The Importance of Modeling Social Factors of Language: Theory and Practice
NAACL, 2021. [pdf]
-
Weakly-Supervised Hierarchical Models for Predicting Persuasive Strategies in Good-faith Textual Requests
AAAI, 2021. [pdf]
-
Tuiteamos o pongamos un tuit? Investigating the Social Constraints of Loanword Integration in Spanish Social Media
The Society for Computation in Linguistics (SCiL), 2021. [pdf]
-
Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization
-
Local Additivity Based Data Augmentation for Semi-supervised NER
EMNLP, 2020. [pdf]
-
Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection
EMNLP, 2020. [pdf]
-
ToTTo: A Controlled Table-To-Text Generation Dataset
EMNLP, 2020. [pdf]
-
Examining the Ordering of Rhetorical Strategies in Persuasive Requests
-
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
ACL, 2020, [pdf]
-
Characterizing Collective Attention via Descriptor Context: A Case Study of Public Discussions of Crisis Events
ICWSM, 2020, [pdf]
-
Automatically Neutralizing Subjective Bias in Text
AAAI, 2020, oral [pdf]
-
Successful Online Socialization: Lessons from the Wikipedia Education Program
CSCW, 2020 [pdf]
Best Paper Honorable Mention
-
Multi-level Modeling of Social Roles in Online Micro-lending Platforms
CSCW, 2019 [pdf]
-
Modeling Persuasive Strategies via Semi-Supervised Neural Nets on Crowdfunding Platforms
NAACL, 2019, oral [pdf]
-
Seekers, Providers, Welcomers, and Storytellers: Modeling Social Roles in Online Health Communities
CHI, 2019 [pdf] [supplement]
Best Paper Honorable Mention
-
The Channel Matters: Self-disclosure, Reciprocity and Social Support in Online Cancer Support Groups
CHI, 2019 [pdf]
Best Paper Honorable Mention
-
Persuading Teammates to Give: Systematic versus Heuristic Cues for Soliciting Loans
CSCW, 2018. [pdf]
-
Identifying Semantic Edit Intentions from Revisions in Wikipedia
-
Commitment of Newcomers and Old-timers to Online Health Support Communities
CHI, 2017. [pdf]
-
Who does What: Editor Role Identification in Wikipedia
ICWSM, 2016. [pdf]
Best Paper Honorable Mention
In the news: [CMU LTI], [Wikimedia Newsletter]
-
Hierarchical Attention Networks for Document Classification
NAACL, 2016. [pdf]
-
Exploring the Effect of Student Confusion in Massive Open Online Courses
Journal of Educational Data Mining (JEDM). [pdf]
-
Humor Recognition and Humor Anchor Extraction
EMNLP 2015, oral. [pdf]
-
Weakly Supervised Role Identification in Teamwork Interactions
ACL 2015, oral. [pdf]
-
That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets
Notable Data Set Award
-
Incorporating Word Correlation Knowledge into Topic Modeling
NAACL, 2015. [pdf]
-
Local Implicit Feedback Mining for Music Recommendation
RecSys 2012. [pdf]
Workshops and Posters
-
Personalized Response Generation with Tensor Factorization
The GEM workshop at ACL 2021. [pdf]
-
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
The GEM workshop at ACL 2021. [pdf]
-
Putting Humans in the Natural Language Processing Loop: A Survey
HCI+NLP workshop at EACL 2021. [pdf]
-
This is a Problem, Don’t You Agree? Framing and Bias in Human Evaluation for Natural Language Generation
Workshop on Evaluating NLG Evaluation at INLG 2020. [pdf]