- Certified defenses for data poisoning attacks.
, , .
*arXiv preprint arXiv:1706.03691,*2017. [bib] [paper]

- Understanding black-box predictions via influence functions.
, .
*International Conference on Machine Learning (ICML),*2017. [bib] [paper] [CodaLab]

- Convexified convolutional neural networks.
, , .
*International Conference on Machine Learning (ICML),*2017. [bib] [paper]

- Developing bug-free machine learning systems with formal mathematics.
, .
*International Conference on Machine Learning (ICML),*2017. [bib] [code]

, - World of bits: an open-domain platform for web-based agents.
.
*International Conference on Machine Learning (ICML),*2017. [bib]

, , , , - A hitting time analysis of stochastic gradient Langevin dynamics.
, , .
*Conference on Learning Theory (COLT),*2017.**Best paper award**. [bib] [paper]

- Prediction with a short memory.
, , , .
*arXiv preprint arXiv:1612.02526,*2017. [bib] [paper]

- Synthesizing program input grammars.
, , , .
*Programming Language Design and Implementation (PLDI),*2017. [bib] [paper]

- Naturalizing a programming language via interactive learning.
, , , .
*Association for Computational Linguistics (ACL),*2017. [bib] [paper] [project] [CodaLab]

- Learning symmetric collaborative dialogue agents with dynamic knowledge graph embeddings.
, , , .
*Association for Computational Linguistics (ACL),*2017. [bib] [paper] [CodaLab]

- From language to programs: bridging reinforcement learning and maximum marginal likelihood.
, , , .
*Association for Computational Linguistics (ACL),*2017. [bib] [paper] [CodaLab]

- Unsupervised risk estimation using only conditional independence structure.
, .
*Advances in Neural Information Processing Systems (NIPS),*2016. [bib] [paper]

- SQuAD: 100,000+ questions for machine comprehension of text.
, , , .
*Empirical Methods in Natural Language Processing (EMNLP),*2016.**Best resource paper award**. [bib] [paper] [CodaLab]

- Learning language games through interaction.
, , .
*Association for Computational Linguistics (ACL),*2016.**Outstanding paper award**. [bib] [paper] [CodaLab]

- Data recombination for neural semantic parsing.
, .
*Association for Computational Linguistics (ACL),*2016. [bib] [paper] [CodaLab]

- Simpler context-dependent logical forms via model projections.
, , .
*Association for Computational Linguistics (ACL),*2016. [bib] [paper] [CodaLab]

- Inferring logical forms from denotations.
, .
*Association for Computational Linguistics (ACL),*2016. [bib] [paper] [CodaLab]

- Unanimous prediction for 100% precision with application to learning semantic mappings.
, , .
*Association for Computational Linguistics (ACL),*2016. [bib] [paper] [CodaLab]

- How much is 131 million dollars? Putting numbers in perspective with compositional descriptions.
, .
*Association for Computational Linguistics (ACL),*2016. [bib] [paper] [CodaLab]

- Estimation from indirect supervision with linear moments.
, , , .
*International Conference on Machine Learning (ICML),*2016. [bib] [paper] [CodaLab]

- Data augmentation via Lévy processes.
, , .
*Perturbations, Optimization and Statistics,*2016. [bib] [paper] [code]

- Learning executable semantic parsers for natural language understanding.
.
*Communications of the ACM,*2016. [bib] [paper]

- Imitation learning of agenda-based semantic parsers.
, .
*Transactions of the Association for Computational Linguistics (TACL),*2015. [bib] [paper] [project] [CodaLab]

- Learning with relaxed supervision.
, .
*Advances in Neural Information Processing Systems (NIPS),*2015. [bib] [paper] [CodaLab]

- Estimating mixture models via mixture of polynomials.
, , .
*Advances in Neural Information Processing Systems (NIPS),*2015. [bib] [paper] [CodaLab]

- On-the-job learning with Bayesian decision theory.
, , , .
*Advances in Neural Information Processing Systems (NIPS),*2015. [bib] [paper] [CodaLab]

- Calibrated structured prediction.
, .
*Advances in Neural Information Processing Systems (NIPS),*2015. [bib] [paper] [CodaLab]

- Traversing knowledge graphs in vector space.
, , .
*Empirical Methods in Natural Language Processing (EMNLP),*2015.**Best paper honorable mention**. [bib] [paper] [CodaLab]

- Compositional semantic parsing on semi-structured tables.
, .
*Association for Computational Linguistics (ACL),*2015. [bib] [paper] [CodaLab]

- Environment-driven lexicon induction for high-level instructions.
, , , .
*Association for Computational Linguistics (ACL),*2015. [bib] [paper] [CodaLab]

- Reified context models.
, .
*International Conference on Machine Learning (ICML),*2015. [bib] [paper] [CodaLab]

- Learning fast-mixing models for structured prediction.
, .
*International Conference on Machine Learning (ICML),*2015. [bib] [paper] [CodaLab]

- Learning where to sample in structured prediction.
, , .
*Artificial Intelligence and Statistics (AISTATS),*2015. [bib] [paper] [CodaLab]

- Tensor factorization via matrix factorization.
, , .
*Artificial Intelligence and Statistics (AISTATS),*2015. [bib] [paper] [CodaLab]

- Bringing machine learning and compositional semantics together.
, .
*Annual Reviews of Linguistics,*2015. [bib] [paper]

- Building a semantic parser overnight.
, , .
*Association for Computational Linguistics (ACL),*2015. [bib] [paper] [project] [CodaLab]

- Talking to computers in natural language.
.
*XRDS: Crossroads, The ACM Magazine for Students,*2014. [bib] [paper]

- Semantic parsing via paraphrasing.
, .
*Association for Computational Linguistics (ACL),*2014.**Best long paper honorable mention**. [bib] [paper] [project]

- Zero-shot entity extraction from web pages.
, .
*Association for Computational Linguistics (ACL),*2014. [bib] [paper] [slides] [project]

- Estimating latent-variable graphical models using moments and likelihoods.
, .
*International Conference on Machine Learning (ICML),*2014. [bib] [paper] [slides]

- Adaptivity and optimism: an improved exponentiated gradient algorithm.
, .
*International Conference on Machine Learning (ICML),*2014. [bib] [paper]

- Filtering with abstract particles.
, .
*International Conference on Machine Learning (ICML),*2014. [bib] [paper] [supplemental material]

- Altitude training: strong bounds for single-layer dropout.
, , , .
*Advances in Neural Information Processing Systems (NIPS),*2014. [bib] [paper]

- Simple MAP inference via low-rank relaxations.
, , , .
*Advances in Neural Information Processing Systems (NIPS),*2014. [bib] [paper] [CodaLab]

- Relaxations for inference in restricted Boltzmann machines.
, , , .
*International Conference on Learning Representations Workshop (ICLR),*2014. [bib] [paper]

- Linking people with "their" names using coreference resolution.
, , , .
*European Conference on Computer Vision (ECCV),*2014. [bib] [paper] [supplemental material]

- The statistics of streaming sparse regression.
, , .
*arXiv preprint arXiv:1412.4182,*2014. [bib] [paper]

- Semantic parsing on Freebase from question-answer pairs.
, , , .
*Empirical Methods in Natural Language Processing (EMNLP),*2013. [bib] [paper] [supplemental material] [slides] [project]

- Feature noising for log-linear structured prediction.
, , , , .
*Empirical Methods in Natural Language Processing (EMNLP),*2013. [bib] [paper]

- Dropout training as adaptive regularization.
, , .
*Advances in Neural Information Processing Systems (NIPS),*2013. [bib] [paper] [poster]

- Spectral experts for estimating mixtures of linear regressions.
, .
*International Conference on Machine Learning (ICML),*2013. [bib] [paper]

- Video event understanding using natural language descriptions.
, , .
*International Conference on Computer Vision (ICCV),*2013. [bib] [paper]

- A data driven approach for algebraic loop invariants.
, , , , , .
*European Symposium on Programming (ESOP),*2013. [bib] [paper]

- Lambda dependency-based compositional semantics.
.
*arXiv preprint arXiv:1309.4408,*2013. [bib] [paper]

- Identifiability and unmixing of latent parse trees.
, , .
*Advances in Neural Information Processing Systems (NIPS),*2012. [abstract] [bib] [paper] [techreport] [poster]

- Learning dependency-based compositional semantics.
, , .
*Association for Computational Linguistics (ACL),*2011. [abstract] [brief] [bib] [paper] [thesis] [journal] [slides] [code]

- Scaling up abstraction refinement via pruning.
, .
*Programming Language Design and Implementation (PLDI),*2011. [abstract] [brief] [bib] [paper] [slides]

- Learning minimal abstractions.
, , .
*Principles of Programming Languages (POPL),*2011. [abstract] [brief] [bib] [paper] [slides]

- A game-theoretic approach to generating spatial descriptions.
, , .
*Empirical Methods in Natural Language Processing (EMNLP),*2010. [abstract] [brief] [bib] [paper] [slides]

- A simple domain-independent probabilistic approach to generation.
, , .
*Empirical Methods in Natural Language Processing (EMNLP),*2010. [abstract] [brief] [bib] [paper] [slides]

- A dynamic evaluation of static heap abstractions.
, , , .
*Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA),*2010. [abstract] [brief] [bib] [paper] [slides]

- Learning programs: a hierarchical Bayesian approach.
, , .
*International Conference on Machine Learning (ICML),*2010. [abstract] [brief] [bib] [paper] [slides] [code]

- On the interaction between norm and dimensionality: multiple regimes in learning.
, .
*International Conference on Machine Learning (ICML),*2010. [abstract] [brief] [bib] [paper] [slides]

- Type-based MCMC.
, , .
*North American Association for Computational Linguistics (NAACL),*2010. [abstract] [brief] [bib] [paper] [slides] [code]

- Asymptotically optimal regularization in smooth parametric models.
, , , .
*Advances in Neural Information Processing Systems (NIPS),*2009. [abstract] [brief] [bib] [paper] [techreport] [poster]

- Probabilistic grammars and hierarchical Dirichlet processes.
, , .
*The Oxford Handbook of Applied Bayesian Analysis,*2009. [abstract] [brief] [bib] [paper]

- Learning semantic correspondences with less supervision.
, , .
*Association for Computational Linguistics and International Joint Conference on Natural Language Processing (ACL-IJCNLP),*2009. [abstract] [brief] [bib] [paper] [slides] [code] [data]

- Learning from measurements in exponential families.
, , .
*International Conference on Machine Learning (ICML),*2009. [abstract] [brief] [bib] [paper] [slides]

- Online EM for unsupervised models.
, .
*North American Association for Computational Linguistics (NAACL),*2009. [abstract] [brief] [bib] [paper] [slides] [code]

- An asymptotic analysis of generative, discriminative, and pseudolikelihood estimators.
, .
*International Conference on Machine Learning (ICML),*2008.**Best student paper award**. [abstract] [brief] [bib] [paper] [slides]

- Structure compilation: trading structure for features.
, , .
*International Conference on Machine Learning (ICML),*2008. [abstract] [brief] [bib] [paper] [slides]

- Analyzing the errors of unsupervised learning.
, .
*Human Language Technology and Association for Computational Linguistics (HLT/ACL),*2008. [abstract] [brief] [bib] [paper] [slides]

- Learning bilingual lexicons from monolingual corpora.
, , , .
*Human Language Technology and Association for Computational Linguistics (HLT/ACL),*2008. [abstract] [brief] [bib] [paper] [code]

- Agreement-based learning.
, , .
*Advances in Neural Information Processing Systems (NIPS),*2008. [abstract] [brief] [bib] [paper] [poster]

- A probabilistic approach to language change.
, , , .
*Advances in Neural Information Processing Systems (NIPS),*2008. [abstract] [brief] [bib] [paper] [poster]

- Structured Bayesian nonparametric models with variational inference (tutorial).
, .
*Association for Computational Linguistics (ACL),*2007. [bib] [paper] [slides]

- A permutation-augmented sampler for Dirichlet process mixture models.
, , .
*International Conference on Machine Learning (ICML),*2007. [abstract] [brief] [bib] [paper] [slides]

- The infinite PCFG using hierarchical Dirichlet processes.
, , , .
*Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP/CoNLL),*2007. [abstract] [brief] [bib] [paper] [slides]

- A probabilistic approach to diachronic phonology.
, , , .
*Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP/CoNLL),*2007. [abstract] [brief] [bib] [paper]

- An end-to-end discriminative approach to machine translation.
, , , .
*International Conference on Computational Linguistics and Association for Computational Linguistics (COLING/ACL),*2006. [abstract] [brief] [bib] [paper] [slides]

- Alignment by agreement.
, , .
*North American Association for Computational Linguistics (NAACL),*2006. [abstract] [brief] [bib] [paper] [slides] [code]

- Semi-supervised learning for natural language.
.
*Massachusetts Institute of Technology,*2005. [brief] [bib] [paper] [errata]

- A data structure for maintaining acyclicity in hypergraphs.
, .
*Massachusetts Institute of Technology Technical Report,*2005. [brief] [bib] [paper] [code]

- Linear programming in bounded tree-width Markov networks.
, .
*Mathematical Programing for Data Mining and Machine Learning Workshop at McMaster University,*2005. [bib] [slides] [code]

- Efficient geometric algorithms for parsing in two dimensions.
, , , .
*International Conference on Document Analysis and Recognition (ICDAR),*2005. [brief] [bib] [paper]

- Methods and experiments with bounded tree-width Markov networks.
, .
*Massachusetts Institute of Technology Technical Report,*2004. [brief] [bib] [paper] [code]

- How much of a hypertree can be captured by windmills?
, .
*Massachusetts Institute of Technology Technical Report,*2003. [brief] [bib] [paper] [code]