David Jurgens

It's me, David Jurgens!

I enable computers to understand people through their writings, decisions, and actions. My research combines linguistic analysis and network science together to insightfully characterize human populations from how they act and communicate social media, to create better interpretations of what people mean in their writing, and to design more fun and rewarding ways for humans to contribute their understanding of text.

Publications

2016
Citation Classification for Behavioral Analysis of a Scientific Field.
David Jurgens, Srijan Kumar, Raine Hoover, Dan McFarland, Dan Jurafsky.
in submission
pdf
User Migration in Online Social Networks: A Case Study on Reddit During A Period of Community Unrest.
Edward Newell*, David Jurgens*, Hardik Vala, Jad Sassine, Caitrin Armstrong, Derek Ruths and Haji Mohammad Saleem.
Proceedings of the 10th International AAAI Conference on Web and Social Media (ICWSM). 2016
pdf
Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel.
Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths.
Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC). 2016.
pdf
Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation.
Osman Baskaya and David Jurgens.
Journal of Artificial Intelligence Research (JAIR). 55(1) pp. 1025-1058.
pdf
SemEval-2016 Task 14: Semantic Taxonomy Enrichment.
David Jurgens and Mohammad Taher Pilehvar.
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval). 2016.
pdf  ·  website
2015
Mr. Bennet, his coachman, and the Archbishop walk into a bar but only one of them gets recognized: On The Difficulty of Detecting Characters in Literary Texts.
Hardik Vala, David Jurgens, Andrew Piper, and Derek Ruths.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 2015.
pdf  ·  data
Evaluating learning language representations.
J. Karlgren, J. Callin, K. Collins-Thompson, A.C. Gyllensten, A. Ekgren, D. Jurgens, A. Korhonen, F. Olsson, M. Sahlgren, and H. Schütze.
Proceedings of Conference and Labs of Evaluation Forum (CLEF). 2015.
pdf
Reading Between the Lines: Overcoming Data Sparsity for Accurate Classification of Lexical Relationships.
Silvia Necsulescu, Sara Mendes, David Jurgens, Núria Bel, and Roberto Navigli.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (*SEM). 2015.
pdf
Everyone's Invited: A New Paradigm For Evaluation on Non-transferable Datasets.
David Jurgens, Tyler Finethy, Caitrin Armstrong, and Derek Ruths.
Proceedings of the ICWSM Workshop on Standards and Practices in Large-Scale Social Media Research. 2015.
pdf  ·  code  ·  FREESR code  ·  FREESR website  ·  project website
Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice.
David Jurgens, Tyler Finethy, James McCorriston, Yi Tian Xu, and Derek Ruths.
Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015
pdf  ·  poster  ·  code  ·  website
An Analysis of Exercising Behavior in Online Populations.
David Jurgens, James McCorriston, and Derek Ruths.
Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015
pdf  ·  poster  ·  website
Organizations are Users Too: Characterizing and Detecting the Presence of Organizations on Twitter.
James McCorriston, David Jurgens, and Derek Ruths.
Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015
pdf  ·  website  ·  code
Cross Level Semantic Similarity: An Evaluation Framework for Universal Measures of Similarity.
David Jurgens, Mohammad Taher Pilehvar, and Roberto Navilgi.
Journal of Language Resources and Evaluation. 50(1) pp. 5-30.
pdf (preprint)
Reserating the awesometastic: An automatic extension of the WordNet taxonomy for novel terms.
David Jurgens and Mohammad Taher Pilehvar.
Proceeding of the Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL-HLT). 2015.
pdf  ·  poster  ·  download  ·  website
2014
It's All Fun and Games until Someone Annotates: Video Games with a Purpose for Linguistic Annotation.
David Jurgens and Roberto Navigli.
Transactions of the Association for Computational Linguistics (TACL) 2014.
pdf  ·  slides: pdf, pptx  ·  games!
Geotagging One Hundred Million Twitter Accounts with Total Variation Minimization.
Ryan Compton, David Jurgens, and David Allan.
Proceedings of the IEEE International Conference on Big Data. 2014.
pdf
Press: Forbes, MIT Technology Review, Business Insider, Daily Caller, Schneier on Security
Twitter users #CodeSwitch hashtags! #MoltoImportante #wow #헐.
David Jurgens, Stefan Dimitrov, and Derek Ruths.
Proceedings of The First Workshop on Computational Approaches to Code Switching. 2014.
pdf  ·  blog post
SemEval-2014 Task 3: Cross-Level Semantic Similarity.
David Jurgens, Mohammad Taher Pilehvar, and Roberto Navigli.
Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval) 2014.
pdf  ·  slides  ·  website
Validating and Extending Semantic Knowledge Bases using Video Games with a Purpose.
Daniele Vannella, David Jurgens, Daniele Scarfini, Domenico Toscani, and Roberto Navigli.
Proceedings of the Annual Meeting for the Association for Computational Linguistics (ACL) 2014.
pdf  ·  poster  ·  games!
An analysis of ambiguity in word sense annotations.
David Jurgens.
Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC) 2014.
pdf
2013
Align, Disambiguate and Walk: A Unified Approach for Measuring Semantic Similarity.
(Best paper nominee)
Mohammad T. Pilehvar, David Jurgens, and Roberto Navigli.
Proceedings of the Annual Meeting for the Association for Computational Linguistics (ACL) 2013.
pdf  ·  slides  ·  code
That's what friends are for: Inferring location in online communities based on social relationships.
David Jurgens.
Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM) 2013.
pdf  ·  slides  ·  video  ·  Press: Follow the Crowd, MIT Technology Review
Embracing Ambiguity: A Comparison of Annotation Methodologies for Crowdsourcing Word Sense Labels.
David Jurgens.
Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL) 2013.
pdf  ·  poster
Characterizing Online Discussions in Microblogs Using Network Analysis.
Veronika Strnadova, David Jurgens, and Tsai-Ching Lu.
Proceedings of the AAAI Spring Symposium on Analyzing Microtext, 2013.
pdf
SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses.
David Jurgens and Ioannis Klapaftis.
Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval) 2013.
pdf  ·  errata  ·  slides  ·  website
SemEval-2013 Task 12: Multilingual Word Sense Disambiguation.
Roberto Navigli, David Jurgens, and Daniele Vanilla.
Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval) 2013.
pdf  ·  website
2012
Temporal Motifs Reveal the Dynamics of Editor Interactions in Wikipedia.
David Jurgens and Tsai-Ching Lu.
Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM) 2012.
pdf  ·  video
Semeval-2012 task 2: Measuring degrees of relational similarity.
David Jurgens, Saif M Mohammad, Peter D Turney, and Keith J Holyoak.
Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval), 2012.
pdf  ·  slides
An Evaluation of Graded Sense Disambiguation using Word Sense Induction.
David Jurgens.
Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM), 2012.
pdf  ·  slides
Friends, Enemies, and Lovers: Detecting Communities in Networks Where Relationships Matter.
David Jurgens and Tsai-Ching Lu.
Proceedings of Web Science, 2012.
pdf
2011
Word sense induction by community detection.
David Jurgens.
Proceedings of the Workshop on Graph-based Methods for Natural Language Processing (TextGraphs), 2011.
pdf
Measuring the impact of sense similarity on word sense induction.
David Jurgens and Keith Stevens.
Proceedings of the First Workshop on Unsupervised Learning in NLP, 2011.
pdf
2010
The S-Space Package: An Open Source Package for Word Space Models.
David Jurgens and Keith Stevens.
Proceedings of the ACL 2010 System Demonstrations, 2010.
pdf  ·  website  ·  Mailing Lists: Users, Developers
Capturing nonlinear structure in word spaces through dimensionality reduction.
David Jurgens and Keith Stevens.
Proceedings of the ACL Workshop on GEometrical Models of Natural Language (GEMS), 2010.
pdf
HERMIT: Flexible clustering for the SemEval-2 WSI task.
David Jurgens and Keith Stevens.
Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), 2010.
pdf
2009
Event detection in blogs using temporal random indexing.
David Jurgens and Keith Stevens.
Proceedings of the Workshop on Events in Emerging Text Types, 2009.
pdf
2004
Road extraction from motion cues in aerial video.
Robert Pless and David Jurgens.
Proceedings of the 12th annual ACM international workshop on Geographic information systems, 2004.
pdf

Biographical Sketch

David Jurgens is a postdoctoral scholar, jointly in the the Stanford NLP and SNAP Groups under Dan Jurafsky, Jure Leskovec and Dan McFarland. His research interests span Natural Language Processing, Network Science and Computational Social Science.

Before joining the NLP Group, he was a postdoctoral scholar at McGill University in the Network Dynamics group with Derek Ruths. Prior, he was a research scientist at the Linguistics Computing Laboratory at Sapienza University under Roberto Navigli and before that, was a visiting researcher at the Information and Systems Science Lab at HRL Laboratories. He received his PhD in Computer Science from the University of California, Los Angeles under Michael Dyer. He received his BA in Philosophy and Political Science and an MS in Computer Science on Computer Vision under Robert Pless from Washington University in St. Louis.

Activities

Current

  • Data Chair for ICWSM-2017
  • Co-chair of the International Workshop on NLP and Computational Social Science at ACL-2017 with Dirk Hovy, David Bamman, A. Seza Dogruoz, Brendan O'Connor, Oren Tsur, and Svitlana Volkova
  • Co-chair of the International Workshops on NLP and Computational Social Science at EMNLP-2016 and at WebSci-2016 with Dirk Hovy, David Bamman, A. Seza Dogruoz, Jacob Eisenstein, Brendan O'Connor, Alice Oh, Oren Tsur, and Svitlana Volkova
  • Reviewing for 2016:
    • Senior Program Committee member for ICWSM.
    • Program Committee member in 2016 for WWW, NAACL, ACL, EMNLP, CSCW
    • External Reviewer for CHI.
    • Program committee for the Workshop on Multilingual and Crosslingual Methods in NLP, Workshop on Evaluating Vector-Space Representations for NLP, and Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages

Past

Research

My research enables computers to understand individuals and groups of people through language. Broadly, I conduct research in the areas of natural language processing (NLP), computational social science, and human computer interaction, drawing upon linguistic and graph-based techniques to help machines connect language and meaning. My research focuses on three central problems spanning these fields:

  • Who are the people communicating? Humans create nearly all of the text we see, yet often who is communicating is overlooked. I design new methods for identifying the demographics of communicators in order to understand large social systems.
  • What does a piece of text mean? Language is often ambiguous, especially when dealing with social media. I work to represent the semantics of all kinds of text to enable natural language understanding.
  • How to effectively create new annotated data? Many of the machine learning techniques used in my fields rely on annotation, yet the process for creating data is often difficult and laborious. I create new enjoyable and engaging annotation methods based on video games to more effectively incorporate humans in order to understand linguistic and social phenomena.

My long-term research goal combines human and language technologies to create social understanding that reflects both the content and the people involved in communication.


Stanford University

Postdoctoral Scholar
NLP and SNAP Groups
Department of Computer Science
Stanford University

jurgens@stanford.edu

Gates 405
Stanford CA 94305-9035


Curriculum Vitae
Google Scholar
Twitter


Software

CROWN
Geoinference
FREESR
S-Space Package
ADW
BabelNet
Knowledge Forge (Video Games with a Purpose)