Matei Zaharia

Associate Professor, Computer Science
matei@cs.stanford.edu
Google Scholar | LinkedIn | Twitter
Office: Gates 428

I’m an associate professor at Stanford CS, where I work on computer systems and machine learning as part of Stanford DAWN. I’m also co-founder and Chief Technologist of Databricks, a data and AI platform startup.

Interests: I’m interested in computer systems for emerging large-scale workloads such as machine learning, big data analytics and cloud computing. In DAWN, we’re working on infrastructure for usable machine learning to make it dramatically easier to bring ML applications to production: these issues are often much larger obstacles than ML algorithms in practice. My work includes software runtimes, quality assurance tools and systems optimizations for ML. I am also interested in data privacy as the flipside to big data, and have worked on systems that can provide scalable privacy for communication, Internet queries and SaaS applications.

Impact: Our group works closely with the open source community to test and publish our ideas. During my PhD, I started the Apache Spark project, which is now one of the most widely used frameworks for distributed data processing, and co-started other widely used datacenter software such as Apache Mesos, Alluxio, and Spark Streaming. At Stanford, we developed DAWNBench, a machine learning performance competition that drew submissions from the top industry groups and influenced the industry-standard MLPerf, and we are continuing to develop open source software such as Weld, NoScope, FlexFlow, and ColBERT.

Some of our work has been featured in Wired (1/2/3), Fortune, TechCrunch, The Wall Street Journal, The Register, Ars Technica, Motherboard, ZDNet, The Economist, and Forbes.

Teaching

PhD Students

Past PhD Students and Postdocs

Publications

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

Full Publication List

Recent Preprints

Awards

Service

Open Source

Almost all of my work is open source:

More recent projects are available on the Weld and FutureData websites.

Adapted from a template by Andreas Viklund. Photo by Hector Garcia-Molina.