- kach@cs.********.edu
*(hint: the mascot is 🌲)* - CV / Google Scholar / Github / Twitter / Mastodon

I’m a PhD student at MIT CSAIL studying visual computing systems with Jonathan Ragan-Kelley and Josh Tenenbaum , because the strangeness of sensory perception affirms our shared humanity in this fractured world. Previously, I was an undergrad at Stanford University studying computer science, physics, and English literature — islands I hold to be the lost children of one great continent.

In the past I’ve worked at Facebook, NVIDIA, UW Seattle, and Berkeley. More recently, I spend my time writing (I love words), playing music, juggling with friends, and marveling at the world.

My research is generously supported by a Hertz Foundation Fellowship, the Paul & Daisy Soros Fellowship for New Americans, and the National Science Foundation’s GRFP [press]. If you are currently applying to graduate school, I encourage you to consider these wonderful programs. Also see the MIT EECS Graduate Application Assistance Program, which I lead.

(Click the boxes to open them and see some research highlights. For the rest, see my CV.)

Who optimizes a hyperparameter optimizer's hyper-hyperparameters?
What color is a shadow?
How would Wordle work in languages with different kinds of scripts? [press]
Are there other illusions like "The Dress" out there?
If neuron activations are indeed like toilet flushes (a common analogy), can we implement deep neural networks using plumbing components? [SIGTBD ’22]
Where does the factor of "2" in the stochastic coefficient of Langevin sampling come from?
Can we get better semantic HTML and CSS by using Generative Adversarial Web Development?
What shape is a snowflake?
Is it possible to find iterated function systems that produce fractals that look the way we want?
How can we prevent vector coordinate-system-mismatch bugs in 3D graphics code?
Can literature save the American tech industry?
Why won't my tensors flow?
Can you tune a guitar just by watching the strings vibrate?
Can one hear the fate of a coin?
What happens if we throw Sol LeWitt's sculptures into a tank of soapy water?
How can we build programs that require both forward- and reverse-mode automatic differentiation?
How can we make PyTorch tensors easier to use?
What does GPT-3 tell us about the nature of literature from a statistical or information-theoretic perspective? Following a Russian literary theory, can we use Shannon entropy to think about this quantitatively?
Can we create controlled moiré patterns by carefully perturbing misaligned grids?
Do many hands make light work?
Can we atavise Game-of-Life configurations by gradient descent? What's the connection to pufferfish?
What can we do with the hundreds of images of Hoover Tower you can find online?
Why are PCA-projections of trajectories loopy?
Can we optimize axis-aligned kD-trees by gradient descent?
Do eucalyptus trees have a “handedness”? If so, why?
How much should you wager in a *Jeopardy!* Daily Double?
Does AD-powered gradient descent on Lagrangian action create energy-conserving physics engines?
Does gradient descent eliminate the need for solvers in bidirectional programming environments?
What is the pattern behind the irregularly-spaced vanishing points of rows of trees in orchards?
What does “un-unravelable” really mean, and how?
Can we dream ambigrams by gradient descent?
Can we learn to walk by gradient descent?
What causes the 22° halo around the sun, and how can we efficiently render physically-accurate images of it?
Are there more "Laurel/Yanny" illusions out there?
How big a buffer do we need to be able to pair up an infinite number of socks coming in over a stream?
By how much do windmills slow down the wind?
What shapes emerge if we insist that a certain casino is the nearest to *everyone* in Los Angeles?
What is the optimal alphabet layout for the historic 1888 Crown Typewriter?
Which directed acyclic graphs can be represented by towers of stacked blocks?
Would triangular tesselations make better jigsaw puzzles?
Where are the best bathrooms at Stanford?
What is the analogue of the Curry-Howard correspondence for narrative theory?
What would it mean for a computer to synthesize a story?
What is the relationship between monadic I/O and the dataflow/patch programming paradigm?
Can we approximate Euler's constant using historic earthquake data?
How well do SAT solvers handle the constraints of Western classical music theory?
Is there a *symmetric* cryptographic algorithm for collusion with plausible deniability?
Can JavaScript be dynamically scoped... from within JavaScript?
How few words do you *really* need to communicate — and what's the minimal set?
Why are there 12 notes in a musical octave, and not, say, 19?
Are epicycles just Fourier transforms?
How well do voxel approximations to complex surfaces raytrace?
Which dimensional analysis problems are solvable?
Can poetry be a *subtractive*, rather than an additive, act?
Why is 2^10=1024 so near 10^3=1000, and are there better approximations?
How do automated theorem provers work, and what can they prove?
Can some sort of temporal RSS proxy help you catch up on old webcomics?
How fast can Arthur C. Clarke's famous spacecraft "Rama" accelerate?
Why are bubble tea straws pointy, and why is this absolutely fascinating?
If you played Scrabble with periodic table element tiles, what's the longest word you could make?
Is xkcd's random number generator *actually* random?
Could you play Tetris with true rigid-body physics?
Why do Scratch lists have odd performance behavior?
What shape is the shadow of a lampshade?