About me

I am a staff research scientist at Google working on machine learning and computer vision. I am particularly interested in core scene and video understanding (e.g. object detection, instance segmentation, tracking), weak/self supervised learning and generative models. I live in Seattle and work from Google's Fremont office.

Prior to Google, I was a postdoctoral fellow working in the Computer Science Department at Stanford University and was supported by an NSF/CRA CI (Computing Innovations) fellowship. At Stanford I was a member of the Geometric Computation Group which is headed by Leonidas Guibas. I was also part of the Lytics Lab, a multidisciplinary group focused on Learning Analytics.

I received a Ph.D. in Robotics from the School of Computer Science at Carnegie Mellon University in 2011, where I worked with Carlos Guestrin. During graduate school, I was fortunate enough to spend two happy summers interning in Seattle, first with Intel Research working with Ali Rahimi, then at Microsoft Research working with Ashish Kapoor.

Before coming to CMU, I studied math (also) at Stanford University. And before Stanford, I attended Oakton High School in Vienna, Virginia, and for a time, also Lynbrook High School in San Jose, California.

Here is an "official" bio and photo.

Research Interests

My research interests in wordle form. The right wordle is generated from my most recent publications on online education and the left wordle is generated from my work on probabilistic inference and learning with combinatorially structured data. Note as of May 2018: these wordles are outdated and are not the best reflection of my research activities. And given that wordles themselves are quite outdated, I probably won't be refreshing these. Just keep an eye on my recent papers to get a sense of my recent work :)

I am interested in theoretical and applied problems in machine learning. My main interests lie in designing computationally efficient probabilistic reasoning and learning algorithms which allow computers to deal with the uncertainty and complexity inherent in real world data. My work has focused on tackling applications whose mathematical abstractions involve probabilistic reasoning with combinatorially structured objects such as matchings, rankings, and trees. These problems are challenging both statistically and computationally due to structural constraints (like mutual exclusivity) which cause interactions between objects that traditional techniques in machine learning have been ill-equipped to handle. Portions of my work thus address:

Compact, probabilistic formulations for reasoning jointly with large collections of structured data,
Efficient algorithms for reasoning and learning that exploit problem structure,
Theoretical analyses of computational and statistical complexity as well as approximation quality.

While being dedicated to pushing on core research problems, I am also committed to problems with real world applications and impact. My past work has contributed solutions to a variety of applications such as predicting preference over webpages and political elections, tracking with camera networks, and reconstructing temporal orderings of events (such as the onset of symptoms in neurodegenerative diseases) from noisy and incomplete data.

I now focus most of my energies on applications with educational impact. The recent surge in popularity of massive open online courses (MOOCs), with platforms such as Coursera and EdX, has made it possible for almost anyone to take free university courses. However while new technologies allow for scalable content delivery, we remain limited in our ability to scalably evaluate and give feedback for open-ended assignments. I approach these challenges fundamentally as machine learning (ML) problems, in which we can leverage the massive datasets now collected by online learning platforms. My work has thus focused on ML-driven education and has contributed algorithms for giving feedback in MOOCs via crowdsourcing or semi-automated methods.

Publications

The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift,
Sara Beery, Guanhang Wu, Trevor Edwards, Filip Pavetic, Bo Majewski, Shreyasee Mukherjee, Stanley Chan, John Morgan, Vivek Rathod, Jonathan Huang.
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022),
pp. 21294--21307, 2022.

pdf dataset blogpost abstract bibtex

Local Metrics for Multi-Object Tracking,
Jack Valmadre, Alex Bewley, Jonathan Huang, Chen Sun, Cristian Sminchisescu, Cordelia Schmid.
In arXiv preprint arXiv:2104.02631,
2021.

pdf code abstract bibtex

Perf-net: Pose empowered rgb-flow net,
Li, Yinxiao, Lu, Zhichao, Xiong, Xuehan, Huang, Jonathan.
In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022),
pp. 513--522, 2022.

pdf abstract bibtex

The surprising impact of mask-head architecture on novel class segmentation,
Vighnesh Birodkar, Zhichao Lu, Siyang Li, Vivek Rathod, Jonathan Huang.
In International Conference on Computer Vision (ICCV 2021),
2021.

pdf code blogpost webpage demo video abstract bibtex

Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection,
Sara Beery, Guanhang Wu, Vivek Rathod, Ronny Votel, Jonathan Huang.
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020),
2020.

pdf code blogpost abstract bibtex

RetinaTrack: Online Single Stage Joint Detection and Tracking,
Zhichao Lu, Vivek Rathod, Ronny Votel, Jonathan Huang.
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020),
2020.

pdf abstract bibtex

Diverse Generation for Multi-Agent Sports Games,
Raymond Yeh, Alexander Schwing, Jonathan Huang, Kevin Murphy.
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019),
2019.

pdf webpage abstract bibtex

Learning to Segment via Cut-and-Paste,
Tal Remez, Jonathan Huang, Matthew Brown.
In European Conference on Computer Vision (ECCV 2018),
2018.

pdf abstract bibtex

Rethinking Spatiotemporal Feature Learning For Video Understanding,
Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy.
In European Conference on Computer Vision (ECCV 2018),
2018.

pdf abstract bibtex

Progressive neural architecture search,
Chenxi Liu, Barret Zoph, Jonathon Shlens, Wei Hua, Li-Jia Li, Fei-Fei Li, Alan Yuille, Jonathan Huang, Kevin Murphy.
In European Conference on Computer Vision (ECCV 2018),
2018.

pdf code blogpost abstract bibtex

Generative Models of Visually Grounded Imagination,
Ramakrishna Vedantam, Ian Fischer, Jonathan Huang, Kevin Murphy.
In International Conference on Learning Representations (ICLR 2018),
2018.

pdf code abstract bibtex

Spatially Adaptive Computation Time for Residual Networks,
Michael Figurnov, Maxwell Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov.
In The 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
2017.

pdf code abstract bibtex

Speed/accuracy trade-offs for modern convolutional object detectors,
Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy.
In The 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
2017.

pdf spotlight code abstract bibtex

Efficient inference in occlusion-aware generative models of images,
Jonathan Huang, Kevin Murphy.
In The 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
Las Vegas, Nevada, June, 2016.

pdf abstract bibtex

Generation and Comprehension of Unambiguous Object Descriptions,
Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan Yuille, Kevin Murphy.
In The 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
Las Vegas, Nevada, June, 2016.

pdf dataset slides abstract bibtex

Detecting events and key actors in multi-person videos,
Vignesh Ramanathan, Jonathan Huang, Sami Abu-El-Haija, Alexander Gorban, Kevin Murphy, Li Fei-Fei.
In The 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
Las Vegas, Nevada, June, 2016.

pdf dataset abstract bibtex

Im2Calories: towards an automated mobile vision food diary,
Austin Myers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin Murphy.
In International Conference on Computer Vision (ICCV),
Santiago, Chile, December, 2015.

pdf dataset abstract bibtex

Deep Knowledge Tracing,
Chris Piech, Jonathan Spencer, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas Guibas, Jascha Sohl-Dickstein.
In Neural Information Processing Systems (NIPS),
Montreal, Canada, December, 2015.

pdf code abstract bibtex

Learning Program Embeddings to Propagate Feedback,
Chris Piech, Jonathan Huang, Andy Nguyen, Mike Phulsuksombati, Mehran Sahami, Leonidas Guibas.
In International Conference on Machine Learning (ICML 2015),
Lille, France, July, 2015.

pdf dataset abstract bibtex

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision,
Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, Kevin Murphy.
In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL),
Denver, Colorado, 2015.

pdf dataset abstract bibtex

Multiple Orderings of Events in Disease Progression,
Alexandra Young, Neil Oxtoby, Jonathan Huang, Razvan Marinescu, Pankag Daga, David Cash, Nick Fox, Sebastien Ourselin, Daniel Alexander.
In Information Processing in Medical Imaging (IPMI),
Isle of Skye, Scotland, 2015.

pdf abstract bibtex

Autonomously Generating Hints by Inferring Problem Solving Policies,
Chris Piech, Mehran Sahami, Jonathan Huang, Leonidas Guibas.
In ACM Conference on Learning at Scale (LAS'15),
Vancouver, Canada, 2015.

pdf abstract bibtex

Codewebs: Scalable Homework Search for Massive Open Online Programming Courses,
Andy Nguyen, Christopher Piech, Jonathan Huang, Leonidas Guibas.
In The 23rd International World Wide Web Conference (WWW'14),
Seoul, Korea, 2014.

pdf slides abstract bibtex

Superposter behavior in MOOC forums,
Jonathan Huang, Anirban Dasgupta, Arpita Ghosh, Jane Manning, Marc Sanders.
In ACM Conference on Learning at Scale (LAS'14),
Atlanta, Georgia, 2014.

pdf slides abstract bibtex

Tuned Models of Peer Assessment in MOOCs,
Chris Piech, Jonathan Huang, Zhenghao Chen, Chuong Do, Andrew Ng, Daphne Koller.
In Proceedings of the 6th International Conference on Educational Data Mining (EDM 2013),
Memphis, TN, July, 2013.

pdf appendix slides abstract bibtex

Syntactic and Functional Variability of a Million Code Submissions in a Machine Learning MOOC,
Jonathan Huang, Chris Piech, Andy Nguyen, Leonidas Guibas.
In Artificial Intelligence in Education (AIED) Workshop on MOOCs (MOOCshop),
Memphis, Tennessee, July, 2013.

pdf abstract bibtex

Probabilistic Event Cascades for Alzheimer's disease,
Jonathan Huang, Daniel Alexander.
In Neural Information Processing Systems (NIPS),
South Lake Tahoe, CA, December, 2012.

pdf poster abstract bibtex

Riffled Independence for Efficient Inference with Partial Ranking,
Jonathan Huang, Ashish Kapoor, Carlos Guestrin.
In Journal of Artificial Intelligence,
pp. 491-532, 2012.

pdf abstract bibtex

Uncovering the Riffled Independence Structure of Rankings,
Jonathan Huang, Carlos Guestrin.
In Electronic Journal of Statistics (EJS),
pp. 199-230, 2012.

pdf arXiv abstract bibtex

Probabilistic Reasoning and Learning on Permutations: Exploiting Structural Decompositions of the Symmetric Group,
Jonathan Huang.
Doctoral dissertation,
Carnegie Mellon University, 2011.

pdf slides abstract bibtex

Efficient Probabilistic Inference with Partial Ranking Queries,
Jonathan Huang, Ashish Kapoor, Carlos Guestrin.
In Conference on Uncertainty in Artificial Intelligence,
Barcelona, Spain, July, 2011.

pdf proofs abstract bibtex

Fourier-Information Duality in the Identity Management Problem,
Xiaoye Jiang, Jonathan Huang, Leonidas Guibas.
In The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML 2011),
Athens, Greece, September, 2011.

pdf abstract bibtex

Learning Hierarchical Riffle Independent Groupings from Rankings,
Jonathan Huang, Carlos Guestrin.
In International Conference on Machine Learning (ICML 2010),
Haifa, Israel, June, 2010.

pdf proofs slides poster abstract bibtex

Hilbert Space Embeddings of Conditional Distributions with Applications to Dynamical Systems,
Le Song, Jonathan Huang, Alex Smola, Kenji Fukumizu.
In International Conference on Machine Learning (ICML 2009),
Montreal, Canada, June, 2009.

pdf proofs slides poster abstract bibtex

Fourier Theoretic Probabilistic Inference over Permutations,
Jonathan Huang, Carlos Guestrin, Leonidas Guibas.
In Journal of Machine Learning Research (JMLR),
pp. 997-1070, May, 2009.

pdf abstract bibtex

Riffled Independence for Ranked Data,
Jonathan Huang, Carlos Guestrin.
In Advances in Neural Information Processing Systems (NIPS),
Vancouver, Canada, December, 2009.

pdf proofs spotlight audio slides poster abstract bibtex

Exploiting Probabilistic Independence for Permutations,
Jonathan Huang, Carlos Guestrin, Xiaoye Jiang, Leonidas Guibas.
In Artificial Intelligence and Statistics (AISTATS),
Clearwater Beach, Florida, April, 2009.

pdf proofs slides abstract bibtex

Efficient Inference for Distributions on Permutations,
Jonathan Huang, Carlos Guestrin, Leonidas Guibas.
In Advances in Neural Information Processing Systems (NIPS),
Vancouver, Canada, December, 2007.

pdf spotlight talk slides poster abstract bibtex

A database of vocal tract resonance trajectories for research in speech processing,
Li Deng, Xiaodong Cui, Robert Pruvenok, Jonathan Huang, Safiyy Momen, Yanyi Chen, Abeer Alwan.
In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006),
pp. 60--63, Toulouse, France, 2006.

pdf abstract bibtex

Code

Tensorflow Object Detection API
Jonathan Huang, Vivek Rathod, Derek Chow, Chen Sun, Menglong Zhu, Matthew Tang, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Jasper Uijlings, Viacheslav Kovalevskyi, and Kevin Murphy.
Open source framework built on top of TensorFlow that makes it easy to construct, train and deploy object detection models.

Github

PyMallows
Jonathan Huang
Python routines for fitting and simulating from a generalized Mallows model. Learning algorithms implemented for both full and partial rankings

download readme

PROPS: Probabilistic Reasoning on Permutations toolbox
Jonathan Leonard Long, Jonathan Huang,
C++/Python library for reasoning/learning with distributions on permutations.

download documentation webpage

Littlewood-Richardson rule
Jonathan Huang
A matlab implementation of the Littlewood-Richardson rule.

enumeratelrtabs.m lrtabvis.m

(LDA) Latent Dirichlet Allocation
Jonathan Huang and Tomasz Malisiewicz,
An implementation of the mean field inference/learning algorithms from Blei et al. (2003)

ldainference.m lda_param_est.m trainLDA.m

Sample output on 20 Newsgroups dataset: [Link]

Note: Every now and then, Tomasz and I get emails from people about this code. While we're always happy to help out, I would like to point out that we wrote this code many years ago. Nowadays it is much more popular (and effective) to use collapsed samplers or online algorithms over the mean field + variational EM algorithm that was proposed in the first LDA paper.

Matlab function for visualizing a 2d Dirichlet distribution,
Jonathan Huang

HLNfit,
Jonathan Huang and Tomasz Malisiewicz,
Code for fitting a Hierarchical Logistic Normal distribution.
There is also a Romanian translation by Maxim Petrenko - Software blogger

Talk videos

Deep learning for structured image understanding

Taller Deep Learning
CIMAT, Guanajuato, Mexico

Adaptive Fourier-Domain Inference on the Symmetric Group

Algebraic Methods in Machine Learning Workshop, NIPS '08
Whistler, Canada

Probability Distributions on Permutations: Compact Representations and Inference

Machine Learning Lunch Seminar, 2008
Carnegie Mellon University

Exploiting Independence and Its Generalizations for Reasoning about Permutation Data

Machine Learning Lunch Seminar, 2010
Carnegie Mellon University

Politics, Preferences and Permutations: Probabilistic Reasoning with Rankings

Seminar, 2011
Microsoft Research Cambridge (UK)

Data Driven Student Feedback for Programming Intensive MOOCs

MSR Latin American Faculty Summit, 2014
Viña del Mar, Chile

talk slides

Miscellaneous writeups, presentations, art

Codewebs
Check out our visualization of 40,000 Octave/Matlab implementations of linear regression! This is part of the Codewebs project for analyzing and providing detailed feedback to students in a programming based MOOC with Chris Piech, Andy Nguyen, and Leo Guibas. Data from Andrew Ng's course on Machine Learning offered through Coursera.
Also check out Ben Lorica's blog post, and Hal Hodson's article at New Scientist about our work!

ICML 2015 Workshop on Machine Learning for Education
I co-organized the ICML 2015 workshop on Machine Learning for Education with Richard Baraniuk, Emma Brunskill, Mihaela van der Schaar, Mike Mozer, Christoph Studer, Andrew Lan.

Those Chatty Seniors!
Read my post at Stanford Online's Signal blog (with Jane Manning and Marc Sanders) on: Those Chatty Seniors! in which we analyze and discuss the demographics of MOOC forum posters. (the tl;dr is that older people talk more). The Stanford Daily also covered our work in this article.

NIPS 2013 Workshop on Data Driven Education
I co-organized a NIPS 2013 workshop on Data Driven Education with Sumit Basu and Kalyan Veeramachaneni.

NIPS 2009 Learning with Orderings workshop
I co-organized a NIPS 2009 workshop on Learning with Orderings with Tiberio Caetano, Carlos Guestrin, Risi Kondor, Guy Lebanon, and Marina Meila.

"Bag of words" art installation at Gates-Hillman complex.
(joint work with Khalid El-Arini, Sue Ann Hong, Joseph Gonzalez)

photos abstract

Learning and Inference in Vision: from Features to Scene Understanding
Tomasz Malisiewicz and I gave a tutorial on vision at the MLD Student Research Symposium on November 13 (2011).

slides

Probabilistic Reasoning with Permutations: A Fourier-Theoretic Approach ,
Jonathan Huang,
My thesis proposal document.

pdf slides abstract

Hierarchical Logistic Normal parameter estimation,
Jonathan Huang, Tomasz Malisiewicz,
A project for Alyosha Efros's class on Learning based methods in computer vision. See our application to object recognition.

pdf

Maximum Likelihood Estimation of Dirichlet Distributions ,
Jonathan Huang,
Notes on several ways to numerically find the MLE of a Dirichlet Distribution. This was done for a Math Fundamentals for Robotics course taught by Mike Erdmann.

pdf

Sperner's Lemma ,
Jonathan Huang,
Some theorems/corollaries of Sperner's Lemma that I collected for a combinatorics class. Sperner is an easy combinatorial fact about labelings on a simplicial complex, but it has several surprising applications in topology and analysis. The famous Brouwer fixed point theorem, and the fundamental theorem of algebra are two of the examples that I discuss.

pdf errata

Notes on the Kalman Filter ,
Jonathan Huang,
A derivation of the Kalman filter updates. The notes try to mostly follow the development given by Drew Bagnell in the Statistical Techniques for Robotics class at CMU.

pdf

Cup Products in Computational Topology ,
Jonathan Huang,
Senior Honors Thesis (advisor: Gunnar Carlsson). We show an application of topological persistence to computing invariants related to the cohomology (cup product structure) of a finite simplicial complex.

pdf

Jonathan Chung-Kuan Huang

About me

Research Interests

Publications

Code

Talk videos

Deep learning for structured image understanding

Adaptive Fourier-Domain Inference on the Symmetric Group

Probability Distributions on Permutations: Compact Representations and Inference

Exploiting Independence and Its Generalizations for Reasoning about Permutation Data

Politics, Preferences and Permutations: Probabilistic Reasoning with Rankings

Data Driven Student Feedback for Programming Intensive MOOCs

Miscellaneous writeups, presentations, art