Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Conference on Neural Information Processing Systems (NeurIPS) - 2018

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue	Year	Papers	Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.	Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.	% Empirical Percentage of papers that are empirical research vs theoretical research.	% Industry Percentage of empirical research papers with at least one author from Industry.	Website
NeurIPS	2018	1009	0.45	3.13	90.19%	40.99%

Search Papers

	Pseudocode	Open Source Code	Open Datasets	Dataset Splits	Hardware Specification	Software Dependencies	Experiment Setup
$\ell_1$-regression with Heavy-tailed Distributions	❌	❌	❌	❌	❌	❌	❌	0
(Probably) Concave Graph Matching	✅	❌	✅	❌	✅	❌	✅	4
3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data	❌	✅	✅	✅	❌	❌	✅	4
3D-Aware Scene Manipulation via Inverse Graphics	❌	✅	✅	❌	❌	❌	✅	3
A Bandit Approach to Sequential Experimental Design with False Discovery Control	✅	❌	❌	❌	❌	❌	✅	2
A Bayes-Sard Cubature Method	❌	❌	❌	❌	❌	❌	✅	1
A Bayesian Approach to Generative Adversarial Imitation Learning	✅	❌	✅	❌	❌	❌	✅	3
A Bayesian Nonparametric View on Count-Min Sketch	❌	❌	✅	❌	❌	❌	✅	2
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization	✅	❌	✅	❌	❌	❌	❌	2
A Bridging Framework for Model Optimization and Deep Propagation	❌	❌	✅	❌	✅	❌	❌	2
A Convex Duality Framework for GANs	❌	❌	✅	❌	❌	❌	✅	2
A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents	✅	❌	❌	❌	❌	❌	❌	1
A Dual Framework for Low-rank Tensor Completion	✅	✅	✅	✅	❌	❌	✅	5
A Game-Theoretic Approach to Recommendation Systems with Strategic Content Providers	✅	❌	❌	❌	❌	❌	❌	1
A General Method for Amortizing Variational Filtering	✅	✅	✅	✅	❌	❌	✅	5
A Likelihood-Free Inference Framework for Population Genetic Data using Exchangeable Neural Networks	❌	✅	❌	✅	✅	❌	✅	4
A Linear Speedup Analysis of Distributed Deep Learning with Sparse and Quantized Communication	✅	❌	✅	❌	✅	✅	✅	5
A Lyapunov-based Approach to Safe Reinforcement Learning	✅	❌	❌	❌	❌	❌	✅	2
A Mathematical Model For Optimal Decisions In A Representative Democracy	❌	❌	❌	❌	❌	❌	❌	0
A Model for Learned Bloom Filters and Optimizing by Sandwiching	❌	❌	❌	❌	❌	❌	❌	0
A Neural Compositional Paradigm for Image Captioning	❌	❌	✅	✅	❌	❌	✅	3
A Practical Algorithm for Distributed Clustering and Outlier Detection	✅	❌	✅	❌	✅	❌	✅	4
A Probabilistic U-Net for Segmentation of Ambiguous Images	❌	✅	✅	✅	✅	❌	✅	5
A Reduction for Efficient LDA Topic Reconstruction	✅	✅	✅	❌	✅	❌	✅	5
A Retrieve-and-Edit Framework for Predicting Structured Outputs	❌	✅	✅	✅	❌	❌	✅	4
A Simple Cache Model for Image Recognition	❌	❌	✅	✅	❌	❌	✅	3
A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization	✅	❌	✅	❌	❌	❌	✅	3
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks	✅	✅	✅	✅	❌	❌	❌	4
A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem	❌	❌	❌	❌	❌	❌	❌	0
A Smoother Way to Train Structured Prediction Models	✅	✅	✅	❌	❌	❌	✅	4
A Spectral View of Adversarially Robust Features	❌	❌	✅	❌	❌	❌	✅	2
A Statistical Recurrent Model on the Manifold of Symmetric Positive Definite Matrices	❌	✅	✅	✅	✅	❌	✅	5
A Stein variational Newton method	✅	✅	❌	❌	❌	❌	✅	3
A Structured Prediction Approach for Label Ranking	❌	✅	✅	✅	❌	❌	❌	3
A Theory-Based Evaluation of Nearest Neighbor Models Put Into Practice	✅	✅	✅	❌	✅	❌	✅	5
A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation	❌	✅	✅	❌	❌	❌	❌	2
A Unified Framework for Extensive-Form Game Abstraction with Bounds	❌	❌	❌	❌	❌	❌	❌	0
A Unified View of Piecewise Linear Neural Network Verification	✅	✅	✅	❌	✅	❌	✅	5
A convex program for bilinear inversion of sparse vectors	❌	❌	❌	❌	❌	❌	✅	1
A flexible model for training action localization with varying levels of supervision	❌	✅	✅	✅	✅	❌	✅	5
A loss framework for calibrated anomaly detection	❌	❌	❌	❌	❌	❌	❌	0
A no-regret generalization of hierarchical softmax to extreme multi-label classification	✅	✅	✅	❌	❌	❌	✅	4
A probabilistic population code based on neural samples	❌	❌	❌	❌	❌	❌	❌	0
A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization	❌	❌	✅	❌	❌	❌	✅	2
ATOMO: Communication-efficient Learning via Atomic Sparsification	✅	✅	✅	❌	✅	❌	✅	5
A^2-Nets: Double Attention Networks	❌	❌	✅	✅	✅	❌	✅	4
Accelerated Stochastic Matrix Inversion: General Theory and Speeding up BFGS Rules for Faster Second-Order Optimization	✅	❌	✅	❌	❌	❌	✅	3
Acceleration through Optimistic No-Regret Dynamics	✅	❌	❌	❌	❌	❌	❌	1
Active Learning for Non-Parametric Regression Using Purely Random Trees	✅	✅	✅	❌	❌	❌	✅	4
Active Matting	❌	❌	✅	❌	✅	❌	✅	3
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments	✅	❌	✅	❌	❌	❌	✅	3
Adaptation to Easy Data in Prediction with Limited Advice	✅	❌	❌	❌	❌	❌	❌	1
Adapted Deep Embeddings: A Synthesis of Methods for k-Shot Inductive Transfer Learning	❌	✅	✅	✅	❌	❌	✅	4
Adaptive Learning with Unknown Information Flows	✅	❌	❌	❌	❌	❌	❌	1
Adaptive Methods for Nonconvex Optimization	✅	❌	✅	✅	✅	❌	✅	5
Adaptive Negative Curvature Descent with Applications in Non-convex Optimization	✅	❌	✅	❌	❌	❌	✅	3
Adaptive Online Learning in Dynamic Environments	✅	❌	❌	❌	❌	❌	❌	1
Adaptive Path-Integral Autoencoders: Representation Learning and Planning for Dynamical Systems	✅	✅	✅	❌	❌	❌	✅	4
Adaptive Sampling Towards Fast Graph Representation Learning	❌	❌	✅	❌	❌	❌	✅	2
Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models	✅	✅	❌	✅	❌	❌	✅	4
Adding One Neuron Can Eliminate All Bad Local Minima	❌	❌	❌	❌	❌	❌	❌	0
Adversarial Attacks on Stochastic Bandits	✅	❌	❌	❌	❌	❌	✅	2
Adversarial Examples that Fool both Computer Vision and Time-Limited Humans	❌	❌	✅	❌	❌	❌	✅	2
Adversarial Multiple Source Domain Adaptation	❌	❌	✅	❌	❌	❌	❌	1
Adversarial Regularizers in Inverse Problems	✅	✅	✅	❌	❌	❌	❌	3
Adversarial Risk and Robustness: General Definitions and Implications for the Uniform Distribution	❌	❌	❌	❌	❌	❌	✅	1
Adversarial Scene Editing: Automatic Object Removal from Weak Supervision	❌	❌	✅	❌	❌	❌	❌	1
Adversarial Text Generation via Feature-Mover's Distance	✅	❌	✅	❌	❌	❌	✅	3
Adversarial vulnerability for any classifier	❌	❌	✅	❌	❌	❌	✅	2
Adversarially Robust Generalization Requires More Data	❌	❌	✅	❌	❌	❌	✅	2
Adversarially Robust Optimization with Gaussian Processes	✅	❌	✅	❌	❌	❌	✅	3
Algebraic tests of general Gaussian latent tree models	✅	❌	❌	❌	❌	❌	✅	2
Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation	✅	✅	✅	❌	❌	❌	✅	4
Algorithmic Linearly Constrained Gaussian Processes	❌	❌	❌	❌	❌	✅	✅	2
Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced	❌	❌	❌	❌	❌	❌	✅	1
Algorithms and Theory for Multiple-Source Adaptation	❌	❌	✅	❌	❌	❌	❌	1
Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs	✅	❌	❌	❌	✅	❌	✅	3
Alternating optimization of decision trees, with application to learning sparse oblique trees	✅	❌	✅	✅	✅	❌	✅	5
Amortized Inference Regularization	❌	❌	✅	❌	❌	❌	❌	1
An Efficient Pruning Algorithm for Robust Isotonic Regression	✅	❌	❌	❌	✅	❌	✅	3
An Improved Analysis of Alternating Minimization for Structured Multi-Response Regression	✅	❌	❌	❌	❌	❌	✅	2
An Information-Theoretic Analysis for Thompson Sampling with Many Actions	❌	❌	❌	❌	❌	❌	❌	0
An Off-policy Policy Gradient Theorem Using Emphatic Weightings	✅	❌	❌	❌	❌	❌	✅	2
An intriguing failing of convolutional neural networks and the CoordConv solution	❌	✅	✅	✅	❌	❌	❌	3
Analysis of Krylov Subspace Solutions of Regularized Non-Convex Quadratic Problems	❌	❌	❌	❌	❌	❌	❌	0
Analytic solution and stationary phase approximation for the Bayesian lasso and elastic net	❌	✅	✅	✅	❌	❌	✅	4
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog	✅	❌	✅	❌	❌	❌	✅	3
Approximate Knowledge Compilation by Online Collapsed Importance Sampling	✅	✅	✅	❌	❌	❌	✅	4
Approximating Real-Time Recurrent Learning with Random Kronecker Factors	✅	❌	✅	✅	❌	❌	✅	4
Approximation algorithms for stochastic clustering	✅	❌	❌	❌	❌	❌	❌	1
Are GANs Created Equal? A Large-Scale Study	❌	✅	✅	✅	✅	❌	✅	5
Are ResNets Provably Better than Linear Predictors?	❌	❌	❌	❌	❌	❌	❌	0
Assessing Generative Models via Precision and Recall	❌	✅	✅	✅	❌	❌	✅	4
Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures	✅	❌	✅	❌	✅	❌	✅	4
Asymptotic optimality of adaptive importance sampling	✅	✅	❌	❌	❌	❌	✅	3
Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples	❌	✅	✅	✅	❌	✅	✅	5
Attention in Convolutional LSTM for Gesture Recognition	❌	✅	✅	❌	❌	❌	✅	3
Autoconj: Recognizing and Exploiting Conjugacy Without a Domain-Specific Language	❌	✅	❌	❌	❌	❌	✅	2
Automatic Program Synthesis of Long Programs with a Learned Garbage Collector	❌	✅	❌	❌	✅	❌	✅	3
Automatic differentiation in ML: Where we are and where we should be going	❌	✅	❌	❌	❌	❌	❌	1
Automating Bayesian optimization with Bayesian optimization	✅	✅	✅	❌	❌	❌	✅	4
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training	✅	❌	✅	❌	✅	❌	✅	4
BRITS: Bidirectional Recurrent Imputation for Time Series	❌	✅	✅	✅	✅	❌	✅	5
BRUNO: A Deep Recurrent Model for Exchangeable Data	❌	✅	✅	✅	✅	❌	✅	5
Backpropagation with Callbacks: Foundations for Efficient and Expressive Differentiable Programming	✅	✅	✅	❌	✅	❌	✅	5
Balanced Policy Evaluation and Learning	❌	❌	✅	❌	❌	✅	✅	3
Banach Wasserstein GAN	❌	✅	✅	❌	❌	❌	✅	3
Bandit Learning in Concave N-Person Games	✅	❌	❌	❌	❌	❌	❌	1
Bandit Learning with Implicit Feedback	✅	✅	✅	❌	❌	❌	✅	4
Bandit Learning with Positive Externalities	✅	❌	❌	❌	❌	❌	✅	2
Batch-Instance Normalization for Adaptively Style-Invariant Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Bayesian Adversarial Learning	✅	❌	✅	✅	❌	❌	✅	4
Bayesian Alignments of Warped Multi-Output Gaussian Processes	❌	❌	❌	❌	❌	❌	❌	0
Bayesian Control of Large MDPs with Unknown Dynamics in Data-Poor Environments	✅	❌	❌	❌	❌	❌	✅	2
Bayesian Distributed Stochastic Gradient Descent	❌	❌	✅	✅	✅	❌	❌	3
Bayesian Inference of Temporal Task Specifications from Demonstrations	✅	❌	❌	❌	✅	❌	✅	3
Bayesian Model Selection Approach to Boundary Detection with Non-Local Priors	✅	✅	❌	❌	❌	❌	✅	3
Bayesian Model-Agnostic Meta-Learning	✅	❌	✅	✅	❌	❌	✅	4
Bayesian Nonparametric Spectral Estimation	❌	✅	✅	❌	❌	❌	✅	3
Bayesian Pose Graph Optimization via Bingham Distributions and Tempered Geodesic MCMC	❌	❌	✅	❌	❌	❌	✅	2
Bayesian Semi-supervised Learning with Graph Gaussian Processes	❌	❌	✅	✅	❌	❌	✅	3
Bayesian Structure Learning by Recursive Bootstrap	✅	❌	✅	❌	❌	❌	✅	3
Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data	❌	❌	✅	✅	❌	❌	✅	3
Beauty-in-averageness and its contextual modulations: A Bayesian statistical account	❌	❌	✅	❌	❌	❌	✅	2
Benefits of over-parameterization with EM	❌	❌	❌	❌	❌	❌	✅	1
Beyond Grids: Learning Graph Representations for Visual Recognition	❌	❌	✅	✅	❌	❌	✅	3
Beyond Log-concavity: Provable Guarantees for Sampling Multi-modal Distributions using Simulated Tempering Langevin Monte Carlo	✅	❌	❌	❌	❌	❌	❌	1
Bias and Generalization in Deep Generative Models: An Empirical Study	❌	✅	✅	❌	❌	❌	❌	2
Bilevel Distance Metric Learning for Robust Image Recognition	✅	❌	✅	✅	❌	❌	✅	4
Bilevel learning of the Group Lasso structure	✅	❌	✅	✅	❌	❌	✅	4
Bilinear Attention Networks	❌	❌	✅	❌	❌	❌	✅	2
BinGAN: Learning Compact Binary Descriptors with a Regularized GAN	❌	✅	✅	✅	❌	❌	✅	4
Binary Classification from Positive-Confidence Data	❌	✅	✅	❌	❌	❌	✅	3
Binary Rating Estimation with Graph Side Information	❌	❌	✅	❌	❌	❌	❌	1
Bipartite Stochastic Block Models with Tiny Clusters	✅	✅	✅	❌	✅	❌	✅	5
Blind Deconvolutional Phase Retrieval via Convex Programming	❌	❌	❌	❌	❌	❌	✅	1
Blockwise Parallel Decoding for Deep Autoregressive Models	✅	✅	✅	✅	✅	❌	✅	6
Boolean Decision Rules via Column Generation	❌	❌	✅	✅	❌	✅	✅	4
Boosted Sparse and Low-Rank Tensor Regression	✅	✅	✅	✅	✅	❌	✅	6
Boosting Black Box Variational Inference	✅	✅	✅	❌	❌	❌	✅	4
Bounded-Loss Private Prediction Markets	❌	❌	❌	❌	❌	❌	❌	0
BourGAN: Generative Networks with Metric Embeddings	❌	❌	✅	❌	❌	❌	✅	2
Breaking the Activation Function Bottleneck through Adaptive Parameterization	❌	✅	✅	✅	❌	❌	✅	4
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation	❌	❌	✅	❌	❌	❌	✅	2
Breaking the Span Assumption Yields Fast Finite-Sum Minimization	✅	❌	❌	❌	❌	❌	✅	2
But How Does It Work in Theory? Linear SVM with Random Features	✅	✅	❌	❌	❌	❌	✅	3
Byzantine Stochastic Gradient Descent	✅	❌	❌	❌	❌	❌	❌	1
COLA: Decentralized Linear Learning	✅	✅	✅	❌	✅	❌	❌	4
Can We Gain More from Orthogonality Regularizations in Training Deep Networks?	❌	✅	✅	✅	❌	❌	✅	4
CapProNet: Deep Feature Learning via Orthogonal Projections onto Capsule Subspaces	❌	✅	✅	✅	❌	❌	❌	3
CatBoost: unbiased boosting with categorical features	✅	✅	✅	❌	❌	❌	❌	3
Causal Discovery from Discrete Data using Hidden Compact Representation	✅	✅	✅	❌	❌	❌	✅	4
Causal Inference and Mechanism Clustering of A Mixture of Additive Noise Models	✅	✅	✅	❌	❌	❌	✅	4
Causal Inference via Kernel Deviance Measures	❌	❌	✅	❌	❌	❌	✅	2
Causal Inference with Noisy and Missing Covariates via Matrix Factorization	❌	❌	✅	✅	❌	❌	✅	3
Chain of Reasoning for Visual Question Answering	❌	❌	✅	✅	❌	❌	✅	3
Chaining Mutual Information and Tightening Generalization Bounds	❌	❌	❌	❌	❌	❌	❌	0
ChannelNets: Compact and Efficient Convolutional Neural Networks via Channel-Wise Convolutions	❌	❌	✅	✅	✅	❌	✅	4
Clebsch–Gordan Nets: a Fully Fourier Space Spherical Convolutional Neural Network	✅	❌	✅	❌	❌	❌	❌	2
Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data	✅	✅	✅	❌	❌	❌	✅	4
Clustering Redemption–Beyond the Impossibility of Kleinberg’s Axioms	❌	❌	❌	❌	❌	❌	❌	0
Co-regularized Alignment for Unsupervised Domain Adaptation	❌	❌	✅	✅	❌	❌	✅	3
Co-teaching: Robust training of deep neural networks with extremely noisy labels	✅	✅	✅	❌	✅	❌	✅	5
Collaborative Learning for Deep Neural Networks	❌	❌	✅	✅	❌	❌	✅	3
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search	❌	❌	✅	✅	✅	❌	✅	4
Communication Compression for Decentralized Training	✅	❌	✅	❌	✅	❌	✅	4
Communication Efficient Parallel Algorithms for Optimization on Manifolds	✅	✅	❌	❌	❌	❌	✅	3
Community Exploration: From Offline Optimization to Online Learning	✅	❌	❌	❌	❌	❌	❌	1
Compact Generalized Non-local Network	❌	✅	✅	✅	❌	❌	✅	4
Compact Representation of Uncertainty in Clustering	✅	❌	✅	❌	❌	❌	✅	3
Completing State Representations using Spectral Learning	✅	❌	✅	✅	❌	❌	✅	4
Complex Gated Recurrent Neural Networks	❌	✅	✅	✅	✅	❌	✅	5
Computationally and statistically efficient learning of causal Bayes nets using path queries	✅	❌	✅	❌	❌	❌	❌	2
Computing Higher Order Derivatives of Matrix and Tensor Expressions	❌	✅	❌	❌	✅	✅	✅	4
Computing Kantorovich-Wasserstein Distances on $d$-dimensional histograms using $(d+1)$-partite graphs	❌	✅	✅	❌	✅	✅	✅	5
Conditional Adversarial Domain Adaptation	❌	✅	✅	✅	❌	❌	✅	4
Confounding-Robust Policy Improvement	✅	❌	✅	❌	❌	❌	❌	2
Connecting Optimization and Regularization Paths	❌	❌	❌	❌	❌	❌	✅	1
Connectionist Temporal Classification with Maximum Entropy Regularization	❌	✅	✅	✅	❌	❌	✅	4
Constant Regret, Generalized Mixability, and Mirror Descent	✅	❌	✅	❌	❌	❌	✅	3
Constrained Cross-Entropy Method for Safe Reinforcement Learning	✅	❌	❌	❌	❌	❌	✅	2
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders	❌	❌	✅	❌	❌	❌	❌	1
Constrained Graph Variational Autoencoders for Molecule Design	❌	✅	✅	❌	❌	❌	✅	3
Constructing Deep Neural Networks by Bayesian Network Structure Learning	✅	❌	✅	❌	❌	❌	❌	2
Constructing Fast Network through Deconstruction of Convolution	❌	✅	✅	❌	✅	❌	✅	4
Constructing Unrestricted Adversarial Examples with Generative Models	✅	❌	✅	❌	❌	❌	✅	3
Contamination Attacks and Mitigation in Multi-Party Machine Learning	✅	❌	✅	✅	❌	❌	✅	4
Content preserving text generation with attribute controls	❌	❌	✅	✅	❌	❌	✅	3
Context-aware Synthesis and Placement of Object Instances	❌	✅	✅	❌	❌	❌	✅	3
Context-dependent upper-confidence bounds for directed exploration	✅	❌	✅	❌	❌	❌	✅	3
Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward	✅	❌	✅	❌	❌	❌	❌	2
Contextual Pricing for Lipschitz Buyers	✅	❌	❌	❌	❌	❌	❌	1
Contextual Stochastic Block Models	❌	❌	❌	❌	❌	❌	✅	1
Contextual bandits with surrogate losses: Margin bounds and efficient algorithms	✅	❌	❌	❌	❌	❌	❌	1
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces	✅	❌	✅	❌	❌	❌	✅	3
Contour location via entropy reduction leveraging multiple information sources	❌	✅	❌	❌	❌	✅	✅	3
Contrastive Learning from Pairwise Measurements	❌	❌	❌	❌	❌	❌	✅	1
Convergence of Cubic Regularization for Nonconvex Optimization under KL Property	❌	❌	❌	❌	❌	❌	❌	0
Convex Elicitation of Continuous Properties	❌	❌	❌	❌	❌	❌	❌	0
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation	❌	❌	✅	❌	✅	❌	✅	3
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization	❌	❌	✅	❌	❌	❌	✅	2
Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification	✅	❌	✅	✅	✅	❌	✅	5
Coordinate Descent with Bandit Sampling	✅	❌	✅	❌	❌	❌	✅	3
Coupled Variational Bayes via Optimization Embedding	✅	✅	✅	❌	✅	❌	✅	5
Credit Assignment For Collective Multiagent RL With Global Rewards	❌	❌	❌	❌	❌	❌	❌	0
Critical initialisation for deep signal propagation in noisy rectifier neural networks	❌	✅	✅	❌	✅	❌	✅	4
DAGs with NO TEARS: Continuous Optimization for Structure Learning	✅	✅	✅	❌	❌	❌	✅	4
DVAE#: Discrete Variational Autoencoders with Relaxed Boltzmann Priors	❌	✅	✅	❌	❌	❌	✅	3
Data Amplification: A Unified and Competitive Approach to Property Estimation	❌	❌	✅	❌	❌	❌	✅	2
Data center cooling using model-predictive control	❌	❌	❌	✅	❌	❌	✅	2
Data-Driven Clustering via Parameterized Lloyd's Families	✅	❌	✅	✅	❌	❌	✅	4
Data-Efficient Hierarchical Reinforcement Learning	❌	✅	✅	❌	❌	❌	✅	3
Data-dependent PAC-Bayes priors via differential privacy	❌	❌	✅	❌	❌	❌	✅	2
Decentralize and Randomize: Faster Algorithm for Wasserstein Barycenters	✅	❌	✅	❌	❌	❌	✅	3
Deep Anomaly Detection Using Geometric Transformations	❌	✅	✅	✅	❌	❌	✅	4
Deep Attentive Tracking via Reciprocative Learning	❌	❌	✅	❌	✅	❌	✅	3
Deep Defense: Training DNNs with Improved Adversarial Robustness	❌	✅	✅	✅	✅	❌	✅	5
Deep Dynamical Modeling and Control of Unsteady Fluid Flows	❌	✅	❌	❌	✅	❌	✅	3
Deep Functional Dictionaries: Learning Consistent Semantic Structures on 3D Models from Functions	✅	✅	✅	✅	❌	❌	✅	5
Deep Generative Markov State Models	❌	✅	✅	✅	❌	❌	❌	3
Deep Generative Models for Distribution-Preserving Lossy Compression	❌	✅	✅	❌	❌	❌	✅	3
Deep Generative Models with Learnable Knowledge Constraints	✅	❌	✅	❌	❌	❌	❌	2
Deep Homogeneous Mixture Models: Representation, Separation, and Approximation	❌	❌	✅	❌	❌	❌	✅	2
Deep Network for the Integrated 3D Sensing of Multiple People in Natural Images	❌	❌	✅	✅	❌	❌	✅	3
Deep Neural Nets with Interpolating Function as Output Activation	✅	✅	✅	❌	✅	❌	✅	5
Deep Neural Networks with Box Convolutions	❌	✅	✅	✅	✅	✅	✅	6
Deep Non-Blind Deconvolution via Generalized Low-Rank Approximation	❌	❌	✅	❌	✅	❌	✅	3
Deep Poisson gamma dynamical systems	✅	❌	✅	✅	❌	❌	✅	4
Deep Predictive Coding Network with Local Recurrent Processing for Object Recognition	✅	❌	✅	✅	❌	❌	✅	4
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models	✅	✅	✅	❌	❌	❌	✅	4
Deep Reinforcement Learning of Marked Temporal Point Processes	✅	✅	✅	❌	❌	❌	✅	4
Deep State Space Models for Time Series Forecasting	❌	❌	✅	❌	❌	❌	✅	2
Deep State Space Models for Unconditional Word Generation	✅	❌	✅	✅	❌	❌	✅	4
Deep Structured Prediction with Nonlinear Output Transformations	✅	✅	✅	✅	❌	❌	✅	5
Deep, complex, invertible networks for inversion of transmission effects in multimode optical fibres	❌	✅	✅	❌	❌	❌	✅	3
DeepExposure: Learning to Expose Photos with Asynchronously Reinforced Adversarial Learning	✅	❌	✅	❌	✅	❌	✅	4
DeepPINK: reproducible feature selection in deep neural networks	❌	✅	✅	❌	❌	❌	✅	3
DeepProbLog: Neural Probabilistic Logic Programming	❌	✅	✅	❌	❌	❌	✅	3
Deepcode: Feedback Codes via Deep Learning	❌	✅	❌	❌	❌	❌	✅	2
Delta-encoder: an effective sample synthesis method for few-shot object recognition	❌	✅	✅	✅	✅	❌	✅	5
Demystifying excessively volatile human learning: A Bayesian persistent prior and a neural approximation	❌	❌	✅	✅	❌	❌	✅	3
Dendritic cortical microcircuits approximate the backpropagation algorithm	❌	❌	✅	✅	❌	❌	✅	3
Densely Connected Attention Propagation for Reading Comprehension	❌	❌	✅	✅	❌	❌	✅	3
Depth-Limited Solving for Imperfect-Information Games	❌	❌	✅	❌	✅	❌	✅	3
Derivative Estimation in Random Design	❌	❌	❌	❌	❌	✅	✅	2
Designing by Training: Acceleration Neural Network for Fast High-Dimensional Convolution	❌	❌	❌	❌	❌	❌	✅	1
Dialog-based Interactive Image Retrieval	❌	❌	✅	❌	❌	❌	❌	1
Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base	❌	❌	✅	✅	❌	❌	✅	3
DifNet: Semantic Segmentation by Diffusion Networks	❌	❌	✅	✅	✅	❌	✅	4
Differentiable MPC for End-to-end Planning and Control	✅	✅	❌	❌	❌	❌	✅	3
Differential Privacy for Growing Databases	✅	❌	❌	❌	❌	❌	❌	1
Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance	✅	✅	✅	✅	✅	❌	✅	6
Differentially Private Bayesian Inference for Exponential Families	✅	❌	❌	❌	❌	❌	✅	2
Differentially Private Change-Point Detection	✅	❌	❌	❌	❌	❌	✅	2
Differentially Private Contextual Linear Bandits	✅	❌	❌	❌	❌	❌	❌	1
Differentially Private Robust Low-Rank Approximation	✅	❌	❌	❌	❌	❌	❌	1
Differentially Private Testing of Identity and Closeness of Discrete Distributions	✅	❌	❌	❌	❌	❌	❌	1
Differentially Private Uniformly Most Powerful Tests for Binomial Data	✅	❌	❌	❌	❌	❌	✅	2
Differentially Private k-Means with Constant Multiplicative Error	✅	❌	❌	❌	❌	❌	❌	1
Diffusion Maps for Textual Network Embedding	❌	❌	✅	❌	✅	❌	✅	3
Dimensionality Reduction for Stationary Time Series via Stochastic Nonconvex Optimization	✅	❌	✅	❌	❌	❌	✅	3
Dimensionality Reduction has Quantifiable Imperfections: Two Geometric Bounds	❌	❌	❌	❌	❌	❌	✅	1
Dimensionally Tight Bounds for Second-Order Hamiltonian Monte Carlo	✅	✅	❌	❌	❌	❌	✅	3
Diminishing Returns Shape Constraints for Interpretability and Regularization	❌	✅	✅	✅	❌	❌	✅	4
Direct Estimation of Differences in Causal Graphs	✅	✅	✅	✅	❌	❌	✅	5
Direct Runge-Kutta Discretization Achieves Acceleration	✅	❌	❌	❌	❌	❌	✅	2
Dirichlet belief networks for topic structure learning	✅	✅	✅	✅	❌	❌	✅	5
Dirichlet-based Gaussian Processes for Large-scale Calibrated Classification	❌	✅	✅	✅	✅	❌	✅	5
Disconnected Manifold Learning for Generative Adversarial Networks	✅	❌	✅	❌	❌	❌	✅	3
Discovery of Latent 3D Keypoints via End-to-end Geometric Reasoning	❌	❌	✅	❌	❌	❌	✅	2
Discretely Relaxing Continuous Variables for tractable Variational Inference	❌	✅	✅	✅	✅	❌	✅	5
Discrimination-aware Channel Pruning for Deep Neural Networks	✅	✅	✅	❌	❌	❌	✅	4
Distilled Wasserstein Learning for Word Embedding and Topic Modeling	✅	❌	✅	✅	❌	❌	✅	4
Distributed $k$-Clustering for Data with Heavy Noise	✅	❌	❌	❌	❌	❌	❌	1
Distributed Learning without Distress: Privacy-Preserving Empirical Risk Minimization	❌	✅	✅	❌	❌	❌	✅	3
Distributed Multi-Player Bandits - a Game of Thrones Approach	✅	❌	❌	❌	❌	❌	✅	2
Distributed Multitask Reinforcement Learning with Quadratic Convergence	❌	❌	✅	❌	❌	❌	✅	2
Distributed Stochastic Optimization via Adaptive SGD	✅	❌	✅	✅	❌	✅	✅	5
Distributed Weight Consolidation: A Brain Segmentation Case Study	❌	❌	✅	❌	❌	❌	✅	2
Distributionally Robust Graphical Models	❌	❌	✅	✅	❌	❌	❌	2
Diverse Ensemble Evolution: Curriculum Data-Model Marriage	✅	❌	✅	❌	❌	❌	✅	3
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning	❌	❌	✅	❌	❌	❌	❌	1
Do Less, Get More: Streaming Submodular Maximization with Subsampling	✅	❌	✅	❌	✅	❌	✅	4
Does mitigating ML's impact disparity require treatment disparity?	❌	❌	✅	❌	❌	❌	❌	1
Domain Adaptation by Using Causal Inference to Predict Invariant Conditional Distributions	❌	✅	✅	❌	❌	✅	✅	4
Domain-Invariant Projection Learning for Zero-Shot Recognition	✅	❌	✅	✅	❌	❌	✅	4
Doubly Robust Bayesian Inference for Non-Stationary Streaming Data with $\beta$-Divergences	✅	✅	✅	❌	✅	❌	✅	5
DropBlock: A regularization method for convolutional networks	✅	✅	✅	✅	✅	❌	✅	6
DropMax: Adaptive Variational Softmax	❌	✅	✅	✅	❌	❌	✅	4
Dropping Symmetry for Fast Symmetric Nonnegative Matrix Factorization	✅	❌	✅	✅	❌	❌	❌	3
Dual Policy Iteration	✅	❌	✅	❌	❌	❌	✅	3
Dual Principal Component Pursuit: Improved Analysis and Efficient Algorithms	✅	❌	✅	❌	✅	❌	✅	4
Dual Swap Disentangling	✅	❌	✅	✅	❌	❌	✅	4
Dynamic Network Model from Partial Observations	✅	❌	✅	✅	❌	❌	✅	4
Early Stopping for Nonparametric Testing	❌	❌	❌	❌	❌	❌	✅	1
Efficient Algorithms for Non-convex Isotonic Regression through Submodular Optimization	✅	❌	❌	❌	❌	❌	✅	2
Efficient Anomaly Detection via Matrix Sketching	✅	❌	✅	❌	✅	❌	✅	4
Efficient Convex Completion of Coupled Tensors using Coupled Nuclear Norms	✅	✅	✅	✅	❌	❌	✅	5
Efficient Formal Safety Analysis of Neural Networks	❌	❌	✅	❌	❌	✅	✅	3
Efficient Gradient Computation for Structured Output Learning with Rational and Tropical Losses	✅	❌	✅	✅	❌	❌	❌	3
Efficient High Dimensional Bayesian Optimization with Additivity and Quadrature Fourier Features	✅	❌	✅	❌	❌	❌	✅	3
Efficient Loss-Based Decoding on Graphs for Extreme Classification	❌	✅	✅	❌	❌	❌	❌	2
Efficient Neural Network Robustness Certification with General Activation Functions	❌	✅	✅	❌	✅	❌	❌	3
Efficient Online Portfolio with Logarithmic Regret	✅	❌	❌	❌	❌	❌	❌	1
Efficient Projection onto the Perfect Phylogeny Model	✅	✅	❌	❌	✅	❌	❌	3
Efficient Stochastic Gradient Hard Thresholding	✅	❌	✅	❌	❌	❌	✅	3
Efficient inference for time-varying behavior during learning	✅	✅	✅	✅	❌	❌	✅	5
Efficient nonmyopic batch active search	❌	✅	✅	❌	❌	❌	✅	3
Efficient online algorithms for fast-rate regret bounds under sparsity	✅	❌	❌	❌	❌	❌	❌	1
Embedding Logical Queries on Knowledge Graphs	✅	✅	✅	✅	❌	❌	✅	5
Empirical Risk Minimization Under Fairness Constraints	❌	✅	✅	✅	❌	❌	✅	4
Empirical Risk Minimization in Non-interactive Local Differential Privacy Revisited	✅	❌	❌	❌	❌	❌	❌	1
End-to-End Differentiable Physics for Learning and Control	❌	✅	❌	✅	❌	❌	✅	3
End-to-end Symmetry Preserving Inter-atomic Potential Energy Model for Finite and Extended Systems	❌	✅	✅	✅	❌	❌	❌	3
Enhancing the Accuracy and Fairness of Human Decision Making	❌	✅	✅	❌	❌	❌	✅	3
Entropy Rate Estimation for Markov Chains with Large State Space	❌	❌	✅	❌	❌	❌	❌	1
Entropy and mutual information in models of deep neural networks	❌	✅	❌	❌	✅	❌	❌	2
Equality of Opportunity in Classification: A Causal Approach	✅	❌	✅	❌	❌	❌	❌	2
Escaping Saddle Points in Constrained Optimization	✅	❌	❌	❌	❌	❌	❌	1
Estimating Learnability in the Sublinear Data Regime	✅	✅	✅	❌	✅	✅	✅	6
Estimators for Multivariate Information Measures in General Probability Spaces	✅	❌	❌	❌	❌	❌	❌	1
Evidential Deep Learning to Quantify Classification Uncertainty	❌	✅	✅	✅	❌	❌	✅	4
Evolution-Guided Policy Gradient in Reinforcement Learning	✅	✅	✅	❌	❌	❌	✅	4
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks	✅	❌	✅	✅	❌	❌	✅	4
Evolved Policy Gradients	✅	✅	✅	❌	❌	❌	✅	4
Ex ante coordination and collusion in zero-sum multi-player extensive-form games	✅	❌	✅	❌	❌	✅	✅	4
Exact natural gradient in deep linear networks and its application to the nonlinear case	❌	❌	✅	❌	❌	❌	✅	2
Expanding Holographic Embeddings for Knowledge Completion	❌	❌	✅	✅	❌	❌	✅	3
Experimental Design for Cost-Aware Learning of Causal Graphs	✅	❌	❌	❌	❌	✅	✅	3
Explaining Deep Learning Models -- A Bayesian Non-parametric Approach	❌	❌	✅	❌	❌	❌	❌	1
Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives	✅	✅	✅	✅	❌	❌	✅	5
Exploiting Numerical Sparsity for Efficient Learning : Faster Eigenvector Computation and Regression	✅	❌	❌	❌	❌	❌	❌	1
Exploration in Structured Reinforcement Learning	✅	❌	❌	❌	❌	❌	❌	1
Exponentially Weighted Imitation Learning for Batched Historical Data	✅	❌	❌	❌	❌	❌	✅	2
Exponentiated Strongly Rayleigh Distributions	✅	❌	✅	✅	❌	❌	✅	4
Extracting Relationships by Multi-Domain Matching	✅	❌	✅	✅	❌	❌	✅	4
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification	❌	✅	✅	✅	❌	❌	✅	4
FRAGE: Frequency-Agnostic Word Representation	✅	✅	✅	✅	❌	❌	✅	5
Factored Bandits	✅	❌	❌	❌	❌	❌	✅	2
Fairness Behind a Veil of Ignorance: A Welfare Analysis for Automated Decision Making	❌	❌	✅	✅	❌	❌	✅	3
Fairness Through Computationally-Bounded Awareness	✅	❌	❌	❌	❌	❌	❌	1
Faithful Inversion of Generative Models for Effective Amortized Inference	✅	✅	✅	✅	❌	❌	✅	5
Fast Approximate Natural Gradient Descent in a Kronecker Factored Eigenbasis	✅	❌	✅	❌	❌	❌	✅	3
Fast Estimation of Causal Interactions using Wold Processes	✅	✅	✅	❌	✅	✅	✅	6
Fast Greedy MAP Inference for Determinantal Point Process to Improve Recommendation Diversity	✅	❌	✅	❌	✅	❌	❌	3
Fast Rates of ERM and Stochastic Approximation: Adaptive to Error Bound Conditions	✅	❌	✅	✅	❌	❌	❌	3
Fast Similarity Search via Optimal Sparse Lifting	✅	❌	✅	❌	❌	❌	✅	3
Fast and Effective Robustness Certification	❌	✅	✅	❌	✅	❌	✅	4
Fast deep reinforcement learning using online adjustments from the past	✅	❌	✅	❌	❌	❌	✅	3
Fast greedy algorithms for dictionary selection with generalized sparsity constraints	✅	❌	✅	❌	✅	❌	✅	4
FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network	❌	✅	✅	✅	✅	❌	✅	5
Faster Neural Networks Straight from JPEG	❌	✅	✅	❌	✅	❌	✅	4
Faster Online Learning of Optimal Threshold for Consistent F-measure Optimization	✅	❌	✅	✅	❌	❌	✅	4
Fighting Boredom in Recommender Systems with Linear Reinforcement Learning	✅	❌	✅	❌	❌	❌	✅	3
First-order Stochastic Algorithms for Escaping From Saddle Points in Almost Linear Time	✅	❌	✅	❌	❌	❌	✅	3
FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction	❌	✅	✅	✅	❌	❌	✅	4
Flexible and accurate inference and learning for deep generative models	✅	❌	✅	❌	❌	❌	✅	3
Flexible neural representation for physics prediction	✅	✅	❌	❌	❌	❌	❌	2
Forecasting Treatment Responses Over Time Using Recurrent Marginal Structural Networks	❌	✅	❌	✅	❌	❌	✅	3
Foreground Clustering for Joint Segmentation and Localization in Videos and Images	❌	✅	✅	❌	❌	❌	✅	3
Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger	❌	✅	✅	✅	❌	❌	❌	3
Found Graph Data and Planted Vertex Covers	✅	✅	✅	✅	❌	❌	✅	5
Frequency-Domain Dynamic Pruning for Convolutional Neural Networks	❌	❌	✅	✅	❌	❌	✅	3
From Stochastic Planning to Marginal MAP	✅	❌	✅	❌	❌	❌	✅	3
Fully Neural Network Based Speech Recognition on Mobile and Embedded Devices	❌	❌	✅	✅	✅	❌	✅	4
Fully Understanding The Hashing Trick	❌	❌	✅	❌	❌	❌	✅	2
GIANT: Globally Improved Approximate Newton Method for Distributed Optimization	❌	✅	✅	❌	✅	❌	✅	4
GILBO: One Metric to Measure Them All	❌	✅	✅	❌	❌	❌	✅	3
GLoMo: Unsupervised Learning of Transferable Relational Graphs	❌	❌	✅	✅	❌	❌	✅	3
GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference with GPU Acceleration	❌	✅	✅	❌	✅	❌	✅	4
Gamma-Poisson Dynamic Matrix Factorization Embedded with Metadata Influence	✅	❌	✅	✅	❌	❌	✅	4
Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Gaussian Process Conditional Density Estimation	❌	❌	✅	✅	❌	❌	✅	3
Gaussian Process Prior Variational Autoencoders	❌	✅	✅	✅	❌	❌	✅	4
Gen-Oja: Simple & Efficient Algorithm for Streaming Generalized Eigenvector Computation	✅	❌	❌	❌	❌	❌	✅	2
Generalisation in humans and deep neural networks	❌	✅	✅	❌	❌	✅	✅	4
Generalisation of structural knowledge in the hippocampal-entorhinal system	❌	❌	✅	❌	❌	❌	❌	1
Generalization Bounds for Uniformly Stable Algorithms	❌	❌	❌	❌	❌	❌	❌	0
Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels	✅	❌	✅	✅	❌	❌	✅	4
Generalized Inverse Optimization through Online Learning	✅	❌	❌	❌	✅	❌	✅	3
Generalized Zero-Shot Learning with Deep Calibration Network	❌	❌	✅	✅	❌	❌	✅	3
Generalizing Graph Matching beyond Quadratic Assignment Model	✅	❌	✅	❌	❌	❌	✅	3
Generalizing Point Embeddings using the Wasserstein Space of Elliptical Distributions	✅	❌	✅	❌	❌	❌	✅	3
Generalizing Tree Probability Estimation via Bayesian Networks	✅	✅	✅	❌	❌	❌	✅	4
Generalizing to Unseen Domains via Adversarial Data Augmentation	✅	✅	✅	❌	❌	❌	✅	4
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization	❌	❌	❌	✅	✅	❌	✅	3
Generative Neural Machine Translation	✅	❌	✅	✅	❌	❌	✅	4
Generative Probabilistic Novelty Detection with Adversarial Autoencoders	❌	✅	✅	✅	✅	❌	✅	5
Generative modeling for protein structures	❌	❌	✅	❌	❌	✅	❌	2
Genetic-Gated Networks for Deep Reinforcement Learning	✅	❌	✅	❌	✅	❌	✅	4
Geometrically Coupled Monte Carlo Sampling	✅	❌	✅	❌	❌	❌	✅	3
Geometry Based Data Generation	✅	✅	✅	✅	❌	❌	❌	4
Geometry-Aware Recurrent Neural Networks for Active Visual Recognition	❌	✅	✅	✅	❌	❌	✅	4
Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization	✅	❌	❌	❌	❌	❌	❌	1
Global Gated Mixture of Second-order Pooling for Improving Deep Convolutional Neural Networks	❌	✅	✅	✅	✅	❌	✅	5
Global Geometry of Multichannel Sparse Blind Deconvolution on the Sphere	❌	❌	✅	❌	❌	❌	✅	2
Global Non-convex Optimization with Discretized Diffusions	❌	❌	❌	❌	❌	❌	✅	1
Glow: Generative Flow with Invertible 1x1 Convolutions	❌	✅	✅	✅	✅	❌	✅	5
GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training	✅	❌	✅	❌	✅	❌	✅	4
Gradient Descent Meets Shift-and-Invert Preconditioning for Eigenvector Computation	✅	✅	✅	❌	❌	❌	✅	4
Gradient Descent for Spiking Neural Networks	❌	❌	❌	❌	❌	❌	✅	1
Gradient Sparsification for Communication-Efficient Distributed Optimization	✅	❌	✅	❌	❌	❌	✅	3
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation	❌	✅	✅	❌	❌	❌	✅	3
Graph Oracle Models, Lower Bounds, and Gaps for Parallel Stochastic Optimization	✅	❌	❌	❌	❌	❌	❌	1
Graphical Generative Adversarial Networks	✅	✅	✅	❌	❌	❌	❌	3
Graphical model inference: Sequential Monte Carlo meets deterministic approximations	✅	❌	✅	❌	❌	❌	✅	3
Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN	✅	✅	✅	❌	❌	❌	✅	4
Group Equivariant Capsule Networks	✅	✅	✅	❌	❌	❌	✅	4
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking	✅	❌	✅	❌	❌	❌	✅	3
GumBolt: Extending Gumbel trick to Boltzmann priors	❌	❌	✅	✅	❌	❌	✅	3
HOGWILD!-Gibbs can be PanAccurate	✅	❌	❌	❌	✅	❌	✅	3
HOUDINI: Lifelong Learning as Program Synthesis	❌	✅	✅	❌	✅	❌	❌	3
Hamiltonian Variational Auto-Encoder	✅	✅	✅	✅	❌	❌	✅	5
Hardware Conditioned Policies for Multi-Robot Transfer Learning	✅	❌	❌	✅	❌	❌	✅	3
Hessian-based Analysis of Large Batch Training and Robustness to Adversaries	❌	❌	✅	✅	❌	❌	✅	3
Heterogeneous Bitwidth Binarization in Convolutional Neural Networks	✅	❌	✅	❌	✅	❌	✅	4
Heterogeneous Multi-output Gaussian Process Prediction	❌	✅	✅	✅	❌	❌	✅	4
Hierarchical Graph Representation Learning with Differentiable Pooling	❌	❌	✅	✅	❌	❌	✅	3
Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies	✅	❌	❌	❌	❌	❌	✅	2
High Dimensional Linear Regression using Lattice Basis Reduction	✅	❌	❌	❌	❌	❌	✅	2
HitNet: Hybrid Ternary Recurrent Neural Network	❌	❌	✅	❌	❌	❌	✅	2
Horizon-Independent Minimax Linear Regression	❌	❌	❌	❌	❌	❌	❌	0
How Does Batch Normalization Help Optimization?	❌	❌	✅	❌	❌	❌	✅	2
How Many Samples are Needed to Estimate a Convolutional Neural Network?	❌	❌	❌	❌	❌	❌	❌	0
How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?	❌	❌	❌	❌	❌	❌	✅	1
How SGD Selects the Global Minima in Over-parameterized Learning: A Dynamical Stability Perspective	❌	✅	✅	❌	❌	❌	✅	3
How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD	✅	❌	❌	❌	❌	❌	❌	1
How to Start Training: The Effect of Initialization and Architecture	❌	❌	✅	❌	❌	❌	✅	2
How to tell when a clustering is (approximately) correct using convex relaxations	❌	❌	✅	❌	❌	❌	✅	2
Human-in-the-Loop Interpretability Prior	❌	❌	✅	✅	❌	❌	❌	2
Hunting for Discriminatory Proxies in Linear Regression Models	❌	❌	✅	❌	❌	❌	✅	2
Hybrid Knowledge Routed Modules for Large-scale Object Detection	❌	✅	✅	✅	❌	❌	✅	4
Hybrid Macro/Micro Level Backpropagation for Training Deep Spiking Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation	❌	❌	✅	✅	✅	❌	✅	4
Hybrid-MST: A Hybrid Active Sampling Strategy for Pairwise Preference Aggregation	✅	✅	✅	❌	✅	✅	❌	5
Hyperbolic Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Identification and Estimation of Causal Effects from Dependent Data	❌	❌	❌	❌	❌	❌	❌	0
Image Inpainting via Generative Multi-column Convolutional Neural Networks	❌	❌	✅	✅	✅	✅	✅	5
Image-to-image translation for cross-domain disentanglement	❌	✅	✅	❌	❌	❌	❌	2
Implicit Bias of Gradient Descent on Linear Convolutional Networks	❌	❌	❌	❌	❌	❌	❌	0
Implicit Probabilistic Integrators for ODEs	✅	❌	❌	❌	❌	❌	✅	2
Implicit Reparameterization Gradients	✅	✅	✅	❌	❌	❌	✅	4
Importance Weighting and Variational Inference	✅	❌	✅	❌	❌	✅	✅	4
Improved Algorithms for Collaborative PAC Learning	✅	❌	❌	❌	❌	❌	❌	1
Improved Expressivity Through Dendritic Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Improved Network Robustness with Adversary Critic	✅	❌	✅	❌	✅	❌	✅	4
Improving Explorability in Variational Inference with Annealed Variational Objectives	❌	❌	✅	❌	❌	❌	✅	2
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents	✅	✅	✅	❌	❌	❌	✅	4
Improving Neural Program Synthesis with Inferred Execution Traces	❌	❌	✅	✅	❌	❌	✅	3
Improving Online Algorithms via ML Predictions	✅	❌	❌	❌	❌	❌	✅	2
Improving Simple Models with Confidence Profiles	✅	❌	✅	✅	❌	❌	✅	4
Incorporating Context into Language Encoding Models for fMRI	❌	❌	✅	❌	❌	❌	✅	2
Inequity aversion improves cooperation in intertemporal social dilemmas	❌	❌	❌	❌	❌	❌	✅	1
Inexact trust-region algorithms on Riemannian manifolds	✅	✅	✅	❌	✅	❌	✅	5
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing	✅	❌	✅	❌	❌	❌	✅	3
Inference in Deep Gaussian Processes using Stochastic Gradient Hamiltonian Monte Carlo	✅	✅	✅	❌	❌	❌	✅	4
Inferring Latent Velocities from Weather Radar Data using Gaussian Processes	✅	❌	❌	❌	❌	❌	❌	1
Inferring Networks From Random Walk-Based Node Similarities	❌	✅	✅	❌	✅	❌	✅	4
Infinite-Horizon Gaussian Processes	✅	✅	✅	❌	✅	✅	✅	6
Information Constraints on Auto-Encoding Variational Bayes	❌	❌	✅	❌	❌	❌	❌	1
Information-based Adaptive Stimulus Selection to Optimize Communication Efficiency in Brain-Computer Interfaces	✅	❌	❌	❌	❌	❌	✅	2
Information-theoretic Limits for Community Detection in Network Models	✅	❌	❌	❌	❌	❌	❌	1
Informative Features for Model Comparison	❌	✅	✅	✅	❌	❌	✅	4
Insights on representational similarity in neural networks with canonical correlation	❌	❌	✅	❌	❌	❌	❌	1
Integrated accounts of behavioral and neuroimaging data using flexible recurrent neural network models	❌	❌	✅	❌	❌	❌	❌	1
Interactive Structure Learning with Structural Query-by-Committee	✅	❌	✅	❌	❌	❌	❌	2
Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections	✅	❌	✅	✅	✅	✅	✅	6
IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis	✅	❌	✅	✅	❌	❌	✅	4
Invariant Representations without Adversarial Training	❌	❌	✅	✅	❌	❌	✅	3
Invertibility of Convolutional Generative Networks from Partial Measurements	❌	✅	✅	❌	❌	❌	✅	3
Is Q-Learning Provably Efficient?	✅	❌	❌	❌	❌	❌	❌	1
Isolating Sources of Disentanglement in Variational Autoencoders	❌	✅	✅	❌	❌	❌	✅	3
Iterative Value-Aware Model Learning	✅	❌	❌	❌	❌	❌	❌	1
Joint Active Feature Acquisition and Classification with Variable-Size Set Encoding	✅	✅	✅	✅	✅	❌	✅	6
Joint Autoregressive and Hierarchical Priors for Learned Image Compression	❌	❌	✅	❌	❌	❌	✅	2
Joint Sub-bands Learning with Clique Structures for Wavelet Domain Super-Resolution	❌	❌	✅	✅	❌	❌	✅	3
KDGAN: Knowledge Distillation with Generative Adversarial Networks	✅	✅	✅	❌	❌	❌	✅	4
KONG: Kernels for ordered-neighborhood graphs	✅	✅	✅	✅	✅	❌	✅	6
Kalman Normalization: Normalizing Internal Representations Across Network Layers	❌	❌	✅	✅	✅	❌	✅	4
Knowledge Distillation by On-the-Fly Native Ensemble	✅	❌	✅	✅	❌	❌	✅	4
L4: Practical loss-based stepsize adaptation for deep learning	✅	❌	✅	❌	❌	❌	✅	3
LAG: Lazily Aggregated Gradient for Communication-Efficient Distributed Learning	✅	❌	✅	❌	✅	❌	✅	4
LF-Net: Learning Local Features from Images	❌	✅	✅	✅	✅	❌	✅	5
Large Margin Deep Networks for Classification	❌	✅	✅	✅	✅	❌	✅	5
Large Scale computation of Means and Clusters for Persistence Diagrams using Optimal Transport	✅	❌	✅	❌	✅	❌	✅	4
Large-Scale Stochastic Sampling from the Probability Simplex	✅	✅	✅	❌	❌	❌	✅	4
Latent Alignment and Variational Attention	✅	✅	✅	✅	❌	❌	✅	5
Latent Gaussian Activity Propagation: Using Smoothness and Structure to Separate and Localize Sounds in Large Noisy Environments	❌	❌	✅	❌	❌	❌	✅	2
Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation	❌	❌	✅	✅	✅	❌	✅	4
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning	✅	✅	✅	❌	❌	❌	✅	4
Learning Abstract Options	❌	❌	✅	❌	❌	❌	✅	2
Learning Attentional Communication for Multi-Agent Cooperation	❌	❌	✅	❌	❌	❌	✅	2
Learning Attractor Dynamics for Generative Memory	✅	✅	✅	❌	❌	❌	✅	4
Learning Beam Search Policies via Imitation Learning	✅	❌	❌	❌	❌	❌	❌	1
Learning Bounds for Greedy Approximation with Explicit Feature Maps from Multiple Kernels	✅	❌	✅	❌	❌	❌	✅	3
Learning Compressed Transforms with Low Displacement Rank	❌	✅	✅	❌	❌	❌	❌	2
Learning Concave Conditional Likelihood Models for Improved Analysis of Tandem Mass Spectra	❌	❌	✅	❌	✅	✅	✅	4
Learning Conditioned Graph Structures for Interpretable Visual Question Answering	❌	✅	✅	✅	❌	❌	✅	4
Learning Confidence Sets using Support Vector Machines	❌	❌	✅	✅	❌	❌	✅	3
Learning Deep Disentangled Embeddings With the F-Statistic Loss	❌	✅	✅	✅	❌	❌	✅	4
Learning Disentangled Joint Continuous and Discrete Representations	❌	✅	✅	❌	❌	❌	✅	3
Learning Gaussian Processes by Minimizing PAC-Bayesian Generalization Bounds	❌	✅	✅	❌	❌	❌	✅	3
Learning Hierarchical Semantic Image Manipulation through Structured Representations	❌	❌	✅	✅	❌	❌	✅	3
Learning Invariances using the Marginal Likelihood	❌	❌	✅	❌	❌	❌	❌	1
Learning Latent Subspaces in Variational Autoencoders	❌	❌	✅	✅	❌	❌	✅	3
Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction	✅	❌	✅	✅	❌	❌	✅	4
Learning Loop Invariants for Program Verification	❌	✅	✅	❌	✅	❌	❌	3
Learning Optimal Reserve Price against Non-myopic Bidders	✅	❌	❌	❌	❌	❌	❌	1
Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs	✅	❌	❌	❌	❌	✅	❌	2
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data	❌	❌	✅	❌	❌	❌	✅	2
Learning Pipelines with Limited Data and Domain Knowledge: A Study in Parsing Physics Problems	❌	❌	✅	❌	❌	❌	✅	2
Learning Plannable Representations with Causal InfoGAN	❌	✅	✅	❌	❌	❌	❌	2
Learning SMaLL Predictors	✅	❌	✅	✅	❌	❌	✅	4
Learning Safe Policies with Expert Guidance	✅	❌	✅	❌	❌	❌	✅	3
Learning Signed Determinantal Point Processes through the Principal Minor Assignment Problem	✅	❌	❌	❌	❌	❌	❌	1
Learning Task Specifications from Demonstrations	✅	❌	❌	❌	❌	❌	❌	1
Learning Temporal Point Processes via Reinforcement Learning	✅	❌	✅	❌	❌	❌	✅	3
Learning To Learn Around A Common Mean	✅	✅	✅	✅	❌	❌	✅	5
Learning Versatile Filters for Efficient Convolutional Neural Networks	❌	✅	✅	✅	❌	❌	❌	3
Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization	✅	❌	✅	✅	✅	❌	✅	5
Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders	✅	❌	✅	❌	❌	❌	✅	3
Learning a latent manifold of odor representations from neural responses in piriform cortex	❌	❌	❌	✅	❌	❌	✅	2
Learning and Inference in Hilbert Space with Quantum Graphical Models	✅	❌	✅	❌	❌	❌	❌	2
Learning and Testing Causal Models with Interventions	✅	❌	❌	❌	❌	❌	❌	1
Learning convex bounds for linear quadratic control policy synthesis	✅	❌	❌	❌	❌	❌	✅	2
Learning convex polytopes with margin	❌	❌	❌	❌	❌	❌	❌	0
Learning filter widths of spectral decompositions with wavelets	❌	✅	✅	✅	❌	❌	✅	4
Learning from Group Comparisons: Exploiting Higher Order Interactions	❌	❌	✅	❌	❌	❌	✅	2
Learning from discriminative feature feedback	✅	❌	✅	❌	❌	❌	❌	2
Learning in Games with Lossy Feedback	✅	❌	❌	❌	❌	❌	❌	1
Learning latent variable structured prediction models with Gaussian perturbations	✅	❌	✅	❌	❌	❌	✅	3
Learning long-range spatial dependencies with horizontal gated recurrent units	❌	❌	✅	❌	❌	❌	✅	2
Learning semantic similarity in a continuous space	❌	✅	✅	❌	✅	✅	✅	5
Learning sparse neural networks via sensitivity-driven regularization	❌	❌	✅	❌	❌	❌	✅	2
Learning to Decompose and Disentangle Representations for Video Prediction	❌	✅	✅	❌	❌	❌	❌	2
Learning to Exploit Stability for 3D Scene Parsing	❌	❌	✅	✅	❌	❌	✅	3
Learning to Infer Graphics Programs from Hand-Drawn Images	❌	✅	✅	❌	✅	❌	❌	3
Learning to Multitask	❌	❌	✅	✅	❌	❌	✅	3
Learning to Navigate in Cities Without a Map	❌	✅	✅	❌	❌	❌	✅	3
Learning to Optimize Tensor Programs	✅	✅	✅	❌	✅	✅	✅	6
Learning to Play With Intrinsically-Motivated, Self-Aware Agents	❌	❌	❌	✅	❌	❌	✅	2
Learning to Reason with Third Order Tensor Products	❌	✅	✅	✅	✅	❌	✅	5
Learning to Reconstruct Shapes from Unseen Classes	❌	❌	✅	❌	❌	❌	❌	1
Learning to Repair Software Vulnerabilities with Generative Adversarial Networks	❌	❌	✅	✅	❌	❌	✅	3
Learning to Share and Hide Intentions using Information Regularization	✅	✅	❌	❌	❌	❌	✅	3
Learning to Solve SMT Formulas	✅	✅	✅	✅	❌	❌	✅	5
Learning to Specialize with Knowledge Distillation for Visual Question Answering	❌	❌	✅	✅	❌	❌	✅	3
Learning to Teach with Dynamic Loss Functions	✅	❌	✅	❌	❌	❌	✅	3
Learning towards Minimum Hyperspherical Energy	❌	❌	✅	❌	✅	❌	✅	3
Learning with SGD and Random Features	❌	❌	✅	❌	✅	❌	✅	3
Learning without the Phase: Regularized PhaseMax Achieves Optimal Sample Complexity	❌	❌	❌	❌	❌	❌	✅	1
Legendre Decomposition for Tensors	✅	✅	✅	❌	✅	✅	✅	6
Leveraged volume sampling for linear regression	✅	❌	✅	❌	❌	❌	❌	2
Leveraging the Exact Likelihood of Deep Latent Variable Models	✅	❌	✅	❌	✅	❌	✅	4
Life-Long Disentangled Representation Learning with Cross-Domain Latent Homologies	❌	❌	✅	❌	❌	❌	❌	1
Lifelong Inverse Reinforcement Learning	✅	❌	❌	❌	❌	✅	✅	3
Lifted Weighted Mini-Bucket	✅	❌	❌	❌	❌	❌	✅	2
Limited Memory Kelley's Method Converges for Composite Convex and Submodular Objectives	✅	❌	❌	❌	❌	❌	✅	2
Link Prediction Based on Graph Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
LinkNet: Relational Embedding for Scene Graph	❌	❌	✅	❌	❌	❌	✅	2
Lipschitz regularity of deep neural networks: analysis and efficient estimation	✅	✅	✅	❌	❌	❌	❌	3
Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks	✅	✅	✅	✅	❌	❌	✅	5
Local Differential Privacy for Evolving Data	✅	❌	❌	❌	❌	❌	❌	1
Long short-term memory and Learning-to-learn in networks of spiking neurons	❌	❌	✅	❌	✅	❌	✅	3
Loss Functions for Multiset Prediction	❌	❌	✅	✅	❌	❌	❌	2
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs	✅	✅	✅	✅	❌	❌	✅	5
Low-Rank Tucker Decomposition of Large Tensors Using TensorSketch	✅	✅	❌	❌	✅	✅	✅	5
Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames	✅	❌	✅	❌	❌	❌	❌	2
Low-shot Learning via Covariance-Preserving Adversarial Augmentation Networks	❌	❌	✅	✅	❌	❌	✅	3
M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search	❌	✅	✅	✅	❌	❌	❌	3
MULAN: A Blind and Off-Grid Method for Multichannel Echo Retrieval	✅	✅	✅	❌	❌	❌	✅	4
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models	❌	❌	✅	✅	❌	❌	✅	3
Mallows Models for Top-k Lists	❌	❌	✅	❌	❌	❌	✅	2
Manifold Structured Prediction	❌	❌	✅	✅	✅	❌	✅	4
Manifold-tiling Localized Receptive Fields are Optimal in Similarity-preserving Neural Networks	❌	✅	✅	❌	❌	❌	❌	2
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction	❌	✅	✅	✅	❌	❌	✅	4
Masking: A New Perspective of Noisy Supervision	❌	✅	✅	❌	✅	❌	✅	4
Maximizing Induced Cardinality Under a Determinantal Point Process	❌	❌	❌	❌	❌	❌	❌	0
Maximizing acquisition functions for Bayesian optimization	✅	❌	❌	❌	❌	❌	✅	2
Maximum Causal Tsallis Entropy Imitation Learning	✅	❌	✅	❌	❌	❌	✅	3
Maximum-Entropy Fine Grained Classification	❌	❌	✅	❌	✅	❌	❌	2
Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues	❌	❌	❌	❌	❌	❌	✅	1
Mean-field theory of graph neural networks in graph partitioning	❌	❌	❌	✅	❌	✅	✅	3
Measures of distortion for machine learning	❌	❌	❌	❌	❌	❌	✅	1
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing	✅	✅	✅	✅	❌	❌	✅	5
Memory Replay GANs: Learning to Generate New Categories without Forgetting	❌	✅	✅	❌	❌	❌	✅	3
Mental Sampling in Multimodal Representations	✅	✅	❌	❌	❌	❌	✅	3
Mesh-TensorFlow: Deep Learning for Supercomputers	✅	✅	✅	✅	✅	❌	✅	6
Meta-Gradient Reinforcement Learning	✅	❌	✅	✅	❌	❌	✅	4
Meta-Learning MCMC Proposals	✅	❌	✅	❌	❌	❌	✅	3
Meta-Reinforcement Learning of Structured Exploration Strategies	✅	❌	❌	❌	❌	❌	❌	1
MetaAnchor: Learning to Detect Objects with Customized Anchors	❌	❌	✅	✅	❌	❌	✅	3
MetaGAN: An Adversarial Approach to Few-Shot Learning	✅	❌	✅	✅	❌	❌	✅	4
MetaReg: Towards Domain Generalization using Meta-Regularization	✅	❌	✅	❌	❌	❌	✅	3
Metric on Nonlinear Dynamical Systems with Perron-Frobenius Operators	❌	✅	✅	✅	❌	❌	✅	4
MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare	❌	✅	❌	✅	❌	❌	✅	3
Middle-Out Decoding	❌	❌	✅	✅	❌	❌	✅	3
Minimax Estimation of Neural Net Distance	❌	❌	❌	❌	❌	❌	❌	0
Minimax Statistical Learning with Wasserstein distances	❌	❌	❌	❌	❌	❌	❌	0
Mirrored Langevin Dynamics	✅	❌	✅	❌	❌	❌	✅	3
MixLasso: Generalized Mixed Regression via Convex Atomic-Norm Regularization	✅	❌	❌	❌	❌	❌	✅	2
Mixture Matrix Completion	✅	❌	✅	❌	❌	❌	✅	3
Model Agnostic Supervised Local Explanations	❌	✅	✅	✅	❌	❌	✅	4
Model-Agnostic Private Learning	✅	❌	❌	❌	❌	❌	❌	1
Model-based targeted dimensionality reduction for neuronal population data	✅	✅	❌	❌	❌	❌	✅	3
Modeling Dynamic Missingness of Implicit Feedback for Recommendation	❌	❌	✅	✅	✅	❌	✅	4
Modelling and unsupervised learning of symmetric deformable object categories	❌	❌	✅	✅	❌	❌	✅	3
Modelling sparsity, heterogeneity, reciprocity and community structure in temporal interaction data	❌	✅	✅	❌	❌	❌	✅	3
Modern Neural Networks Generalize on Small Data Sets	❌	❌	✅	❌	❌	❌	✅	2
Modular Networks: Learning to Decompose Neural Computation	✅	✅	✅	❌	❌	❌	✅	4
Monte-Carlo Tree Search for Constrained POMDPs	✅	❌	✅	❌	❌	❌	✅	3
Moonshine: Distilling with Cheap Convolutions	❌	✅	✅	❌	❌	❌	✅	3
Multi-Agent Generative Adversarial Imitation Learning	✅	✅	✅	❌	❌	❌	✅	4
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization	✅	❌	✅	❌	❌	❌	✅	3
Multi-Class Learning: From Theory to Algorithm	✅	❌	✅	✅	❌	✅	✅	5
Multi-Layered Gradient Boosting Decision Trees	✅	❌	✅	✅	❌	❌	✅	4
Multi-Task Learning as Multi-Objective Optimization	✅	❌	✅	✅	✅	❌	❌	4
Multi-Task Zipping via Layer-wise Neuron Sharing	✅	❌	✅	❌	✅	❌	✅	4
Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation	❌	✅	✅	✅	✅	❌	❌	4
Multi-armed Bandits with Compensation	✅	❌	❌	❌	❌	❌	✅	2
Multi-domain Causal Structure Learning in Linear Systems	✅	❌	✅	❌	❌	❌	✅	3
Multi-objective Maximization of Monotone Submodular Functions with Cardinality Constraint	✅	❌	✅	❌	❌	❌	✅	3
Multi-value Rule Sets for Interpretable Classification with Feature-Efficient Representations	❌	✅	✅	✅	❌	❌	✅	4
Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages	❌	✅	✅	✅	✅	❌	✅	5
Multimodal Generative Models for Scalable Weakly-Supervised Learning	❌	❌	✅	❌	❌	❌	✅	2
Multiple Instance Learning for Efficient Sequential Data Classification on Resource-constrained Devices	✅	✅	✅	❌	✅	❌	✅	5
Multiple-Step Greedy Policies in Approximate and Online Reinforcement Learning	✅	❌	❌	❌	❌	❌	❌	1
Multiplicative Weights Updates with Constant Step-Size in Graphical Constant-Sum Games	❌	❌	❌	❌	❌	❌	❌	0
Multitask Boosting for Survival Analysis with Competing Risks	✅	❌	✅	✅	❌	❌	✅	4
Multivariate Convolutional Sparse Coding for Electromagnetic Brain Signals	✅	✅	✅	❌	❌	❌	✅	4
Multivariate Time Series Imputation with Generative Adversarial Networks	❌	❌	✅	❌	❌	❌	✅	2
NAIS-Net: Stable Deep Networks from Non-Autonomous Differential Equations	✅	❌	✅	✅	❌	❌	✅	4
NEON2: Finding Local Minima via First-Order Oracles	✅	❌	❌	❌	❌	❌	❌	1
Natasha 2: Faster Non-Convex Optimization Than SGD	✅	❌	❌	❌	❌	❌	❌	1
Navigating with Graph Representations for Fast and Scalable Decoding of Neural Language Models	✅	❌	✅	❌	✅	❌	✅	4
Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes	✅	❌	✅	❌	❌	❌	✅	3
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models	✅	❌	❌	❌	❌	❌	✅	2
Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model	✅	❌	❌	❌	❌	❌	❌	1
Nearly tight sample complexity bounds for learning mixtures of Gaussians via sample compression schemes	❌	❌	❌	❌	❌	❌	❌	0
Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making	❌	❌	✅	❌	❌	❌	✅	2
Neighbourhood Consensus Networks	❌	✅	✅	✅	❌	❌	✅	4
Neural Architecture Optimization	✅	✅	✅	✅	✅	❌	✅	6
Neural Architecture Search with Bayesian Optimisation and Optimal Transport	❌	✅	✅	✅	✅	❌	✅	5
Neural Arithmetic Logic Units	❌	❌	✅	✅	❌	❌	✅	3
Neural Code Comprehension: A Learnable Representation of Code Semantics	❌	✅	✅	✅	✅	❌	✅	5
Neural Edit Operations for Biological Sequences	✅	❌	✅	❌	❌	✅	✅	4
Neural Guided Constraint Logic Programming for Program Synthesis	❌	✅	❌	❌	✅	❌	✅	3
Neural Interaction Transparency (NIT): Disentangling Learned Interactions for Improved Interpretability	❌	❌	✅	✅	❌	❌	✅	3
Neural Nearest Neighbors Networks	❌	✅	✅	✅	❌	❌	✅	4
Neural Networks Trained to Solve Differential Equations Learn General Representations	❌	❌	✅	❌	❌	❌	✅	2
Neural Ordinary Differential Equations	✅	❌	✅	❌	❌	❌	✅	3
Neural Proximal Gradient Descent for Compressive Imaging	❌	✅	✅	❌	✅	❌	✅	4
Neural Tangent Kernel: Convergence and Generalization in Neural Networks	❌	❌	✅	❌	❌	❌	✅	2
Neural Voice Cloning with a Few Samples	❌	❌	✅	✅	❌	❌	✅	3
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding	❌	✅	✅	✅	❌	❌	✅	4
New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity	✅	❌	✅	❌	❌	❌	❌	2
Non-Adversarial Mapping with VAEs	❌	❌	✅	❌	✅	❌	✅	3
Non-Ergodic Alternating Proximal Augmented Lagrangian Algorithms with Optimal Rates	✅	❌	❌	❌	✅	✅	✅	4
Non-Local Recurrent Network for Image Restoration	❌	✅	✅	✅	✅	❌	✅	5
Non-delusional Q-learning and value-iteration	✅	❌	❌	❌	❌	❌	✅	2
Non-metric Similarity Graphs for Maximum Inner Product Search	✅	✅	✅	❌	✅	❌	✅	5
Non-monotone Submodular Maximization in Exponentially Fewer Iterations	✅	❌	✅	❌	❌	❌	✅	3
Nonlocal Neural Networks, Nonlocal Diffusion and Nonlocal Modeling	❌	❌	✅	✅	✅	❌	✅	4
Nonparametric Bayesian Lomax delegate racing for survival analysis with competing risks	❌	✅	✅	❌	❌	✅	❌	3
Nonparametric Density Estimation under Adversarial Losses	❌	❌	❌	❌	❌	❌	❌	0
Nonparametric learning from Bayesian models with randomized objective functions	✅	❌	✅	❌	✅	❌	✅	4
Norm matters: efficient and accurate normalization schemes in deep networks	❌	✅	✅	❌	✅	❌	❌	3
Norm-Ranging LSH for Maximum Inner Product Search	✅	✅	✅	❌	❌	❌	✅	4
Object-Oriented Dynamics Predictor	❌	✅	✅	❌	❌	❌	❌	2
Objective and efficient inference for couplings in neuronal networks	❌	❌	✅	❌	❌	❌	✅	2
Occam's razor is insufficient to infer the preferences of irrational agents	❌	❌	❌	❌	❌	❌	❌	0
On Binary Classification in Extreme Regions	✅	❌	✅	❌	❌	❌	✅	3
On Controllable Sparse Alternatives to Softmax	❌	❌	✅	❌	❌	❌	✅	2
On Coresets for Logistic Regression	❌	❌	✅	❌	✅	❌	✅	3
On Fast Leverage Score Sampling and Optimal Learning	✅	❌	✅	❌	✅	❌	✅	4
On GANs and GMMs	❌	✅	✅	❌	❌	❌	❌	2
On Learning Intrinsic Rewards for Policy Gradient Methods	✅	✅	✅	❌	❌	❌	✅	4
On Learning Markov Chains	❌	❌	❌	❌	❌	❌	✅	1
On Markov Chain Gradient Descent	❌	❌	❌	❌	❌	❌	✅	1
On Misinformation Containment in Online Social Networks	✅	❌	✅	❌	✅	❌	✅	4
On Neuronal Capacity	❌	❌	❌	❌	❌	❌	❌	0
On Oracle-Efficient PAC RL with Rich Observations	✅	❌	❌	❌	❌	❌	❌	1
On gradient regularizers for MMD GANs	❌	✅	✅	❌	❌	❌	✅	3
On preserving non-discrimination when combining expert advice	❌	❌	❌	❌	❌	❌	❌	0
On the Convergence and Robustness of Training GANs with Regularized Optimal Transport	✅	❌	✅	❌	❌	❌	✅	3
On the Dimensionality of Word Embedding	❌	✅	✅	❌	❌	❌	❌	2
On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport	❌	❌	❌	❌	❌	❌	❌	0
On the Local Hessian in Back-propagation	✅	❌	✅	✅	❌	❌	✅	4
On the Local Minima of the Empirical Risk	✅	❌	❌	❌	❌	❌	❌	1
One-Shot Unsupervised Cross Domain Translation	❌	✅	✅	❌	❌	❌	✅	3
Online Adaptive Methods, Universality and Acceleration	✅	❌	❌	❌	❌	❌	✅	2
Online Improper Learning with an Approximation Oracle	✅	❌	❌	❌	❌	❌	❌	1
Online Learning of Quantum States	✅	❌	❌	❌	❌	❌	❌	1
Online Learning with an Unknown Fairness Metric	❌	❌	❌	❌	❌	❌	❌	0
Online Reciprocal Recommendation with Theoretical Performance Guarantees	✅	❌	✅	❌	❌	❌	✅	3
Online Robust Policy Learning in the Presence of Unknown Adversaries	✅	✅	✅	❌	❌	❌	❌	3
Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks	✅	❌	✅	✅	✅	❌	✅	5
Online Structured Laplace Approximations for Overcoming Catastrophic Forgetting	❌	✅	✅	❌	❌	❌	✅	3
Online convex optimization for cumulative constraints	✅	❌	✅	❌	❌	❌	✅	3
Optimal Algorithms for Continuous Non-monotone Submodular and DR-Submodular Maximization	✅	❌	❌	❌	❌	❌	❌	1
Optimal Algorithms for Non-Smooth Distributed Optimization in Networks	✅	❌	❌	❌	❌	❌	❌	1
Optimal Subsampling with Influence Functions	✅	❌	✅	❌	❌	❌	❌	2
Optimistic optimization of a Brownian	✅	❌	❌	❌	❌	❌	❌	1
Optimization for Approximate Submodularity	✅	❌	❌	❌	❌	❌	❌	1
Optimization of Smooth Functions with Noisy Observations: Local Minimax Rates	❌	❌	❌	❌	❌	❌	❌	0
Optimization over Continuous and Multi-dimensional Decisions with Observational Data	❌	❌	✅	❌	❌	❌	❌	1
Orthogonally Decoupled Variational Gaussian Processes	❌	✅	✅	❌	✅	❌	✅	4
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering	❌	❌	✅	❌	❌	❌	✅	2
Out-of-Distribution Detection using Multiple Semantic Label Representations	❌	✅	✅	❌	❌	❌	✅	3
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization	❌	❌	✅	✅	✅	❌	✅	4
Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate	❌	❌	❌	❌	❌	❌	❌	0
Overlapping Clustering Models, and One (class) SVM to Bind Them All	✅	❌	✅	❌	❌	✅	✅	4
PAC-Bayes Tree: weighted subtrees with guarantees	✅	❌	✅	✅	❌	❌	❌	3
PAC-Bayes bounds for stable algorithms with instance-dependent priors	❌	❌	✅	✅	❌	❌	✅	3
PAC-learning in the presence of adversaries	❌	❌	❌	❌	❌	❌	❌	0
PCA of high dimensional random walks with comparison to neural network training	❌	❌	✅	❌	❌	❌	✅	2
PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits	✅	❌	✅	❌	❌	❌	✅	3
PacGAN: The power of two samples in generative adversarial networks	❌	✅	✅	❌	❌	❌	❌	2
Parameters as interacting particles: long time convergence and asymptotic error scaling of neural networks	❌	❌	❌	❌	❌	❌	✅	1
Paraphrasing Complex Network: Network Compression via Factor Transfer	❌	❌	✅	✅	❌	❌	✅	3
Parsimonious Bayesian deep networks	✅	✅	✅	❌	✅	❌	✅	5
Parsimonious Quantile Regression of Financial Asset Tail Dynamics via Sequential Learning	❌	❌	❌	✅	❌	❌	✅	2
Partially-Supervised Image Captioning	✅	✅	✅	✅	✅	❌	✅	6
Pelee: A Real-Time Object Detection System on Mobile Devices	❌	✅	✅	✅	✅	❌	✅	5
Persistence Fisher Kernel: A Riemannian Manifold Kernel for Persistence Diagrams	✅	✅	✅	❌	❌	❌	✅	4
Phase Retrieval Under a Generative Prior	✅	❌	✅	❌	❌	❌	❌	2
Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training	✅	❌	✅	✅	✅	✅	✅	6
Playing hard exploration games by watching YouTube	❌	❌	❌	❌	❌	❌	✅	1
Plug-in Estimation in High-Dimensional Linear Inverse Problems: A Rigorous Analysis	✅	❌	❌	❌	❌	❌	✅	2
Point process latent variable models of larval zebrafish behavior	❌	❌	❌	✅	❌	❌	✅	2
PointCNN: Convolution On X-Transformed Points	✅	✅	✅	❌	✅	❌	✅	5
Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks	✅	✅	✅	❌	❌	❌	✅	4
Policy Optimization via Importance Sampling	✅	✅	✅	❌	✅	❌	✅	5
Policy Regret in Repeated Games	❌	❌	❌	❌	❌	❌	❌	0
Policy-Conditioned Uncertainty Sets for Robust Markov Decision Processes	✅	❌	❌	❌	❌	❌	✅	2
Porcupine Neural Networks: Approximating Neural Network Landscapes	❌	✅	✅	❌	❌	❌	✅	3
Post: Device Placement with Cross-Entropy Minimization and Proximal Policy Optimization	✅	❌	✅	❌	✅	❌	✅	4
Posterior Concentration for Sparse Deep Learning	❌	❌	❌	❌	❌	❌	❌	0
Power-law efficient neural codes provide general link between perceptual bias and discriminability	❌	❌	❌	❌	❌	❌	✅	1
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching	❌	❌	✅	✅	✅	❌	✅	4
Practical Methods for Graph Two-Sample Testing	❌	✅	✅	❌	❌	❌	✅	3
Practical exact algorithm for trembling-hand equilibrium refinements in games	❌	❌	❌	❌	❌	✅	✅	2
Precision and Recall for Time Series	✅	❌	✅	❌	✅	❌	✅	4
Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer	❌	✅	✅	❌	❌	❌	✅	3
Predictive Approximate Bayesian Computation via Saddle Points	✅	❌	❌	❌	❌	❌	✅	2
Predictive Uncertainty Estimation via Prior Networks	❌	✅	✅	❌	❌	❌	❌	2
Preference Based Adaptation for Learning Objectives	✅	❌	✅	✅	❌	❌	✅	4
Privacy Amplification by Subsampling: Tight Analyses via Couplings and Divergences	❌	❌	❌	❌	❌	❌	❌	0
Probabilistic Matrix Factorization for Automated Machine Learning	❌	✅	✅	✅	❌	❌	✅	4
Probabilistic Model-Agnostic Meta-Learning	✅	✅	✅	❌	❌	❌	❌	3
Probabilistic Neural Programmed Networks for Scene Generation	❌	✅	✅	❌	❌	❌	✅	3
Processing of missing data by neural networks	❌	✅	✅	✅	❌	❌	✅	4
Provable Gaussian Embedding with One Observation	❌	❌	❌	✅	❌	❌	✅	2
Provable Variational Inference for Constrained Log-Submodular Models	❌	❌	✅	❌	❌	❌	❌	1
Provably Correct Automatic Sub-Differentiation for Qualified Programs	✅	❌	❌	❌	❌	❌	❌	1
Proximal Graphical Event Models	✅	❌	✅	❌	❌	❌	✅	3
Proximal SCOPE for Distributed Sparse Learning	✅	❌	✅	❌	✅	❌	✅	4
Q-learning with Nearest Neighbors	✅	❌	❌	❌	❌	❌	❌	1
Quadratic Decomposable Submodular Function Minimization	✅	✅	✅	❌	✅	❌	✅	5
Quadrature-based features for kernel approximation	❌	✅	✅	❌	❌	❌	✅	3
Quantifying Learning Guarantees for Convex but Inconsistent Surrogates	❌	❌	❌	❌	❌	❌	❌	0
Query Complexity of Bayesian Private Learning	✅	❌	❌	❌	❌	❌	❌	1
Query K-means Clustering and the Double Dixie Cup Problem	✅	❌	✅	❌	❌	❌	✅	3
REFUEL: Exploring Sparse Features in Deep Reinforcement Learning for Fast Disease Diagnosis	✅	❌	❌	✅	❌	❌	✅	3
Random Feature Stein Discrepancies	❌	✅	❌	❌	❌	❌	✅	2
Randomized Prior Functions for Deep Reinforcement Learning	✅	❌	✅	❌	❌	❌	✅	3
Re-evaluating evaluation	✅	✅	✅	❌	❌	❌	❌	3
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms	❌	✅	✅	✅	❌	❌	✅	4
Rectangular Bounding Process	❌	❌	✅	❌	❌	❌	✅	2
Recurrent Relational Networks	❌	✅	✅	✅	✅	❌	✅	5
Recurrent Transformer Networks for Semantic Correspondence	❌	❌	✅	✅	❌	❌	❌	2
Recurrent World Models Facilitate Policy Evolution	✅	❌	❌	❌	❌	❌	✅	2
Recurrently Controlled Recurrent Networks	❌	✅	✅	❌	✅	❌	✅	4
Reducing Network Agnostophobia	❌	✅	✅	❌	❌	❌	❌	2
Regret Bounds for Online Portfolio Selection with a Cardinality Constraint	✅	❌	✅	❌	❌	❌	✅	3
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator	✅	❌	❌	❌	❌	❌	✅	2
Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior	✅	✅	❌	❌	❌	❌	✅	3
Regularization Learning Networks: Deep Learning for Tabular Datasets	❌	✅	✅	❌	❌	❌	❌	2
Regularizing by the Variance of the Activations' Sample-Variances	❌	❌	✅	✅	✅	❌	✅	4
Reinforced Continual Learning	✅	❌	✅	❌	✅	❌	❌	3
Reinforcement Learning for Solving the Vehicle Routing Problem	❌	❌	❌	❌	✅	❌	✅	2
Reinforcement Learning of Theorem Proving	❌	❌	✅	❌	✅	❌	✅	3
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach	✅	❌	✅	❌	❌	❌	✅	3
Relating Leverage Scores and Density using Regularized Christoffel Functions	❌	❌	❌	❌	❌	❌	✅	1
Relational recurrent neural networks	❌	✅	✅	❌	❌	❌	✅	3
Removing Hidden Confounding by Experimental Grounding	✅	❌	✅	✅	❌	❌	✅	4
Removing the Feature Correlation Effect of Multiplicative Noise	❌	✅	✅	❌	❌	❌	❌	2
RenderNet: A deep convolutional network for differentiable rendering from 3D shapes	❌	✅	✅	❌	❌	❌	✅	3
Reparameterization Gradient for Non-differentiable Models	❌	✅	✅	❌	❌	❌	✅	3
Representation Balancing MDPs for Off-policy Policy Evaluation	❌	❌	✅	❌	❌	❌	✅	2
Representation Learning for Treatment Effect Estimation from Observational Data	❌	✅	✅	❌	✅	❌	❌	3
Representation Learning of Compositional Data	❌	✅	✅	❌	❌	❌	✅	3
Representer Point Selection for Explaining Deep Neural Networks	❌	✅	✅	❌	❌	❌	✅	3
ResNet with one-neuron hidden layers is a Universal Approximator	❌	❌	❌	❌	❌	❌	✅	1
Rest-Katyusha: Exploiting the Solution's Structure via Scheduled Restart Schemes	✅	❌	✅	❌	❌	❌	✅	3
RetGK: Graph Kernels based on Return Probabilities of Random Walks	❌	❌	✅	✅	✅	❌	✅	4
Reversible Recurrent Neural Networks	✅	✅	✅	✅	❌	❌	✅	5
Revisiting $(\epsilon, \gamma, \tau)$-similarity learning for domain adaptation	❌	❌	❌	❌	❌	❌	✅	1
Revisiting Decomposable Submodular Function Minimization with Incidence Relations	✅	✅	✅	❌	❌	❌	✅	4
Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection	❌	❌	✅	✅	❌	❌	✅	3
Reward learning from human preferences and demonstrations in Atari	✅	❌	✅	❌	❌	❌	✅	3
Ridge Regression and Provable Deterministic Ridge Leverage Score Sampling	✅	✅	✅	❌	❌	❌	✅	4
Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias	❌	❌	❌	❌	✅	❌	✅	2
Robust Detection of Adversarial Attacks by Modeling the Intrinsic Properties of Deep Neural Networks	❌	❌	✅	❌	❌	❌	✅	2
Robust Hypothesis Testing Using Wasserstein Uncertainty Sets	❌	❌	✅	❌	❌	❌	✅	2
Robust Learning of Fixed-Structure Bayesian Networks	✅	❌	✅	❌	✅	❌	✅	4
Robust Subspace Approximation in a Stream	✅	❌	✅	❌	❌	❌	❌	2
Robustness of conditional GANs to noisy labels	❌	✅	✅	❌	✅	❌	❌	3
SEGA: Variance Reduction via Gradient Sketching	✅	❌	❌	❌	❌	❌	✅	2
SING: Symbol-to-Instrument Neural Generator	❌	✅	✅	❌	✅	❌	✅	4
SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient	✅	✅	✅	✅	❌	❌	✅	5
SLAYER: Spike Layer Error Reassignment in Time	❌	✅	✅	❌	❌	❌	✅	3
SNIPER: Efficient Multi-Scale Training	❌	✅	✅	✅	✅	❌	✅	5
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator	✅	❌	❌	❌	❌	❌	❌	1
Safe Active Learning for Time-Series Modeling with Gaussian Processes	✅	❌	❌	❌	❌	❌	✅	2
Sample Efficient Stochastic Gradient Iterative Hard Thresholding Method for Stochastic Sparse Linear Regression with Limited Attribute Observation	✅	❌	✅	❌	❌	❌	❌	2
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion	❌	✅	✅	❌	✅	❌	❌	3
Sanity Checks for Saliency Maps	❌	✅	✅	❌	❌	❌	❌	2
Scalable Coordinated Exploration in Concurrent Reinforcement Learning	❌	❌	❌	❌	❌	❌	✅	1
Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation	✅	❌	✅	❌	❌	✅	✅	4
Scalable Hyperparameter Transfer Learning	❌	❌	✅	❌	✅	❌	✅	3
Scalable Laplacian K-modes	✅	✅	✅	✅	✅	❌	✅	6
Scalable Robust Matrix Factorization with Nonconvex Loss	✅	❌	✅	✅	✅	❌	✅	5
Scalable methods for 8-bit training of neural networks	❌	✅	✅	❌	✅	❌	✅	4
Scalar Posterior Sampling with Applications	✅	❌	✅	❌	❌	❌	✅	3
Scaling Gaussian Process Regression with Derivatives	✅	✅	✅	❌	❌	❌	❌	3
Scaling provable adversarial defenses	✅	✅	✅	❌	✅	❌	✅	5
Scaling the Poisson GLM to massive neural datasets through polynomial approximations	❌	✅	✅	✅	❌	❌	✅	4
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction	❌	✅	✅	✅	✅	❌	✅	5
See and Think: Disentangling Semantic Scene Completion	❌	✅	✅	✅	✅	❌	✅	5
Self-Erasing Network for Integral Object Attention	✅	❌	✅	✅	❌	❌	✅	4
Self-Supervised Generation of Spatial Audio for 360° Video	❌	✅	✅	❌	✅	❌	✅	4
Semi-Supervised Learning with Declaratively Specified Entropy Constraints	❌	❌	✅	❌	❌	❌	✅	2
Semi-crowdsourced Clustering with Deep Generative Models	✅	✅	✅	❌	❌	❌	✅	4
Semi-supervised Deep Kernel Learning: Regression with Unlabeled Data by Minimizing Predictive Variance	❌	✅	✅	✅	❌	❌	✅	4
Semidefinite relaxations for certifying robustness to adversarial examples	❌	✅	✅	❌	✅	❌	✅	4
Sequence-to-Segment Networks for Segment Detection	❌	✅	✅	✅	✅	❌	✅	5
Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects	✅	✅	✅	✅	❌	❌	✅	5
Sequential Context Encoding for Duplicate Removal	❌	✅	✅	✅	✅	❌	✅	5
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling	❌	❌	❌	❌	❌	❌	✅	1
Sharp Bounds for Generalized Uniformity Testing	✅	❌	❌	❌	❌	❌	❌	1
Sigsoftmax: Reanalysis of the Softmax Bottleneck	❌	❌	✅	✅	❌	❌	✅	3
SimplE Embedding for Link Prediction in Knowledge Graphs	❌	✅	✅	✅	❌	❌	✅	4
Simple random search of static linear policies is competitive for reinforcement learning	✅	❌	✅	❌	❌	✅	✅	4
Simple, Distributed, and Accelerated Probabilistic Programming	✅	✅	✅	❌	✅	✅	✅	6
Single-Agent Policy Tree Search With Guarantees	✅	❌	✅	❌	❌	❌	✅	3
Size-Noise Tradeoffs in Generative Networks	❌	❌	✅	❌	❌	❌	❌	1
Sketching Method for Large Scale Combinatorial Inference	✅	❌	✅	✅	❌	❌	✅	4
Smoothed Analysis of Discrete Tensor Decomposition and Assemblies of Neurons	❌	❌	❌	❌	❌	❌	❌	0
Smoothed analysis of the low-rank approach for smooth semidefinite programs	❌	❌	❌	❌	❌	❌	✅	1
Snap ML: A Hierarchical Framework for Machine Learning	❌	❌	✅	❌	✅	❌	✅	3
Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis	❌	❌	✅	❌	❌	❌	✅	2
Solving Large Sequential Games with the Excessive Gap Technique	✅	❌	❌	❌	✅	❌	✅	3
Solving Non-smooth Constrained Programs with Lower Complexity than $\mathcal{O}(1/\varepsilon)$: A Primal-Dual Homotopy Smoothing Approach	✅	❌	❌	❌	❌	❌	✅	2
Sparse Attentive Backtracking: Temporal Credit Assignment Through Reminding	✅	❌	✅	✅	❌	❌	✅	4
Sparse DNNs with Improved Adversarial Robustness	❌	❌	✅	❌	❌	❌	✅	2
Sparse PCA from Sparse Linear Regression	✅	❌	❌	❌	❌	❌	✅	2
Sparsified SGD with Memory	✅	✅	✅	❌	✅	❌	✅	5
Speaker-Follower Models for Vision-and-Language Navigation	✅	✅	✅	✅	❌	❌	✅	5
Spectral Filtering for General Linear Dynamical Systems	✅	❌	❌	❌	❌	❌	✅	2
Spectral Signatures in Backdoor Attacks	✅	❌	✅	❌	❌	❌	❌	2
SplineNets: Continuous Neural Decision Graphs	❌	❌	✅	❌	❌	❌	✅	2
Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning	❌	✅	✅	✅	❌	❌	✅	4
Statistical Optimality of Stochastic Gradient Descent on Hard Learning Problems through Multiple Passes	✅	❌	✅	❌	❌	❌	✅	3
Statistical and Computational Trade-Offs in Kernel K-Means	✅	❌	✅	✅	✅	❌	✅	5
Statistical mechanics of low-rank tensor decomposition	❌	✅	❌	❌	❌	❌	✅	2
Stein Variational Gradient Descent as Moment Matching	❌	❌	❌	❌	❌	❌	❌	0
Step Size Matters in Deep Learning	❌	✅	❌	❌	❌	❌	✅	2
Stimulus domain transfer in recurrent models for large scale cortical population prediction on video	❌	✅	✅	✅	✅	❌	✅	5
Stochastic Chebyshev Gradient Descent for Spectral Optimization	✅	❌	✅	❌	❌	❌	❌	2
Stochastic Composite Mirror Descent: Optimal Bounds with High Probabilities	❌	❌	✅	❌	❌	❌	✅	2
Stochastic Cubic Regularization for Fast Nonconvex Optimization	✅	❌	✅	❌	❌	❌	✅	3
Stochastic Expectation Maximization with Variance Reduction	✅	❌	✅	❌	✅	❌	✅	4
Stochastic Nested Variance Reduction for Nonconvex Optimization	✅	❌	✅	❌	✅	✅	✅	5
Stochastic Nonparametric Event-Tensor Decomposition	❌	❌	✅	✅	❌	❌	✅	3
Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity	✅	❌	✅	❌	❌	❌	✅	3
Stochastic Spectral and Conjugate Descent Methods	✅	❌	❌	❌	❌	❌	✅	2
Streaming Kernel PCA with $\tilde{O}(\sqrt{n})$ Random Features	✅	❌	✅	✅	❌	❌	✅	4
Streamlining Variational Inference for Constraint Satisfaction Problems	✅	✅	❌	✅	❌	❌	✅	4
Structural Causal Bandits: Where to Intervene?	✅	✅	❌	❌	❌	❌	✅	3
Structure-Aware Convolutional Neural Networks	❌	✅	✅	✅	❌	❌	✅	4
Structured Local Minima in Sparse Blind Deconvolution	✅	❌	❌	❌	❌	❌	❌	1
Sublinear Time Low-Rank Approximation of Distance Matrices	✅	❌	✅	❌	✅	❌	✅	4
Submodular Field Grammars: Representation, Inference, and Application to Image Parsing	✅	❌	✅	❌	✅	❌	✅	4
Submodular Maximization via Gradient Ascent: The Case of Deep Submodular Functions	✅	❌	❌	❌	❌	❌	❌	1
Supervised autoencoders: Improving generalization performance with unsupervised regularizers	❌	❌	✅	✅	❌	❌	✅	3
Supervising Unsupervised Learning	❌	❌	✅	❌	❌	❌	✅	2
Support Recovery for Orthogonal Matching Pursuit: Upper and Lower bounds	✅	❌	❌	❌	❌	❌	✅	2
Symbolic Graph Reasoning Meets Convolutions	❌	❌	✅	✅	✅	❌	✅	4
Synaptic Strength For Convolutional Neural Network	❌	❌	✅	✅	❌	❌	✅	3
Synthesized Policies for Transfer and Adaptation across Tasks and Environments	❌	❌	✅	❌	❌	❌	✅	2
TADAM: Task dependent adaptive metric for improved few-shot learning	❌	❌	✅	✅	❌	❌	✅	3
TETRIS: TilE-matching the TRemendous Irregular Sparsity	✅	❌	✅	❌	✅	❌	✅	4
Tangent: Automatic differentiation using source-code transformation for dynamically typed array programming	❌	✅	❌	❌	✅	✅	✅	4
Task-Driven Convolutional Recurrent Models of the Visual System	❌	✅	✅	✅	✅	❌	✅	5
Teaching Inverse Reinforcement Learners via Features and Demonstrations	✅	❌	❌	❌	❌	❌	✅	2
Temporal Regularization for Markov Decision Process	✅	✅	✅	❌	❌	❌	✅	4
Temporal alignment and latent Gaussian process factor inference in population spike trains	❌	❌	✅	✅	❌	❌	✅	3
Testing for Families of Distributions via the Fourier Transform	✅	❌	❌	❌	❌	❌	❌	1
Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language	❌	❌	✅	❌	❌	❌	✅	2
The Cluster Description Problem - Complexity Results, Formulations and Approximations	✅	✅	❌	❌	✅	❌	✅	4
The Convergence of Sparsified Gradient Methods	✅	❌	✅	❌	❌	❌	❌	2
The Description Length of Deep Learning models	✅	✅	✅	✅	❌	❌	✅	5
The Effect of Network Width on the Performance of Large-batch Training	❌	❌	✅	❌	✅	❌	✅	3
The Everlasting Database: Statistical Validity at a Fair Price	✅	❌	❌	❌	❌	❌	❌	1
The Global Anchor Method for Quantifying Linguistic Shifts and Domain Adaptation	✅	✅	✅	❌	❌	❌	❌	3
The Importance of Sampling inMeta-Reinforcement Learning	✅	✅	❌	❌	❌	❌	✅	3
The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization	❌	❌	❌	❌	❌	❌	❌	0
The Limits of Post-Selection Generalization	✅	❌	❌	❌	❌	❌	❌	1
The Lingering of Gradients: How to Reuse Gradients Over Time	✅	❌	✅	❌	❌	❌	✅	3
The Nearest Neighbor Information Estimator is Adaptively Near Minimax Rate-Optimal	❌	❌	❌	❌	❌	❌	❌	0
The Pessimistic Limits and Possibilities of Margin-based Losses in Semi-supervised Learning	❌	❌	❌	❌	❌	❌	❌	0
The Physical Systems Behind Optimization Algorithms	❌	❌	❌	❌	❌	❌	✅	1
The Price of Fair PCA: One Extra dimension	✅	❌	✅	❌	❌	❌	❌	2
The Price of Privacy for Low-rank Factorization	✅	❌	❌	❌	❌	❌	❌	1
The Sample Complexity of Semi-Supervised Learning with Nonparametric Mixture Models	❌	❌	✅	❌	❌	❌	✅	2
The Sparse Manifold Transform	❌	❌	❌	❌	❌	❌	✅	1
The Spectrum of the Fisher Information Matrix of a Single-Hidden-Layer Neural Network	❌	❌	❌	❌	❌	❌	✅	1
The challenge of realistic music generation: modelling raw audio at scale	❌	❌	❌	❌	❌	❌	❌	0
The committee machine: Computational to statistical gaps in learning a two-layers neural network	✅	✅	❌	❌	✅	❌	❌	3
The emergence of multiple retinal cell types through efficient coding of natural movies	❌	✅	❌	❌	❌	❌	✅	2
The promises and pitfalls of Stochastic Gradient Langevin Dynamics	❌	❌	✅	❌	❌	❌	✅	2
The streaming rollout of deep networks - towards fully model-parallel execution	❌	✅	✅	❌	❌	❌	✅	3
Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds	❌	✅	✅	✅	❌	❌	✅	4
Theoretical guarantees for EM under misspecified Gaussian mixture models	❌	❌	❌	❌	❌	❌	✅	1
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning	✅	❌	✅	✅	❌	❌	✅	4
Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima	✅	❌	✅	❌	✅	✅	✅	5
Thwarting Adversarial Examples: An $L_0$-Robust Sparse Fourier Transform	✅	❌	✅	❌	❌	❌	✅	3
Tight Bounds for Collaborative PAC Learning via Multiplicative Weights	✅	❌	✅	❌	❌	❌	✅	3
To Trust Or Not To Trust A Classifier	✅	✅	✅	✅	❌	❌	✅	5
Toddler-Inspired Visual Object Learning	❌	❌	❌	❌	❌	❌	✅	1
TopRank: A practical algorithm for online stochastic ranking	✅	❌	✅	❌	❌	❌	✅	3
Topkapi: Parallel and Fast Sketches for Finding Top-K Frequent Elements	✅	✅	✅	❌	✅	❌	✅	5
Total stochastic gradient algorithms and applications in reinforcement learning	✅	❌	❌	❌	❌	❌	✅	2
Towards Deep Conversational Recommendations	❌	❌	✅	✅	❌	❌	❌	2
Towards Robust Detection of Adversarial Examples	❌	❌	✅	❌	❌	❌	✅	2
Towards Robust Interpretability with Self-Explaining Neural Networks	❌	❌	✅	❌	❌	❌	❌	1
Towards Text Generation with Adversarially Learned Neural Outlines	❌	❌	✅	❌	✅	❌	✅	3
Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization	❌	❌	✅	✅	✅	❌	✅	4
Towards Understanding Learning Representations: To What Extent Do Different Neural Networks Learn the Same Representation	✅	❌	✅	✅	❌	❌	❌	3
Trading robust representations for sample complexity through self-supervised visual experience	❌	❌	✅	✅	✅	❌	✅	4
Training DNNs with Hybrid Block Floating Point	❌	❌	✅	❌	✅	❌	❌	2
Training Deep Models Faster with Robust, Approximate Importance Sampling	✅	❌	✅	❌	❌	❌	✅	3
Training Deep Neural Networks with 8-bit Floating Point Numbers	✅	❌	✅	❌	❌	❌	✅	3
Training Neural Networks Using Features Replay	✅	❌	✅	❌	✅	❌	✅	4
Training deep learning based denoisers without ground truth data	❌	✅	✅	✅	✅	❌	✅	5
Trajectory Convolution for Action Recognition	❌	❌	✅	✅	✅	❌	✅	4
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis	❌	❌	✅	✅	❌	❌	✅	3
Transfer Learning with Neural AutoML	❌	❌	✅	✅	❌	❌	✅	3
Transfer of Deep Reactive Policies for MDP Planning	❌	✅	✅	❌	❌	❌	✅	3
Transfer of Value Functions via Variational Methods	✅	❌	❌	✅	✅	❌	✅	4
Tree-to-tree Neural Networks for Program Translation	❌	❌	❌	✅	❌	❌	❌	1
Turbo Learning for CaptionBot and DrawingBot	❌	❌	✅	✅	❌	❌	✅	3
Uncertainty Sampling is Preconditioned Stochastic Gradient Descent on Zero-One Loss	✅	✅	✅	❌	❌	❌	✅	4
Uncertainty-Aware Attention for Reliable Interpretation and Prediction	❌	✅	✅	✅	❌	❌	❌	3
Understanding Batch Normalization	❌	❌	✅	❌	❌	❌	✅	2
Understanding Regularized Spectral Clustering via Graph Conductance	❌	❌	✅	❌	❌	✅	✅	3
Understanding Weight Normalized Deep Neural Networks with Rectified Linear Units	❌	❌	❌	❌	❌	❌	❌	0
Understanding the Role of Adaptivity in Machine Teaching: The Case of Version Space Learners	✅	❌	❌	❌	❌	❌	✅	2
Uniform Convergence of Gradients for Non-Convex Learning and Optimization	❌	❌	❌	❌	❌	❌	❌	0
Universal Growth in Production Economies	❌	❌	❌	❌	❌	❌	❌	0
Unorganized Malicious Attacks Detection	✅	❌	✅	❌	❌	❌	✅	3
Unsupervised Adversarial Invariance	❌	❌	✅	✅	❌	❌	✅	3
Unsupervised Attention-guided Image-to-Image Translation	✅	✅	✅	❌	❌	❌	✅	4
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces	❌	❌	✅	❌	❌	❌	✅	2
Unsupervised Depth Estimation, 3D Face Rotation and Replacement	❌	❌	✅	✅	❌	❌	❌	2
Unsupervised Image-to-Image Translation Using Domain-Specific Variational Information Bound	❌	❌	✅	❌	❌	❌	✅	2
Unsupervised Learning of Artistic Styles with Archetypal Style Analysis	❌	✅	✅	❌	❌	❌	❌	2
Unsupervised Learning of Object Landmarks through Conditional Image Generation	❌	✅	✅	✅	❌	❌	✅	4
Unsupervised Learning of Shape and Pose with Differentiable Point Clouds	❌	✅	✅	✅	❌	❌	✅	4
Unsupervised Learning of View-invariant Action Representations	❌	❌	✅	✅	❌	❌	✅	3
Unsupervised Text Style Transfer using Language Models as Discriminators	❌	❌	✅	✅	❌	❌	✅	3
Unsupervised Video Object Segmentation for Deep Reinforcement Learning	❌	✅	✅	❌	❌	❌	✅	3
Uplift Modeling from Separate Labels	❌	❌	✅	✅	❌	❌	✅	3
Using Large Ensembles of Control Variates for Variational Inference	❌	❌	✅	❌	❌	❌	✅	2
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise	✅	✅	✅	✅	❌	❌	✅	5
Variance-Reduced Stochastic Gradient Descent on Streaming Data	✅	❌	✅	❌	❌	❌	✅	3
Variational Bayesian Monte Carlo	✅	✅	✅	❌	❌	❌	✅	4
Variational Inference with Tail-adaptive f-Divergence	✅	✅	✅	❌	❌	❌	✅	4
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition	✅	❌	❌	❌	❌	❌	✅	2
Variational Learning on Aggregate Outputs with Gaussian Processes	❌	✅	✅	✅	❌	❌	✅	4
Variational Memory Encoder-Decoder	✅	✅	✅	✅	❌	❌	❌	4
Variational PDEs for Acceleration on Manifolds and Application to Diffeomorphisms	❌	❌	❌	❌	❌	❌	✅	1
Verifiable Reinforcement Learning via Policy Extraction	✅	❌	✅	❌	✅	❌	✅	4
Video Prediction via Selective Sampling	❌	❌	✅	❌	❌	❌	✅	2
Video-to-Video Synthesis	❌	✅	✅	✅	✅	❌	✅	5
VideoCapsuleNet: A Simplified Network for Action Detection	❌	❌	✅	✅	✅	❌	✅	4
Virtual Class Enhanced Discriminative Embedding Learning	❌	❌	✅	❌	✅	❌	✅	3
Visual Memory for Robust Path Following	❌	✅	✅	✅	❌	❌	✅	4
Visual Object Networks: Image Generation with Disentangled 3D Representations	❌	✅	✅	❌	❌	❌	✅	3
Visual Reinforcement Learning with Imagined Goals	✅	❌	❌	❌	❌	❌	✅	2
Visualizing the Loss Landscape of Neural Nets	❌	✅	✅	❌	❌	❌	✅	3
Wasserstein Distributionally Robust Kalman Filtering	✅	✅	❌	❌	✅	❌	✅	4
Wasserstein Variational Inference	✅	❌	✅	✅	❌	❌	✅	4
Watch Your Step: Learning Node Embeddings via Graph Attention	❌	✅	✅	✅	❌	❌	✅	4
Wavelet regression and additive models for irregularly spaced data	❌	❌	✅	✅	❌	✅	✅	4
Weakly Supervised Dense Event Captioning in Videos	❌	✅	✅	✅	❌	✅	✅	5
When do random forests fail?	✅	❌	❌	❌	❌	❌	❌	1
Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior	❌	✅	✅	❌	❌	❌	❌	2
Which Neural Net Architectures Give Rise to Exploding and Vanishing Gradients?	❌	❌	✅	❌	❌	❌	❌	1
Why Is My Classifier Discriminatory?	❌	❌	✅	✅	❌	❌	❌	2
Why so gloomy? A Bayesian explanation of human pessimism bias in the multi-armed bandit task	❌	❌	❌	✅	❌	❌	✅	2
With Friends Like These, Who Needs Adversaries?	✅	✅	✅	❌	❌	❌	❌	3
Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning	✅	❌	❌	❌	❌	❌	✅	2
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization	✅	✅	✅	❌	❌	❌	✅	4
Zeroth-order (Non)-Convex Stochastic Optimization via Conditional Gradient and Gradient Updates	✅	❌	❌	❌	❌	❌	❌	1
cpSGD: Communication-efficient and differentially-private distributed SGD	❌	❌	✅	❌	❌	❌	✅	2
e-SNLI: Natural Language Inference with Natural Language Explanations	❌	✅	✅	✅	❌	❌	✅	4
rho-POMDPs have Lipschitz-Continuous epsilon-Optimal Value Functions	✅	✅	✅	❌	✅	❌	✅	5