Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Machine Learning (ICML) - 2016

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICML 2016 322 0.36 3.07 93.17% 33.0%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
A Box-Constrained Approach for Hard Permutation Problems βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Convolutional Attention Network for Extreme Summarization of Source Code βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
A Deep Learning Approach to Unsupervised Ensemble Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
A Distributed Variational Inference Framework for Unifying Parallel Sparse Gaussian Process Regression Models ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
A Kernel Test of Goodness of Fit ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
A Kernelized Stein Discrepancy for Goodness-of-fit Tests βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Kronecker-factored approximate Fisher matrix for convolution layers ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
A Neural Autoregressive Approach to Collaborative Filtering ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A New PAC-Bayesian Perspective on Domain Adaptation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Random Matrix Approach to Echo-State Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Self-Correcting Variable-Metric Algorithm for Stochastic Optimization βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
A Simple and Provable Algorithm for Sparse Diagonal CCA βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Simple and Strongly-Local Flow-Based Method for Cut Improvement βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
A Subspace Learning Approach for High Dimensional Matrix Decomposition with Efficient Column/Row Sampling βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
A Superlinearly-Convergent Proximal Newton-type Method for the Optimization of Finite Sums βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Theory of Generative ConvNet βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Variational Analysis of Stochastic Gradient Algorithms ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A ranking approach to global optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
ADIOS: Architectures Deep In Output Space βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Accurate Robust and Efficient Error Estimation for Decision Trees ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Actively Learning Hemimetrics with Applications to Eliciting User Preferences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Algorithms for Online Convex Optimization with Long-term Constraints βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Sampling for SGD by Exploiting Side Information βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Additive Approximations in High Dimensional Nonparametric Regression via the SALSA ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Algorithms for Optimizing the Ratio of Submodular Functions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
An optimal algorithm for the Thresholding Bandit Problem βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Analysis of Deep Neural Networks with Extended Data Jacobian Matrix ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Analysis of Variational Bayesian Factorizations for Sparse and Low-Rank Estimation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Anytime Exploration for Multi-armed Bandits using Confidence Information βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Anytime optimal algorithms in stochastic multi-armed bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Associative Long Short-Term Memory ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Asymmetric Multi-task Learning Based on Task Relatedness and Loss βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Asynchronous Methods for Deep Reinforcement Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Augmenting Supervised Neural Networks with Unsupervised Objectives for Large-scale Image Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Autoencoding beyond pixels using a learned similarity metric βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Automatic Construction of Nonparametric Relational Regression Models for Multiple Time Series βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Auxiliary Deep Generative Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
BASC: Applying Bayesian Optimization to the Search for Global Minima on Potential Energy Surfaces ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Barron and Cover’s Theory in Supervised Learning and its Application to Lasso ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Bayesian Poisson Tucker Decomposition for Learning the Structure of International Relations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Benchmarking Deep Reinforcement Learning for Continuous Control ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Beyond CCA: Moment Matching for Multi-View Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Beyond Parity Constraints: Fourier Analysis of Hash Functions for Inference βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bidirectional Helmholtz Machines βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Binary embeddings with structured hashed projections ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Black-Box Alpha Divergence Minimization ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Black-box Optimization with a Politician βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Boolean Matrix Factorization and Noisy Completion via Message Passing βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bounded Off-Policy Evaluation with Missing Data for Course Recommendation and Curriculum Design βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Clustering High Dimensional Categorical Data via Topographical Features βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Collapsed Variational Inference for Sum-Product Networks βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Community Recovery in Graphs with Locality βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Complex Embeddings for Simple Link Prediction ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Compressive Spectral Clustering βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Computationally Efficient NystrΓΆm Approximation using Fast Transforms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Conditional Bernoulli Mixtures for Multi-label Classification βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Conditional Dependence via Shannon Capacity: Axioms, Estimators and Applications ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Conservative Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Contextual Combinatorial Cascading Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Continuous Deep Q-Learning with Model-based Acceleration βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Control of Memory, Active Perception, and Action in Minecraft ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Controlling the distance to a Kemeny consensus without computing it βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Convergence of Stochastic Gradient Descent for PCA βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Convolutional Rectifier Networks as Generalized Tensor Decompositions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Correcting Forecasts with Multifactor Neural Attention ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Correlation Clustering and Biclustering with Locally Bounded Errors βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Cross-Graph Learning of Multi-Relational Associations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
CryptoNets: Applying Neural Networks to Encrypted Data with High Throughput and Accuracy ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
DCM Bandits: Learning to Rank with Multiple Clicks βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
DR-ABC: Approximate Bayesian Computation with Kernel-Based Distribution Regression βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Data-driven Rank Breaking for Efficient Rank Aggregation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Dealbreaker: A Nonlinear Latent Variable Model for Educational Data ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Deconstructing the Ladder Network Architecture ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Deep Gaussian Processes for Regression using Approximate Expectation Propagation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Deep Structured Energy Based Models for Anomaly Detection ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Dictionary Learning for Massive Matrix Factorization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Differential Geometric Regularization for Supervised Learning of Classifiers βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Differentially Private Chi-Squared Hypothesis Testing: Goodness of Fit and Independence Testing βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Differentially Private Policy Evaluation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Dirichlet Process Mixture Model for Correcting Technical Variation in Single-Cell Gene Expression Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Discrete Deep Feature Extraction: A Theory and New Architectures ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Discrete Distribution Estimation under Local Privacy ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Discriminative Embeddings of Latent Variable Models for Structured Data βœ… βœ… βœ… βœ… βœ… ❌ ❌ 5
Distributed Clustering of Linear Bandits in Peer to Peer Networks βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Diversity-Promoting Bayesian Learning of Latent Variable Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Domain Adaptation with Conditional Transferable Components ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Doubly Decomposing Nonparametric Tensor Regression βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Dropout distillation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dueling Network Architectures for Deep Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dynamic Capacity Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Dynamic Memory Networks for Visual and Textual Question Answering βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Early and Reliable Event Detection Using Proximity Space Representation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Efficient Algorithms for Adversarial Contextual Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient Algorithms for Large-scale Generalized Eigenvector Computation and Canonical Correlation Analysis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Efficient Learning with a Family of Nonconvex Regularizers by Redistributing Nonconvexity βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Efficient Multi-Instance Learning for Activity Recognition from Time Series Data Using an Auto-Regressive Hidden Markov Model ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Efficient Private Empirical Risk Minimization for High-dimensional Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Energetic Natural Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Epigraph projections for fast general convex programming βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Estimating Accuracy from Unlabeled Data: A Bayesian Approach ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Estimating Cosmological Parameters from the Dark Matter Distribution ❌ ❌ ❌ βœ… βœ… ❌ βœ… 3
Estimating Maximum Expected Value through Gaussian Approximation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Estimating Structured Vector Autoregressive Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Estimation from Indirect Supervision with Linear Moments ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Evasion and Hardening of Tree Ensemble Classifiers βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Exact Exponent in Optimal Rates for Crowdsourcing ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Experimental Design on a Budget for Sparse Linear Models and Applications βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Exploiting Cyclic Symmetry in Convolutional Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Expressiveness of Rectifier Networks ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Extended and Unscented Kitchen Sinks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Extreme F-measure Maximization using Sparse Probability Estimates βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Factored Temporal Sigmoid Belief Networks for Sequence Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
False Discovery Rate Control and Statistical Quality Assessment of Annotators in Crowdsourced Ranking βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Fast Algorithms for Segmented Regression βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Fast Constrained Submodular Maximization: Personalized Data Summarization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Fast DPP Sampling for Nystrom with Application to Kernel Methods βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Fast Parameter Inference in Nonlinear Dynamical Systems using Iterative Gradient Matching ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Fast Rate Analysis of Some Stochastic Optimization Algorithms ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Fast Stochastic Algorithms for SVD and PCA: Convergence Properties and Convexity βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Fast k-means with accurate bounds ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
Fast methods for estimating the Numerical rank of large matrices βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Faster Convex Optimization: Simulated Annealing with an Efficient Universal Barrier βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Faster Eigenvector Computation via Shift-and-Invert Preconditioning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Fixed Point Quantization of Deep Convolutional Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
ForecastICU: A Prognostic Decision Support System for Timely Prediction of Intensive Care Unit Admission ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Gaussian process nonparametric tensor estimator and its minimax optimality ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Gaussian quadrature for matrix inverse forms with applications βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Generalization Properties and Implicit Regularization for Multiple Passes SGM βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Generalization and Exploration via Randomized Value Functions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalized Direct Change Estimation in Ising Model Structure βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generative Adversarial Text to Image Synthesis βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Geometric Mean Metric Learning βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Gossip Dual Averaging for Decentralized Optimization of Pairwise Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Graying the black box: Understanding DQNs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Greedy Column Subset Selection: New Bounds and Distributed Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Gromov-Wasserstein Averaging of Kernel and Distance Matrices βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Group Equivariant Convolutional Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Hawkes Processes with Stochastic Excitations βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Heteroscedastic Sequences: Beyond Gaussianity βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Hierarchical Compound Poisson Factorization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Hierarchical Decision Making In Electricity Grid Management βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Hierarchical Span-Based Conditional Random Fields for Labeling and Segmenting Events in Wearable Sensor Data Streams βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Hierarchical Variational Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Horizontally Scalable Submodular Maximization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
How to Fake Multiply by a Gaussian Matrix βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Hyperparameter optimization with approximate gradient βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Importance Sampling Tree for Large-scale Empirical Expectation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Improved SVRG for Non-Strongly-Convex or Sum-of-Non-Convex Objectives βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Inference Networks for Sequential Monte Carlo in Graphical Models ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Interacting Particle Markov Chain Monte Carlo βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Interactive Bayesian Hierarchical Clustering ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Isotonic Hawkes Processes βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
K-Means Clustering with Distributed Dimensions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
L1-regularized Neural Networks are Improperly Learnable in Polynomial Time βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Large-Margin Softmax Loss for Convolutional Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Learning Convolutional Neural Networks for Graphs βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning End-to-end Video Classification with Rank-Pooling ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning Granger Causality for Hawkes Processes βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Mixtures of Plackett-Luce Models βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Learning Physical Intuition of Block Towers by Example ❌ βœ… ❌ βœ… ❌ ❌ βœ… 3
Learning Population-Level Diffusions with Generative RNNs ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Representations for Counterfactual Inference βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Simple Algorithms from Examples ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Learning Sparse Combinatorial Representations via Two-stage Submodular Maximization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning and Inference via Maximum Inner Product Search βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning from Multiway Data: Simple and Efficient Tensor Regression βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning privately from multiparty data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning to Filter with Predictive State Inference Machines βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Learning to Generate with Memory ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Linking losses for density ratio and class-probability estimation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Loss factorization, weakly supervised learning and label noise robustness βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Low-Rank Matrix Approximation with Stability βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Low-rank Solutions of Linear Matrix Equations via Procrustes Flow βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Low-rank tensor completion: a Riemannian manifold preconditioning approach ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Markov Latent Feature Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Markov-modulated Marked Poisson Processes for Check-in Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Matrix Eigen-decomposition via Doubly Stochastic Riemannian Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Meta-Learning with Memory-Augmented Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Metadata-conscious anonymous messaging βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Meta–Gradient Boosted Decision Tree Model for Weight and Target Learning βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Minding the Gaps for Block Frank-Wolfe Optimization of Structured SVMs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Minimizing the Maximal Loss: How and Why βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Minimum Regret Search for Single- and Multi-Task Optimization ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Mixing Rates for the Alternating Gibbs Sampler over Restricted Boltzmann Machines and Friends ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Mixture Proportion Estimation via Kernel Embeddings of Distributions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Model-Free Imitation Learning with Policy Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Model-Free Trajectory Optimization for Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Multi-Bias Non-linear Activation in Deep Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Multi-Player Bandits – a Musical Chairs Approach βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Near Optimal Behavior via Approximate State Abstraction ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Network Morphism βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Neural Variational Inference for Text Processing ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
No Oops, You Won’t Do It Again: Mechanisms for Self-correction in Crowdsourcing βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
No penalty no tears: Least squares in high-dimensional linear models βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
No-Regret Algorithms for Heavy-Tailed Linear Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Noisy Activation Functions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Non-negative Matrix Factorization under Heavy Noise βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Nonlinear Statistical Learning with Truncated Gaussian Graphical Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Nonparametric Canonical Correlation Analysis βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
On Graduated Optimization for Stochastic Non-Convex Problems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On collapsed representation of hierarchical Completely Random Measures ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On the Consistency of Feature Selection With Lasso for Non-linear Targets ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On the Iteration Complexity of Oblivious First-Order Optimization Algorithms βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
On the Power and Limits of Distance-Based Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Quality of the Initial Basin in Overspecified Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Statistical Limits of Convex Relaxations ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
One-Shot Generalization in Deep Generative Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Online Learning with Feedback Graphs Without the Graphs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online Low-Rank Subspace Clustering by Basis Dictionary Pursuit βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Stochastic Linear Optimization under One-bit Feedback βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Opponent Modeling in Deep Reinforcement Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Optimal Classification with Multivariate Losses βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Optimality of Belief Propagation for Crowdsourced Classification ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
PAC Lower Bounds and Efficient Algorithms for The Max K-Armed Bandit Problem βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
PAC learning of Probabilistic Automaton based on the Method of Moments βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
PD-Sparse : A Primal and Dual Sparse Approach to Extreme Multiclass and Multilabel Classification βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
PHOG: Probabilistic Model for Code ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Parallel and Distributed Block-Coordinate Frank-Wolfe Algorithms βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Parameter Estimation for Generalized Thurstone Choice Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Pareto Frontier Learning with Expensive Correlated Objectives ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Partition Functions from Rao-Blackwellized Tempered Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Persistence weighted Gaussian kernel for topological data analysis ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Persistent RNNs: Stashing Recurrent Weights On-Chip ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Pixel Recurrent Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Pliable Rejection Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Polynomial Networks and Factorization Machines: New Insights and Efficient Training Algorithms βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Power of Ordered Hypothesis Testing ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Preconditioning Kernel Matrices βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Predictive Entropy Search for Multi-objective Bayesian Optimization ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Pricing a Low-regret Seller βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Primal-Dual Rates and Certificates βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Principal Component Projection Without Principal Component Analysis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Provable Algorithms for Inference in Topic Models βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Provable Non-convex Phase Retrieval with Outliers: Median TruncatedWirtinger Flow βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Quadratic Optimization with Orthogonality Constraints: Explicit Lojasiewicz Exponent and Linear Convergence of Line-Search Methods βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Recommendations as Treatments: Debiasing Learning and Evaluation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Recovery guarantee of weighted low-rank approximation via alternating minimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Recurrent Orthogonal Networks and Long-Memory Tasks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Recycling Randomness with Structure for Sublinear time Kernel Expansions ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Representational Similarity Learning with Application to Brain Networks ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Revisiting Semi-Supervised Learning with Graph Embeddings βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Rich Component Analysis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Monte Carlo Sampling using Riemannian NosΓ©-PoincarΓ© Hamiltonian Dynamics βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Robust Principal Component Analysis with Side Information βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Random Cut Forest Based Anomaly Detection on Streams βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
SDCA without Duality, Regularization, and Individual Convexity βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Scalable Discrete Sampling as a Multi-Armed Bandit Problem βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Sequence to Sequence Training of CTC-RNNs with Partial Windowing βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Shifting Regret, Mirror Descent, and Matrices βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Slice Sampling on Hamiltonian Trajectories βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Smooth Imitation Learning for Online Sequence Prediction βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Softened Approximate Policy Iteration for Markov Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Solving Ridge Regression using Sketched Preconditioned SVRG βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Sparse Parameter Recovery from Aggregated Data ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Speeding up k-means by approximating Euclidean distances via block vectors βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Stability of Controllers for Gaussian Process Forward Models βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Starting Small - Learning with Adaptive Sample Sizes βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Stochastic Block BFGS: Squeezing More Curvature out of Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Discrete Clenshaw-Curtis Quadrature βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Stochastic Optimization for Multiview Representation Learning using Partial Least Squares βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Stochastic Quasi-Newton Langevin Monte Carlo βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Stochastic Variance Reduced Optimization for Nonconvex Sparse Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Variance Reduction for Nonconvex Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Stratified Sampling Meets Machine Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Strongly-Typed Recurrent Neural Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Structure Learning of Partitioned Markov Networks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Structured Prediction Energy Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Tensor Decomposition via Joint Matrix Schur Decomposition ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Texture Networks: Feed-forward Synthesis of Textures and Stylized Images ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Arrow of Time in Multivariate Time Series βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The Information Sieve ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Information-Theoretic Requirements of Subspace Clustering with Missing Data βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
The Label Complexity of Mixed-Initiative Classifier Training βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
The Segmented iHMM: A Simple, Efficient Hierarchical Infinite HMM ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
The Sum-Product Theorem: A Foundation for Learning Tractable Models βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
The Teaching Dimension of Linear Learners ❌ ❌ ❌ ❌ ❌ βœ… βœ… 2
The Variational Nystrom method for large-scale spectral problems ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The knockoff filter for FDR control in group-sparse and multitask regression ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
Towards Faster Rates and Oracle Property for Low-Rank Matrix Estimation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Train and Test Tightness of LP Relaxations in Structured Prediction ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Train faster, generalize better: Stability of stochastic gradient descent ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Training Deep Neural Networks via Direct Loss Minimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Training Neural Networks Without Gradients: A Scalable ADMM Approach βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Truthful Univariate Estimators ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Unitary Evolution Recurrent Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Unsupervised Deep Embedding for Clustering Analysis ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Uprooting and Rerooting Graphical Models ❌ ❌ ❌ ❌ ❌ βœ… βœ… 2
Variable Elimination in the Fourier Domain ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Variance Reduction for Faster Non-Convex Optimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Variance-Reduced and Projection-Free Stochastic Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Variational Inference for Monte Carlo Objectives βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Why Most Decisions Are Easy in Tetrisβ€”And Perhaps in Other Sequential Decision Problems, As Well ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Why Regularized Auto-Encoders learn Sparse Representation? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
k-variates++: more pluses in the k-means++ βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2