Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Machine Learning (ICML) - 2017

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICML 2017 434 0.39 3.15 92.17% 41.25%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
A Birth-Death Process for Feature Allocation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Closer Look at Memorization in Deep Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Distributional Perspective on Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Laplacian Framework for Option Discovery in Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
A Richer Theory of Convex Constrained Optimization with Reduced Projections and Improved Rates βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Semismooth Newton Method for Fast, Generic Convex Programming βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Simple Multi-Class Boosting Framework with Theoretical Guarantees and Empirical Proficiency ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Simulated Annealing Based Inexact Oracle for Wasserstein Loss Minimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Unified Maximum Likelihood Approach for Estimating Symmetric Properties of Discrete Distributions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Unified Variance Reduction-Based Framework for Nonconvex Low-Rank Matrix Recovery βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Unified View of Multi-Label Performance Measures βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Accelerating Eulerian Fluid Simulation With Convolutional Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Active Heteroscedastic Regression βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Active Learning for Accurate Estimation of Linear Models βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Active Learning for Cost-Sensitive Classification βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Active Learning for Top-$K$ Rank Aggregation from Noisy Comparisons βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
AdaNet: Adaptive Structural Learning of Artificial Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Adapting Kernel Representations Online Using Submodular Maximization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Consensus ADMM for Distributed Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression under RIP βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Adaptive Multiple-Arm Identification βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Neural Networks for Efficient Inference βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Adaptive Sampling Probabilities for Non-Smooth Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adversarial Feature Matching for Text Generation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Algebraic Variety Models for High-Rank Matrix Completion βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Algorithmic Stability and Hypothesis Complexity ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Algorithms for $\ell_p$ Low-Rank Approximation βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
An Adaptive Test of Independence with Analytic Kernel Embeddings ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
An Alternative Softmax Operator for Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
An Infinite Hidden Markov Model With Similarity-Biased Transitions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Analogical Inference for Multi-relational Embeddings ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Analysis and Optimization of Graph Decompositions by Lifted Multicuts ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Analytical Guarantees on Numerical Precision of Deep Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Approximate Newton Methods and Their Local Convergence βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Approximate Steepest Coordinate Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Asymmetric Tri-training for Unsupervised Domain Adaptation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Asynchronous Distributed Variational Gaussian Process for Regression βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Asynchronous Stochastic Gradient Descent with Delay Compensation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Attentive Recurrent Comparators ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Automated Curriculum Learning for Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Automatic Discovery of the Statistical Types of Variables in a Dataset βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Axiomatic Attribution for Deep Networks ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Batched High-dimensional Bayesian Optimization via Structural Kernel Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Bayesian Boolean Matrix Factorisation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bayesian Models of Data Streams with Hierarchical Power Priors ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bayesian Optimization with Tree-structured Dependencies ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Bayesian inference on random simple graphs with power law degree distributions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Being Robust (in High Dimensions) Can Be Practical βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Beyond Filters: Compact Feature Map for Portable Deep Model βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Bidirectional Learning for Time-series Models with Hidden Units βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Boosted Fitted Q-Iteration βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Bottleneck Conditional Density Estimation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Breaking Locality Accelerates Block Gauss-Seidel βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Canopy Fast Sampling with Cover Trees ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Capacity Releasing Diffusion for Speed and Locality βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
ChoiceRank: Identifying Preferences from Node Traffic in Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Clustering High Dimensional Dynamic Data Streams βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Co-clustering through Optimal Transport βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Coherence Pursuit: Fast, Simple, and Robust Subspace Recovery βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Coherent Probabilistic Forecasts for Hierarchical Time Series βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Collect at Once, Use Effectively: Making Non-interactive Locally Private Learning Possible βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Combined Group and Exclusive Sparsity for Deep Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Composing Tree Graphical Models with Persistent Homology Features for Clustering Mixed-Type Data βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Compressed Sensing using Generative Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Conditional Accelerated Lazy Stochastic Gradient Descent βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Conditional Image Synthesis with Auxiliary Classifier GANs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Confident Multiple Choice Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Connected Subgraph Detection with Mirror Descent on SDPs βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Consistency Analysis for Binary Classification Revisited βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Consistent On-Line Off-Policy Evaluation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Consistent k-Clustering βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Constrained Policy Optimization βœ… βœ… ❌ ❌ ❌ ❌ ❌ 2
Contextual Decision Processes with low Bellman rank are PAC-Learnable βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Continual Learning Through Synaptic Intelligence ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Convex Phase Retrieval without Lifting via PhaseMax ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Convexified Convolutional Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Convolutional Sequence to Sequence Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Coordinated Multi-Agent Imitation Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Coresets for Vector Summarization with Applications to Network Graphs βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Cost-Optimal Learning of Causal Graphs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Count-Based Exploration with Neural Density Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Counterfactual Data-Fusion for Online Reinforcement Learners ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Coupling Distributed and Symbolic Execution for Natural Language Queries ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Curiosity-driven Exploration by Self-supervised Prediction ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Dance Dance Convolution ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Data-Efficient Policy Evaluation Through Behavior Policy Search βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Decoupled Neural Interfaces using Synthetic Gradients ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Deep Bayesian Active Learning with Image Data ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Deep Generative Models for Relational Data with Side Information ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Deep IV: A Flexible Approach for Counterfactual Prediction ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deep Spectral Clustering Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Deep Tensor Convolution on Multicores βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Deep Transfer Learning with Joint Adaptation Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deep Voice: Real-time Neural Text-to-Speech ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
DeepBach: a Steerable Model for Bach Chorales Generation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deletion-Robust Submodular Maximization: Data Summarization with β€œthe Right to be Forgotten” βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Delta Networks for Optimized Recurrent Network Computation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Density Level Set Estimation on Manifolds with DBSCAN βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Deriving Neural Architectures from Sequence and Graph Kernels ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Developing Bug-Free Machine Learning Systems With Formal Mathematics βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Device Placement Optimization with Reinforcement Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Diameter-Based Active Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Dictionary Learning Based on Sparse Distribution Tomography βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentiable Programs with Neural Libraries βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Chi-squared Test by Unit Circle Mechanism βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Differentially Private Clustering in High-Dimensional Euclidean Spaces βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Learning of Undirected Graphical Models Using Collective Graphical Models βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Differentially Private Ordinary Least Squares βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Submodular Maximization: Data Summarization in Disguise βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Discovering Discrete Latent Topics with Neural Variational Inference βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dissipativity Theory for Nesterov’s Accelerated Method ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Distributed Batch Gaussian Process Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Distributed Mean Estimation with Limited Communication ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Distributed and Provably Good Seedings for k-Means in Constant Rounds βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Doubly Accelerated Methods for Faster CCA and Generalized Eigendecomposition βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dropout Inference in Bayesian Neural Networks with Alpha-divergences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Dual Supervised Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Dueling Bandits with Weak Regret βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dynamic Word Embeddings ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Efficient Distributed Learning with Sparsity βœ… ❌ ❌ βœ… ❌ ❌ ❌ 2
Efficient Nonmyopic Active Search ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient Regret Minimization in Non-Convex Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient softmax approximation for GPUs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Emulating the Expert: Inverse Optimization through Online Learning βœ… ❌ ❌ ❌ βœ… βœ… ❌ 3
End-to-End Differentiable Adversarial Imitation Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
End-to-End Learning for Structured Prediction Energy Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Enumerating Distinct Decision Trees βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Equivariance Through Parameter-Sharing ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Estimating individual treatment effect: generalization bounds and algorithms βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Estimating the unseen from multiple populations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Evaluating Bayesian Models with Posterior Dispersion Indices βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Evaluating the Variance of Likelihood-Ratio Gradient Estimators βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Exact Inference for Integer Latent-Variable Models βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Exact MAP Inference by Avoiding Fractional Vertices βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Exploiting Strong Convexity from Data with Primal-Dual First-Order Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Failures of Gradient-Based Deep Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Fairness in Reinforcement Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Fake News Mitigation via Point Process Based Intervention βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Fast Bayesian Intensity Estimation for the Permanental Process ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Fast k-Nearest Neighbour Search via Prioritized DCI βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Faster Greedy MAP Inference for Determinantal Point Processes βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Faster Principal Component Regression and Stable Matrix Chebyshev Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
FeUdal Networks for Hierarchical Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Follow the Moving Leader in Deep Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Forest-type Regression with General Losses and Robust Forest βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Forward and Reverse Gradient-Based Hyperparameter Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for Markov Chain Monte Carlo ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Frame-based Data Factorizations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
From Patches to Images: A Nonparametric Generative Model ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
GSOS: Gauss-Seidel Operator Splitting Algorithm for Multi-Term Nonsmooth Convex Composite Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalization and Equilibrium in Generative Adversarial Nets (GANs) ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Geometry of Neural Network Loss Surfaces via Random Matrix Theory ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Global optimization of Lipschitz functions βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Globally Induced Forest: A Prepruning Compression Scheme βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Gradient Boosted Decision Trees for High Dimensional Sparse Output βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Gradient Coding: Avoiding Stragglers in Distributed Learning βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Gradient Projection Iterative Sketch for Large-Scale Constrained Least-Squares βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Grammar Variational Autoencoder βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Graph-based Isometry Invariant Representation Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Guarantees for Greedy Maximization of Non-submodular Functions with Applications βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Hierarchy Through Composition with Multitask LMDPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
High Dimensional Bayesian Optimization with Elastic Gaussian Process βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
High-Dimensional Structured Quantile Regression ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
High-dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
How Close Are the Eigenvectors of the Sample and Actual Covariance Matrices? ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
How to Escape Saddle Points Efficiently βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Hyperplane Clustering via Dual Principal Component Pursuit ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Identification and Model Testing in Linear Structural Equation Models using Auxiliary Variables βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Identify the Nash Equilibrium in Static Games with Random Payoffs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Identifying Best Interventions through Online Importance Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Image-to-Markup Generation with Coarse-to-Fine Attention ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Improving Gibbs Sampler Scan Quality with DoGS βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Improving Viterbi is Hard: Better Runtimes Imply Faster Clique Algorithms βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Innovation Pursuit: A New Approach to the Subspace Clustering Problem βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Input Convex Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Interactive Learning from Policy-Dependent Human Feedback βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Iterative Machine Teaching βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Joint Dimensionality Reduction and Metric Learning: A Geometric Take ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Just Sort It! A Simple and Effective Approach to Active Preference Learning βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Kernelized Support Tensor Machines βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Language Modeling with Gated Convolutional Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Large-Scale Evolution of Image Classifiers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Latent Feature Lasso βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Latent Intention Dialogue Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Latent LSTM Allocation: Joint Clustering and Non-Linear Dynamic Modeling of Sequence Data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Lazifying Conditional Gradient Algorithms βœ… ❌ ❌ ❌ ❌ βœ… ❌ 2
Learned Optimizers that Scale and Generalize ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning Algorithms for Active Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Continuous Semantic Representations of Symbolic Expressions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Deep Architectures via Generalized Whitened Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Deep Latent Gaussian Models with Markov Chain Monte Carlo βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Learning Determinantal Point Processes with Moments and Cycles βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Discrete Representations via Information Maximizing Self-Augmented Training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Gradient Descent: Better Generalization and Longer Horizons ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Hawkes Processes from Short Doubly-Censored Event Sequences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Hierarchical Features from Deep Generative Models ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Learning Important Features Through Propagating Activation Differences ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Infinite Layer Networks Without the Kernel Trick βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Latent Space Models with Angular Constraints ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Learning Stable Stochastic Nonlinear Dynamical Systems ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning Texture Manifolds with the Periodic Spatial GAN ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning in POMDPs with Monte Carlo Tree Search βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning the Structure of Generative Models without Labeled Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning to Aggregate Ordinal Labels by Maximizing Separating Width βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning to Align the Source Code to the Compiled Object Code ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning to Discover Sparse Graphical Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning to Generate Long-term Future via Hierarchical Prediction βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning to Learn without Gradient Descent by Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Leveraging Node Attributes for Incomplete Relational Data ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Leveraging Union of Subspace Structure to Improve Constrained Clustering βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Local Bayesian Optimization of Motor Skills βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Local-to-Global Bayesian Network Structure Learning βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Logarithmic Time One-Against-Some βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Lost Relatives of the Gumbel Trick βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
MEC: Memory-efficient Convolution for Deep Neural Network βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Magnetic Hamiltonian Monte Carlo βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Max-value Entropy Search for Efficient Bayesian Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Maximum Selection and Ranking under Noisy Comparisons βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
McGan: Mean and Covariance Feature Matching GAN βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Measuring Sample Quality with Kernels ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Meritocratic Fairness for Cross-Population Selection βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Meta Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Minimax Regret Bounds for Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Minimizing Trust Leaks for Robust Sybil Detection ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Model-Independent Online Learning for Influence Maximization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Modular Multitask Reinforcement Learning with Policy Sketches βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Multi-Class Optimal Margin Distribution Machine βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Multi-fidelity Bayesian Optimisation with Continuous Approximations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Multi-objective Bandits: Optimizing the Generalized Gini Index βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Multi-task Learning with Labeled and Unlabeled Tasks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Multichannel End-to-end Speech Recognition ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Multilabel Classification with Group Testing and Codes βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Multilevel Clustering via Wasserstein Means βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Multiple Clustering Views from Multiple Uncertain Experts ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-Optimal Design of Experiments via Regret Minimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Nearly Optimal Robust Matrix Completion βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Neural Episodic Control βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Neural Message Passing for Quantum Chemistry ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Neural Networks and Rational Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Neural Optimizer Search with Reinforcement Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Nonnegative Matrix Factorization for Time Series Recovery From a Few Temporal Aggregates βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Nonparanormal Information Estimation ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
NystrΓΆm Method with Kernel K-means++ Samples as Landmarks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Approximation Guarantees for Greedy Low Rank Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
On Calibration of Modern Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
On Context-Dependent Clustering of Bandits βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On Kernelized Multi-armed Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Mixed Memberships and Symmetric Nonnegative Matrix Factorizations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Relaxing Determinism in Arithmetic Circuits ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On The Projection Operator to A Three-view Cardinality Constrained Set βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On orthogonality and learning recurrent networks with long term dependencies ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On the Expressive Power of Deep Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Iteration Complexity of Support Recovery via Hard Thresholding Pursuit ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On the Sampling Problem for Kernel Quadrature βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Learning to Rank in Stochastic Click Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Learning with Local Permutations and Delayed Feedback βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Partial Least Square Optimization: Dropping Convexity for Better Efficiency and Scalability ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Online and Linear-Time Attention by Enforcing Monotonic Alignments βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
OptNet: Differentiable Optimization as a Layer in Neural Networks ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Optimal Densification for Fast and Accurate Minwise Hashing βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Oracle Complexity of Second-Order Methods for Finite-Sum Problems ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Ordinal Graphical Models: A Tale of Two Approaches ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Pain-Free Random Differential Privacy with Sensitivity Sampling βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Parallel Multiscale Autoregressive Density Estimation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Parseval Networks: Improving Robustness to Adversarial Examples βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Partitioned Tensor Factorizations for Learning Mixed Membership Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
PixelCNN Models with Auxiliary Variables for Natural Image Modeling ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Post-Inference Prior Swapping βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Practical Gauss-Newton Optimisation for Deep Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Prediction and Control with Temporal Segment Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Prediction under Uncertainty in Sparse Spectrum Gaussian Processes with Applications to Filtering and Control βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Preferential Bayesian Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Priv’IT: Private and Sample Efficient Identity Testing βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Probabilistic Path Hamiltonian Monte Carlo βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Probabilistic Submodular Maximization in Sub-Linear Time βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Programming with a Differentiable Forth Interpreter βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Projection-free Distributed Online Learning in Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
ProtoNN: Compressed and Accurate kNN for Resource-scarce Devices βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Provable Alternating Gradient Descent for Non-negative Matrix Factorization with Strong Correlations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Provably Optimal Algorithms for Generalized Linear Contextual Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Prox-PDA: The Proximal Primal-Dual Algorithm for Fast Distributed Nonconvex Optimization and Learning Over Networks βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Random Feature Expansions for Deep Gaussian Processes ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Re-revisiting Learning on Hypergraphs: Confidence Interval and Subgradient Method βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Real-Time Adaptive Image Compression ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Recovery Guarantees for One-hidden-layer Neural Networks βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Recurrent Highway Networks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Recursive Partitioning for Personalization using Observational Data βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Regret Minimization in Behaviorally-Constrained Zero-Sum Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Regularising Non-linear Models Using Feature Side-information ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Reinforcement Learning with Deep Energy-Based Policies βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Relative Fisher Information and Natural Gradient for Learning Large Modular Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Resource-efficient Machine Learning in 2 KB RAM for the Internet of Things βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Risk Bounds for Transferring Representations With and Without Fine-Tuning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Robust Adversarial Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Budget Allocation via Continuous Submodular Functions βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Robust Gaussian Graphical Model Estimation with Arbitrary Corruption ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Robust Guarantees of Stochastic Greedy Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Probabilistic Modeling with Bayesian Data Reweighting ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Robust Structured Estimation with Single-Index Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Robust Submodular Maximization: A Non-Uniform Partitioning Approach βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
RobustFill: Neural Program Learning under Noisy I/O ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Rule-Enhanced Penalized Regression by Column Generation using Rectangular Maximum Agreement βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
SPLICE: Fully Tractable Hierarchical Extension of ICA with Pooling ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Safety-Aware Algorithms for Adversarial Contextual Bandit βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Scalable Bayesian Rule Lists βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Scalable Generative Models for Multi-label Learning with Missing Labels ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Scalable Multi-Class Gaussian Process Classification using Expectation Propagation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Second-Order Kernel Online Convex Optimization with Adaptive Sketching βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Selective Inference for Sparse High-Order Interaction Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Self-Paced Co-training βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Sequence Modeling via Segmentations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Sequence to Better Sequence: Continuous Revision of Combinatorial Structures βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Sharp Minima Can Generalize For Deep Nets ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Sliced Wasserstein Kernel for Persistence Diagrams βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Soft-DTW: a Differentiable Loss Function for Time-Series βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Source-Target Similarity Modelings for Multi-Source Transfer Gaussian Process Regression ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Sparse + Group-Sparse Dirty Models: Statistical Guarantees without Unreasonable Conditions and a Case for Non-Convexity ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Spectral Learning from a Single Trajectory under Finite-State Policies βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Spherical Structured Feature Maps for Kernel Approximation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
State-Frequency Memory Recurrent Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
StingyCD: Safely Avoiding Wasteful Updates in Coordinate Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Adaptive Quasi-Newton Methods for Minimizing Expected Values βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Stochastic Bouncy Particle Sampler βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic DCA for the Large-sum of Non-convex Functions Problem and its Application to Group Variable Selection in Classification βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Stochastic Generative Hashing βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Stochastic Gradient MCMC Methods for Hidden Markov Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Gradient Monomial Gamma Sampler βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Variance Reduction Methods for Policy Evaluation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Strong NP-Hardness for Sparse Optimization with Concave Penalty Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Strongly-Typed Agents are Guaranteed to Interact Safely ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Sub-sampled Cubic Regularization for Non-convex Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Tensor Balancing on Statistical Manifold ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Tensor Belief Propagation βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Tensor Decomposition via Simultaneous Power Iteration βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Tensor Decomposition with Smoothness ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Tensor-Train Recurrent Neural Networks for Video Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
The Loss Surface of Deep and Wide Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Predictron: End-To-End Learning and Planning ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Price of Differential Privacy for Online Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
The Sample Complexity of Online One-Class Collaborative Filtering βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
The Shattered Gradients Problem: If resnets are the answer, then what is the question? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Statistical Recurrent Unit ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Tight Bounds for Approximate CarathΓ©odory and Beyond βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Toward Controlled Generation of Text βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Toward Efficient and Accurate Covariance Matrix Estimation on Compressed Data βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Uncorrelation and Evenness: a New Diversity-Promoting Regularizer ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Uncovering Causality from Multivariate Hawkes Integrated Cumulants βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Understanding Black-box Predictions via Influence Functions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding Synthetic Gradients and Decoupled Neural Interfaces ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Uniform Convergence Rates for Kernel Density Estimation ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Uniform Deviation Bounds for k-Means Clustering ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Unifying Task Specification in Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Unimodal Probability Distributions for Deep Ordinal Classification ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Unsupervised Learning by Predicting Noise βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Variational Boosting: Iteratively Refining Posterior Approximations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Variational Dropout Sparsifies Deep Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Variational Inference for Sparse and Undirected Models βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Variational Policy for Guiding Point Processes βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Video Pixel Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Warped Convolutions: Efficient Invariance to Spatial Transformations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Wasserstein Generative Adversarial Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, $\ell_2$-consistency and Neuroscience Applications ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
World of Bits: An Open-Domain Platform for Web-Based Agents ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Zero-Inflated Exponential Family Embeddings ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Zonotope Hit-and-run for Efficient Sampling from Projection DPPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
iSurvive: An Interpretable, Event-time Prediction Model for mHealth βœ… βœ… ❌ βœ… ❌ ❌ ❌ 3
meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
β€œConvex Until Proven Guilty”: Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3