Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

International Conference on Machine Learning (ICML) - 2017

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025)
Documentation Score Global mean is the average score over the seven reproducibility variables for empirical research papers.
% Empirical Percentage of papers that are empirical research vs theoretical research
% Industry Percentage of empirical research papers with at least one author from Industry
Website
ICML 2017 434 0.39 3.15 92.17% 41.25%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
A Birth-Death Process for Feature Allocation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Closer Look at Memorization in Deep Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Distributional Perspective on Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Laplacian Framework for Option Discovery in Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
A Richer Theory of Convex Constrained Optimization with Reduced Projections and Improved Rates βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Semismooth Newton Method for Fast, Generic Convex Programming βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Simple Multi-Class Boosting Framework with Theoretical Guarantees and Empirical Proficiency ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Simulated Annealing Based Inexact Oracle for Wasserstein Loss Minimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Unified Maximum Likelihood Approach for Estimating Symmetric Properties of Discrete Distributions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Unified Variance Reduction-Based Framework for Nonconvex Low-Rank Matrix Recovery βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Unified View of Multi-Label Performance Measures βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Accelerating Eulerian Fluid Simulation With Convolutional Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Active Heteroscedastic Regression βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Active Learning for Accurate Estimation of Linear Models βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Active Learning for Cost-Sensitive Classification βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Active Learning for Top-$K$ Rank Aggregation from Noisy Comparisons βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
AdaNet: Adaptive Structural Learning of Artificial Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Adapting Kernel Representations Online Using Submodular Maximization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Consensus ADMM for Distributed Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression under RIP βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Adaptive Multiple-Arm Identification βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Neural Networks for Efficient Inference βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Adaptive Sampling Probabilities for Non-Smooth Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adversarial Feature Matching for Text Generation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Algebraic Variety Models for High-Rank Matrix Completion βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Algorithmic Stability and Hypothesis Complexity ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Algorithms for $\ell_p$ Low-Rank Approximation βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
An Adaptive Test of Independence with Analytic Kernel Embeddings ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
An Alternative Softmax Operator for Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
An Infinite Hidden Markov Model With Similarity-Biased Transitions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Analogical Inference for Multi-relational Embeddings ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Analysis and Optimization of Graph Decompositions by Lifted Multicuts ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Analytical Guarantees on Numerical Precision of Deep Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Approximate Newton Methods and Their Local Convergence βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Approximate Steepest Coordinate Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Asymmetric Tri-training for Unsupervised Domain Adaptation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Asynchronous Distributed Variational Gaussian Process for Regression βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Asynchronous Stochastic Gradient Descent with Delay Compensation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Attentive Recurrent Comparators ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Automated Curriculum Learning for Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Automatic Discovery of the Statistical Types of Variables in a Dataset βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Axiomatic Attribution for Deep Networks ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Batched High-dimensional Bayesian Optimization via Structural Kernel Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Bayesian Boolean Matrix Factorisation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bayesian Models of Data Streams with Hierarchical Power Priors ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bayesian Optimization with Tree-structured Dependencies ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Bayesian inference on random simple graphs with power law degree distributions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Being Robust (in High Dimensions) Can Be Practical βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Beyond Filters: Compact Feature Map for Portable Deep Model βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Bidirectional Learning for Time-series Models with Hidden Units βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Boosted Fitted Q-Iteration βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Bottleneck Conditional Density Estimation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Breaking Locality Accelerates Block Gauss-Seidel βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Canopy Fast Sampling with Cover Trees ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Capacity Releasing Diffusion for Speed and Locality βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
ChoiceRank: Identifying Preferences from Node Traffic in Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Clustering High Dimensional Dynamic Data Streams βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Co-clustering through Optimal Transport βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Coherence Pursuit: Fast, Simple, and Robust Subspace Recovery βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Coherent Probabilistic Forecasts for Hierarchical Time Series βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Collect at Once, Use Effectively: Making Non-interactive Locally Private Learning Possible βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Combined Group and Exclusive Sparsity for Deep Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Composing Tree Graphical Models with Persistent Homology Features for Clustering Mixed-Type Data βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Compressed Sensing using Generative Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Conditional Accelerated Lazy Stochastic Gradient Descent βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Conditional Image Synthesis with Auxiliary Classifier GANs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Confident Multiple Choice Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Connected Subgraph Detection with Mirror Descent on SDPs βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Consistency Analysis for Binary Classification Revisited βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Consistent On-Line Off-Policy Evaluation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Consistent k-Clustering βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Constrained Policy Optimization βœ… βœ… ❌ ❌ ❌ ❌ ❌ 2
Contextual Decision Processes with low Bellman rank are PAC-Learnable βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Continual Learning Through Synaptic Intelligence ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Convex Phase Retrieval without Lifting via PhaseMax ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Convexified Convolutional Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Convolutional Sequence to Sequence Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Coordinated Multi-Agent Imitation Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Coresets for Vector Summarization with Applications to Network Graphs βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Cost-Optimal Learning of Causal Graphs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Count-Based Exploration with Neural Density Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Counterfactual Data-Fusion for Online Reinforcement Learners ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Coupling Distributed and Symbolic Execution for Natural Language Queries ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Curiosity-driven Exploration by Self-supervised Prediction ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Dance Dance Convolution ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Data-Efficient Policy Evaluation Through Behavior Policy Search βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Decoupled Neural Interfaces using Synthetic Gradients ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Deep Bayesian Active Learning with Image Data ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Deep Generative Models for Relational Data with Side Information ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Deep IV: A Flexible Approach for Counterfactual Prediction ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deep Spectral Clustering Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Deep Tensor Convolution on Multicores βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Deep Transfer Learning with Joint Adaptation Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deep Voice: Real-time Neural Text-to-Speech ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
DeepBach: a Steerable Model for Bach Chorales Generation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deletion-Robust Submodular Maximization: Data Summarization with β€œthe Right to be Forgotten” βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Delta Networks for Optimized Recurrent Network Computation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Density Level Set Estimation on Manifolds with DBSCAN βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Deriving Neural Architectures from Sequence and Graph Kernels ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Developing Bug-Free Machine Learning Systems With Formal Mathematics βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Device Placement Optimization with Reinforcement Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Diameter-Based Active Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Dictionary Learning Based on Sparse Distribution Tomography βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentiable Programs with Neural Libraries βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Chi-squared Test by Unit Circle Mechanism βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Differentially Private Clustering in High-Dimensional Euclidean Spaces βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Learning of Undirected Graphical Models Using Collective Graphical Models βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Differentially Private Ordinary Least Squares βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Submodular Maximization: Data Summarization in Disguise βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Discovering Discrete Latent Topics with Neural Variational Inference βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dissipativity Theory for Nesterov’s Accelerated Method ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Distributed Batch Gaussian Process Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Distributed Mean Estimation with Limited Communication ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Distributed and Provably Good Seedings for k-Means in Constant Rounds βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Doubly Accelerated Methods for Faster CCA and Generalized Eigendecomposition βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dropout Inference in Bayesian Neural Networks with Alpha-divergences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Dual Supervised Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Dueling Bandits with Weak Regret βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Dynamic Word Embeddings ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Efficient Distributed Learning with Sparsity βœ… ❌ ❌ βœ… ❌ ❌ ❌ 2
Efficient Nonmyopic Active Search ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient Regret Minimization in Non-Convex Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient softmax approximation for GPUs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Emulating the Expert: Inverse Optimization through Online Learning βœ… ❌ ❌ ❌ βœ… βœ… ❌ 3
End-to-End Differentiable Adversarial Imitation Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
End-to-End Learning for Structured Prediction Energy Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Enumerating Distinct Decision Trees βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Equivariance Through Parameter-Sharing ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Estimating individual treatment effect: generalization bounds and algorithms βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Estimating the unseen from multiple populations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Evaluating Bayesian Models with Posterior Dispersion Indices βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Evaluating the Variance of Likelihood-Ratio Gradient Estimators βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Exact Inference for Integer Latent-Variable Models βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Exact MAP Inference by Avoiding Fractional Vertices βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Exploiting Strong Convexity from Data with Primal-Dual First-Order Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Failures of Gradient-Based Deep Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Fairness in Reinforcement Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Fake News Mitigation via Point Process Based Intervention βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Fast Bayesian Intensity Estimation for the Permanental Process ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Fast k-Nearest Neighbour Search via Prioritized DCI βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Faster Greedy MAP Inference for Determinantal Point Processes βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Faster Principal Component Regression and Stable Matrix Chebyshev Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
FeUdal Networks for Hierarchical Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Follow the Moving Leader in Deep Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Forest-type Regression with General Losses and Robust Forest βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Forward and Reverse Gradient-Based Hyperparameter Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for Markov Chain Monte Carlo ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Frame-based Data Factorizations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
From Patches to Images: A Nonparametric Generative Model ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
GSOS: Gauss-Seidel Operator Splitting Algorithm for Multi-Term Nonsmooth Convex Composite Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalization and Equilibrium in Generative Adversarial Nets (GANs) ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Geometry of Neural Network Loss Surfaces via Random Matrix Theory ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Global optimization of Lipschitz functions βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Globally Induced Forest: A Prepruning Compression Scheme βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Gradient Boosted Decision Trees for High Dimensional Sparse Output βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Gradient Coding: Avoiding Stragglers in Distributed Learning βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Gradient Projection Iterative Sketch for Large-Scale Constrained Least-Squares βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Grammar Variational Autoencoder βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Graph-based Isometry Invariant Representation Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Guarantees for Greedy Maximization of Non-submodular Functions with Applications βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Hierarchy Through Composition with Multitask LMDPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
High Dimensional Bayesian Optimization with Elastic Gaussian Process βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
High-Dimensional Structured Quantile Regression ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
High-dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
How Close Are the Eigenvectors of the Sample and Actual Covariance Matrices? ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
How to Escape Saddle Points Efficiently βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Hyperplane Clustering via Dual Principal Component Pursuit ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Identification and Model Testing in Linear Structural Equation Models using Auxiliary Variables βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Identify the Nash Equilibrium in Static Games with Random Payoffs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Identifying Best Interventions through Online Importance Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Image-to-Markup Generation with Coarse-to-Fine Attention ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Improving Gibbs Sampler Scan Quality with DoGS βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Improving Viterbi is Hard: Better Runtimes Imply Faster Clique Algorithms βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Innovation Pursuit: A New Approach to the Subspace Clustering Problem βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Input Convex Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Interactive Learning from Policy-Dependent Human Feedback βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Iterative Machine Teaching βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Joint Dimensionality Reduction and Metric Learning: A Geometric Take ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Just Sort It! A Simple and Effective Approach to Active Preference Learning βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Kernelized Support Tensor Machines βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Language Modeling with Gated Convolutional Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Large-Scale Evolution of Image Classifiers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Latent Feature Lasso βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Latent Intention Dialogue Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Latent LSTM Allocation: Joint Clustering and Non-Linear Dynamic Modeling of Sequence Data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Lazifying Conditional Gradient Algorithms βœ… ❌ ❌ ❌ ❌ βœ… ❌ 2
Learned Optimizers that Scale and Generalize ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning Algorithms for Active Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Continuous Semantic Representations of Symbolic Expressions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Deep Architectures via Generalized Whitened Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Deep Latent Gaussian Models with Markov Chain Monte Carlo βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Learning Determinantal Point Processes with Moments and Cycles βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Discrete Representations via Information Maximizing Self-Augmented Training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Gradient Descent: Better Generalization and Longer Horizons ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Hawkes Processes from Short Doubly-Censored Event Sequences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Hierarchical Features from Deep Generative Models ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Learning Important Features Through Propagating Activation Differences ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Infinite Layer Networks Without the Kernel Trick βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Latent Space Models with Angular Constraints ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Learning Stable Stochastic Nonlinear Dynamical Systems ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning Texture Manifolds with the Periodic Spatial GAN ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning in POMDPs with Monte Carlo Tree Search βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning the Structure of Generative Models without Labeled Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning to Aggregate Ordinal Labels by Maximizing Separating Width βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning to Align the Source Code to the Compiled Object Code ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning to Discover Sparse Graphical Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning to Generate Long-term Future via Hierarchical Prediction βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning to Learn without Gradient Descent by Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Leveraging Node Attributes for Incomplete Relational Data ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Leveraging Union of Subspace Structure to Improve Constrained Clustering βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Local Bayesian Optimization of Motor Skills βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Local-to-Global Bayesian Network Structure Learning βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Logarithmic Time One-Against-Some βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Lost Relatives of the Gumbel Trick βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
MEC: Memory-efficient Convolution for Deep Neural Network βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Magnetic Hamiltonian Monte Carlo βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Max-value Entropy Search for Efficient Bayesian Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Maximum Selection and Ranking under Noisy Comparisons βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
McGan: Mean and Covariance Feature Matching GAN βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Measuring Sample Quality with Kernels ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Meritocratic Fairness for Cross-Population Selection βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Meta Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Minimax Regret Bounds for Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Minimizing Trust Leaks for Robust Sybil Detection ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Model-Independent Online Learning for Influence Maximization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Modular Multitask Reinforcement Learning with Policy Sketches βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Multi-Class Optimal Margin Distribution Machine βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Multi-fidelity Bayesian Optimisation with Continuous Approximations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Multi-objective Bandits: Optimizing the Generalized Gini Index βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Multi-task Learning with Labeled and Unlabeled Tasks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Multichannel End-to-end Speech Recognition ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Multilabel Classification with Group Testing and Codes βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Multilevel Clustering via Wasserstein Means βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Multiple Clustering Views from Multiple Uncertain Experts ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-Optimal Design of Experiments via Regret Minimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Nearly Optimal Robust Matrix Completion βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Neural Episodic Control βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Neural Message Passing for Quantum Chemistry ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Neural Networks and Rational Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Neural Optimizer Search with Reinforcement Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Nonnegative Matrix Factorization for Time Series Recovery From a Few Temporal Aggregates βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Nonparanormal Information Estimation ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
NystrΓΆm Method with Kernel K-means++ Samples as Landmarks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Approximation Guarantees for Greedy Low Rank Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
On Calibration of Modern Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
On Context-Dependent Clustering of Bandits βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On Kernelized Multi-armed Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Mixed Memberships and Symmetric Nonnegative Matrix Factorizations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Relaxing Determinism in Arithmetic Circuits ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On The Projection Operator to A Three-view Cardinality Constrained Set βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On orthogonality and learning recurrent networks with long term dependencies ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On the Expressive Power of Deep Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Iteration Complexity of Support Recovery via Hard Thresholding Pursuit ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On the Sampling Problem for Kernel Quadrature βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Learning to Rank in Stochastic Click Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Learning with Local Permutations and Delayed Feedback βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Partial Least Square Optimization: Dropping Convexity for Better Efficiency and Scalability ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Online and Linear-Time Attention by Enforcing Monotonic Alignments βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
OptNet: Differentiable Optimization as a Layer in Neural Networks ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Optimal Densification for Fast and Accurate Minwise Hashing βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Oracle Complexity of Second-Order Methods for Finite-Sum Problems ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Ordinal Graphical Models: A Tale of Two Approaches ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Pain-Free Random Differential Privacy with Sensitivity Sampling βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Parallel Multiscale Autoregressive Density Estimation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Parseval Networks: Improving Robustness to Adversarial Examples βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Partitioned Tensor Factorizations for Learning Mixed Membership Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
PixelCNN Models with Auxiliary Variables for Natural Image Modeling ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Post-Inference Prior Swapping βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Practical Gauss-Newton Optimisation for Deep Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Prediction and Control with Temporal Segment Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Prediction under Uncertainty in Sparse Spectrum Gaussian Processes with Applications to Filtering and Control βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Preferential Bayesian Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Priv’IT: Private and Sample Efficient Identity Testing βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Probabilistic Path Hamiltonian Monte Carlo βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Probabilistic Submodular Maximization in Sub-Linear Time βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Programming with a Differentiable Forth Interpreter βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Projection-free Distributed Online Learning in Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
ProtoNN: Compressed and Accurate kNN for Resource-scarce Devices βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Provable Alternating Gradient Descent for Non-negative Matrix Factorization with Strong Correlations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Provably Optimal Algorithms for Generalized Linear Contextual Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Prox-PDA: The Proximal Primal-Dual Algorithm for Fast Distributed Nonconvex Optimization and Learning Over Networks βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Random Feature Expansions for Deep Gaussian Processes ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Re-revisiting Learning on Hypergraphs: Confidence Interval and Subgradient Method βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Real-Time Adaptive Image Compression ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Recovery Guarantees for One-hidden-layer Neural Networks βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Recurrent Highway Networks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Recursive Partitioning for Personalization using Observational Data βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Regret Minimization in Behaviorally-Constrained Zero-Sum Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Regularising Non-linear Models Using Feature Side-information ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Reinforcement Learning with Deep Energy-Based Policies βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Relative Fisher Information and Natural Gradient for Learning Large Modular Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Resource-efficient Machine Learning in 2 KB RAM for the Internet of Things βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Risk Bounds for Transferring Representations With and Without Fine-Tuning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Robust Adversarial Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Budget Allocation via Continuous Submodular Functions βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Robust Gaussian Graphical Model Estimation with Arbitrary Corruption ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Robust Guarantees of Stochastic Greedy Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Probabilistic Modeling with Bayesian Data Reweighting ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Robust Structured Estimation with Single-Index Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Robust Submodular Maximization: A Non-Uniform Partitioning Approach βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
RobustFill: Neural Program Learning under Noisy I/O ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Rule-Enhanced Penalized Regression by Column Generation using Rectangular Maximum Agreement βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
SPLICE: Fully Tractable Hierarchical Extension of ICA with Pooling ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Safety-Aware Algorithms for Adversarial Contextual Bandit βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Scalable Bayesian Rule Lists βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Scalable Generative Models for Multi-label Learning with Missing Labels ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Scalable Multi-Class Gaussian Process Classification using Expectation Propagation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Second-Order Kernel Online Convex Optimization with Adaptive Sketching βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Selective Inference for Sparse High-Order Interaction Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Self-Paced Co-training βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Sequence Modeling via Segmentations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Sequence to Better Sequence: Continuous Revision of Combinatorial Structures βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Sharp Minima Can Generalize For Deep Nets ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Sliced Wasserstein Kernel for Persistence Diagrams βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Soft-DTW: a Differentiable Loss Function for Time-Series βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Source-Target Similarity Modelings for Multi-Source Transfer Gaussian Process Regression ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Sparse + Group-Sparse Dirty Models: Statistical Guarantees without Unreasonable Conditions and a Case for Non-Convexity ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Spectral Learning from a Single Trajectory under Finite-State Policies βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Spherical Structured Feature Maps for Kernel Approximation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
State-Frequency Memory Recurrent Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
StingyCD: Safely Avoiding Wasteful Updates in Coordinate Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Adaptive Quasi-Newton Methods for Minimizing Expected Values βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Stochastic Bouncy Particle Sampler βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic DCA for the Large-sum of Non-convex Functions Problem and its Application to Group Variable Selection in Classification βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Stochastic Generative Hashing βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Stochastic Gradient MCMC Methods for Hidden Markov Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Gradient Monomial Gamma Sampler βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Variance Reduction Methods for Policy Evaluation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Strong NP-Hardness for Sparse Optimization with Concave Penalty Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Strongly-Typed Agents are Guaranteed to Interact Safely ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Sub-sampled Cubic Regularization for Non-convex Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Tensor Balancing on Statistical Manifold ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Tensor Belief Propagation βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Tensor Decomposition via Simultaneous Power Iteration βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Tensor Decomposition with Smoothness ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Tensor-Train Recurrent Neural Networks for Video Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
The Loss Surface of Deep and Wide Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Predictron: End-To-End Learning and Planning ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Price of Differential Privacy for Online Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
The Sample Complexity of Online One-Class Collaborative Filtering βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
The Shattered Gradients Problem: If resnets are the answer, then what is the question? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Statistical Recurrent Unit ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Tight Bounds for Approximate CarathΓ©odory and Beyond βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Toward Controlled Generation of Text βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Toward Efficient and Accurate Covariance Matrix Estimation on Compressed Data βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Uncorrelation and Evenness: a New Diversity-Promoting Regularizer ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Uncovering Causality from Multivariate Hawkes Integrated Cumulants βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Understanding Black-box Predictions via Influence Functions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding Synthetic Gradients and Decoupled Neural Interfaces ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Uniform Convergence Rates for Kernel Density Estimation ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Uniform Deviation Bounds for k-Means Clustering ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Unifying Task Specification in Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Unimodal Probability Distributions for Deep Ordinal Classification ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Unsupervised Learning by Predicting Noise βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Variational Boosting: Iteratively Refining Posterior Approximations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Variational Dropout Sparsifies Deep Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Variational Inference for Sparse and Undirected Models βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Variational Policy for Guiding Point Processes βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Video Pixel Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Warped Convolutions: Efficient Invariance to Spatial Transformations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Wasserstein Generative Adversarial Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, $\ell_2$-consistency and Neuroscience Applications ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
World of Bits: An Open-Domain Platform for Web-Based Agents ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Zero-Inflated Exponential Family Embeddings ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Zonotope Hit-and-run for Efficient Sampling from Projection DPPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
iSurvive: An Interpretable, Event-time Prediction Model for mHealth βœ… βœ… ❌ βœ… ❌ ❌ ❌ 3
meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
β€œConvex Until Proven Guilty”: Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3