Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Conference on Neural Information Processing Systems (NeurIPS) - 2017

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
NeurIPS 2017 679 0.39 2.95 90.13% 35.62%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning 2
A Bayesian Data Augmentation Approach for Learning Deep Models 3
A Decomposition of Forecast Error in Prediction Markets 1
A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering 2
A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning 3
A General Framework for Robust Interactive Learning 1
A Greedy Approach for Budgeted Maximum Inner Product Search 2
A KL-LUCB algorithm for Large-Scale Crowdsourcing 3
A Learning Error Analysis for Structured Prediction with Approximate Inference 5
A Linear-Time Kernel Goodness-of-Fit Test 4
A Meta-Learning Perspective on Cold-Start Recommendations for Items 1
A Minimax Optimal Algorithm for Crowdsourcing 2
A New Alternating Direction Method for Linear Programming 2
A New Theory for Matrix Completion 1
A PAC-Bayesian Analysis of Randomized Learning with Application to Stochastic Gradient Descent 3
A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks 2
A Regularized Framework for Sparse and Structured Neural Attention 2
A Sample Complexity Measure with Applications to Learning Optimal Auctions 0
A Scale Free Algorithm for Stochastic Bandits with Bounded Kurtosis 0
A Screening Rule for l1-Regularized Ising Model Estimation 5
A Sharp Error Analysis for the Fused Lasso, with Application to Approximate Changepoint Screening 0
A Unified Approach to Interpreting Model Predictions 3
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning 2
A Universal Analysis of Large-Scale Regularized Least Squares Solutions 1
A framework for Multi-A(rmed)/B(andit) Testing with Online FDR Control 5
A graph-theoretic approach to multitasking 0
A multi-agent reinforcement learning model of common-pool resource appropriation 0
A simple model of recognition and recall memory 2
A simple neural network module for relational reasoning 3
A-NICE-MC: Adversarial Training for MCMC 3
ADMM without a Fixed Penalty Parameter: Faster Convergence with New Adaptive Penalization 3
AIDE: An algorithm for measuring the accuracy of probabilistic inference algorithms 2
ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching 2
Accelerated First-order Methods for Geodesically Convex Optimization on Riemannian Manifolds 3
Accelerated Stochastic Greedy Coordinate Descent by Soft Thresholding Projection onto Simplex 3
Accelerated consensus via Min-Sum Splitting 1
Acceleration and Averaging in Stochastic Descent Dynamics 0
Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM 3
Action Centered Contextual Bandits 2
Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples 2
Active Exploration for Learning Symbolic Representations 2
Active Learning from Peers 4
AdaGAN: Boosting Generative Models 3
Adaptive Accelerated Gradient Converging Method under H\"{o}lderian Error Bound Condition 3
Adaptive Active Hypothesis Testing under Limited Information 1
Adaptive Batch Size for Safe Policy Gradients 2
Adaptive Bayesian Sampling with Monte Carlo EM 3
Adaptive Classification for Prediction Under a Budget 3
Adaptive Clustering through Semidefinite Programming 2
Adaptive SVRG Methods under Error Bound Conditions with Unknown Growth Parameter 3
Adaptive stimulus selection for optimizing neural population responses 4
Adversarial Ranking for Language Generation 3
Adversarial Surrogate Losses for Ordinal Regression 3
Adversarial Symmetric Variational Autoencoder 3
Affine-Invariant Online Optimization and the Low-rank Experts Problem 1
Affinity Clustering: Hierarchical Clustering at Scale 3
Aggressive Sampling for Multi-class to Binary Reduction with Applications to Text Classification 5
Alternating Estimation for Structured High-Dimensional Multi-Response Models 2
Alternating minimization for dictionary learning with random initialization 1
An Empirical Bayes Approach to Optimizing Machine Learning Algorithms 4
An Empirical Study on The Properties of Random Bases for Kernel Methods 3
An Error Detection and Correction Framework for Connectomics 4
An inner-loop free solution to inverse problems using deep neural networks 3
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems 4
Approximate Supermodularity Bounds for Experimental Design 2
Approximation Algorithms for $\ell_0$-Low Rank Approximation 1
Approximation Bounds for Hierarchical Clustering: Average Linkage, Bisecting K-means, and Local Search 1
Approximation and Convergence Properties of Generative Adversarial Learning 0
Associative Embedding: End-to-End Learning for Joint Detection and Grouping 3
Asynchronous Coordinate Descent under More Realistic Assumptions 3
Asynchronous Parallel Coordinate Minimization for MAP Inference 3
Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin 3
Attention is All you Need 5
Attentional Pooling for Action Recognition 2
Avoiding Discrimination through Causal Reasoning 0
Balancing information exposure in social networks 4
Bandits Dueling on Partially Ordered Sets 3
Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models 4
Bayesian Compression for Deep Learning 5
Bayesian Dyadic Trees and Histograms for Regression 0
Bayesian GAN 6
Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes 4
Bayesian Optimization with Gradients 5
Best Response Regression 5
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model 3
Beyond Parity: Fairness Objectives for Collaborative Filtering 2
Beyond Worst-case: A Probabilistic Analysis of Affine Policies in Dynamic Optimization 3
Beyond normality: Learning sparse probabilistic graphical models in the non-Gaussian setting 3
Boltzmann Exploration Done Right 1
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 4
Bregman Divergence for Stochastic Variance Reduction: Saddle-Point and Adversarial Prediction 3
Bridging the Gap Between Value and Policy Based Reinforcement Learning 2
Can Decentralized Algorithms Outperform Centralized Algorithms? A Case Study for Decentralized Parallel Stochastic Gradient Descent 5
Causal Effect Inference with Deep Latent-Variable Models 3
Certified Defenses for Data Poisoning Attacks 4
Clone MCMC: Parallel High-Dimensional Gaussian Gibbs Sampling 2
Clustering Billions of Reads for DNA Data Storage 4
Clustering Stable Instances of Euclidean k-means. 2
Clustering with Noisy Queries 1
Coded Distributed Computing for Inverse Problems 3
Cold-Start Reinforcement Learning with Softmax Policy Gradient 5
Collaborative Deep Learning in Fixed Topology Networks 3
Collaborative PAC Learning 1
Collapsed variational Bayes for Markov jump processes 3
Collecting Telemetry Data Privately 1
Communication-Efficient Distributed Learning of Discrete Distributions 0
Compatible Reward Inverse Reinforcement Learning 3
Compression-aware Training of Deep Networks 4
Concentration of Multilinear Functions of the Ising Model with Applications to Network Data 2
Concrete Dropout 4
Conic Scan-and-Cover algorithms for nonparametric topic modeling 3
Conservative Contextual Linear Bandits 2
Consistent Multitask Learning with Nonlinear Output Relations 2
Consistent Robust Regression 3
Context Selection for Embedding Models 5
Continual Learning with Deep Generative Replay 1
Continuous DR-submodular Maximization: Structure and Algorithms 4
Contrastive Learning for Image Captioning 3
Controllable Invariance through Adversarial Feature Learning 4
Convergence Analysis of Two-layer Neural Networks with ReLU Activation 3
Convergence of Gradient EM on Multi-component Mixture of Gaussians 1
Convergence rates of a partition based Bayesian multivariate density estimation method 0
Convergent Block Coordinate Descent for Training Tikhonov Regularized Deep Neural Networks 3
Convolutional Gaussian Processes 3
Convolutional Phase Retrieval 1
Cortical microcircuits as gated-recurrent neural networks 3
Cost efficient gradient boosting 4
Counterfactual Fairness 3
Countering Feedback Delays in Multi-Agent Learning 1
Cross-Spectral Factor Analysis 2
DPSCREEN: Dynamic Personalized Screening 3
Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs 2
Deanonymization in the Bitcoin P2P Network 3
Decoding with Value Networks for Neural Machine Translation 5
Decomposable Submodular Function Minimization: Discrete and Continuous 3
Decomposition-Invariant Conditional Gradient for General Polytopes with Line Search 3
Deconvolutional Paragraph Representation Learning 4
Decoupling "when to update" from "how to update" 4
Deep Dynamic Poisson Factorization Model 2
Deep Hyperalignment 6
Deep Hyperspherical Learning 3
Deep Lattice Networks and Partial Monotonic Functions 4
Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model 2
Deep Learning with Topological Signatures 3
Deep Mean-Shift Priors for Image Restoration 5
Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks 3
Deep Recurrent Neural Network-Based Identification of Precursor microRNAs 5
Deep Reinforcement Learning from Human Preferences 3
Deep Sets 3
Deep Subspace Clustering Networks 4
Deep Supervised Discrete Hashing 3
Deep Voice 2: Multi-Speaker Neural Text-to-Speech 2
Deliberation Networks: Sequence Generation Beyond One-Pass Decoding 5
Detrended Partial Cross Correlation for Brain Connectivity Analysis 5
Differentiable Learning of Logical Rules for Knowledge Base Reasoning 4
Differentiable Learning of Submodular Models 3
Differentially Private Empirical Risk Minimization Revisited: Faster and More General 1
Differentially private Bayesian learning on distributed data 5
Diffusion Approximations for Online Principal Component Estimation and Global Convergence 0
Dilated Recurrent Neural Networks 5
Discovering Potential Correlations via Hypercontractivity 2
Discriminative State Space Models 1
Distral: Robust multitask reinforcement learning 1
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space 3
Diving into the shallows: a computational perspective on large-scale shallow learning 5
Do Deep Neural Networks Suffer from Crowding? 3
Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization 3
Doubly Stochastic Variational Inference for Deep Gaussian Processes 4
DropoutNet: Addressing Cold Start in Recommender Systems 5
Dual Discriminator Generative Adversarial Nets 4
Dual Path Networks 3
Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis 4
Dualing GANs 3
Dykstra's Algorithm, ADMM, and Coordinate Descent: Connections, Insights, and Extensions 0
Dynamic Importance Sampling for Anytime Bounds of the Partition Function 3
Dynamic Revenue Sharing 2
Dynamic Routing Between Capsules 4
Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning 0
Dynamic-Depth Context Tree Weighting 4
EEG-GRAPH: A Factor-Graph-Based Model for Capturing Spatial, Temporal, and Observational Relationships in Electroencephalograms 1
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games 5
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning 3
Early stopping for kernel boosting algorithms: A general analysis with localized complexities 1
Effective Parallelisation for Machine Learning 5
Efficient Approximation Algorithms for Strings Kernel Based Sequence Classification 6
Efficient Modeling of Latent Information in Supervised Learning using Gaussian Processes 3
Efficient Online Linear Optimization with Approximation Algorithms 1
Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding 3
Efficient Second-Order Online Kernel Learning with Adaptive Embedding 4
Efficient Sublinear-Regret Algorithms for Online Sparse Linear Regression with Limited Observation 3
Efficient Use of Limited-Memory Accelerators for Linear Learning on Heterogeneous Systems 4
Efficient and Flexible Inference for Stochastic Systems 1
Eigen-Distortions of Hierarchical Representations 1
Eigenvalue Decay Implies Polynomial-Time Learnability for Neural Networks 1
Elementary Symmetric Polynomials for Optimal Experimental Design 2
Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols 3
End-to-end Differentiable Proving 3
Ensemble Sampling 2
Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach 4
Estimating High-dimensional Non-Gaussian Multiple Index Models via Stein’s Lemma 1
Estimating Mutual Information for Discrete-Continuous Mixtures 2
Estimation of the covariance structure of heavy-tailed distributions 0
Excess Risk Bounds for the Bayes Risk using Variational Inference in Latent Gaussian Models 0
Expectation Propagation for t-Exponential Family Using q-Algebra 2
Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems 1
Experimental Design for Learning Causal Graphs with Latent Variables 1
Exploring Generalization in Deep Learning 1
Extracting low-dimensional dynamics from multiple large-scale neural population recordings by learning to predict correlations 2
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events 4
FALKON: An Optimal Large Scale Kernel Method 6
Fader Networks:Manipulating Images by Sliding Attributes 3
Fair Clustering Through Fairlets 1
Fast Black-box Variational Inference through Stochastic Trust-Region Optimization 3
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe 1
Fast amortized inference of neural activity from calcium imaging data with variational autoencoders 4
Fast, Sample-Efficient Algorithms for Structured Phase Retrieval 4
Fast-Slow Recurrent Neural Networks 4
Faster and Non-ergodic O(1/K) Stochastic Alternating Direction Method of Multipliers 4
Federated Multi-Task Learning 5
Few-Shot Adversarial Domain Adaptation 3
Few-Shot Learning Through an Information Retrieval Lens 4
Filtering Variational Objectives 4
Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov Setting 2
First-Order Adaptive Sample Size Methods to Reduce Complexity of Empirical Risk Minimization 3
Fisher GAN 5
Fitting Low-Rank Tensors in Constant Time 4
Fixed-Rank Approximation of a Positive-Semidefinite Matrix from Streaming Data 4
Flexible statistical inference for mechanistic models of neural dynamics 2
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks 5
Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation 3
From Bayesian Sparsity to Gated Recurrent Nets 0
From Parity to Preference-based Notions of Fairness in Classification 2
From which world is your graph 1
Fully Decentralized Policies for Multi-Agent Systems: An Information Theoretic Approach 1
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium 2
GP CaKe: Effective brain connectivity with causal kernels 1
Gated Recurrent Convolution Neural Network for OCR 4
Gauging Variational Inference 2
Gaussian Quadrature for Kernel Features 3
Gaussian process based nonlinear latent structure discovery in multivariate spike train data 3
Generalization Properties of Learning with Random Features 1
Generalized Linear Model Regression under Distance-to-set Penalties 2
Generalizing GANs: A Turing Perspective 1
Generating steganographic images via adversarial training 3
Generative Local Metric Learning for Kernel Regression 3
Geometric Descent Method for Convex Composite Minimization 4
Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks 5
GibbsNet: Iterative Adversarial Inference for Deep Graphical Models 1
Good Semi-supervised Learning That Requires a Bad GAN 4
Gradient Descent Can Take Exponential Time to Escape Saddle Points 2
Gradient Episodic Memory for Continual Learning 4
Gradient Methods for Submodular Maximization 3
Gradient descent GAN optimization is locally stable 2
Gradients of Generative Models for Improved Discriminative Analysis of Tandem Mass Spectra 3
Graph Matching via Multiplicative Update Algorithm 2
Greedy Algorithms for Cone Constrained Optimization with Convergence Guarantees 2
Group Additive Structure Identification for Kernel Nonparametric Regression 4
Group Sparse Additive Machine 3
Hash Embeddings for Efficient Word Representations 4
Hiding Images in Plain Sight: Deep Steganography 1
Hierarchical Attentive Recurrent Tracking 4
Hierarchical Clustering Beyond the Worst-Case 3
Hierarchical Implicit Models and Likelihood-Free Variational Inference 3
Hierarchical Methods of Moments 5
High-Order Attention Models for Visual Question Answering 4
Higher-Order Total Variation Classes on Grids: Minimax Theory and Trend Filtering Methods 0
Hindsight Experience Replay 3
Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples 2
How regularization affects the critical points in linear networks 1
Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs 6
Hybrid Reward Architecture for Reinforcement Learning 2
Hypothesis Transfer Learning via Transformation Functions 4
Identification of Gaussian Process State Space Models 2
Identifying Outlier Arms in Multi-Armed Bandit 2
Imagination-Augmented Agents for Deep Reinforcement Learning 1
Implicit Regularization in Matrix Factorization 2
Improved Dynamic Regret for Non-degenerate Functions 1
Improved Graph Laplacian via Geometric Self-Consistency 4
Improved Training of Wasserstein GANs 5
Improving Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms and Its Applications 1
Improving the Expected Improvement Algorithm 1
Incorporating Side Information by Adaptive Convolution 3
Independence clustering (without a matrix) 1
Inductive Representation Learning on Large Graphs 5
Inference in Graphical Models via Semidefinite Programming Hierarchies 2
Inferring Generative Model Structure with Static Analysis 2
Influence Maximization with $\varepsilon$-Almost Submodular Threshold Functions 4
InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations 2
Information Theoretic Properties of Markov Random Fields, and their Algorithmic Applications 1
Information-theoretic analysis of generalization capability of learning algorithms 0
Inhomogeneous Hypergraph Clustering with Applications 2
Integration Methods and Optimization Algorithms 0
Interactive Submodular Bandit 2
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning 3
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts 4
Introspective Classification with Convolutional Nets 4
Invariance and Stability of Deep Convolutional Representations 0
Inverse Filtering for Hidden Markov Models 3
Inverse Reward Design 0
Is Input Sparsity Time Possible for Kernel Low-Rank Approximation? 0
Is the Bellman residual a bad proxy? 1
Joint distribution optimal transportation for domain adaptation 5
K-Medoids For K-Means Seeding 4
Kernel Feature Selection via Conditional Covariance Minimization 4
Kernel functions based on triplet comparisons 4
Label Distribution Learning Forests 4
Label Efficient Learning of Transferable Representations acrosss Domains and Tasks 3
Langevin Dynamics with Continuous Tempering for Training Deep Neural Networks 4
Language Modeling with Recurrent Highway Hypernetworks 5
Large-Scale Quadratically Constrained Quadratic Program via Low-Discrepancy Sequences 2
Learned D-AMP: Principled Neural Network based Compressive Image Recovery 5
Learned in Translation: Contextualized Word Vectors 4
Learning A Structured Optimal Bipartite Graph for Co-Clustering 3
Learning Active Learning from Data 3
Learning Affinity via Spatial Propagation Networks 3
Learning Causal Structures Using Regression Invariance 3
Learning Chordal Markov Networks via Branch and Bound 5
Learning Combinatorial Optimization Algorithms over Graphs 7
Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction 6
Learning Disentangled Representations with Semi-Supervised Deep Generative Models 3
Learning Efficient Object Detection Models with Knowledge Distillation 3
Learning Graph Representations with Embedding Propagation 4
Learning Hierarchical Information Flow with Recurrent Neural Modules 3
Learning Identifiable Gaussian Bayesian Networks in Polynomial Time and Sample Complexity 3
Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition 2
Learning Linear Dynamical Systems via Spectral Filtering 2
Learning Low-Dimensional Metrics 1
Learning Mixture of Gaussians with Streaming Data 1
Learning Multiple Tasks with Multilinear Relationship Networks 3
Learning Neural Representations of Human Cognition across Many fMRI Studies 4
Learning Overcomplete HMMs 2
Learning Populations of Parameters 1
Learning ReLUs via Gradient Descent 1
Learning Spherical Convolution for Fast Features from 360° Imagery 2
Learning Unknown Markov Decision Processes: A Thompson Sampling Approach 3
Learning a Multi-View Stereo Machine 4
Learning from Complementary Labels 3
Learning from uncertain curves: The 2-Wasserstein metric for Gaussian processes 3
Learning multiple visual domains with residual adapters 3
Learning spatiotemporal piecewise-geodesic trajectories from longitudinal manifold-valued data 1
Learning the Morphology of Brain Signals Using Alpha-Stable Convolutional Sparse Coding 4
Learning to Compose Domain-Specific Transformations for Data Augmentation 4
Learning to Inpaint for Image Compression 2
Learning to Model the Tail 3
Learning to Pivot with Adversarial Networks 4
Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon 4
Learning to See Physics via Visual De-animation 2
Learning with Average Top-k Loss 4
Learning with Bandit Feedback in Potential Games 1
Learning with Feature Evolvable Streams 3
LightGBM: A Highly Efficient Gradient Boosting Decision Tree 5
Limitations on Variance-Reduction and Acceleration Schemes for Finite Sums Optimization 1
Linear Convergence of a Frank-Wolfe Type Algorithm over Trace-Norm Balls 3
Linear Time Computation of Moments in Sum-Product Networks 1
Linear regression without correspondence 1
Linearly constrained Gaussian processes 2
Local Aggregative Games 2
Log-normality and Skewness of Estimated State/Action Values in Reinforcement Learning 1
Lookahead Bayesian Optimization with Inequality Constraints 2
Lower bounds on the robustness to adversarial perturbations 2
MMD GAN: Towards Deeper Understanding of Moment Matching Network 6
Machine Learning with Adversaries: Byzantine Tolerant Gradient Descent 2
Mapping distinct timescales of functional interactions among brain networks 4
MarrNet: 3D Shape Reconstruction via 2.5D Sketches 3
MaskRNN: Instance Level Video Object Segmentation 3
Masked Autoregressive Flow for Density Estimation 4
Matching neural paths: transfer from recognition to correspondence search 4
Matching on Balanced Nonlinear Representations for Treatment Effects Estimation 3
Matrix Norm Estimation from a Few Entries 3
Max-Margin Invariant Features from Transformed Unlabelled Data 1
Maximizing Subset Accuracy with Recurrent Neural Networks in Multi-label Classification 4
Maximum Margin Interval Trees 4
Maxing and Ranking with Few Assumptions 2
Mean Field Residual Networks: On the Edge of Chaos 2
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results 2
Min-Max Propagation 1
Minimal Exploration in Structured Stochastic Bandits 2
Minimax Estimation of Bandable Precision Matrices 2
Minimizing a Submodular Function from Samples 1
Mixture-Rank Matrix Approximation for Collaborative Filtering 2
Model evidence from nonequilibrium simulations 4
Model-Powered Conditional Independence Test 5
Model-based Bayesian inference of neural activity and connectivity from all-optical interrogation of a neural circuit 1
Modulating early visual processing by language 5
Monte-Carlo Tree Search by Best Arm Identification 4
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments 3
Multi-Armed Bandits with Metric Movement Costs 1
Multi-Information Source Optimization 5
Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets 1
Multi-Objective Non-parametric Sequential Prediction 1
Multi-Task Learning for Contextual Bandits 4
Multi-View Decision Processes: The Helper-AI Problem 3
Multi-output Polynomial Networks and Factorization Machines 3
Multi-view Matrix Factorization for Linear Dynamical System Estimation 2
Multi-way Interacting Regression via Factorization Machines 3
Multimodal Learning and Reasoning for Visual Question Answering 3
Multiplicative Weights Update with Constant Step-Size in Congestion Games: Convergence, Limit Cycles and Chaos 1
Multiresolution Kernel Approximation for Gaussian Process Regression 5
Multiscale Quantization for Fast Similarity Search 2
Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces 2
Multitask Spectral Learning of Weighted Automata 4
Natural Value Approximators: Learning when to Trust Past Estimates 2
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem 0
Near Optimal Sketching of Low-Rank Tensor Regression 3
Near-Optimal Edge Evaluation in Explicit Generalized Binomial Graphs 2
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration 2
Nearest-Neighbor Sample Compression: Efficiency, Consistency, Infinite Dimensions 1
Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee 3
Neural Discrete Representation Learning 2
Neural Expectation Maximization 5
Neural Networks for Efficient Bayesian Decoding of Natural Images from Retinal Neurons 4
Neural Program Meta-Induction 1
Neural Variational Inference and Learning in Undirected Graphical Models 2
Neural system identification for large populations separating “what” and “where” 4
NeuralFDR: Learning Discovery Thresholds from Hypothesis Features 5
Noise-Tolerant Interactive Learning Using Pairwise Comparisons 1
Non-Stationary Spectral Kernels 3
Non-convex Finite-Sum Optimization Via SCSG Methods 6
Non-parametric Structured Output Networks 2
Nonbacktracking Bounds on the Influence in Independent Cascade Models 3
Nonlinear Acceleration of Stochastic Algorithms 3
Nonlinear random matrix theory for deep learning 0
Nonparametric Online Regression while Learning the Metric 1
Off-policy evaluation for slate recommendation 4
On Blackbox Backpropagation and Jacobian Sensing 2
On Fairness and Calibration 2
On Frank-Wolfe and Equilibrium Computation 1
On Optimal Generalizability in Parametric Learning 4
On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning 4
On Separability of Loss Functions, and Revisiting Discriminative Vs Generative Models 1
On Structured Prediction Theory with Calibrated Convex Surrogate Losses 0
On Tensor Train Rank Minimization : Statistical Efficiency and Scalable Algorithm 2
On clustering network-valued data 3
On the Complexity of Learning Neural Networks 1
On the Consistency of Quick Shift 1
On the Fine-Grained Complexity of Empirical Risk Minimization: Kernel Methods and Neural Networks 0
On the Model Shrinkage Effect of Gamma Process Edge Partition Models 4
On the Optimization Landscape of Tensor Decompositions 0
On the Power of Truncated SVD for General High-rank Matrix Estimation Problems 0
On-the-fly Operation Batching in Dynamic Computation Graphs 5
OnACID: Online Analysis of Calcium Imaging Data in Real Time 3
One-Shot Imitation Learning 1
One-Sided Unsupervised Domain Mapping 3
Online Convex Optimization with Stochastic Constraints 2
Online Dynamic Programming 0
Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback 3
Online Learning for Multivariate Hawkes Processes 4
Online Learning of Optimal Bidding Strategy in Repeated Multi-Commodity Auctions 3
Online Learning with Transductive Regret 1
Online Learning with a Hint 1
Online Prediction with Selfish Experts 1
Online Reinforcement Learning in Stochastic Games 1
Online control of the false discovery rate with decaying memory 0
Online multiclass boosting 3
Online to Offline Conversions, Universality and Adaptive Minibatch Sizes 1
Optimal Sample Complexity of M-wise Data for Top-K Ranking 2
Optimal Shrinkage of Singular Values Under Random Data Contamination 2
Optimistic posterior sampling for reinforcement learning: worst-case regret bounds 1
Optimized Pre-Processing for Discrimination Prevention 3
Overcoming Catastrophic Forgetting by Incremental Moment Matching 3
PASS-GLM: polynomial approximate sufficient statistics for scalable Bayesian GLM inference 2
PRUNE: Preserving Proximity and Global Ranking for Network Embedding 3
Parallel Streaming Wasserstein Barycenters 4
Parameter-Free Online Learning via Model Selection 1
Parametric Simplex Method for Sparse Learning 2
Partial Hard Thresholding: Towards A Principled Analysis of Support Recovery 1
Permutation-based Causal Inference Algorithms with Interventions 4
Perturbative Black Box Variational Inference 4
Phase Transitions in the Pooled Data Problem 0
PixelGAN Autoencoders 3
Pixels to Graphs by Associative Embedding 2
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models 4
Poincaré Embeddings for Learning Hierarchical Representations 3
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space 2
Policy Gradient With Value Function Approximation For Collective Multiagent Planning 1
Polynomial Codes: an Optimal Design for High-Dimensional Coded Matrix Multiplication 2
Polynomial time algorithms for dual volume sampling 3
Population Matching Discrepancy and Applications in Deep Learning 6
Pose Guided Person Image Generation 3
Position-based Multiple-play Bandit Problem with Unknown Position Bias 3
Positive-Unlabeled Learning with Non-Negative Risk Estimator 4
Practical Bayesian Optimization for Model Fitting with Bayesian Adaptive Direct Search 5
Practical Data-Dependent Metric Compression with Provable Guarantees 2
Practical Hash Functions for Similarity Estimation and Dimensionality Reduction 3
Practical Locally Private Heavy Hitters 2
PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs 3
Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network 4
Predicting Scene Parsing and Motion Dynamics in the Future 4
Predicting User Activity Level In Point Processes With Mass Transport Equation 4
Predictive State Recurrent Neural Networks 2
Predictive-State Decoders: Encoding the Future into Recurrent Networks 2
Premise Selection for Theorem Proving by Deep Graph Embedding 3
Preventing Gradient Explosions in Gated Recurrent Units 4
Principles of Riemannian Geometry in Neural Networks 2
Probabilistic Models for Integration Error in the Assessment of Functional Cardiac Models 1
Probabilistic Rule Realization and Selection 0
Process-constrained batch Bayesian optimisation 3
Protein Interface Prediction using Graph Convolutional Networks 6
Prototypical Networks for Few-shot Learning 4
Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes 3
QMDP-Net: Deep Learning for Planning under Partial Observability 2
QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding 5
Quantifying how much sensory information in a neural code is relevant for behavior 1
Query Complexity of Clustering with Side Information 1
Question Asking as Program Generation 3
REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models 3
Random Permutation Online Isotonic Regression 0
Random Projection Filter Bank for Time Series Data 3
Ranking Data with Continuous Labels through Oriented Recursive Partitions 1
Real Time Image Saliency for Black Box Classifiers 3
Real-Time Bidding with Side Information 1
Reconstruct & Crush Network 3
Reconstructing perceived faces from brain activations with deep adversarial neural decoding 4
Recurrent Ladder Networks 2
Recursive Sampling for the Nystrom Method 3
Recycling Privileged Learning and Distribution Matching for Fairness 3
Reducing Reparameterization Gradient Variance 4
Regret Analysis for Continuous Dueling Bandit 1
Regret Minimization in MDPs with Options without Prior Knowledge 1
Regularized Modal Regression with Applications in Cognitive Impairment Prediction 3
Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization 3
Reinforcement Learning under Model Mismatch 3
Reliable Decision Support using Counterfactual Models 3
Renyi Differential Privacy Mechanisms for Posterior Sampling 3
Repeated Inverse Reinforcement Learning 1
Resurrecting the sigmoid in deep learning through dynamical isometry: theory and practice 3
Revenue Optimization with Approximate Bid Predictions 3
Revisit Fuzzy Neural Network: Demystifying Batch Normalization and ReLU with Generalized Hamming Network 2
Revisiting Perceptron: Efficient and Label-Optimal Learning of Halfspaces 1
Riemannian approach to batch normalization 4
Rigorous Dynamics and Consistent Estimation in Arbitrarily Conditioned Linear Systems 1
Robust Conditional Probabilities 2
Robust Estimation of Neural Signals in Calcium Imaging 3
Robust Hypothesis Test for Nonlinear Effect with Gaussian Processes 3
Robust Imitation of Diverse Behaviors 3
Robust Optimization for Non-Convex Objectives 5
Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes 4
Rotting Bandits 2
Runtime Neural Pruning 5
SGD Learns the Conjugate Kernel Class of the Network 2
SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability 1
SVD-Softmax: Fast Softmax Approximation on Large Vocabulary Neural Networks 4
Safe Adaptive Importance Sampling 2
Safe Model-based Reinforcement Learning with Stability Guarantees 3
Safe and Nested Subgame Solving for Imperfect-Information Games 0
SafetyNets: Verifiable Execution of Deep Neural Networks on an Untrusted Cloud 4
Saliency-based Sequential Image Attention with Multiset Prediction 3
Sample and Computationally Efficient Learning Algorithms under S-Concave Distributions 1
Scalable Demand-Aware Recommendation 2
Scalable Generalized Linear Bandits: Online Computation and Hashing 2
Scalable Levy Process Priors for Spectral Kernel Learning 4
Scalable Log Determinants for Gaussian Process Kernel Learning 3
Scalable Model Selection for Belief Networks 4
Scalable Planning with Tensorflow for Hybrid Nonlinear Domains 3
Scalable Variational Inference for Dynamical Systems 3
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation 3
SchNet: A continuous-filter convolutional neural network for modeling quantum interactions 3
Selective Classification for Deep Neural Networks 4
Self-Normalizing Neural Networks 3
Self-Supervised Intrinsic Image Decomposition 2
Self-supervised Learning of Motion Capture 4
Semi-Supervised Learning for Optical Flow with Generative Adversarial Networks 3
Semi-supervised Learning with GANs: Manifold Invariance with Improved Inference 2
Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding 3
Shallow Updates for Deep Reinforcement Learning 4
Shape and Material from Sound 1
Sharpness, Restart and Acceleration 3
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles 3
Simple strategies for recovering inner products from coarsely quantized random projections 2
Smooth Primal-Dual Coordinate Descent Algorithms for Nonsmooth Convex Optimization 2
Sobolev Training for Neural Networks 2
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations 4
Solid Harmonic Wavelet Scattering: Predicting Quantum Molecular Energy from Invariant Descriptors of 3D Electronic Densities 4
Solving Most Systems of Random Quadratic Equations 4
Sparse Approximate Conic Hulls 4
Sparse Embedded $k$-Means Clustering 2
Sparse convolutional coding for neuronal assembly detection 3
Spectral Mixture Kernels for Multi-Output Gaussian Processes 3
Spectrally-normalized margin bounds for neural networks 2
Speeding Up Latent Variable Gaussian Graphical Model Estimation via Nonconvex Optimization 6
Spherical convolutions and their application in molecular modelling 5
Stabilizing Training of Generative Adversarial Networks through Regularization 4
State Aware Imitation Learning 3
Statistical Cost Sharing 0
Stein Variational Gradient Descent as Gradient Flow 1
Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference 4
Stochastic Approximation for Canonical Correlation Analysis 4
Stochastic Mirror Descent in Variationally Coherent Optimization Problems 2
Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite Sum Structure 4
Stochastic Submodular Maximization: The Case of Coverage Functions 3
Stochastic and Adversarial Online Learning without Hyperparameters 1
Straggler Mitigation in Distributed Optimization Through Data Encoding 4
Streaming Robust Submodular Maximization: A Partitioned Thresholding Approach 3
Streaming Sparse Gaussian Process Approximations 3
Streaming Weak Submodularity: Interpreting Neural Networks on the Fly 4
Structured Bayesian Pruning via Log-Normal Multiplicative Noise 4
Structured Embedding Models for Grouped Data 3
Structured Generative Adversarial Networks 4
Style Transfer from Non-Parallel Text by Cross-Alignment 4
Submultiplicative Glivenko-Cantelli and Uniform Convergence of Revenues 0
Subset Selection and Summarization in Sequential Data 3
Subset Selection under Noise 3
Subspace Clustering via Tangent Cones 1
Successor Features for Transfer in Reinforcement Learning 1
Targeting EEG/LFP Synchrony with Neural Nets 4
Task-based End-to-end Model Learning in Stochastic Optimization 3
Teaching Machines to Describe Images with Natural Language Feedback 4
Temporal Coherency based Criteria for Predicting Video Frames using Deep Multi-stage Generative Adversarial Networks 3
Tensor Biclustering 4
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning 5
Testing and Learning on Distributions with Symmetric Noise Invariance 4
The Expressive Power of Neural Networks: A View from the Width 1
The Expxorcist: Nonparametric Graphical Models Via Conditional Exponential Densities 3
The Importance of Communities for Learning to Influence 2
The Marginal Value of Adaptive Gradient Methods in Machine Learning 3
The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process 5
The Numerics of GANs 4
The Reversible Residual Network: Backpropagation Without Storing Activations 6
The Scaling Limit of High-Dimensional Online Independent Component Analysis 1
The Unreasonable Effectiveness of Structured Random Orthogonal Embeddings 0
The power of absolute discounting: all-dimensional distribution estimation 3
Thinking Fast and Slow with Deep Learning and Tree Search 3
Thy Friend is My Friend: Iterative Collaborative Filtering for Sparse Matrix Estimation 0
Time-dependent spatially varying graphical models, with application to brain fMRI data analysis 2
Tomography of the London Underground: a Scalable Model for Origin-Destination Data 2
Toward Goal-Driven Neural Network Models for the Rodent Whisker-Trigeminal System 3
Toward Multimodal Image-to-Image Translation 3
Toward Robustness against Label Noise in Training Deep Discriminative Neural Networks 4
Towards Accurate Binary Convolutional Neural Network 2
Towards Generalization and Simplicity in Continuous Control 3
Tractability in Structured Probability Spaces 1
Train longer, generalize better: closing the generalization gap in large batch training of neural networks 4
Training Deep Networks without Learning Rates Through Coin Betting 6
Training Quantized Nets: A Deeper Understanding 3
Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems 2
Translation Synchronization via Truncated Least Squares 3
Triangle Generative Adversarial Networks 2
Trimmed Density Ratio Estimation 3
Triple Generative Adversarial Nets 5
Unbiased estimates for linear regression via volume sampling 1
Unbounded cache model for online language modeling with open vocabulary 2
Unified representation of tractography and diffusion-weighted MRI data using sparse multidimensional arrays 3
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning 2
Union of Intersections (UoI) for Interpretable Data Driven Discovery and Prediction 4
Universal Style Transfer via Feature Transforms 4
Universal consistency and minimax rates for online Mondrian Forests 2
Unsupervised Image-to-Image Translation Networks 4
Unsupervised Learning of Disentangled Representations from Video 3
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data 4
Unsupervised Sequence Classification using Sequential Output Statistics 2
Unsupervised Transformation Learning via Convex Relaxations 3
Unsupervised learning of object frames by dense equivariant image labelling 2
Uprooting and Rerooting Higher-Order Graphical Models 0
Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation 3
VAE Learning via Stein Variational Gradient Descent 3
VAIN: Attentional Multi-agent Predictive Modeling 3
VEEGAN: Reducing Mode Collapse in GANs using Implicit Variational Learning 4
Value Prediction Network 4
Variable Importance Using Decision Trees 2
Variance-based Regularization with Convex Objectives 4
Variational Inference for Gaussian Process Models with Linear Complexity 3
Variational Inference via $\chi$ Upper Bound Minimization 4
Variational Laws of Visual Attention for Dynamic Scenes 4
Variational Memory Addressing in Generative Models 3
Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net 4
Visual Interaction Networks: Learning a Physics Simulator from Video 3
Visual Reference Resolution using Attention Memory for Visual Dialog 3
Wasserstein Learning of Deep Generative Point Process Models 4
Welfare Guarantees from Data 1
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? 4
When Cyclic Coordinate Descent Outperforms Randomized Coordinate Descent 1
When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness 2
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning 3
Working hard to know your neighbor's margins: Local descriptor learning loss 4
YASS: Yet Another Spike Sorter 5
Z-Forcing: Training Stochastic Recurrent Networks 3
Zap Q-Learning 2
f-GANs in an Information Geometric Nutshell 2
k-Support and Ordered Weighted Sparsity for Overlapping Groups: Hardness and Algorithms 1