Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Conference on Neural Information Processing Systems (NeurIPS) - 2019

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
NeurIPS 2019 1428 0.52 3.31 89.64% 43.05%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
(Nearly) Efficient Algorithms for the Graph Matching Problem on Correlated Random Graphs 1
A Bayesian Theory of Conformity in Collective Decision Making 3
A Benchmark for Interpretability Methods in Deep Neural Networks 2
A Communication Efficient Stochastic Multi-Block Alternating Direction Method of Multipliers 4
A Composable Specification Language for Reinforcement Learning Tasks 3
A Condition Number for Joint Optimization of Cycle-Consistent Networks 2
A Convex Relaxation Barrier to Tight Robustness Verification of Neural Networks 4
A Debiased MDI Feature Importance Measure for Random Forests 4
A Direct tilde{O}(1/epsilon) Iteration Parallel Algorithm for Optimal Transport 3
A Domain Agnostic Measure for Monitoring and Evaluating GANs 3
A Family of Robust Stochastic Operators for Reinforcement Learning 1
A First-Order Algorithmic Framework for Distributionally Robust Logistic Regression 7
A Flexible Generative Framework for Graph-based Semi-supervised Learning 3
A Fourier Perspective on Model Robustness in Computer Vision 3
A Game Theoretic Approach to Class-wise Selective Rationalization 3
A General Framework for Symmetric Property Estimation 4
A General Theory of Equivariant CNNs on Homogeneous Spaces 0
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation 4
A Generic Acceleration Framework for Stochastic Composite Optimization 4
A Geometric Perspective on Optimal Representations for Reinforcement Learning 3
A Graph Theoretic Additive Approximation of Optimal Transport 4
A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation 4
A Kernel Loss for Solving the Bellman Equation 1
A Latent Variational Framework for Stochastic Optimization 0
A Linearly Convergent Method for Non-Smooth Non-Convex Optimization on the Grassmannian with Applications to Robust Subspace and Dictionary Learning 2
A Linearly Convergent Proximal Gradient Algorithm for Decentralized Optimization 1
A Little Is Enough: Circumventing Defenses For Distributed Learning 2
A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off 3
A Meta-Analysis of Overfitting in Machine Learning 2
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning 4
A Model to Search for Synthesizable Molecules 4
A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation 4
A Necessary and Sufficient Stability Notion for Adaptive Generalization 0
A New Defense Against Adversarial Images: Turning a Weakness into a Strength 4
A New Distribution on the Simplex with Auto-Encoding Applications 5
A New Perspective on Pool-Based Active Classification and False-Discovery Control 1
A Nonconvex Approach for Exact and Efficient Multichannel Sparse Blind Deconvolution 1
A Normative Theory for Causal Inference and Bayes Factor Computation in Neural Circuits 1
A Polynomial Time Algorithm for Log-Concave Maximum Likelihood via Locally Exponential Families 1
A Primal Dual Formulation For Deep Learning With Constraints 4
A Primal-Dual link between GANs and Autoencoders 0
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models 3
A Refined Margin Distribution Analysis for Forest Representation Learning 4
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning 3
A Robust Non-Clairvoyant Dynamic Mechanism for Contextual Auctions 1
A Self Validation Network for Object-Level Human Attention Estimation 3
A Similarity-preserving Network Trained on Transformed Images Recapitulates Salient Features of the Fly Motion Detection Circuit 0
A Simple Baseline for Bayesian Uncertainty in Deep Learning 5
A Solvable High-Dimensional Model of GAN 1
A Step Toward Quantifying Independently Reproducible Machine Learning Research 3
A Stochastic Composite Gradient Method with Incremental Variance Reduction 3
A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning 1
A Tensorized Transformer for Language Modeling 3
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment 2
A Unified Framework for Data Poisoning Attack to Graph-based Semi-supervised Learning 4
A Unifying Framework for Spectrum-Preserving Graph Sparsification and Coarsening 4
A Universally Optimal Multistage Accelerated Stochastic Gradient Method 3
A Zero-Positive Learning Approach for Diagnosing Software Performance Regressions 3
A coupled autoencoder approach for multi-modal analysis of cell types 4
A neurally plausible model for online recognition and postdiction in a dynamical environment 3
A neurally plausible model learns successor representations in partially observable environments 1
A state-space model for inferring effective connectivity of latent neural dynamics from simultaneous EEG/fMRI 2
A unified theory for the origin of grid cells through the lens of pattern formation 1
A unified variance-reduced accelerated gradient method for convex optimization 3
ADDIS: an adaptive discarding algorithm for online FDR control with conservative nulls 3
AGEM: Solving Linear Inverse Problems via Deep Priors and Sampling 5
ANODEV2: A Coupled Neural ODE Framework 2
Abstract Reasoning with Distracting Features 2
Abstraction based Output Range Analysis for Neural Networks 2
Accelerating Rescaled Gradient Descent: Fast Optimization of Smooth Functions 2
Acceleration via Symplectic Discretization of High-Resolution Differential Equations 0
Accurate Layerwise Interpretable Competence Estimation 3
Accurate Uncertainty Estimation and Decomposition in Ensemble Learning 0
Accurate, reliable and fast robustness evaluation 4
Adapting Neural Networks for the Estimation of Treatment Effects 4
Adaptive Auxiliary Task Weighting for Reinforcement Learning 3
Adaptive Cross-Modal Few-shot Learning 4
Adaptive Density Estimation for Generative Models 3
Adaptive GNN for Image Analysis and Editing 1
Adaptive Gradient-Based Meta-Learning Methods 3
Adaptive Influence Maximization with Myopic Feedback 1
Adaptive Sequence Submodularity 4
Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates 3
Adaptively Aligned Image Captioning via Adaptive Attention Time 4
Addressing Failure Prediction by Learning Model Confidence 4
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs 1
Adversarial Examples Are Not Bugs, They Are Features 2
Adversarial Fisher Vectors for Unsupervised Representation Learning 3
Adversarial Music: Real world Audio Adversary against Wake-word Detection System 4
Adversarial Robustness through Local Linearization 4
Adversarial Self-Defense for Cycle-Consistent GANs 1
Adversarial Training and Robustness for Multiple Perturbations 4
Adversarial training for free! 6
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks 0
Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing 2
Algorithmic Guarantees for Inverse Imaging with Untrained Network Priors 3
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations 4
Alleviating Label Switching with Optimal Transport 2
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model 1
Amortized Bethe Free Energy Minimization for Learning MRFs 6
An Inexact Augmented Lagrangian Framework for Nonconvex Optimization with Nonlinear Constraints 3
An Accelerated Decentralized Stochastic Proximal Algorithm for Finite Sums 4
An Adaptive Empirical Bayesian Method for Sparse Deep Learning 3
An Algorithm to Learn Polytree Networks with Hidden Nodes 1
An Algorithmic Framework For Differentially Private Data Analysis on Trusted Processors 1
An Embedding Framework for Consistent Polyhedral Surrogates 0
An Improved Analysis of Training Over-parameterized Deep Neural Networks 1
An adaptive Mirror-Prox method for variational inequalities with singular operators 2
An adaptive nearest neighbor rule for classification 3
Anti-efficient encoding in emergent communication 2
Approximate Bayesian Inference for a Mechanistic Model of Vesicle Release at a Ribbon Synapse 2
Approximate Feature Collisions in Neural Nets 1
Approximate Inference Turns Deep Networks into Gaussian Processes 3
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems 3
Approximating the Permanent by Sampling from Adaptive Partitions 2
Approximation Ratios of Graph Neural Networks for Combinatorial Problems 1
Arbicon-Net: Arbitrary Continuous Geometric Transformation Networks for Image Registration 4
Are Anchor Points Really Indispensable in Label-Noise Learning? 5
Are Disentangled Representations Helpful for Abstract Visual Reasoning? 5
Are Labels Required for Improving Adversarial Robustness? 5
Are Sixteen Heads Really Better than One? 4
Are deep ResNets provably better than linear predictors? 1
Are sample means in multi-armed bandits positively or negatively biased? 1
Ask not what AI can do, but what AI should do: Towards a framework of task delegability 1
Assessing Disparate Impact of Personalized Interventions: Identifiability and Bounds 2
Assessing Social and Intersectional Biases in Contextualized Word Representations 2
Asymmetric Valleys: Beyond Sharp and Flat Local Minima 2
Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance 3
Asymptotics for Sketching in Least Squares Regression 1
AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification 5
Attentive State-Space Modeling of Disease Progression 4
Attribution-Based Confidence Metric For Deep Neural Networks 3
Augmented Neural ODEs 2
AutoAssist: A Framework to Accelerate Training of Deep Neural Networks 5
AutoPrune: Automatic Network Pruning by Regularizing Auxiliary Parameters 5
Average Case Column Subset Selection for Entrywise $\ell_1$-Norm Loss 3
Average Individual Fairness: Algorithms, Generalization and Experiments 3
Average-Case Averages: Private Algorithms for Smooth Sensitivity and Mean Estimation 2
BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling 3
Backprop with Approximate Activations for Memory-efficient Network Training 5
Backpropagation-Friendly Eigendecomposition 5
Balancing Efficiency and Fairness in On-Demand Ridesourcing 3
Band-Limited Gaussian Processes: The Sinc Kernel 3
Bandits with Feedback Graphs and Switching Costs 1
Bat-G net: Bat-inspired High-Resolution 3D Image Reconstruction using Ultrasonic Echoes 3
BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning 5
Batched Multi-armed Bandits Problem 3
Bayesian Batch Active Learning as Sparse Subset Approximation 4
Bayesian Joint Estimation of Multiple Graphical Models 3
Bayesian Layers: A Module for Neural Network Uncertainty 4
Bayesian Learning of Sum-Product Networks 3
Bayesian Optimization under Heavy-tailed Payoffs 3
Bayesian Optimization with Unknown Search Space 6
Beating SGD Saturation with Tail-Averaging and Minibatching 1
BehaveNet: nonlinear embedding and Bayesian neural decoding of behavioral videos 4
Better Exploration with Optimistic Actor Critic 3
Better Transfer Learning with Inferred Successor Maps 2
Beyond Alternating Updates for Matrix Factorization with Inertial Bregman Proximal Gradient Algorithms 4
Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs 3
Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed Online Optimization 1
Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs 3
Beyond temperature scaling: Obtaining well-calibrated multi-class probabilities with Dirichlet calibration 4
Beyond the Single Neuron Convex Barrier for Neural Network Certification 4
Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting 2
Biases for Emergent Communication in Multi-agent Reinforcement Learning 2
Bipartite expander Hopfield networks as self-decoding high-capacity error correcting codes 3
Blended Matching Pursuit 3
Blind Super-Resolution Kernel Estimation using an Internal-GAN 4
Block Coordinate Regularization by Denoising 5
Blocking Bandits 3
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion 5
Bootstrapping Upper Confidence Bound 2
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs 6
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks 5
Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces 3
Bridging Machine Learning and Logical Reasoning by Abductive Learning 3
Budgeted Reinforcement Learning in Continuous State Space 3
CNN^{2}: Viewpoint Generalization via a Binocular Vision 3
CPM-Nets: Cross Partial Multi-View Networks 4
CXPlain: Causal Explanations for Model Interpretation under Uncertainty 4
Calculating Optimistic Likelihoods Using (Geodesically) Convex Optimization 5
Calibration tests in multi-class classification: A unifying framework 3
Can SGD Learn Recurrent Neural Networks with Provable Generalization? 0
Can Unconditional Language Models Recover Arbitrary Sentences? 4
Can you trust your model's uncertainty? Evaluating predictive uncertainty under dataset shift 3
Capacity Bounded Differential Privacy 0
Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution 6
Cascaded Dilated Dense Network with Two-step Data Consistency for MRI Reconstruction 5
Catastrophic Forgetting Meets Negative Transfer: Batch Spectral Shrinkage for Safe Transfer Learning 3
Categorized Bandits 2
Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation 6
Causal Confusion in Imitation Learning 2
Causal Regularization 4
Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback 2
Certainty Equivalence is Efficient for Linear Quadratic Control 0
Certifiable Robustness to Graph Perturbations 5
Certified Adversarial Robustness with Additive Noise 4
Certifying Geometric Robustness of Neural Networks 5
Channel Gating Neural Networks 3
Characterization and Learning of Causal Graphs with Latent Variables from Soft Interventions 1
Characterizing Bias in Classifiers using Generative Models 3
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory 0
Chasing Ghosts: Instruction Following as Bayesian State Tracking 3
Chirality Nets for Human Pose Regression 4
Classification Accuracy Score for Conditional Generative Models 5
Classification-by-Components: Probabilistic Modeling of Reasoning over a Set of Components 5
Co-Generation with GANs using AIS based HMC 4
Coda: An End-to-End Neural Program Decompiler 2
Code Generation as a Dual Task of Code Summarization 4
Cold Case: The Lost MNIST Digits 4
Combinatorial Bandits with Relative Feedback 1
Combinatorial Bayesian Optimization using the Graph Cartesian Product 5
Combinatorial Inference against Label Noise 3
Combining Generative and Discriminative Models for Hybrid Inference 4
Communication trade-offs for Local-SGD with large step size 1
Communication-Efficient Distributed Blockwise Momentum SGD with Error-Feedback 4
Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients 3
Communication-efficient Distributed SGD with Sketching 4
Compacting, Picking and Growing for Unforgetting Continual Learning 3
Comparing Unsupervised Word Translation Methods Step by Step 3
Comparing distributions: $\ell_1$ geometry improves kernel two-sample testing 3
Comparison Against Task Driven Artificial Neural Networks Reveals Functional Properties in Mouse Visual Cortex 2
Competitive Gradient Descent 2
Compiler Auto-Vectorization with Imitation Learning 4
Complexity of Highly Parallel Non-Smooth Convex Optimization 1
Compositional De-Attention Networks 3
Compositional Plan Vectors 3
Compositional generalization through meta sequence-to-sequence learning 4
Compression with Flows via Local Bits-Back Coding 5
Computational Mirrors: Blind Inverse Light Transport by Deep Matrix Factorization 2
Computational Separations between Sampling and Optimization 0
Computing Full Conformal Prediction Set with Approximate Homotopy 5
Computing Linear Restrictions of Neural Networks 2
Concentration of risk measures: A Wasserstein distance approach 0
CondConv: Conditionally Parameterized Convolutions for Efficient Inference 4
Conditional Independence Testing using Generative Adversarial Networks 4
Conditional Structure Generation through Graph Variational Generative Adversarial Nets 2
Conformal Prediction Under Covariate Shift 4
Conformalized Quantile Regression 5
Connections Between Mirror Descent, Thompson Sampling and the Information Ratio 2
Connective Cognition Network for Directional Visual Commonsense Reasoning 4
Consistency-based Semi-supervised Learning for Object detection 3
Constrained Reinforcement Learning Has Zero Duality Gap 1
Constrained deep neural network architecture search for IoT devices accounting for hardware calibration 4
Constraint-based Causal Structure Learning with Consistent Separating Sets 3
Contextual Bandits with Cross-Learning 3
Continual Unsupervised Representation Learning 2
Continuous Hierarchical Representations with Poincaré Variational Auto-Encoders 4
Continuous-time Models for Stochastic Optimization Algorithms 1
Control Batch Size and Learning Rate to Generalize Well: Theoretical and Empirical Evidence 3
Control What You Can: Intrinsically Motivated Task-Planning Agent 3
Controllable Text-to-Image Generation 4
Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation 6
Controlling Neural Level Sets 2
Convergence Guarantees for Adaptive Bayesian Quadrature Methods 0
Convergence of Adversarial Training in Overparametrized Neural Networks 0
Convergence-Rate-Matching Discretization of Accelerated Optimization Flows Through Opportunistic State-Triggered Control 2
Convergent Policy Optimization for Safe Reinforcement Learning 3
Convolution with even-sized kernels and symmetric padding 4
Coordinated hippocampal-entorhinal replay as structural inference 1
Copula Multi-label Learning 3
Copula-like Variational Inference 4
Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders 6
Coresets for Archetypal Analysis 5
Coresets for Clustering with Fairness Constraints 6
Cormorant: Covariant Molecular Neural Networks 3
Correlated Uncertainty for Learning Dense Correspondences from Noisy Labels 3
Correlation Clustering with Adaptive Similarity Queries 3
Correlation Priors for Reinforcement Learning 2
Correlation clustering with local objectives 1
Correlation in Extensive-Form Games: Saddle-Point Formulation and Benchmarks 3
Cost Effective Active Search 3
Counting the Optimal Solutions in Graphical Models 3
Covariate-Powered Empirical Bayes Estimation 4
Cross Attention Network for Few-shot Classification 5
Cross-Domain Transferability of Adversarial Perturbations 5
Cross-Modal Learning with Adversarial Samples 6
Cross-channel Communication Networks 3
Cross-lingual Language Model Pretraining 4
Cross-sectional Learning of Extremal Dependence among Financial Assets 2
Crowdsourcing via Pairwise Co-occurrences: Identifiability and Algorithms 3
Curriculum-guided Hindsight Experience Replay 4
Curvilinear Distance Metric Learning 3
D-VAE: A Variational Autoencoder for Directed Acyclic Graphs 2
DAC: The Double Actor-Critic Architecture for Learning Options 4
DATA: Differentiable ArchiTecture Approximation 4
DETOX: A Redundancy-based Framework for Faster and More Robust Gradient Aggregation 5
DFNets: Spectral CNNs for Graphs with Feedback-Looped Filters 4
DINGO: Distributed Newton-Type Method for Gradient-Norm Optimization 4
DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction 4
DM2C: Deep Mixed-Modal Clustering 5
DRUM: End-To-End Differentiable Rule Mining On Knowledge Graphs 5
DTWNet: a Dynamic Time Warping Network 4
Dancing to Music 3
Data Cleansing for Models Trained with SGD 7
Data Parameters: A New Family of Parameters for Learning a Differentiable Curriculum 4
Data-Dependence of Plateau Phenomenon in Learning with Neural Network --- Statistical Mechanical Analysis 2
Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation 2
Data-driven Estimation of Sinusoid Frequencies 3
Debiased Bayesian inference for average treatment effects 3
Decentralized Cooperative Stochastic Bandits 2
Decentralized sketching of low rank matrices 2
Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask 2
Deep Active Learning with a Neural Architecture Search 5
Deep Equilibrium Models 3
Deep Gamblers: Learning to Abstain with Portfolio Theory 3
Deep Generalized Method of Moments for Instrumental Variable Analysis 4
Deep Generative Video Compression 2
Deep Leakage from Gradients 3
Deep Learning without Weight Transport 3
Deep Model Transferability from Attribution Maps 3
Deep Multi-State Dynamic Recurrent Neural Networks Operating on Wavelet Based Neural Features for Robust Brain Machine Interfaces 3
Deep Multimodal Multilinear Fusion with High-order Polynomial Pooling 3
Deep RGB-D Canonical Correlation Analysis For Sparse Depth Completion 3
Deep Random Splines for Point Process Intensity Estimation of Neural Population Data 3
Deep ReLU Networks Have Surprisingly Few Activation Patterns 2
Deep Scale-spaces: Equivariance Over Scale 3
Deep Set Prediction Networks 5
Deep Signature Transforms 2
Deep Structured Prediction for Facial Landmark Detection 3
Deep Supervised Summarization: Algorithm and Application to Learning Instructions 4
Deep imitation learning for molecular inverse problems 3
DeepUSPS: Deep Robust Unsupervised Saliency Prediction via Self-supervision 5
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging 5
Defending Against Neural Fake News 4
Defending Neural Backdoors via Generative Distribution Modeling 5
Defense Against Adversarial Attacks Using Feature Scattering-based Adversarial Training 4
Deliberative Explanations: visualizing network insecurities 3
Demystifying Black-box Models with Symbolic Metamodels 4
Depth-First Proof-Number Search with Heuristic Edge Cost and Application to Chemical Synthesis Planning 4
DetNAS: Backbone Search for Object Detection 6
Detecting Overfitting via Adversarial Examples 2
Devign: Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks 4
Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural Networks 4
Diffeomorphic Temporal Alignment Nets 5
Differentiable Cloth Simulation for Inverse Problems 4
Differentiable Convex Optimization Layers 4
Differentiable Ranking and Sorting using Optimal Transport 3
Differential Privacy Has Disparate Impact on Model Accuracy 4
Differentially Private Algorithms for Learning Mixtures of Separated Gaussians 1
Differentially Private Anonymized Histograms 1
Differentially Private Bagging: Improved utility and cheaper privacy than subsample-and-aggregate 5
Differentially Private Bayesian Linear Regression 3
Differentially Private Covariance Estimation 3
Differentially Private Distributed Data Summarization under Covariate Shift 4
Differentially Private Markov Chain Monte Carlo 3
Diffusion Improves Graph Learning 4
Dimension-Free Bounds for Low-Precision Training 3
Dimensionality reduction: theoretical perspective on practical measures 1
Direct Estimation of Differential Functional Graphical Models 5
Direct Optimization through $\arg \max$ for Discrete Variational Auto-Encoder 4
Discovering Neural Wirings 5
Discovery of Useful Questions as Auxiliary Tasks 3
Discrete Flows: Invertible Generative Models of Discrete Data 4
Discrete Object Generation with Reversible Inductive Construction 3
Discrimination in Online Markets: Effects of Social Bias on Learning from Reviews and Policy Design 1
Discriminative Topic Modeling with Logistic LDA 5
Discriminator optimal transport 4
Disentangled behavioural representations 2
Disentangling Influence: Using disentangled representations to audit model predictions 4
DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node 4
Distinguishing Distributions When Samples Are Strategically Transformed 0
Distributed Low-rank Matrix Factorization With Exact Consensus 0
Distributed estimation of the inverse Hessian by determinantal averaging 1
Distribution Learning of a Random Spatial Field with a Location-Unaware Mobile Sensor 2
Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards 3
Distribution-Independent PAC Learning of Halfspaces with Massart Noise 1
Distributional Policy Optimization: An Alternative Approach for Continuous Control 4
Distributional Reward Decomposition for Reinforcement Learning 3
Distributionally Robust Optimization and Generalization in Kernel Methods 0
Divergence-Augmented Policy Optimization 3
Divide and Couple: Using Monte Carlo Variational Objectives for Posterior Approximation 3
Domain Generalization via Model-Agnostic Learning of Semantic Features 6
Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction 3
Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse 3
Don't take it lightly: Phasing optical random projections with unknown operators 4
Double Quantization for Communication-Efficient Distributed Optimization 4
Doubly-Robust Lasso Bandit 3
DppNet: Approximating Determinantal Point Processes with Deep Networks 3
Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries 4
Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning 4
Dual Variational Generation for Low Shot Heterogeneous Face Recognition 2
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections 2
Dying Experts: Efficient Algorithms with Optimal Regret Bounds 1
Dynamic Ensemble Modeling Approach to Nonstationary Neural Decoding in Brain-Computer Interfaces 3
Dynamic Incentive-Aware Learning: Robust Pricing in Contextual Auctions 1
Dynamic Local Regret for Non-convex Online Forecasting 4
Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup 3
E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings 3
ETNet: Error Transition Network for Arbitrary Style Transfer 2
Ease-of-Teaching and Language Structure from Emergent Communication 1
Effective End-to-end Unsupervised Outlier Detection via Inlier Priority of Discriminative Network 3
Efficient Algorithms for Smooth Minimax Optimization 3
Efficient Approximation of Deep ReLU Networks for Functions on Low Dimensional Manifolds 0
Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control 4
Efficient Convex Relaxations for Streaming PCA 3
Efficient Deep Approximation of GMMs 0
Efficient Forward Architecture Search 5
Efficient Graph Generation with Graph Recurrent Attention Networks 5
Efficient Identification in Linear Structural Causal Models with Instrumental Cutsets 2
Efficient Meta Learning via Minibatch Proximal Update 5
Efficient Near-Optimal Testing of Community Changes in Balanced Stochastic Block Models 3
Efficient Neural Architecture Transformation Search in Channel-Level for Object Detection 4
Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model 3
Efficient Pure Exploration in Adaptive Round model 2
Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium 3
Efficient Rematerialization for Deep Networks 3
Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent 4
Efficient Symmetric Norm Regression via Linear Sketching 1
Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks 3
Efficient and Thrifty Voting by Any Means Necessary 1
Efficient characterization of electrically evoked responses for neural interfaces 2
Efficient online learning with kernels for adversarial large scale problems 4
Efficiently Estimating Erdos-Renyi Graphs with Node Differential Privacy 1
Efficiently Learning Fourier Sparse Set Functions 2
Efficiently avoiding saddle points with zero order methods: No gradients required 2
Efficiently escaping saddle points on manifolds 1
Elliptical Perturbations for Differential Privacy 0
Embedding Symbolic Knowledge into Deep Networks 3
Emergence of Object Segmentation in Perturbed Generative Models 2
Empirically Measuring Concentration: Fundamental Limits on Intrinsic Robustness 4
Enabling hyperparameter optimization in sequential autoencoders for spiking neural data 3
End to end learning and optimization on graphs 4
End-to-End Learning on 3D Protein Structure for Interface Prediction 5
Energy-Inspired Models: Learning with Sampler-Induced Distributions 3
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting 3
Envy-Free Classification 3
Episodic Memory in Lifelong Language Learning 5
Epsilon-Best-Arm Identification in Pay-Per-Reward Multi-Armed Bandits 1
Equal Opportunity in Online Classification with Partial Feedback 1
Equipping Experts/Bandits with Long-term Memory 1
Equitable Stable Matchings in Quadratic Time 5
Error Correcting Output Codes Improve Probability Estimation and Adversarial Robustness of Deep Neural Networks 2
Escaping from saddle points on Riemannian manifolds 2
Estimating Convergence of Markov chains with L-Lag Couplings 5
Estimating Entropy of Distributions in Constant Space 1
Evaluating Protein Transfer Learning with TAPE 5
Exact Combinatorial Optimization with Graph Convolutional Neural Networks 4
Exact Gaussian Processes on a Million Data Points 5
Exact Rate-Distortion in Autoencoders via Echo Noise 2
Exact inference in structured prediction 0
Exact sampling of determinantal point processes with sublinear time preprocessing 5
Experience Replay for Continual Learning 1
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets 3
Explanations can be manipulated and geometry is to blame 4
Explicit Disentanglement of Appearance and Perspective in Generative Models 3
Explicit Explore-Exploit Algorithms in Continuous State Spaces 4
Explicit Planning for Efficient Exploration in Reinforcement Learning 1
Explicitly disentangling image content from translation and rotation with spatial-VAE 3
Exploiting Local and Global Structure for Point Cloud Semantic Segmentation with Contextual Point Representations 4
Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs 3
Exploration via Hindsight Goal Generation 5
Exploring Algorithmic Fairness in Robust Graph Covering Problems 3
Exploring Unexplored Tensor Network Decompositions for Convolutional Neural Networks 3
Exponential Family Estimation via Adversarial Dynamics Embedding 4
Exponentially convergent stochastic k-PCA without variance reduction 4
Expressive power of tensor-network factorizations for probabilistic modeling 2
Extending Stein's unbiased risk estimator to train deep denoisers with correlated pairs of noisy images 4
Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M Products 5
Face Reconstruction from Voice using Generative Adversarial Networks 5
Facility Location Problem in Differential Privacy Model Revisited 1
Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery 3
Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift 4
Fair Algorithms for Clustering 5
Fast AutoAugment 6
Fast Convergence of Belief Propagation to Global Optima: Beyond Correlation Decay 0
Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks 0
Fast Decomposable Submodular Function Minimization using Constrained Total Variation 2
Fast Efficient Hyperparameter Tuning for Policy Gradient Methods 4
Fast Low-rank Metric Learning for Large-scale and High-dimensional Data 6
Fast Parallel Algorithms for Statistical Subset Selection Problems 2
Fast Sparse Group Lasso 5
Fast Structured Decoding for Sequence Models 5
Fast and Accurate Least-Mean-Squares Solvers 6
Fast and Accurate Stochastic Gradient Estimation 3
Fast and Flexible Multi-Task Classification using Conditional Neural Adaptive Processes 6
Fast and Furious Learning in Zero-Sum Games: Vanishing Regret with Non-Vanishing Step Sizes 2
Fast and Provable ADMM for Learning with Generative Priors 3
Fast structure learning with modular regularization 6
Fast, Provably convergent IRLS Algorithm for p-norm Linear Regression 6
Fast-rate PAC-Bayes Generalization Bounds via Shifted Rademacher Processes 0
FastSpeech: Fast, Robust and Controllable Text to Speech 4
Faster Boosting with Smaller Memory 5
Faster width-dependent algorithm for mixed packing and covering LPs 1
Few-shot Video-to-Video Synthesis 5
Finding Friend and Foe in Multi-Agent Games 3
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias 3
Fine-grained Optimization of Deep Neural Networks 1
Finite-Sample Analysis for SARSA with Linear Function Approximation 1
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning 3
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator 2
First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise 3
First Order Motion Model for Image Animation 3
First order expansion of convex regularized estimators 0
First-order methods almost always avoid saddle points: The case of vanishing step-sizes 0
Fisher Efficient Inference of Intractable Models 3
Fixing Implicit Derivatives: Trust-Region Based Learning of Continuous Energy Functions 4
Fixing the train-test resolution discrepancy 5
Flattening a Hierarchical Clustering through Active Learning 3
Flexible Modeling of Diversity with Strongly Log-Concave Distributions 2
Flexible information routing in neural populations through stochastic comodulation 1
Flow-based Image-to-Image Translation with Feature Disentanglement 6
Focused Quantization for Sparse CNNs 4
Fooling Neural Network Interpretations via Adversarial Model Manipulation 4
Foundations of Comparison-Based Hierarchical Clustering 4
FreeAnchor: Learning to Match Anchors for Visual Object Detection 6
From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization 3
From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction 1
From voxels to pixels and back: Self-supervision in natural-image reconstruction from fMRI 4
Full-Gradient Representation for Neural Network Visualization 2
Fully Dynamic Consistent Facility Location 4
Fully Neural Network based Model for General Temporal Point Processes 4
Fully Parameterized Quantile Function for Distributional Reinforcement Learning 4
Function-Space Distributions over Kernels 5
Functional Adversarial Attacks 4
G2SAT: Learning to Generate SAT Formulas 4
GENO -- GENeric Optimization for Classical Machine Learning 6
GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs 4
GNNExplainer: Generating Explanations for Graph Neural Networks 3
GOT: An Optimal Transport framework for Graph comparison 4
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism 4
GRU-ODE-Bayes: Continuous Modeling of Sporadically-Observed Time Series 6
Game Design for Eliciting Distinguishable Behavior 2
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks 4
Gaussian-Based Pooling for Convolutional Neural Networks 5
General E(2)-Equivariant Steerable CNNs 2
General Proximal Incremental Aggregated Gradient Algorithms: Better and Novel Results under General Scheme 2
Generalization Bounds for Neural Networks via Approximate Description Length 0
Generalization Bounds in the Predict-then-Optimize Framework 0
Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks 1
Generalization Error Analysis of Quantized Compressive Learning 2
Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection 2
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck 3
Generalization in multitask deep neural classifiers: a statistical physics approach 1
Generalization of Reinforcement Learners with Working and Episodic Memory 2
Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer 2
Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs 3
Generalized Off-Policy Actor-Critic 3
Generalized Sliced Wasserstein Distances 2
Generating Diverse High-Fidelity Images with VQ-VAE-2 4
Generative Modeling by Estimating Gradients of the Data Distribution 3
Generative Models for Graph-Based Protein Design 6
Generative Well-intentioned Networks 5
Geometry-Aware Neural Rendering 3
Global Convergence of Gradient Descent for Deep Linear Residual Networks 2
Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities 0
Global Guarantees for Blind Demodulation with Generative Priors 3
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks 4
Globally Convergent Newton Methods for Ill-conditioned Generalized Self-concordant Losses 6
Globally Optimal Learning for Structured Elliptical Losses 3
Globally optimal score-based learning of directed acyclic graphs in high-dimensions 0
Glyce: Glyph-vectors for Chinese Character Representations 3
Goal-conditioned Imitation Learning 3
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning 5
Gradient Dynamics of Shallow Univariate ReLU Networks 1
Gradient Information for Representation and Modeling 2
Gradient based sample selection for online continual learning 4
Gradient-based Adaptive Markov Chain Monte Carlo 4
Graph Agreement Models for Semi-Supervised Learning 5
Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels 4
Graph Normalizing Flows 2
Graph Structured Prediction Energy Networks 5
Graph Transformer Networks 4
Graph-Based Semi-Supervised Learning with Non-ignorable Non-response 4
Graph-based Discriminators: Sample Complexity and Expressiveness 0
Greedy Sampling for Approximate Clustering in the Presence of Outliers 3
Grid Saliency for Context Explanations of Semantic Segmentation 2
Group Retention when Using Machine Learning in Sequential Decision Making: the Interplay between User Dynamics and Fairness 1
Guided Meta-Policy Search 3
Guided Similarity Separation for Image Retrieval 4
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models 2
Hamiltonian Neural Networks 3
Hamiltonian descent for composite objectives 1
Handling correlated and repeated measurements with the smoothed multivariate square-root Lasso 4
Heterogeneous Graph Learning for Visual Commonsense Reasoning 5
Hierarchical Decision Making by Generating and Following Natural Language Instructions 3
Hierarchical Optimal Transport for Document Representation 6
Hierarchical Optimal Transport for Multimodal Distribution Alignment 3
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards 3
High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks 3
High-Dimensional Optimization in Adaptive Random Subspaces 3
High-Quality Self-Supervised Deep Image Denoising 1
High-dimensional multivariate forecasting with low-rank Gaussian Copula Processes 4
Hindsight Credit Assignment 2
How degenerate is the parametrization of neural networks with the ReLU activation function? 0
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets 4
Hybrid 8-bit Floating Point (HFP8) Training and Inference for Deep Neural Networks 2
Hyper-Graph-Network Decoders for Block Codes 2
HyperGCN: A New Method For Training Graph Convolutional Networks on Hypergraphs 5
Hyperbolic Graph Convolutional Neural Networks 4
Hyperbolic Graph Neural Networks 4
Hyperparameter Learning via Distributional Transfer 4
Hyperspherical Prototype Networks 3
Hypothesis Set Stability and Generalization 0
Icebreaker: Element-wise Efficient Information Acquisition with a Bayesian Deep Latent Gaussian Model 3
Identification of Conditional Causal Effects under Markov Equivalence 1
Identifying Causal Effects via Context-specific Independence Relations 4
Image Captioning: Transforming Objects into Words 5
Image Synthesis with a Single (Robust) Classifier 2
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement 2
Imitation-Projected Programmatic Reinforcement Learning 4
Implicit Generation and Modeling with Energy Based Models 4
Implicit Posterior Variational Inference for Deep Gaussian Processes 4
Implicit Regularization for Optimal Sparse Recovery 3
Implicit Regularization in Deep Matrix Factorization 2
Implicit Regularization of Accelerated Methods in Hilbert Spaces 3
Implicit Regularization of Discrete Gradient Dynamics in Linear Neural Networks 2
Implicit Semantic Data Augmentation for Deep Networks 5
Implicitly learning to reason in first-order logic 1
Importance Resampling for Off-policy Prediction 2
Importance Weighted Hierarchical Variational Inference 4
Improved Precision and Recall Metric for Assessing Generative Models 4
Improved Regret Bounds for Bandit Combinatorial Optimization 1
Improving Black-box Adversarial Attacks with a Transfer-based Prior 5
Improving Textual Network Learning with Variational Homophilic Embeddings 2
In-Place Zero-Space Memory Protection for CNN 4
Incremental Few-Shot Learning with Attention Attractor Networks 5
Incremental Scene Synthesis 3
Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits 1
Inducing brain-relevant bias in natural language processing models 4
Information Competing Process for Learning Diversified Representations 5
Information-Theoretic Confidence Bounds for Reinforcement Learning 1
Information-Theoretic Generalization Bounds for SGLD via Data-Dependent Estimates 2
Infra-slow brain dynamics as a marker for cognitive function and decline 3
Inherent Tradeoffs in Learning Fair Representations 2
Inherent Weight Normalization in Stochastic Neural Networks 3
Initialization of ReLUs for Dynamical Isometry 3
Input Similarity from the Neural Network Perspective 2
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks 2
Input-Output Equivalence of Unitary and Contractive RNNs 1
Integer Discrete Flows and Lossless Compression 2
Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning 4
Integrating Markov processes with structural causal modeling enables counterfactual inference in complex systems 3
Interaction Hard Thresholding: Consistent Sparse Quadratic Regression in Sub-quadratic Time and Space 2
Interior-Point Methods Strike Back: Solving the Wasserstein Barycenter Problem 5
Interlaced Greedy Algorithm for Maximization of Submodular Functions in Nearly Linear Time 4
Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain) 4
Interval timing in deep reinforcement learning agents 2
Intrinsic dimension of data representations in deep neural networks 4
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning 3
Invariance and identifiability issues for word embeddings 1
Invariance-inducing regularization using worst-case transformations suffices to boost accuracy and spatial robustness 3
Invert to Learn to Invert 4
Invertible Convolutional Flow 2
Inverting Deep Generative models, One layer at a time 4
Is Deeper Better only when Shallow is Good? 2
Iterative Least Trimmed Squares for Mixed Linear Regression 1
Joint Optimization of Tree-based Index and Deep Model for Recommender Systems 4
Joint-task Self-supervised Learning for Temporal Correspondence 4
KNG: The K-Norm Gradient Mechanism 2
Kalman Filter, Sensor Fusion, and Constrained Regression: Equivalences and Insights 4
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards 2
KerGM: Kernelized Graph Matching 4
Kernel Instrumental Variable Regression 2
Kernel Stein Tests for Multiple Model Comparison 4
Kernel Truncated Randomized Ridge Regression: Optimal Rates and Low Noise Acceleration 2
Kernel quadrature with DPPs 1
Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods 5
Kernelized Bayesian Softmax for Text Generation 4
Knowledge Extraction with No Observable Data 4
LCA: Loss Change Allocation for Neural Network Training 3
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning 5
L_DMI: A Novel Information-theoretic Loss Function for Training Deep Nets Robust to Label Noise 5
Landmark Ordinal Embedding 3
Language as an Abstraction for Hierarchical Deep Reinforcement Learning 5
Large Memory Layers with Product Keys 5
Large Scale Adversarial Representation Learning 4
Large Scale Markov Decision Processes with Changing Rewards 1
Large Scale Structure of Neural Network Loss Landscapes 2
Large-scale optimal transport map estimation using projection pursuit 5
Latent Ordinary Differential Equations for Irregularly-Sampled Time Series 3
Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization 5
Latent distance estimation for random geometric graphs 2
Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks 5
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models 6
Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge 3
Learnable Tree Filter for Structure-preserving Feature Transform 5
Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints 2
Learning Auctions with Robust Incentive Guarantees 1
Learning Bayesian Networks with Low Rank Conditional Probability Tables 1
Learning Compositional Neural Programs with Recursive Tree Search and Planning 3
Learning Conditional Deformable Templates with Convolutional Networks 5
Learning Data Manipulation for Augmentation and Weighting 6
Learning Deep Bilinear Transformation for Fine-grained Image Representation 4
Learning Deterministic Weighted Automata with Queries and Counterexamples 3
Learning Disentangled Representation for Robust Person Re-identification 4
Learning Disentangled Representations for Recommendation 3
Learning Distributions Generated by One-Layer ReLU Networks 3
Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning 4
Learning Erdos-Renyi Random Graphs via Edge Detecting Queries 3
Learning Fairness in Multi-Agent Systems 3
Learning GANs and Ensembles Using Discrepancy 3
Learning Generalizable Device Placement Algorithms for Distributed Machine Learning 4
Learning Hawkes Processes from a handful of events 4
Learning Hierarchical Priors in VAEs 3
Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss 4
Learning Latent Process from High-Dimensional Event Sequences via Efficient Sampling 4
Learning Local Search Heuristics for Boolean Satisfiability 2
Learning Macroscopic Brain Connectomes via Group-Sparse Factorization 4
Learning Mean-Field Games 2
Learning Mixtures of Plackett-Luce Models from Structured Partial Orders 4
Learning Multiple Markov Chains via Adaptive Allocation 1
Learning Nearest Neighbor Graphs from Noisy Distance Samples 4
Learning Neural Networks with Adaptive Regularization 4
Learning New Tricks From Old Dogs: Multi-Source Transfer Learning From Pre-Trained Networks 4
Learning Non-Convergent Non-Persistent Short-Run MCMC Toward Energy-Based Model 4
Learning Nonsymmetric Determinantal Point Processes 5
Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds 6
Learning Perceptual Inference by Contrasting 3
Learning Positive Functions with Pseudo Mirror Descent 3
Learning Representations by Maximizing Mutual Information Across Views 5
Learning Representations for Time Series Clustering 4
Learning Reward Machines for Partially Observable Reinforcement Learning 4
Learning Robust Global Representations by Penalizing Local Predictive Power 4
Learning Robust Options by Conditional Value at Risk Optimization 4
Learning Sample-Specific Models with Low-Rank Personalized Regression 5
Learning Sparse Distributions using Iterative Hard Thresholding 3
Learning Stable Deep Dynamics Models 1
Learning Temporal Pose Estimation from Sparsely-Labeled Videos 5
Learning Transferable Graph Exploration 3
Learning about an exponential amount of conditional distributions 3
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers 2
Learning by Abstraction: The Neural State Machine 2
Learning dynamic polynomial proofs 1
Learning elementary structures for 3D shape generation and matching 5
Learning from Bad Data via Generation 4
Learning from Label Proportions with Generative Adversarial Networks 5
Learning from Trajectories via Subgoal Discovery 3
Learning from brains how to regularize machines 4
Learning in Generalized Linear Contextual Bandits with Stochastic Delays 1
Learning low-dimensional state embeddings and metastable clusters from time series data 2
Learning metrics for persistence-based summaries and applications for graph classification 3
Learning nonlinear level sets for dimensionality reduction in function approximation 5
Learning search spaces for Bayesian optimization: Another view of hyperparameter transfer learning 4
Learning step sizes for unfolded sparse coding 3
Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder 3
Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity 3
Learning to Correlate in Multi-Player General-Sum Sequential Games 3
Learning to Infer Implicit Surfaces without 3D Supervision 4
Learning to Learn By Self-Critique 4
Learning to Optimize in Swarms 3
Learning to Perform Local Rewriting for Combinatorial Optimization 4
Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer 4
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis 5
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction 3
Learning to Propagate for Graph Meta-Learning 6
Learning to Screen 0
Learning to Self-Train for Semi-Supervised Few-Shot Classification 4
Learning-Based Low-Rank Approximations 2
Learning-In-The-Loop Optimization: End-To-End Control And Co-Design Of Soft Robots Through Learned Deep Latent Representations 3
Legendre Memory Units: Continuous-Time Representation in Recurrent Neural Networks 4
Levenshtein Transformer 4
Leveraging Labeled and Unlabeled Data for Consistent Fair Binary Classification 4
Likelihood Ratios for Out-of-Distribution Detection 5
Likelihood-Free Overcomplete ICA and Applications In Causal Discovery 3
Limitations of Lazy Training of Two-layers Neural Network 1
Limitations of the empirical Fisher approximation for natural gradient descent 3
Limiting Extrapolation in Linear Approximate Value Iteration 2
Limits of Private Learning with Access to Public Data 1
Linear Stochastic Bandits Under Safety Constraints 1
List-decodable Linear Regression 0
LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition 4
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning 4
Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization 5
Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond 4
Localized Structured Prediction 4
Locally Private Gaussian Estimation 1
Locally Private Learning without Interaction Requires Separation 0
Logarithmic Regret for Online Control 1
Lookahead Optimizer: k steps forward, 1 step back 6
Low-Complexity Nonparametric Bayesian Online Prediction with Universal Guarantees 4
Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing 3
Lower Bounds on Adversarial Robustness from Optimal Transport 3
MAVEN: Multi-Agent Variational Exploration 3
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies 3
MaCow: Masked Convolutional Generative Flow 4
Machine Learning Estimation of Heterogeneous Treatment Effects with Instruments 6
Machine Teaching of Active Sequential Learners 4
Making AI Forget You: Data Deletion in Machine Learning 3
Making the Cut: A Bandit-based Approach to Tiered Interviewing 3
Manifold denoising by Nonlinear Robust Principal Component Analysis 3
Manifold-regression to predict from MEG/EEG brain signals without source modeling 4
Manipulating a Learning Defender and Ways to Counteract 2
Mapping State Space using Landmarks for Universal Goal Reaching 3
Margin-Based Generalization Lower Bounds for Boosted Classifiers 1
MarginGAN: Adversarial Training in Semi-Supervised Learning 2
Markov Random Fields for Collaborative Filtering 5
Massively scalable Sinkhorn distances via the Nyström method 3
Max-value Entropy Search for Multi-Objective Bayesian Optimization 5
MaxGap Bandit: Adaptive Algorithms for Approximate Ranking 4
Maximum Entropy Monte-Carlo Planning 2
Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards 0
Maximum Mean Discrepancy Gradient Flow 2
McDiarmid-Type Inequalities for Graph-Dependent Variables and Stability Bounds 0
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis 4
Memory Efficient Adaptive Optimization 5
Memory-oriented Decoder for Light Field Salient Object Detection 4
Meta Architecture Search 5
Meta Learning with Relational Information for Short Sequences 3
Meta-Curvature 5
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables 2
Meta-Learning Representations for Continual Learning 5
Meta-Learning with Implicit Gradients 5
Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition 4
Meta-Surrogate Benchmarking for Hyperparameter Optimization 4
Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting 5
MetaInit: Initializing learning by learning to initialize 4
MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization 5
Metalearned Neural Memory 3
Metamers of neural networks reveal divergence from human perceptual systems 4
Metric Learning for Adversarial Robustness 6
Minimal Variance Sampling in Stochastic Gradient Boosting 5
Minimax Optimal Estimation of Approximate Differential Privacy on Neighboring Databases 3
Minimizers of the Empirical Risk and Risk Monotonicity 0
Minimum Stein Discrepancy Estimators 1
Mining GOLD Samples for Conditional GANs 3
MintNet: Building Invertible Neural Networks with Masked Convolutions 3
Missing Not at Random in Matrix Completion: The Effectiveness of Estimating Missingness Probabilities Under a Low Nuclear Norm Assumption 4
MixMatch: A Holistic Approach to Semi-Supervised Learning 5
Mixtape: Breaking the Softmax Bottleneck Efficiently 2
Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation 3
Model Compression with Adversarial Robustness: A Unified Optimization Framework 4
Model Selection for Contextual Bandits 1
Model Similarity Mitigates Test Set Overuse 2
Modeling Conceptual Understanding in Image Reference Games 4
Modeling Dynamic Functional Connectivity with Latent Factor Gaussian Processes 4
Modeling Expectation Violation in Intuitive Physics with Coarse Probabilistic Object Representations 2
Modeling Tabular data using Conditional GAN 4
Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections 3
Modelling heterogeneous distributions with an Uncountable Mixture of Asymmetric Laplacians 6
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach 2
Modular Universal Reparameterization: Deep Multi-task Learning Across Diverse Domains 4
Momentum-Based Variance Reduction in Non-Convex SGD 4
MonoForest framework for tree ensemble analysis 3
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation 5
Multi-Agent Common Knowledge Reinforcement Learning 3
Multi-Criteria Dimensionality Reduction with Applications to Fairness 3
Multi-Resolution Weak Supervision for Sequential Data 4
Multi-View Reinforcement Learning 2
Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition 5
Multi-mapping Image-to-Image Translation via Learning Disentanglement 2
Multi-marginal Wasserstein GAN 5
Multi-objective Bayesian optimisation with preferences over objectives 3
Multi-objects Generation with Amortized Structural Regularization 3
Multi-relational Poincaré Graph Embeddings 3
Multi-resolution Multi-task Gaussian Processes 3
Multi-source Domain Adaptation for Semantic Segmentation 4
Multi-task Learning for Aggregated Data using Gaussian Processes 4
Multiagent Evaluation under Incomplete Information 3
Multiclass Learning from Contradictions 4
Multiclass Performance Metric Elicitation 3
Multilabel reductions: what is my loss optimising? 0
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation 4
Multiple Futures Prediction 4
Multivariate Distributionally Robust Convex Regression under Absolute Error Loss 2
Multivariate Sparse Coding of Nonstationary Covariances with Gaussian Processes 3
Multivariate Triangular Quantile Maps for Novelty Detection 3
Multiview Aggregation for Learning Category-Specific Shape Reconstruction 2
Multiway clustering via tensor block models 4
Mutually Regressive Point Processes 4
Möbius Transformation for Fast Inner Product Search on Graph 3
N-Gram Graph: Simple Unsupervised Representation for Graphs, with Applications to Molecules 5
NAOMI: Non-Autoregressive Multiresolution Sequence Imputation 4
NAT: Neural Architecture Transformer for Accurate and Compact Architectures 5
Near Neighbor: Who is the Fairest of Them All? 2
Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes 3
Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin 1
Necessary and Sufficient Geometries for Gradient Methods 0
Network Pruning via Transformable Architecture Search 6
NeurVPS: Neural Vanishing Point Scanning via Conic Convolution 5
Neural Attribution for Semantic Bug-Localization in Student Programs 5
Neural Diffusion Distance for Image Segmentation 4
Neural Jump Stochastic Differential Equations 6
Neural Lyapunov Control 3
Neural Machine Translation with Soft Prototype 5
Neural Multisensory Scene Inference 2
Neural Networks with Cheap Differential Operators 2
Neural Relational Inference with Fast Modular Meta-learning 3
Neural Shuffle-Exchange Networks - Sequence Processing in O(n log n) Time 4
Neural Similarity Learning 3
Neural Spline Flows 6
Neural Taskonomy: Inferring the Similarity of Task-Derived Representations from Brain Activity 4
Neural Temporal-Difference Learning Converges to Global Optima 1
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy 1
Neural networks grown and self-organized by noise 1
Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm Evaluation 2
No Pressure! Addressing the Problem of Local Minima in Manifold Learning Algorithms 3
No-Press Diplomacy: Modeling Multi-Agent Gameplay 3
No-Regret Learning in Unknown Games with Correlated Payoffs 3
Noise-tolerant fair classification 3
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs 1
Non-Asymptotic Pure Exploration by Solving Games 2
Non-Cooperative Inverse Reinforcement Learning 2
Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning 3
Non-asymptotic Analysis of Stochastic Methods for Non-Smooth Non-Convex Regularized Problems 3
Non-normal Recurrent Neural Network (nnRNN): learning long time dependencies while improving expressivity with transient dynamics 2
Nonconvex Low-Rank Tensor Completion from Noisy Data 2
Nonlinear scaling of resource allocation in sensory bottlenecks 1
Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric 2
Nonparametric Density Estimation & Convergence Rates for GANs under Besov IPM Losses 0
Nonparametric Regressive Point Processes Based on Conditional Gaussian Processes 2
Nonstochastic Multiarmed Bandits with Unrestricted Delays 1
Nonzero-sum Adversarial Hypothesis Testing Games 2
Normalization Helps Training of Quantized LSTM 3
Novel positional encodings to enable tree-based transformers 3
Numerically Accurate Hyperbolic Embeddings Using Tiling-Based Models 4
ODE2VAE: Deep generative second order ODEs with Bayesian neural networks 4
Object landmark discovery through unsupervised adaptation 4
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models 1
Oblivious Sampling Algorithms for Private Data Analysis 4
Off-Policy Evaluation via Off-Policy Classification 3
Offline Contextual Bandits with High Probability Fairness Guarantees 2
Offline Contextual Bayesian Optimization 5
On Adversarial Mixup Resynthesis 5
On Differentially Private Graph Sparsification and Applications 1
On Distributed Averaging for Stochastic k-PCA 2
On Exact Computation with an Infinitely Wide Neural Net 1
On Fenchel Mini-Max Learning 5
On Human-Aligned Risk Minimization 2
On Lazy Training in Differentiable Programming 3
On Learning Over-parameterized Neural Networks: A Functional Approximation Perspective 1
On Making Stochastic Classifiers Deterministic 4
On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks 2
On Relating Explanations and Adversarial Examples 6
On Robustness of Principal Component Regression 0
On Robustness to Adversarial Examples and Polynomial Optimization 3
On Sample Complexity Upper and Lower Bounds for Exact Ranking from Noisy Comparisons 2
On Single Source Robustness in Deep Fusion Models 6
On Testing for Biases in Peer Review 1
On The Classification-Distortion-Perception Tradeoff 3
On Tractable Computation of Expected Predictions 4
On the (In)fidelity and Sensitivity of Explanations 2
On the Accuracy of Influence Functions for Measuring Group Effects 3
On the Calibration of Multiclass Classification with Rejection 2
On the Convergence Rate of Training Recurrent Neural Networks 0
On the Correctness and Sample Complexity of Inverse Reinforcement Learning 1
On the Curved Geometry of Accelerated Optimization 0
On the Downstream Performance of Compressed Word Embeddings 3
On the Expressive Power of Deep Polynomial Neural Networks 1
On the Fairness of Disentangled Representations 1
On the Global Convergence of (Fast) Incremental Expectation Maximization Methods 3
On the Hardness of Robust Classification 0
On the Inductive Bias of Neural Tangent Kernels 2
On the Ineffectiveness of Variance Reduced Optimization for Deep Learning 3
On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems 3
On the Power and Limitations of Random Features for Understanding Neural Networks 0
On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset 3
On the Utility of Learning about Humans for Human-AI Coordination 2
On the Value of Target Data in Transfer Learning 1
On the convergence of single-call stochastic extra-gradient methods 2
On the equivalence between graph isomorphism testing and function approximation with GNNs 5
On the number of variables to use in principal component regression 0
On two ways to use determinantal point processes for Monte Carlo integration 2
One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers 2
One-Shot Object Detection with Co-Attention and Co-Excitation 5
Online Continual Learning with Maximal Interfered Retrieval 5
Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback 1
Online Convex Matrix Factorization with Representative Regions 4
Online EXP3 Learning in Adversarial Bandits with Delayed Feedback 1
Online Forecasting of Total-Variation-bounded Sequences 2
Online Learning via the Differential Privacy Lens 1
Online Markov Decoding: Lower Bounds and Near-Optimal Approximation Algorithms 2
Online Normalization for Training Neural Networks 5
Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret Analysis 2
Online Prediction of Switching Graph Labelings with Cluster Specialists 4
Online Stochastic Shortest Path with Bandit Feedback and Unknown Transition Function 1
Online sampling from log-concave distributions 3
Online-Within-Online Meta-Learning 5
Optimal Analysis of Subset-Selection Based L_p Low-Rank Approximation 1
Optimal Best Markovian Arm Identification with Fixed Confidence 0
Optimal Decision Tree with Noisy Outcomes 3
Optimal Pricing in Repeated Posted-Price Auctions with Different Patience of the Seller and the Buyer 2
Optimal Sampling and Clustering in the Stochastic Block Model 2
Optimal Sketching for Kronecker Product Regression and Low Rank Approximation 2
Optimal Sparse Decision Trees 5
Optimal Sparsity-Sensitive Bounds for Distributed Mean Estimation 3
Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up 0
Optimal Stochastic and Online Learning with Individual Iterates 4
Optimistic Distributionally Robust Optimization for Nonparametric Likelihood Approximation 4
Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions 2
Optimizing Generalized PageRank Methods for Seed-Expansion Community Detection 2
Optimizing Generalized Rate Metrics with Three Players 3
Oracle-Efficient Algorithms for Online Linear Optimization with Bandit Feedback 1
Order Optimal One-Shot Distributed Learning 1
Ordered Memory 5
Ouroboros: On Accelerating Training of Transformer-Based Language Models 5
Outlier Detection and Robust PCA Using a Convex Measure of Innovation 3
Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering 3
Outlier-robust estimation of a sparse linear model using $\ell_1$-penalized Huber's $M$-estimator 1
PAC-Bayes Un-Expected Bernstein Inequality 3
PAC-Bayes under potentially heavy tails 1
PC-Fairness: A Unified Framework for Measuring Causality-based Fairness 3
PHYRE: A New Benchmark for Physical Reasoning 4
PIDForest: Anomaly Detection via Partial Identification 4
PRNet: Self-Supervised Learning for Partial-to-Partial Registration 3
Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence Rates 4
Paradoxes in Fair Machine Learning 2
Parameter elimination in particle Gibbs sampling 3
Paraphrase Generation with Latent Bag of Words 3
Pareto Multi-Task Learning 3
Park: An Open Platform for Learning-Augmented Computer Systems 4
Partially Encrypted Deep Learning using Functional Encryption 6
Partitioning Structure Learning for Segmented Linear Regression Trees 5
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph 5
Perceiving the arrow of time in autoregressive motion 2
Personalizing Many Decisions with High-Dimensional Covariates 2
PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points 3
PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments 2
Phase Transitions and Cyclic Phenomena in Bandits with Switching Constraints 1
Piecewise Strong Convexity of Neural Networks 2
Planning in entropy-regularized Markov decision processes and games 1
Planning with Goal-Conditioned Policies 3
Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games 0
Point-Voxel CNN for Efficient 3D Deep Learning 4
PointDAN: A Multi-Scale 3D Domain Adaption Network for Point Cloud Representation 5
Poisson-Minibatching for Gibbs Sampling with Convergence Rate Guarantees 3
Poisson-Randomized Gamma Dynamical Systems 4
Policy Continuation with Hindsight Inverse Dynamics 4
Policy Evaluation with Latent Confounders via Optimal Balance 4
Policy Learning for Fairness in Ranking 2
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games 0
Policy Poisoning in Batch Reinforcement Learning and Control 2
Polynomial Cost of Adaptation for X-Armed Bandits 1
Positional Normalization 4
Positive-Unlabeled Compression on the Cloud 3
Post training 4-bit quantization of convolutional networks for rapid-deployment 4
Power analysis of knockoff filters for correlated designs 1
PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization 4
Powerset Convolutional Neural Networks 4
Practical Deep Learning with Bayesian Principles 6
Practical Differentially Private Top-k Selection with Pay-what-you-get Composition 1
Practical Two-Step Lookahead Bayesian Optimization 4
Practical and Consistent Estimation of f-Divergences 3
Precision-Recall Balanced Topic Modelling 3
Predicting the Politics of an Image Using Webly Supervised Data 3
Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees 3
Preference-Based Batch and Sequential Teaching: Towards a Unified View of Models 1
Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks 4
Primal-Dual Block Generalized Frank-Wolfe 5
Principal Component Projection and Regression in Nearly Linear Time through Asymmetric SVRG 2
Prior-Free Dynamic Auctions with Low Regret Buyers 1
Privacy Amplification by Mixing and Diffusion Mechanisms 1
Privacy-Preserving Classification of Personal Text Messages with Secure Multi-Party Computation 6
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces 3
Private Hypothesis Selection 1
Private Learning Implies Online Learning: An Efficient Reduction 1
Private Stochastic Convex Optimization with Optimal Rates 1
Private Testing of Distributions via Sample Permutations 1
Probabilistic Logic Neural Networks for Reasoning 2
Probabilistic Watershed: Sampling all spanning forests for seeded segmentation and semi-supervised learning 2
Procrastinating with Confidence: Near-Optimal, Anytime, Adaptive Algorithm Configuration 3
Program Synthesis and Semantic Parsing with Learned Code Idioms 4
Progressive Augmentation of GANs 3
Projected Stein Variational Newton: A Fast and Scalable Bayesian Inference Method in High Dimensions 2
Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters 3
Provable Certificates for Adversarial Examples: Fitting a Ball in the Union of Polytopes 3
Provable Gradient Variance Guarantees for Black-Box Variational Inference 2
Provable Non-linear Inductive Matrix Completion 3
Provably Efficient Q-Learning with Low Switching Cost 1
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle 1
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost 2
Provably Powerful Graph Networks 3
Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers 4
Provably robust boosted decision stumps and trees against adversarial attacks 5
Pseudo-Extended Markov chain Monte Carlo 4
Pure Exploration with Multiple Correct Answers 1
Push-pull Feedback Implements Hierarchical Information Retrieval Efficiently 2
Putting An End to End-to-End: Gradient-Isolated Learning of Representations 4
PyTorch: An Imperative Style, High-Performance Deep Learning Library 7
Qsparse-local-SGD: Distributed SGD with Quantization, Sparsification and Local Computations 5
Quadratic Video Interpolation 3
Quality Aware Generative Adversarial Networks 2
Quantum Embedding of Knowledge for Reasoning 3
Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection 5
Quantum Wasserstein Generative Adversarial Networks 2
Quaternion Knowledge Graph Embeddings 3
R2D2: Reliable and Repeatable Detector and Descriptor 4
REM: From Structural Entropy to Community Structure Deception 4
RSN: Randomized Subspace Newton 3
RUBi: Reducing Unimodal Biases for Visual Question Answering 3
RUDDER: Return Decomposition for Delayed Rewards 2
Random Path Selection for Continual Learning 4
Random Projections and Sampling Algorithms for Clustering of High-Dimensional Polygonal Curves 3
Random Projections with Asymmetric Quantization 1
Random Quadratic Forms with Dependence: Applications to Restricted Isometry and Beyond 0
Random Tessellation Forests 4
Random deep neural networks are biased towards simple functions 2
Rapid Convergence of the Unadjusted Langevin Algorithm: Isoperimetry Suffices 0
Rates of Convergence for Large-scale Nearest Neighbor Classification 4
Re-examination of the Role of Latent Variables in Sequence Modeling 2
Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted Sampling 2
Real-Time Reinforcement Learning 3
Reconciling meta-learning and continual learning with online mixtures of tasks 2
Reconciling λ-Returns with Experience Replay 3
Recovering Bandits 2
Recurrent Kernel Networks 4
Recurrent Registration Neural Networks for Deformable Image Registration 3
Recurrent Space-time Graph Neural Networks 6
Reducing Noise in GAN Training with Variance Reduced Extragradient 3
Reducing the variance in online optimization by transporting past gradients 4
Reflection Separation using a Pair of Unpolarized and Polarized Images 3
Region Mutual Information Loss for Semantic Segmentation 4
Region-specific Diffeomorphic Metric Mapping 4
Regression Planning Networks 1
Regret Bounds for Learning State Representations in Reinforcement Learning 1
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems 3
Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function 1
Regret Minimization for Reinforcement Learning with Vectorial Feedback and Complex Objectives 2
Regularization Matters: Generalization and Optimization of Neural Nets v.s. their Induced Kernel 1
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning 4
Regularized Gradient Boosting 4
Regularized Weighted Low Rank Approximation 4
Regularizing Trajectory Optimization with Denoising Autoencoders 3
Reinforcement Learning with Convex Constraints 2
Reliable training and estimation of variance networks 3
ResNets Ensemble via the Feynman-Kac Formalism to Improve Natural and Robust Accuracies 5
Residual Flows for Invertible Generative Modeling 2
Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks 4
Rethinking Generative Mode Coverage: A Pointwise Guaranteed Approach 2
Rethinking Kernel Methods for Node Representation Learning on Graphs 4
Rethinking the CSC Model for Natural Images 3
Retrosynthesis Prediction with Conditional Graph Logic Network 6
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness 3
Reverse engineering recurrent networks for sentiment classification reveals line attractor dynamics 2
Revisiting the Bethe-Hessian: Improved Community Detection in Sparse Heterogeneous Graphs 3
Riemannian batch normalization for SPD neural networks 5
Robust Attribution Regularization 3
Robust Bi-Tempered Logistic Loss Based on Bregman Divergences 6
Robust Multi-agent Counterfactual Prediction 2
Robust Principal Component Analysis with Adaptive Neighbors 5
Robust and Communication-Efficient Collaborative Learning 3
Robust exploration in linear quadratic reinforcement learning 3
Robustness Verification of Tree-based Models 5
Robustness to Adversarial Perturbations in Learning from Incomplete Data 2
Root Mean Square Layer Normalization 4
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models 6
SGD on Neural Networks Learns Functions of Increasing Complexity 1
SHE: A Fast and Accurate Deep Neural Network for Encrypted Data 4
SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits 1
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies 5
SPoC: Search-based Pseudocode to Code 5
SSRGD: Simple Stochastic Recursive Gradient Descent for Escaping Saddle Points 1
STAR-Caps: Capsule Networks with Straight-Through Attentive Routing 2
STREETS: A Novel Camera Network Dataset for Traffic Flow 4
Saccader: Improving Accuracy of Hard Attention Models for Vision 4
Safe Exploration for Interactive Machine Learning 3
Same-Cluster Querying for Overlapping Clusters 3
Sample Adaptive MCMC 6
Sample Complexity of Learning Mixture of Sparse Linear Regressions 1
Sample Efficient Active Learning of Causal Trees 2
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update 5
Sampled Softmax with Random Fourier Features 2
Sampling Networks and Aggregate Simulation for Online POMDP Planning 2
Sampling Sketches for Concave Sublinear Functions of Frequencies 3
Scalable Bayesian dynamic covariance modeling with variational Wishart and inverse Wishart processes 5
Scalable Bayesian inference of dendritic voltage via spatiotemporal recurrent state space models 3
Scalable Deep Generative Relational Model with High-Order Node Dependence 3
Scalable Global Optimization via Local Bayesian Optimization 4
Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching 5
Scalable Spike Source Localization in Extracellular Recordings using Amortized Variational Inference 5
Scalable Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data 4
Scalable inference of topic evolution via models for latent geometric structures 4
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations 4
Screening Sinkhorn Algorithm for Regularized Optimal Transport 3
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning 3
Search-Guided, Lightly-Supervised Training of Structured Prediction Energy Networks 4
Secretary Ranking with Minimal Inversions 1
Seeing the Wind: Visual Wind Speed Prediction with a Coupled Convolutional and Recurrent Neural Network 3
Selecting Optimal Decisions via Distributionally Robust Nearest-Neighbor Regression 3
Selecting causal brain features with a single conditional independence test per feature 3
Selecting the independent coordinates of manifolds with large aspect ratios 4
Selective Sampling-based Scalable Sparse Subspace Clustering 5
Self-Critical Reasoning for Robust Visual Question Answering 4
Self-Routing Capsule Networks 2
Self-Supervised Deep Learning on Point Clouds by Reconstructing Space 4
Self-Supervised Generalisation with Meta Auxiliary Learning 4
Self-attention with Functional Time Representation Learning 3
Self-supervised GAN: Analysis and Improvement with Multi-class Minimax Game 3
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos 4
Semantic-Guided Multi-Attention Localization for Zero-Shot Learning 4
Semi-Implicit Graph Variational Auto-Encoders 4
Semi-Parametric Dynamic Contextual Pricing 1
Semi-Parametric Efficient Policy Learning with Continuous Actions 3
Semi-flat minima and saddle points by embedding neural networks to overparameterization 2
Semi-supervisedly Co-embedding Attributed Networks 4
Sequence Modeling with Unconstrained Generation Order 4
Sequential Experimental Design for Transductive Linear Bandits 3
Sequential Neural Processes 1
Shadowing Properties of Optimization Algorithms 2
Shallow RNN: Accurate Time-series Classification on Resource Constrained Devices 4
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models 4
Shaping Belief States with Generative Environment Models for RL 3
Sim2real transfer learning for 3D human pose estimation: motion to the rescue 2
Single-Model Uncertainties for Deep Learning 4
Singleshot : a scalable Tucker tensor decomposition 4
Sinkhorn Barycenters with Free Support via Frank-Wolfe Algorithm 4
Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices 3
Sliced Gromov-Wasserstein 3
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity 0
Smoothing Structured Decomposable Circuits 4
Sobolev Independence Criterion 5
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks 1
Solving Interpretable Kernel Dimensionality Reduction 6
Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods 4
Solving graph compression via optimal transport 4
SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers 3
Space and Time Efficient Kernel Density Estimation in High Dimensions 4
Sparse High-Dimensional Isotonic Regression 4
Sparse Logistic Regression Learns All Discrete Pairwise Graphical Models 3
Sparse Variational Inference: Bayesian Coresets from Scratch 5
Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization 4
Spatially Aggregated Gaussian Processes with Multivariate Areal Outputs 4
Specific and Shared Causal Relation Modeling and Mechanism-Based Clustering 2
Spectral Modification of Graphs for Improved Spectral Clustering 5
Spherical Text Embedding 3
SpiderBoost and Momentum: Faster Variance Reduction Algorithms 3
Spike-Train Level Backpropagation for Training Deep Recurrent Spiking Neural Networks 4
Splitting Steepest Descent for Growing Neural Architectures 2
Stability of Graph Scattering Transforms 3
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction 2
Stacked Capsule Autoencoders 4
Stagewise Training Accelerates Convergence of Testing Error Over SGD 3
Stand-Alone Self-Attention in Vision Models 4
State Aggregation Learning from Markov Transition Data 2
Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection 3
Statistical Model Aggregation via Parameter Matching 4
Statistical bounds for entropic optimal transport: sample complexity and the central limit theorem 0
Statistical-Computational Tradeoff in Single Index Models 0
Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling 4
Stein Variational Gradient Descent With Matrix-Valued Kernels 5
Stochastic Bandits with Context Distributions 3
Stochastic Continuous Greedy ++: When Upper and Lower Bounds Match 1
Stochastic Frank-Wolfe for Composite Convex Minimization 7
Stochastic Gradient Hamiltonian Monte Carlo Methods with Recursive Variance Reduction 3
Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates 4
Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond 1
Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers 4
Stochastic Variance Reduced Primal Dual Algorithms for Empirical Composition Optimization 2
Strategizing against No-regret Learners 0
Streaming Bayesian Inference for Crowdsourced Classification 3
Structure Learning with Side Information: Sample Complexity 1
Structured Graph Learning Via Laplacian Spectral Constraints 4
Structured Prediction with Projection Oracles 3
Structured Variational Inference in Continuous Cox Process Models 3
Structured and Deep Similarity Matching via Structured and Deep Hebbian Networks 2
Submodular Function Minimization with Noisy Evaluation Oracle 1
Subquadratic High-Dimensional Hierarchical Clustering 4
Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks 6
Subspace Detours: Building Transport Plans that are Optimal on Subspace Projections 3
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning 4
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems 5
Superposition of many models into one 2
Superset Technique for Approximate Recovery in One-Bit Compressed Sensing 1
Surfing: Iterative Optimization Over Incrementally Trained Deep Networks 3
Surrogate Objectives for Batch Policy Optimization in One-step Decision Making 4
Surround Modulation: A Bio-inspired Connectivity Structure for Convolutional Neural Networks 3
SySCD: A System-Aware Parallel Coordinate Descent Algorithm 5
Symmetry-Based Disentangled Representation Learning requires Interaction with Environments 4
Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules 4
TAB-VCR: Tags and Attributes based VCR Baselines 4
Teaching Multiple Concepts to a Forgetful Learner 2
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations. 4
Tensor Monte Carlo: Particle Methods for the GPU era 3
Text-Based Interactive Recommendation via Constraint-Augmented Reinforcement Learning 4
The Broad Optimality of Profile Maximum Likelihood 1
The Case for Evaluating Causal Models Using Interventional Measures and Empirical Data 1
The Cells Out of Sample (COOS) dataset and benchmarks for measuring out-of-sample generalization of image classifiers 3
The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies 2
The Fairness of Risk Scores Beyond Classification: Bipartite Ranking and the XAUC Metric 1
The Functional Neural Process 3
The Geometry of Deep Networks: Power Diagram Subdivision 1
The Impact of Regularization on High-dimensional Logistic Regression 0
The Implicit Bias of AdaGrad on Separable Data 1
The Implicit Metropolis-Hastings Algorithm 4
The Label Complexity of Active Learning from Observational Data 2
The Landscape of Non-convex Empirical Risk with Degenerate Population Risk 1
The Normalization Method for Alleviating Pathological Sharpness in Wide Neural Networks 1
The Option Keyboard: Combining Skills in Reinforcement Learning 1
The Parameterized Complexity of Cascading Portfolio Scheduling 0
The Point Where Reality Meets Fantasy: Mixed Adversarial Generators for Image Splice Detection 4
The Randomized Midpoint Method for Log-Concave Sampling 3
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares 4
The Synthesis of XNOR Recurrent Neural Networks with Stochastic Logic 3
The Thermodynamic Variational Objective 4
The continuous Bernoulli: fixing a pervasive error in variational autoencoders 3
The spiked matrix model with generative priors 5
Theoretical Analysis of Adversarial Learning: A Minimax Approach 0
Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning 3
Theoretical evidence for adversarial robustness through randomization 3
Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting 6
Think out of the "Box": Generically-Constrained Asynchronous Composite Optimization and Hedging 1
Thinning for Accelerating the Learning of Point Processes 4
Third-Person Visual Imitation Learning via Decoupled Hierarchical Controller 4
This Looks Like That: Deep Learning for Interpretable Image Recognition 4
Thompson Sampling and Approximate Inference 1
Thompson Sampling for Multinomial Logit Contextual Bandits 2
Thompson Sampling with Information Relaxation Penalties 2
Thresholding Bandit with Optimal Aggregate Regret 2
Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers 5
Tight Dimension Independent Lower Bound on the Expected Convergence Rate for Diminishing Step Sizes in SGD 1
Tight Dimensionality Reduction for Sketching Low Degree Polynomial Kernels 2
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies 2
Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks 2
Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence 2
Time-series Generative Adversarial Networks 3
Time/Accuracy Tradeoffs for Learning a ReLU with respect to Gaussian Marginals 2
Topology-Preserving Deep Image Segmentation 3
Total Least Squares Regression in Input Sparsity Time 6
Toward a Characterization of Loss Functions for Distribution Learning 0
Towards Automatic Concept-based Explanations 4
Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks 2
Towards Hardware-Aware Tractable Learning of Probabilistic Models 5
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents 2
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling 1
Towards Practical Alternating Least-Squares for CCA 4
Towards Understanding the Importance of Shortcut Connections in Residual Networks 1
Towards a Zero-One Law for Column Subset Selection 3
Towards closing the gap between the theory and practice of SVRG 3
Towards modular and programmable architecture search 6
Training Image Estimators without Image Ground Truth 4
Training Language GANs from Scratch 4
Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration 4
Transductive Zero-Shot Learning with Visual Structure Constraint 4
Transfer Anomaly Detection by Inferring Latent Domain Representations 3
Transfer Learning via Minimizing the Performance Gap Between Domains 4
Transferable Normalization: Towards Improving Transferability of Deep Neural Networks 3
Transfusion: Understanding Transfer Learning for Medical Imaging 1
Tree-Sliced Variants of Wasserstein Distances 6
Triad Constraints for Learning Causal Structure of Latent Variables 2
Trivializations for Gradient-Based Optimization on Manifolds 3
Trust Region-Guided Proximal Policy Optimization 5
Turbo Autoencoder: Deep learning based channel codes for point-to-point communication channels 2
Twin Auxilary Classifiers GAN 4
Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 3
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples 2
U-Time: A Fully Convolutional Network for Time Series Segmentation Applied to Sleep Staging 5
Ultra Fast Medoid Identification via Correlated Sequential Halving 3
Ultrametric Fitting by Gradient Descent 4
Uncertainty on Asynchronous Time Event Prediction 4
Uncertainty-based Continual Learning with Adaptive Regularization 3
Unconstrained Monotonic Neural Networks 4
Uncoupled Regression from Pairwise Comparison Data 3
Understanding Attention and Generalization in Graph Neural Networks 4
Understanding Sparse JL for Feature Hashing 2
Understanding and Improving Layer Normalization 4
Understanding the Representation Power of Graph Neural Networks in Learning Graph Topology 1
Understanding the Role of Momentum in Stochastic Gradient Methods 3
UniXGrad: A Universal, Adaptive Algorithm with Optimal Guarantees for Constrained Optimization 3
Unified Language Model Pre-training for Natural Language Understanding and Generation 5
Unified Sample-Optimal Property Estimation in Near-Linear Time 0
Uniform Error Bounds for Gaussian Process Regression with Application to Safe Control 2
Uniform convergence may be unable to explain generalization in deep learning 2
Universal Approximation of Input-Output Maps by Temporal Convolutional Nets 0
Universal Boosting Variational Inference 5
Universal Invariant and Equivariant Graph Neural Networks 2
Universality and individuality in neural dynamics across large populations of recurrent networks 1
Universality in Learning from Linear Measurements 1
Unlabeled Data Improves Adversarial Robustness 4
Unlocking Fairness: a Trade-off Revisited 2
Unsupervised Co-Learning on $G$-Manifolds Across Irreducible Representations 4
Unsupervised Curricula for Visual Meta-Reinforcement Learning 3
Unsupervised Discovery of Temporal Structure in Noisy Data with Dynamical Components Analysis 4
Unsupervised Emergence of Egocentric Spatial Structure from Sensorimotor Prediction 1
Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction 3
Unsupervised Learning of Object Keypoints for Perception and Control 2
Unsupervised Meta-Learning for Few-Shot Image Classification 4
Unsupervised Object Segmentation by Redrawing 6
Unsupervised Scalable Representation Learning for Multivariate Time Series 4
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video 4
Unsupervised State Representation Learning in Atari 4
Unsupervised learning of object structure and dynamics from videos 1
Untangling in Invariant Speech Recognition 2
Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input 3
User-Specified Local Differential Privacy in Unconstrained Adaptive Online Learning 1
Using Embeddings to Correct for Unobserved Confounding in Networks 4
Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty 3
Using Statistics to Automate Stochastic Optimization 4
Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning 3
VIREL: A Variational Inference Framework for Reinforcement Learning 3
Value Function in Frequency Domain and the Characteristic Value Iteration Algorithm 0
Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning 2
Variance Reduced Policy Evaluation with Smooth Function Approximation 3
Variance Reduction for Matrix Games 1
Variance Reduction in Bipartite Experiments through Correlation Clustering 2
Variational Bayes under Model Misspecification 1
Variational Bayesian Decision-making for Continuous Utilities 3
Variational Bayesian Optimal Experimental Design 1
Variational Denoising Network: Toward Blind Noise Modeling and Removal 4
Variational Graph Recurrent Neural Networks 3
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models 3
Variational Structured Semantic Inference for Diverse Image Captioning 6
Variational Temporal Abstraction 2
Verified Uncertainty Calibration 4
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks 4
Visual Concept-Metaconcept Learning 4
Visual Sequence Learning in Hierarchical Prediction Networks and Primate Visual Cortex 5
Visualizing and Measuring the Geometry of BERT 3
Visualizing the PHATE of Neural Networks 3
Volumetric Correspondence Networks for Optical Flow 4
Wasserstein Dependency Measure for Representation Learning 1
Wasserstein Weisfeiler-Lehman Graph Kernels 6
Weakly Supervised Instance Segmentation using the Bounding Box Tightness Prior 4
Weight Agnostic Neural Networks 3
Weighted Linear Bandits for Non-Stationary Environments 3
What Can ResNet Learn Efficiently, Going Beyond Kernels? 1
What the Vec? Towards Probabilistically Grounded Embeddings 2
When does label smoothing help? 3
When to Trust Your Model: Model-Based Policy Optimization 3
When to use parametric models in reinforcement learning? 3
Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model 3
Who is Afraid of Big Bad Minima? Analysis of gradient-flow in spiked matrix-tensor models 1
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition 5
Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes 4
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent 3
Worst-Case Regret Bounds for Exploration via Randomized Value Functions 1
Write, Execute, Assess: Program Synthesis with a REPL 0
XLNet: Generalized Autoregressive Pretraining for Language Understanding 5
XNAS: Neural Architecture Search with Expert Advice 6
You Only Propagate Once: Accelerating Adversarial Training via Maximal Principle 4
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization 3
Zero-Shot Semantic Segmentation 3
Zero-shot Knowledge Transfer via Adversarial Belief Matching 4
Zero-shot Learning via Simultaneous Generating and Learning 4
iSplit LBI: Individualized Partial Ranking with Ties via Split LBI 1
k-Means Clustering of Lines for Big Data 3
muSSP: Efficient Min-cost Flow Algorithm for Multi-object Tracking 3
q-means: A quantum algorithm for unsupervised machine learning 2
vGraph: A Generative Model for Joint Community Detection and Node Representation Learning 3