Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Machine Learning (ICML) - 2019

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICML 2019 773 0.49 3.23 92.24% 46.28%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs 1
A Better k-means++ Algorithm via Local Search 3
A Block Coordinate Descent Proximal Method for Simultaneous Filtering and Parameter Estimation 2
A Composite Randomized Incremental Gradient Method 2
A Conditional-Gradient-Based Augmented Lagrangian Framework 2
A Contrastive Divergence for Combining Variational Inference and MCMC 4
A Convergence Theory for Deep Learning via Over-Parameterization 2
A Deep Reinforcement Learning Perspective on Internet Congestion Control 2
A Dynamical Systems Perspective on Nesterov Acceleration 1
A Framework for Bayesian Optimization in Embedded Subspaces 3
A Gradual, Semi-Discrete Approach to Generative Network Training via Explicit Wasserstein Minimization 3
A Kernel Perspective for Regularizing Deep Neural Networks 4
A Kernel Theory of Modern Data Augmentation 3
A Large-Scale Study on Regularization and Normalization in GANs 3
A Multitask Multiple Kernel Learning Algorithm for Survival Analysis with Application to Cancer Biology 5
A Persistent Weisfeiler-Lehman Procedure for Graph Classification 5
A Personalized Affective Memory Model for Improving Emotion Recognition 3
A Polynomial Time MCMC Method for Sampling from Continuous Determinantal Point Processes 2
A Quantitative Analysis of the Effect of Batch Normalization on Gradient Descent 2
A Recurrent Neural Cascade-based Model for Continuous-Time Diffusion 4
A Statistical Investigation of Long Memory in Language and Music 2
A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks 3
A Theoretical Analysis of Contrastive Unsupervised Representation Learning 2
A Theory of Regularized Markov Decision Processes 0
A Tree-Based Method for Fast Repeated Sampling of Determinantal Point Processes 3
A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning 4
A fully differentiable beam search decoder 3
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables 5
AReS and MaRS Adversarial and MMD-Minimizing Regression for SDEs 4
AUCμ: A Performance Metric for Multi-Class Machine Learning Models 0
Accelerated Flow for Probability Distributions 2
Accelerated Linear Convergence of Stochastic Momentum Methods in Wasserstein Distances 0
Acceleration of SVRG and Katyusha X by Inexact Preconditioning 6
Action Robust Reinforcement Learning and Applications in Continuous Control 4
Active Embedding Search via Noisy Paired Comparisons 4
Active Learning for Decision-Making from Imbalanced Observational Data 4
Active Learning for Probabilistic Structured Prediction of Cuts and Matchings 2
Active Learning with Disagreement Graphs 4
Active Manifolds: A non-linear analogue to Active Subspaces 3
Actor-Attention-Critic for Multi-Agent Reinforcement Learning 3
AdaGrad Stepsizes: Sharp Convergence Over Nonconvex Landscapes 4
Adaptive Antithetic Sampling for Variance Reduction 1
Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits 5
Adaptive Neural Trees 4
Adaptive Regret of Convex and Smooth Functions 1
Adaptive Scale-Invariant Online Algorithms for Learning Linear Models 3
Adaptive Sensor Placement for Continuous Spaces 2
Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search 7
Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces 2
Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment 4
Adjustment Criteria for Generalizing Experimental Findings 1
Adversarial Attacks on Node Embeddings via Graph Poisoning 3
Adversarial Examples Are a Natural Consequence of Test Error in Noise 2
Adversarial Generation of Time-Frequency Features with application in audio synthesis 4
Adversarial Online Learning with noise 1
Adversarial camera stickers: A physical camera-based attack on deep learning systems 2
Adversarial examples from computational constraints 0
Adversarially Learned Representations for Information Obfuscation and Inference 3
Agnostic Federated Learning 4
Almost Unsupervised Text to Speech and Automatic Speech Recognition 4
Almost surely constrained convex optimization 4
Alternating Minimizations Converge to Second-Order Optimal Solutions 1
Amortized Monte Carlo Integration 1
An Instability in Variational Inference for Topic Models 1
An Investigation into Neural Net Optimization via Hessian Eigenvalue Density 3
An Investigation of Model-Free Planning 3
An Optimal Private Stochastic-MAB Algorithm based on Optimal Private Stopping Rule 3
Analogies Explained: Towards Understanding Word Embeddings 1
Analyzing Federated Learning through an Adversarial Lens 3
Analyzing and Improving Representations with the Soft Nearest Neighbor Loss 4
Anomaly Detection With Multiple-Hypotheses Predictions 3
Anytime Online-to-Batch, Optimism and Acceleration 1
Approximated Oracle Filter Pruning for Destructive CNN Width Optimization 6
Approximating Orthogonal Matrices with Effective Givens Factorization 4
Approximation and non-parametric estimation of ResNet-type convolutional neural networks 0
Are Generative Classifiers More Robust to Adversarial Attacks? 3
Area Attention 6
Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation 4
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss 2
Automated Model Selection with Bayesian Quadrature 2
Automatic Classifiers as Scientific Instruments: One Step Further Away from Ground-Truth 2
Automatic Posterior Transformation for Likelihood-Free Inference 3
Autoregressive Energy Machines 4
BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning 3
Band-limited Training and Inference for Convolutional Neural Networks 3
Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case 2
Batch Policy Learning under Constraints 2
BayesNAS: A Bayesian Approach for Neural Architecture Search 5
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning 3
Bayesian Counterfactual Risk Minimization 3
Bayesian Deconditional Kernel Mean Embeddings 1
Bayesian Generative Active Deep Learning 4
Bayesian Joint Spike-and-Slab Graphical Lasso 3
Bayesian Nonparametric Federated Learning of Neural Networks 4
Bayesian Optimization Meets Bayesian Optimal Stopping 3
Bayesian Optimization of Composite Functions 3
Bayesian leave-one-out cross-validation for large data 5
Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously 2
Benefits and Pitfalls of the Exponential Mechanism with Applications to Hilbert Spaces and Functional PCA 3
Better generalization with less data using robust gradient descent 3
Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio 3
Beyond Backprop: Online Alternating Minimization with Auxiliary Variables 4
Beyond the Chinese Restaurant and Pitman-Yor processes: Statistical Models with double power-law behavior 3
Bias Also Matters: Bias Attribution for Deep Neural Network Explanation 3
Bilinear Bandits with Low-rank Structure 2
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables 4
Blended Conditonal Gradients 1
Boosted Density Estimation Remastered 3
Bounding User Contributions: A Bias-Variance Trade-off in Differential Privacy 3
Breaking Inter-Layer Co-Adaptation by Classifier Anonymization 3
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities 4
Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms 4
Bridging Theory and Algorithm for Domain Adaptation 4
CAB: Continuous Adaptive Blending for Policy Evaluation and Learning 3
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network 1
COMIC: Multi-view Clustering Without Parameter Selection 5
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning 2
Calibrated Approximate Bayesian Inference 4
Calibrated Model-Based Deep Reinforcement Learning 5
CapsAndRuns: An Improved Method for Approximately Optimal Algorithm Configuration 3
Categorical Feature Compression via Submodular Optimization 3
Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models 3
Causal Identification under Markov Equivalence: Completeness Results 1
Cautious Regret Minimization: Online Optimization with Long-Term Budget Constraints 2
Certified Adversarial Robustness via Randomized Smoothing 5
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations 3
Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD 3
Characterizing Well-Behaved vs. Pathological Deep Neural Networks 2
Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and Unitary Group 4
Circuit-GNN: Graph Neural Networks for Distributed Circuit Design 3
Classification from Positive, Unlabeled and Biased Negative Data 4
Classifying Treatment Responders Under Causal Effect Monotonicity 3
Co-Representation Network for Generalized Zero-Shot Learning 3
Co-manifold learning with missing data 3
CoT: Cooperative Training for Generative Modeling of Discrete Data 4
Cognitive model priors for predicting human decisions 3
Collaborative Channel Pruning for Deep Networks 5
Collaborative Evolutionary Reinforcement Learning 4
Collective Model Fusion for Multiple Black-Box Experts 1
Combating Label Noise in Deep Learning using Abstention 3
Combining parametric and nonparametric models for off-policy evaluation 3
Communication Complexity in Locally Private Distribution Estimation and Heavy Hitters 2
Communication-Constrained Inference and the Role of Shared Randomness 0
CompILE: Compositional Imitation Learning and Execution 2
Competing Against Nash Equilibria in Adversarially Changing Zero-Sum Games 1
Complementary-Label Learning for Arbitrary Losses and Models 5
Complexity of Linear Regions in Deep Networks 2
Composable Core-sets for Determinant Maximization: A Simple Near-Optimal Algorithm 3
Composing Entropic Policies using Divergence Correction 1
Composing Value Functions in Reinforcement Learning 2
Compositional Fairness Constraints for Graph Embeddings 2
Compressed Factorization: Fast and Accurate Low-Rank Factorization of Compressively-Sensed Data 3
Compressing Gradient Optimizers via Count-Sketches 5
Concentration Inequalities for Conditional Value at Risk 1
Concrete Autoencoders: Differentiable Feature Selection and Reconstruction 5
Conditional Gradient Methods via Stochastic Path-Integrated Differential Estimator 1
Conditional Independence in Testing Bayesian Networks 3
Conditioning by adaptive sampling for robust design 3
Connectivity-Optimized Representation Learning via Persistent Homology 5
Context-Aware Zero-Shot Learning for Object Recognition 3
Contextual Memory Trees 5
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model 3
Control Regularization for Reduced Variance Reinforcement Learning 4
Convolutional Poisson Gamma Belief Network 5
Coresets for Ordered Weighted Clustering 3
Correlated Variational Auto-Encoders 5
Correlated bandits or: How to minimize mean-squared error online 1
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models 2
Counterfactual Visual Explanations 4
Cross-Domain 3D Equivariant Image Embeddings 3
Curiosity-Bottleneck: Exploration By Distilling Task-Specific Novelty 4
Curvature-Exploiting Acceleration of Elastic Net Computations 3
DAG-GNN: DAG Structure Learning with Graph Neural Networks 3
DBSCAN++: Towards fast and scalable density clustering 3
DL2: Training and Querying Neural Networks with Logic 5
DP-GP-LVM: A Bayesian Non-Parametric Model for Learning Multivariate Dependency Structures 2
Data Poisoning Attacks on Stochastic Bandits 2
Data Shapley: Equitable Valuation of Data for Machine Learning 3
Dead-ends and Secure Exploration in Reinforcement Learning 4
Decentralized Exploration in Multi-Armed Bandits 2
Decentralized Stochastic Optimization and Gossip Algorithms with Compressed Communication 3
Decomposing feature-level variation with Covariate Gaussian Process Latent Variable Models 3
Deep Compressed Sensing 4
Deep Counterfactual Regret Minimization 2
Deep Factors for Forecasting 4
Deep Gaussian Processes with Importance-Weighted Variational Inference 4
Deep Generative Learning via Variational Gradient Flow 2
Deep Residual Output Layers for Neural Language Generation 4
DeepMDP: Learning Continuous Latent Space Models for Representation Learning 3
DeepNose: Using artificial neural networks to represent the space of odorants 3
Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning 1
Demystifying Dropout 3
Detecting Overlapping and Correlated Communities without Pure Nodes: Identifiability and Algorithm 3
Diagnosing Bottlenecks in Deep Q-learning Algorithms 3
Differentiable Dynamic Normalization for Learning Deep Representation 3
Differentiable Linearized ADMM 3
Differential Inclusions for Modeling Nonsmooth ADMM Variants: A Continuous Limit Theory 1
Differentially Private Empirical Risk Minimization with Non-convex Loss Functions 1
Differentially Private Fair Learning 3
Differentially Private Learning of Geometric Concepts 1
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning 4
Dimensionality Reduction for Tukey Regression 3
Direct Uncertainty Prediction for Medical Second Opinions 3
Dirichlet Simplex Nest and Geometric Inference 3
Discovering Conditionally Salient Features with Statistical Guarantees 1
Discovering Context Effects from Raw Choice Data 2
Discovering Latent Covariance Structures for Multiple Time Series 4
Discovering Options for Exploration by Minimizing Cover Time 3
Discriminative Regularization for Latent Variable Models with Applications to Electrocardiography 3
Disentangled Graph Convolutional Networks 4
Disentangling Disentanglement in Variational Autoencoders 2
Distributed Learning over Unreliable Networks 5
Distributed Learning with Sublinear Communication 1
Distributed Weighted Matching via Randomized Composable Coresets 1
Distributed, Egocentric Representations of Graphs for Detecting Critical Structures 5
Distribution calibration for regression 2
Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN 4
Distributional Reinforcement Learning for Efficient Exploration 3
Do ImageNet Classifiers Generalize to ImageNet? 3
Does Data Augmentation Lead to Positive Margin? 0
Domain Adaptation with Asymmetrically-Relaxed Distribution Alignment 1
Domain Agnostic Learning with Disentangled Representations 3
DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression 4
Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random 4
Doubly-Competitive Distribution Estimation 0
Dropout as a Structured Shrinkage Prior 2
Dual Entangled Polynomial Code: Three-Dimensional Coding for Distributed Matrix Multiplication 2
Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem 2
Dynamic Measurement Scheduling for Event Forecasting using Deep RL 4
Dynamic Weights in Multi-Objective Deep Reinforcement Learning 4
EDDI: Efficient Dynamic Discovery of High-Value Information with Partial VAE 3
ELF OpenGo: an analysis and open reimplementation of AlphaZero 4
EMI: Exploration with Mutual Information 4
Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems 2
Efficient Dictionary Learning with Gradient Descent 0
Efficient Full-Matrix Adaptive Regularization 4
Efficient Nonconvex Regularized Tensor Completion with Structure-aware Proximal Iterations 5
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables 4
Efficient On-Device Models using Neural Projections 3
Efficient Training of BERT by Progressively Stacking 6
Efficient learning of smooth probability functions from Bernoulli tests with guarantees 2
Efficient optimization of loops and limits with randomized telescoping sums 5
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 4
EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis 4
Emerging Convolutions for Generative Normalizing Flows 2
Empirical Analysis of Beam Search Performance Degradation in Neural Sequence Models 1
End-to-End Probabilistic Inference for Nonstationary Audio Analysis 3
Entropic GANs meet VAEs: A Statistical Approach to Compute Sample Likelihoods in GANs 4
Equivariant Transformer Networks 4
Error Feedback Fixes SignSGD and other Gradient Compression Schemes 4
Escaping Saddle Points with Adaptive Gradient Methods 3
Estimate Sequences for Variance-Reduced Stochastic Composite Optimization 3
Estimating Information Flow in Deep Neural Networks 2
Explaining Deep Neural Networks with a Polynomial Time Algorithm for Shapley Value Approximation 3
Exploiting Worker Correlation for Label Aggregation in Crowdsourcing 3
Exploiting structure of uncertainty for efficient matroid semi-bandits 2
Exploration Conscious Reinforcement Learning Revisited 4
Exploring interpretable LSTM neural networks over multi-variable data 3
Exploring the Landscape of Spatial Robustness 2
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations 5
Fair Regression: Quantitative Definitions and Reduction-Based Algorithms 3
Fair k-Center Clustering for Data Summarization 5
Fairness risk measures 3
Fairness without Harm: Decoupled Classifiers with Preference Guarantees 4
Fairness-Aware Learning for Continuous Attributes and Treatments 4
Fairwashing: the risk of rationalization 5
Fast Algorithm for Generalized Multinomial Models with Ranking Data 3
Fast Context Adaptation via Meta-Learning 5
Fast Direct Search in an Optimally Compressed Continuous Target Space for Efficient Multi-Label Active Learning 3
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications 6
Fast Rates for a kNN Classifier Robust to Unknown Asymmetric Label Noise 1
Fast and Flexible Inference of Joint Distributions from their Marginals 4
Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations 3
Fast and Stable Maximum Likelihood Estimation for Incomplete Multinomial Models 2
Faster Algorithms for Binary Matrix Factorization 2
Faster Attend-Infer-Repeat with Tractable Probabilistic Models 3
Faster Stochastic Alternating Direction Method of Multipliers for Nonconvex Optimization 4
Fault Tolerance in Iterative-Convergent Machine Learning 3
Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data 6
Feature-Critic Networks for Heterogeneous Domain Generalization 5
Finding Mixed Nash Equilibria of Generative Adversarial Networks 3
Finding Options that Minimize Planning Time 0
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks 2
Fingerprint Policy Optimisation for Robust Reinforcement Learning 4
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning 1
First-Order Adversarial Vulnerability of Neural Networks and Input Dimension 3
First-Order Algorithms Converge Faster than $O(1/k)$ on Convex Problems 0
Flat Metric Minimization with Applications in Generative Modeling 3
Flexibly Fair Representation Learning by Disentanglement 1
FloWaveNet : A Generative Flow for Raw Audio 4
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design 4
Formal Privacy for Functional Data with Gaussian Perturbations 4
Functional Transparency for Structured Data: a Game-Theoretic Approach 2
GDPP: Learning Diverse Generations using Determinantal Point Processes 3
GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects 4
GMNN: Graph Markov Neural Networks 4
GOODE: A Gaussian Off-The-Shelf Ordinary Differential Equation Solver 3
Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute 5
Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function 1
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits 3
Gauge Equivariant Convolutional Networks and the Icosahedral CNN 3
Generalized Approximate Survey Propagation for High-Dimensional Estimation 2
Generalized Linear Rule Models 4
Generalized Majorization-Minimization 4
Generalized No Free Lunch Theorem for Adversarial Robustness 2
Generative Adversarial User Model for Reinforcement Learning Based Recommendation System 3
Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation 3
Geometric Losses for Distributional Learning 4
Geometric Scattering for Graph Data Analysis 3
Geometry Aware Convolutional Filters for Omnidirectional Images Representation 3
Geometry and Symmetry in Short-and-Sparse Deconvolution 2
Global Convergence of Block Coordinate Descent in Deep Learning 3
Good Initializations of Variational Bayes for Deep Models 4
Gradient Descent Finds Global Minima of Deep Neural Networks 0
Graph Convolutional Gaussian Processes 2
Graph Element Networks: adaptive, structured computation and memory 3
Graph Matching Networks for Learning the Similarity of Graph Structured Objects 3
Graph Neural Network for Music Score Data and Modeling Expressive Piano Performance 3
Graph Resistance and Learning from Pairwise Comparisons 0
Graph U-Nets 3
Graphical-model based estimation and inference for differential privacy 2
Graphite: Iterative Generative Modeling of Graphs 2
Greedy Layerwise Learning Can Scale To ImageNet 4
Greedy Orthogonal Pivoting Algorithm for Non-Negative Matrix Factorization 4
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI 1
Gromov-Wasserstein Learning for Graph Matching and Node Embedding 5
Guarantees for Spectral Clustering with Fairness Constraints 4
Guided evolutionary strategies: augmenting random search with surrogate gradients 3
HOList: An Environment for Machine Learning of Higher Order Logic Theorem Proving 4
Hessian Aided Policy Gradient 4
Heterogeneous Model Reuse via Optimizing Multiparty Multiclass Margin 2
HexaGAN: Generative Adversarial Nets for Real World Classification 4
Hierarchical Decompositional Mixtures of Variational Autoencoders 5
Hierarchical Importance Weighted Autoencoders 2
Hierarchically Structured Meta-learning 3
High-Fidelity Image Generation With Fewer Labels 5
Hiring Under Uncertainty 2
Homomorphic Sensing 3
How does Disagreement Help Generalization against Label Corruption? 4
Humor in Word Embeddings: Cockamamie Gobbledegook for Nincompoops 4
Hybrid Models with Deep and Invertible Features 2
HyperGAN: A Generative Model for Diverse, Performant Neural Networks 3
Hyperbolic Disk Embeddings for Directed Acyclic Graphs 2
IMEXnet A Forward Stable Deep Neural Network 5
Imitating Latent Policies from Observation 3
Imitation Learning from Imperfect Demonstration 3
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition 2
Importance Sampling Policy Evaluation with an Estimated Behavior Policy 1
Improved Convergence for $\ell_1$ and $\ell_∞$ Regression via Iteratively Reweighted Least Squares 1
Improved Dynamic Graph Learning through Fault-Tolerant Sparsification 3
Improved Parallel Algorithms for Density-Based Network Clustering 2
Improved Zeroth-Order Variance Reduced Algorithms and Analysis for Nonconvex Optimization 3
Improving Adversarial Robustness via Promoting Ensemble Diversity 5
Improving Model Selection by Employing the Test Data 2
Improving Neural Language Modeling via Adversarial Training 5
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting 5
Imputing Missing Events in Continuous-Time Event Streams 5
Incorporating Grouping Information into Bayesian Decision Tree Ensembles 3
Incremental Randomized Sketching for Online Kernel Learning 5
Inference and Sampling of $K_33$-free Ising Models 1
Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding 3
Infinite Mixture Prototypes for Few-shot Learning 5
Information-Theoretic Considerations in Batch Reinforcement Learning 0
Insertion Transformer: Flexible Sequence Generation via Insertion Operations 4
Interpreting Adversarially Trained Convolutional Neural Networks 3
Invariant-Equivariant Representation Learning for Multi-Class Data 3
Invertible Residual Networks 5
Iterative Linearized Control: Stable Algorithms and Complexity Guarantees 3
Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks 3
Jumpout : Improved Dropout for Deep Neural Networks with ReLUs 4
Katalyst: Boosting Convex Katayusha for Non-Convex Problems with a Large Condition Number 3
Kernel Mean Matching for Content Addressability of GANs 3
Kernel Normalized Cut: a Theoretical Revisit 2
Kernel-Based Reinforcement Learning in Robust Markov Decision Processes 3
LGM-Net: Learning to Generate Matching Networks for Few-Shot Learning 5
LIT: Learned Intermediate Representation Training for Model Compression 4
LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations 3
Ladder Capsule Network 4
Large-Scale Sparse Kernel Canonical Correlation Analysis 5
Latent Normalizing Flows for Discrete Sequences 3
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition 4
Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting 1
Learning Action Representations for Reinforcement Learning 2
Learning Classifiers for Target Domain with Limited or No Labels 3
Learning Context-dependent Label Permutations for Multi-label Classification 4
Learning Dependency Structures for Weak Supervision Models 2
Learning Discrete Structures for Graph Neural Networks 5
Learning Discrete and Continuous Factors of Data via Alternating Disentanglement 5
Learning Distance for Sequences by Learning a Ground Metric 4
Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations 3
Learning Generative Models across Incomparable Spaces 3
Learning Hawkes Processes Under Synchronization Noise 4
Learning Latent Dynamics for Planning from Pixels 5
Learning Linear-Quadratic Regulators Efficiently with only $\sqrtT$ Regret 1
Learning Models from Data with Measurement Error: Tackling Underreporting 1
Learning Neurosymbolic Generative Models via Program Synthesis 2
Learning Novel Policies For Tasks 2
Learning Optimal Fair Policies 1
Learning Optimal Linear Regularizers 4
Learning Structured Decision Problems with Unawareness 2
Learning What and Where to Transfer 3
Learning a Compressed Sensing Measurement Matrix via Gradient Unrolling 5
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning 3
Learning and Data Selection in Big Datasets 4
Learning deep kernels for exponential family densities 5
Learning from Delayed Outcomes via Proxies with Applications to Recommender Systems 3
Learning from a Learner 3
Learning interpretable continuous-time models of latent stochastic dynamical systems 1
Learning to Clear the Market 2
Learning to Collaborate in Markov Decision Processes 1
Learning to Convolve: A Generalized Weight-Tying Approach 3
Learning to Exploit Long-term Relational Dependencies in Knowledge Graphs 4
Learning to Generalize from Sparse and Underspecified Rewards 4
Learning to Groove with Inverse Sequence Transformations 3
Learning to Infer Program Sketches 4
Learning to Optimize Multigrid PDE Solvers 2
Learning to Prove Theorems via Interacting with Proof Assistants 5
Learning to Route in Similarity Graphs 6
Learning to bid in revenue-maximizing auctions 3
Learning to select for a predefined ranking 4
Learning with Bad Training Data via Iterative Trimmed Loss Minimization 3
Learning-to-Learn Stochastic Gradient Descent with Biased Regularization 4
LegoNet: Efficient Convolutional Neural Networks with Lego Filters 4
Leveraging Low-Rank Relations Between Surrogate Tasks in Structured Prediction 5
Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models 0
Linear-Complexity Data-Parallel Earth Mover’s Distance Approximations 4
Lipschitz Generative Adversarial Nets 3
Locally Private Bayesian Inference for Count Models 2
Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation 4
Lorentzian Distance Learning for Hyperbolic Representations 3
Loss Landscapes of Regularized Linear Autoencoders 4
Lossless or Quantized Boosting with Integer Arithmetic 5
Low Latency Privacy Preserving Inference 5
Lower Bounds for Smooth Nonconvex Finite-Sum Optimization 0
MASS: Masked Sequence to Sequence Pre-training for Language Generation 5
ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation 3
MIWAE: Deep Generative Modelling and Imputation of Incomplete Data Sets 3
MONK Outlier-Robust Mean Embedding Estimation by Median-of-Means 4
Making Convolutional Networks Shift-Invariant Again 3
Making Decisions that Reduce Discriminatory Impacts 3
Making Deep Q-learning methods robust to time discretization 3
Mallows ranking models: maximum likelihood estimate and regeneration 3
Manifold Mixup: Better Representations by Interpolating Hidden States 2
Matrix-Free Preconditioning in Online Learning 3
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning 3
Maximum Likelihood Estimation for Learning Populations of Parameters 2
MeanSum: A Neural Model for Unsupervised Multi-Document Abstractive Summarization 4
Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians 3
Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Applications 4
Meta-Learning Neural Bloom Filters 4
Metric-Optimized Example Weights 5
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement 4
Metropolis-Hastings Generative Adversarial Networks 5
Minimal Achievable Sufficient Statistic Learning 3
MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing 5
Mixture Models for Diverse Machine Translation: Tricks of the Trade 5
Model Comparison for Semantic Grouping 2
Model Function Based Conditional Gradient Method with Armijo-like Line Search 2
Model-Based Active Exploration 4
Molecular Hypergraph Grammar with Its Application to Molecular Optimization 4
Moment-Based Variational Inference for Markov Jump Processes 1
Monge blunts Bayes: Hardness Results for Adversarial Training 2
More Efficient Off-Policy Evaluation through Regularized Targeted Learning 2
Multi-Agent Adversarial Inverse Reinforcement Learning 3
Multi-Frequency Phase Synchronization 2
Multi-Frequency Vector Diffusion Maps 2
Multi-Object Representation Learning with Iterative Variational Inference 3
Multi-objective training of Generative Adversarial Networks with multiple discriminators 3
Multiplicative Weights Updates as a distributed constrained optimization algorithm: Convergence to second-order stationary points almost always 0
Multivariate Submodular Optimization 0
Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching 2
Myopic Posterior Sampling for Adaptive Goal Oriented Design of Experiments 3
NAS-Bench-101: Towards Reproducible Neural Architecture Search 5
NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks 4
Natural Analysts in Adaptive Data Analysis 0
Near optimal finite time identification of arbitrary linear dynamical systems 0
Nearest Neighbor and Kernel Survival Analysis: Nonasymptotic Error Bounds and Strong Consistency Rates 3
Neural Collaborative Subspace Clustering 5
Neural Inverse Knitting: From Images to Manufacturing Instructions 5
Neural Joint Source-Channel Coding 3
Neural Logic Reinforcement Learning 2
Neural Network Attributions: A Causal Perspective 3
Neural Separation of Observed and Unobserved Distributions 3
Neurally-Guided Structure Inference 5
Neuron birth-death dynamics accelerates gradient descent and converges asymptotically 2
New results on information theoretic clustering 4
Noise2Self: Blind Denoising by Self-Supervision 3
Noisy Dual Principal Component Pursuit 4
Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization 0
Non-Monotonic Sequential Text Generation 4
Non-Parametric Priors For Generative Adversarial Networks 2
Non-monotone Submodular Maximization with Nearly Optimal Adaptivity and Query Complexity 3
Nonconvex Variance Reduced Optimization with Arbitrary Sampling 3
Nonlinear Distributional Gradient Temporal-Difference Learning 3
Nonlinear Stein Variational Gradient Descent for Learning Diversified Mixture Models 5
Nonparametric Bayesian Deep Networks with Local Competition 3
Obtaining Fairness using Optimal Transport Theory 2
Off-Policy Deep Reinforcement Learning without Exploration 4
On Certifying Non-Uniform Bounds against Adversarial Attacks 5
On Connected Sublevel Sets in Deep Learning 0
On Dropout and Nuclear Norm Regularization 2
On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms 3
On Learning Invariant Representations for Domain Adaptation 2
On Medians of (Randomized) Pairwise Means 1
On Scalable and Efficient Computation of Large Scale Optimal Transport 4
On Sparse Linear Regression in the Local Differential Privacy Model 2
On Symmetric Losses for Learning from Corrupted Labels 3
On The Power of Curriculum Learning in Training Deep Networks 5
On Variational Bounds of Mutual Information 2
On discriminative learning of prediction uncertainty 3
On the Complexity of Approximating Wasserstein Barycenters 2
On the Computation and Communication Complexity of Parallel SGD with Dynamic Batch Sizes for Stochastic Non-Convex Optimization 3
On the Connection Between Adversarial Robustness and Saliency Map Interpretability 4
On the Convergence and Robustness of Adversarial Training 3
On the Design of Estimators for Bandit Off-Policy Evaluation 2
On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference 3
On the Generalization Gap in Reparameterizable Reinforcement Learning 2
On the Impact of the Activation function on Deep Neural Networks Training 3
On the Limitations of Representing Functions on Sets 1
On the Linear Speedup Analysis of Communication Efficient Momentum SGD for Distributed Non-Convex Optimization 5
On the Long-term Impact of Algorithmic Decision Policies: Effort Unfairness and Feature Segregation through Social Learning 2
On the Spectral Bias of Neural Networks 3
On the Universality of Invariant Networks 0
On the statistical rate of nonlinear recovery in generative models with heavy-tailed data 0
Online Adaptive Principal Component Analysis and Its extensions 4
Online Algorithms for Rent-Or-Buy with Expert Advice 2
Online Control with Adversarial Disturbances 1
Online Convex Optimization in Adversarial Markov Decision Processes 1
Online Learning to Rank with Features 4
Online Learning with Sleeping Experts and Feedback Graphs 2
Online Meta-Learning 3
Online Variance Reduction with Mixtures 5
Online learning with kernel losses 1
Open Vocabulary Learning on Source Code with a Graph-Structured Cache 3
Open-ended learning in symmetric zero-sum games 3
Optimal Algorithms for Lipschitz Bandits with Heavy-tailed Rewards 2
Optimal Auctions through Deep Learning 4
Optimal Continuous DR-Submodular Maximization and Applications to Provable Mean Field Inference 4
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning 5
Optimal Mini-Batch and Step Sizes for SAGA 4
Optimal Minimal Margin Maximization with Boosting 4
Optimal Transport for structured data with application on graphs 5
Optimality Implies Kernel Sum Classifiers are Statistically Efficient 1
Optimistic Policy Optimization via Multiple Importance Sampling 4
Orthogonal Random Forest for Causal Inference 3
Overcoming Mean-Field Approximations in Recurrent Gaussian Process Models 3
Overcoming Multi-model Forgetting 4
Overparameterized Nonlinear Learning: Gradient Descent Takes the Shortest Path? 2
PA-GD: On the Convergence of Perturbed Alternating Gradient Descent to Second-Order Stationary Points for Structured Nonconvex Optimization 2
PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits 2
PAC Learnability of Node Functions in Networked Dynamical Systems 2
POLITEX: Regret Bounds for Policy Iteration using Expert Prediction 3
POPQORN: Quantifying Robustness of Recurrent Neural Networks 5
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach 4
Parameter efficient training of deep convolutional neural networks by dynamic sparse reparameterization 5
Parameter-Efficient Transfer Learning for NLP 4
Pareto Optimal Streaming Unsupervised Classification 2
Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization 5
Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation 2
Partially Linear Additive Gaussian Graphical Models 2
Particle Flow Bayes’ Rule 3
Passed & Spurious: Descent Algorithms and Local Minima in Spiked Matrix-Tensor Models 2
Per-Decision Option Discounting 2
Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size! 3
Phaseless PCA: Low-Rank Matrix Recovery from Column-wise Phaseless Measurements 2
Plug-and-Play Methods Provably Converge with Properly Trained Denoisers 4
Poission Subsampled Rényi Differential Privacy 2
Policy Certificates: Towards Accountable Reinforcement Learning 2
Policy Consolidation for Continual Reinforcement Learning 2
Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules 6
Position-aware Graph Neural Networks 4
Power k-Means Clustering 3
Predicate Exchange: Inference with Declarative Knowledge 4
Predictor-Corrector Policy Optimization 4
Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering 4
Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning 1
Processing Megapixel Images with Deep Attention-Sampling Models 3
Projection onto Minkowski Sums with Application to Constrained Learning 3
Projections for Approximate Policy Iteration Algorithms 5
Proportionally Fair Clustering 3
Provable Guarantees for Gradient-Based Meta-Learning 3
Provably Efficient Imitation Learning from Observation Alone 4
Provably Efficient Maximum Entropy Exploration 4
Provably efficient RL with Rich Observations via Latent State Decoding 3
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning 2
Quantifying Generalization in Reinforcement Learning 2
Quantile Stein Variational Gradient Descent for Batch Bayesian Optimization 3
RaFM: Rank-Aware Factorization Machines 5
Rademacher Complexity for Adversarially Robust Generalization 2
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation 3
Random Function Priors for Correlation Modeling 3
Random Matrix Improved Covariance Estimation for a Large Class of Metrics 4
Random Shuffling Beats SGD after Finite Epochs 0
Random Walks on Hypergraphs with Edge-Dependent Vertex Weights 3
Rao-Blackwellized Stochastic Gradients for Discrete Distributions 4
Rate Distortion For Model Compression:From Theory To Practice 2
Rates of Convergence for Sparse Variational Gaussian Process Regression 1
Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces 2
Recursive Sketches for Modular Deep Learning 1
Refined Complexity of PCA with Outliers 0
Regret Circuits: Composability of Regret Minimizers 0
Regularization in directable environments with application to Tetris 3
Rehashing Kernel Evaluation in High Dimensions 4
Reinforcement Learning in Configurable Continuous Environments 2
Relational Pooling for Graph Representations 3
Remember and Forget for Experience Replay 3
Repairing without Retraining: Avoiding Disparate Impact with Counterfactual Distributions 5
Replica Conditional Sequential Monte Carlo 4
Rethinking Lossy Compression: The Rate-Distortion-Perception Tradeoff 1
Revisiting precision recall definition for generative modeling 3
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective 3
Riemannian adaptive stochastic gradient algorithms on matrix manifolds 5
Robust Decision Trees Against Adversarial Examples 5
Robust Estimation of Tree Structured Gaussian Graphical Models 0
Robust Inference via Generative Classifiers for Handling Noisy Labels 4
Robust Influence Maximization for Hyperparametric Models 2
Robust Learning from Untrusted Sources 5
Robustly Disentangled Causal Mechanisms: Validating Deep Representations for Interventional Robustness 3
Rotation Invariant Householder Parameterization for Bayesian PCA 3
SAGA with Arbitrary Sampling 3
SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver 4
SELFIE: Refurbishing Unclean Samples for Robust Deep Learning 6
SGD without Replacement: Sharper Rates for General Smooth Convex Functions 1
SGD: General Analysis and Improved Rates 2
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning 4
SWALP : Stochastic Weight Averaging in Low Precision Training 4
Safe Grid Search with Optimal Complexity 5
Safe Policy Improvement with Baseline Bootstrapping 3
Same, Same But Different: Recovering Neural Network Quantization Error Through Weight Factorization 5
Sample-Optimal Parametric Q-Learning Using Linearly Additive Features 1
Scalable Fair Clustering 4
Scalable Learning in Reproducing Kernel Krein Spaces 3
Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets 3
Scalable Nonparametric Sampling from Multimodal Posteriors with the Posterior Bootstrap 6
Scalable Training of Inference Networks for Gaussian-Process Models 5
Scale-free adaptive planning for deterministic dynamics & discounted rewards 2
Scaling Up Ordinal Embedding: A Landmark Approach 5
Screening rules for Lasso with non-convex Sparse Regularizers 4
SelectiveNet: A Deep Neural Network with an Integrated Reject Option 3
Self-Attention Generative Adversarial Networks 3
Self-Attention Graph Pooling 5
Self-Supervised Exploration via Disagreement 4
Self-similar Epochs: Value in arrangement 3
Semi-Cyclic Stochastic Gradient Descent 3
Sensitivity Analysis of Linear Structural Causal Models 2
Separating value functions across time-scales 3
Sequential Facility Location: Approximate Submodularity and Greedy Algorithm 3
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks 3
Sever: A Robust Meta-Algorithm for Stochastic Optimization 5
Shallow-Deep Networks: Understanding and Mitigating Network Overthinking 4
Shape Constraints for Set Functions 2
Similarity of Neural Network Representations Revisited 2
Simple Black-box Adversarial Attacks 5
Simple Stochastic Gradient Methods for Non-Smooth Non-Convex Regularized Optimization 4
Simplifying Graph Convolutional Networks 5
Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions 3
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning 2
Sorting Out Lipschitz Function Approximation 3
Sparse Extreme Multi-label Learning with Oracle Property 4
Sparse Multi-Channel Variational Autoencoder for the Joint Analysis of Heterogeneous Data 3
Spectral Approximate Inference 3
Spectral Clustering of Signed Graphs via Matrix Power Means 3
Stable and Fair Classification 3
Stable-Predictive Optimistic Counterfactual Regret Minimization 2
State-Regularized Recurrent Neural Networks 6
State-Reification Networks: Improving Generalization by Modeling the Distribution of Hidden Representations 2
Static Automatic Batching In TensorFlow 3
Statistical Foundations of Virtual Democracy 2
Statistics and Samples in Distributional Reinforcement Learning 3
Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging 2
Stein Point Markov Chain Monte Carlo 2
Stochastic Beams and Where To Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement 4
Stochastic Blockmodels meet Graph Neural Networks 3
Stochastic Deep Networks 3
Stochastic Gradient Push for Distributed Deep Learning 6
Stochastic Iterative Hard Thresholding for Graph-structured Sparsity Optimization 6
Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with Non-asymptotic Convergence 3
Structured agents for physical construction 1
Sublinear Space Private Algorithms Under the Sliding Window Model 1
Sublinear Time Nearest Neighbor Search over Generalized Weighted Space 2
Sublinear quantum algorithms for training linear and kernel-based classifiers 1
Submodular Cost Submodular Cover with an Approximate Oracle 2
Submodular Maximization beyond Non-negativity: Guarantees, Fast Algorithms, and Applications 5
Submodular Observation Selection and Information Gathering for Quadratic Models 2
Submodular Streaming in All Its Glory: Tight Approximation, Minimum Memory and Low Adaptive Complexity 2
Subspace Robust Wasserstein Distances 2
Sum-of-Squares Polynomial Flow 3
Supervised Hierarchical Clustering with Exponential Linkage 5
Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization 3
Switching Linear Dynamics for Variational Bayes Filtering 0
Taming MAML: Efficient unbiased meta-reinforcement learning 1
TapNet: Neural Network Augmented with Task-Adaptive Projection for Few-Shot Learning 5
TarMAC: Targeted Multi-Agent Communication 2
Target Tracking for Contextual Bandits: Application to Demand Side Management 3
Target-Based Temporal-Difference Learning 2
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning 3
Teaching a black-box learner 3
Temporal Gaussian Mixture Layer for Videos 4
Tensor Variable Elimination for Plated Factor Graphs 4
TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing 5
The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects 2
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study 2
The Evolved Transformer 5
The Implicit Fairness Criterion of Unconstrained Learning 3
The Kernel Interaction Trick: Fast Bayesian Discovery of Pairwise Interactions in High Dimensions 3
The Natural Language of Actions 1
The Odds are Odd: A Statistical Test for Detecting Adversarial Examples 3
The Value Function Polytope in Reinforcement Learning 1
The Variational Predictive Natural Gradient 4
The Wasserstein Transform 2
The advantages of multiple classes for reducing overfitting from test set reuse 2
The information-theoretic value of unlabeled data in semi-supervised learning 0
Theoretically Principled Trade-off between Robustness and Accuracy 4
TibGM: A Transferable and Information-Based Graphical Model Approach for Reinforcement Learning 3
Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel $k$-means Clustering 0
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds 1
Topological Data Analysis of Decision Boundaries with Application to Model Selection 3
Toward Controlling Discrimination in Online Ad Auctions 3
Toward Understanding the Importance of Noise in Training Neural Networks 2
Towards Accurate Model Selection in Deep Unsupervised Domain Adaptation 5
Towards Understanding Knowledge Distillation 2
Towards a Deep and Unified Understanding of Deep Neural Models in NLP 3
Towards a Unified Analysis of Random Fourier Features 3
Trading Redundancy for Communication: Speeding up Distributed SGD for Non-convex Optimization 4
Traditional and Heavy Tailed Self Regularization in Neural Network Models 3
Trainable Decoding of Sets of Sequences for Neural Sequence Models 5
Training CNNs with Selective Allocation of Channels 4
Training Neural Networks with Local Error Signals 3
Training Well-Generalizing Classifiers for Fairness Metrics and Other Data-Dependent Constraints 4
Trajectory-Based Off-Policy Deep Reinforcement Learning 4
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation 4
Transfer of Samples in Policy Search via Multiple Importance Sampling 1
Transferability vs. Discriminability: Batch Spectral Penalization for Adversarial Domain Adaptation 4
Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers 5
Transferable Clean-Label Poisoning Attacks on Deep Neural Nets 4
Trimming the $\ell_1$ Regularizer: Statistical Analysis, Optimization, and Applications to Deep Learning 4
Understanding Geometry of Encoder-Decoder CNNs 0
Understanding Impacts of High-Order Loss Approximations and Features in Deep Learning Interpretation 5
Understanding MCMC Dynamics as Flows on the Wasserstein Space 3
Understanding Priors in Bayesian Neural Networks at the Unit Level 1
Understanding and Accelerating Particle-Based Variational Inference 4
Understanding and Controlling Memory in Recurrent Neural Networks 4
Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels 5
Understanding and correcting pathologies in the training of learned optimizers 4
Understanding the Impact of Entropy on Policy Optimization 1
Understanding the Origins of Bias in Word Embeddings 4
Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension 0
Unifying Orthogonal Monte Carlo Methods 2
Universal Multi-Party Poisoning Attacks 0
Unreproducible Research is Reproducible 3
Unsupervised Deep Learning by Neighbourhood Discovery 4
Unsupervised Label Noise Modeling and Loss Correction 4
Using Pre-Training Can Improve Model Robustness and Uncertainty 3
Validating Causal Inference Models via Influence Functions 3
Variational Annealing of GANs: A Langevin Perspective 4
Variational Implicit Processes 4
Variational Inference for sparse network reconstruction from count data 3
Variational Laplace Autoencoders 5
Variational Russian Roulette for Deep Bayesian Nonparametrics 4
Voronoi Boundary Classification: A High-Dimensional Geometric Approach via Weighted Monte Carlo Integration 6
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback 4
Wasserstein Adversarial Examples via Projected Sinkhorn Iterations 5
Wasserstein of Wasserstein Loss for Learning Generative Models 3
Weak Detection of Signal in the Spiked Wigner Model 1
Weakly-Supervised Temporal Localization via Occurrence Count Learning 3
What is the Effect of Importance Weighting in Deep Learning? 2
When Samples Are Strategically Selected 0
White-box vs Black-box: Bayes Optimal Strategies for Membership Inference 3
Why do Larger Models Generalize Better? A Theoretical Perspective via the XOR Problem 2
Width Provably Matters in Optimization for Deep Linear Neural Networks 0
Zeno: Distributed Stochastic Gradient Descent with Suspicion-based Fault-tolerance 4
Zero-Shot Knowledge Distillation in Deep Networks 3
kernelPSI: a Post-Selection Inference Framework for Nonlinear Variable Selection 2