Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Machine Learning (ICML) - 2020

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICML 2020 1084 0.52 3.43 90.68% 44.05%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
(Locally) Differentially Private Combinatorial Semi-Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Chance-Constrained Generative Framework for Sequence Optimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
A Distributional Framework For Data Valuation βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Flexible Framework for Nonparametric Graphical Modeling that Accommodates Machine Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Flexible Latent Space Model for Multilayer Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Free-Energy Principle for Representation Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Game Theoretic Framework for Model Based Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Generative Model for Molecular Distance Geometry ❌ βœ… βœ… βœ… βœ… ❌ ❌ 4
A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
A Geometric Approach to Archetypal Analysis via Sparse Projections βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Graph to Graphs Framework for Retrosynthesis Prediction ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
A Markov Decision Process Model for Socio-Economic Systems Impacted by Climate Change βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Mean Field Analysis Of Deep ResNet And Beyond: Towards Provably Optimization Via Overparameterization From Depth βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Natural Lottery Ticket Winner: Reinforcement Learning with Ordinary Neural Circuits βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
A Pairwise Fair and Community-preserving Approach to k-Center Clustering βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Quantile-based Approach for Hyperparameter Transfer Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
A Sample Complexity Separation between Non-Convex and Convex Meta-Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
A Simple Framework for Contrastive Learning of Visual Representations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
A Swiss Army Knife for Minimax Optimal Transport βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Tree-Structured Decoder for Image-to-Markup Generation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
A Unified Theory of Decentralized SGD with Changing Topology and Local Updates βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A distributional view on multi-objective policy optimization βœ… βœ… ❌ ❌ ❌ βœ… βœ… 4
A general recurrent state space framework for modeling neural dynamics during decision-making ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A new regret analysis for Adam-type algorithms βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A simpler approach to accelerated optimization: iterative averaging meets optimism βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
ACFlow: Flow Models for Arbitrary Conditional Likelihoods ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Abstraction Mechanisms Predict Generalization in Deep Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Accelerated Message Passing for Entropy-Regularized MAP Inference βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Accelerated Stochastic Gradient-free and Projection-free Methods βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Accelerating Large-Scale Inference with Anisotropic Vector Quantization ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Accelerating the diffusion-based ensemble sampling by non-reversible dynamics ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Acceleration for Compressed Gradient Descent in Distributed and Federated Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Acceleration through spectral density estimation βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Accountable Off-Policy Evaluation With Kernel Bellman Statistics βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Active Learning on Attributed Graphs via Graph Cognizant Logistic Regression and Preemptive Query Generation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Active World Model Learning with Progress Curiosity βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
AdaScale SGD: A User-Friendly Algorithm for Distributed Training βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Adaptive Adversarial Multi-task Representation Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adaptive Droplet Routing in Digital Microfluidic Biochips Using Deep Reinforcement Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Adaptive Estimator Selection for Off-Policy Evaluation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Adaptive Gradient Descent without Descent βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Adaptive Region-Based Active Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Adaptive Reward-Poisoning Attacks against Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Sampling for Estimating Probability Distributions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Sketching for Fast and Convergent Canonical Polyadic Decomposition βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Adding seemingly uninformative labels helps in low data regimes ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Adversarial Attacks on Copyright Detection Systems ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Adversarial Attacks on Probabilistic Autoregressive Forecasting Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Adversarial Filters of Dataset Biases βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Adversarial Mutual Information for Text Generation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Adversarial Neural Pruning with Latent Vulnerability Suppression βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Adversarial Nonnegative Matrix Factorization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adversarial Risk via Optimal Transport and Optimal Couplings ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Adversarial Robustness Against the Union of Multiple Perturbation Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adversarial Robustness for Code βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adversarial Robustness via Runtime Masking and Cleansing βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Agent57: Outperforming the Atari Human Benchmark ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Aggregation of Multiple Knockoffs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Aligned Cross Entropy for Non-Autoregressive Machine Translation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
All in the Exponential Family: Bregman Duality in Thermodynamic Variational Inference ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Alleviating Privacy Attacks via Causal Learning ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Almost Tune-Free Variance Reduction βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Amortised Learning by Wake-Sleep βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Amortized Finite Element Analysis for Fast PDE-Constrained Optimization ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Amortized Population Gibbs Samplers with Neural Sufficient Statistics βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An Accelerated DFO Algorithm for Finite-sum Convex Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An EM Approach to Non-autoregressive Conditional Sequence Generation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
An Explicitly Relational Neural Network Architecture ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
An Imitation Learning Approach for Cache Replacement βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
An Investigation of Why Overparameterization Exacerbates Spurious Correlations ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
An Optimistic Perspective on Offline Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
An end-to-end approach for the verification problem: learning the right distance βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Analytic Marching: An Analytic Meshing Solution from Deep Implicit Surface Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Anderson Acceleration of Proximal Gradient Methods ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Angular Visual Hardness ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Approximation Capabilities of Neural ODEs and Invertible Residual Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Approximation Guarantees of Local Search Algorithms via Localizability of Set Functions βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Associative Memory in Iterated Overparameterized Sigmoid Autoencoders ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Asynchronous Coagent Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Attacks Which Do Not Kill Training Make Adversarial Learning Stronger βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Attentive Group Equivariant Convolutional Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
AutoML-Zero: Evolving Machine Learning Algorithms From Scratch βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Automated Synthetic-to-Real Generalization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Automatic Reparameterisation of Probabilistic Programs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Automatic Shortcut Removal for Self-Supervised Representation Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
BINOCULARS for efficient, nonmyopic sequential experimental design βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Bandits for BMO Functions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bandits with Adversarial Scaling βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Batch Reinforcement Learning with Hyperparameter Gradients ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Batch Stationary Distribution Estimation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bayesian Differential Privacy for Machine Learning ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Bayesian Graph Neural Networks with Adaptive Connection Sampling ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bayesian Learning from Sequential Data using Gaussian Processes with Signature Covariances βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Bayesian Optimisation over Multiple Continuous and Categorical Inputs βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Bayesian Sparsification of Deep C-valued Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Being Bayesian about Categorical Probability ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Better depth-width trade-offs for neural networks through the lens of dynamical systems ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Beyond Signal Propagation: Is Feature Diversity Necessary in Deep Neural Network Initialization? ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Bidirectional Model-based Policy Optimization βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Bio-Inspired Hashing for Unsupervised Similarity Search ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Bisection-Based Pricing for Repeated Contextual Auctions against Strategic Buyer βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Black-Box Methods for Restoring Monotonicity ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Black-Box Variational Inference as a Parametric Approximation to Langevin Dynamics ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Black-box Certification and Learning under Adversarial Perturbations ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
BoXHED: Boosted eXact Hazard Estimator with Dynamic covariates βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Boosted Histogram Transform for Regression βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Boosting Deep Neural Network Efficiency with Dual-Module Inference βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Boosting Frank-Wolfe by Chasing Gradients βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Boosting for Control of Dynamical Systems βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Born-Again Tree Ensembles βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Bounding the fairness and accuracy of classifiers from population statistics βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Bridging the Gap Between f-GANs and Wasserstein GANs βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Budgeted Online Influence Maximization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
CAUSE: Learning Granger Causality from Event Sequences using Attribution Methods βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
CURL: Contrastive Unsupervised Representations for Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Calibration, Entropy Rates, and Memory in Language Models βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts? βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Can Stochastic Zeroth-Order Frank-Wolfe Method Converge Faster for Non-Convex Problems? βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Causal Effect Identifiability under Partial-Observability βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Causal Inference using Gaussian Processes with Structured Latent Confounders βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Causal Modeling for Fairness In Dynamical Systems ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Causal Strategic Linear Regression βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Causal Structure Discovery from Distributions Arising from Mixtures of DAGs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Certified Data Removal from Machine Learning Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Certified Robustness to Label-Flipping Attacks via Randomized Smoothing βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Channel Equilibrium Networks for Learning Deep Representation ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Characterizing Distribution Equivalence and Structure Learning for Cyclic and Acyclic Directed Graphs ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Choice Set Optimization Under Discrete Choice Models of Group Decisions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Circuit-Based Intrinsic Methods to Detect Overfitting ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Class-Weighted Classification: Trade-offs and Robust Approaches ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Closing the convergence gap of SGD without replacement ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
CoMic: Complementary Task Learning & Mimicry for Reusable Skills ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Collaborative Machine Learning with Incentive-Aware Model Rewards ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Collapsed Amortized Variational Inference for Switching Nonlinear Dynamical Systems ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Combinatorial Pure Exploration for Dueling Bandit βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Combining Differentiable PDE Solvers and Graph Neural Networks for Fluid Flow Prediction ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Communication-Efficient Distributed PCA by Riemannian Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Communication-Efficient Distributed Stochastic AUC Maximization with Deep Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Composable Sketches for Functions of Frequencies: Beyond the Worst Case ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Compressive sensing with un-trained neural networks: Gradient descent finds a smooth approximation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Computational and Statistical Tradeoffs in Inferring Combinatorial Structures of Ising Model ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
ConQUR: Mitigating Delusional Bias in Deep Q-Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Concept Bottleneck Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Concise Explanations of Neural Networks using Adversarial Training ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Conditional gradient methods for stochastically constrained convex minimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Confidence-Aware Learning for Deep Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Confidence-Calibrated Adversarial Training: Generalizing to Unseen Attacks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Consistent Estimators for Learning to Defer to an Expert ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Consistent Structured Prediction with Max-Min Margin Markov Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Constant Curvature Graph Convolutional Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Constrained Markov Decision Processes via Backward Value Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Constructive Universal High-Dimensional Distribution Generation through Deep ReLU Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Context Aware Local Differential Privacy ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Continuous Graph Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Continuous Time Bayesian Networks with Clocks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Continuous-time Lower Bounds for Gradient-based Algorithms ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Continuously Indexed Domain Adaptation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Contrastive Multi-View Representation Learning on Graphs βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
ControlVAE: Controllable Variational Autoencoder βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Convergence Rates of Variational Inference in Sparse Deep Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Convex Calibrated Surrogates for the Multi-Label F-Measure βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Convolutional Kernel Networks for Graph-Structured Data βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Convolutional dictionary learning based auto-encoders for natural exponential-family distributions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Cooperative Multi-Agent Bandits with Heavy Tails βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Coresets for Clustering in Graphs of Bounded Treewidth βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Coresets for Data-efficient Training of Machine Learning Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Correlation Clustering with Asymmetric Classification Errors βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Cost-Effective Interactive Attention Learning with Neural Attention Processes βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Cost-effectively Identifying Causal Effects When Only Response Variable is Observable βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Countering Language Drift with Seeded Iterated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Curse of Dimensionality on Randomized Smoothing for Certifiable Robustness ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Curvature-corrected learning dynamics in deep neural networks ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Customizing ML Predictions for Online Algorithms βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
DINO: Distributed Newton-Type Optimization Method βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
DROCC: Deep Robust One-Class Classification βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
DRWR: A Differentiable Renderer without Rendering for Unsupervised 3D Structure Learning from Silhouette Images ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Data Amplification: Instance-Optimal Property Estimation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Data Valuation using Reinforcement Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Data preprocessing to mitigate bias: A maximum entropy based approach βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Data-Dependent Differentially Private Parameter Learning for Directed Graphical Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Data-Efficient Image Recognition with Contrastive Predictive Coding ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
DeBayes: a Bayesian Method for Debiasing Network Embeddings ❌ ❌ βœ… βœ… ❌ βœ… βœ… 4
Debiased Sinkhorn barycenters βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Decentralised Learning with Random Features and Distributed Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Decision Trees for Decision-Making under the Predict-then-Optimize Framework ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Decoupled Greedy Learning of CNNs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Deep Coordination Graphs βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Deep Divergence Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Gaussian Markov Random Fields βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Deep Graph Random Process for Relational-Thinking-Based Speech Recognition ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Deep Isometric Learning for Visual Recognition ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Deep Molecular Programming: A Natural Implementation of Binary-Weight ReLU Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deep Reasoning Networks for Unsupervised Pattern De-mixing with Constraint Reasoning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Deep Reinforcement Learning with Robust and Smooth Policy βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deep Streaming Label Learning ❌ ❌ βœ… βœ… βœ… ❌ ❌ 3
Deep k-NN for Noisy Labels βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
DeepCoDA: personalized interpretability for compositional health data ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Defense Through Diverse Directions ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
DeltaGrad: Rapid retraining of machine learning models βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Description Based Text Classification with Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Designing Optimal Dynamic Treatment Regimes: A Causal Reinforcement Learning Approach βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Detecting Out-of-Distribution Examples with Gram Matrices βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Differentiable Likelihoods for Fast Inversion of ’Likelihood-Free’ Dynamical Systems βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Differentiable Product Quantization for End-to-End Embedding Compression βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Differentially Private Set Union βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Differentiating through the FrΓ©chet Mean βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Discount Factor as a Regularizer in Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Discriminative Adversarial Search for Abstractive Summarization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Discriminative Jackknife: Quantifying Uncertainty in Deep Learning via Higher-Order Influence Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Disentangling Trainability and Generalization in Deep Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Dispersed Exponential Family Mixture VAEs for Interpretable Text Generation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Dissecting Non-Vacuous Generalization Bounds based on the Mean-Field Approximation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Distance Metric Learning with Joint Representation Diversification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Distinguishing Cause from Effect Using Quantiles: Bivariate Quantile Causal Discovery ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Distributed Online Optimization over a Heterogeneous Network with Any-Batch Mirror Descent ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Distribution Augmentation for Generative Modeling ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Divide, Conquer, and Combine: a New Inference Strategy for Probabilistic Programs with Stochastic Support βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Do GANs always have Nash equilibria? βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Do RNN and LSTM have Long Memory? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Do We Need Zero Training Loss After Achieving Zero Training Error? βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Does label smoothing mitigate label noise? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Domain Adaptive Imitation Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Domain Aggregation Networks for Multi-Source Domain Adaptation βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Don’t Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Double-Loop Unadjusted Langevin Algorithm βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Doubly Stochastic Variational Inference for Neural Processes with Hierarchical Latent Variables βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Doubly robust off-policy evaluation with shrinkage ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
DropNet: Reducing Neural Network Complexity via Iterative Pruning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Dual Mirror Descent for Online Allocation Problems βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Dual-Path Distillation: A Unified Framework to Improve Black-Box Attacks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Duality in RKHSs with Infinite Dimensional Outputs: Application to Robust Losses βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Dynamics of Deep Neural Networks and Neural Tangent Hierarchy ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
ECLIPSE: An Extreme-Scale Linear Program Solver for Web-Applications βœ… ❌ ❌ ❌ ❌ βœ… ❌ 2
Educating Text Autoencoders: Latent Representation Guidance via Denoising ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Efficient Continuous Pareto Exploration in Multi-Task Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Efficient Domain Generalization via Common-Specific Low-Rank Decomposition βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Efficient Identification in Linear Structural Causal Models with Auxiliary Cutsets βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient Intervention Design for Causal Discovery with Latents βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Efficient Non-conjugate Gaussian Process Factor Models for Spike Count Data using Polynomial Approximations ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient Policy Learning from Surrogate-Loss Classification Reductions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Efficient Proximal Mapping of the 1-path-norm of Shallow Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Efficient Robustness Certificates for Discrete Data: Sparsity-Aware Randomized Smoothing for Graphs, Images and More βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Efficient nonparametric statistical inference on population feature importance using Shapley values βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Efficiently Learning Adversarially Robust Halfspaces with Noise ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Efficiently Solving MDPs with Stochastic Mirror Descent βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficiently sampling functions from Gaussian process posteriors ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Eliminating the Invariance on the Loss Landscape of Linear Autoencoders ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Emergence of Separable Manifolds in Deep Language Representations ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Encoding Musical Style with Transformer Autoencoders ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Energy-Based Processes for Exchangeable Data βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Enhanced POET: Open-ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Enhancing Simple Models by Exploiting What They Already Know βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Entropy Minimization In Emergent Languages ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Equivariant Flows: Exact Likelihood Generative Learning for Symmetric Densities ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Equivariant Neural Rendering ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Error Estimation for Sketched SVD via the Bootstrap βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Error-Bounded Correction of Noisy Labels βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Estimating Generalization under Distribution Shifts via Domain-Invariant Representations βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Estimating Model Uncertainty of Neural Networks in Sparse Information Form βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Estimating Q(s,s’) with Deep Deterministic Dynamics Gradients βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Estimating the Error of Randomized Newton Methods: A Bootstrap Approach βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Estimating the Number and Effect Sizes of Non-null Hypotheses ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Estimation of Bounds on Potential Outcomes For Decision Making βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Evaluating Lossy Compression Rates of Deep Generative Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Evaluating Machine Accuracy on ImageNet ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Evaluating the Performance of Reinforcement Learning Algorithms βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Evolutionary Topology Search for Tensor Network Decomposition βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Expert Learning through Generalized Inverse Multiobjective Optimization: Models, Insights, and Algorithms βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Explainable and Discourse Topic-aware Neural Language Understanding βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Explainable k-Means and k-Medians Clustering βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Explaining Groups of Points in Low-Dimensional Representations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Explicit Gradient Learning for Black-Box Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Extra-gradient with player sampling for faster convergence in n-player games βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Extrapolation for Large-batch Training in Deep Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Extreme Multi-label Classification from Aggregated Labels βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
FACT: A Diagnostic for Group Fairness Trade-offs ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
FR-Train: A Mutual Information-Based Approach to Fair and Robust Training ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Fair Generative Modeling via Weak Supervision βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Fair Learning with Private Demographic Data βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Fair k-Centers via Maximum Matching βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Fairwashing explanations with off-manifold detergent ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Familywise Error Rate Control by Interactive Unmasking βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Fast Adaptation to New Environments via Policy-Dynamics Value Functions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Fast Deterministic CUR Matrix Decomposition with Accuracy Assurance βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Fast Differentiable Sorting and Ranking ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Fast OSCAR and OWL Regression via Safe Screening Rules βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Fast and Consistent Learning of Hidden Markov Models by Incorporating Non-Consecutive Correlations ❌ ❌ ❌ ❌ ❌ βœ… ❌ 1
Fast and Private Submodular and $k$-Submodular Functions Maximization with Matroid Constraints βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Fast computation of Nash Equilibria in Imperfect Information Games ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Faster Graph Embeddings via Coarsening βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Feature Noise Induces Loss Discrepancy Across Groups ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Feature Quantization Improves GAN Training βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Feature Selection using Stochastic Gates βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Feature-map-level Online Adversarial Knowledge Distillation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
FedBoost: A Communication-Efficient Algorithm for Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Federated Learning with Only Positive Labels βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
FetchSGD: Communication-Efficient Federated Learning with Sketching βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Few-shot Domain Adaptation by Causal Mechanism Transfer βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Fiduciary Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fiedler Regularization: Learning Neural Networks with Graph Sparsity βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Finding trainable sparse networks through Neural Tangent Transfer ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Fine-Grained Analysis of Stability and Generalization for Stochastic Gradient Descent ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Finite-Time Convergence in Continuous-Time Optimization ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Flexible and Efficient Long-Range Planning Through Curious Exploration βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Forecasting Sequential Data Using Consistent Koopman Autoencoders ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Fractal Gaussian Networks: A sparse random graph model based on Gaussian Multiplicative Chaos ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Fractional Underdamped Langevin Dynamics: Retargeting SGD with Momentum under Heavy-Tailed Gradient Noise ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Frequency Bias in Neural Networks for Input of Non-Uniform Density ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
From ImageNet to Image Classification: Contextualizing Progress on Benchmarks ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
From Importance Sampling to Doubly Robust Policy Gradient ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
From Local SGD to Local Fixed-Point Methods for Federated Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
From Sets to Multisets: Provable Variational Inference for Probabilistic Integer Submodular Models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Frustratingly Simple Few-Shot Object Detection ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Full Law Identification in Graphical Models of Missing Data: Completeness Results ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Fully Parallel Hyperparameter Search: Reshaped Space-Filling ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Fundamental Tradeoffs between Invariance and Sensitivity to Adversarial Perturbations βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Gamification of Pure Exploration for Linear Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalisation error in learning with random features and the hidden manifold model ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Generalization Error of Generalized Linear Models in High Dimensions βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Generalization Guarantees for Sparse Kernel Approximation with Entropic Optimal Features βœ… ❌ βœ… βœ… βœ… ❌ ❌ 4
Generalization and Representational Limits of Graph Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Generalization to New Actions in Reinforcement Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Generalized and Scalable Optimal Sparse Decision Trees βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Generating Programmatic Referring Expressions via Program Synthesis βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Generative Flows with Matrix Exponential βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Generative Pretraining From Pixels ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Global Concavity and Optimization in a Class of Dynamic Discrete Choice Models βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Goal-Aware Prediction: Learning to Model What Matters βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Goodness-of-Fit Tests for Inhomogeneous Random Graphs βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Gradient Temporal-Difference Learning with Regularized Corrections ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Gradient-free Online Learning in Continuous Games with Delayed Rewards βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Graph Convolutional Network for Recommendation with Low-pass Collaborative Filters ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Graph Filtration Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Graph Homomorphism Convolution βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Graph Optimal Transport for Cross-Domain Alignment βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Graph Random Neural Features for Distance-Preserving Graph Representations ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Graph Structure of Neural Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Graph-based Nearest Neighbor Search: From Practice to Theory ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GraphOpt: Learning Optimization Models of Graph Formation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Graphical Models Meet Bandits: A Variational Thompson Sampling Approach βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Growing Action Spaces ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Growing Adaptive Multi-hyperplane Machines βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Haar Graph Pooling βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Hallucinative Topological Memory for Zero-Shot Visual Planning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Handling the Positive-Definite Constraint in the Bayesian Learning Rule βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Harmonic Decompositions of Convolutional Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Healing Products of Gaussian Process Experts ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Hierarchical Generation of Molecular Graphs using Structural Motifs ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Hierarchical Verification for Adversarial Robustness βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Hierarchically Decoupled Imitation For Morphological Transfer βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
High-dimensional Robust Mean Estimation via Gradient Descent βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
How Good is the Bayes Posterior in Deep Neural Networks Really? βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
How recurrent networks implement contextual processing in sentiment analysis ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
How to Solve Fair k-Center in Massive Data Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
How to Train Your Neural ODE: the World of Jacobian and Kinetic Regularization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Hybrid Stochastic-Deterministic Minibatch Proximal Gradient: Less-Than-Single-Pass Optimization with Nearly Optimal Generalization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Hypernetwork approach to generating point clouds ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
IPBoost – Non-Convex Boosting via Integer Programming βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Identifying Statistical Bias in Dataset Replication βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Implicit Euler Skip Connections: Enhancing Adversarial Robustness via Numerical Stability βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Implicit Generative Modeling for Efficient Exploration βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Implicit Geometric Regularization for Learning Shapes ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Implicit Regularization of Random Feature Models ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Implicit competitive regularization in GANs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Implicit differentiation of Lasso-type models for hyperparameter optimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Improved Communication Cost in Distributed PageRank Computation – A Theoretical Study βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved Optimistic Algorithms for Logistic Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Improving Generative Imagination in Object-Centric World Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Improving Molecular Design by Stochastic Iterative Target Augmentation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Improving Robustness of Deep-Learning-Based Image Reconstruction βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Improving Transformer Optimization Through Better Initialization ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Improving generalization by controlling label-noise information in neural network weights βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Improving the Gating Mechanism of Recurrent Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Imputer: Sequence Modelling via Imputation and Dynamic Programming ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
In Defense of Uniform Convergence: Generalization via Derandomization with an Application to Interpolating Predictors ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Incremental Sampling Without Replacement for Sequence Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Individual Calibration with Randomized Forecasting ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Individual Fairness for k-Clustering βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Inducing and Exploiting Activation Sparsity for Fast Inference on Deep Neural Networks βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Inductive Relation Prediction by Subgraph Reasoning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Inductive-bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters ❌ ❌ βœ… ❌ βœ… ❌ ❌ 2
Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Inexact Tensor Methods with Dynamic Accuracies βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Inferring DQN structure for high-dimensional continuous control βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Infinite attention: NNGP and NTK for deep attention networks ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Influenza Forecasting Framework based on Gaussian Processes βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
InfoGAN-CR and ModelCentrality: Self-supervised Model Training and Selection for Disentangling GANs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Information-Theoretic Local Minima Characterization and Regularization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Informative Dropout for Robust Representation Learning: A Shape-bias Perspective βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Input-Sparsity Low Rank Approximation in Schatten Norm βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
InstaHide: Instance-hiding Schemes for Private Distributed Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Inter-domain Deep Gaussian Processes ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Interference and Generalization in Temporal Difference Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Interferometric Graph Transform: a Deep Unsupervised Graph Representation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Interpolation between Residual and Non-Residual Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Interpretable, Multidimensional, Multimodal Anomaly Detection with Negative Sampling for Detection of Device Failure ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Interpreting Robust Optimization via Adversarial Influence Functions ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Intrinsic Reward Driven Imitation Learning via Generative Model βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Invariant Causal Prediction for Block MDPs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Invariant Rationalization ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Invariant Risk Minimization Games βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Inverse Active Sensing: Modeling and Understanding Timely Decision-Making βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Invertible generative models for inverse problems: mitigating representation error and dataset bias ❌ ❌ βœ… ❌ βœ… ❌ ❌ 2
Involutive MCMC: a Unifying Framework βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Is Local SGD Better than Minibatch SGD? ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
It’s Not What Machines Can Learn, It’s What We Cannot Teach ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Kernel Methods for Cooperative Multi-Agent Contextual Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Kernel interpolation with continuous volume sampling ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Kernelized Stein Discrepancy Tests of Goodness-of-fit for Time-to-Event Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning βœ… βœ… ❌ ❌ ❌ ❌ ❌ 2
Knowing The What But Not The Where in Bayesian Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
LEEP: A New Measure to Evaluate Transferability of Learned Representations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
LTF: A Label Transformation Framework for Correcting Label Shift ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Label-Noise Robust Domain Adaptation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Landscape Connectivity and Dropout Stability of SGD Solutions for Over-parameterized Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Laplacian Regularized Few-Shot Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Latent Bernoulli Autoencoder βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Latent Space Factorisation and Manipulation via Matrix Subspace Projection ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Latent Variable Modelling with Hyperbolic Normalizing Flows ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Layered Sampling for Robust Optimization Problems βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
LazyIter: A Fast Algorithm for Counting Markov Equivalent DAGs and Designing Experiments βœ… βœ… ❌ ❌ ❌ ❌ ❌ 2
Learnable Group Transform For Time-Series ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Algebraic Multigrid Using Graph Neural Networks βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Learning Autoencoders with Relational Regularization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning Calibratable Policies using Programmatic Style-Consistency βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Learning Compound Tasks without Task-specific Knowledge via Imitation and Self-supervised Learning ❌ ❌ βœ… ❌ βœ… ❌ ❌ 2
Learning De-biased Representations with Biased Representations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Deep Kernels for Non-Parametric Two-Sample Tests βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning Discrete Structured Representations by Adversarially Maximizing Mutual Information βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning Efficient Multi-agent Communication: An Information Bottleneck Approach βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Factorized Weight Matrix for Joint Filtering ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards ❌ ❌ ❌ ❌ βœ… ❌ ❌ 1
Learning Flat Latent Manifolds with VAEs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Human Objectives by Evaluating Hypothetical Behavior βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Learning Mixtures of Graphs from Epidemic Cascades βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning Near Optimal Policies with Low Inherent Bellman Error βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Opinions in Social Networks βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Optimal Tree Models under Beam Search βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning Portable Representations for High-Level Planning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Learning Quadratic Games on Networks βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
Learning Reasoning Strategies in End-to-End Differentiable Proving βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Representations that Support Extrapolation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning Robot Skills with Temporal Variational Inference βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Selection Strategies in Buchberger’s Algorithm βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Learning Similarity Metrics for Numerical Simulations ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Learning To Stop While Learning To Predict βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning What to Defer for Maximum Independent Sets ❌ ❌ βœ… ❌ βœ… ❌ ❌ 2
Learning and Evaluating Contextual Embedding of Source Code ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning and Sampling of Atomic Interventions from Observations βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning disconnected manifolds: a no GAN’s land ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning for Dose Allocation in Adaptive Clinical Trials with Safety Constraints βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning from Irregularly-Sampled Time Series: A Missing Data Perspective ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning the Valuations of a $k$-demand Agent βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning the piece-wise constant graph structure of a varying Ising model ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning to Branch for Multi-Task Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over Modules βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning to Encode Position for Transformer with Continuous Dynamical Model ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Learning to Learn Kernels with Variational Random Features ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning to Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Learning to Rank Learning Curves βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning to Score Behaviors for Guided Policy Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning to Simulate Complex Physics with Graph Networks ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Learning to Simulate and Design for Structural Engineering ❌ ❌ ❌ βœ… βœ… ❌ βœ… 3
Learning with Bounded Instance and Label-dependent Label Noise βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning with Feature and Distribution Evolvable Streams ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning with Good Feature Representations in Bandits and in RL with a Generative Model βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning with Multiple Complementary Labels ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Let’s Agree to Agree: Neural Networks Share Classification Order on Real Datasets ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Leveraging Frequency Analysis for Deep Fake Image Recognition ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Leveraging Procedural Generation to Benchmark Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Lifted Disjoint Paths with Application in Multiple Object Tracking βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Likelihood-free MCMC with Amortized Approximate Ratio Estimators βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Linear Convergence of Randomized Primal-Dual Coordinate Method for Large-scale Linear Constrained Convex Programming βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Linear Lower Bounds and Conditioning of Differentiable Games ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Linear Mode Connectivity and the Lottery Ticket Hypothesis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Linear bandits with Stochastic Delayed Feedback βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Logarithmic Regret for Adversarial Online Control βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Logistic Regression for Massive Data with Rare Events ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Lookahead-Bounded Q-learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Lorentz Group Equivariant Neural Network for Particle Physics ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Loss Function Search for Face Recognition βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Low Bias Low Variance Gradient Estimates for Boolean Stochastic Networks βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Low-Rank Bottleneck in Multi-head Attention Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Low-Variance and Zero-Variance Baselines for Extensive-Form Games βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Low-loss connection of weight vectors: distribution-based approaches ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
LowFER: Low-rank Bilinear Pooling for Link Prediction ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Lower Complexity Bounds for Finite-Sum Convex-Concave Minimax Optimization Problems ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Manifold Identification for Ultimately Communication-Efficient Distributed Optimization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Mapping natural-language problems to formal-language solutions using structured neural representations ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Margin-aware Adversarial Domain Adaptation with Optimal Transport ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Maximum Likelihood with Bias-Corrected Calibration is Hard-To-Beat at Label Shift Adaptation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Maximum-and-Concatenation Networks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Measuring Non-Expert Comprehension of Machine Learning Fairness Metrics ❌ βœ… ❌ ❌ ❌ βœ… ❌ 2
Median Matrix Completion: from Embarrassment to Optimality βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Message Passing Least Squares Framework and its Application to Rotation Synchronization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Meta Variance Transfer: Learning to Augment from the Others βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Meta-Learning with Shared Amortized Variational Inference ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Meta-learning for Mixed Linear Regression βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Meta-learning with Stochastic Linear Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
MetaFun: Meta-Learning with Iterative Functional Updates ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Minimax Pareto Fairness: A Multi Objective Perspective βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Minimax Rate for Learning From Pairwise Comparisons in the BTL Model βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Minimax Weight and Q-Function Learning for Off-Policy Evaluation ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Missing Data Imputation using Optimal Transport βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Mix-n-Match : Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Model Fusion with Kullback-Leibler Divergence ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Model-Based Reinforcement Learning with Value-Targeted Regression βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Modulating Surrogates for Bayesian Optimization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Momentum Improves Normalized SGD βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Momentum-Based Policy Gradient Methods βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Moniqua: Modulo Quantized Communication in Decentralized SGD βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Monte-Carlo Tree Search as Regularized Policy Optimization ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
More Information Supervised Probabilistic Deep Face Embedding Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Multi-Agent Determinantal Q-Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Multi-Agent Routing Value Iteration Network ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Multi-Objective Molecule Generation using Interpretable Substructures βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Multi-Precision Policy Enforced Training (MuPPET) : A Precision-Switching Strategy for Quantised Fixed-Point Training of CNNs ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Multi-Task Learning with User Preferences: Gradient Descent with Controlled Ascent in Pareto Optimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its Parallelization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Multi-objective Bayesian Optimization using Pareto-frontier Entropy ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Multi-step Greedy Reinforcement Learning Algorithms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Multiclass Neural Network Minimization via Tropical Newton Polytope Approximation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Multidimensional Shape Constraints ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Multigrid Neural Memory ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations ❌ ❌ βœ… βœ… ❌ βœ… βœ… 4
Multinomial Logit Bandit with Low Switching Cost βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Mutual Transfer Learning for Massive Data ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
NADS: Neural Architecture Distribution Search for Uncertainty Awareness ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
NGBoost: Natural Gradient Boosting for Probabilistic Prediction βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Naive Exploration is Optimal for Online LQR βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Near-Tight Margin-Based Generalization Bounds for Support Vector Machines ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Near-linear time Gaussian process optimization with adaptive batching and resparsification βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Near-optimal Regret Bounds for Stochastic Shortest Path βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Nearly Linear Row Sampling Algorithm for Quantile Regression βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Negative Sampling in Semi-Supervised learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Nested Subspace Arrangement for Representation of Relational Data βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
NetGAN without GAN: From Random Walks to Low-Rank Approximations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Neural Architecture Search in A Proxy Validation Loss Landscape βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Neural Clustering Processes βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Neural Contextual Bandits with UCB-based Exploration βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Neural Datalog Through Time: Informed Temporal Modeling via Logical Specification ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Neural Kernels Without Tangents βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Neural Network Control Policy Verification With Persistent Adversarial Perturbation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-layer Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Neural Topic Modeling with Continual Lifelong Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning" βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
New Oracle-Efficient Algorithms for Private Synthetic Data Release βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
No-Regret Exploration in Goal-Oriented Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
No-Regret and Incentive-Compatible Online Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Non-Autoregressive Neural Text-to-Speech ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Non-Stationary Delayed Bandits with Intermediate Observations βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Non-autoregressive Machine Translation with Disentangled Context Transformer βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Non-convex Learning via Replica Exchange Stochastic Gradient MCMC βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Non-separable Non-stationary random fields ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Nonparametric Score Estimators βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks Using PAC-Bayesian Analysis ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Normalized Loss Functions for Deep Learning with Noisy Labels ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Normalizing Flows on Tori and Spheres ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Obtaining Adjustable Regularization for Free via Iterate Averaging ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Off-Policy Actor-Critic with Shared Experience Replay ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Approximate Thompson Sampling with Langevin Algorithms βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On Breaking Deep Generative Model-based Defenses and Beyond βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
On Conditional Versus Marginal Bias in Multi-Armed Bandits ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On Contrastive Learning for Likelihood-free Inference βœ… βœ… ❌ βœ… ❌ ❌ βœ… 4
On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Coresets for Regularized Regression ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
On Differentially Private Stochastic Convex Optimization with Heavy-tailed Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
On Efficient Constructions of Checkpoints βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
On Efficient Low Distortion Ultrametric Embedding ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Implicit Regularization in $Ξ²$-VAEs ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On Layer Normalization in the Transformer Architecture ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On Learning Language-Invariant Representations for Universal Machine Translation ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On Learning Sets of Symmetric Elements ❌ ❌ βœ… ❌ βœ… ❌ ❌ 2
On Leveraging Pretrained GANs for Generation with Limited Data ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On Lp-norm Robustness of Ensemble Decision Stumps and Trees ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On Relativistic f-Divergences ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
On Second-Order Group Influence Functions for Black-Box Predictions ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Semi-parametric Inference for BART ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On Unbalanced Optimal Transport: An Analysis of Sinkhorn Algorithm βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Validation and Planning of An Optimal Decision Rule with Application in Healthcare Studies ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
On Variational Learning of Controllable Representations for Text without Supervision ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
On a projective ensemble approach to two sample test for equality of distributions ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
On hyperparameter tuning in general clustering problemsm βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
On the (In)tractability of Computing Normalizing Constants for the Product of Determinantal Point Processes ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Convergence of Nesterov’s Accelerated Gradient Method in Stochastic Settings ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On the Expressivity of Neural Networks for Deep Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
On the Generalization Benefit of Noise in Stochastic Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Generalization Effects of Linear Transformations in Data Augmentation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On the Global Convergence Rates of Softmax Policy Gradient Methods βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On the Global Optimality of Model-Agnostic Meta-Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
On the Iteration Complexity of Hypergradient Computation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
On the Noisy Gradient Descent that Generalizes as SGD βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
On the Number of Linear Regions of Convolutional Neural Networks ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
On the Power of Compressed Sensing with Generative Models ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Relation between Quality-Diversity Evaluation and Distribution-Fitting Goal in Text Generation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Sample Complexity of Adversarial Multi-Source PAC Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
On the Theoretical Properties of the Network Jackknife ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the Unreasonable Effectiveness of the Greedy Algorithm: Greedy Adapts to Sharpness βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On the consistency of top-k surrogate losses ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
One Size Fits All: Can We Train One Denoiser for All Noise Levels? ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
One-shot Distributed Ridge Regression in High Dimensions βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Online Bayesian Moment Matching based SAT Solver Heuristics ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Online Continual Learning from Imbalanced Data βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Online Control of the False Coverage Rate and False Sign Rate βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Convex Optimization in the Random Order Model βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Dense Subgraph Discovery via Blurred-Graph Feedback βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Online Learned Continual Compression with Adaptive Quantization Modules βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Online Learning for Active Cache Synchronization βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Online Learning with Dependent Stochastic Feedback Graphs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Learning with Imperfect Hints βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online Multi-Kernel Learning with Graph-Structured Feedback βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Pricing with Offline Data: Phase Transition and Inverse Square Law βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online metric algorithms with untrusted predictions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Online mirror descent and dual averaging: keeping pace in the dynamic case βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Operation-Aware Soft Channel Pruning using Differentiable Masks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Optimal Bounds between f-Divergences and Integral Probability Metrics ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Optimal Continual Learning has Perfect Memory and is NP-hard ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Optimal Differential Privacy Composition for Exponential Mechanisms ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Optimal Estimator for Unlabeled Linear Regression βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Optimal Non-parametric Learning in Repeated Contextual Auctions with Strategic Buyer βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimal Randomized First-Order Methods for Least-Squares Problems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Optimal Robust Learning of Discrete Distributions from Batches βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Optimal Sequential Maximization: One Interview is Enough! βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Optimal approximation for unconstrained non-submodular minimization ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Optimal transport mapping via input convex neural networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Optimistic Bounds for Multi-output Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Optimistic Policy Optimization with Bandit Feedback βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimization Theory for ReLU Neural Networks Trained with Normalization Layers ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Optimization and Analysis of the pAp@k Metric for Recommender Systems βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Optimization from Structured Samples for Coverage Functions βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimizer Benchmarking Needs to Account for Hyperparameter Tuning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Optimizing Black-box Metrics with Adaptive Surrogates βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Optimizing Data Usage via Differentiable Rewards βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Optimizing Dynamic Structures with Bayesian Generative Search βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Optimizing for the Future in Non-Stationary MDPs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Option Discovery in the Absence of Rewards with Manifold Analysis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Oracle Efficient Private Non-Convex Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Ordinal Non-negative Matrix Factorization for Recommendation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Orthogonalized SGD and Nested Architectures for Anytime Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Overfitting in adversarially robust deep learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
PDO-eConvs: Partial Differential Operator Based Equivariant Convolutions ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
PENNI: Pruned Kernel Sharing for Efficient CNN Inference ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
PackIt: A Virtual Environment for Geometric Planning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Parallel Algorithm for Non-Monotone DR-Submodular Maximization βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Parameter-free, Dynamic, and Strongly-Adaptive Online Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Parameterized Rate-Distortion Stochastic Encoder βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Parametric Gaussian Process Regressors βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Partial Trace Regression and Low-Rank Kraus Decomposition ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Perceptual Generative Autoencoders ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Performative Prediction βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Piecewise Linear Regression via a Difference of Convex Functions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Planning to Explore via Self-Supervised World Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
PoWER-BERT: Accelerating BERT Inference via Progressive Word-vector Elimination ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Poisson Learning: Graph Based Semi-Supervised Learning At Very Low Label Rates βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
PolyGen: An Autoregressive Generative Model of 3D Meshes ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Polynomial Tensor Sketch for Element-wise Function of Low-Rank Matrix βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Population-Based Black-Box Optimization for Biological Sequence Design βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
PowerNorm: Rethinking Batch Normalization in Transformers βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Predicting Choice with Set-Dependent Aggregation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Predicting deliberative outcomes ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Predictive Coding for Locally-Linear Control ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Predictive Multiplicity in Classification βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Predictive Sampling with Forecasting Autoregressive Models βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Preference Modeling with Context-Dependent Salient Features ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Preselection Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Clusters for Extreme Multi-label Text Classification ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Principled learning method for Wasserstein distributionally robust optimization with local perturbations ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Private Counting from Anonymous Messages: Near-Optimal Accuracy with Vanishing Communication Overhead βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Private Outsourced Bayesian Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Private Query Release Assisted by Public Data βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Private Reinforcement Learning with PAC and Regret Guarantees βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Privately Learning Markov Random Fields βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Privately detecting changes in unknown distributions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Probing Emergent Semantics in Predictive Agents via Question Answering ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Problems with Shapley-value-based explanations as feature importance measures ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Progressive Graph Learning for Open-Set Domain Adaptation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Progressive Identification of True Labels for Partial-Label Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Projection-free Distributed Online Convex Optimization with $O(\sqrtT)$ Communication Complexity βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Projective Preferential Bayesian Optimization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Proper Network Interpretability Helps Adversarial Robustness in Classification ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Provable Representation Learning for Imitation Learning via Bi-level Optimization ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Provable Self-Play Algorithms for Competitive Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Provable Smoothness Guarantees for Black-Box Variational Inference ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Provable guarantees for decision tree induction: the agnostic setting βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Provably Efficient Exploration in Policy Optimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Provably Efficient Model-based Policy Adaptation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Proving the Lottery Ticket Hypothesis: Pruning is All You Need ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Quantized Decentralized Stochastic Learning over Directed Graphs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Quantum Boosting βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Quantum Expectation-Maximization for Gaussian mixture models βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Radioactive data: tracing through training ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Random extrapolation for primal-dual coordinate descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Randomization matters How to defend against strong adversarial attacks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Randomized Block-Diagonal Preconditioning for Parallel Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Randomized Smoothing of All Shapes and Sizes βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Randomly Projected Additive Gaussian Processes for Regression ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Rank Aggregation from Pairwise Comparisons in the Presence of Adversarial Corruptions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Rate-distortion optimization guided autoencoder for isometric embedding in Euclidean latent space ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Ready Policy One: World Building Through Active Learning βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Real-Time Optimisation for Online Learning in Auctions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Recht-Re Noncommutative Arithmetic-Geometric Mean Conjecture is False ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Recovery of Sparse Signals from a Mixture of Linear Samples βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Recurrent Hierarchical Topic-Guided RNN for Language Generation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Reducing Sampling Error in Batch Temporal Difference Learning βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Refined bounds for algorithm configuration: The knife-edge of dual class approximability βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Regularized Optimal Transport is Ground Cost Adversarial βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Reinforcement Learning for Integer Programming: Learning to Cut βœ… ❌ ❌ ❌ ❌ βœ… βœ… 3
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics ❌ βœ… βœ… ❌ βœ… βœ… ❌ 4
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Relaxing Bijectivity Constraints with Continuously Indexed Normalising Flows βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Reliable Fidelity and Diversity Metrics for Generative Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Representation Learning via Adversarially-Contrastive Optimal Transport ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Representations for Stable Off-Policy Reinforcement Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Representing Unordered Data Using Complex-Weighted Multiset Automata ❌ βœ… ❌ βœ… ❌ ❌ βœ… 3
Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Rethinking Bias-Variance Trade-off for Generalization of Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Retrieval Augmented Language Model Pre-Training ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Reverse-engineering deep ReLU networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Revisiting Fundamentals of Experience Replay ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Revisiting Spatial Invariance with Low-Rank Local Connectivity βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Revisiting Training Strategies and Generalization Performance in Deep Metric Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Reward-Free Exploration for Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Rigging the Lottery: Making All Tickets Winners βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Robust Bayesian Classification Using An Optimistic Score Ratio βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Robust Graph Representation Learning via Neural Sparsification βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Robust Learning with the Hilbert-Schmidt Independence Criterion βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Robust Outlier Arm Identification βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Robust Pricing in Dynamic Mechanism Design ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Robust and Stable Black Box Explanations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Robustifying Sequential Neural Processes βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robustness to Programmable String Transformations via Augmented Abstract Training βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Robustness to Spurious Correlations via Human Annotations ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
SCAFFOLD: Stochastic Controlled Averaging for Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
SGD Learns One-Layer Networks in WGANs βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
SIGUA: Forgetting May Make Learning with Noisy Labels More Robust βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Safe Reinforcement Learning in Constrained Markov Decision Processes βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Safe screening rules for L0-regression from Perspective Relaxations ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Sample Amplification: Increasing Dataset Size even when Learning is Impossible βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Sample Complexity Bounds for 1-bit Compressive Sensing and Binary Stable Embeddings with Generative Priors ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Scalable Deep Generative Modeling for Sparse Graphs βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Scalable Differentiable Physics for Learning and Control ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Scalable Differential Privacy with Certified Robustness in Adversarial Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Scalable Exact Inference in Multi-Output Gaussian Processes βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Scalable Gaussian Process Separation for Kernels with a Non-Stationary Phase ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Scalable Nearest Neighbor Search for Optimal Transport ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
Scalable and Efficient Comparison-based Search without Features βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Searching to Exploit Memorization Effect in Learning with Noisy Labels βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Second-Order Provable Defenses against Adversarial Attacks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Selective Dyna-Style Planning Under Limited Model Capacity ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Self-Attentive Associative Memory ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Self-Attentive Hawkes Process ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Self-Concordant Analysis of Frank-Wolfe Algorithms βœ… βœ… βœ… ❌ βœ… βœ… ❌ 5
Self-Modulating Nonparametric Event-Tensor Factorization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Self-supervised Label Augmentation via Input Transformations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Semi-Supervised Learning with Normalizing Flows ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Semi-Supervised StyleGAN for Disentanglement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Semismooth Newton Algorithm for Efficient Projections onto $\ell_1, ∞$-norm Ball βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Sequence Generation with Mixed Representations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Sequential Cooperative Bayesian Inference βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Sequential Transfer in Reinforcement Learning with a Generative Model βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Set Functions for Time Series ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Sets Clustering βœ… βœ… βœ… ❌ βœ… βœ… ❌ 5
Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Sharp Statistical Guaratees for Adversarially Robust Gaussian Classification βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
SimGANs: Simulator-Based Generative Adversarial Networks for ECG Synthesis to Improve Deep ECG Classification ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Simple and Deep Graph Convolutional Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Simple and sharp analysis of k-means|| βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Simultaneous Inference for Massive Data: Distributed Bootstrap βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Single Point Transductive Prediction ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Small Data, Big Decisions: Model Selection in the Small-Data Regime ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Small-GAN: Speeding up GAN Training using Core-Sets βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Smaller, more accurate regression forests using tree alternating optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Soft Threshold Weight Reparameterization for Learnable Sparsity βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
SoftSort: A Continuous Relaxation for the argsort Operator βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Source Separation with Deep Generative Priors βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Sparse Convex Optimization via Adaptively Regularized Hard Thresholding βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Sparse Gaussian Processes with Spherical Harmonic Features ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Sparse Shrunk Additive Models ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Sparse Sinkhorn Attention ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Sparse Subspace Clustering with Entropy-Norm ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Sparsified Linear Programming for Zero-Sum Equilibrium Finding βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
Spectral Clustering with Graph Neural Networks for Graph Pooling ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Spectral Frank-Wolfe Algorithm: Strict Complementarity and Linear Convergence βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Spectral Graph Matching and Regularized Quadratic Relaxations: Algorithm and Theory βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Spectral Subsampling MCMC for Stationary Time Series ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Spread Divergence ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Stabilizing Differentiable Architecture Search via Perturbation-based Regularization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Stabilizing Transformers for Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Statistically Efficient Off-Policy Policy Gradients βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Coordinate Minimization with Progressive Precision for Stochastic Convex Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Differential Equations with Variational Wishart Diffusions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Stochastic Flows and Geometric Optimization on the Orthogonal Group βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Frank-Wolfe for Constrained Finite-Sum Minimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Gauss-Newton Algorithms for Nonconvex Compositional Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Gradient and Langevin Processes ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Stochastic Hamiltonian Gradient Methods for Smooth Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Latent Residual Video Prediction ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Stochastic Optimization for Non-convex Inf-Projection Problems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Optimization for Regularized Wasserstein Estimators βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Stochastic Regret Minimization in Extensive-Form Games βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Stochastic Subspace Cubic Newton Method βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Stochastic bandits with arm-dependent delays βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
StochasticRank: Global Optimization of Scale-Free Discrete Functions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Stochastically Dominant Distributional Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Strategic Classification is Causal Modeling in Disguise ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Strategyproof Mean Estimation from Multiple-Choice Questions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Streaming Coresets for Symmetric Tensor Factorization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Streaming Submodular Maximization under a k-Set System Constraint βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Streaming k-Submodular Maximization under Noise subject to Size Constraint βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Strength from Weakness: Fast Learning Using Weak Supervision βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stronger and Faster Wasserstein Adversarial Attacks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Structural Language Models of Code ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Structure Adaptive Algorithms for Stochastic Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Structured Policy Iteration for Linear Quadratic Regulator βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Structured Prediction with Partial Labelling through the Infimum Loss ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Student Specialization in Deep Rectified Networks With Finite Width and Input Dimension ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Student-Teacher Curriculum Learning via Reinforcement Learning: Predicting Hospital Inpatient Admission Location βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Sub-Goal Trees a Framework for Goal-Based Reinforcement Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Sub-linear Memory Sketches for Near Neighbor Search on Streaming Data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Super-efficiency of automatic differentiation for functions defined as a minimum ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Superpolynomial Lower Bounds for Learning One-Layer Neural Networks using Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Supervised Quantile Normalization for Low Rank Matrix Factorization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Supervised learning: no loss no cry βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Symbolic Network: Generalized Neural Policies for Relational MDPs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
T-Basis: a Compact Representation for Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
T-GD: Transferable GAN-generated Images Detection Framework βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Tails of Lipschitz Triangular Flows ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Task Understanding from Confusing Multi-task Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Task-Oriented Active Perception and Planning in Environments with Partially Known Semantics βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
TaskNorm: Rethinking Batch Normalization for Meta-Learning ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Taylor Expansion Policy Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Teaching with Limited Information on the Learner’s Behaviour βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Temporal Logic Point Processes ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Temporal Phenotyping using Deep Predictive Clustering of Disease Progression βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Tensor denoising and completion based on ordinal observations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Boomerang Sampler ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
The Buckley-Osthus model and the block preferential attachment model: statistical analysis and application ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Complexity of Finding Stationary Points with Stochastic Gradient Descent ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Cost-free Nature of Optimally Tuning Tikhonov Regularizers and Other Ordered Smoothers ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
The Differentiable Cross-Entropy Method βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The Effect of Natural Distribution Shift on Question Answering Models ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
The FAST Algorithm for Submodular Maximization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
The Implicit Regularization of Stochastic Gradient Flow for Least Squares ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Implicit and Explicit Regularization Effects of Dropout βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Many Shapley Values for Model Explanation βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Non-IID Data Quagmire of Decentralized Machine Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Performance Analysis of Generalized Margin Maximizers on Separable Data ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Role of Regularization in Classification of High-dimensional Noisy Gaussian Mixture ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Sample Complexity of Best-$k$ Items Selection from Pairwise Comparisons βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The Shapley Taylor Interaction Index ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Tree Ensemble Layer: Differentiability meets Conditional Computation βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
The Usual Suspects? Reassessing Blame for VAE Posterior Collapse ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The continuous categorical: a novel simplex-valued exponential family βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The k-tied Normal Distribution: A Compact Parameterization of Gaussian Mean Field Posteriors in Bayesian Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Thompson Sampling Algorithms for Mean-Variance Bandits βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Thompson Sampling via Local Uncertainty βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Tight Bounds on Minimax Regret under Logarithmic Loss via Self-Concordance ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Tightening Exploration in Upper Confidence Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Time Series Deconfounder: Estimating Treatment Effects over Time in the Presence of Hidden Confounders ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
Time-Consistent Self-Supervision for Semi-Supervised Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Time-aware Large Kernel Convolutions ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Too Relaxed to Be Fair βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Topic Modeling via Full Dependence Mixtures βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Topological Autoencoders ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Topologically Densified Distributions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Towards Accurate Post-training Network Quantization via Bit-Split and Stitching βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Towards Adaptive Residual Network Training: A Neural-ODE Perspective βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Towards Non-Parametric Drift Detection via Dynamic Adapting Window Independence Drift Detection (DAWIDD) βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Towards Understanding the Dynamics of the First-Order Adversaries ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Towards Understanding the Regularization of Adversarial Robustness on Neural Networks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Towards a General Theory of Infinite-Width Limits of Neural Classifiers ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Training Binary Neural Networks through Learning with Noisy Supervision βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Training Binary Neural Networks using the Bayesian Learning Rule βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Training Deep Energy-Based Models with f-Divergence Minimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Training Linear Neural Networks: Non-Local Convergence and Complexity Results ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Training Neural Networks for and by Interpolation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Transfer Learning without Knowing: Reprogramming Black-box Machine Learning Models with Scarce Data and Limited Resources βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Transformation of ReLU-based recurrent neural networks from discrete-time to continuous-time ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Transformer Hawkes Process ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Transparency Promotion with Model-Agnostic Linear Competitors βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Two Routes to Scalable Credit Assignment without Weight Symmetry ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Two Simple Ways to Learn Individual Fairness Metrics from Data βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Uncertainty Estimation Using a Single Deep Deterministic Neural Network ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Uncertainty quantification for nonconvex tensor completion: Confidence intervals, heteroscedasticity and optimality βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Uncertainty-Aware Lookahead Factor Models for Quantitative Investing ❌ ❌ ❌ βœ… βœ… ❌ βœ… 3
Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Understanding Self-Training for Gradual Domain Adaptation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding and Mitigating the Tradeoff between Robustness and Accuracy βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Understanding and Stabilizing GANs’ Training Dynamics Using Control Theory βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Undirected Graphical Models as Approximate Posteriors βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Uniform Convergence of Rank-weighted Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Unique Properties of Flat Minima in Deep Networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Universal Average-Case Optimality of Polyak Momentum ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Universal Equivariant Multilayer Perceptrons ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Unraveling Meta-Learning: Understanding Feature Representations for Few-Shot Tasks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Unsupervised Discovery of Interpretable Directions in the GAN Latent Space ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Unsupervised Speech Decomposition via Triple Information Bottleneck ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Unsupervised Transfer Learning for Spatiotemporal Predictive Networks ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Up or Down? Adaptive Rounding for Post-Training Quantization ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Upper bounds for Model-Free Row-Sparse Principal Component Analysis βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
VFlow: More Expressive Generative Flows with Variational Data Augmentation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Variable Skipping for Autoregressive Range Density Estimation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Variance Reduced Coordinate Descent with Acceleration: New Method With a Surprising Application to Finite-Sum Problems βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Variance Reduction and Quasi-Newton for Particle-Based Variational Inference βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Variance Reduction in Stochastic Particle-Optimization Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Variational Autoencoders with Riemannian Brownian Motion Priors ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Variational Bayesian Quantization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Variational Imitation Learning with Diverse-quality Demonstrations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Variational Inference for Sequential Data with Future Likelihood Estimates ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Variational Label Enhancement ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Video Prediction via Example Guidance βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
VideoOneNet: Bidirectional Convolutional Recurrent OneNet with Trainable Data Steps for Video Processing βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Visual Grounding of Learned Physical Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Voice Separation with an Unknown Number of Multiple Speakers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
WaveFlow: A Compact Flow-based Model for Raw Audio ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Weakly-Supervised Disentanglement Without Compromises ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
What Can Learned Intrinsic Rewards Capture? βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
What can I do here? A Theory of Affordances in Reinforcement Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization? βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
When Demands Evolve Larger and Noisier: Learning and Earning in a Growing Environment βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
When Does Self-Supervision Help Graph Convolutional Networks? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
When Explanations Lie: Why Many Modified BP Attributions Fail ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
When are Non-Parametric Methods Robust? βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
When deep denoising meets iterative phase retrieval βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Which Tasks Should Be Learned Together in Multi-task Learning? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Why Are Learned Indexes So Effective? ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
Why bigger is not always better: on finite and infinite neural networks ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Word-Level Speech Recognition With a Letter to Word Encoder ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Working Memory Graphs ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
XtarNet: Learning to Extract Task-Adaptive Representation for Incremental Few-Shot Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Zeno++: Robust Fully Asynchronous SGD βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
k-means++: few more steps yield constant approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
p-Norm Flow Diffusion for Local Graph Clustering βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
β€œOther-Play” for Zero-Shot Coordination ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3