Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

International Conference on Machine Learning (ICML) - 2022

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025)
Documentation Score Global mean is the average score over the seven reproducibility variables for empirical research papers.
% Empirical Percentage of papers that are empirical research vs theoretical research
% Industry Percentage of empirical research papers with at least one author from Industry
Website
ICML 2022 1233 0.58 3.97 92.94% 42.84%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
$p$-Laplacian Based Graph Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
(Non-)Convergence Results for Predictive Coding Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
3D Infomax improves GNNs for Molecular Property Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
3DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
3PC: Three Point Compressors for Communication-Efficient Distributed Training and a Better Theory for Lazy Aggregation βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
A Branch and Bound Framework for Stronger Adversarial Attacks of ReLU Networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
A Closer Look at Smoothness in Domain Adversarial Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
A Completely Tuning-Free and Robust Approach to Sparse Precision Matrix Estimation βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
A Consistent and Efficient Evaluation Strategy for Attribution Methods ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
A Context-Integrated Transformer-Based Neural Network for Auction Design βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
A Convergence Theory for SVGD in the Population Limit under Talagrand’s Inequality T1 βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Convergent and Dimension-Independent Min-Max Optimization Algorithm βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Deep Learning Approach for the Segmentation of Electroencephalography Data in Eye Tracking Applications ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A Difference Standardization Method for Mutual Transfer Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
A Differential Entropy Estimator for Training Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
A Dynamical System Perspective for Lipschitz Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Framework for Learning to Request Rich and Contextually Useful Information from Humans ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A Functional Information Perspective on Model Interpretation ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
A General Recipe for Likelihood-free Bayesian Optimization ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Hierarchical Transitive-Aligned Graph Kernel for Un-attributed Graphs ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
A Joint Exponential Mechanism For Differentially Private Top-$k$ βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Langevin-like Sampler for Discrete Distributions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
A Model-Agnostic Randomized Learning Framework based on Random Hypothesis Subspace Sampling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Modern Self-Referential Weight Matrix That Learns to Modify Itself ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A Multi-objective / Multi-task Learning Framework Induced by Pareto Stationarity βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
A Natural Actor-Critic Framework for Zero-Sum Markov Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Neural Tangent Kernel Perspective of GANs ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
A New Perspective on the Effects of Spectrum in Graph Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A Parametric Class of Approximate Gradient Updates for Policy Optimization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Psychological Theory of Explainability ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
A Random Matrix Analysis of Data Stream Clustering: Coping With Limited Memory Resources βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Regret Minimization Approach to Multi-Agent Control βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Resilient Distributed Boosting Algorithm βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Rigorous Study of Integrated Gradients Method and Extensions to Internal Neuron Attributions ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Simple Guard for Learned Optimizers βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
A Simple Reward-free Approach to Constrained Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Simple Unified Framework for High Dimensional Bandit Problems βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Simple yet Universal Strategy for Online Convex Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
A Single-Loop Gradient Descent and Perturbed Ascent Algorithm for Nonconvex Functional Constrained Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Statistical Manifold Framework for Point Cloud Data ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A Stochastic Multi-Rate Control Framework For Modeling Distributed Optimization Algorithms ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
A Study of Face Obfuscation in ImageNet ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A Study on the Ramanujan Graph Property of Winning Lottery Tickets βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Temporal-Difference Approach to Policy Gradient Estimation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
A Theoretical Comparison of Graph Neural Network Extensions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Tighter Analysis of Spectral Clustering, and Beyond ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
A Tree-based Model Averaging Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
A Unified View on PAC-Bayes Bounds for Meta-Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A data-driven approach for learning to control computers ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A deep convolutional neural network that is invariant to time rescaling ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A new similarity measure for covariate shift with applications to nonparametric regression ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A query-optimal algorithm for finding counterfactuals βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
AGNAS: Attention-Guided Micro and Macro-Architecture Search βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
ASAP.SGD: Instance-based Adaptiveness to Staleness in Asynchronous SGD βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Accelerated Federated Learning with Decoupled Adaptive Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Accelerated Gradient Methods for Geodesically Convex Optimization: Tractable Algorithms and Convergence Analysis βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Accelerated, Optimal and Parallel: Some results on model-based stochastic optimization ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Accelerating Shapley Explanation via Contributive Cooperator Selection βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Accurate Quantization of Measures via Interacting Particle-based Optimization βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Achieving Fairness at No Utility Cost via Data Reweighing with Influence βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Achieving Minimax Rates in Pool-Based Batch Active Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Action-Sufficient State Representation Learning for Control with Structural Constraints βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Active Multi-Task Representation Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Active Nearest Neighbor Regression Through Delaunay Refinement βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Active Sampling for Min-Max Fairness βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Active fairness auditing βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
ActiveHedge: Hedge meets Active Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Actor-Critic based Improper Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
AdaGrad Avoids Saddle Points ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Adapting k-means Algorithms for Outliers βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adapting the Linearised Laplace Model Evidence for Modern Deep Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Adapting to Mixing Time in Stochastic Optimization with Markovian Data βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Accelerated (Extra-)Gradient Methods with Variance Reduction βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Adaptive Conformal Predictions for Time Series βœ… βœ… ❌ βœ… ❌ ❌ βœ… 4
Adaptive Data Analysis with Correlated Observations βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Adaptive Gaussian Process Change Point Detection βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adaptive Model Design for Markov Decision Process βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Adaptive Random Walk Gradient Descent for Decentralized Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Adaptive Second Order Coresets for Data-efficient Machine Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Additive Gaussian Processes Revisited βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Adversarial Attack and Defense for Non-Parametric Two-Sample Tests βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Adversarial Attacks on Gaussian Process Bandits ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Adversarial Masking for Self-Supervised Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Adversarial Robustness against Multiple and Single $l_p$-Threat Models via Quick Fine-Tuning of Robust Classifiers ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Adversarial Vulnerability of Randomized Ensembles βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adversarially Robust Models may not Transfer Better: Sufficient Conditions for Domain Transferability from the View of Regularization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Adversarially Trained Actor Critic for Offline Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Adversarially trained neural representations are already as robust as biological neural representations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Agnostic Learnability of Halfspaces via Logistic Loss ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Algorithms for the Communication of Samples βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
An Analytical Update Rule for General Policy Optimization ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
An Asymptotic Test for Conditional Independence using Analytic Kernel Embeddings ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
An Equivalence Between Data Poisoning and Byzantine Gradient Attacks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
An Exact Symbolic Reduction of Linear Smart Predict+Optimize to Mixed Integer Linear Programming βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
An Initial Alignment between Neural Network and Target is Needed for Gradient Descent to Learn ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
An Intriguing Property of Geophysics Inversion ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
An iterative clustering algorithm for the Contextual Stochastic Block Model with optimality guarantees βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Analysis of Stochastic Processes through Replay Buffers βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Analyzing and Mitigating Interference in Neural Architecture Search ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Anarchic Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Antibody-Antigen Docking and Design via Hierarchical Structure Refinement ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Anticorrelated Noise Injection for Improved Generalization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Anytime Information Cascade Popularity Prediction via Self-Exciting Processes ❌ βœ… βœ… βœ… βœ… βœ… ❌ 5
Approximate Bayesian Computation with Domain Expert in the Loop βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Approximate Frank-Wolfe Algorithms over Graph-structured Support Sets βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Approximately Equivariant Networks for Imperfectly Symmetric Dynamics ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Architecture Agnostic Federated Learning for Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Asymptotically-Optimal Gaussian Bandits with Side Observations βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Attentional Meta-learners for Few-shot Polythetic Classification βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Augment with Care: Contrastive Learning for Combinatorial Problems ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
AutoIP: A United Framework to Integrate Physics into Gaussian Processes ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
AutoSNN: Towards Energy-Efficient Spiking Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Auxiliary Learning with Joint Task and Data Scheduling βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
BAMDT: Bayesian Additive Semi-Multivariate Decision Trees for Nonparametric Regression βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
BabelTower: Learning to Auto-parallelized Program Translation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Balancing Discriminability and Transferability for Source-Free Domain Adaptation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Balancing Sample Efficiency and Suboptimality in Inverse Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Batch Greenkhorn Algorithm for Entropic-Regularized Multimarginal Optimal Transport: Linear Rate of Convergence and Iteration Complexity βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Batched Dueling Bandits βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Bayesian Continuous-Time Tucker Decomposition βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Bayesian Deep Embedding Topic Meta-Learner βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Bayesian Imitation Learning for End-to-End Mobile Manipulation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Bayesian Learning with Information Gain Provably Bounds Risk for a Robust Adversarial Defense βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Bayesian Model Selection, the Marginal Likelihood, and Generalization ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Bayesian Nonparametric Learning for Point Processes with Spatial Homogeneity: A Spatial Analysis of NBA Shot Locations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Bayesian Nonparametrics for Offline Skill Discovery βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bayesian Optimization for Distributionally Robust Chance-constrained Problem βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bayesian Optimization under Stochastic Delayed Feedback βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Be Like Water: Adaptive Floating Point for Machine Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Being Properly Improper βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Benchmarking and Analyzing Point Cloud Classification under Corruptions ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Beyond Images: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Biological Sequence Design with GFlowNets βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Bit Prioritization in Variational Autoencoders via Progressive Coding ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Black-Box Tuning for Language-Model-as-a-Service ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Boosting Graph Structure Learning with Dummy Nodes βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bounding Training Data Reconstruction in Private (Deep) Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bounding the Width of Neural Networks via Coupled Initialization A Worst Case Analysis ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Branching Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Breaking the $\sqrtT$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Bregman Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bregman Power k-Means for Clustering Exponential Family Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bregman Proximal Langevin Monte Carlo via Bregman-Moreau Envelopes βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Building Robust Ensembles via Margin Boosting βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Burst-Dependent Plasticity and Dendritic Amplification Support Target-Based Learning and Hierarchical Imitation Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
ButterflyFlow: Building Invertible Layers with Butterfly Matrices ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Byzantine Machine Learning Made Easy By Resilient Averaging of Momentums βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
C*-algebra Net: A New Approach Generalizing Neural Network Parameters to C*-algebra ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
C-MinHash: Improving Minwise Hashing with Circulant Permutation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
CITRIS: Causal Identifiability from Temporal Intervened Sequences ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
COAT: Measuring Object Compositionality in Emergent Representations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
COLA: Consistent Learning with Opponent-Learning Awareness ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Calibrated Learning to Defer with One-vs-All Classifiers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Causal Conceptions of Fairness and their Consequences βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Causal Dynamics Learning for Task-Independent State Abstraction ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Causal Imitation Learning under Temporally Correlated Noise βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Causal Inference Through the Structural Causal Marginal Problem ❌ βœ… ❌ ❌ ❌ βœ… βœ… 3
Causal Transformer for Estimating Counterfactual Outcomes βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Causal structure-based root cause analysis of outliers ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Centroid Approximation for Bootstrap: Improving Particle Quality at Inference βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
CerDEQ: Certifiable Deep Equilibrium Model ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Certified Adversarial Robustness Under the Bounded Support Set βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Certified Neural Network Watermarks with Randomized Smoothing βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Certified Robustness Against Natural Language Attacks by Causal Intervention βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Certifying Out-of-Domain Generalization for Blackbox Functions ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Channel Importance Matters in Few-Shot Image Classification ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Characterizing and Overcoming the Greedy Nature of Learning in Multi-modal Deep Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Choosing Answers in Epsilon-Best-Answer Identification for Linear Bandits βœ… βœ… ❌ ❌ ❌ βœ… βœ… 4
Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Closed-Form Diffeomorphic Transformations for Time Series Alignment ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Co-training Improves Prompt-based Learning for Large Language Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Coin Flipping Neural Networks ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
Combining Diverse Feature Priors βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Communicating via Markov Decision Processes βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Communication-Efficient Adaptive Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Communication-efficient Distributed Learning for Large Batch Optimization βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Composing Partial Differential Equations with Physics-Aware Neural Networks ❌ βœ… ❌ βœ… βœ… ❌ βœ… 4
Comprehensive Analysis of Negative Sampling in Knowledge Graph Representation Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Conditional GANs with Auxiliary Discriminative Classifier ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Confidence Score for Source-Free Unsupervised Domain Adaptation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Conformal Prediction Sets with Limited False Positives βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Congested Bandits: Optimal Routing via Short-term Resets βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Consistent Polyhedral Surrogates for Top-k Classification and Variants ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Constants Matter: The Performance Gains of Active Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Constrained Discrete Black-Box Optimization using Mixed-Integer Programming βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Constrained Offline Policy Optimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Constrained Optimization with Dynamic Bound-scaling for Effective NLP Backdoor Defense βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Constrained Variational Policy Optimization for Safe Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Constraint-based graph network simulator βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Content Addressable Memory Without Catastrophic Forgetting by Heteroassociation with a Fixed Scaffold ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Context-Aware Drift Detection βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Contextual Bandits with Large Action Spaces: Made Practical βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Contextual Information-Directed Sampling ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Continual Learning via Sequential Function-Space Variational Inference ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Continual Learning with Guarantees via Weight Interval Constraints βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Continual Repeated Annealed Flow Transport Monte Carlo βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Continuous Control with Action Quantization from Demonstrations ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Continuous-Time Analysis of Accelerated Gradient Methods via Conservation Laws in Dilated Coordinate Systems ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Contrastive Learning with Boosted Memorization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Controlling Conditional Language Models without Catastrophic Forgetting βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Convergence Rates of Non-Convex Stochastic Gradient Descent Under a Generic Lojasiewicz Condition and Local Smoothness βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Convergence and Recovery Guarantees of the K-Subspaces Method for Subspace Clustering βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Convergence of Invariant Graph Networks ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Convergence of Uncertainty Sampling for Active Learning βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Convolutional and Residual Networks Provably Contain Lottery Tickets βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Cooperative Online Learning in Stochastic and Adversarial MDPs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Coordinated Double Machine Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Correct-N-Contrast: a Contrastive Approach for Improving Robustness to Spurious Correlations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Correlated Quantization for Distributed Mean Estimation and Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Counterfactual Prediction for Outcome-Oriented Treatments βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Counterfactual Transportability: A Formal Approach βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Cross-Space Active Learning on Graph Convolutional Networks βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Curriculum Reinforcement Learning via Constrained Optimal Transport βœ… βœ… ❌ ❌ βœ… βœ… βœ… 5
Cycle Representation Learning for Inductive Relation Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
DAVINZ: Data Valuation using Deep Neural Networks at Initialization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
DNA: Domain Generalization with Diversified Neural Averaging βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
DNNR: Differential Nearest Neighbors Regression βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
DRIBO: Robust Deep Reinforcement Learning via Multi-View Information Bottleneck βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Data Augmentation as Feature Manipulation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP) ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Data Scaling Laws in NMT: The Effect of Noise and Architecture ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Data-SUITE: Data-centric identification of in-distribution incongruous examples βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Datamodels: Understanding Predictions with Data and Data with Predictions ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Dataset Condensation via Efficient Synthetic-Data Parameterization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Dataset Condensation with Contrastive Signals βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
De novo mass spectrometry peptide sequencing with a transformer model ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Debiaser Beware: Pitfalls of Centering Regularized Transport Maps ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Decentralized Online Convex Optimization in Networked Systems βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Deciphering Lasso-based Classification Through a Large Dimensional Analysis of the Iterative Soft-Thresholding Algorithm βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Decision-Focused Learning: Through the Lens of Learning to Rank βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Decomposing Temporal High-Order Interactions via Latent ODEs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deduplicating Training Data Mitigates Privacy Risks in Language Models ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Deep Causal Metric Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deep Hierarchy in Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deep Network Approximation in Terms of Intrinsic Parameters ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape Geometry βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Deep Probability Estimation βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Deep Reference Priors: What is the best way to pretrain a model? βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Deep Safe Incomplete Multi-view Clustering: Theorem and Algorithm βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Deep Variational Graph Convolutional Recurrent Network for Multivariate Time Series Anomaly Detection βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Deep and Flexible Graph Neural Architecture Search βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Deep equilibrium networks are sensitive to initialization statistics ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Deep symbolic regression for recurrence prediction ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Delay-Adaptive Step-sizes for Asynchronous Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Delayed Reinforcement Learning by Imitation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Deletion Robust Submodular Maximization over Matroids βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Demystifying the Adversarial Robustness of Random Transformation Defenses βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Denoised MDPs: Learning World Models Better Than the World Itself βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Describing Differences between Text Distributions with Natural Language ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Detached Error Feedback for Distributed SGD with Random Sparsification βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Detecting Adversarial Examples Is (Nearly) As Hard As Classifying Them ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Detecting Corrupted Labels Without Training a Model to Predict βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Dialog Inpainting: Turning Documents into Dialogs ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Difference Advantage Estimation for Multi-Agent Policy Gradients βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Differentiable Top-k Classification Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Differentially Private Approximate Quantiles βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Differentially Private Community Detection for Stochastic Block Models βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
Differentially Private Coordinate Descent for Composite Empirical Risk Minimization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Differentially Private Maximal Information Coefficients ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Diffusion Models for Adversarial Purification ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Diffusion bridges vector quantized variational autoencoders βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Dimension-free Complexity Bounds for High-order Nonconvex Finite-sum Optimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Direct Behavior Specification via Constrained Reinforcement Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Directed Acyclic Transformer for Non-Autoregressive Machine Translation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
DisPFL: Towards Communication-Efficient Personalized Federated Learning via Decentralized Sparse Training βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Discrete Probabilistic Inverse Optimal Transport βœ… ❌ βœ… ❌ ❌ βœ… βœ… 4
Discrete Tree Flows via Tree-Structured Permutations βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Disentangled Federated Learning for Tackling Attributes Skew via Invariant Aggregation and Diversity Transferring βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Disentangling Disease-related Representation from Obscure for Disease Prediction ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Distinguishing rule and exemplar-based generalization in learning systems ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Distribution Regression with Sliced Wasserstein Kernels βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Distributionally Robust $Q$-Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Distributionally-Aware Kernelized Bandit Problems for Risk Aversion βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Divergence-Regularized Multi-Agent Actor-Critic βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Diversified Adversarial Attacks based on Conjugate Gradient Method βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Do Differentiable Simulators Give Better Policy Gradients? ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Do More Negative Samples Necessarily Hurt In Contrastive Learning? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Does the Data Induce Capacity Control in Deep Learning? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Domain Adaptation for Time Series Forecasting via Attention Sharing βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Double Sampling Randomized Smoothing βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning βœ… βœ… ❌ βœ… ❌ ❌ βœ… 4
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Dual Perspective of Label-Specific Feature Learning for Multi-Label Classification ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
DynaMixer: A Vision MLP Architecture with Dynamic Mixing βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Dynamic Regret of Online Markov Decision Processes βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Dynamic Topic Models for Temporal Document Networks βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Learning ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Easy Variational Inference for Categorical Models via an Independent Binary Approximation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Efficient Approximate Inference for Stationary Kernel on Frequency Domain βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Efficient Computation of Higher-Order Subgraph Attribution via Message Passing βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Efficient Distributionally Robust Bayesian Optimization with Worst-case Sensitivity βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Efficient Learning for AlphaZero via Path Consistency βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Efficient Learning of CNNs using Patch Based Features βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Efficient Low Rank Convex Bounds for Pairwise Discrete Graphical Models ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Efficient Online ML API Selection for Multi-Label Classification Tasks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Efficient PAC Learning from the Crowd with Pairwise Comparisons βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Efficient Representation Learning via Adaptive Context Pooling ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Efficient Test-Time Model Adaptation without Forgetting βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Efficient Variance Reduction for Meta-learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Efficiently Learning the Topology and Behavior of a Networked Dynamical System Via Active Queries βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
End-to-End Balancing for Causal Continuous Treatment-Effect Estimation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Entropic Causal Inference: Graph Identifiability βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Entropic Gromov-Wasserstein between Gaussian Distributions ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
EqR: Equivariant Representations for Data-Efficient Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Equivariance versus Augmentation for Spherical Images ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Equivariant Diffusion for Molecule Generation in 3D βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Equivariant Priors for compressed sensing with unknown orientation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Equivariant Quantum Graph Circuits ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Error-driven Input Modulation: Solving the Credit Assignment Problem without a Backward Pass βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Estimating and Penalizing Induced Preference Shifts in Recommender Systems βœ… ❌ ❌ βœ… βœ… βœ… βœ… 5
Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Estimation in Rotationally Invariant Generalized Linear Models via Approximate Message Passing ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Evaluating the Adversarial Robustness of Adaptive Test-time Defenses ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Evolving Curricula with Regret-Based Environment Design βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Exact Learning of Preference Structure: Single-peaked Preferences and Beyond ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Exact Optimal Accelerated Complexity for Fixed-Point Iterations ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Examining Scaling and Transfer of Language Model Architectures for Machine Translation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Exploiting Independent Instruments: Identification and Distribution Generalization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Exploiting Redundancy: Separable Group Convolutional Networks on Lie Groups ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Exploring the Gap between Collapsed & Whitened Features in Self-Supervised Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Extended Unconstrained Features Model for Exploring Deep Neural Collapse ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Extracting Latent State Representations with Linear Dynamics from Rich Observations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
FITNESS: (Fine Tune on New and Similar Samples) to detect anomalies in streams with drift and outliers βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
FOCUS: Familiar Objects in Common and Uncommon Settings ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Failure and success of the spectral bias prediction for Laplace Kernel Ridge Regression: the case of low-dimensional data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Fair Generalized Linear Models with a Convex Penalty βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Fair Representation Learning through Implicit Path Alignment βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Fair and Fast k-Center Clustering for Data Summarization βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Fairness Interventions as (Dis)Incentives for Strategic Manipulation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Fairness with Adaptive Weights βœ… ❌ βœ… βœ… ❌ ❌ ❌ 3
Fast Aquatic Swimmer Optimization with Differentiable Projective Dynamics and Neural Network Hydrodynamic Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Fast Composite Optimization and Statistical Recovery in Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Fast Finite Width Neural Tangent Kernel ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Fast Lossless Neural Compression with Integer-Only Discrete Flows βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Fast Population-Based Reinforcement Learning on a Single Machine βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Fast Provably Robust Decision Trees and Boosting βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Fast Relative Entropy Coding with A* coding βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Fast and Provable Nonconvex Tensor RPCA βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Fast and Reliable Evaluation of Adversarial Robustness with Minimum-Margin Attack βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Fast rates for noisy interpolation require rethinking the effect of inductive bias ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Fast-Rate PAC-Bayesian Generalization Bounds for Meta-Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Faster Algorithms for Learning Convex Functions βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Faster Fundamental Graph Algorithms via Learned Predictions βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Faster Privacy Accounting via Evolving Discretization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Fat–Tailed Variational Inference with Anisotropic Tail Adaptive Flows ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Feature Learning and Signal Propagation in Deep Neural Networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Feature Space Particle Inference for Neural Network Ensembles βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Feature and Parameter Selection in Stochastic Linear Bandits βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Feature selection using e-values βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
FedNL: Making Newton-Type Methods Applicable to Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
FedNest: Federated Bilevel, Minimax, and Compositional Optimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
FedScale: Benchmarking Model and System Performance of Federated Learning at Scale βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Federated Learning with Label Distribution Skew via Logits Calibration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Federated Learning with Partial Model Personalization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Federated Learning with Positive and Unlabeled Data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Federated Minimax Optimization: Improved Convergence Analyses and Algorithms βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fenrir: Physics-Enhanced Regression for Initial Value Problems ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Fictitious Play and Best-Response Dynamics in Identical Interest and Zero-Sum Stochastic Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Finding Global Homophily in Graph Neural Networks When Meeting Heterophily βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Finite-Sum Coupled Compositional Stochastic Optimization: Theory and Applications βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Fisher SAM: Information Geometry and Sharpness Aware Minimisation βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Fishing for User Data in Large-Batch Federated Learning via Gradient Magnification βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Flashlight: Enabling Innovation in Tools for Machine Learning βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Flow-Guided Sparse Transformer for Video Deblurring ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Flow-based Recurrent Belief State Learning for POMDPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Flowformer: Linearizing Transformers with Conservation Flows βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Forget-free Continual Learning with Winning Subnetworks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Forward Operator Estimation in Generative Models with Kernel Transfer Operators βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Fourier Learning with Cyclical Data βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Framework for Evaluating Faithfulness of Local Explanations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
FriendlyCore: Practical Differentially Private Aggregation βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
From Noisy Prediction to True Label: Noisy Prediction Calibration via Generative Model βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
From data to functa: Your data point is a function and you can treat it like one βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Frustratingly Easy Transferability Estimation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Fully-Connected Network on Noncompact Symmetric Space and Ridgelet Transform based on Helgason-Fourier Analysis ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Function-space Inference with Sparse Implicit Processes ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Functional Generalized Empirical Likelihood Estimation for Conditional Moment Restrictions βœ… βœ… ❌ βœ… ❌ ❌ βœ… 4
Functional Output Regression with Infimal Convolution: Exploring the Huber and $Ξ΅$-insensitive Losses βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
G$^2$CN: Graph Gaussian Convolution Networks with Concentrated Graph Filters ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
G-Mixup: Graph Data Augmentation for Graph Classification βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
GACT: Activation Compressed Training for Generic Network Architectures βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
GALAXY: Graph-based Active Learning at the Extreme βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
GNNRank: Learning Global Rankings from Pairwise Comparisons via Directed Graph Neural Networks βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Gaussian Mixture Variational Autoencoder with Contrastive Learning for Multi-Label Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
GenLabel: Mixup Relabeling using Generative Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
General-purpose, long-context autoregressive modeling with Perceiver AR ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Generalised Policy Improvement with Geometric Policy Composition βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalization Bounds using Lower Tail Exponents in Stochastic Optimizers βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalization and Robustness Implications in Object-Centric Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Generalized Beliefs for Cooperative AI ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Generalized Data Distribution Iteration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Generalized Federated Learning via Sharpness Aware Minimization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Generalized Leverage Scores: Geometric Interpretation and Applications βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Generalized Results for the Existence and Consistency of the MLE in the Bradley-Terry-Luce Model ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Generalized Strategic Classification and the Case of Aligned Incentives βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generalizing Gaussian Smoothing for Random Search ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Generalizing to New Physical Systems via Context-Informed Dynamics Model βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Generating 3D Molecules for Target Protein Binding ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Generating Distributional Adversarial Examples to Evade Statistical Detectors ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Generative Coarse-Graining of Molecular Conformations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Generative Cooperative Networks for Natural Language Generation βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Generative Flow Networks for Discrete Probabilistic Modeling βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Generative Modeling for Multi-task Visual Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Generative Trees: Adversarial and Copycat βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Generic Coreset for Scalable Learning of Monotonic Kernels: Logistic Regression, Sigmoid and more βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Geometric Multimodal Contrastive Representation Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Global Optimization Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Global Optimization of K-Center Clustering βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Goal Misgeneralization in Deep Reinforcement Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Going Deeper into Permutation-Sensitive Graph Neural Networks ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Gradient Based Clustering ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Gradient Descent on Neurons and its Link to Approximate Second-order Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Gradient-Free Method for Heavily Constrained Nonconvex Optimization βœ… ❌ ❌ βœ… βœ… ❌ βœ… 4
Graph Neural Architecture Search Under Distribution Shifts βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Graph-Coupled Oscillator Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GraphFM: Improving Large-Scale GNN Training via Feature Momentum βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Greedy when Sure and Conservative when Uncertain about the Opponents βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
H-Consistency Bounds for Surrogate Loss Minimizers ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Hardness and Algorithms for Robust and Sparse Optimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Hermite Polynomial Features for Private Data Generation ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Hessian-Free High-Resolution Nesterov Acceleration For Sampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Hierarchical Shrinkage: Improving the accuracy and interpretability of tree-based models. ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
High Probability Guarantees for Nonconvex Stochastic Gradient Descent with Heavy Tails βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Hindering Adversarial Attacks with Implicit Neural Representations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
History Compression via Language Models in Reinforcement Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
HousE: Knowledge Graph Embedding with Householder Parameterization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
How Powerful are Spectral Graph Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
How Tempering Fixes Data Augmentation in Bayesian Neural Networks ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
How to Leverage Unlabeled Data in Offline Reinforcement Learning ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
How to Steer Your Adversary: Targeted and Efficient Model Stealing Defenses with Gradient Redirection βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
How to Train Your Wide Neural Network Without Backprop: An Input-Weight Alignment Perspective βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
HyperImpute: Generalized Iterative Imputation with Automatic Model Selection βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
HyperPrompt: Prompt-based Task-Conditioning of Transformers ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
IDYNO: Learning Nonparametric DAGs from Interventional Dynamic Data ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Identifiability Conditions for Domain Adaptation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Identification of Linear Non-Gaussian Latent Hierarchical Structure βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Identity-Disentangled Adversarial Augmentation for Self-supervised Learning ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Imitation Learning by Estimating Expertise of Demonstrators ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Implicit Bias of Linear Equivariant Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Implicit Bias of the Step Size in Linear Diagonal Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Implicit Regularization with Polynomial Growth in Deep Tensor Factorization ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Importance Weighted Kernel Bayes’ Rule βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Improve Single-Point Zeroth-Order Optimization Using High-Pass and Low-Pass Filters ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved Rates for Differentially Private Stochastic Convex Optimization with Heavy-Tailed Data βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved Regret for Differentially Private Exploration in Linear MDP βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Improved StyleGAN-v2 based Inversion for Out-of-Distribution Images βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Improving Adversarial Robustness via Mutual Information Estimation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Improving Ensemble Distillation With Weight Averaging and Diversifying Perturbation βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Improving Language Models by Retrieving from Trillions of Tokens βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Improving Mini-batch Optimal Transport via Partial Transportation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Improving Out-of-Distribution Robustness via Selective Augmentation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Improving Policy Optimization with Generalist-Specialist Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Improving Screening Processes via Calibrated Subset Selection βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Improving Task-free Continual Learning by Distributionally Robust Memory Evolution βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Improving Transformers with Probabilistic Attention Keys ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
In defense of dual-encoders for neural ranking ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Individual Preference Stability for Clustering βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Individual Reward Assisted Multi-Agent Reinforcement Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Inducing Causal Structure for Interpretable Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Inductive Biases and Variable Creation in Self-Attention Mechanisms ❌ ❌ ❌ βœ… βœ… ❌ βœ… 3
Inductive Matrix Completion: No Bad Local Minima and a Fast Algorithm βœ… βœ… ❌ ❌ ❌ βœ… βœ… 4
Inferring Cause and Effect in the Presence of Heteroscedastic Noise βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems βœ… ❌ ❌ ❌ βœ… ❌ ❌ 2
Information Discrepancy in Strategic Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Informed Learning by Wide Neural Networks: Convergence, Generalization and Sampling Complexity βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Injecting Logical Constraints into Neural Networks via Straight-Through Estimators ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Input Dependent Sparse Gaussian Processes βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Input-agnostic Certified Group Fairness via Gaussian Parameter Smoothing ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Instance Dependent Regret Analysis of Kernelized Bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Instrumental Variable Regression with Confounder Balancing βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Interactive Correlation Clustering with Existential Cluster Constraints βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Interactive Inverse Reinforcement Learning for Cooperative Games βœ… βœ… ❌ ❌ βœ… βœ… βœ… 5
Interactively Learning Preference Constraints in Linear Bandits βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Interpretable Off-Policy Learning via Hyperbox Search βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Interventional Contrastive Learning with Meta Semantic Regularizer βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Intriguing Properties of Input-Dependent Randomized Smoothing βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Invariant Ancestry Search βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Inverse Contextual Bandits: Learning How Behavior Evolves over Time βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Investigating Generalization by Controlling Normalized Margin βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Investigating Why Contrastive Learning Benefits Robustness against Label Noise ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Iterative Double Sketching for Faster Least-Squares Optimization βœ… ❌ ❌ ❌ ❌ βœ… βœ… 3
Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
It’s Raw! Audio Generation with State-Space Models ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Kernel Methods for Radial Transformed Compositional Data with Many Zeros ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Kill a Bird with Two Stones: Closing the Convergence Gaps in Non-Strongly Convex Optimization by Directly Accelerated SVRG with Double Compensation and Snapshots βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Knowledge Base Question Answering by Case-based Reasoning over Subgraphs ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
LCANets: Lateral Competition Improves Robustness Against Corruption and Attack ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
LIMO: Latent Inceptionism for Targeted Molecule Generation βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
LSB: Local Self-Balancing MCMC in Discrete Spaces βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Label Ranking through Nonparametric Regression βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
Label-Descriptive Patterns and Their Application to Characterizing Classification Errors βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Label-Free Explainability for Unsupervised Models βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Langevin Monte Carlo for Contextual Bandits βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Large Batch Experience Replay βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Large-Scale Graph Neural Architecture Search βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Latent Diffusion Energy-Based Model for Interpretable Text Modelling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Latent Outlier Exposure for Anomaly Detection with Contaminated Data βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Lazy Estimation of Variable Importance for Large Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Augmented Binary Search Trees ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
Learning Bellman Complete Representations for Offline Policy Evaluation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Domain Adaptive Object Detection with Probabilistic Teacher βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Dynamics and Generalization in Deep Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning Efficient and Robust Ordinary Differential Equations via Invertible Neural Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning General Halfspaces with Adversarial Label Noise via Online Gradient Descent βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning Iterative Reasoning through Energy Minimization βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Learning Mixtures of Linear Dynamical Systems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Multiscale Transformer Models for Sequence Generation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning Stable Classifiers by Transferring Unstable Features ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning Stochastic Shortest Path with Linear Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning Symmetric Embeddings for Equivariant World Models ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning fair representation with a parametric integral probability metric βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning from Counterfactual Links for Link Prediction βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Learning from a Learning User for Optimal Recommendations βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Learning inverse folding from millions of predicted structures ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning of Cluster-based Feature Importance for Electronic Health Record Time-series ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning to Cut by Looking Ahead: Cutting Plane Selection via Imitation Learning ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
Learning to Estimate and Refine Fluid Motion with Physical Dynamics ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning to Hash Robustly, Guaranteed βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Learning to Incorporate Texture Saliency Adaptive Attention to Image Cartoonization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning to Infer Structures of Network Games ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning to Separate Voices by Spatial Regions βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning to Solve PDE-constrained Inverse Problems with Graph Networks ❌ ❌ ❌ βœ… βœ… ❌ βœ… 3
Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Least Squares Estimation using Sketched Data with Heteroskedastic Errors ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Let Invariant Rationale Discovery Inspire Graph Contrastive Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Leverage Score Sampling for Tensor Product Matrices in Input Sparsity Time βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Lie Point Symmetry Data Augmentation for Neural PDE Solvers ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent βœ… ❌ ❌ ❌ βœ… ❌ ❌ 2
Linear Adversarial Concept Erasure βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Linear Bandit Algorithms with Sublinear Time Complexity βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Linear Complexity Randomized Self-attention Mechanism βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Linear-Time Gromov Wasserstein Distances using Low Rank Couplings and Costs βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Local Augmentation for Graph Neural Networks βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Local Linear Convergence of Douglas-Rachford for Linear Programming: a Probabilistic Analysis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Locally Sparse Neural Networks for Tabular Biomedical Data βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Log-Euclidean Signatures for Intrinsic Distances Between Unaligned Datasets βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Loss Function Learning for Domain Generalization by Implicit Gradient βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Low-Complexity Deep Convolutional Neural Networks on Fully Homomorphic Encryption Using Multiplexed Parallel Convolutions βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Low-Precision Stochastic Gradient Langevin Dynamics βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
LyaNet: A Lyapunov Framework for Training Neural ODEs βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
MAML and ANIL Provably Learn Representations ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Making Linear MDPs Practical via Contrastive Representation Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Marginal Tail-Adaptive Normalizing Flows βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Markov Chain Monte Carlo for Continuous-Time Switching Dynamical Systems βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Maslow’s Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Massively Parallel $k$-Means Clustering for Perturbation Resilient Instances βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Matching Learned Causal Effects of Neural Networks with Domain Priors βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Matching Normalizing Flows and Probability Paths on Manifolds ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Matching Structure for Dual Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Maximum Likelihood Training for Score-based Diffusion ODEs by High Order Denoising Score Matching βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Meaningfully debugging model mistakes using conceptual counterfactual explanations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Measure Estimation in the Barycentric Coding Model βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Measuring Representational Robustness of Neural Networks Through Shared Invariances ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Measuring dissimilarity with diffeomorphism invariance ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
MemSR: Training Memory-efficient Lightweight Model for Image Super-Resolution βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Memory-Based Model Editing at Scale ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
MetAug: Contrastive Learning via Meta Feature Augmentation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Meta-Learning Hypothesis Spaces for Sequential Decision-making ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Metric-Fair Active Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Metric-Fair Classifier Derandomization ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Minimax Classification under Concept Drift with Multidimensional Adaptation and Performance Guarantees βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Minimax M-estimation under Adversarial Contamination βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Minimizing Control for Credit Assignment with Strong Feedback βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Minimum Cost Intervention Design for Causal Effect Identification βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Mirror Learning: A Unifying Framework of Policy Optimisation ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Mitigating Modality Collapse in Multimodal VAEs via Impartial Optimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Mitigating Neural Network Overconfidence with Logit Normalization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
ModLaNets: Learning Generalisable Dynamics via Modularity and Physical Inductive Bias βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably) ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Model Agnostic Sample Reweighting for Out-of-Distribution Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Model Selection in Batch Policy Optimization βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Model-Free Opponent Shaping βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Model-Value Inconsistency as a Signal for Epistemic Uncertainty ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Modeling Adversarial Noise for Adversarial Training βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Modeling Irregular Time Series with Continuous Recurrent Units βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Modeling Strong and Human-Like Gameplay with KL-Regularized Search βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Modeling Structure with Undirected Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Modular Conformal Calibration βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Molecular Representation Learning via Heterogeneous Motif Graph Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Monarch: Expressive Structured Matrices for Efficient and Accurate Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
More Efficient Sampling for Tensor Decomposition With Worst-Case Guarantees βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Multi Resolution Analysis (MRA) for Approximate Self-Attention βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Multi-Level Branched Regularization for Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Multi-Task Learning as a Bargaining Game βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Multi-scale Feature Learning Dynamics: Insights for Double Descent ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Multi-slots Online Matching with High Entropy βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Multiclass learning with margin: exponential rates with no bias-variance trade-off ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Multicoated Supermasks Enhance Hidden Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Multirate Training of Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
N-Penetrate: Active Learning of Neural Collision Handler for Complex 3D Mesh Deformations βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
NAFS: A Simple yet Tough-to-beat Baseline for Graph Representation Learning βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
NISPA: Neuro-Inspired Stability-Plasticity Adaptation for Continual Learning in Sparse Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
NOMU: Neural Optimization-based Model Uncertainty βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
NP-Match: When Neural Processes meet Semi-Supervised Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Near-Exact Recovery for Tomographic Inverse Problems via Deep Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-Optimal Learning of Extensive-Form Games with Imperfect Information βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-optimal rate of consistency for linear models with missing values ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Nearly Optimal Catoni’s M-estimator for Infinite Variance βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Nested Bandits βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Nesterov Accelerated Shuffling Gradient Method for Convex Optimization βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Neural Fisher Discriminant Analysis: Optimal Neural Network Embeddings in Polynomial Time βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Neural Implicit Dictionary Learning via Mixture-of-Expert Training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural Inverse Kinematic ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Neural Inverse Transform Sampler βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps ❌ ❌ βœ… βœ… ❌ βœ… βœ… 4
Neural Laplace: Learning diverse classes of differential equations in the Laplace domain βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Neural Network Poisson Models for Behavioural and Neural Spike Train Data ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural Network Pruning Denoises the Features and Makes Local Connectivity Emerge in Visual Tasks ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural Tangent Kernel Analysis of Deep Narrow Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural Tangent Kernel Beyond the Infinite-Width Limit: Effects of Depth and Initialization ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural Tangent Kernel Empowered Federated Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Neural-Symbolic Models for Logical Queries on Knowledge Graphs βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
NeuralEF: Deconstructing Kernels by Deep Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Neuro-Symbolic Hierarchical Rule Induction ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Neurocoder: General-Purpose Computation Using Stored Neural Programs ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Neuron Dependency Graphs: A Causal Abstraction of Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Neurotoxin: Durable Backdoors in Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
No-Regret Learning in Partially-Informed Auctions βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
No-Regret Learning in Time-Varying Zero-Sum Games βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Non-Vacuous Generalisation Bounds for Shallow Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Nonlinear Feature Diffusion on Hypergraphs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Nonparametric Embeddings of Sparse High-Order Interaction Events ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Nonparametric Factor Trajectory Learning for Dynamic Tensor Decomposition ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Nonparametric Involutive Markov Chain Monte Carlo βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Nonparametric Sparse Tensor Factorization with Hierarchical Gamma Processes ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Not All Poisons are Created Equal: Robust Training against Data Poisoning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
NysADMM: faster composite convex optimization via low-rank approximation βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
NystrΓΆm Kernel Mean Embeddings ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Object Permanence Emerges in a Random Walk along Memory βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Off-Policy Evaluation for Large Action Spaces via Embeddings βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Off-Policy Reinforcement Learning with Delayed Rewards βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Offline Meta-Reinforcement Learning with Online Self-Supervision βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Offline RL Policies Should Be Trained to be Adaptive βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Omni-Granular Ego-Semantic Propagation for Self-Supervised Graph Representation Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On Collective Robustness of Bagging Against Data Poisoning βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
On Convergence of Gradient Descent Ascent: A Tight Local Analysis ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Distribution Shift in Learning-based Bug Detectors ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On Finite-Sample Identifiability of Contrastive Learning-Based Nonlinear Independent Component Analysis ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Implicit Bias in Overparameterized Bilevel Optimization βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Last-Iterate Convergence Beyond Zero-Sum Games ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Learning Mixture of Linear Regressions in the Non-Realizable Setting βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On Measuring Causal Contributions via do-interventions βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
On Non-local Convergence Analysis of Deep Linear Networks ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
On Numerical Integration in Neural Ordinary Differential Equations ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
On Transportation of Mini-batches: A Hierarchical Approach βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Adversarial Robustness of Causal Algorithmic Recourse βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
On the Convergence of Inexact Predictor-Corrector Methods for Linear Programming βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On the Convergence of Local Stochastic Compositional Gradient Descent with Momentum βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
On the Convergence of the Shapley Value in Parametric Bayesian Learning Games ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
On the Difficulty of Defending Self-Supervised Learning against Model Extraction βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
On the Effects of Artificial Data Modification ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On the Equivalence Between Temporal and Static Equivariant Graph Representations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz Functions βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
On the Finite-Time Performance of the Knowledge Gradient Algorithm βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On the Generalization Analysis of Adversarial Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Learning of Non-Autoregressive Transformers ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
On the Practicality of Deterministic Epistemic Uncertainty ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
On the Robustness of CountSketch to Adaptive Inputs βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
On the Role of Discount Factor in Offline Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
On the Statistical Benefits of Curriculum Learning βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
On the Surrogate Gap between Contrastive and Supervised Losses ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
One-Pass Algorithms for MAP Inference of Nonsymmetric Determinantal Point Processes βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
One-Pass Diversified Sampling with Application to Terabyte-Scale Genomic Sequence Streams βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Online Active Regression βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Online Algorithms with Multiple Predictions βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online Balanced Experimental Design βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Online Continual Learning through Mutual Information Maximization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Online Decision Transformer βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Online Learning and Pricing with Reusable Resources: Linear Bandits with Sub-Exponential Rewards βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Learning for Min Sum Set Cover and Pandora’s Box βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online Learning with Knapsacks: the Best of Both Worlds βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Online and Consistent Correlation Clustering βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Only tails matter: Average-Case Universality and Robustness in the Convex Regime βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Optimal Algorithms for Mean Estimation under Local Differential Privacy βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimal Algorithms for Stochastic Multi-Level Compositional Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Optimal Clustering with Noisy Queries via Multi-Armed Bandit βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimal Estimation of Policy Gradient via Double Fitted Iteration βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Optimally Controllable Perceptual Lossy Compression ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Optimization-Derived Learning with Essential Convergence Analysis of Training and Hyper-training βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Optimization-Induced Graph Implicit Nonlinear Diffusion ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Optimizing Sequential Experimental Design with Deep Reinforcement Learning ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Optimizing Tensor Network Contraction Using Reinforcement Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Orchestra: Unsupervised Federated Learning via Globally Consistent Clustering βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Order Constraints in Optimal Transport βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Out-of-Distribution Detection with Deep Nearest Neighbors βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Overcoming Oscillations in Quantization-Aware Training βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
PAC-Bayesian Bounds on Rate-Efficient Classifiers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
PAC-Net: A Model Pruning Approach to Inductive Transfer Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
PDE-Based Optimal Strategy for Unconstrained Online Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
PDO-s3DCNNs: Partial Differential Operator Based Steerable 3D CNNs ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
PINs: Progressive Implicit Networks for Multi-Scale Neural Representations ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
PLATINUM: Semi-Supervised Model Agnostic Meta-Learning using Submodular Mutual Information βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
POEM: Out-of-Distribution Detection with Posterior Sampling βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Pairwise Conditional Gradients without Swap Steps and Sparser Kernel Herding βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Parametric Visual Program Induction with Function Modularization βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Parsimonious Learning-Augmented Caching βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Partial Counterfactual Identification from Observational and Experimental Data βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Partial Label Learning via Label Influence Function βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Partial disentanglement for domain adaptation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Particle Transformer for Jet Tagging ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Path-Aware and Structure-Preserving Generation of Synthetically Accessible Molecules ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Path-Gradient Estimators for Continuous Normalizing Flows βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Permutation Search of Tensor Network Structures via Local Sampling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Personalized Federated Learning through Local Memorization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Personalized Federated Learning via Variational Bayesian Inference βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Planning with Diffusion for Flexible Behavior Synthesis βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
PoF: Post-Training of Feature Extractor for Improving Generalization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Policy Gradient Method For Robust Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Popular decision tree algorithms are provably noise tolerant βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Position Prediction as an Effective Pretraining Strategy βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Power-Law Escape Rate of SGD ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Practical Almost-Linear-Time Approximation Algorithms for Hybrid and Overlapping Graph Clustering βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Preconditioning for Scalable Gaussian Process Hyperparameter Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Predicting Out-of-Distribution Error with the Projection Norm βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Principal Component Flows βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Principled Knowledge Extrapolation with GANs βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Privacy for Free: How does Dataset Condensation Help Privacy? ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
Private Adaptive Optimization with Side information βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Private Streaming SCO in $\ell_p$ geometry with Applications in High Dimensional Online Decision Making βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Private frequency estimation via projective geometry ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Private optimization in the interpolation regime: faster rates and hardness results βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Probabilistic Bilevel Coreset Selection βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Probabilistic ODE Solutions in Millions of Dimensions ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Probabilistically Robust Learning: Balancing Average and Worst-case Performance βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Prompting Decision Transformer for Few-Shot Policy Generalization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Prototype Based Classification from Hierarchy to Fairness ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Prototype-Anchored Learning for Learning with Imperfect Annotations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Lojasiewicz Functions when the Non-Convexity is Averaged-Out βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Provable Domain Generalization via Invariant-Feature Subspace Recovery βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Provable Reinforcement Learning with a Short-Term Memory βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Provably Adversarially Robust Nearest Prototype Classifiers βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Proving Theorems using Incremental Learning and Hindsight Experience Replay βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
ProxSkip: Yes! Local Gradient Steps Provably Lead to Communication Acceleration! Finally! βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Proximal Denoiser for Convergent Plug-and-Play Optimization with Nonconvex Regularization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Proximal Exploration for Model-guided Protein Sequence Design βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Proximal and Federated Random Reshuffling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Public Data-Assisted Mirror Descent for Private Model Training βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noise Images βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
QSFL: A Two-Level Uplink Communication Optimization Framework for Federated Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Quantifying and Learning Linear Symmetry-Based Disentanglement ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Quantum-Inspired Algorithms from Randomized Numerical Linear Algebra βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
RECAPP: Crafting a More Efficient Catalyst for Convex Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
ROCK: Causal Inference Principles for Reasoning about Commonsense Causality ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
RUMs from Head-to-Head Contests βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Random Forest Density Estimation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Random Gegenbauer Features for Scalable Kernel Methods ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Re-evaluating Word Mover’s Distance ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Reachability Constrained Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Reconstructing Nonlinear Dynamical Systems from Multi-Modal Time Series ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Refined Convergence Rates for Maximum Likelihood Estimation under Finite Mixture Models βœ… βœ… ❌ ❌ ❌ βœ… βœ… 4
Region-Based Semantic Factorization in GANs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Regret Minimization with Performative Feedback βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Reinforcement Learning with Action-Free Pre-Training from Videos ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Removing Batch Normalization Boosts Adversarial Training ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Representation Topology Divergence: A Method for Comparing Neural Network Representations. βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Residual-Based Sampling for Online Outlier-Robust PCA βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Resilient and Communication Efficient Learning for Heterogeneous Federated Systems βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Restarted Nonconvex Accelerated Gradient Descent: No More Polylogarithmic Factor in the $O(Ξ΅^-7/4)$ Complexity βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Rethinking Attention-Model Explainability through Faithfulness Violation Test ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Rethinking Fano’s Inequality in Ensemble Learning ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Rethinking Graph Neural Networks for Anomaly Detection ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Retrieval-Augmented Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Retroformer: Pushing the Limits of End-to-end Retrosynthesis Transformer βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Reverse Engineering $\ell_p$ attacks: A block-sparse optimization approach with recovery guarantees βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Reverse Engineering the Neural Tangent Kernel ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Revisiting Consistency Regularization for Deep Partial Label Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Revisiting End-to-End Speech-to-Text Translation From Scratch ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing? βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Revisiting Online Submodular Minimization: Gap-Dependent Regret Bounds, Best of Both Worlds and Adversarial Robustness βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Revisiting and Advancing Fast Adversarial Training Through The Lens of Bi-Level Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Revisiting the Effects of Stochasticity for Hamiltonian Samplers ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Rich Feature Construction for the Optimization-Generalization Dilemma βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Ripple Attention for Visual Perception with Sub-quadratic Complexity βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Risk-Averse No-Regret Learning in Online Convex Games βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Robin Hood and Matthew Effects: Differential Privacy Has Disparate Impact on Synthetic Data ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Robust Counterfactual Explanations for Tree-Based Ensembles βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Robust Group Synchronization via Quadratic Programming βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Robust Imitation Learning against Variations in Environment Dynamics βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Robust Kernel Density Estimation with Median-of-Means principle ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Robust Models Are More Interpretable Because Attributions Look Normal ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Robust Multi-Objective Bayesian Optimization Under Input Noise ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Robust Policy Learning over Multiple Uncertainty Sets βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Robust SDE-Based Variational Formulations for Solving Linear PDEs via Deep Learning βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Robust Training of Neural Networks Using Scale Invariant Architectures βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robust Training under Label Noise by Over-parameterization βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Robust alignment of cross-session recordings of neural population activity by behaviour via unsupervised domain adaptation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Robustness Implies Generalization via Data-Dependent Generalization Bounds ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Robustness Verification for Contrastive Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Robustness and Accuracy Could Be Reconcilable by (Proper) Definition ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Robustness in Multi-Objective Submodular Optimization: a Quantile Approach βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Role-based Multiplex Network Embedding βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Rotting Infinitely Many-Armed Bandits βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
SDQ: Stochastic Differentiable Quantization with Mixed Precision βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
SE(3) Equivariant Graph Neural Networks with Complete Local Frames βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
SPDY: Accurate Pruning with Speedup Guarantees βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
SPECTRE: Spectral Conditioning Helps to Overcome the Expressivity Limits of One-shot Graph Generators ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Safe Exploration for Efficient Policy Evaluation and Comparison βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Sample Efficient Learning of Predictors that Complement Humans βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Sanity Simulations for Saliency Methods ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Scalable Computation of Causal Bounds βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Scalable Deep Gaussian Markov Random Fields for General Graphs ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Scalable First-Order Bayesian Optimization via Structured Automatic Differentiation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Scalable MCMC Sampling for Nonsymmetric Determinantal Point Processes βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Scalable Spike-and-Slab βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Scaling Out-of-Distribution Detection for Real-World Settings ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Scaling Structured Inference with Randomization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Scaling-up Diverse Orthogonal Convolutional Networks by a Paraunitary Framework βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Score-Guided Intermediate Level Optimization: Fast Langevin Mixing for Inverse Problems βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Searching for BurgerFormer with Micro-Meso-Macro Space Design ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Secure Distributed Training at Scale βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Secure Quantized Training for Deep Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Selective Network Linearization for Efficient Private Inference βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Selective Regression under Fairness Criteria βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Self-Organized Polynomial-Time Coordination Graphs βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Self-Supervised Models of Audio Effectively Explain Human Cortical Responses to Speech ❌ ❌ βœ… βœ… ❌ βœ… βœ… 4
Self-Supervised Representation Learning via Latent Graph Prediction βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Self-conditioning Pre-Trained Language Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Self-supervised Models are Good Teaching Assistants for Vision Transformers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Self-supervised learning with random-projection quantizer for speech recognition ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Selling Data To a Machine Learner: Pricing via Costly Signaling ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Sequential Covariate Shift Detection Using Classifier Two-Sample Tests βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Sequential and Parallel Constrained Max-value Entropy Search via Information Lower Bound βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Set Based Stochastic Subsampling βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Set Norm and Equivariant Skip Connections: Putting the Deep in Deep Sets ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Sharpened Quasi-Newton Methods: Faster Superlinear Rate and Larger Local Convergence Neighborhood βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Short-Term Plasticity Neurons Learning to Learn and Forget βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Shuffle Private Linear Contextual Bandits βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Simple and near-optimal algorithms for hidden stratification and multi-group learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Simultaneous Graph Signal Clustering and Graph Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Sketching Algorithms and Lower Bounds for Ridge Regression βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Skin Deep Unlearning: Artefact and Instrument Debiasing in the Context of Melanoma Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Smoothed Adversarial Linear Contextual Bandits with Knapsacks βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Solving Stackelberg Prediction Game with Least Squares Loss via Spherically Constrained Least Squares Reformulation βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
SpaceMAP: Visualizing High-Dimensional Data by Space Expansion βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Sparse Double Descent: Where Network Pruning Aggravates Overfitting ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Sparse Invariant Risk Minimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Sparse Mixed Linear Regression with Guarantees: Taming an Intractable Problem with Invex Relaxation ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Sparsity in Partially Controllable Linear Systems βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Spatial-Channel Token Distillation for Vision MLPs βœ… ❌ βœ… ❌ βœ… ❌ ❌ 3
Spectral Representation of Robustness Measures for Optimization Under Input Uncertainty βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
SpeqNets: Sparsity-aware permutation-equivariant graph networks βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Stability Based Generalization Bounds for Exponential Family Langevin Dynamics ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Stable Conformal Prediction Sets βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Staged Training for Transformer Language Models βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
State Transition of Dendritic Spines Improves Learning of Sparse Spiking Neural Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Steerable 3D Spherical Neurons ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Stochastic Continuous Submodular Maximization: Boosting via Non-oblivious Function βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic Reweighted Gradient Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Stochastic Rising Bandits βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Stochastic smoothing of the top-K calibrated hinge loss for deep imbalanced classification ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Strategic Representation βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Streaming Algorithm for Monotone k-Submodular Maximization with Cardinality Constraints βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Streaming Algorithms for High-Dimensional Robust Statistics βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Streaming Algorithms for Support-Aware Histograms βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Streaming Inference for Infinite Feature Models ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Structural Entropy Guided Graph Hierarchical Pooling βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Structure Preserving Neural Networks: A Case Study in the Entropy Closure of the Boltzmann Equation βœ… βœ… ❌ βœ… βœ… βœ… βœ… 6
Structure-Aware Transformer for Graph Representation Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Structure-preserving GANs ❌ ❌ βœ… βœ… ❌ βœ… βœ… 4
Structured Stochastic Gradient MCMC βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Sublinear-Time Clustering Oracle for Signed Graphs βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Subspace Learning for Effective Meta-Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Supervised Learning with General Risk Functionals ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Supervised Off-Policy Ranking βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Surrogate Likelihoods for Variational Annealed Importance Sampling βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Symmetric Machine Theory of Mind βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
TACTiS: Transformer-Attentional Copulas for Time Series ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
TAM: Topology-Aware Margin Loss for Class-Imbalanced Node Classification βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
TPC: Transformation-Specific Smoothing for Point Cloud Models ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
TSPipe: Learn from Teacher Faster with Pipelines ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
TURF: Two-Factor, Universal, Robust, Fast Distribution Learning Algorithm βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Tackling Data Heterogeneity: A New Unified Framework for Decentralized SGD with Sample-induced Topology ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Tackling covariate shift with node-based Bayesian neural networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Task-aware Privacy Preservation for Multi-dimensional Data βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Tell me why! Explanations support learning relational and causal structure ❌ ❌ ❌ ❌ βœ… βœ… βœ… 3
Temporal Difference Learning for Model Predictive Control βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Test-Time Training Can Close the Natural Distribution Shift Performance Gap in Deep Learning Based Compressed Sensing ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
The Algebraic Path Problem for Graph Metrics ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The CLRS Algorithmic Reasoning Benchmark βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The Complexity of k-Means Clustering when Little is Known ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
The Fundamental Price of Secure Aggregation in Differentially Private Federated Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Geometry of Robust Value Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Importance of Non-Markovianity in Maximum State Entropy Exploration ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Infinite Contextual Graph Markov Model βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Multivariate Community Hawkes Model for Dependent Relational Events in Continuous-time Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
The Poisson Binomial Mechanism for Unbiased Federated Learning with Secure Aggregation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
The Primacy Bias in Deep Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Role of Deconfounding in Meta-learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
The State of Sparse Training in Deep Reinforcement Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Teaching Dimension of Regularized Kernel Learners ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Unsurprising Effectiveness of Pre-Trained Vision Models for Control ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
The dynamics of representation learning in shallow, non-linear autoencoders ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The power of first-order smooth optimization for black-box non-smooth problems ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Thompson Sampling for (Combinatorial) Pure Exploration βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Thompson Sampling for Robust Transfer in Multi-Task Bandits βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Three-stage Evolution and Fast Equilibrium for SGD with Non-degerate Critical Points ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Thresholded Lasso Bandit βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Tight and Robust Private Mean Estimation with Few Users βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Time Is MattEr: Temporal Self-supervision for Video Transformers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
To Smooth or Not? When Label Smoothing Meets Noisy Labels ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Topology-aware Generalization of Decentralized SGD ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Toward Compositional Generalization in Object-Oriented World Modeling ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Towards Coherent and Consistent Use of Entities in Narrative Generation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Towards Noise-adaptive, Problem-adaptive (Accelerated) Stochastic Gradient Descent ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Towards Scaling Difference Target Propagation by Learning Backprop Targets βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Towards Theoretical Analysis of Transformation Complexity of ReLU DNNs ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Towards Understanding Sharpness-Aware Minimization ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Towards Uniformly Superhuman Autonomy via Subdominance Minimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Towards understanding how momentum improves generalization in deep learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Tractable Dendritic RNNs for Reconstructing Nonlinear Dynamical Systems ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Tractable Uncertainty for Structure Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Training Discrete Deep Generative Models via Gapped Straight-Through Estimator βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Training OOD Detectors in their Natural Habitats βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Training Your Sparse Neural Network Better with Any Mask ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Transfer Learning In Differential Privacy’s Hybrid-Model βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Transfer and Marginalize: Explaining Away Label Noise with Privileged Information ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Transformer Quality in Linear Time βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Transformers are Meta-Reinforcement Learners βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Translating Robot Skills: Learning Unsupervised Skill Correspondences Across Robots βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
UAST: Uncertainty-Aware Siamese Tracking βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
UNIREX: A Unified Learning Framework for Language Model Rationale Extraction ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Unaligned Supervision for Automatic Music Transcription in The Wild βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Uncertainty Modeling in Generative Compressed Sensing βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
UnderGrad: A Universal Black-Box Optimization Method with Almost Dimension-Free Convergence Rate Guarantees βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Understanding Clipping for Federated Learning: Convergence and Client-Level Differential Privacy βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Understanding Contrastive Learning Requires Incorporating Inductive Biases ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Understanding Doubly Stochastic Clustering ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Understanding Gradient Descent on the Edge of Stability in Deep Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding Instance-Level Impact of Fairness Constraints ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Understanding Robust Generalization in Learning Regular Languages ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Understanding Robust Overfitting of Adversarial Training and Beyond βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Understanding The Robustness in Vision Transformers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding and Improving Knowledge Graph Embedding for Entity Alignment βœ… βœ… βœ… βœ… βœ… ❌ ❌ 5
Understanding the unstable convergence of gradient descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
UniRank: Unimodal Bandit Algorithms for Online Ranking βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous Spaces ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Unified Scaling Laws for Routed Language Models ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Universal Hopfield Networks: A General Framework for Single-Shot Associative Memory Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Universal Joint Approximation of Manifolds and Densities by Simple Injective Flows ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Universal and data-adaptive algorithms for model selection in linear contextual bandits βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Universality of Winning Tickets: A Renormalization Group Perspective ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Unsupervised Ground Metric Learning Using Wasserstein Singular Vectors ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Unsupervised Image Representation Learning with Deep Latent Particles βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Unsupervised Time-Series Representation Learning with Iterative Bilinear Temporal-Spectral Fusion ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
Utility Theory for Sequential Decision Making ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Utilizing Expert Features for Contrastive Learning of Time-Series Representations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Validating Causal Inference Methods ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Value Function based Difference-of-Convex Algorithm for Bilevel Hyperparameter Selection Problems βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
VarScene: A Deep Generative Model for Realistic Scene Graph Synthesis ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
VariGrow: Variational Architecture Growing for Task-Agnostic Continual Learning based on Bayesian Novelty ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Variational Feature Pyramid Networks ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Variational Inference for Infinitely Deep Neural Networks βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Variational Inference with Locally Enhanced Bounds for Hierarchical Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Variational Mixtures of ODEs for Inferring Cellular Gene Expression Dynamics ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Variational On-the-Fly Personalization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Variational Sparse Coding with Learned Thresholding βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Variational Wasserstein gradient flow βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Variational nearest neighbor Gaussian process ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
ViT-NeT: Interpretable Vision Transformers with Neural Tree Decoder βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Virtual Homogeneity Learning: Defending against Data Heterogeneity in Federated Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Visual Attention Emerges from Recurrent Sparse Reconstruction ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Volatility Based Kernels and Moving Average Means for Accurate Forecasting with Gaussian Processes ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Weisfeiler-Lehman Meets Gromov-Wasserstein βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
What Can Linear Interpolation of Neural Network Loss Landscapes Tell Us? ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
What Dense Graph Do You Need for Self-Attention? βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization? ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
When AUC meets DRO: Optimizing Partial AUC for Deep Learning with Non-Convex Convergence Guarantee βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
When Are Linear Stochastic Bandits Attackable? βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
When and How Mixup Improves Calibration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error ❌ ❌ ❌ ❌ ❌ βœ… βœ… 2
Why the Rich Get Richer? On the Balancedness of Random Partition Models ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Wide Bayesian neural networks have a simple weight posterior: theory and accelerated sampling ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Wide Neural Networks Forget Less Catastrophically ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Winning the Lottery Ahead of Time: Efficient Early Network Pruning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
XAI for Transformers: Better Explanations through Conservative Propagation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
You Only Cut Once: Boosting Data Augmentation with a Single Cut ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Zero-Shot Reward Specification via Grounded Natural Language ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Zero-shot AutoML with Pretrained Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
pathGCN: Learning General Graph Spatial Operators from Paths ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4