Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Learning Representations (ICLR) - 2022

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICLR 2022 1094 0.66 4.3 97.62% 50.94%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
$\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited Overlap ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
$\mathrm{SO}(2)$-Equivariant Reinforcement Learning ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
$\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
8-bit Optimizers via Block-wise Quantization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of Disease ❌ ❌ ❌ βœ… βœ… βœ… βœ… 4
A Class of Short-term Recurrence Anderson Mixing Methods and Their Applications βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
A Comparison of Hamming Errors of Representative Variable Selection Methods ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
A Deep Variational Approach to Clustering Survival Data ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
A Fine-Grained Analysis on Distribution Shift ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A Fine-Tuning Approach to Belief State Modeling βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A First-Occupancy Representation for Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
A General Analysis of Example-Selection for Stochastic Gradient Descent βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
A Generalized Weighted Optimization Method for Computational Learning and Inversion ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
A Johnson-Lindenstrauss Framework for Randomly Initialized CNNs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Loss Curvature Perspective on Training Instabilities of Deep Learning Models ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
A NON-PARAMETRIC REGRESSION VIEWPOINT : GENERALIZATION OF OVERPARAMETRIZED DEEP RELU NETWORK UNDER NOISY OBSERVATIONS ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
A Neural Tangent Kernel Perspective of Infinite Tree Ensembles ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
A New Perspective on "How Graph Neural Networks Go Beyond Weisfeiler-Lehman?" ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
A Program to Build E(N)-Equivariant Steerable CNNs βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Statistical Framework for Efficient Out of Distribution Detection in Deep Neural Networks βœ… ❌ βœ… βœ… βœ… βœ… ❌ 5
A Tale of Two Flows: Cooperative Learning of Langevin Flow and Normalizing Flow Toward Energy-Based Model βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
A Theoretical Analysis on Feature Learning in Neural Networks: Emergence from Inputs and Advantage over Fixed Features βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
A Theory of Tournament Representations ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
A Unified Wasserstein Distributional Robustness Framework for Adversarial Training βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A Zest of LIME: Towards Architecture-Independent Model Distances βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
A fast and accurate splitting method for optimal transport: analysis and implementation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
A generalization of the randomized singular value decomposition βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
A global convergence theory for deep ReLU implicit networks via over-parameterization ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
ADAVI: Automatic Dual Amortized Variational Inference Applied To Pyramidal Bayesian Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value Analysis βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
AS-MLP: An Axial Shifted MLP Architecture for Vision βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave Functions ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Accelerated Policy Learning with Parallel Differentiable Simulation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Acceleration of Federated Learning with Alleviated Forgetting in Local Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Active Hierarchical Exploration with Stable Subgoal Representation Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Actor-critic is implicitly biased towards high entropy optimal policies βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
AdaAug: Learning Class- and Instance-adaptive Data Augmentation Policies βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adaptive Wavelet Transformer Network for 3D Shape Representation Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Adversarial Retriever-Ranker for Dense Text Retrieval βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Adversarial Robustness Through the Lens of Causality ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Adversarial Support Alignment βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Adversarial Unlearning of Backdoors via Implicit Hypergradient βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Adversarially Robust Conformal Prediction βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Almost Tight L0-norm Certified Robustness of Top-k Predictions against Adversarial Perturbations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
AlphaZero-based Proof Cost Network to Aid Game Solving ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Amortized Implicit Differentiation for Stochastic Bilevel Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular Design βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
An Agnostic Approach to Federated Learning with Class Imbalance βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An Autoregressive Flow Model for 3D Molecular Geometry Generation from Scratch βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
An Experimental Design Perspective on Model-Based Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
An Explanation of In-context Learning as Implicit Bayesian Inference ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
An Information Fusion Approach to Learning with Instance-Dependent Label Noise βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
An Operator Theoretic View On Pruning Deep Neural Networks βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
An Unconstrained Layer-Peeled Perspective on Neural Collapse ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Ancestral protein sequence reconstruction using a tree-structured Ornstein-Uhlenbeck variational autoencoder βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Anisotropic Random Feature Regression in High Dimensions ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Anomaly Detection for Tabular Data with Internal Contrastive Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Anti-Concentrated Confidence Bonuses For Scalable Exploration βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Anytime Dense Prediction with Confidence Adaptivity ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Approximation and Learning with Deep Convolutional Models: a Kernel Perspective ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Assessing Generalization of SGD via Disagreement ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Associated Learning: an Alternative to End-to-End Backpropagation that Works on CNN, RNN, and Transformer βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Asymmetry Learning for Counterfactually-invariant Classification in OOD Tasks ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Attacking deep networks with surrogate-based adversarial black-box methods is easy βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Attention-based Interpretability with Concept Transformers βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Augmented Sliced Wasserstein Distances βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Auto-Transfer: Learning to Route Transferable Representations βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Auto-scaling Vision Transformers without Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Automated Self-Supervised Learning for Graphs βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Automatic Loss Function Search for Predict-Then-Optimize Problems with Strong Ranking Property βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Autonomous Learning of Object-Centric Abstractions for High-Level Planning βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Autonomous Reinforcement Learning: Formalism and Benchmarking ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Autoregressive Diffusion Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Autoregressive Quantile Flows for Predictive Uncertainty Estimation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
BAM: Bayes with Adaptive Memory βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
BEiT: BERT Pre-Training of Image Transformers βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Back2Future: Leveraging Backfill Dynamics for Improving Real-time Predictions in Future ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Backdoor Defense via Decoupling the Training Process ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Bag of Instances Aggregation Boosts Self-supervised Distillation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bandit Learning with Joint Effect of Incentivized Sampling, Delayed Sampling Feedback, and Self-Reinforcing User Preferences βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Bayesian Framework for Gradient Leakage βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Bayesian Neural Network Priors Revisited ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Benchmarking the Spectrum of Agent Capabilities ❌ βœ… βœ… ❌ ❌ βœ… ❌ 3
Better Supervisory Signals by Observing Learning Paths βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Bi-linear Value Networks for Multi-goal Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
BiBERT: Accurate Fully Binarized BERT ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic Functions ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Boosted Curriculum Reinforcement Learning βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Boosting Randomized Smoothing with Variance Reduced Classifiers βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Boosting the Certified Robustness of L-infinity Distance Nets ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Bootstrapped Meta-Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Bootstrapping Semantic Segmentation with Regional Contrast ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Bregman Gradient Policy Optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Bridging Recommendation and Marketing via Recurrent Intensity Modeling βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Bundle Networks: Fiber Bundles, Local Trivializations, and a Generative Approach to Exploring Many-to-one Maps ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Byzantine-Robust Learning on Heterogeneous Datasets via Bucketing βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
CKConv: Continuous Kernel Convolution For Sequential Data ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
CLEVA-Compass: A Continual Learning Evaluation Assessment Compass to Promote Research Transparency and Comparability ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
CURVATURE-GUIDED DYNAMIC SCALE NETWORKS FOR MULTI-VIEW STEREO ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Can an Image Classifier Suffice For Action Recognition? ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views? ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Capturing Structural Locality in Non-parametric Language Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Case-based reasoning for better generalization in textual reinforcement learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Causal Contextual Bandits with Targeted Interventions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Certified Robustness for Deep Equilibrium Models via Interval Bound Propagation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Chemical-Reaction-Aware Molecule Representation Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Chunked Autoregressive GAN for Conditional Waveform Synthesis ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Churn Reduction via Distillation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene Deblurring βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Closed-form Sample Probing for Learning Generative Models in Zero-shot Learning ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
CoBERL: Contrastive BERT for Reinforcement Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
CoMPS: Continual Meta Policy Search βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
CodeTrek: Flexible Modeling of Code using an Extensible Relational Representation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Coherence-based Label Propagation over Time Series for Accelerated Active Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Collapse by Conditioning: Training Class-conditional GANs with Limited Data ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos ❌ ❌ βœ… βœ… βœ… ❌ ❌ 3
Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Comparing Distributions by Measuring Differences that Affect Decision Making ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Complete Verification via Multi-Neuron Relaxation Guided Branch-and-Bound ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Compositional Attention: Disentangling Search and Retrieval ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Compositional Training for End-to-End Deep AUC Maximization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
ConFeSS: A Framework for Single Source Cross-Domain Few-Shot Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Concurrent Adversarial Learning for Large-Batch Training βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Conditional Contrastive Learning with Kernel ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Conditional Image Generation by Conditioning Variational Auto-Encoders ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Conditional Object-Centric Learning from Video ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Conditioning Sequence-to-sequence Networks with Learned Activations ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Connectome-constrained Latent Variable Model of Whole-Brain Neural Activity βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Consistent Counterfactuals for Deep Models ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Constrained Physical-Statistics Models for Dynamical System Identification and Prediction βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Constrained Policy Optimization via Bayesian World Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Constraining Linear-chain CRFs to Regular Languages βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Constructing Orthogonal Convolutions in an Explicit Manner βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Context-Aware Sparse Deep Coordination Graphs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Contextualized Scene Imagination for Generative Commonsense Reasoning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Continual Learning with Filter Atom Swapping βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Continual Learning with Recursive Gradient Optimization βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Continual Normalization: Rethinking Batch Normalization for Online Continual Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Continuous-Time Meta-Learning with Forward Mode Differentiation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Contrastive Fine-grained Class Clustering via Generative Adversarial Networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Controlling Directions Orthogonal to a Classifier βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Controlling the Complexity and Lipschitz Constant improves Polynomial Nets βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Convergent Graph Solvers βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Convergent and Efficient Deep Q Learning Algorithm βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
CoordX: Accelerating Implicit Neural Representation with a Split MLP Architecture ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Coordination Among Neural Modules Through a Shared Global Workspace βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Counterfactual Plans under Distributional Ambiguity ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Creating Training Sets via Weak Indirect Supervision βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Critical Points in Quantum Generative Models ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Cross-Domain Imitation Learning via Optimal Transport βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Cross-Lingual Transfer with Class-Weighted Language-Invariant Representations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
CrossBeam: Learning to Search in Bottom-Up Program Synthesis βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
CrossMatch: Cross-Classifier Consistency Regularization for Open-Set Single Domain Generalization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
CrowdPlay: Crowdsourcing Human Demonstrations for Offline Learning ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Crystal Diffusion Variational Autoencoder for Periodic Material Generation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Curriculum learning as a tool to uncover learning principles in the brain ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
CycleMLP: A MLP-like Architecture for Dense Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
D-CODE: Discovering Closed-form ODEs from Observed Trajectories βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
DEGREE: Decomposition Based Explanation for Graph Neural Networks βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
DEPTS: Deep Expansion Learning for Periodic Time Series Forecasting βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
DISCOVERING AND EXPLAINING THE REPRESENTATION BOTTLENECK OF DNNS ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
DIVA: Dataset Derivative of a Learning Task ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
DKM: Differentiable k-Means Clustering Layer for Neural Network Compression ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Data Poisoning Won’t Save You From Facial Recognition βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Data-Driven Offline Optimization for Architecting Hardware Accelerators βœ… ❌ ❌ βœ… ❌ ❌ βœ… 3
Data-Efficient Graph Grammar Learning for Molecular Generation ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman Operator βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Declarative nets that are equilibrium models ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Deconstructing the Inductive Biases of Hamiltonian Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Decoupled Adaptation for Cross-Domain Object Detection βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Deep Attentive Variational Inference ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Deep AutoAugment ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Deep Point Cloud Reconstruction ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Deep ReLU Networks Preserve Expected Length ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Defending Against Image Corruptions Through Adversarial Augmentations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Delaunay Component Analysis for Evaluation of Data Representations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit Regularization ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Demystifying Limited Adversarial Transferability in Automatic Speech Recognition Systems ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Denoising Likelihood Score Matching for Conditional Score-based Data Generation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
DictFormer: Tiny Transformer with Shared Dictionary ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Differentiable DAG Sampling βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Differentiable Expectation-Maximization for Set Representation Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single Image ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Differentiable Scaffolding Tree for Molecule Optimization βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Differentially Private Fine-tuning of Language Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Differentially Private Fractional Frequency Moments Estimation with Polylogarithmic Space βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Discovering Invariant Rationales for Graph Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Discovering Latent Concepts Learned in BERT βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Discovering Nonlinear PDEs from Scarce Data with Physics-encoded Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Discrepancy-Based Active Learning for Domain Adaptation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Discrete Representations Strengthen Vision Transformer Robustness βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Discriminative Similarity for Data Clustering βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Disentanglement Analysis with Partial Information Decomposition ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Distribution Compression in Near-Linear Time βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Distributional Reinforcement Learning with Monotonic Splines βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Distributionally Robust Fair Principal Components via Geodesic Descents βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Distributionally Robust Models with Parametric Likelihood Ratios ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting Distributions βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Dive Deeper Into Integral Pose Regression ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Divergence-aware Federated Self-Supervised Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Diverse Client Selection for Federated Learning via Submodular Maximization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Divisive Feature Normalization Improves Image Recognition Performance in AlexNet ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Do Users Benefit From Interpretable Vision? A User Study, Baseline, And Dataset βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Do We Need Anisotropic Graph Neural Networks? βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Do deep networks transfer invariances across classes? βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Domain Adversarial Training: A Game Perspective βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Domino: Discovering Systematic Errors with Cross-Modal Embeddings βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Doubly Adaptive Scaled Algorithm for Machine Learning Using Second-Order Information βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG Signals βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Dropout Q-Functions for Doubly Efficient Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Dual Lottery Ticket Hypothesis ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Dynamic Token Normalization improves Vision Transformers βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Dynamics-Aware Comparison of Learned Reward Functions ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
EViT: Expediting Vision Transformers via Token Reorganizations βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
EXACT: Scalable Graph Neural Networks Training via Extreme Activation Compression βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Effect of scale on catastrophic forgetting in neural networks ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
Effective Model Sparsification by Scheduled Grow-and-Prune Methods βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Efficient Active Search for Combinatorial Optimization Problems ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Efficient Computation of Deep Nonlinear Infinite-Width Neural Networks that Learn Features ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Efficient Neural Causal Discovery without Acyclicity Constraints βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Efficient Self-supervised Vision Transformers for Representation Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient Split-Mix Federated Learning for On-Demand and In-Situ Customization βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficient and Differentiable Conformal Prediction with General Function Classes βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Efficiently Modeling Long Sequences with Structured State Spaces βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
EigenGame Unloaded: When playing games is better than optimizing βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Einops: Clear and Reliable Tensor Manipulations with Einstein-like Notation ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modeling βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Emergent Communication at Scale βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Enabling Arbitrary Translation Objectives with Adaptive Tree Search ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
End-to-End Learning of Probabilistic Hierarchies on Graphs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Energy-Inspired Molecular Conformation Optimization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Enhancing Cross-lingual Transfer by Manifold Mixup ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
EntQA: Entity Linking as Question Answering ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Entroformer: A Transformer-based Entropy Model for Learned Image Compression ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Environment Predictive Coding for Visual Navigation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Equivariant Graph Mechanics Networks with Constraints βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Equivariant Self-Supervised Learning: Encouraging Equivariance in Representations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Equivariant Subgraph Aggregation Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Equivariant Transformers for Neural Network based Molecular Potentials ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Equivariant and Stable Positional Encoding for More Powerful Graph Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Evaluating Disentanglement of Structured Representations βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Evaluating Distributional Distortion in Neural Language Modeling ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Evaluating Model-Based Planning and Planner Amortization for Continuous Control βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical Solutions ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Evidential Turing Processes βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Explainable GNN-Based Models over Knowledge Graphs βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Explaining Point Processes by Learning Interpretable Temporal Logic Rules βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Explanations of Black-Box Models based on Directional Feature Interactions βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Exploiting Class Activation Value for Partial-Label Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Exploring Memorization in Adversarial Training ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Exploring extreme parameter compression for pre-trained language models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Exploring the Limits of Large Scale Pre-training ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Expressiveness and Approximation Properties of Graph Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and Unpredictability βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Extending the WILDS Benchmark for Unsupervised Adaptation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
FILIP: Fine-grained Interactive Language-Image Pre-Training ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
FILM: Following Instructions in Language with Modular Methods βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
FP-DETR: Detection Transformer Advanced by Fully Pre-training ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Fair Normalizing Flows βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
FairCal: Fairness Calibration for Face Verification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Fairness Guarantees under Demographic Shift βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language Modeling ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Fast AdvProp βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Fast Differentiable Matrix Square Root βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Fast Generic Interaction Detection for Model Interpretability and Compression βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Fast Model Editing at Scale βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Fast Regression for Structured Inputs βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Fast topological clustering with Wasserstein distance ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
FastSHAP: Real-Time Shapley Value Estimation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Feature Kernel Distillation βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
FedBABU: Toward Enhanced Representation for Federated Image Classification βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
FedChain: Chained Algorithms for Near-optimal Communication Cost in Federated Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
FedPara: Low-rank Hadamard Product for Communication-Efficient Federated Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Federated Learning from Only Unlabeled Data with Class-conditional-sharing Clients βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Few-Shot Backdoor Attacks on Visual Object Tracking ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Few-shot Learning via Dirichlet Tessellation Ensemble βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks ❌ βœ… βœ… ❌ ❌ βœ… βœ… 4
Finding an Unsupervised Image Segmenter in each of your Deep Generative Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Fine-grained Differentiable Physics: A Yarn-level Model for Fabrics ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Finetuned Language Models are Zero-Shot Learners ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Fixed Neural Network Steganography: Train the images, not the network βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Focus on the Common Good: Group Distributional Robustness Follows βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Fooling Explanations in Text Classifiers βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Fortuitous Forgetting in Connectionist Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Frame Averaging for Invariant and Equivariant Network Design ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
From Intervention to Domain Transportation: A Novel Perspective to Optimize Recommendation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
From Stars to Subgraphs: Uplifting Any GNN with Local Structure Awareness ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GATSBI: Generative Adversarial Training for Simulation-Based Inference βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
GDA-AM: ON THE EFFECTIVENESS OF SOLVING MIN-IMAX OPTIMIZATION VIA ANDERSON MIXING βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
GLASS: GNN with Labeling Tricks for Subgraph Representation Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GNN is a Counter? Revisiting GNN for Question Answering βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
GNN-LM: Language Modeling based on Global Contexts via GNN ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
GRAND++: Graph Neural Diffusion with A Source Term ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Gaussian Mixture Convolution Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GeneDisco: A Benchmark for Experimental Design in Drug Discovery ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Generalisation in Lifelong Reinforcement Learning through Logical Composition βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Generalization Through the Lens of Leave-One-Out Error ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Generalization of Neural Combinatorial Solvers Through the Lens of Adversarial Robustness βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generalized Decision Transformer for Offline Hindsight Information Matching βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generalized Demographic Parity for Group Fairness ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Generalized Kernel Thinning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generalized Natural Gradient Flows in Hidden Convex-Concave Games and GANs ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Generalized rectifier wavelet covariance models for texture synthesis ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Generalizing Few-Shot NAS with Gradient Matching βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Generative Modeling with Optimal Transport Maps βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Generative Models as a Data Source for Multiview Representation Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Generative Principal Component Analysis βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Generative Pseudo-Inverse Memory βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
GeoDiff: A Geometric Diffusion Model for Molecular Conformation Generation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Geometric Transformers for Protein Interface Contact Prediction ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Geometric and Physical Quantities improve E(3) Equivariant Message Passing βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
GiraffeDet: A Heavy-Neck Paradigm for Object Detection ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding Indexes βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Goal-Directed Planning via Hindsight Experience Replay βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
GradMax: Growing Neural Networks using Gradient Information ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
GradSign: Model Performance Inference with Theoretical Insights βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Gradient Importance Learning for Incomplete Observations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Gradient Information Matters in Policy Optimization by Back-propagating through Model βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Gradient Matching for Domain Generalization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Gradient Step Denoiser for convergent Plug-and-Play βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Granger causal inference on DAGs identifies genomic loci regulating transcription ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Graph Auto-Encoder via Neighborhood Wasserstein Reconstruction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Graph Condensation for Graph Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Graph Neural Network Guided Local Search for the Traveling Salesperson Problem ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Graph Neural Networks with Learnable Structural and Positional Representations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Graph-Guided Network for Irregularly Sampled Multivariate Time Series ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Graph-Relational Domain Adaptation ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Graph-based Nearest Neighbor Search in Hyperbolic Spaces ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Graph-less Neural Networks: Teaching Old MLPs New Tricks Via Distillation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
GraphENS: Neighbor-Aware Ego Network Synthesis for Class-Imbalanced Node Classification βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Graphon based Clustering and Testing of Networks: Algorithms and Theory βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
GreaseLM: Graph REASoning Enhanced Language Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Group equivariant neural posterior estimation ❌ βœ… ❌ βœ… βœ… ❌ βœ… 4
Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
HTLM: Hyper-Text Pre-Training and Prompting of Language Models ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Half-Inverse Gradients for Physical Deep Learning ❌ βœ… ❌ ❌ βœ… βœ… βœ… 4
Handling Distribution Shifts on Graphs: An Invariance Perspective βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Heteroscedastic Temporal Variational Autoencoder For Irregularly Sampled Time Series ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Hierarchical Few-Shot Imitation with Skill Transition Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Hierarchical Variational Memory for Few-shot Learning Across Domains ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad Stepsize βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
High Probability Generalization Bounds with Fast Rates for Minimax Problems ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Hindsight Foresight Relabeling for Meta-Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Hindsight: Posterior-guided training of retrievers for improved open-ended generation ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image Retrieval βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
How Attentive are Graph Attention Networks? ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
How Did the Model Change? Efficiently Assessing Machine Learning API Shifts βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
How Do Vision Transformers Work? ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
How Low Can We Go: Trading Memory for Error in Low-Precision Training βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
How Much Can CLIP Benefit Vision-and-Language Tasks? ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
How Well Does Self-Supervised Pre-Training Perform with Streaming Data? ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
How many degrees of freedom do we need to train deep networks: a loss landscape perspective ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
How to Train Your MAML to Excel in Few-Shot Classification βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
How to deal with missing data in supervised deep learning? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Huber Additive Models for Non-stationary Time Series Analysis βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Hybrid Local SGD for Federated Learning with Heterogeneous Communications βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Hybrid Random Features ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Hyperparameter Tuning with Renyi Differential Privacy ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
IGLU: Efficient GCN Training via Lazy Updates βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Igeood: An Information Geometry Approach to Out-of-Distribution Detection βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Illiterate DALL-E Learns to Compose ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Image BERT Pre-training with Online Tokenizer βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Imbedding Deep Neural Networks βœ… βœ… βœ… ❌ ❌ βœ… βœ… 5
Imitation Learning by Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Imitation Learning from Observations under Transition Model Disparity βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Implicit Bias of Adversarial Training for Deep Neural Networks βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100 ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Improving Federated Learning Face Recognition via Privacy-Agnostic Clusters βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Improving Mutual Information Estimation with Annealed and Energy-Based Bounds ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Improving Non-Autoregressive Translation Models Without Distillation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Improving the Accuracy of Learning Example Weights for Imbalance Classification βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Increasing the Cost of Model Extraction with Calibrated Proof of Work ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Incremental False Negative Detection for Contrastive Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Inductive Relation Prediction Using Analogy Subgraph Embeddings βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
InfinityGAN: Towards Infinite-Pixel Image Synthesis βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Information Bottleneck: Exact Analysis of (Quantized) Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Information Gain Propagation: a New Way to Graph Active Learning with Soft Labels βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Information Prioritization through Empowerment in Visual Model-based RL βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Information-theoretic Online Memory Selection for Continual Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
IntSGD: Adaptive Floatless Compression of Stochastic Gradients βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Interacting Contour Stochastic Gradient Langevin Dynamics βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Interpretable Unsupervised Diversity Denoising and Artefact Removal ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Invariant Causal Representation Learning for Out-of-Distribution Generalization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies βœ… βœ… βœ… βœ… βœ… ❌ ❌ 5
Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Is High Variance Unavoidable in RL? A Case Study in Continuous Control ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
Is Homophily a Necessity for Graph Neural Networks? βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Is Importance Weighting Incompatible with Interpolating Classifiers? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
It Takes Two to Tango: Mixup for Deep Metric Learning ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Joint Shapley values: a measure of joint feature importance ❌ βœ… βœ… ❌ βœ… ❌ ❌ 3
KL Guided Domain Adaptation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Know Thyself: Transferable Visual Control Policies Through Robot-Awareness βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Know Your Action Set: Learning Action Relations for Reinforcement Learning βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Knowledge Infused Decoding βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Knowledge Removal in Sampling-based Bayesian Inference βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
L0-Sparse Canonical Correlation Analysis βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
LEARNING GUARANTEES FOR GRAPH CONVOLUTIONAL NETWORKS ON THE STOCHASTIC BLOCK MODEL ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5 ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
LOSSY COMPRESSION WITH DISTRIBUTION SHIFT AS ENTROPY CONSTRAINED OPTIMAL TRANSPORT ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Label Encoding for Regression Networks ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Label Leakage and Protection in Two-party Split Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Label-Efficient Semantic Segmentation with Diffusion Models ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Language model compression with weighted low-rank factorization ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Language modeling via stochastic processes ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Language-biased image classification: evaluation based on semantic representations ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Language-driven Semantic Segmentation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Large Language Models Can Be Strong Differentially Private Learners βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Large-Scale Representation Learning on Graphs via Bootstrapping ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Latent Image Animator: Learning to Animate Images via Latent Space Navigation ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Latent Variable Sequential Set Transformers for Joint Multi-Agent Motion Prediction ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learnability Lock: Authorized Learnability Control Through Adversarial Invertible Transformations βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learnability of convolutional neural networks for infinite dimensional input via mixed and anisotropic smoothness ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Learned Simulators for Turbulence ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Altruistic Behaviours in Reinforcement Learning without External Rewards ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning Causal Models from Conditional Moment Restrictions by Importance Weighting ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning Continuous Environment Fields via Implicit Functions βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning Curves for Gaussian Process Regression with Power-Law Priors and Targets ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Learning Curves for SGD on Structured Features ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Distributionally Robust Models at Scale via Composite Optimization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning Features with Parameter-Free Layers ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Graphon Mean Field Games and Approximate Nash Equilibria βœ… βœ… ❌ ❌ ❌ βœ… βœ… 4
Learning Hierarchical Structures with Differentiable Nondeterministic Stacks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning Long-Term Reward Redistribution via Randomized Return Decomposition βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Multimodal VAEs through Mutual Supervision ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Learning Neural Contextual Bandits through Perturbed Rewards βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Learning Object-Oriented Dynamics for Planning from Text βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning Optimal Conformal Classifiers βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Learning Prototype-oriented Set Representations for Meta-Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Representation from Neural Fisher Kernel with Low-rank Approximation βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Scenario Representation for Solving Two-stage Stochastic Integer Programs βœ… ❌ ❌ ❌ βœ… βœ… βœ… 4
Learning State Representations via Retracing in Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning Strides in Convolutional Neural Networks βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Learning Super-Features for Image Retrieval ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning Synthetic Environments and Reward Networks for Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning Temporally Causal Latent Processes from General Temporal Data ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Learning Towards The Largest Margins ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Learning Transferable Reward for Query Object Localization with Policy Adaptation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning Value Functions from Undirected State-only Experience βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning Versatile Neural Architectures by Propagating Network Codes βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Learning Weakly-supervised Contrastive Representations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning a subspace of policies for online adaptation in Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Learning by Directional Gradient Descent βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Learning curves for continual learning in neural networks: Self-knowledge transfer and forgetting ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning meta-features for AutoML βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning more skills through optimistic exploration βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning to Annotate Part Segmentation with Gradient Matching βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Learning to Complete Code with Sketches βœ… ❌ ❌ βœ… βœ… βœ… βœ… 5
Learning to Dequantise with Truncated Flows βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Learning to Downsample for Segmentation of Ultra-High Resolution Images ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning to Extend Molecular Scaffolds with Structural Motifs βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Learning to Generalize across Domains on Single Test Samples βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning to Guide and to be Guided in the Architect-Builder Problem βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Learning to Map for Active Semantic Goal Navigation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic Forecasting ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Learning to Schedule Learning rate with Graph Neural Networks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Learning transferable motor skills with hierarchical latent mixture policies ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Learning-Augmented $k$-means Clustering βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Leveraging Automated Unit Tests for Unsupervised Code Translation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Leveraging unlabeled data to predict out-of-distribution performance ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Likelihood Training of SchrΓΆdinger Bridge using Forward-Backward SDEs Theory βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Linking Emergent and Natural Languages via Corpus Transfer ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Lipschitz-constrained Unsupervised Skill Discovery βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
LoRA: Low-Rank Adaptation of Large Language Models ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Local Feature Swapping for Generalization in Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Long Expressive Memory for Sequence Modeling ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Looking Back on Learned Experiences For Class/task Incremental Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Lossless Compression with Probabilistic Circuits βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Low-Budget Active Learning via Wasserstein Distance: An Integer Programming Approach βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
MAML is a Noisy Contrastive Learner in Classification βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMC βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
MT3: Multi-Task Multitrack Music Transcription ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Map Induction: Compositional spatial submap learning for efficient exploration in novel environments ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Mapping Language Models to Grounded Conceptual Spaces ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Mapping conditional distributions for domain adaptation under generalized target shift βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Maximizing Ensemble Diversity in Deep Reinforcement Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Maximum Entropy RL (Provably) Solves Some Robust RL Problems ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Maximum n-times Coverage for Vaccine Design βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Measuring the Interpretability of Unsupervised Representations via Quantized Reversed Probing ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Memorizing Transformers ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Memory Augmented Optimizers for Deep Learning βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Memory Replay with Data Compression for Continual Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Mention Memory: incorporating textual knowledge into Transformers through entity mention attention ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Message Passing Neural PDE Solvers βœ… βœ… ❌ βœ… βœ… ❌ βœ… 5
Meta Discovery: Learning to Discover Novel Classes given Very Limited Data βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Meta Learning Low Rank Covariance Factors for Energy Based Deterministic Uncertainty βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Meta-Imitation Learning by Watching Video Demonstrations ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Meta-Learning with Fewer Tasks through Task Interpolation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
MetaMorph: Learning Universal Controllers with Transformers βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Minimax Optimality (Probably) Doesn't Imply Distribution Learning for GANs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Minimax Optimization with Smooth Algorithmic Adversaries βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Mirror Descent Policy Optimization βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Missingness Bias in Model Debugging ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
MoReL: Multi-omics Relational Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Model Agnostic Interpretability for Multiple Instance Learning ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Model Zoo: A Growing Brain That Learns Continually ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Model-Based Offline Meta-Reinforcement Learning with Regularization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Model-augmented Prioritized Experience Replay βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Modeling Label Space Interactions in Multi-label Classification using Box Embeddings ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Modular Lifelong Reinforcement Learning via Neural Composition βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
MonoDistill: Learning Spatial Features for Monocular 3D Object Detection ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Monotonic Differentiable Sorting Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Multi-Agent MDP Homomorphic Networks ❌ βœ… ❌ ❌ ❌ βœ… βœ… 3
Multi-Critic Actor Learning: Teaching RL Policies to Act with Style ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Multi-Mode Deep Matrix and Tensor Factorization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Multi-Stage Episodic Control for Strategic Exploration in Text Games βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Multi-Task Processes ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Multi-objective Optimization by Learning Space Partition βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Multimeasurement Generative Models βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Multitask Prompted Training Enables Zero-Shot Task Generalization ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy βœ… βœ… βœ… βœ… ❌ ❌ ❌ 4
NASI: Label- and Data-agnostic Neural Architecture Search at Initialization βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
NASPY: Automated Extraction of Automated Machine Learning Models βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet Training βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
NETWORK INSENSITIVITY TO PARAMETER NOISE VIA PARAMETER ATTACK DURING TRAINING βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
NODE-GAM: Neural Generalized Additive Model for Interpretable Deep Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Natural Language Descriptions of Deep Visual Features ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Network Augmentation for Tiny Deep Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
NeuPL: Neural Population Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Neural Contextual Bandits with Deep Representation and Shallow Exploration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Neural Deep Equilibrium Solvers βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Neural Link Prediction with Walk Pooling ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Neural Markov Controlled SDE: Stochastic Optimization for Continuous-Time Data βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Neural Methods for Logical Reasoning over Knowledge Graphs ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Neural Models for Output-Space Invariance in Combinatorial Problems ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Neural Network Approximation based on Hausdorff distance of Tropical Zonotopes βœ… ❌ βœ… ❌ ❌ ❌ ❌ 2
Neural Networks as Kernel Learners: The Silent Alignment Effect ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Neural Parameter Allocation Search ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Neural Processes with Stochastic Attention: Paying more attention to the context dataset βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Neural Program Synthesis with Query βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Neural Relational Inference with Node-Specific Information ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Neural Solvers for Fast and Accurate Numerical Optimal Control ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Neural Spectral Marked Point Processes βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Neural Stochastic Dual Dynamic Programming βœ… ❌ ❌ ❌ βœ… ❌ βœ… 3
Neural Structured Prediction for Inductive Node Classification ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Neural Variational Dropout Processes ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Neural graphical modelling in continuous-time: consistency guarantees and algorithms ❌ βœ… ❌ βœ… ❌ ❌ βœ… 3
New Insights on Reducing Abrupt Representation Change in Online Continual Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
No One Representation to Rule Them All: Overlapping Features of Training Methods ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Node Feature Extraction by Self-Supervised Multi-scale Neighborhood Prediction ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Noisy Feature Mixup ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Non-Linear Operator Approximations for Initial Value Problems ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Non-Parallel Text Style Transfer with Self-Parallel Supervision βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Authorization βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Nonlinear ICA Using Volume-Preserving Transformations ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Normalization of Language Embeddings for Cross-Lingual Alignment βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
OBJECT DYNAMICS DISTILLATION FOR SCENE DECOMPOSITION AND REPRESENTATION βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Object Pursuit: Building a Space of Objects via Discriminative Weight Generation βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Objects in Semantic Topology ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Offline Reinforcement Learning with Implicit Q-Learning βœ… βœ… βœ… ❌ βœ… ❌ ❌ 4
Offline Reinforcement Learning with Value-based Episodic Memory βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Omni-Dimensional Dynamic Convolution ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On Bridging Generic and Personalized Federated Learning for Image Classification βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Distributed Adaptive Optimization with Gradient Compression βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
On Evaluation Metrics for Graph Generative Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On Improving Adversarial Transferability of Vision Transformers βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
On Incorporating Inductive Biases into VAEs ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On Non-Random Missing Labels in Semi-Supervised Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On Predicting Generalization using GANs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
On Redundancy and Diversity in Cell-based Neural Architecture Search ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On Robust Prefix-Tuning for Text Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On feature learning in neural networks with global convergence guarantees ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
On the Certified Robustness for Ensemble Models and Beyond ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On the Connection between Local Attention and Dynamic Depth-wise Convolution ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
On the Convergence of Certified Robust Training with Interval Bound Propagation ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
On the Convergence of mSGD and AdaGrad for Stochastic Optimization ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
On the Existence of Universal Lottery Tickets ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
On the Importance of Difficulty Calibration in Membership Inference Attacks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
On the Importance of Firth Bias Reduction in Few-Shot Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On the Learning and Learnability of Quasimetrics βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
On the Limitations of Multimodal VAEs ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
On the Optimal Memorization Power of ReLU Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the Pitfalls of Analyzing Individual Neurons in Language Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
On the Role of Neural Collapse in Transfer Learning ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
On the Uncomputability of Partition Functions in Energy-Based Sequence Models ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
On the approximation properties of recurrent encoder-decoder architectures ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
On the benefits of maximum likelihood estimation for Regression and Forecasting βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
On the relation between statistical learning and perceptual distances ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
On the role of population heterogeneity in emergent communication ❌ βœ… ❌ ❌ ❌ βœ… βœ… 3
On-Policy Model Errors in Reinforcement Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
One After Another: Learning Incremental Skills for a Changing World βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Online Ad Hoc Teamwork under Partial Observability βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Online Adversarial Attacks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Online Coreset Selection for Rehearsal-based Continual Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Online Facility Location with Predictions βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Online Hyperparameter Meta-Learning with Hypergradient Distillation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
OntoProtein: Protein Pretraining With Gene Ontology Embedding ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Open-Set Recognition: A Good Closed-Set Classifier is All You Need ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Open-World Semi-Supervised Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Optimal Representations for Covariate Shift βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Optimal Transport for Causal Discovery βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Optimal Transport for Long-Tailed Recognition with Learnable Cost Matrix βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Optimization and Adaptive Generalization of Three layer Neural Networks βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Optimization inspired Multi-Branch Equilibrium Models βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Optimizer Amalgamation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Optimizing Neural Networks with Gradient Lexicase Selection βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Orchestrated Value Mapping for Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Overcoming The Spectral Bias of Neural Value Approximation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
PAC Prediction Sets Under Covariate Shift βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
PAC-Bayes Information Bottleneck βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
PF-GNN: Differentiable particle filtering based approximation of universal graph representations βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
PI3NN: Out-of-distribution-aware Prediction Intervals from Three Neural Networks ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
POETREE: Interpretable Policy Learning with Adaptive Decision Trees βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
PSA-GAN: Progressive Self Attention GANs for Synthetic Time Series ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Pareto Policy Adaptation βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Pareto Policy Pool for Model-based Offline Reinforcement Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Pareto Set Learning for Neural Multi-Objective Combinatorial Optimization βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Particle Stochastic Dual Coordinate Ascent: Exponential convergent algorithm for mean field neural network optimization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Path Auxiliary Proposal for MCMC in Discrete Space βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Path Integral Sampler: A Stochastic Control Approach For Sampling βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Perceiver IO: A General Architecture for Structured Inputs & Outputs ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Permutation Compressors for Provably Faster Distributed Nonconvex Optimization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Permutation-Based SGD: Is Random Optimal? βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Phase Collapse in Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Phenomenology of Double Descent in Finite-Width Neural Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
PiCO: Contrastive Label Disambiguation for Partial Label Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Pix2seq: A Language Modeling Framework for Object Detection βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Planning in Stochastic Environments with a Learned Model βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Plant 'n' Seek: Can You Find the Winning Ticket? βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Poisoning and Backdooring Contrastive Learning ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Policy Gradients Incorporating the Future βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Policy Smoothing for Provably Robust Reinforcement Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Policy improvement by planning with Gumbel βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Possibility Before Utility: Learning And Using Hierarchical Affordances βœ… βœ… ❌ ❌ ❌ ❌ βœ… 3
Post hoc Explanations may be Ineffective for Detecting Unknown Spurious Correlation ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack Scenarios βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Practical Conditional Neural Process Via Tractable Dependent Predictions ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Practical Integration via Separable Bijective Networks ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Pre-training Molecular Graph Representation with 3D Geometry ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Predicting Physics in Mesh-reduced Space with Temporal Attention βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Pretrained Language Model in Continual Learning: A Comparative Study βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Privacy Implications of Shuffling βœ… βœ… βœ… ❌ ❌ ❌ ❌ 3
Probabilistic Implicit Scene Completion ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Procedural generalization by planning with self-supervised world models ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Programmatic Reinforcement Learning without Oracles βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Progressive Distillation for Fast Sampling of Diffusion Models βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Proof Artifact Co-Training for Theorem Proving with Language Models ❌ βœ… ❌ βœ… βœ… ❌ βœ… 4
Properties from mechanisms: an equivariance perspective on identifiable representation learning ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Prospect Pruning: Finding Trainable Weights at Initialization using Meta-Gradients βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
ProtoRes: Proto-Residual Network for Pose Authoring via Learned Inverse Kinematics βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Prototype memory and attention mechanisms for few shot image generation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Prototypical Contrastive Predictive Coding βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Provable Adaptation across Multiway Domains via Representation Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Provable Learning-based Algorithm For Sparse Recovery βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Provably Robust Adversarial Examples βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Provably convergent quasistatic dynamics for mean-field two-player zero-sum games βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
Proving the Lottery Ticket Hypothesis for Convolutional Neural Networks ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Pseudo Numerical Methods for Diffusion Models on Manifolds βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and Forecasting ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
QUERY EFFICIENT DECISION BASED SPARSE ATTACKS AGAINST BLACK-BOX DEEP LEARNING MODELS βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Quadtree Attention for Vision Transformers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Quantitative Performance Assessment of CNN Units via Topological Entropy Calculation ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Query Embedding on Hyper-Relational Knowledge Graphs ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
R4D: Utilizing Reference Objects for Long-Range Distance Estimation ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
R5: Rule Discovery with Reinforced and Recurrent Relational Reasoning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Random matrices in service of ML footprint: ternary random features with no performance loss βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Real-Time Neural Voice Camouflage ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Recursive Disentanglement Network ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank? βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Reducing Excessive Margin to Achieve a Better Accuracy vs. Robustness Trade-off βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
RegionViT: Regional-to-Local Attention for Vision Transformers ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Regularized Autoencoders for Isometric Representation Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Reinforcement Learning in Presence of Discrete Markovian Context Evolution βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Reinforcement Learning under a Multi-agent Predictive State Representation Model: Method and Theory βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Relating transformers to models and neural representations of the hippocampal formation ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Relational Learning with Variational Bayes βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Relational Multi-Task Learning: Modeling Relations between Data and Tasks βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Relational Surrogate Loss Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
RelaxLoss: Defending Membership Inference Attacks without Losing Utility βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Reliable Adversarial Distillation with Unreliable Teachers βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Representation Learning for Online and Offline RL in Low-rank MDPs βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Representation-Agnostic Shape Fields ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Representational Continuity for Unsupervised Continual Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Resolving Training Biases via Influence-based Data Relabeling βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum ❌ βœ… ❌ ❌ ❌ ❌ βœ… 2
Responsible Disclosure of Generative Models Using Scalable Fingerprinting ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Rethinking Adversarial Transferability from a Data Distribution Perspective βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Rethinking Class-Prior Estimation for Positive-Unlabeled Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Rethinking Supervised Pre-Training for Better Downstream Transferring ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Reverse Engineering of Imperceptible Adversarial Image Perturbations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Revisiting Design Choices in Offline Model Based Reinforcement Learning ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Revisiting Over-smoothing in BERT from the Perspective of Graph ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Revisiting flow generative models for Out-of-distribution detection βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Robbing the Fed: Directly Obtaining Private Data in Federated Learning with Modified Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness? ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Robust and Scalable SDE Learning: A Functional Perspective βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
RotoGrad: Gradient Homogenization in Multitask Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
RvS: What is Essential for Offline RL via Supervised Learning? βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
SGD Can Converge to Local Maxima ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
SOSP: Efficiently Capturing Global Correlations by Second-Order Structured Pruning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
SUMNAS: Supernet with Unbiased Meta-Features for Neural Architecture Search βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Safe Neurosymbolic Learning with Differentiable Symbolic Execution βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Salient ImageNet: How to discover spurious features in Deep Learning? ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov Game βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Sample Selection with Uncertainty of Losses for Learning with Noisy Labels βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Sample and Computation Redistribution for Efficient Face Detection βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Sampling with Mirrored Stein Operators βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Scalable Sampling for Nonsymmetric Determinantal Point Processes βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Scale Efficiently: Insights from Pretraining and Finetuning Transformers ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Scale Mixtures of Neural Network Gaussian Processes ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Scaling Laws for Neural Machine Translation ❌ ❌ ❌ ❌ βœ… ❌ βœ… 2
Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption βœ… ❌ βœ… βœ… ❌ βœ… βœ… 5
Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNs ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Scene Transformer: A unified architecture for predicting future trajectories of multiple agents βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Score-Based Generative Modeling with Critically-Damped Langevin Diffusion βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Selective Ensembles for Consistent Predictions βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Self-Joint Supervised Learning βœ… βœ… βœ… βœ… ❌ βœ… βœ… 6
Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Self-Supervised Inference in State-Space Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Self-Supervision Enhanced Feature Selection with Correlated Gates βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Self-ensemble Adversarial Training for Improved Robustness βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Self-supervised Learning is More Robust to Dataset Imbalance βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Semi-relaxed Gromov-Wasserstein divergence and applications on graphs βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Sequence Approximation using Feedforward Spiking Neural Network for Spatiotemporal Learning: Theory and Optimization Methods ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Shallow and Deep Networks are Near-Optimal Approximators of Korobov Functions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Should I Run Offline Reinforcement Learning or Behavioral Cloning? βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Shuffle Private Stochastic Convex Optimization βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Signing the Supermask: Keep, Hide, Invert ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Simple GNN Regularisation for 3D Molecular Property Prediction and Beyond βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
SketchODE: Learning neural sketch representation in continuous time ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Skill-based Meta-Reinforcement Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Solving Inverse Problems in Medical Imaging with Score-Based Generative Models βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Sound Adversarial Audio-Visual Navigation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Sound and Complete Neural Network Repair with Minimality and Locality Guarantees βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Space-Time Graph Neural Networks ❌ ❌ ❌ βœ… ❌ ❌ βœ… 2
Spanning Tree-based Graph Generation for Molecules βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Sparse Attention with Learning to Hash βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Sparse Communication via Mixed Distributions ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Sparsity Winning Twice: Better Robust Generalization from More Efficient Training βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
SphereFace2: Binary Classification is All You Need for Deep Face Recognition ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Spherical Message Passing for 3D Molecular Graphs ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Spike-inspired rank coding for fast and accurate recurrent neural networks βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Sqrt(d) Dimension Dependence of Langevin Monte Carlo ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Stability Regularization for Discrete Representation Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Steerable Partial Differential Operators for Equivariant Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Stein Latent Optimization for Generative Adversarial Networks βœ… βœ… βœ… ❌ βœ… βœ… βœ… 6
Step-unrolled Denoising Autoencoders for Text Generation βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Stiffness-aware neural network for learning Hamiltonian systems ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Stochastic Training is Not Necessary for Generalization ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Strength of Minibatch Noise in SGD ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
StyleAlign: Analysis and Applications of Aligned StyleGAN Models ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image Synthesis ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Subspace Regularizers for Few-Shot Class Incremental Learning ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Superclass-Conditional Gaussian Mixture Model For Learning Fine-Grained Embeddings βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Surreal-GAN:Semi-Supervised Representation Learning via GAN for uncovering heterogeneous disease-related imaging patterns βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Surrogate Gap Minimization Improves Sharpness-Aware Training βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS Benchmarks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot Classification ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Symbolic Learning to Optimize: Towards Interpretability and Scalability βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Synchromesh: Reliable Code Generation from Pre-trained Language Models βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
TAMP-S2GCNets: Coupling Time-Aware Multipersistence Knowledge Representation with Spatio-Supra Graph Convolutional Networks for Time-Series Forecasting ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
TAPEX: Table Pre-training via Learning a Neural SQL Executor ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
TAda! Temporally-Adaptive Convolutions for Video Understanding ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
TPU-GAN: Learning temporal coherence from dynamic point cloud sequences ❌ ❌ βœ… βœ… βœ… βœ… βœ… 5
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
TRGP: Trust Region Gradient Projection for Continual Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Taming Sparsely Activated Transformer with Stochastic Experts ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Target-Side Input Augmentation for Sequence to Sequence Generation ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Task Affinity with Maximum Bipartite Matching in Few-Shot Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Task Relatedness-Based Generalization Bounds for Meta Learning ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Task-Induced Representation Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
The Close Relationship Between Contrastive Learning and Meta-Learning βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Effects of Invertibility on the Representational Complexity of Encoders in Variational Autoencoders ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
The Efficiency Misnomer ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
The Evolution of Uncertainty of Learning in Games ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Geometry of Memoryless Stochastic Policy Optimization in Infinite-Horizon POMDPs βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
The Hidden Convex Optimization Landscape of Regularized Two-Layer ReLU Networks: an Exact Characterization of Optimal Solutions ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
The Information Geometry of Unsupervised Reinforcement Learning ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
The MultiBERTs: BERT Reproductions for Robustness Analysis βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning ❌ βœ… βœ… βœ… ❌ ❌ ❌ 3
The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
The Role of Pretrained Representations for the OOD Generalization of RL Agents ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
The Spectral Bias of Polynomial Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Three Stages of Learning Dynamics in High-dimensional Kernel Methods ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
The Uncanny Similarity of Recurrence and Depth ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Tighter Sparse Approximation Bounds for ReLU Neural Networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind ❌ βœ… ❌ ❌ βœ… ❌ βœ… 3
Top-N: Equivariant Set and Graph Generation without Exchangeability ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Top-label calibration and multiclass-to-binary reductions βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Topological Experience Replay βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Topological Graph Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Topologically Regularized Data Embeddings ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Toward Efficient Low-Precision Training: Data Format Optimization and Hysteresis Quantization ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Toward Faithful Case-based Reasoning through Learning Prototypes in a Nearest Neighbor-friendly Space. ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Towards Better Understanding and Better Generalization of Low-shot Classification in Histology Images with Contrastive Learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Towards Building A Group-based Unsupervised Representation Disentanglement Framework ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Towards Continual Knowledge Learning of Language Models ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Towards Deepening Graph Neural Networks: A GNTK-based Optimization Perspective ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Towards Empirical Sandwich Bounds on the Rate-Distortion Function βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Towards Evaluating the Robustness of Neural Networks Learned by Transduction βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Towards General Function Approximation in Zero-Sum Markov Games βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Towards Model Agnostic Federated Learning Using Knowledge Distillation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Towards Training Billion Parameter Graph Neural Networks for Atomic Simulations ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Towards Understanding Generalization via Decomposing Excess Risk Dynamics ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Towards Understanding the Data Dependency of Mixup-style Training ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Towards Understanding the Robustness Against Evasion Attack on Categorical Data βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Towards a Unified View of Parameter-Efficient Transfer Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Tracking the risk of a deployed model and detecting harmful distribution shifts βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Training Data Generating Networks: Shape Reconstruction via Bi-level Optimization βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Training Structured Neural Networks Through Manifold Identification and Variance Reduction βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Training Transition Policies via Distribution Matching for Complex Tasks βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Training invariances and the low-rank phenomenon: beyond linear networks ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Transfer RL across Observation Feature Spaces via Model-Based Regularization βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Transferable Adversarial Attack based on Integrated Gradients ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design βœ… βœ… ❌ ❌ βœ… ❌ βœ… 4
Transformer Embeddings of Irregularly Spaced Events and Their Participants ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Transformer-based Transform Coding ❌ ❌ βœ… ❌ βœ… βœ… βœ… 4
Transformers Can Do Bayesian Inference βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Triangle and Four Cycle Counting with Predictions in Graph Streams βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Trigger Hunting with a Topological Prior for Trojan Detection ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Trivial or Impossible --- dichotomous data difficulty masks model differences (on ImageNet and beyond) ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Tuformer: Data-driven Design of Transformers for Improved Generalization or Efficiency ❌ βœ… βœ… ❌ βœ… ❌ βœ… 4
Uncertainty Modeling for Out-of-Distribution Generalization βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Understanding Dimensional Collapse in Contrastive Self-supervised Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding Domain Randomization for Sim-to-real Transfer βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
Understanding Intrinsic Robustness Using Label Uncertainty βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Understanding and Improving Graph Injection Attack by Promoting Unnoticeability βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Understanding and Leveraging Overparameterization in Recursive Value Estimation ❌ ❌ βœ… ❌ ❌ ❌ βœ… 2
Understanding and Preventing Capacity Loss in Reinforcement Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Understanding approximate and unrolled dictionary learning for pattern recovery βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Understanding over-squashing and bottlenecks on graphs via curvature βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Understanding the Role of Self Attention for Efficient Speech Recognition ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Understanding the Variance Collapse of SVGD in High Dimensions βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Unified Visual Transformer Compression βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Unifying Likelihood-free Inference with Black-box Optimization and Beyond βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Universal Approximation Under Constraints is Possible with Transformers ❌ βœ… ❌ ❌ ❌ ❌ ❌ 1
Universalizing Weak Supervision βœ… ❌ βœ… ❌ βœ… ❌ βœ… 4
Unraveling Model-Agnostic Meta-Learning via The Adaptation Learning Rate ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Unrolling PALM for Sparse Semi-Blind Source Separation βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Unsupervised Discovery of Object Radiance Fields βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Unsupervised Disentanglement with Tensor Product Representations on the Torus ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
Unsupervised Semantic Segmentation by Distilling Feature Correspondences ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Using Graph Representation Learning with Schema Encoders to Measure the Severity of Depressive Symptoms ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
VAE Approximation Error: ELBO and Exponential Families ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
VC dimension of partially quantized neural networks in the overparametrized regime ❌ βœ… βœ… βœ… ❌ βœ… βœ… 5
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
VOS: Learning What You Don't Know by Virtual Outlier Synthesis βœ… βœ… βœ… βœ… βœ… βœ… βœ… 7
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning ❌ ❌ βœ… ❌ ❌ ❌ ❌ 1
Value Gradient weighted Model-Based Reinforcement Learning βœ… βœ… βœ… ❌ ❌ ❌ βœ… 4
Variational Inference for Discriminative Learning with Generative Modeling of Feature Incompletion βœ… ❌ βœ… βœ… βœ… βœ… βœ… 6
Variational Neural Cellular Automata βœ… βœ… βœ… βœ… ❌ ❌ βœ… 5
Variational Predictive Routing with Nested Subjective Timescales βœ… ❌ βœ… ❌ ❌ ❌ βœ… 3
Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias ❌ ❌ ❌ ❌ ❌ ❌ βœ… 1
Variational methods for simulation-based inference βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Variational oracle guiding for reinforcement learning ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Vector-quantized Image Modeling with Improved VQGAN ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
ViDT: An Efficient and Effective Fully Transformer-based Object Detector ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
ViTGAN: Training GANs with Vision Transformers ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Vision-Based Manipulators Need to Also See from Their Hands ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
Visual Correspondence Hallucination ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Visual Representation Learning Does Not Generalize Strongly Within the Same Domain ❌ βœ… βœ… ❌ βœ… βœ… βœ… 5
Visual Representation Learning over Latent Domains ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
Visual hyperacuity with moving sensor and recurrent neural computations ❌ βœ… βœ… ❌ ❌ ❌ ❌ 2
Vitruvion: A Generative Model of Parametric CAD Sketches ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
W-CTC: a Connectionist Temporal Classification Loss with Wild Cards ❌ βœ… βœ… ❌ ❌ ❌ βœ… 3
WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Weighted Training for Cross-Task Learning βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
What Do We Mean by Generalization in Federated Learning? ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework ❌ ❌ ❌ ❌ ❌ ❌ ❌ 0
What Makes Better Augmentation Strategies? Augment Difficult but Not too Different βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
What’s Wrong with Deep Learning in Tree Search for Combinatorial Optimization βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? βœ… ❌ ❌ ❌ ❌ ❌ ❌ 1
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
When should agents explore? βœ… ❌ βœ… ❌ βœ… βœ… βœ… 5
When, Why, and Which Pretrained GANs Are Useful? ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective ❌ ❌ βœ… ❌ ❌ βœ… βœ… 3
Who Is Your Right Mixup Partner in Positive and Unlabeled Learning βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL βœ… βœ… βœ… ❌ βœ… ❌ βœ… 5
Why Propagate Alone? Parallel Use of Labels and Features on Graphs ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Wiring Up Vision: Minimizing Supervised Synaptic Updates Needed to Produce a Primate Ventral Stream ❌ βœ… βœ… βœ… βœ… βœ… βœ… 6
Wisdom of Committees: An Overlooked Approach To Faster and More Accurate Models βœ… ❌ βœ… βœ… βœ… ❌ βœ… 5
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation βœ… ❌ ❌ ❌ ❌ ❌ βœ… 2
X-model: Improving Data Efficiency in Deep Learning with A Minimax Model ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction ❌ ❌ βœ… βœ… ❌ ❌ ❌ 2
You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks ❌ βœ… βœ… βœ… βœ… ❌ βœ… 5
Zero Pixel Directional Boundary by Vector Transform ❌ ❌ βœ… βœ… ❌ ❌ βœ… 3
Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning ❌ ❌ βœ… ❌ βœ… ❌ βœ… 3
Zero-Shot Self-Supervised Learning for MRI Reconstruction ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
cosFormer: Rethinking Softmax In Attention βœ… βœ… βœ… βœ… βœ… ❌ βœ… 6
iFlood: A Stable and Effective Regularizer ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4
iLQR-VAE : control-based learning of input-driven dynamics with applications to neural data βœ… ❌ βœ… βœ… ❌ ❌ βœ… 4
miniF2F: a cross-system benchmark for formal Olympiad-level mathematics ❌ βœ… βœ… βœ… ❌ ❌ βœ… 4
switch-GLAT: Multilingual Parallel Machine Translation Via Code-Switch Decoder ❌ ❌ βœ… βœ… βœ… ❌ βœ… 4