International Conference on Learning Representations (ICLR) - 2022

Conference Proceedings:

Key: PC - Pseudocode, OSC - Open Source Code, OSD - Open Datasets, DS - Dataset Splits, HS - Hardware Specification, SD - Software Dependencies, ES - Experiment Setup

$\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited Overlap 4
$\mathrm{SO}(2)$-Equivariant Reinforcement Learning 2
$\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization 5
8-bit Optimizers via Block-wise Quantization 5
A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of Disease 4
A Class of Short-term Recurrence Anderson Mixing Methods and Their Applications 5
A Comparison of Hamming Errors of Representative Variable Selection Methods 1
A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion 4
A Deep Variational Approach to Clustering Survival Data 5
A Fine-Grained Analysis on Distribution Shift 4
A Fine-Tuning Approach to Belief State Modeling 4
A First-Occupancy Representation for Reinforcement Learning 5
A General Analysis of Example-Selection for Stochastic Gradient Descent 7
A Generalized Weighted Optimization Method for Computational Learning and Inversion 0
A Johnson-Lindenstrauss Framework for Randomly Initialized CNNs 2
A Loss Curvature Perspective on Training Instabilities of Deep Learning Models 4
A NON-PARAMETRIC REGRESSION VIEWPOINT : GENERALIZATION OF OVERPARAMETRIZED DEEP RELU NETWORK UNDER NOISY OBSERVATIONS 1
A Neural Tangent Kernel Perspective of Infinite Tree Ensembles 5
A New Perspective on "How Graph Neural Networks Go Beyond Weisfeiler-Lehman?" 4
A Program to Build E(N)-Equivariant Steerable CNNs 5
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning 1
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning 4
A Statistical Framework for Efficient Out of Distribution Detection in Deep Neural Networks 5
A Tale of Two Flows: Cooperative Learning of Langevin Flow and Normalizing Flow Toward Energy-Based Model 3
A Theoretical Analysis on Feature Learning in Neural Networks: Emergence from Inputs and Advantage over Fixed Features 5
A Theory of Tournament Representations 2
A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training 2
A Unified Wasserstein Distributional Robustness Framework for Adversarial Training 4
A Zest of LIME: Towards Architecture-Independent Model Distances 4
A fast and accurate splitting method for optimal transport: analysis and implementation 5
A generalization of the randomized singular value decomposition 4
A global convergence theory for deep ReLU implicit networks via over-parameterization 1
ADAVI: Automatic Dual Amortized Variational Inference Applied To Pyramidal Bayesian Models 5
AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value Analysis 4
ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity 5
AS-MLP: An Axial Shifted MLP Architecture for Vision 6
Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave Functions 4
Accelerated Policy Learning with Parallel Differentiable Simulation 5
Acceleration of Federated Learning with Alleviated Forgetting in Local Training 6
Active Hierarchical Exploration with Stable Subgoal Representation Learning 5
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game 5
Actor-critic is implicitly biased towards high entropy optimal policies 1
Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space 4
AdaAug: Learning Class- and Instance-adaptive Data Augmentation Policies 6
AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation 4
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning 5
Adaptive Wavelet Transformer Network for 3D Shape Representation Learning 4
Adversarial Retriever-Ranker for Dense Text Retrieval 5
Adversarial Robustness Through the Lens of Causality 3
Adversarial Support Alignment 6
Adversarial Unlearning of Backdoors via Implicit Hypergradient 6
Adversarially Robust Conformal Prediction 6
Almost Tight L0-norm Certified Robustness of Top-k Predictions against Adversarial Perturbations 2
AlphaZero-based Proof Cost Network to Aid Game Solving 4
Amortized Implicit Differentiation for Stochastic Bilevel Optimization 4
Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular Design 6
An Agnostic Approach to Federated Learning with Class Imbalance 3
An Autoregressive Flow Model for 3D Molecular Geometry Generation from Scratch 5
An Experimental Design Perspective on Model-Based Reinforcement Learning 3
An Explanation of In-context Learning as Implicit Bayesian Inference 5
An Information Fusion Approach to Learning with Instance-Dependent Label Noise 5
An Operator Theoretic View On Pruning Deep Neural Networks 6
An Unconstrained Layer-Peeled Perspective on Neural Collapse 2
Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models 6
Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation 2
Ancestral protein sequence reconstruction using a tree-structured Ornstein-Uhlenbeck variational autoencoder 5
Anisotropic Random Feature Regression in High Dimensions 1
Anomaly Detection for Tabular Data with Internal Contrastive Learning 4
Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy 5
Anti-Concentrated Confidence Bonuses For Scalable Exploration 5
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice 4
Anytime Dense Prediction with Confidence Adaptivity 5
Approximation and Learning with Deep Convolutional Models: a Kernel Perspective 4
Assessing Generalization of SGD via Disagreement 2
Associated Learning: an Alternative to End-to-End Backpropagation that Works on CNN, RNN, and Transformer 6
Asymmetry Learning for Counterfactually-invariant Classification in OOD Tasks 1
Attacking deep networks with surrogate-based adversarial black-box methods is easy 5
Attention-based Interpretability with Concept Transformers 5
Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable 4
Augmented Sliced Wasserstein Distances 5
Auto-Transfer: Learning to Route Transferable Representations 5
Auto-scaling Vision Transformers without Training 6
Automated Self-Supervised Learning for Graphs 6
Automatic Loss Function Search for Predict-Then-Optimize Problems with Strong Ranking Property 5
Autonomous Learning of Object-Centric Abstractions for High-Level Planning 3
Autonomous Reinforcement Learning: Formalism and Benchmarking 3
Autoregressive Diffusion Models 6
Autoregressive Quantile Flows for Predictive Uncertainty Estimation 3
Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning 6
BAM: Bayes with Adaptive Memory 3
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis 6
BEiT: BERT Pre-Training of Image Transformers 5
Back2Future: Leveraging Backfill Dynamics for Improving Real-time Predictions in Future 5
Backdoor Defense via Decoupling the Training Process 4
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models 4
Bag of Instances Aggregation Boosts Self-supervised Distillation 4
Bandit Learning with Joint Effect of Incentivized Sampling, Delayed Sampling Feedback, and Self-Reinforcing User Preferences 3
Bayesian Framework for Gradient Leakage 4
Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How 5
Bayesian Neural Network Priors Revisited 4
Benchmarking the Spectrum of Agent Capabilities 3
Better Supervisory Signals by Observing Learning Paths 5
Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains 3
Bi-linear Value Networks for Multi-goal Reinforcement Learning 2
BiBERT: Accurate Fully Binarized BERT 4
Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic Functions 3
Boosted Curriculum Reinforcement Learning 4
Boosting Randomized Smoothing with Variance Reduced Classifiers 6
Boosting the Certified Robustness of L-infinity Distance Nets 5
Bootstrapped Meta-Learning 5
Bootstrapping Semantic Segmentation with Regional Contrast 4
Bregman Gradient Policy Optimization 4
Bridging Recommendation and Marketing via Recurrent Intensity Modeling 5
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations 5
Bundle Networks: Fiber Bundles, Local Trivializations, and a Generative Approach to Exploring Many-to-one Maps 4
Byzantine-Robust Learning on Heterogeneous Datasets via Bucketing 4
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks 4
CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals 7
CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation 3
CKConv: Continuous Kernel Convolution For Sequential Data 5
CLEVA-Compass: A Continual Learning Evaluation Assessment Compass to Promote Research Transparency and Comparability 2
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks 4
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation 3
CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing 5
CURVATURE-GUIDED DYNAMIC SCALE NETWORKS FOR MULTI-VIEW STEREO 5
Can an Image Classifier Suffice For Action Recognition? 5
Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views? 5
Capturing Structural Locality in Non-parametric Language Models 5
Case-based reasoning for better generalization in textual reinforcement learning 5
Causal Contextual Bandits with Targeted Interventions 4
Certified Robustness for Deep Equilibrium Models via Interval Bound Propagation 5
Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap 2
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization 6
Chemical-Reaction-Aware Molecule Representation Learning 5
Chunked Autoregressive GAN for Conditional Waveform Synthesis 5
Churn Reduction via Distillation 6
Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene Deblurring 6
ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods 2
Closed-form Sample Probing for Learning Generative Models in Zero-shot Learning 2
CoBERL: Contrastive BERT for Reinforcement Learning 4
CoMPS: Continual Meta Policy Search 4
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting 5
CodeTrek: Flexible Modeling of Code using an Extensible Relational Representation 5
Coherence-based Label Propagation over Time Series for Accelerated Active Learning 6
Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods 3
Collapse by Conditioning: Training Class-conditional GANs with Limited Data 3
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos 3
Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games 3
Comparing Distributions by Measuring Differences that Affect Decision Making 4
Complete Verification via Multi-Neuron Relaxation Guided Branch-and-Bound 4
Compositional Attention: Disentangling Search and Retrieval 4
Compositional Training for End-to-End Deep AUC Maximization 6
ConFeSS: A Framework for Single Source Cross-Domain Few-Shot Learning 5
Concurrent Adversarial Learning for Large-Batch Training 4
Conditional Contrastive Learning with Kernel 5
Conditional Image Generation by Conditioning Variational Auto-Encoders 4
Conditional Object-Centric Learning from Video 5
Conditioning Sequence-to-sequence Networks with Learned Activations 2
Connectome-constrained Latent Variable Model of Whole-Brain Neural Activity 6
Consistent Counterfactuals for Deep Models 6
Constrained Physical-Statistics Models for Dynamical System Identification and Prediction 5
Constrained Policy Optimization via Bayesian World Models 5
Constraining Linear-chain CRFs to Regular Languages 6
Constructing Orthogonal Convolutions in an Explicit Manner 4
Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates 3
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics 3
Context-Aware Sparse Deep Coordination Graphs 4
Contextualized Scene Imagination for Generative Commonsense Reasoning 4
Continual Learning with Filter Atom Swapping 5
Continual Learning with Recursive Gradient Optimization 6
Continual Normalization: Rethinking Batch Normalization for Online Continual Learning 5
Continuous-Time Meta-Learning with Forward Mode Differentiation 6
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization 3
Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation 4
Contrastive Fine-grained Class Clustering via Generative Adversarial Networks 4
Controlling Directions Orthogonal to a Classifier 4
Controlling the Complexity and Lipschitz Constant improves Polynomial Nets 3
Convergent Graph Solvers 6
Convergent and Efficient Deep Q Learning Algorithm 4
CoordX: Accelerating Implicit Neural Representation with a Split MLP Architecture 3
Coordination Among Neural Modules Through a Shared Global Workspace 6
Counterfactual Plans under Distributional Ambiguity 5
Creating Training Sets via Weak Indirect Supervision 4
Critical Points in Quantum Generative Models 1
Cross-Domain Imitation Learning via Optimal Transport 3
Cross-Lingual Transfer with Class-Weighted Language-Invariant Representations 4
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL 4
CrossBeam: Learning to Search in Bottom-Up Program Synthesis 6
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention 6
CrossMatch: Cross-Classifier Consistency Regularization for Open-Set Single Domain Generalization 3
CrowdPlay: Crowdsourcing Human Demonstrations for Offline Learning 4
Crystal Diffusion Variational Autoencoder for Periodic Material Generation 6
Curriculum learning as a tool to uncover learning principles in the brain 1
CycleMLP: A MLP-like Architecture for Dense Prediction 5
D-CODE: Discovering Closed-form ODEs from Observed Trajectories 6
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR 5
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning 4
DEGREE: Decomposition Based Explanation for Graph Neural Networks 7
DEPTS: Deep Expansion Learning for Periodic Time Series Forecasting 5
DISCOVERING AND EXPLAINING THE REPRESENTATION BOTTLENECK OF DNNS 4
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals 5
DIVA: Dataset Derivative of a Learning Task 3
DKM: Differentiable k-Means Clustering Layer for Neural Network Compression 3
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization 2
Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation 5
Data Poisoning Won’t Save You From Facial Recognition 4
Data-Driven Offline Optimization for Architecting Hardware Accelerators 3
Data-Efficient Graph Grammar Learning for Molecular Generation 3
DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman Operator 4
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition 4
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach 4
Declarative nets that are equilibrium models 4
Deconstructing the Inductive Biases of Hamiltonian Neural Networks 3
Decoupled Adaptation for Cross-Domain Object Detection 6
Deep Attentive Variational Inference 4
Deep AutoAugment 3
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity 5
Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers 5
Deep Point Cloud Reconstruction 3
Deep ReLU Networks Preserve Expected Length 1
Defending Against Image Corruptions Through Adversarial Augmentations 3
Delaunay Component Analysis for Evaluation of Data Representations 6
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations 4
Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit Regularization 3
Demystifying Limited Adversarial Transferability in Automatic Speech Recognition Systems 2
Denoising Likelihood Score Matching for Conditional Score-based Data Generation 4
DictFormer: Tiny Transformer with Shared Dictionary 5
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools 3
Differentiable DAG Sampling 6
Differentiable Expectation-Maximization for Set Representation Learning 4
Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single Image 4
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners 5
Differentiable Scaffolding Tree for Molecule Optimization 6
Differentially Private Fine-tuning of Language Models 5
Differentially Private Fractional Frequency Moments Estimation with Polylogarithmic Space 5
Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme 3
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching 3
Discovering Invariant Rationales for Graph Neural Networks 6
Discovering Latent Concepts Learned in BERT 5
Discovering Nonlinear PDEs from Scarce Data with Physics-encoded Learning 4
Discrepancy-Based Active Learning for Domain Adaptation 5
Discrete Representations Strengthen Vision Transformer Robustness 5
Discriminative Similarity for Data Clustering 4
Disentanglement Analysis with Partial Information Decomposition 2
Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data 5
Distribution Compression in Near-Linear Time 5
Distributional Reinforcement Learning with Monotonic Splines 4
Distributionally Robust Fair Principal Components via Geodesic Descents 5
Distributionally Robust Models with Parametric Likelihood Ratios 4
Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting Distributions 4
Dive Deeper Into Integral Pose Regression 3
Divergence-aware Federated Self-Supervised Learning 5
Diverse Client Selection for Federated Learning via Submodular Maximization 4
Divisive Feature Normalization Improves Image Recognition Performance in AlexNet 3
Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs 5
Do Users Benefit From Interpretable Vision? A User Study, Baseline, And Dataset 4
Do We Need Anisotropic Graph Neural Networks? 6
Do deep networks transfer invariances across classes? 5
Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features 4
Domain Adversarial Training: A Game Perspective 7
Domino: Discovering Systematic Errors with Cross-Modal Embeddings 4
Doubly Adaptive Scaled Algorithm for Machine Learning Using Second-Order Information 4
DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG Signals 5
Dropout Q-Functions for Doubly Efficient Reinforcement Learning 5
Dual Lottery Ticket Hypothesis 5
Dynamic Token Normalization improves Vision Transformers 5
Dynamics-Aware Comparison of Learned Reward Functions 4
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits 4
EViT: Expediting Vision Transformers via Token Reorganizations 7
EXACT: Scalable Graph Neural Networks Training via Extreme Activation Compression 7
Effect of scale on catastrophic forgetting in neural networks 3
Effective Model Sparsification by Scheduled Grow-and-Prune Methods 6
Efficient Active Search for Combinatorial Optimization Problems 5
Efficient Computation of Deep Nonlinear Infinite-Width Neural Networks that Learn Features 5
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization 5
Efficient Neural Causal Discovery without Acyclicity Constraints 5
Efficient Self-supervised Vision Transformers for Representation Learning 5
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks 5
Efficient Split-Mix Federated Learning for On-Demand and In-Situ Customization 7
Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators 5
Efficient and Differentiable Conformal Prediction with General Function Classes 5
Efficiently Modeling Long Sequences with Structured State Spaces 6
EigenGame Unloaded: When playing games is better than optimizing 4
Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums 6
Einops: Clear and Reliable Tensor Manipulations with Einstein-like Notation 4
Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise 3
Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modeling 6
Emergent Communication at Scale 6
Enabling Arbitrary Translation Objectives with Adaptive Tree Search 3
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression 5
End-to-End Learning of Probabilistic Hierarchies on Graphs 4
Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine Learning 4
Energy-Inspired Molecular Conformation Optimization 5
Enhancing Cross-lingual Transfer by Manifold Mixup 5
EntQA: Entity Linking as Question Answering 5
Entroformer: A Transformer-based Entropy Model for Learned Image Compression 4
Environment Predictive Coding for Visual Navigation 4
Equivariant Graph Mechanics Networks with Constraints 5
Equivariant Self-Supervised Learning: Encouraging Equivariance in Representations 5
Equivariant Subgraph Aggregation Networks 6
Equivariant Transformers for Neural Network based Molecular Potentials 5
Equivariant and Stable Positional Encoding for More Powerful Graph Neural Networks 5
Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems 3
Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent 2
Evaluating Disentanglement of Structured Representations 5
Evaluating Distributional Distortion in Neural Language Modeling 4
Evaluating Model-Based Planning and Planner Amortization for Continuous Control 3
Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical Solutions 3
Evidential Turing Processes 6
Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement Learning 3
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning 4
Explainable GNN-Based Models over Knowledge Graphs 7
Explaining Point Processes by Learning Interpretable Temporal Logic Rules 5
Explanations of Black-Box Models based on Directional Feature Interactions 5
Exploiting Class Activation Value for Partial-Label Learning 5
Exploring Memorization in Adversarial Training 4
Exploring extreme parameter compression for pre-trained language models 5
Exploring the Limits of Large Scale Pre-training 3
Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings 4
Expressiveness and Approximation Properties of Graph Neural Networks 0
Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and Unpredictability 4
Extending the WILDS Benchmark for Unsupervised Adaptation 6
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization 5
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations 4
FILIP: Fine-grained Interactive Language-Image Pre-Training 3
FILM: Following Instructions in Language with Modular Methods 5
FP-DETR: Detection Transformer Advanced by Fully Pre-training 4
Fair Normalizing Flows 7
FairCal: Fairness Calibration for Face Verification 5
Fairness Guarantees under Demographic Shift 5
Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language Modeling 6
Fast AdvProp 5
Fast Differentiable Matrix Square Root 6
Fast Generic Interaction Detection for Model Interpretability and Compression 4
Fast Model Editing at Scale 7
Fast Regression for Structured Inputs 4
Fast topological clustering with Wasserstein distance 3
FastSHAP: Real-Time Shapley Value Estimation 6
Feature Kernel Distillation 4
FedBABU: Toward Enhanced Representation for Federated Image Classification 6
FedChain: Chained Algorithms for Near-optimal Communication Cost in Federated Learning 3
FedPara: Low-rank Hadamard Product for Communication-Efficient Federated Learning 6
Federated Learning from Only Unlabeled Data with Class-conditional-sharing Clients 6
Few-Shot Backdoor Attacks on Visual Object Tracking 5
Few-shot Learning via Dirichlet Tessellation Ensemble 7
Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks 5
Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space 3
Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks 4
Finding an Unsupervised Image Segmenter in each of your Deep Generative Models 3
Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution 4
Fine-grained Differentiable Physics: A Yarn-level Model for Fabrics 2
Finetuned Language Models are Zero-Shot Learners 5
Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward 2
Fixed Neural Network Steganography: Train the images, not the network 6
FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes 4
Focus on the Common Good: Group Distributional Robustness Follows 5
Fooling Explanations in Text Classifiers 2
Fortuitous Forgetting in Connectionist Networks 4
Frame Averaging for Invariant and Equivariant Network Design 5
Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits 4
From Intervention to Domain Transportation: A Novel Perspective to Optimize Recommendation 6
From Stars to Subgraphs: Uplifting Any GNN with Local Structure Awareness 5
GATSBI: Generative Adversarial Training for Simulation-Based Inference 6
GDA-AM: ON THE EFFECTIVENESS OF SOLVING MIN-IMAX OPTIMIZATION VIA ANDERSON MIXING 5
GLASS: GNN with Labeling Tricks for Subgraph Representation Learning 5
GNN is a Counter? Revisiting GNN for Question Answering 5
GNN-LM: Language Modeling based on Global Contexts via GNN 3
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems 4
GRAND++: Graph Neural Diffusion with A Source Term 4
Gaussian Mixture Convolution Networks 5
GeneDisco: A Benchmark for Experimental Design in Drug Discovery 5
Generalisation in Lifelong Reinforcement Learning through Logical Composition 2
Generalization Through the Lens of Leave-One-Out Error 2
Generalization of Neural Combinatorial Solvers Through the Lens of Adversarial Robustness 4
Generalized Decision Transformer for Offline Hindsight Information Matching 4
Generalized Demographic Parity for Group Fairness 4
Generalized Kernel Thinning 4
Generalized Natural Gradient Flows in Hidden Convex-Concave Games and GANs 3
Generalized rectifier wavelet covariance models for texture synthesis 3
Generalizing Few-Shot NAS with Gradient Matching 5
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks 5
Generative Modeling with Optimal Transport Maps 5
Generative Models as a Data Source for Multiview Representation Learning 4
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning 4
Generative Principal Component Analysis 6
Generative Pseudo-Inverse Memory 5
GeoDiff: A Geometric Diffusion Model for Molecular Conformation Generation 6
Geometric Transformers for Protein Interface Contact Prediction 6
Geometric and Physical Quantities improve E(3) Equivariant Message Passing 7
Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields 4
GiraffeDet: A Heavy-Neck Paradigm for Object Detection 4
Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding Indexes 5
Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games 2
Goal-Directed Planning via Hindsight Experience Replay 2
GradMax: Growing Neural Networks using Gradient Information 4
GradSign: Model Performance Inference with Theoretical Insights 6
Gradient Importance Learning for Incomplete Observations 6
Gradient Information Matters in Policy Optimization by Back-propagating through Model 4
Gradient Matching for Domain Generalization 6
Gradient Step Denoiser for convergent Plug-and-Play 5
Granger causal inference on DAGs identifies genomic loci regulating transcription 4
Graph Auto-Encoder via Neighborhood Wasserstein Reconstruction 5
Graph Condensation for Graph Neural Networks 6
Graph Neural Network Guided Local Search for the Traveling Salesperson Problem 5
Graph Neural Networks with Learnable Structural and Positional Representations 6
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series 5
Graph-Guided Network for Irregularly Sampled Multivariate Time Series 4
Graph-Relational Domain Adaptation 3
Graph-based Nearest Neighbor Search in Hyperbolic Spaces 4
Graph-less Neural Networks: Teaching Old MLPs New Tricks Via Distillation 5
GraphENS: Neighbor-Aware Ego Network Synthesis for Class-Imbalanced Node Classification 4
Graphon based Clustering and Testing of Networks: Algorithms and Theory 5
GreaseLM: Graph REASoning Enhanced Language Models 4
Group equivariant neural posterior estimation 4
Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training 4
HTLM: Hyper-Text Pre-Training and Prompting of Language Models 2
Half-Inverse Gradients for Physical Deep Learning 4
Handling Distribution Shifts on Graphs: An Invariance Perspective 7
Heteroscedastic Temporal Variational Autoencoder For Irregularly Sampled Time Series 5
Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions 4
Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios 2
Hierarchical Few-Shot Imitation with Skill Transition Models 5
Hierarchical Variational Memory for Few-shot Learning Across Domains 3
High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad Stepsize 1
High Probability Generalization Bounds with Fast Rates for Minimax Problems 1
Hindsight Foresight Relabeling for Meta-Reinforcement Learning 4
Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception 4
Hindsight: Posterior-guided training of retrievers for improved open-ended generation 3
Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image Retrieval 5
How Attentive are Graph Attention Networks? 5
How Did the Model Change? Efficiently Assessing Machine Learning API Shifts 6
How Do Vision Transformers Work? 4
How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning 3
How Low Can We Go: Trading Memory for Error in Low-Precision Training 7
How Much Can CLIP Benefit Vision-and-Language Tasks? 5
How Well Does Self-Supervised Pre-Training Perform with Streaming Data? 2
How many degrees of freedom do we need to train deep networks: a loss landscape perspective 5
How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data 2
How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective 4
How to Train Your MAML to Excel in Few-Shot Classification 5
How to deal with missing data in supervised deep learning? 4
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis 3
Huber Additive Models for Non-stationary Time Series Analysis 4
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation 6
Hybrid Local SGD for Federated Learning with Heterogeneous Communications 4
Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface 2
Hybrid Random Features 3
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning 3
Hyperparameter Tuning with Renyi Differential Privacy 2
IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes 2
IGLU: Efficient GCN Training via Lazy Updates 7
Igeood: An Information Geometry Approach to Out-of-Distribution Detection 5
Illiterate DALL-E Learns to Compose 4
Image BERT Pre-training with Online Tokenizer 6
Imbedding Deep Neural Networks 5
Imitation Learning by Reinforcement Learning 4
Imitation Learning from Observations under Transition Model Disparity 2
Implicit Bias of Adversarial Training for Deep Neural Networks 3
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural Networks 0
Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension 5
Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100 5
Improving Federated Learning Face Recognition via Privacy-Agnostic Clusters 4
Improving Mutual Information Estimation with Annealed and Energy-Based Bounds 4
Improving Non-Autoregressive Translation Models Without Distillation 6
Improving the Accuracy of Learning Example Weights for Imbalance Classification 6
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications 5
Increasing the Cost of Model Extraction with Calibrated Proof of Work 5
Incremental False Negative Detection for Contrastive Learning 6
Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking 5
Inductive Relation Prediction Using Analogy Subgraph Embeddings 5
InfinityGAN: Towards Infinite-Pixel Image Synthesis 7
Information Bottleneck: Exact Analysis of (Quantized) Neural Networks 5
Information Gain Propagation: a New Way to Graph Active Learning with Soft Labels 7
Information Prioritization through Empowerment in Visual Model-based RL 4
Information-theoretic Online Memory Selection for Continual Learning 4
IntSGD: Adaptive Floatless Compression of Stochastic Gradients 6
Interacting Contour Stochastic Gradient Langevin Dynamics 4
Interpretable Unsupervised Diversity Denoising and Artefact Removal 4
Invariant Causal Representation Learning for Out-of-Distribution Generalization 4
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies 5
Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric Learning 4
Is High Variance Unavoidable in RL? A Case Study in Continuous Control 5
Is Homophily a Necessity for Graph Neural Networks? 5
Is Importance Weighting Incompatible with Interpolating Classifiers? 4
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation 5
It Takes Two to Tango: Mixup for Deep Metric Learning 3
Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming 5
Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design 5
Joint Shapley values: a measure of joint feature importance 3
KL Guided Domain Adaptation 5
Know Thyself: Transferable Visual Control Policies Through Robot-Awareness 4
Know Your Action Set: Learning Action Relations for Reinforcement Learning 7
Knowledge Infused Decoding 5
Knowledge Removal in Sampling-based Bayesian Inference 4
L0-Sparse Canonical Correlation Analysis 5
LEARNING GUARANTEES FOR GRAPH CONVOLUTIONAL NETWORKS ON THE STOCHASTIC BLOCK MODEL 2
LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5 4
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning 1
LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations 7
LOSSY COMPRESSION WITH DISTRIBUTION SHIFT AS ENTROPY CONSTRAINED OPTIMAL TRANSPORT 2
Label Encoding for Regression Networks 6
Label Leakage and Protection in Two-party Split Learning 5
Label-Efficient Semantic Segmentation with Diffusion Models 4
Language model compression with weighted low-rank factorization 3
Language modeling via stochastic processes 4
Language-biased image classification: evaluation based on semantic representations 2
Language-driven Semantic Segmentation 5
Large Language Models Can Be Strong Differentially Private Learners 7
Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect 1
Large-Scale Representation Learning on Graphs via Bootstrapping 5
Latent Image Animator: Learning to Animate Images via Latent Space Navigation 4
Latent Variable Sequential Set Transformers for Joint Multi-Agent Motion Prediction 4
Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural Networks 4
Learnability Lock: Authorized Learnability Control Through Adversarial Invertible Transformations 3
Learnability of convolutional neural networks for infinite dimensional input via mixed and anisotropic smoothness 1
Learned Simulators for Turbulence 4
Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations 4
Learning Altruistic Behaviours in Reinforcement Learning without External Rewards 3
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction 5
Learning Causal Models from Conditional Moment Restrictions by Importance Weighting 2
Learning Continuous Environment Fields via Implicit Functions 4
Learning Curves for Gaussian Process Regression with Power-Law Priors and Targets 0
Learning Curves for SGD on Structured Features 3
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies 5
Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View 3
Learning Distributionally Robust Models at Scale via Composite Optimization 5
Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning 4
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees 4
Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality 2
Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System 5
Learning Features with Parameter-Free Layers 5
Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities 4
Learning Graphon Mean Field Games and Approximate Nash Equilibria 4
Learning Hierarchical Structures with Differentiable Nondeterministic Stacks 5
Learning Long-Term Reward Redistribution via Randomized Return Decomposition 4
Learning Multimodal VAEs through Mutual Supervision 3
Learning Neural Contextual Bandits through Perturbed Rewards 4
Learning Object-Oriented Dynamics for Planning from Text 6
Learning Optimal Conformal Classifiers 6
Learning Prototype-oriented Set Representations for Meta-Learning 5
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining 4
Learning Representation from Neural Fisher Kernel with Low-rank Approximation 3
Learning Scenario Representation for Solving Two-stage Stochastic Integer Programs 4
Learning State Representations via Retracing in Reinforcement Learning 4
Learning Strides in Convolutional Neural Networks 7
Learning Super-Features for Image Retrieval 4
Learning Synthetic Environments and Reward Networks for Reinforcement Learning 5
Learning Temporally Causal Latent Processes from General Temporal Data 6
Learning Towards The Largest Margins 4
Learning Transferable Reward for Query Object Localization with Policy Adaptation 6
Learning Value Functions from Undirected State-only Experience 3
Learning Versatile Neural Architectures by Propagating Network Codes 6
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers 2
Learning Weakly-supervised Contrastive Representations 6
Learning a subspace of policies for online adaptation in Reinforcement Learning 4
Learning by Directional Gradient Descent 3
Learning curves for continual learning in neural networks: Self-knowledge transfer and forgetting 2
Learning meta-features for AutoML 6
Learning more skills through optimistic exploration 5
Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element Networks 6
Learning to Annotate Part Segmentation with Gradient Matching 5
Learning to Complete Code with Sketches 5
Learning to Dequantise with Truncated Flows 4
Learning to Downsample for Segmentation of Ultra-High Resolution Images 5
Learning to Extend Molecular Scaffolds with Structural Motifs 7
Learning to Generalize across Domains on Single Test Samples 6
Learning to Guide and to be Guided in the Architect-Builder Problem 5
Learning to Map for Active Semantic Goal Navigation 6
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic Forecasting 4
Learning to Schedule Learning rate with Graph Neural Networks 6
Learning transferable motor skills with hierarchical latent mixture policies 2
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations 5
Learning-Augmented $k$-means Clustering 4
Leveraging Automated Unit Tests for Unsupervised Code Translation 5
Leveraging unlabeled data to predict out-of-distribution performance 4
Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory 5
Linking Emergent and Natural Languages via Corpus Transfer 5
Lipschitz-constrained Unsupervised Skill Discovery 4
LoRA: Low-Rank Adaptation of Large Language Models 4
Local Feature Swapping for Generalization in Reinforcement Learning 5
Long Expressive Memory for Sequence Modeling 5
Looking Back on Learned Experiences For Class/task Incremental Learning 5
Lossless Compression with Probabilistic Circuits 5
Low-Budget Active Learning via Wasserstein Distance: An Integer Programming Approach 4
MAML is a Noisy Contrastive Learner in Classification 6
MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMC 3
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling 3
MT3: Multi-Task Multitrack Music Transcription 5
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining 6
Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality 1
Map Induction: Compositional spatial submap learning for efficient exploration in novel environments 3
Mapping Language Models to Grounded Conceptual Spaces 2
Mapping conditional distributions for domain adaptation under generalized target shift 6
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning 4
Maximizing Ensemble Diversity in Deep Reinforcement Learning 4
Maximum Entropy RL (Provably) Solves Some Robust RL Problems 2
Maximum n-times Coverage for Vaccine Design 5
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models 5
Measuring the Interpretability of Unsupervised Representations via Quantized Reversed Probing 5
Memorizing Transformers 3
Memory Augmented Optimizers for Deep Learning 7
Memory Replay with Data Compression for Continual Learning 5
Mention Memory: incorporating textual knowledge into Transformers through entity mention attention 5
Message Passing Neural PDE Solvers 5
Meta Discovery: Learning to Discover Novel Classes given Very Limited Data 6
Meta Learning Low Rank Covariance Factors for Energy Based Deterministic Uncertainty 4
Meta-Imitation Learning by Watching Video Demonstrations 2
Meta-Learning with Fewer Tasks through Task Interpolation 5
MetaMorph: Learning Universal Controllers with Transformers 5
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts 5
Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks 4
Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond 2
Minimax Optimality (Probably) Doesn't Imply Distribution Learning for GANs 2
Minimax Optimization with Smooth Algorithmic Adversaries 5
Mirror Descent Policy Optimization 3
Missingness Bias in Model Debugging 5
MoReL: Multi-omics Relational Learning 5
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer 6
Model Agnostic Interpretability for Multiple Instance Learning 6
Model Zoo: A Growing Brain That Learns Continually 5
Model-Based Offline Meta-Reinforcement Learning with Regularization 4
Model-augmented Prioritized Experience Replay 4
Modeling Label Space Interactions in Multi-label Classification using Box Embeddings 5
Modular Lifelong Reinforcement Learning via Neural Composition 5
MonoDistill: Learning Spatial Features for Monocular 3D Object Detection 5
Monotonic Differentiable Sorting Networks 3
Multi-Agent MDP Homomorphic Networks 3
Multi-Critic Actor Learning: Teaching RL Policies to Act with Style 3
Multi-Mode Deep Matrix and Tensor Factorization 4
Multi-Stage Episodic Control for Strategic Exploration in Text Games 4
Multi-Task Processes 4
Multi-objective Optimization by Learning Space Partition 3
Multimeasurement Generative Models 6
Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation 5
Multitask Prompted Training Enables Zero-Shot Task Generalization 5
NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy 4
NASI: Label- and Data-agnostic Neural Architecture Search at Initialization 5
NASPY: Automated Extraction of Automated Machine Learning Models 7
NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet Training 6
NETWORK INSENSITIVITY TO PARAMETER NOISE VIA PARAMETER ATTACK DURING TRAINING 4
NODE-GAM: Neural Generalized Additive Model for Interpretable Deep Learning 6
Natural Language Descriptions of Deep Visual Features 5
Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions 5
Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver 1
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism 2
Network Augmentation for Tiny Deep Learning 4
NeuPL: Neural Population Learning 5
Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path 2
Neural Contextual Bandits with Deep Representation and Shallow Exploration 4
Neural Deep Equilibrium Solvers 6
Neural Link Prediction with Walk Pooling 4
Neural Markov Controlled SDE: Stochastic Optimization for Continuous-Time Data 4
Neural Methods for Logical Reasoning over Knowledge Graphs 5
Neural Models for Output-Space Invariance in Combinatorial Problems 4
Neural Network Approximation based on Hausdorff distance of Tropical Zonotopes 2
Neural Networks as Kernel Learners: The Silent Alignment Effect 2
Neural Parameter Allocation Search 5
Neural Processes with Stochastic Attention: Paying more attention to the context dataset 6
Neural Program Synthesis with Query 3
Neural Relational Inference with Node-Specific Information 4
Neural Solvers for Fast and Accurate Numerical Optimal Control 4
Neural Spectral Marked Point Processes 5
Neural Stochastic Dual Dynamic Programming 3
Neural Structured Prediction for Inductive Node Classification 6
Neural Variational Dropout Processes 3
Neural graphical modelling in continuous-time: consistency guarantees and algorithms 3
New Insights on Reducing Abrupt Representation Change in Online Continual Learning 5
No One Representation to Rule Them All: Overlapping Features of Training Methods 2
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models 6
Node Feature Extraction by Self-Supervised Multi-scale Neighborhood Prediction 5
NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs 6
Noisy Feature Mixup 3
Non-Linear Operator Approximations for Initial Value Problems 4
Non-Parallel Text Style Transfer with Self-Parallel Supervision 6
Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Authorization 6
Nonlinear ICA Using Volume-Preserving Transformations 1
Normalization of Language Embeddings for Cross-Lingual Alignment 6
OBJECT DYNAMICS DISTILLATION FOR SCENE DECOMPOSITION AND REPRESENTATION 5
Object Pursuit: Building a Space of Objects via Discriminative Weight Generation 5
Objects in Semantic Topology 3
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization 4
Offline Reinforcement Learning with Implicit Q-Learning 4
Offline Reinforcement Learning with Value-based Episodic Memory 4
Omni-Dimensional Dynamic Convolution 5
Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification 4
On Bridging Generic and Personalized Federated Learning for Image Classification 6
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning 3
On Distributed Adaptive Optimization with Gradient Compression 4
On Evaluation Metrics for Graph Generative Models 5
On Improving Adversarial Transferability of Vision Transformers 6
On Incorporating Inductive Biases into VAEs 5
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning 2
On Non-Random Missing Labels in Semi-Supervised Learning 3
On Predicting Generalization using GANs 3
On Redundancy and Diversity in Cell-based Neural Architecture Search 5
On Robust Prefix-Tuning for Text Classification 5
On feature learning in neural networks with global convergence guarantees 2
On the Certified Robustness for Ensemble Models and Beyond 5
On the Connection between Local Attention and Dynamic Depth-wise Convolution 4
On the Convergence of Certified Robust Training with Interval Bound Propagation 3
On the Convergence of mSGD and AdaGrad for Stochastic Optimization 0
On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning 4
On the Existence of Universal Lottery Tickets 4
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications 4
On the Importance of Difficulty Calibration in Membership Inference Attacks 4
On the Importance of Firth Bias Reduction in Few-Shot Classification 5
On the Learning and Learnability of Quasimetrics 5
On the Limitations of Multimodal VAEs 4
On the Optimal Memorization Power of ReLU Neural Networks 0
On the Pitfalls of Analyzing Individual Neurons in Language Models 5
On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks 5
On the Role of Neural Collapse in Transfer Learning 3
On the Uncomputability of Partition Functions in Energy-Based Sequence Models 0
On the approximation properties of recurrent encoder-decoder architectures 2
On the benefits of maximum likelihood estimation for Regression and Forecasting 5
On the relation between statistical learning and perceptual distances 3
On the role of population heterogeneity in emergent communication 3
On-Policy Model Errors in Reinforcement Learning 4
One After Another: Learning Incremental Skills for a Changing World 6
Online Ad Hoc Teamwork under Partial Observability 2
Online Adversarial Attacks 4
Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference 4
Online Coreset Selection for Rehearsal-based Continual Learning 4
Online Facility Location with Predictions 5
Online Hyperparameter Meta-Learning with Hypergradient Distillation 6
Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs 3
OntoProtein: Protein Pretraining With Gene Ontology Embedding 5
Open-Set Recognition: A Good Closed-Set Classifier is All You Need 5
Open-World Semi-Supervised Learning 6
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation 4
Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks 4
Optimal Representations for Covariate Shift 5
Optimal Transport for Causal Discovery 4
Optimal Transport for Long-Tailed Recognition with Learnable Cost Matrix 5
Optimization and Adaptive Generalization of Three layer Neural Networks 1
Optimization inspired Multi-Branch Equilibrium Models 4
Optimizer Amalgamation 5
Optimizing Neural Networks with Gradient Lexicase Selection 4
Orchestrated Value Mapping for Reinforcement Learning 4
Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations 5
Overcoming The Spectral Bias of Neural Value Approximation 4
P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts 4
PAC Prediction Sets Under Covariate Shift 5
PAC-Bayes Information Bottleneck 5
PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning 4
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method 3
PF-GNN: Differentiable particle filtering based approximation of universal graph representations 5
PI3NN: Out-of-distribution-aware Prediction Intervals from Three Neural Networks 4
POETREE: Interpretable Policy Learning with Adaptive Decision Trees 6
PSA-GAN: Progressive Self Attention GANs for Synthetic Time Series 4
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences 6
Pareto Policy Adaptation 4
Pareto Policy Pool for Model-based Offline Reinforcement Learning 5
Pareto Set Learning for Neural Multi-Objective Combinatorial Optimization 4
Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration 4
Particle Stochastic Dual Coordinate Ascent: Exponential convergent algorithm for mean field neural network optimization 4
Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations? 4
Path Auxiliary Proposal for MCMC in Discrete Space 4
Path Integral Sampler: A Stochastic Control Approach For Sampling 5
Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently 4
Perceiver IO: A General Architecture for Structured Inputs & Outputs 4
Permutation Compressors for Provably Faster Distributed Nonconvex Optimization 5
Permutation-Based SGD: Is Random Optimal? 3
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning 5
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage 1
Phase Collapse in Neural Networks 5
Phenomenology of Double Descent in Finite-Width Neural Networks 3
PiCO: Contrastive Label Disambiguation for Partial Label Learning 6
PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication 5
Pix2seq: A Language Modeling Framework for Object Detection 5
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models 6
Planning in Stochastic Environments with a Learned Model 4
Plant 'n' Seek: Can You Find the Winning Ticket? 4
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences 5
Poisoning and Backdooring Contrastive Learning 4
Policy Gradients Incorporating the Future 3
Policy Smoothing for Provably Robust Reinforcement Learning 6
Policy improvement by planning with Gumbel 5
PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions 5
Possibility Before Utility: Learning And Using Hierarchical Affordances 3
Post hoc Explanations may be Ineffective for Detecting Unknown Spurious Correlation 2
Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack Scenarios 5
Practical Conditional Neural Process Via Tractable Dependent Predictions 5
Practical Integration via Separable Bijective Networks 3
Pre-training Molecular Graph Representation with 3D Geometry 4
Predicting Physics in Mesh-reduced Space with Temporal Attention 2
Pretrained Language Model in Continual Learning: A Comparative Study 5
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators 5
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior 5
Privacy Implications of Shuffling 3
Probabilistic Implicit Scene Completion 4
Procedural generalization by planning with self-supervised world models 3
Programmatic Reinforcement Learning without Oracles 4
Progressive Distillation for Fast Sampling of Diffusion Models 5
Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection 4
Proof Artifact Co-Training for Theorem Proving with Language Models 4
Properties from mechanisms: an equivariance perspective on identifiable representation learning 1
Prospect Pruning: Finding Trainable Weights at Initialization using Meta-Gradients 6
ProtoRes: Proto-Residual Network for Pose Authoring via Learned Inverse Kinematics 6
Prototype memory and attention mechanisms for few shot image generation 5
Prototypical Contrastive Predictive Coding 4
Provable Adaptation across Multiway Domains via Representation Learning 2
Provable Learning-based Algorithm For Sparse Recovery 5
Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics 2
Provably Robust Adversarial Examples 6
Provably convergent quasistatic dynamics for mean-field two-player zero-sum games 2
Proving the Lottery Ticket Hypothesis for Convolutional Neural Networks 4
Pseudo Numerical Methods for Diffusion Models on Manifolds 5
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization 5
Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and Forecasting 6
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization 5
QUERY EFFICIENT DECISION BASED SPARSE ATTACKS AGAINST BLACK-BOX DEEP LEARNING MODELS 4
Quadtree Attention for Vision Transformers 4
Quantitative Performance Assessment of CNN Units via Topological Entropy Calculation 3
Query Embedding on Hyper-Relational Knowledge Graphs 5
R4D: Utilizing Reference Objects for Long-Range Distance Estimation 4
R5: Rule Discovery with Reinforced and Recurrent Relational Reasoning 4
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation 2
Random matrices in service of ML footprint: ternary random features with no performance loss 4
Real-Time Neural Voice Camouflage 5
Recursive Disentanglement Network 2
Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank? 5
Reducing Excessive Margin to Achieve a Better Accuracy vs. Robustness Trade-off 5
RegionViT: Regional-to-Local Attention for Vision Transformers 4
Regularized Autoencoders for Isometric Representation Learning 6
Reinforcement Learning in Presence of Discrete Markovian Context Evolution 3
Reinforcement Learning under a Multi-agent Predictive State Representation Model: Method and Theory 4
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration 4
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning 4
Relating transformers to models and neural representations of the hippocampal formation 1
Relational Learning with Variational Bayes 5
Relational Multi-Task Learning: Modeling Relations between Data and Tasks 6
Relational Surrogate Loss Learning 6
RelaxLoss: Defending Membership Inference Attacks without Losing Utility 6
Reliable Adversarial Distillation with Unreliable Teachers 4
Representation Learning for Online and Offline RL in Low-rank MDPs 1
Representation-Agnostic Shape Fields 5
Representational Continuity for Unsupervised Continual Learning 3
Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings 5
Resolving Training Biases via Influence-based Data Relabeling 6
Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum 2
Responsible Disclosure of Generative Models Using Scalable Fingerprinting 4
Rethinking Adversarial Transferability from a Data Distribution Perspective 5
Rethinking Class-Prior Estimation for Positive-Unlabeled Learning 5
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL 5
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework 4
Rethinking Supervised Pre-Training for Better Downstream Transferring 3
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph 3
Reverse Engineering of Imperceptible Adversarial Image Perturbations 5
Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift 7
Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions 5
Revisiting Design Choices in Offline Model Based Reinforcement Learning 3
Revisiting Over-smoothing in BERT from the Perspective of Graph 4
Revisiting flow generative models for Out-of-distribution detection 4
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning 3
Robbing the Fed: Directly Obtaining Private Data in Federated Learning with Modified Models 3
Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness? 5
Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning 5
Robust and Scalable SDE Learning: A Functional Perspective 5
RotoGrad: Gradient Homogenization in Multitask Learning 6
RvS: What is Essential for Offline RL via Supervised Learning? 5
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations 6
SGD Can Converge to Local Maxima 1
SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models 6
SOSP: Efficiently Capturing Global Correlations by Second-Order Structured Pruning 3
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training 5
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation 5
SUMNAS: Supernet with Unbiased Meta-Features for Neural Architecture Search 4
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning 5
Safe Neurosymbolic Learning with Differentiable Symbolic Execution 4
Salient ImageNet: How to discover spurious features in Deep Learning? 3
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation 5
Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov Game 1
Sample Selection with Uncertainty of Losses for Learning with Noisy Labels 5
Sample and Computation Redistribution for Efficient Face Detection 6
Sampling with Mirrored Stein Operators 6
Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation 7
Scalable Sampling for Nonsymmetric Determinantal Point Processes 5
Scale Efficiently: Insights from Pretraining and Finetuning Transformers 5
Scale Mixtures of Neural Network Gaussian Processes 5
Scaling Laws for Neural Machine Translation 2
Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption 5
Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNs 2
Scene Transformer: A unified architecture for predicting future trajectories of multiple agents 5
Score-Based Generative Modeling with Critically-Damped Langevin Diffusion 4
Selective Ensembles for Consistent Predictions 6
Self-Joint Supervised Learning 6
Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis 5
Self-Supervised Inference in State-Space Models 4
Self-Supervision Enhanced Feature Selection with Correlated Gates 6
Self-ensemble Adversarial Training for Improved Robustness 6
Self-supervised Learning is More Robust to Dataset Imbalance 5
Semi-relaxed Gromov-Wasserstein divergence and applications on graphs 6
Sequence Approximation using Feedforward Spiking Neural Network for Spatiotemporal Learning: Theory and Optimization Methods 3
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning 4
Shallow and Deep Networks are Near-Optimal Approximators of Korobov Functions 0
Should I Run Offline Reinforcement Learning or Behavioral Cloning? 3
Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative 5
Shuffle Private Stochastic Convex Optimization 1
Signing the Supermask: Keep, Hide, Invert 3
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision 4
Simple GNN Regularisation for 3D Molecular Property Prediction and Beyond 6
SketchODE: Learning neural sketch representation in continuous time 2
Skill-based Meta-Reinforcement Learning 4
Solving Inverse Problems in Medical Imaging with Score-Based Generative Models 4
Sound Adversarial Audio-Visual Navigation 4
Sound and Complete Neural Network Repair with Minimality and Locality Guarantees 5
Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration 5
Space-Time Graph Neural Networks 2
Spanning Tree-based Graph Generation for Molecules 5
Sparse Attention with Learning to Hash 6
Sparse Communication via Mixed Distributions 6
Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity 5
Sparsity Winning Twice: Better Robust Generalization from More Efficient Training 5
Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery 4
SphereFace2: Binary Classification is All You Need for Deep Face Recognition 4
Spherical Message Passing for 3D Molecular Graphs 5
Spike-inspired rank coding for fast and accurate recurrent neural networks 4
Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation 6
Sqrt(d) Dimension Dependence of Langevin Monte Carlo 1
Stability Regularization for Discrete Representation Learning 4
Steerable Partial Differential Operators for Equivariant Neural Networks 5
Stein Latent Optimization for Generative Adversarial Networks 6
Step-unrolled Denoising Autoencoders for Text Generation 5
Stiffness-aware neural network for learning Hamiltonian systems 1
Stochastic Training is Not Necessary for Generalization 6
Strength of Minibatch Noise in SGD 2
Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning 2
StyleAlign: Analysis and Applications of Aligned StyleGAN Models 4
StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image Synthesis 4
Subspace Regularizers for Few-Shot Class Incremental Learning 5
Superclass-Conditional Gaussian Mixture Model For Learning Fine-Grained Embeddings 6
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm 6
Surreal-GAN:Semi-Supervised Representation Learning via GAN for uncovering heterogeneous disease-related imaging patterns 4
Surrogate Gap Minimization Improves Sharpness-Aware Training 4
Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS Benchmarks 5
Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot Classification 3
Symbolic Learning to Optimize: Towards Interpretability and Scalability 5
Synchromesh: Reliable Code Generation from Pre-trained Language Models 4
T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis 2
TAMP-S2GCNets: Coupling Time-Aware Multipersistence Knowledge Representation with Spatio-Supra Graph Convolutional Networks for Time-Series Forecasting 5
TAPEX: Table Pre-training via Learning a Neural SQL Executor 5
TAda! Temporally-Adaptive Convolutions for Video Understanding 3
THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling 3
TPU-GAN: Learning temporal coherence from dynamic point cloud sequences 5
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data 2
TRGP: Trust Region Gradient Projection for Continual Learning 4
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs 5
Taming Sparsely Activated Transformer with Stochastic Experts 5
Target-Side Input Augmentation for Sequence to Sequence Generation 4
Task Affinity with Maximum Bipartite Matching in Few-Shot Learning 4
Task Relatedness-Based Generalization Bounds for Meta Learning 0
Task-Induced Representation Learning 3
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification 5
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting 5
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models 3
The Close Relationship Between Contrastive Learning and Meta-Learning 5
The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program 0
The Effects of Invertibility on the Representational Complexity of Encoders in Variational Autoencoders 0
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models 1
The Efficiency Misnomer 3
The Evolution of Uncertainty of Learning in Games 1
The Geometry of Memoryless Stochastic Policy Optimization in Infinite-Horizon POMDPs 4
The Hidden Convex Optimization Landscape of Regularized Two-Layer ReLU Networks: an Exact Characterization of Optimal Solutions 0
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design 2
The Information Geometry of Unsupervised Reinforcement Learning 1
The MultiBERTs: BERT Reproductions for Robustness Analysis 7
The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization 4
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning 3
The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks 5
The Role of Pretrained Representations for the OOD Generalization of RL Agents 4
The Spectral Bias of Polynomial Neural Networks 1
The Three Stages of Learning Dynamics in High-dimensional Kernel Methods 1
The Uncanny Similarity of Recurrence and Depth 4
The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training 3
Tighter Sparse Approximation Bounds for ReLU Neural Networks 0
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind 3
Top-N: Equivariant Set and Graph Generation without Exchangeability 4
Top-label calibration and multiclass-to-binary reductions 5
Topological Experience Replay 4
Topological Graph Neural Networks 5
Topologically Regularized Data Embeddings 5
Toward Efficient Low-Precision Training: Data Format Optimization and Hysteresis Quantization 4
Toward Faithful Case-based Reasoning through Learning Prototypes in a Nearest Neighbor-friendly Space. 3
Towards Better Understanding and Better Generalization of Low-shot Classification in Histology Images with Contrastive Learning 3
Towards Building A Group-based Unsupervised Representation Disentanglement Framework 2
Towards Continual Knowledge Learning of Language Models 5
Towards Deepening Graph Neural Networks: A GNTK-based Optimization Perspective 4
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality 1
Towards Empirical Sandwich Bounds on the Rate-Distortion Function 6
Towards Evaluating the Robustness of Neural Networks Learned by Transduction 6
Towards General Function Approximation in Zero-Sum Markov Games 1
Towards Model Agnostic Federated Learning Using Knowledge Distillation 2
Towards Training Billion Parameter Graph Neural Networks for Atomic Simulations 3
Towards Understanding Generalization via Decomposing Excess Risk Dynamics 2
Towards Understanding the Data Dependency of Mixup-style Training 3
Towards Understanding the Robustness Against Evasion Attack on Categorical Data 5
Towards a Unified View of Parameter-Efficient Transfer Learning 4
Tracking the risk of a deployed model and detecting harmful distribution shifts 5
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation 5
Training Data Generating Networks: Shape Reconstruction via Bi-level Optimization 4
Training Structured Neural Networks Through Manifold Identification and Variance Reduction 6
Training Transition Policies via Distribution Matching for Complex Tasks 4
Training invariances and the low-rank phenomenon: beyond linear networks 0
Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations 5
Transfer RL across Observation Feature Spaces via Model-Based Regularization 4
Transferable Adversarial Attack based on Integrated Gradients 4
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design 4
Transformer Embeddings of Irregularly Spaced Events and Their Participants 6
Transformer-based Transform Coding 4
Transformers Can Do Bayesian Inference 6
Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models 1
Triangle and Four Cycle Counting with Predictions in Graph Streams 5
Trigger Hunting with a Topological Prior for Trojan Detection 4
Trivial or Impossible --- dichotomous data difficulty masks model differences (on ImageNet and beyond) 5
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning 4
Tuformer: Data-driven Design of Transformers for Improved Generalization or Efficiency 4
Uncertainty Modeling for Out-of-Distribution Generalization 5
Understanding Dimensional Collapse in Contrastive Self-supervised Learning 4
Understanding Domain Randomization for Sim-to-real Transfer 1
Understanding Intrinsic Robustness Using Label Uncertainty 6
Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective 5
Understanding and Improving Graph Injection Attack by Promoting Unnoticeability 7
Understanding and Leveraging Overparameterization in Recursive Value Estimation 2
Understanding and Preventing Capacity Loss in Reinforcement Learning 4
Understanding approximate and unrolled dictionary learning for pattern recovery 5
Understanding over-squashing and bottlenecks on graphs via curvature 5
Understanding the Role of Self Attention for Efficient Speech Recognition 5
Understanding the Variance Collapse of SVGD in High Dimensions 2
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning 4
Unified Visual Transformer Compression 4
Unifying Likelihood-free Inference with Black-box Optimization and Beyond 4
Universal Approximation Under Constraints is Possible with Transformers 1
Universalizing Weak Supervision 4
Unraveling Model-Agnostic Meta-Learning via The Adaptation Learning Rate 3
Unrolling PALM for Sparse Semi-Blind Source Separation 4
Unsupervised Discovery of Object Radiance Fields 5
Unsupervised Disentanglement with Tensor Product Representations on the Torus 4
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop 4
Unsupervised Semantic Segmentation by Distilling Feature Correspondences 6
Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling 4
Using Graph Representation Learning with Schema Encoders to Measure the Severity of Depressive Symptoms 4
VAE Approximation Error: ELBO and Exponential Families 3
VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects 3
VC dimension of partially quantized neural networks in the overparametrized regime 5
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning 5
VOS: Learning What You Don't Know by Virtual Outlier Synthesis 7
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning 1
Value Gradient weighted Model-Based Reinforcement Learning 4
Variational Inference for Discriminative Learning with Generative Modeling of Feature Incompletion 6
Variational Neural Cellular Automata 5
Variational Predictive Routing with Nested Subjective Timescales 3
Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias 1
Variational methods for simulation-based inference 5
Variational oracle guiding for reinforcement learning 3
Vector-quantized Image Modeling with Improved VQGAN 4
ViDT: An Efficient and Effective Fully Transformer-based Object Detector 5
ViTGAN: Training GANs with Vision Transformers 6
Vision-Based Manipulators Need to Also See from Their Hands 3
Visual Correspondence Hallucination 5
Visual Representation Learning Does Not Generalize Strongly Within the Same Domain 5
Visual Representation Learning over Latent Domains 4
Visual hyperacuity with moving sensor and recurrent neural computations 2
Vitruvion: A Generative Model of Parametric CAD Sketches 5
W-CTC: a Connectionist Temporal Classification Loss with Wild Cards 3
WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection 5
Weighted Training for Cross-Task Learning 6
What Do We Mean by Generalization in Federated Learning? 4
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework 0
What Makes Better Augmentation Strategies? Augment Difficult but Not too Different 6
What’s Wrong with Deep Learning in Tree Search for Combinatorial Optimization 6
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? 1
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations 5
When should agents explore? 5
When, Why, and Which Pretrained GANs Are Useful? 5
Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective 3
Who Is Your Right Mixup Partner in Positive and Unlabeled Learning 4
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL 5
Why Propagate Alone? Parallel Use of Labels and Features on Graphs 3
Wiring Up Vision: Minimizing Supervised Synaptic Updates Needed to Produce a Primate Ventral Stream 6
Wisdom of Committees: An Overlooked Approach To Faster and More Accurate Models 5
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation 2
X-model: Improving Data Efficiency in Deep Learning with A Minimax Model 4
You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction 2
You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks 5
Zero Pixel Directional Boundary by Vector Transform 3
Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning 3
Zero-Shot Self-Supervised Learning for MRI Reconstruction 4
ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity 4
cosFormer: Rethinking Softmax In Attention 6
iFlood: A Stable and Effective Regularizer 4
iLQR-VAE : control-based learning of input-driven dynamics with applications to neural data 4
miniF2F: a cross-system benchmark for formal Olympiad-level mathematics 4
switch-GLAT: Multilingual Parallel Machine Translation Via Code-Switch Decoder 4