Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Learning Representations (ICLR) - 2025

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICLR 2025 3700 0.66 4.54 97.97% 42.65%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
$F^3Set$: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos 3
$InterLCM$: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration 5
$R^2$-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning 5
$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models 4
$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs 5
$\phi$-Update: A Class of Policy Update Methods with Policy Convergence Guarantee 1
$\sigma$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples 6
$\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models 4
$\text{I}^2\text{AM}$: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps 3
$q$-exponential family for policy optimization 4
(Mis)Fitting Scaling Laws: A Survey of Scaling Law Fitting Techniques in Deep Learning 4
3D StreetUnveiler with Semantic-aware 2DGS - a simple baseline 3
3D Vision-Language Gaussian Splatting 3
3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds 3
3D-MolT5: Leveraging Discrete Structural Information for Molecule-Text Modeling 6
3D-Properties: Identifying Challenges in DPO and Charting a Path Forward 5
3D-SPATIAL MULTIMODAL MEMORY 3
3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing 4
3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation 3
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery 5
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation 3
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting 3
4K4DGen: Panoramic 4D Generation at 4K Resolution 3
6D Object Pose Tracking in Internet Videos for Robotic Manipulation 2
6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering 5
A Benchmark for Semantic Sensitive Information in LLMs Outputs 3
A Black Swan Hypothesis: The Role of Human Irrationality in AI Safety 1
A CLIP-Powered Framework for Robust and Generalizable Data Selection 6
A Causal Lens for Learning Long-term Fair Policies 3
A Closer Look at Machine Unlearning for Large Language Models 5
A Coefficient Makes SVRG Effective 5
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement 5
A Computational Framework for Modeling Emergence of Color Vision in the Human Brain 2
A Conditional Independence Test in the Presence of Discretization 5
A Decade's Battle on Dataset Bias: Are We There Yet? 4
A Deep Generative Learning Approach for Two-stage Adaptive Robust Optimization 5
A Differentiable Rank-Based Objective for Better Feature Learning 5
A Distributional Approach to Uncertainty-Aware Preference Alignment Using Offline Demonstrations 5
A Formal Framework for Understanding Length Generalization in Transformers 5
A General Framework for Off-Policy Learning with Partially-Observed Reward 3
A General Framework for Producing Interpretable Semantic Text Embeddings 5
A Generalist Hanabi Agent 3
A Generic Framework for Conformal Fairness 6
A Geometric Framework for Understanding Memorization in Generative Models 2
A Graph Enhanced Symbolic Discovery Framework For Efficient Logic Optimization 6
A Large-scale Dataset and Benchmark for Commuting Origin-Destination Flow Generation 6
A Large-scale Training Paradigm for Graph Generative Models 6
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts 4
A Meta-Learning Approach to Bayesian Causal Discovery 3
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules 3
A Multiscale Frequency Domain Causal Framework for Enhanced Pathological Analysis 5
A New Perspective on Shampoo's Preconditioner 2
A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation 4
A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language 4
A Periodic Bayesian Flow for Material Generation 6
A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence 5
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models 6
A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning 3
A Riemannian Framework for Learning Reduced-order Lagrangian Dynamics 3
A Robust Method to Discover Causal or Anticausal Relation 4
A Sanity Check for AI-generated Image Detection 4
A Second-Order Perspective on Model Compositionality and Incremental Learning 5
A Simple Approach to Unifying Diffusion-based Conditional Generation 3
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation 5
A Simple yet Effective $\Delta\Delta G$ Predictor is An Unsupervised Antibody Optimizer and Explainer 7
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals 4
A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery 4
A Solvable Attention for Neural Scaling Laws 1
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation 5
A Statistical Approach for Controlled Training Data Detection 5
A Statistical Framework for Ranking LLM-based Chatbots 5
A Stochastic Approach to the Subset Selection Problem via Mirror Descent 5
A Theoretical Analysis of Self-Supervised Learning for Vision Transformers 1
A Theoretical Framework for Partially-Observed Reward States in RLHF 1
A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops 1
A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules 5
A Theory for Token-Level Harmonization in Retrieval-Augmented Generation 5
A Theory of Initialisation's Impact on Specialisation 4
A Tight Convergence Analysis of Inexact Stochastic Proximal Point Algorithm for Stochastic Composite Optimization Problems 3
A Training-Free Sub-quadratic Cost Transformer Model Serving Framework with Hierarchically Pruned Attention 5
A Transfer Attack to Image Watermarks 6
A Truncated Newton Method for Optimal Transport 5
A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations 4
A Unified Theory of Quantum Neural Network Loss Landscapes 0
A Unifying Framework for Representation Learning 4
A Watermark for Order-Agnostic Language Models 3
A deep inverse-mapping model for a flapping robotic wing 6
A new framework for evaluating model out-of-distribution generalisation for the biochemical domain 6
A transfer learning framework for weak to strong generalization 4
A-Bench: Are LMMs Masters at Evaluating AI-generated Images? 4
A3D: Does Diffusion Dream about 3D Alignment? 4
ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration 6
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer 1
ACES: Automatic Cohort Extraction System for Event-Stream Datasets 4
ACTIVE: Offline Reinforcement Learning via Adaptive Imitation and In-sample $V$-Ensemble 5
ADAM Optimization with Adaptive Batch Selection 4
ADAM: An Embodied Causal Agent in Open-World Environments 3
ADAPT: Attentive Self-Distillation and Dual-Decoder Prediction Fusion for Continual Panoptic Segmentation 6
ADBM: Adversarial Diffusion Bridge Model for Reliable Adversarial Purification 6
ADIFF: Explaining audio difference using natural language 5
ADMM for Nonconvex Optimization under Minimal Continuity Assumption 5
ADMM for Structured Fractional Minimization 4
ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning 4
AFlow: Automating Agentic Workflow Generation 5
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation 4
AI Sandbagging: Language Models can Strategically Underperform on Evaluations 4
AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text 4
AI2TALE: An Innovative Information Theory-based Approach for Learning to Localize Phishing Attacks 7
AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements 4
AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories 2
ALBAR: Adversarial Learning approach to mitigate Biases in Action Recognition 4
ALLaM: Large Language Models for Arabic and English 5
ANaGRAM: A Natural Gradient Relative to Adapted Model for efficient PINNs learning 5
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding 5
API Pack: A Massive Multi-Programming Language Dataset for API Call Generation 5
ARB-LLM: Alternating Refined Binarizations for Large Language Models 6
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation 4
ASTrA: Adversarial Self-supervised Training with Adaptive-Attacks 7
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models 5
Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization 4
Accelerated training through iterative gradient propagation along the residual path 4
Accelerating 3D Molecule Generation via Jointly Geometric Optimal Transport 6
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding 5
Accelerating Diffusion Transformers with Token-wise Feature Caching 6
Accelerating Goal-Conditioned Reinforcement Learning Algorithms and Research 5
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection 4
Accelerating Neural ODEs: A Variational Formulation-based Approach 6
Accelerating Task Generalisation with Multi-Level Skill Hierarchies 4
Accelerating Training with Neuron Interaction and Nowcasting Networks 5
Accelerating neural network training: An analysis of the AlgoPerf competition 5
Accessing Vision Foundation Models via ImageNet-1K 5
Accurate and Scalable Graph Neural Networks via Message Invariance 6
Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization 5
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning 4
Action Sequence Augmentation for Action Anticipation 2
Action abstractions for amortized sampling 4
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints 3
Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes 4
Activation Gradient based Poisoned Sample Detection Against Backdoor Attacks 6
Active Learning for Continual Learning: Keeping the Past Alive in the Present 7
Active Learning for Neural PDE Solvers 6
Active Task Disambiguation with LLMs 5
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs 4
AdaFisher: Adaptive Second Order Optimization via Fisher Information 6
AdaGrad under Anisotropic Smoothness 5
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation 5
AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning 2
AdaRankGrad: Adaptive Gradient Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning 6
AdaWM: Adaptive World Model based Planning for Autonomous Driving 5
Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity 4
Adam-mini: Use Fewer Learning Rates To Gain More 6
Adapt-$\infty$: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection 4
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most? 5
Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards 5
Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning 4
Adaptive Batch Size for Privately Finding Second-Order Stationary Points 1
Adaptive Camera Sensor for Vision Models 5
Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws 6
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats 6
Adaptive Energy Alignment for Accelerating Test-Time Adaptation 5
Adaptive Gradient Clipping for Robust Federated Learning 3
Adaptive Length Image Tokenization via Recurrent Allocation 5
Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise 3
Adaptive Pruning of Pretrained Transformer via Differential Inclusions 5
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters 5
Adaptive Retention & Correction: Test-Time Training for Continual Learning 5
Adaptive Shrinkage Estimation for Personalized Deep Kernel Regression in Modeling Brain Trajectories 5
Adaptive Transformer Programs: Bridging the Gap Between Performance and Interpretability in Transformers 3
Adaptive backtracking for faster optimization 4
Adaptive teachers for amortized samplers 4
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models 2
Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models 6
Adding Conditional Control to Diffusion Models with Reinforcement Learning 6
Addressing Label Shift in Distributed Learning via Entropy Regularization 6
Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control 3
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption 4
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models 2
Advancing Graph Generation through Beta Diffusion 6
Advancing LLM Reasoning Generalists with Preference Trees 2
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages 4
Advancing Out-of-Distribution Detection via Local Neuroplasticity 6
Advancing Prompt-Based Methods for Replay-Independent General Continual Learning 5
Advantage Alignment Algorithms 5
Advantage-Guided Distillation for Preference Alignment in Small Language Models 5
Adversarial Attacks on Data Attribution 4
Adversarial Generative Flow Network for Solving Vehicle Routing Problems 5
Adversarial Latent Feature Augmentation for Fairness 6
Adversarial Machine Unlearning 7
Adversarial Mixup Unlearning 5
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI 4
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning 5
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step 6
Adversarial Search Engine Optimization for Large Language Models 0
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data 3
Adversarial Training for Defense Against Label Poisoning Attacks 6
Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation 5
Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings 5
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness 5
Affine Steerable Equivariant Layer for Canonicalization of Neural Networks 5
Agent S: An Open Agentic Framework that Uses Computers Like a Human 3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents 3
Agent Skill Acquisition for Large Language Models via CycleQD 7
Agent-Oriented Planning in Multi-Agent Systems 5
Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos 3
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents 5
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents 4
AgentRefine: Enhancing Agent Generalization through Refinement Tuning 5
AgentSquare: Automatic LLM Agent Search in Modular Design Space 4
AgentStudio: A Toolkit for Building General Virtual Agents 3
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials 2
Agents' Room: Narrative Generation through Multi-step Collaboration 4
Agree to Disagree: Demystifying Homogeneous Deep Ensembles through Distributional Equivalence 5
Aioli: A Unified Optimization Framework for Language Model Data Mixing 5
Air Quality Prediction with Physics-Guided Dual Neural ODEs in Open Systems 6
Alchemy: Amplifying Theorem-Proving Capability Through Symbolic Mutation 7
Algorithmic Stability Based Generalization Bounds for Adversarial Training 4
Aligned Better, Listen Better for Audio-Visual Large Language Models 5
Aligned Datasets Improve Detection of Latent Diffusion-Generated Images 3
Aligned LLMs Are Not Aligned Browser Agents 4
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception 6
Aligning Human Motion Generation with Human Perceptions 5
Aligning Language Models with Demonstrated Feedback 6
Aligning Visual Contrastive learning models via Preference Optimization 5
Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits 1
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models 5
Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models Trained on Corrupted Data 4
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series 6
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs 5
An Asynchronous Bundle Method for Distributed Learning Problems 4
An Auditing Test to Detect Behavioral Shift in Language Models 5
An Effective Manifold-based Optimization Method for Distributionally Robust Classification 4
An Effective Theory of Bias Amplification 4
An Efficient Framework for Crediting Data Contributors of Diffusion Models 5
An Empirical Analysis of Uncertainty in Large Language Model Evaluations 5
An Engorgio Prompt Makes Large Language Model Babble on 4
An Evolved Universal Transformer Memory 6
An Exploration with Entropy Constrained 3D Gaussians for 2D Video Compression 5
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels 3
An Information Criterion for Controlled Disentanglement of Multimodal Data 6
An Intelligent Agentic System for Complex Image Restoration Problems 6
An Online Learning Theory of Trading-Volume Maximization 1
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning 3
An Undetectable Watermark for Generative Image Models 7
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies 3
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods 4
Analytic DAG Constraints for Differentiable DAG Learning 5
Analyzing Neural Scaling Laws in Two-Layer Networks with Power-Law Data Spectra 2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models 5
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents 4
AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction 3
Animate Your Thoughts: Reconstruction of Dynamic Natural Vision from Human Brain Activity 6
Animate-X: Universal Character Image Animation with Enhanced Motion Representation 4
AnoLLM: Large Language Models for Tabular Anomaly Detection 4
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions 4
Anti-Exposure Bias in Diffusion Models 5
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning 5
AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors 5
Anyprefer: An Agentic Framework for Preference Data Synthesis 4
Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming 5
Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding 4
Approximating Full Conformal Prediction for Neural Network Regression with Gauss-Newton Influence 5
Approximation algorithms for combinatorial optimization with predictions 4
Are Large Vision Language Models Good Game Players? 4
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data? 5
Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling 4
Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count 4
Arithmetic Without Algorithms: Language Models Solve Math with a Bag of Heuristics 4
Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model 5
Artificial Kuramoto Oscillatory Neurons 4
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss 6
Ask, and it shall be given: On the Turing completeness of prompting 2
AssembleFlow: Rigid Flow Matching with Inertial Frames for Molecular Assembly 4
Associative memory and dead neurons 0
AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data 6
Asymmetric Factorized Bilinear Operation for Vision Transformer 5
Asymptotic Analysis of Two-Layer Neural Networks after One Gradient Step under Gaussian Mixtures Data with Structure 3
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis 4
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models 5
Atlas Gaussians Diffusion for 3D Generation 5
AtomSurf: Surface Representation for Learning on Protein Structures 5
Atomas: Hierarchical Adaptive Alignment on Molecule-Text for Unified Molecule Understanding and Generation 6
Attention as a Hypernetwork 7
Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers 5
Attention layers provably solve single-location regression 3
Attention with Markov: A Curious Case of Single-layer Transformers 2
AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution 4
Attribute-based Visual Reprogramming for Vision-Language Models 6
Attributing Culture-Conditioned Generations to Pretraining Corpora 4
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators 3
AugKD: Ingenious Augmentations Empower Knowledge Distillation for Image Super-Resolution 5
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark 4
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval-Augmented Generation 6
AutoBencher: Towards Declarative Benchmark Construction 6
AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations 5
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs 7
AutoG: Towards automatic graph construction from tabular data 2
AutoUAD: Hyper-parameter Optimization for Unsupervised Anomaly Detection 5
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models 3
Automated Design of Agentic Systems 5
Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models 7
Automated Proof Generation for Rust Code via Self-Evolution 3
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning 6
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks 6
Autoregressive Pretraining with Mamba in Vision 5
Autoregressive Video Generation without Vector Quantization 4
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation 5
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners 5
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games 3
BAMDP Shaping: a Unified Framework for Intrinsic Motivation and Reward Shaping 3
BANGS: Game-theoretic Node Selection for Graph Self-Training 7
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts 5
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models 5
BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation 4
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL 5
BOND: Aligning LLMs with Best-of-N Distillation 3
BP-Modified Local Loss for Efficient Training of Deep Neural Networks 4
BRAID: Input-driven Nonlinear Dynamical Modeling of Neural-Behavioral Data 4
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval 5
BTBS-LNS: Binarized-Tightening, Branch and Search on Learning LNS Policies for MIP 6
BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics 2
Backdooring Vision-Language Models with Out-Of-Distribution Data 3
Backtracking Improves Generation Safety 4
Bad-PFL: Exploiting Backdoor Attacks against Personalized Federated Learning 6
BadJudge: Backdoor Vulnerabilities of LLM-As-A-Judge 6
BadRobot: Jailbreaking Embodied LLM Agents in the Physical World 6
Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations 7
Balanced Ranking with Relative Centrality: A multi-core periphery perspective 4
Balancing Act: Diversity and Consistency in Large Language Model Ensembles 6
Balancing Bias in Two-sided Markets for Fair Stable Matchings 5
Bandit Learning in Matching Markets with Indifference 2
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression 3
Bayesian Analysis of Combinatorial Gaussian Process Bandits 3
Bayesian Experimental Design Via Contrastive Diffusions 5
Bayesian Image Regression with Soft-thresholded Conditional Autoregressive Prior 4
Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences 5
Bayesian Optimization via Continual Variational Last Layer Training 3
Bayesian Regularization of Latent Representation 4
Bayesian Treatment of the Spectrum of the Empirical Kernel in (Sub)Linear-Width Neural Networks 2
Bayesian WeakS-to-Strong from Text Classification to Generation 5
Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms 4
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning 2
BenTo: Benchmark Reduction with In-Context Transferability 5
Benchmarking Agentic Workflow Generation 6
Benchmarking LLMs' Judgments with No Gold Standard 7
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent 4
Benchmarking Predictive Coding Networks -- Made Simple 6
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset 6
Benign Overfitting in Out-of-Distribution Generalization of Linear Models 2
Better Instruction-Following Through Minimum Bayes Risk 5
Better autoregressive regression with LLMs via regression-aware fine-tuning 3
Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback 6
Beware of Calibration Data for Pruning Large Language Models 4
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning 6
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time 4
Beyond Canonicalization: How Tensorial Messages Improve Equivariant Message Passing 4
Beyond Circuit Connections: A Non-Message Passing Graph Transformer Approach for Quantum Error Mitigation 5
Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models 3
Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality 6
Beyond Graphs: Can Large Language Models Comprehend Hypergraphs? 1
Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness 4
Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix 4
Beyond Mere Token Analysis: A Hypergraph Metric Space Framework for Defending Against Socially Engineered LLM Attacks 5
Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification 4
Beyond Next Token Prediction: Patch-Level Training for Large Language Models 6
Beyond Random Augmentations: Pretraining with Hard Views 7
Beyond Random Masking: When Dropout meets Graph Convolutional Networks 5
Beyond Sequence: Impact of Geometric Context for RNA Property Prediction 4
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution 5
Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks 2
Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension ability 5
Beyond Worst-Case Dimensionality Reduction for Sparse Vectors 0
Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge 5
Beyond single neurons: population response geometry in digital twins of mouse visual cortex 3
Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints 6
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration 5
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models 5
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities 6
Bias Mitigation in Graph Diffusion Models 3
Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling 5
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions 5
BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks 5
Bilinear MLPs enable weight-based mechanistic interpretability 4
Binary Losses for Density Ratio Estimation 5
BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models 3
BingoGuard: LLM Content Moderation Tools with Risk Levels 4
Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences 5
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments 4
Biologically Constrained Barrel Cortex Model Integrates Whisker Inputs and Replicates Key Brain Network Dynamics 5
Biologically Plausible Brain Graph Transformer 6
BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics 6
Bisimulation Metric for Model Predictive Control 5
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments 6
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition 4
Black-Box Detection of Language Model Watermarks 3
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning 5
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models 7
Block Verification Accelerates Speculative Decoding 4
Block-Attention for Efficient Prefilling 4
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks 4
BodyGen: Advancing Towards Efficient Embodiment Co-Design 5
Boltzmann Semantic Score: A Semantic Metric for Evaluating Large Vision Models Using Large Language Models 7
Boltzmann priors for Implicit Transfer Operators 5
Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions 5
BoneMet: An Open Large-Scale Multi-Modal Murine Dataset for Breast Cancer Bone Metastasis Diagnosis and Prognosis 5
Bonsai: Gradient-free Graph Condensation for Node Classification 7
Boost Self-Supervised Dataset Distillation via Parameterization, Predefined Augmentation, and Approximation 5
Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation 5
Boosting Latent Diffusion with Perceptual Objectives 3
Boosting Methods for Interval-censored Data with Regression and Classification 5
Boosting Multiple Views for pretrained-based Continual Learning 5
Boosting Neural Combinatorial Optimization for Large-Scale Vehicle Routing Problems 6
Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games 4
Boosting Ray Search Procedure of Hard-label Attacks with Transfer-based Priors 7
Boosting the visual interpretability of CLIP via adversarial fine-tuning 5
Bootstrapped Model Predictive Control 5
Bootstrapping Language Models with DPO Implicit Rewards 5
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel 6
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation 5
Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions 4
Brain Bandit: A Biologically Grounded Neural Network for Efficient Control of Exploration 4
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers 5
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex 6
BrainACTIV: Identifying visuo-semantic properties driving cortical selectivity using diffusion-based image manipulation 4
BrainOOD: Out-of-distribution Generalizable Brain Network Analysis 4
BrainUICL: An Unsupervised Individual Continual Learning Framework for EEG Applications 5
Breach By A Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses 3
Breaking Class Barriers: Efficient Dataset Distillation via Inter-Class Feature Compensator 5
Breaking Free from MMI: A New Frontier in Rationalization by Probing Input Utilization 5
Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate 5
Breaking Neural Network Scaling Laws with Modularity 4
Breaking the $\log(1/\Delta_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids 3
Breaking the Reclustering Barrier in Centroid-based Deep Clustering 6
Bridging Compressed Image Latents and Multimodal Large Language Models 3
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding 5
Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach 5
Bridging Jensen Gap for Max-Min Group Fairness Optimization in Recommendation 7
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization 5
Bridging the Data Provenance Gap Across Text, Speech, and Video 2
Bridging the Gap Between f-divergences and Bayes Hilbert Spaces 4
Bridging the Gap between Database Search and \emph{De Novo} Peptide Sequencing with SearchNovo 5
Bridging the Gap between Variational Inference and Stochastic Gradient MCMC in Function Space 4
Bridging the Semantic Gap Between Text and Table: A Case Study on NL2SQL 6
Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder 4
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space 5
Broadening Target Distributions for Accelerated Diffusion Models via a Novel Analysis Approach 1
Budgeted Online Continual Learning by Adaptive Layer Freezing and Frequency-based Sampling 6
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation 5
Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting 4
Building Math Agents with Multi-Turn Iterative Preference Learning 7
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences 4
Bundle Neural Network for message diffusion on graphs 6
C-CLIP: Multimodal Continual Learning for Vision-Language Model 5
CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences 6
CAMEx: Curvature-aware Merging of Experts 6
CARTS: Advancing Neural Theorem Proving with Diversified Tactic Calibration and Bias-Resistant Tree Search 6
CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression 3
CATCH: Channel-Aware Multivariate Time Series Anomaly Detection via Frequency Patching 7
CAX: Cellular Automata Accelerated in JAX 6
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph 6
CBMA: Improving Conformal Prediction through Bayesian Model Averaging 5
CBQ: Cross-Block Quantization for Large Language Models 4
CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding 6
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models 6
CFD: Learning Generalized Molecular Representation via Concept-Enhanced Feedback Disentanglement 5
CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models 4
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding 2
CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators 4
CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL 5
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs 4
CL-DiffPhyCon: Closed-loop Diffusion Control of Complex Physical Systems 5
CL-MFAP: A Contrastive Learning-Based Multimodal Foundation Model for Molecular Property Prediction and Antibiotic Screening 6
CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models 4
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale 5
CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing 5
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification 6
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control 4
CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets 5
COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training 4
COFlowNet: Conservative Constraints on Flows Enable High-Quality Candidate Generation 3
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation 4
COME: Test-time Adaption by Conservatively Minimizing Entropy 6
CONDA: Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts 6
CONGO: Compressive Online Gradient Optimization 5
CONTRA: Conformal Prediction Region via Normalizing Flow Transformation 5
COPER: Correlation-based Permutations for Multi-View Clustering 4
CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion 6
CR-CTC: Consistency regularization on CTC for improved speech recognition 5
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning 6
CREAM: Consistency Regularized Self-Rewarding Language Models 4
CREIMBO: Cross-Regional Ensemble Interactions in Multi-view Brain Observations 4
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion 5
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery 5
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features 5
CTSyn: A Foundation Model for Cross Tabular Data Generation 4
CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning 4
CViT: Continuous Vision Transformer for Operator Learning 5
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation 3
Cached Multi-Lora Composition for Multi-Concept Image Generation 4
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control 5
Calibrating Expressions of Certainty 5
Calibrating LLMs with Information-Theoretic Evidential Deep Learning 5
CameraCtrl: Enabling Camera Control for Video Diffusion Models 4
Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective 4
Can In-context Learning Really Generalize to Out-of-distribution Tasks? 2
Can Knowledge Editing Really Correct Hallucinations? 3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers 3
Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book? 3
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? 5
Can LLMs Solve Longer Math Word Problems Better? 5
Can LLMs Understand Time Series Anomalies? 5
Can Large Language Models Understand Symbolic Graphics Programs? 4
Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model 2
Can One Modality Model Synergize Training of Other Modality Models? 6
Can Reinforcement Learning Solve Asymmetric Combinatorial-Continuous Zero-Sum Games? 6
Can Textual Gradient Work in Federated Learning? 4
Can Transformers Do Enumerative Geometry? 2
Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models 5
Can Watermarked LLMs be Identified by Users via Crafted Prompts? 4
Can Watermarks be Used to Detect LLM IP Infringement For Free? 3
Can We Ignore Labels in Out of Distribution Detection? 4
Can We Talk Models Into Seeing the World Differently? 4
Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-Based Decision-Making Systems 4
Can a Large Language Model be a Gaslighter? 6
Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning 5
Capability Localization: Capabilities Can be Localized rather than Individual Knowledge 3
CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation 5
Captured by Captions: On Memorization and its Mitigation in CLIP Models 4
Capturing the Temporal Dependence of Training Data Influence 4
CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling 5
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models 6
Catastrophic Failure of LLM Unlearning via Quantization 3
Cauchy-Schwarz Regularizers 5
Causal Concept Graph Models: Beyond Causal Opacity in Deep Learning 5
Causal Discovery via Bayesian Optimization 5
Causal Effect Estimation with Mixed Latent Confounders and Post-treatment Variables 2
Causal Graph Transformer for Treatment Effect Estimation Under Unknown Interference 7
Causal Graphical Models for Vision-Language Compositional Understanding 5
Causal Identification for Complex Functional Longitudinal Studies 2
Causal Information Prioritization for Efficient Reinforcement Learning 5
Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference 4
Causal Representation Learning from Multimodal Biomedical Observations 3
CausalRivers - Scaling up benchmarking of causal discovery for real-world time-series 4
Causally Motivated Sycophancy Mitigation for Large Language Models 4
Centrality-guided Pre-training for Graph 5
Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images 3
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness 4
Certified Robustness Under Bounded Levenshtein Distance 5
Certifying Counterfactual Bias in LLMs 6
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks 4
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models 4
Chain-of-Focus Prompting: Leveraging Sequential Visual Cues to Prompt Large Autoregressive Vision Models 4
Chain-of-Thought Provably Enables Learning the (Otherwise) Unlearnable 4
Chain-of-region: Visual Language Models Need Details for Diagram Analysis 4
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation 7
ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding 5
Charting the Design Space of Neural Graph Representations for Subgraph Matching 6
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities 3
CheapNet: Cross-attention on Hierarchical representations for Efficient protein-ligand binding Affinity Prediction 6
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates 7
ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning 5
Chemistry-Inspired Diffusion with Non-Differentiable Guidance 5
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains 6
Chunk-Distilled Language Modeling 6
CipherPrune: Efficient and Scalable Private Transformer Inference 7
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired Transformer 5
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment 5
Circuit Transformer: A Transformer That Preserves Logical Equivalence 6
CircuitFusion: Multimodal Circuit Representation Learning for Agile Chip Design 6
CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs 5
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes 5
Class Distribution-induced Attention Map for Open-vocabulary Semantic Segmentations 5
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance 5
Classic but Everlasting: Traditional Gradient-Based Algorithms Converge Fast Even in Time-Varying Multi-Player Games 2
ClawMachine: Learning to Fetch Visual Tokens for Referential Comprehension 5
ClimaQA: An Automated Evaluation Framework for Climate Question Answering Models 4
Clique Number Estimation via Differentiable Functions of Adjacency Matrix Permutations 7
Closed-Form Merging of Parameter-Efficient Modules for Federated Continual Learning 5
Co$^{\mathbf{3}}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion 5
CoInD: Enabling Logical Compositions in Diffusion Models 5
CoMRes: Semi-Supervised Time Series Forecasting Utilizing Consensus Promotion of Multi-Resolution 5
CoMotion: Concurrent Multi-person 3D Motion 4
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking 5
CoTFormer: A Chain of Thought Driven Architecture with Budget-Adaptive Computation Cost at Inference 4
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion 4
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs 3
CodePlan: Unlocking Reasoning Potential in Large Language Models by Scaling Code-form Planning 5
CofCA: A STEP-WISE Counterfactual Multi-hop QA benchmark 4
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning 7
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer 5
ColPali: Efficient Document Retrieval with Vision Language Models 5
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment 5
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing 4
Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models 6
Collapsed Language Models Promote Fairness 3
ComLoRA: A Competitive Learning Approach for Enhancing LoRA 4
ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors 5
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization 1
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection 5
Combining Induction and Transduction for Abstract Reasoning 6
Commit0: Library Generation from Scratch 4
Comparing Targeting Strategies for Maximizing Social Welfare with Limited Resources 2
Comparing noisy neural population dynamics using optimal transport distances 3
Competing Large Language Models in Multi-Agent Gaming Environments 2
Competition Dynamics Shape Algorithmic Phases of In-Context Learning 5
Competitive Fair Scheduling with Predictions 4
Complementary Label Learning with Positive Label Guessing and Negative Label Enhancement 6
Complexity Lower Bounds of Adaptive Gradient Algorithms for Non-convex Stochastic Optimization under Relaxed Smoothness 0
Composable Interventions for Language Models 4
Composing Unbalanced Flows for Flexible Docking and Relaxation 7
Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering 2
Compositional Entailment Learning for Hyperbolic Vision-Language Models 5
Compositional simulation-based inference for time series 4
Computational Explorations of Total Variation Distance 0
Computational Limits of Low-Rank Adaptation (LoRA) Fine-Tuning for Transformer Models 4
Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics 1
Compute-Constrained Data Selection 4
Compute-Optimal LLMs Provably Generalize Better with Scale 3
Computing Circuits Optimization via Model-Based Circuit Genetic Evolution 4
ConFIG: Towards Conflict-free Training of Physics Informed Neural Networks 5
ConMix: Contrastive Mixup at Representation Level for Long-tailed Deep Clustering 5
Concept Bottleneck Language Models For Protein Design 4
Concept Bottleneck Large Language Models 3
Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate 5
Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing 5
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning 3
ConcreTizer: Model Inversion Attack via Occupancy Classification and Dispersion Control for 3D Point Cloud Restoration 2
Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation 0
Conditional Diffusion with Ordinal Regression: Longitudinal Data Generation for Neurodegenerative Disease Studies 5
Conditional Testing based on Localized Conformal $p$-values 5
Confidence Elicitation: A New Attack Vector for Large Language Models 5
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning 4
Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering 5
Conformal Language Model Reasoning with Coherent Factuality 5
Conformal Prediction Sets Can Cause Disparate Impact 6
Conformal Structured Prediction 3
Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback 3
Conformalized Survival Analysis for General Right-Censored Data 6
Connecting Federated ADMM to Bayes 4
Connectome Mapping: Shape-Memory Network via Interpretation of Contextual Semantic Information 7
Conservative Contextual Bandits: Beyond Linear Representations 3
Consistency Checks for Language Model Forecasters 6
Consistency Models Made Easy 5
Consistent Flow Distillation for Text-to-3D Generation 3
Constraint-Conditioned Actor-Critic for Offline Safe Reinforcement Learning 5
Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets 5
Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions 4
Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data 5
Context Steering: Controllable Personalization at Inference Time 4
Context-Alignment: Activating and Enhancing LLMs Capabilities in Time Series 5
Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance 2
Context-aware Dynamic Pruning for Speech Foundation Models 6
ContextGNN: Beyond Two-Tower Recommendation Systems 5
Contextual Document Embeddings 3
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding 5
Contextualizing biological perturbation experiments through language 4
Continual Slow-and-Fast Adaptation of Latent Neural Dynamics (CoSFan): Meta-Learning What-How & When to Adapt 5
Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images 4
Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis 4
Continuous Diffusion for Mixed-Type Tabular Data 7
Continuous Ensemble Weather Forecasting with Diffusion models 6
Continuous Exposure Learning for Low-light Image Enhancement using Neural ODEs 6
ContraDiff: Planning Towards High Return States via Contrastive Learning 6
Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery 7
Contrastive Learning from Synthetic Audio Doppelgängers 5
Control-oriented Clustering of Visual Latent Representation 4
ControlAR: Controllable Image Generation with Autoregressive Models 4
Controllable Blur Data Augmentation Using 3D-Aware Motion Estimation 4
Controllable Context Sensitivity and the Knob Behind It 5
Controllable Generation via Locally Constrained Resampling 5
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements 5
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control 5
Controllable Unlearning for Image-to-Image Generative Models via $\epsilon$-Constrained Optimization 5
Controlled LLM Decoding via Discrete Auto-regressive Biasing 4
Controlling Language and Diffusion Models by Transporting Activations 4
Controlling Space and Time with Diffusion Models 4
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments 3
Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification 4
Convergence of Distributed Adaptive Optimization with Local Updates 1
Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis 1
Convergent Privacy Loss of Noisy-SGD without Convexity and Smoothness 2
Convex Formulations for Training Two-Layer ReLU Neural Networks 6
Copyright-Protected Language Generation via Adaptive Model Fusion 5
Coreset Selection via Reducible Loss in Continual Learning 6
Coreset Spectral Clustering 5
Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization 4
Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking 4
Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain) 5
Correlation and Navigation in the Vocabulary Key Representation Space of Language Models 3
Counterfactual Concept Bottleneck Models 5
Counterfactual Generative Modeling with Variational Causal Inference 5
Counterfactual Realizability 1
CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair 5
Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification 7
Credit-based self organizing maps: training deep topographic networks with minimal performance degradation 4
Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion 6
Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models 4
Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits 2
Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint 6
Cross-Embodiment Dexterous Grasping with Reinforcement Learning 5
Cross-Entropy Is All You Need To Invert the Data Generating Process 4
Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models 7
CrossMPT: Cross-attention Message-passing Transformer for Error Correcting Codes 4
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities 5
CryoGEN: Generative Energy-based Models for Cryogenic Electron Tomography Reconstruction 5
CtD: Composition through Decomposition in Emergent Communication 4
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation 5
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model 7
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling 4
CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation 4
Curriculum-aware Training for Discriminating Molecular Property Prediction Models 5
Cut Your Losses in Large-Vocabulary Language Models 7
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems 6
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models 3
CyberHost: A One-stage Diffusion Framework for Audio-driven Talking Body Generation 5
CycleResearcher: Improving Automated Research via Automated Review 5
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection 5
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 5
DAMO: Decoding by Accumulating Activations Momentum for Mitigating Hallucinations in Vision-Language Models 3
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models 6
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking head Video Generation 5
DCT-CryptoNets: Scaling Private Inference in the Frequency Domain 4
DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation 4
DEEM: Diffusion models serve as the eyes of large language models for image perception 5
DELIFT: Data Efficient Language model Instruction Fine-Tuning 5
DELTA: DENSE EFFICIENT LONG-RANGE 3D TRACKING FOR ANY VIDEO 4
DEPT: Decoupled Embeddings for Pre-training Language Models 5
DEPfold: RNA Secondary Structure Prediction as Dependency Parsing. 6
DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models 5
DICE: Data Influence Cascade in Decentralized Learning 4
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image 5
DLEFT-MKC: Dynamic Late Fusion Multiple Kernel Clustering with Robust Tensor Learning via Min-Max Optimization 2
DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models 2
DON’T STOP ME NOW: EMBEDDING BASED SCHEDULING FOR LLMS 5
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback 4
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search 5
DPLM-2: A Multimodal Diffusion Protein Language Model 5
DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle 6
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing 7
DRL: Decomposed Representation Learning for Tabular Anomaly Detection 6
DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints 4
DRoP: Distributionally Robust Data Pruning 6
DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models 3
DSBench: How Far Are Data Science Agents from Becoming Data Science Experts? 5
DSPO: Direct Score Preference Optimization for Diffusion Model Alignment 5
DUALFormer: Dual Graph Transformer 5
DUET: Decentralized Bilevel Optimization without Lower-Level Strong Convexity 5
DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation 6
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life 3
DarkBench: Benchmarking Dark Patterns in Large Language Models 3
DartControl: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control 6
Data Center Cooling System Optimization Using Offline Reinforcement Learning 2
Data Distillation for extrapolative protein design through exact preference optimization 4
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance 5
Data Pruning by Information Maximization 4
Data Scaling Laws in Imitation Learning for Robotic Manipulation 5
Data Selection via Optimal Control for Language Models 4
Data Shapley in One Training Run 4
Data Taggants: Dataset Ownership Verification Via Harmless Targeted Data Poisoning 3
Data Unlearning in Diffusion Models 4
Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning 5
Data-centric Prediction Explanation via Kernelized Stein Discrepancy 4
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback 6
DataGen: Unified Synthetic Dataset Generation via Large Language Models 4
DataMan: Data Manager for Pre-training Large Language Models 5
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-training of Deep Networks 5
Dataset Ownership Verification in Contrastive Pre-trained Models 5
DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference 5
DeLLMa: Decision Making Under Uncertainty with Large Language Models 4
DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery 5
Debiasing Federated Learning with Correlated Client Participation 4
Debiasing Mini-Batch Quadratics for Applications in Deep Learning 4
Decentralized Optimization with Coupled Constraints 4
Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees 5
DeciMamba: Exploring the Length Extrapolation Potential of Mamba 6
Decision Information Meets Large Language Models: The Future of Explainable Operations Research 2
Decision Tree Induction Through LLMs via Semantically-Aware Evolution 6
Decoding Game: On Minimax Optimality of Heuristic Text Generation Strategies 5
Decomposition Polyhedra of Piecewise Linear Functions 0
Deconstructing Denoising Diffusion Models for Self-Supervised Learning 4
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models 3
Decoupled Finetuning for Domain Generalizable Semantic Segmentation 4
Decoupled Graph Energy-based Model for Node Out-of-Distribution Detection on Heterophilic Graphs 6
Decoupled Subgraph Federated Learning 6
Decoupling Angles and Strength in Low-rank Adaptation 5
Decoupling Layout from Glyph in Online Chinese Handwriting Generation 6
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models 5
Deep Distributed Optimization for Large-Scale Quadratic Programming 5
Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs 5
Deep Kernel Posterior Learning under Infinite Variance Prior Weights 5
Deep Kernel Relative Test for Machine-generated Text Detection 7
Deep Learning Alternatives Of The Kolmogorov Superposition Theorem 4
Deep Linear Probe Generators for Weight Space Learning 4
Deep MMD Gradient Flow without adversarial training 5
Deep Networks Learn Features From Local Discontinuities in the Label Function 5
Deep Random Features for Scalable Interpolation of Spatiotemporal Data 5
Deep Signature: Characterization of Large-Scale Molecular Dynamics 3
Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries 5
DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale 5
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL 4
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model 5
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search 5
DeepTAGE: Deep Temporal-Aligned Gradient Enhancement for Optimizing Spiking Neural Networks 4
DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance 6
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory 7
Democratic Training Against Universal Adversarial Perturbations 6
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts 4
Demystifying Topological Message-Passing with Relational Structures: A Case Study on Oversquashing in Simplicial Message-Passing 6
Demystifying the Token Dynamics of Deep Selective State Space Models 6
DenoiseVAE: Learning Molecule-Adaptive Noise Distributions for Denoising-based 3D Molecular Pre-training 5
Denoising Autoregressive Transformers for Scalable Text-to-Image Generation 4
Denoising Levy Probabilistic Models 5
Denoising Task Difficulty-based Curriculum for Training Diffusion Models 4
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration 5
Denoising with a Joint-Embedding Predictive Architecture 6
Dense Video Object Captioning from Disjoint Supervision 6
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-centric 3D Visual Grounding 3
DenseMatcher: Learning 3D Semantic Correspondence for Category-Level Manipulation from a Single Demo 4
Density estimation with LLMs: a geometric investigation of in-context learning trajectories 2
Depth Any Video with Scalable Synthetic Data 5
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second 5
Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm 6
Descent with Misaligned Gradients and Applications to Hidden Convexity 1
Designing Concise ConvNets with Columnar Stages 6
Designing Mechanical Meta-Materials by Learning Equivariant Flows 2
Detecting Backdoor Samples in Contrastive Language Image Pretraining 6
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling 5
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References 5
DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction 6
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors 4
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models 5
Diff-PIC: Revolutionizing Particle-In-Cell Nuclear Fusion Simulation with Diffusion Models 3
Diff-Prompt: Diffusion-driven Prompt Generator with Mask Supervision 4
Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering 3
DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector 6
DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement 6
DiffPuter: Empowering Diffusion Models for Missing Data Imputation 7
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation 3
Difference-of-submodular Bregman Divergence 4
Differentiable Causal Discovery for Latent Hierarchical Causal Models 4
Differentiable Integer Linear Programming 6
Differentiable Optimization of Similarity Scores Between Models and Brains 4
Differentiable Rule Induction from Raw Sequence Inputs 4
Differentiable and Learnable Wireless Simulation with Geometric Transformers 5
Differential Transformer 6
Differential learning kinetics govern the transition from memorization to generalization during in-context learning 1
Differentially Private Federated Learning with Time-Adaptive Privacy Spending 5
Differentially Private Steering for Large Language Model Alignment 5
Differentially private learners for heterogeneous treatment effects 5
Differentially private optimization for non-decomposable objective functions 6
Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient 3
Diffusing States and Matching Scores: A New Framework for Imitation Learning 4
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning 5
Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models 5
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning 6
Diffusion Bridge Implicit Models 6
Diffusion Feedback Helps CLIP See Better 5
Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images 4
Diffusion Models Are Real-Time Game Engines 3
Diffusion Models are Evolutionary Algorithms 4
Diffusion Models as Cartoonists: The Curious Case of High Density Regions 4
Diffusion On Syntax Trees For Program Synthesis 3
Diffusion Policy Policy Optimization 5
Diffusion State-Guided Projected Gradient for Inverse Problems 6
Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data 2
Diffusion Transformers for Tabular Data Time Series Generation 5
Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models 5
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance 4
Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models 4
Diffusion-based Decoupled Deterministic and Uncertain Framework for Probabilistic Multivariate Time Series Forecasting 5
Diffusion-based Neural Network Weights Generation 5
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing 5
Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents 4
Dimension Agnostic Neural Processes 5
Direct Distributional Optimization for Provable Alignment of Diffusion Models 5
Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations 2
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models 6
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation 4
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation 4
Discovering Clone Negatives via Adaptive Contrastive Learning for Image-Text Matching 5
Discovering Group Structures via Unitary Representation Learning 5
Discovering Influential Neuron Path in Vision Transformers 6
Discovering Temporally Compositional Neural Manifolds with Switching Infinite GPFA 3
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models 3
Discrete Codebook World Models for Continuous Control 5
Discrete Copula Diffusion 5
Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation 6
Discrete Distribution Networks 5
Discrete GCBF Proximal Policy Optimization for Multi-agent Safe Optimal Control 4
Discrete Latent Plans via Semantic Skill Abstractions 5
Discretization-invariance? On the Discretization Mismatch Errors in Neural Operators 5
Discriminating image representations with principal distortions 7
Discriminator-Guided Embodied Planning for LLM Agent 4
Disentangled Representation Learning with the Gromov-Monge Gap 5
Disentangling 3D Animal Pose Dynamics with Scrubbed Conditional Latent Variables 6
Disentangling Representations through Multi-task Learning 3
Dissecting Adversarial Robustness of Multimodal LM Agents 5
Dist Loss: Enhancing Regression in Few-Shot Region through Distribution Distance Constraint 5
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent 5
Distance-Based Tree-Sliced Wasserstein Distance 5
DistillHGNN: A Knowledge Distillation Approach for High-Speed Hypergraph Neural Networks 4
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching 5
Distilling Dataset into Neural Field 6
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning 5
Distilling Structural Representations into Protein Sequence Models 5
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference 5
Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion Distillation 6
Distribution-Free Data Uncertainty for Neural Network Regression 5
Distribution-Specific Agnostic Conditional Classification With Halfspaces 1
Distributional Associations vs In-Context Reasoning: A Study of Feed-forward and Attention Layers 5
Divergence of Neural Tangent Kernel in Classification Problems 2
Divergence-Regularized Discounted Aggregation: Equilibrium Finding in Multiplayer Partially Observable Stochastic Games 4
Divergence-enhanced Knowledge-guided Context Optimization for Visual-Language Prompt Tuning 5
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning 4
Diverse Preference Learning for Capabilities and Alignment 3
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents 4
Diversity-Rewarded CFG Distillation 2
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning 6
Do Contemporary Causal Inference Models Capture Real-World Heterogeneity? Findings from a Large-Scale Benchmark 4
Do Deep Neural Network Solutions Form a Star Domain? 5
Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions? 5
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models 4
Do LLM Agents Have Regret? A Case Study in Online Learning and Games 1
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs 5
Do LLMs ``know'' internally when they follow instructions? 4
Do LLMs estimate uncertainty well in instruction-following? 4
Do LLMs have Consistent Values? 3
Do Large Language Models Truly Understand Geometric Structures? 5
Do Mice Grok? Glimpses of Hidden Progress in Sensory Cortex 3
Do Stochastic, Feel Noiseless: Stable Stochastic Optimization via a Double Momentum Mechanism 5
Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations? 4
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities 2
Do WGANs succeed because they minimize the Wasserstein Distance? Lessons from Discrete Generators 4
Do You Keep an Eye on What I Ask? Mitigating Multimodal Hallucination via Attention-Guided Ensemble Decoding 4
Do as I do (Safely): Mitigating Task-Specific Fine-tuning Risks in Large Language Models 5
Do as We Do, Not as You Think: the Conformity of Large Language Models 3
DoF: A Diffusion Factorization Framework for Offline Multi-Agent Reinforcement Learning 6
Dobi-SVD: Differentiable SVD for LLM Compression and Some New Perspectives 4
DocMIA: Document-Level Membership Inference Attacks against DocVQA Models 6
Does Refusal Training in LLMs Generalize to the Past Tense? 5
Does SGD really happen in tiny subspaces? 4
Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts? 3
Does Spatial Cognition Emerge in Frontier Models? 2
Does Training with Synthetic Data Truly Protect Privacy? 4
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model 5
Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models 4
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL 4
Doubly Optimal Policy Evaluation for Reinforcement Learning 3
Doubly robust identification of treatment effects from multiple environments 5
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient 6
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want 5
Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination 4
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation 2
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation 4
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation 5
Dreamweaver: Learning Compositional World Models from Pixels 3
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving 5
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization 5
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting 5
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces 5
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads 4
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images 5
Durable Quantization Conditioned Misalignment Attack on Large Language Models 2
DyCAST: Learning Dynamic Causal Structure from Time Series 3
DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation 5
DynFrs: An Efficient Framework for Machine Unlearning in Random Forest 7
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models 2
DynaPrompt: Dynamic Test-Time Prompt Tuning 6
Dynamic Assortment Selection and Pricing with Censored Preference Feedback 3
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment 3
Dynamic Diffusion Transformer 5
Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Dynamic Scenes 5
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining 4
Dynamic Low-Rank Sparse Adaptation for Large Language Models 5
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models 6
Dynamic Modeling of Patients, Modalities and Tasks via Multi-modal Multi-task Mixture of Experts 5
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping 3
Dynamic Negative Guidance of Diffusion Models 4
Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense 4
Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness 4
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification 4
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks 4
DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes 4
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models 5
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs 3
E(3)-equivariant models cannot learn chirality: Field-based molecular generation 6
E(n) Equivariant Topological Neural Networks 5
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing 4
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation 5
ECD: A Machine Learning Benchmark for Predicting Enhanced-Precision Electronic Charge Density in Crystalline Inorganic Materials 5
ECHOPulse: ECG Controlled Echocardio-gram Video Generation 5
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models 5
EFFICIENT JAILBREAK ATTACK SEQUENCES ON LARGE LANGUAGE MODELS VIA MULTI-ARMED BANDIT-BASED CONTEXT SWITCHING 5
EG4D: Explicit Generation of 4D Object without Score Distillation 4
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE 3
ELBOing Stein: Variational Bayes with Stein Mixture Inference 5
ELFS: Label-Free Coreset Selection with Proxy Training Dynamics 5
ELICIT: LLM Augmentation Via External In-context Capability 4
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment 5
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents 4
ESE: Espresso Sentence Embeddings 4
ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY 3
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time 6
EVA: Geometric Inverse Design for Fast Protein Motif-Scaffolding with Coupled Flow 5
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders 4
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective 4
Easing Training Process of Rectified Flow Models Via Lengthening Inter-Path Distance 4
EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation 5
Edge Prompt Tuning for Graph Neural Networks 4
Edge-aware Image Smoothing with Relative Wavelet Domain Representation 3
EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation 4
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing 3
Effective Interplay between Sparsity and Quantization: From Theory to Practice 4
Effective and Efficient Time-Varying Counterfactual Prediction with State-Space Models 4
Effective post-training embedding compression via temperature control in contrastive training 3
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs 4
Efficient Active Imitation Learning with Random Network Distillation 4
Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation 2
Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition 6
Efficient Biological Data Acquisition through Inference Set Design 6
Efficient Causal Decision Making with One-sided Feedback 2
Efficient Cross-Episode Meta-RL 6
Efficient Dictionary Learning with Switch Sparse Autoencoders 3
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning 5
Efficient Discovery of Pareto Front for Multi-Objective Reinforcement Learning 5
Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax 3
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets 5
Efficient Evolutionary Search Over Chemical Space with Large Language Models 6
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction 4
Efficient Imitation under Misspecification 4
Efficient Inference for Large Language Model-based Generative Recommendation 6
Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs 2
Efficient Learning with Sine-Activated Low-Rank Matrices 5
Efficient Low-Bit Quantization with Adaptive Scales for Multi-Task Co-Training 5
Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark 6
Efficient Model Editing with Task-Localized Sparse Fine-tuning 6
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling 4
Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching 4
Efficient Neuron Segmentation in Electron Microscopy by Affinity-Guided Queries 5
Efficient Off-Policy Learning for High-Dimensional Action Spaces 4
Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games 3
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data 4
Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models 5
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning 3
Efficient Reinforcement Learning with Large Language Model Priors 6
Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping 5
Efficient Source-Free Time-Series Adaptation via Parameter Subspace Disentanglement 4
Efficient Sparse PCA via Block-Diagonalization 4
Efficient Top-m Data Values Identification for Data Selection 6
Efficient Training of Neural Stochastic Differential Equations by Matching Finite Dimensional Distributions 6
Efficient and Accurate Explanation Estimation with Distribution Compression 6
Efficient and Context-Aware Label Propagation for Zero-/Few-Shot Training-Free Adaptation of Vision-Language Model 6
Efficient and Robust Neural Combinatorial Optimization via Wasserstein-Based Coresets 6
Efficient and Trustworthy Causal Discovery with Latent Variables and Complex Relations 2
Efficient stagewise pretraining via progressive subnetworks 4
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts 5
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs 7
Efficiently Parameterized Neural Metriplectic Systems 6
EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition 4
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos 3
EgoSim: Egocentric Exploration in Virtual Worlds with Multi-modal Conditioning 4
ElasticTok: Adaptive Tokenization for Image and Video 4
Eliciting Human Preferences with Language Models 4
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models 6
Eliminating Position Bias of Language Models: A Mechanistic Approach 5
Elliptic Loss Regularization 4
Elucidating the Preconditioning in Consistency Distillation 3
EmbedLLM: Learning Compact Representations of Large Language Models 5
EmbodiedSAM: Online Segment Any 3D Thing in Real Time 4
Emergence of a High-Dimensional Abstraction Phase in Language Transformers 5
Emergence of meta-stable clustering in mean-field transformer models 3
Emergent Orientation Maps —— Mechanisms, Coding Efficiency and Robustness 4
Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models 5
Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning 4
Empowering Users in Digital Privacy Management through Interactive LLM-Based Agents 2
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference 5
Encryption-Friendly LLM Architecture 5
End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler 3
Endless Jailbreaks with Bijection Learning 3
Endowing Visual Reprogramming with Adversarial Robustness 5
Energy-Based Diffusion Language Models for Text Generation 5
Energy-Weighted Flow Matching for Offline Reinforcement Learning 3
Energy-based Backdoor Defense Against Federated Graph Learning 5
Enhance Multi-View Classification Through Multi-Scale Alignment and Expanded Boundary 5
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions 6
Enhancing Clustered Federated Learning: Integration of Strategies and Improved Methodologies 6
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data 6
Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds 4
Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information 5
Enhancing End-to-End Autonomous Driving with Latent World Model 4
Enhancing Federated Domain Adaptation with Multi-Domain Prototype-Based Federated Fine-Tuning 6
Enhancing Graph Of Thought: Enhancing Prompts with LLM Rationales and Dynamic Temperature Control 1
Enhancing Language Model Agents using Diversity of Thoughts 5
Enhancing Learning with Label Differential Privacy by Vector Approximation 2
Enhancing Pre-trained Representation Classifiability can Boost its Interpretability 3
Enhancing Prediction Performance through Influence Measure 4
Enhancing Robust Fairness via Confusional Spectral Regularization 4
Enhancing Uncertainty Estimation and Interpretability with Bayesian Non-negative Decision Layer 5
Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures 5
Enhancing the Scalability and Applicability of Kohn-Sham Hamiltonians for Molecular Systems 5
Ensembles of Low-Rank Expert Adapters 4
Ensembling Diffusion Models via Adaptive Feature Aggregation 6
Entropy-based Activation Function Optimization: A Method on Searching Better Activation Functions 5
Episodic Memories Generation and Evaluation Benchmark for Large Language Models 4
Episodic Novelty Through Temporal Distance 5
Epistemic Monte Carlo Tree Search 5
EqNIO: Subequivariant Neural Inertial Odometry 5
Equivariant Denoisers Cannot Copy Graphs: Align Your Graph Diffusion Models 6
Equivariant Masked Position Prediction for Efficient Molecular Representation 4
Equivariant Neural Functional Networks for Transformers 4
Erasing Concept Combination from Text-to-Image Diffusion Model 6
Error-quantified Conformal Inference for Time Series 4
Estimating the Probabilities of Rare Outputs in Language Models 5
Estimation of single-cell and tissue perturbation effect in spatial transcriptomics via Spatial Causal Disentanglement 4
EvA: Erasing Spurious Correlations with Activations 5
Evaluating Large Language Models through Role-Guide and Self-Reflection: A Comparative Study 5
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective 5
Event-Driven Online Vertical Federated Learning 3
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models 5
Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable? 3
Evidential Learning-based Certainty Estimation for Robust Dense Feature Matching 4
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning 6
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles 5
Exact Certification of (Graph) Neural Networks Against Label Poisoning 5
Exact Community Recovery under Side Information: Optimality of Spectral Algorithms 2
Exact Computation of Any-Order Shapley Interactions for Graph Neural Networks 6
Examining Alignment of Large Language Models through Representative Heuristics: the case of political stereotypes 3
Execution-guided within-prompt search for programming-by-example 2
Expand and Compress: Exploring Tuning Principles for Continual Spatio-Temporal Graph Forecasting 6
Expected Return Symmetries 5
Expected Sliced Transport Plans 3
Explain Yourself, Briefly! Self-Explaining Neural Networks with Concise Sufficient Reasons 5
Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation 3
Explanations of GNN on Evolving Graphs via Axiomatic Layer edges 4
Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval 2
Exploiting Hidden Symmetry to Improve Objective Perturbation for DP Linear Learners with a Nonsmooth L1-Norm 4
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank 2
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF 1
Explore Theory of Mind: program-guided adversarial data generation for theory of mind reasoning 4
Exploring Learning Complexity for Efficient Downstream Dataset Pruning 5
Exploring Local Memorization in Diffusion Models via Bright Ending Attention 3
Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View 4
Exploring The Forgetting in Adversarial Training: A Novel Method for Enhancing Robustness 5
Exploring The Loss Landscape Of Regularized Neural Networks Via Convex Duality 2
Exploring a Principled Framework for Deep Subspace Clustering 6
Exploring channel distinguishability in local neighborhoods of the model space in quantum neural networks 4
Exploring the Camera Bias of Person Re-identification 6
Exploring the Design Space of Visual Context Representation in Video MLLMs 4
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models 4
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning 5
Exposure Bracketing Is All You Need For A High-Quality Image 5
Expressivity of Neural Networks with Random Weights and Learned Biases 3
Extendable and Iterative Structure Learning Strategy for Bayesian Networks 2
Extending Mercer's expansion to indefinite and asymmetric kernels 1
F-Fidelity: A Robust Framework for Faithfulness Evaluation of Explainable AI 5
FACTS: A Factored State-Space Framework for World Modelling 6
FIG: Flow with Interpolant Guidance for Linear Inverse Problems 6
FIRING-Net: A filtered feature recycling network for speech enhancement 3
FLIP: Flow-Centric Generative Planning as General-Purpose Manipulation World Model 4
FLOPS: Forward Learning with OPtimal Sampling 6
FOSP: Fine-tuning Offline Safe Policy through World Models 5
FaceShot: Bring Any Character into Life 3
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning 4
Factor Graph-based Interpretable Neural Networks 6
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models 3
Fair Clustering in the Sliding Window Model 4
Fair Submodular Cover 5
FairDen: Fair Density-Based Clustering 5
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs 6
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" 3
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models 5
Fantastic Copyrighted Beasts and How (Not) to Generate Them 5
Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them 5
Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation 4
Fast Feedforward 3D Gaussian Splatting Compression 5
Fast Summation of Radial Kernels via QMC Slicing 4
Fast Training of Sinusoidal Neural Fields via Scaling Initialization 4
Fast Uncovering of Protein Sequence Diversity from Structure 5
Fast and Accurate Blind Flexible Docking 6
Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage 6
Fast training and sampling of Restricted Boltzmann Machines 6
Fast unsupervised ground metric learning with tree-Wasserstein distance 5
Faster Algorithms for Structured Linear and Kernel Support Vector Machines 1
Faster Cascades via Speculative Decoding 5
Faster Diffusion Sampling with Randomized Midpoints: Sequential and Parallel 3
Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling 6
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality 4
Fat-to-Thin Policy Optimization: Offline Reinforcement Learning with Sparse Policies 4
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models 4
Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks 2
Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse 5
Feature-Based Online Bilateral Trade 1
FedLWS: Federated Learning with Adaptive Layer-wise Weight Shrinking 6
FedTMOS: Efficient One-Shot Federated Learning with Tsetlin Machine 5
Federated $Q$-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost 4
Federated Class-Incremental Learning: A Hybrid Approach Using Latent Exemplars and Data-Free Techniques to Address Local and Global Forgetting 3
Federated Continual Learning Goes Online: Uncertainty-Aware Memory Management for Vision Tasks and Beyond 5
Federated Domain Generalization with Data-free On-server Matching Gradient 5
Federated Few-Shot Class-Incremental Learning 6
Federated Granger Causality Learning For Interdependent Clients With State Space Representation 4
Federated Residual Low-Rank Adaption of Large Language Models 6
Feedback Favors the Generalization of Neural ODEs 6
Feedback Schrödinger Bridge Matching 2
Fengbo: a Clifford Neural Operator pipeline for 3D PDEs in Computational Fluid Dynamics 4
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms 3
Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization 4
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement 5
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset 4
Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning 5
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models 5
Field-DiT: Diffusion Transformer on Unified Video, 3D, and Game Field Generation 2
Filtered not Mixed: Filtering-Based Online Gating for Mixture of Large Language Models 4
Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment 2
Find A Winning Sign: Sign Is All We Need to Win the Lottery 4
Finding Shared Decodable Concepts and their Negations in the Brain 6
Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment 6
Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic 5
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design 6
Fine-tuning can Help Detect Pretraining Data from Large Language Models 5
Fine-tuning with Reserved Majority for Noise Reduction 5
First-Person Fairness in Chatbots 4
Fitting Networks with a Cancellation Trick 2
Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond 3
FlashMask: Efficient and Rich Mask Extension of FlashAttention 6
FlashRNN: I/O-Aware Optimization of Traditional RNNs on modern hardware 7
Flat Reward in Policy Parameter Space Implies Robust Reinforcement Learning 5
Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks 3
FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models 4
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference 5
FlickerFusion: Intra-trajectory Domain Generalizing Multi-agent Reinforcement Learning 6
Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors 5
Flow Matching with Gaussian Process Priors for Probabilistic Time Series Forecasting 5
Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective 3
Flow matching achieves almost minimax optimal convergence 0
Flow-based Variational Mutual Information: Fast and Flexible Approximations 6
Flow: Modularized Agentic Workflow Automation 3
FlowDec: A flow-based full-band general audio codec with high perceptual quality 5
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens 3
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems 3
Following the Human Thread in Social Navigation 4
For Better or For Worse? Learning Minimum Variance Features With Label Augmentation 5
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities 6
Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration 6
Forget the Data and Fine-Tuning! Just Fold the Network to Compress 6
Forgetting Transformer: Softmax Attention with a Forget Gate 6
Forking Paths in Neural Text Generation 3
FormalAlign: Automated Alignment Evaluation for Autoformalization 3
Formation of Representations in Neural Networks 3
Forte : Finding Outliers with Representation Typicality Estimation 5
Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models 2
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions 5
Fourier Sliced-Wasserstein Embedding for Multisets and Measures 4
Fragment and Geometry Aware Tokenization of Molecules for Structure-Based Drug Design Using Language Models 4
Frame-Voyager: Learning to Query Frames for Video Large Language Models 4
Framer: Interactive Frame Interpolation 5
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling 5
FreDF: Learning to Forecast in the Frequency Domain 5
FreSh: Frequency Shifting for Accelerated Neural Representation Learning 4
Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs 6
FreeCG: Free the Design Space of Clebsch-Gordan Transform for Machine Learning Force Fields 6
FreeVS: Generative View Synthesis on Free Driving Trajectory 5
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise 5
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning 6
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data 4
From Attention to Activation: Unraveling the Enigmas of Large Language Models 6
From Commands to Prompts: LLM-based Semantic File System for AIOS 2
From Decoupling to Adaptive Transformation: a Wider Optimization Space for PTQ 5
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions 4
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation 4
From GNNs to Trees: Multi-Granular Interpretability for Graph Neural Networks 4
From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs 4
From Layers to States: A State Space Model Perspective to Deep Neural Network Layer Dynamics 5
From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks 3
From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question-Answering 6
From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities 5
From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy 1
From Promise to Practice: Realizing High-performance Decentralized Training 7
From Risk to Uncertainty: Generating Predictive Uncertainty Measures via Bayesian Estimation 3
From Search to Sampling: Generative Models for Robust Algorithmic Recourse 5
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency 5
From Tokens to Lattices: Emergent Lattice Structures in Language Models 5
From Tokens to Words: On the Inner Lexicon of LLMs 4
From an LLM Swarm to a PDDL-empowered Hive: Planning Self-executed Instructions in a Multi-modal Jungle 3
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation 4
Fugatto 1: Foundational Generative Audio Transformer Opus 1 5
Fully-inductive Node Classification on Arbitrary Graphs 6
Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks 6
Fundamental Limitations on Subquadratic Alternatives to Transformers 0
Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency 0
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model 6
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation 6
GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment 5
GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion 4
GETS: Ensemble Temperature Scaling for Calibration in Graph Neural Networks 5
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation 4
GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering 4
GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero Cost 6
GLOMA: Global Video Text Spotting with Morphological Association 4
GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs 6
GMValuator: Similarity-based Data Valuation for Generative Models 6
GNNs Getting ComFy: Community and Feature Similarity Guided Rewiring 5
GOAL: A Generalist Combinatorial Optimization Agent Learner 4
GOFA: A Generative One-For-All Model for Joint Graph Language Modeling 5
GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation 7
GOttack: Universal Adversarial Attacks on Graph Neural Networks via Graph Orbits Learning 6
GPS: A Probabilistic Distributional Similarity with Gumbel Priors for Set-to-Set Matching 5
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS 5
GPromptShield: Elevating Resilience in Graph Prompt Tuning Against Adversarial Attacks 3
GRAIN: Exact Graph Reconstruction from Gradients 6
GROOT-2: Weakly Supervised Multimodal Instruction Following Agents 4
GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers 6
GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting 5
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting 5
GSBA$^K$: $top$-$K$ Geometric Score-based Black-box Attack 5
GSE: Group-wise Sparse and Explainable Adversarial Attacks 6
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models 4
GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement 3
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding 5
GameArena: Evaluating LLM Reasoning through Live Computer Games 3
GameGen-X: Interactive Open-world Game Video Generation 6
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher 6
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition 4
Gated Delta Networks: Improving Mamba2 with Delta Rule 4
Gaussian Differentially Private Human Faces Under a Face Radial Curve Representation 3
Gaussian Ensemble Belief Propagation for Efficient Inference in High-Dimensional, Black-box Systems 4
Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping 2
Gaussian Mixture Counterfactual Generator 3
Gaussian Splatting Lucas-Kanade 4
Gaussian-Based Instance-Adaptive Intensity Modeling for Point-Supervised Facial Expression Spotting 4
Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection 3
GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation 3
GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians 5
GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation 4
GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment 3
GenDataAgent: On-the-fly Dataset Augmentation with Synthetic Data 5
GenEx: Generating an Explorable World 4
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling 4
GenVP: Generating Visual Puzzles with Contrastive Hierarchical VAEs 4
GenXD: Generating Any 3D and 4D Scenes 4
General Scene Adaptation for Vision-and-Language Navigation 5
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Power 3
Generalizable Human Gaussians from Single-View Image 6
Generalizable Motion Planning via Operator Learning 5
Generalization Bounds and Model Complexity for Kolmogorov–Arnold Networks 3
Generalization Bounds for Canonicalization: A Comparative Study with Group Averaging 2
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors 5
Generalization and Distributed Learning of GFlowNets 5
Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis 5
Generalization through variance: how noise shapes inductive biases in diffusion models 1
Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data 5
Generalization, Expressivity, and Universality of Graph Neural Networks on Attributed Graphs 4
Generalized Behavior Learning from Diverse Demonstrations 5
Generalized Consistency Trajectory Models for Image Manipulation 5
Generalized Principal-Agent Problem with a Learning Agent 1
Generalized Video Moment Retrieval 4
Generalizing Reasoning Problems to Longer Lengths 4
Generalizing Weisfeiler-Lehman Kernels to Subgraphs 6
Generating Graphs via Spectral Diffusion 5
Generating CAD Code with Vision-Language Models for 3D Designs 4
Generating Freeform Endoskeletal Robots 4
Generating Likely Counterfactuals Using Sum-Product Networks 5
Generating Physical Dynamics under Priors 3
Generation and Comprehension Hand-in-Hand: Vision-guided Expression Diffusion for Boosting Referring Expression Generation and Comprehension 5
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass 5
Generative Classifiers Avoid Shortcut Solutions 5
Generative Flows on Synthetic Pathway for Drug Design 7
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation 4
Generative Monoculture in Large Language Models 5
Generative Representational Instruction Tuning 6
Generative Verifiers: Reward Modeling as Next-Token Prediction 3
Generator Matching: Generative modeling with arbitrary Markov processes 3
GeoILP: A Synthetic Dataset to Guide Large-Scale Rule Induction 5
GeoLoRA: Geometric integration for parameter efficient fine-tuning 4
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training 5
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture 6
Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation 3
Geometry of Lightning Self-Attention: Identifiability and Dimension 2
Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions 4
Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces 2
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits 3
Geometry-aware RL for Manipulation of Varying Shapes and Deformable Objects 4
Glad: A Streaming Scene Generator for Autonomous Driving 5
Glauber Generative Model: Discrete Diffusion Models via Binary Classification 4
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection 6
Global Convergence in Neural ODEs: Impact of Activation Functions 3
Global Convergence of Policy Gradient in Average Reward MDPs 0
Global Identifiability of Overcomplete Dictionary Learning via L1 and Volume Minimization 2
Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates 0
GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine Learning 5
Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers 4
Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information 6
Going Beyond Static: Understanding Shifts with Time-Series Attribution 3
GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models 5
GotenNet: Rethinking Efficient 3D Equivariant Graph Neural Networks 5
GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision 5
Gradient correlation is a key ingredient to accelerate SGD with momentum 5
Gradient descent with generalized Newton’s method 5
Gradient-Free Generation for Hard-Constrained Systems 5
Gramian Multimodal Representation Learning and Alignment 5
Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach 4
Graph Assisted Offline-Online Deep Reinforcement Learning for Dynamic Workflow Scheduling 7
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective 4
Graph Neural Networks Can (Often) Count Substructures 5
Graph Neural Networks Gone Hogwild 5
Graph Neural Networks for Edge Signals: Orientation Equivariance and Invariance 5
Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems 5
Graph Neural Ricci Flow: Evolving Feature from a Curvature Perspective 7
Graph Sparsification via Mixture of Graphs 6
Graph Transformers Dream of Electric Flow 5
Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting 5
Graph-based Document Structure Analysis 5
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation 4
GraphBridge: Towards Arbitrary Transfer Learning in GNNs 4
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation 6
GraphRouter: A Graph-based Router for LLM Selections 6
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation 5
Greener GRASS: Enhancing GNNs with Encoding, Rewiring, and Attention 6
GridMix: Exploring Spatial Modulation for Neural Fields in PDE Modeling 5
Grokking at the Edge of Numerical Stability 4
Grounding Continuous Representations in Geometry: Equivariant Neural Fields 6
Grounding Multimodal Large Language Model in GUI World 5
Grounding Video Models to Actions through Goal Conditioned Exploration 6
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval 4
Group Distributionally Robust Dataset Distillation with Risk Minimization 6
Group Downsampling with Equivariant Anti-aliasing 6
Group Ligands Docking to Protein Pockets 5
Group-robust Sample Reweighting for Subpopulation Shifts via Influence Functions 6
Growth Inhibitors for Suppressing Inappropriate Image Concepts in Diffusion Models 4
Guaranteed Generation from Large Language Models 4
Guided Score identity Distillation for Data-Free One-Step Text-to-Image Generation 6
Gumbel Counterfactual Generation From Language Models 6
Gyrogroup Batch Normalization 6
HADAMRNN: BINARY AND SPARSE TERNARY ORTHOGONAL RNNS 5
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis 6
HAMSTER: Hierarchical Action Models for Open-World Robot Manipulation 4
HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics 5
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer 4
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents 5
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models 4
HELM: Hierarchical Encoding for mRNA Language Modeling 4
HELMET: How to Evaluate Long-context Models Effectively and Thoroughly 5
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning 4
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters 6
HGM³: Hierarchical Generative Masked Motion Modeling with Hard Token Mining 4
HMoRA: Making LLMs More Effective with Hierarchical Mixture of LoRA Experts 5
HOPE for a Robust Parameterization of Long-memory State Space Models 4
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing 4
HQGS: High-Quality Novel View Synthesis with Gaussian Splatting in Degraded Scenes 5
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting 4
HShare: Fast LLM Decoding by Hierarchical Key-Value Sharing 5
HaDeMiF: Hallucination Detection and Mitigation in Large Language Models 3
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation 4
Halton Scheduler for Masked Generative Image Transformer 4
Handling Delay in Real-Time Reinforcement Learning 5
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models 5
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models 4
Harnessing Webpage UIs for Text-Rich Visual Understanding 4
Has the Deep Neural Network learned the Stochastic Process? An Evaluation Viewpoint 4
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs 4
HeadMap: Locating and Enhancing Knowledge Circuits in LLMs 5
Heavy-Tailed Diffusion Models 5
HelpSteer2-Preference: Complementing Ratings with Preferences 5
Herald: A Natural Language Annotated Lean 4 Dataset 4
Hessian-Free Online Certified Unlearning 7
HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment 3
HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging 5
HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts 4
HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models 5
HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction 5
Hidden in the Noise: Two-Stage Robust Watermarking for Images 6
Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models 3
Hierarchical Uncertainty Estimation for Learning-based Registration in Neuroimaging 5
Hierarchical World Models as Visual Whole-Body Humanoid Controllers 4
Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs 6
High-Dimensional Bayesian Optimisation with Gaussian Process Prior Variational Autoencoders 6
High-Dynamic Radar Sequence Prediction for Weather Nowcasting Using Spatiotemporal Coherent Gaussian Representation 5
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity 5
High-Quality Joint Image and Video Tokenization with Causal VAE 3
High-dimension Prototype is a Better Incremental Object Detection Learner 4
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws 3
High-quality Text-to-3D Character Generation with SparseCubes and Sparse Transformers. 4
Higher-Order Graphon Neural Networks: Approximation and Cut Distance 3
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning 5
Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data 3
Holistically Evaluating the Environmental Impact of Creating Language Models 3
Holographic Node Representations: Pre-training Task-Agnostic Node Embeddings 7
Homomorphism Counts as Structural Encodings for Graph Learning 5
Homomorphism Expressivity of Spectral Invariant Graph Neural Networks 3
Horizon Generalization in Reinforcement Learning 4
Hot-pluggable Federated Learning: Bridging General and Personalized FL via Dynamic Selection 7
Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension 6
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning 5
How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework 1
How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension 5
How Does Critical Batch Size Scale in Pre-training? 4
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? 5
How Far Are We from True Unlearnability? 4
How Feature Learning Can Improve Neural Scaling Laws 2
How Gradient descent balances features: A dynamical analysis for two-layer neural networks 1
How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings 4
How Low Can You Go? Searching for the Intrinsic Dimensionality of Complex Networks using Metric Node Embeddings 5
How Much is Unseen Depends Chiefly on Information About the Seen 4
How Much is a Noisy Image Worth? Data Scaling Laws for Ambient Diffusion. 3
How efficient is LLM-generated code? A rigorous & high-standard benchmark 6
How many samples are needed to train a deep neural network? 4
How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning 6
How new data permeates LLM knowledge and how to dilute it 3
How to Evaluate Reward Models for RLHF 5
How to Find the Exact Pareto Front for Multi-Objective MDPs? 2
How to Probe: Simple Yet Effective Techniques for Improving Post-hoc Explanations 4
How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions 1
Human Simulacra: Benchmarking the Personification of Large Language Models 5
Human-Aligned Chess With a Bit of Search 5
Human-inspired Episodic Memory for Infinite Context LLMs 5
Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors 5
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment 5
HyPoGen: Optimization-Biased Hypernetworks for Generalizable Policy Generation 5
Hybrid Regularization Improves Diffusion-based Inverse Problem Solving 6
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation 5
Hymba: A Hybrid-head Architecture for Small Language Models 6
Hyper-Connections 4
HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks 3
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere 6
HyperPLR: Hypergraph Generation through Projection, Learning, and Reconstruction 5
Hyperbolic Genome Embeddings 4
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models 5
I Can Hear You: Selective Robust Training for Deepfake Audio Detection 5
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength 6
ICLR: In-Context Learning of Representations 3
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model 4
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations 3
IDInit: A Universal and Stable Initialization Method for Neural Network Training 3
IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION 4
IGL-Bench: Establishing the Comprehensive Benchmark for Imbalanced Graph Learning 6
ILLUSION: Unveiling Truth with a Comprehensive Multi-Modal, Multi-Lingual Deepfake Dataset 4
IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning 4
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge 4
INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph 6
INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning 5
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts 3
IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities 5
IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis 6
Identifiability for Gaussian Processes with Holomorphic Kernels 4
Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning 4
Identification of Intermittent Temporal Latent Process 5
Identifying latent state transitions in non-linear dynamical systems 5
IgGM: A Generative Model for Functional Antibody and Nanobody Design 6
ImDy: Human Inverse Dynamics from Imitated Observations 4
ImProver: Agent-Based Automated Proof Optimization 3
Image Watermarks are Removable using Controllable Regeneration from Clean Noise 7
Image and Video Tokenization with Binary Spherical Quantization 5
Image-level Memorization Detection via Inversion-based Inference Perturbation 5
ImageFolder: Autoregressive Image Generation with Folded Tokens 5
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination 3
Immunogenicity Prediction with Dual Attention Enables Vaccine Target Selection 5
ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences 5
Implicit Bias of Mirror Flow for Shallow Neural Networks in Univariate Regression 3
Implicit In-context Learning 6
Implicit Neural Surface Deformation with Explicit Velocity Fields 4
Implicit Search via Discrete Diffusion: A Study on Chess 6
Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions 4
Improved Approximation Algorithms for $k$-Submodular Maximization via Multilinear Extension 1
Improved Convergence Rate for Diffusion Probabilistic Models 1
Improved Diffusion-based Generative Model with Better Adversarial Robustness 6
Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent 0
Improved Sampling Algorithms for Lévy-Itô Diffusion Models 3
Improved Sampling Of Diffusion Models In Fluid Dynamics With Tweedie's Formula 6
Improved Techniques for Optimization-Based Jailbreaking on Large Language Models 5
Improved Training Technique for Latent Consistency Models 3
Improving Complex Reasoning with Dynamic Prompt Corruption: A Soft Prompt Optimization Approach 3
Improving Convergence Guarantees of Random Subspace Second-order Algorithm for Nonconvex Optimization 4
Improving Data Efficiency via Curating LLM-Driven Rating Systems 6
Improving Deep Regression with Tightness 4
Improving Equivariant Networks with Probabilistic Symmetry Breaking 6
Improving Generalization and Robustness in SNNs Through Signed Rate Encoding and Sparse Encoding Attacks 5
Improving Graph Neural Networks by Learning Continuous Edge Directions 5
Improving Instruction-Following in Language Models through Activation Steering 4
Improving Language Model Distillation through Hidden State Matching 5
Improving Large Language Model Planning with Action Sequence Similarity 5
Improving Long-Text Alignment for Text-to-Image Diffusion Models 6
Improving Neural Network Accuracy by Concurrently Training with a Twin Network 3
Improving Neural Optimal Transport via Displacement Interpolation 5
Improving Pretraining Data Using Perplexity Correlations 6
Improving Probabilistic Diffusion Models With Optimal Diagonal Covariance Matching 6
Improving Reasoning Performance in Large Language Models via Representation Engineering 4
Improving Semantic Understanding in Speech Language Models via Brain-tuning 5
Improving Uncertainty Estimation through Semantically Diverse Language Generation 5
Improving Unsupervised Constituency Parsing via Maximizing Semantic Information 5
Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency 4
Imputation for prediction: beware of diminishing returns. 4
In Search of Forgotten Domain Generalization 5
In vivo cell-type and brain region classification via multimodal contrastive learning 4
In-Context Editing: Learning Knowledge from Self-Induced Distributions 5
In-context Time Series Predictor 5
InCoDe: Interpretable Compressed Descriptions For Image Generation 6
Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On 6
Incremental Causal Effect for Time to Treatment Initialization 2
Indirect Gradient Matching for Adversarial Robust Distillation 4
Inference Optimal VLMs Need Fewer Visual Tokens and More Parameters 3
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving 4
Inference Scaling for Long-Context Retrieval Augmented Generation 2
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models 4
Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models 5
Infinite-Resolution Integral Noise Warping for Diffusion Models 4
Influence Functions for Scalable Data Attribution in Diffusion Models 6
Influence-Guided Diffusion for Dataset Distillation 6
InfoGS: Efficient Structure-Aware 3D Gaussians via Lightweight Information Shaping 4
Information Theoretic Text-to-Image Alignment 6
Injecting Universal Jailbreak Backdoors into LLMs in Minutes 6
Injective flows for star-like manifolds 4
Inner Information Analysis Algorithm for Deep Neural Network based on Community 3
Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps 5
Input Space Mode Connectivity in Deep Neural Networks 4
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation 5
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct 1
InstaRevive: One-Step Image Enhancement via Dynamic Score Matching 4
InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly 2
InstaTrain: Adaptive Training via Ultra-Fast Natural Annealing within Dynamical Systems 5
Instance-dependent Early Stopping 6
Instant Policy: In-Context Imitation Learning via Graph Diffusion 5
InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation 3
InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting 4
InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences 3
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning 5
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales 6
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy 5
Integral Performance Approximation for Continuous-Time Reinforcement Learning Control 7
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows 4
Integrative Decoding: Improving Factuality via Implicit Self-consistency 5
Intelligence at the Edge of Chaos 6
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models 5
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention 4
InterMask: 3D Human Interaction Generation via Collaborative Masked Modeling 3
Interaction Asymmetry: A General Principle for Learning Composable Abstractions 4
Interactive Adjustment for Human Trajectory Prediction with Individual Feedback 5
Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface 4
Interference Among First-Price Pacing Equilibria: A Bias and Variance Analysis 2
Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment 4
Intermediate Layer Classifiers for OOD generalization 6
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence 3
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks 6
Interpretable Causal Representation Learning for Biological Data in the Pathway Space 4
Interpretable Unsupervised Joint Denoising and Enhancement for Real-World low-light Scenarios 3
Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology 5
Interpreting Emergent Planning in Model-Free Reinforcement Learning 4
Interpreting Language Reward Models via Contrastive Explanations 2
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations 5
Interpreting the Second-Order Effects of Neurons in CLIP 4
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning 5
Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs 6
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations 5
Intrinsic User-Centric Interpretability through Global Mixture of Experts 4
Inverse Attention Agents for Multi-Agent Systems 5
Inverse Constitutional AI: Compressing Preferences into Principles 4
Inverse Rendering using Multi-Bounce Path Tracing and Reservoir Sampling 3
Inverse decision-making using neural amortized Bayesian actors 4
InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences 5
InversionGNN: A Dual Path Network for Multi-Property Molecular Optimization 5
InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma 2
Investigating Pattern Neurons in Urban Time Series Forecasting 6
Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning 4
Is Factuality Enhancement a Free Lunch For LLMs? Better Factuality Can Lead to Worse Context-Faithfulness 3
Is In-Context Learning Sufficient for Instruction Following in LLMs? 4
Is Large-scale Pretraining the Secret to Good Domain Generalization? 5
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist 4
Is Your Multimodal Language Model Oversensitive to Safe Queries? 5
Is Your Video Language Model a Reliable Judge? 1
Is uniform expressivity too restrictive? Towards efficient expressivity of GNNs 5
Isometric Regularization for Manifolds of Functional Data 7
It Helps to Take a Second Opinion: Teaching Smaller LLMs To Deliberate Mutually via Selective Rationale Optimisation 5
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation 5
IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking 5
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision 5
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning 3
Iterative Substructure Extraction for Molecular Relational Learning with Interactive Graph Information Bottleneck 5
JPEG Inspired Deep Learning 5
Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in Large Language Models 4
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks 6
Jailbreaking as a Reward Misspecification Problem 6
Jamba: Hybrid Transformer-Mamba Language Models 5
JetFormer: An autoregressive generative model of raw images and text 3
Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity 6
Joint Gradient Balancing for Data Ordering in Finite-Sum Multi-Objective Optimization 5
Joint Graph Rewiring and Feature Denoising via Spectral Resonance 6
Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment 5
Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment 3
JudgeBench: A Benchmark for Evaluating LLM-Based Judges 3
JudgeLM: Fine-tuned Large Language Models are Scalable Judges 5
Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models 5
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge 5
K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models 3
KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural Networks 4
KAN: Kolmogorov–Arnold Networks 4
KBLaM: Knowledge Base augmented Language Model 5
KGARevion: An AI Agent for Knowledge-Intensive Biomedical QA 6
KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI 5
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks 1
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models 6
Kernel-based Optimally Weighted Conformal Time-Series Prediction 4
KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models 5
KinFormer: Generalizable Dynamical Symbolic Regression for Catalytic Organic Reaction Kinetics 5
KinPFN: Bayesian Approximation of RNA Folding Kinetics using Prior-Data Fitted Networks 5
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks 6
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding 5
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution 3
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition 5
Knowledge Graph Finetuning Enhances Knowledge Manipulation in Large Language Models 4
Knowledge Localization: Mission Not Accomplished? Enter Query Localization! 4
Kolmogorov-Arnold Transformer 4
KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting 5
Kronecker Mask and Interpretive Prompts are Language-Action Video Learners 5
L-WISE: Boosting Human Visual Category Learning Through Model-Based Image Selection and Enhancement 4
L3Ms — Lagrange Large Language Models 5
LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding 6
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior 2
LASER: A Neuro-Symbolic Framework for Learning Spatio-Temporal Scene Graphs with Weak Supervision 4
LASeR: Towards Diversified and Generalizable Robot Design with Large Language Models 6
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics 5
LICO: Large Language Models for In-Context Molecular Optimization 5
LICORICE: Label-Efficient Concept-Based Interpretable Reinforcement Learning 5
LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh 4
LLM Unlearning via Loss Adjustment with Only Forget Data 4
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models 6
LLM-based Typed Hyperresolution for Commonsense Reasoning with Knowledge Bases 4
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension 5
LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch 5
LLMs Can Plan Only If We Tell Them 2
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations 4
LLaMA-Omni: Seamless Speech Interaction with Large Language Models 6
LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing 3
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy 4
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models 5
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token 5
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation 5
LOIRE: LifelOng learning on Incremental data via pre-trained language model gRowth Efficiently 5
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models 3
LR0.FM: LOW-RESOLUTION ZERO-SHOT CLASSIFICATION BENCHMARK FOR FOUNDATION MODELS 3
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias 5
LaGeM: A Large Geometry Model for 3D Representation Learning and Diffusion 6
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning 4
LaMPlace: Learning to Optimize Cross-Stage Metrics in Macro Placement 6
Lambda-Skip Connections: the architectural component that prevents Rank Collapse 2
LancBiO: Dynamic Lanczos-aided Bilevel Optimization via Krylov Subspace 6
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning 6
Language Agents Meet Causality -- Bridging LLMs and Causal World Models 6
Language Guided Skill Discovery 2
Language Imbalance Driven Rewarding for Multilingual Self-improving 5
Language Model Alignment in Multilingual Trolley Problems 3
Language Models Are Implicitly Continuous 3
Language Models Learn to Mislead Humans via RLHF 1
Language Models Need Inductive Biases to Count Inductively 4
Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice 4
Language Models are Advanced Anonymizers 5
Language Representations Can be What Recommenders Need: Findings and Potentials 4
Language models scale reliably with over-training and on downstream tasks 4
Language-Assisted Feature Transformation for Anomaly Detection 6
Language-Image Models with 3D Understanding 5
Laplace Sample Information: Data Informativeness Through a Bayesian Lens 5
Large (Vision) Language Models are Unsupervised In-Context Learners 6
Large Convolutional Model Tuning via Filter Subspace 4
Large Language Models Assume People are More Rational than We Really are 2
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation 4
Large Language Models Often Say One Thing and Do Another 5
Large Language Models are Interpretable Learners 4
Large Language Models can Become Strong Self-Detoxifiers 3
Large Scale Knowledge Washing 4
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding 4
Lasso Bandit with Compatibility Condition on Optimal Arm 4
Last Iterate Convergence of Incremental Methods as a Model of Forgetting 2
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games 5
Latent Action Pretraining from Videos 5
Latent Bayesian Optimization via Autoregressive Normalizing Flows 5
Latent Radiance Fields with 3D-aware 2D Representations 5
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning 5
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation 5
Latent-EnSF: A Latent Ensemble Score Filter for High-Dimensional Data Assimilation with Sparse Observation Data 4
Law of the Weakest Link: Cross Capabilities of Large Language Models 5
Lawma: The Power of Specialization for Legal Annotation 5
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models 3
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation 5
Layerwise Recurrent Router for Mixture-of-Experts 5
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint 3
LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models 6
Lean-STaR: Learning to Interleave Thinking and Proving 4
LeanAgent: Lifelong Learning for Formal Theorem Proving 5
LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid 6
Learn Your Reference Model for Real Good Alignment 5
Learn hybrid prototypes for multivariate time series anomaly detection 4
Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments 4
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion 4
Learned Reference-based Diffusion Sampler for multi-modal distributions 6
Learning 3D Perception from Others' Predictions 4
Learning Causal Alignment for Reliable Disease Diagnosis 5
Learning Chaos In A Linear Way 7
Learning Clustering-based Prototypes for Compositional Zero-Shot Learning 6
Learning Color Equivariant Representations 5
Learning Continually by Spectral Regularization 4
Learning Diagrams: A Graphical Language for Compositional Training Regimes 4
Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks 7
Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning 5
Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability 2
Learning Dynamics of LLM Finetuning 4
Learning Efficient Positional Encodings with Graph Neural Networks 6
Learning Equivariant Non-Local Electron Density Functionals 5
Learning Evolving Tools for Large Language Models 7
Learning Fine-Grained Representations through Textual Token Disentanglement in Composed Video Retrieval 5
Learning Gain Map for Inverse Tone Mapping 5
Learning General-purpose Biomedical Volume Representations using Randomized Synthesis 6
Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation 6
Learning Geometric Reasoning Networks For Robot Task And Motion Planning 5
Learning Graph Invariance by Harnessing Spuriosity 6
Learning Graph Quantized Tokenizers 6
Learning Harmonized Representations for Speculative Sampling 5
Learning Hierarchical Polynomials of Multiple Nonlinear Features 3
Learning High-Degree Parities: The Crucial Role of the Initialization 2
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation 3
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models 2
Learning Interpretable Hierarchical Dynamical Systems Models from Time Series Data 5
Learning LLM-as-a-Judge for Preference Alignment 6
Learning Long Range Dependencies on Graphs via Random Walks 5
Learning Mask Invariant Mutual Information for Masked Image Modeling 5
Learning Molecular Representation in a Cell 5
Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics 3
Learning Neural Networks with Distribution Shift: Efficiently Certifiable Guarantees 1
Learning Partial Graph Matching via Optimal Partial Transport 5
Learning Randomized Algorithms with Transformers 2
Learning Robust Representations with Long-Term Information for Generalization in Visual Reinforcement Learning 6
Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation 4
Learning Spatial-Semantic Features for Robust Video Object Segmentation 4
Learning Spatiotemporal Dynamical Systems from Point Process Observations 5
Learning Splitting Heuristics in Divide-and-Conquer SAT Solvers with Reinforcement Learning 4
Learning Structured Representations by Embedding Class Hierarchy with Fast Optimal Transport 6
Learning Structured Universe Graph with Outlier OOD Detection for Partial Matching 5
Learning Successor Features with Distributed Hebbian Temporal Memory 6
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning 6
Learning Transformer-based World Models with Contrastive Predictive Coding 3
Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer 5
Learning View-invariant World Models for Visual Robotic Manipulation 5
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory 3
Learning a Neural Solver for Parametric PDEs to Enhance Physics-Informed Methods 6
Learning and aligning single-neuron invariance manifolds in visual cortex 5
Learning from End User Data with Shuffled Differential Privacy over Kernel Densities 6
Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling 2
Learning from negative feedback, or positive feedback or both 2
Learning from weak labelers as constraints 4
Learning local equivariant representations for quantum operators 5
Learning mirror maps in policy mirror descent 5
Learning on One Mode: Addressing Multi-modality in Offline Reinforcement Learning 5
Learning stochastic dynamics from snapshots through regularized unbalanced optimal transport 5
Learning system dynamics without forgetting 5
Learning the Complexity of Weakly Noisy Quantum States 3
Learning the Optimal Stopping for Early Classification within Finite Horizons via Sequential Probability Ratio Test 7
Learning to Adapt Frozen CLIP for Few-Shot Test-Time Domain Adaptation 5
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training 5
Learning to Communicate Through Implicit Communication Channels 5
Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents 5
Learning to Discover Regulatory Elements for Gene Expression Prediction 5
Learning to Discretize Denoising Diffusion ODEs 6
Learning to Explore and Exploit with GNNs for Unsupervised Combinatorial Optimization 5
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels 3
Learning to Help in Multi-Class Settings 5
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving 3
Learning to Search from Demonstration Sequences 4
Learning to Select Nodes in Branch and Bound with Sufficient Tree Representation 6
Learning to Solve Differential Equation Constrained Optimization Problems 4
Learning to Steer Markovian Agents under Model Uncertainty 4
Learning to engineer protein flexibility 4
Learning under Temporal Label Noise 5
Learning vector fields of differential equations on manifolds with geometrically constrained operator-valued kernels 6
Learning-Augmented Frequent Directions 3
Learning-Augmented Search Data Structures 6
Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling 6
Leave-One-Out Stable Conformal Prediction 5
Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models 5
Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model 5
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions 7
Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting 5
Let the Code LLM Edit Itself When You Edit the Code 4
LevAttention: Time, Space and Streaming Efficient Algorithm for Heavy Attentions 3
Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction 5
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD 5
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning 5
Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs 4
Leveraging Variable Sparsity to Refine Pareto Stationarity in Multi-Objective Optimization 5
LiFT: Learning to Fine-Tune via Bayesian Parameter Efficient Meta Fine-Tuning 5
LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging 5
Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups 4
Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space 6
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models 4
Lightweight Neural App Control 2
Lightweight Predictive 3D Gaussian Splats 4
Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory 6
Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data 4
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better 4
Linear Mode Connectivity in Differentiable Tree Ensembles 6
Linear Multistep Solver Distillation for Fast Sampling of Diffusion Models 5
Linear Partial Gromov-Wasserstein Embedding 5
Linear Representations of Political Perspective Emerge in Large Language Models 4
Linear SCM Identification in the Presence of Confounders and Gaussian Noise 1
Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data 5
Linear Transformer Topological Masking with Graph Random Features 3
Linear combinations of latents in generative models: subspaces and beyond 3
Lines of Thought in Large Language Models 5
Lipschitz Bandits in Optimal Space 2
LiveBench: A Challenging, Contamination-Limited LLM Benchmark 4
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code 4
LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content 3
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning 5
LoCoDL: Communication-Efficient Distributed Learning with Local Training and Compression 3
LoLCATs: On Low-Rank Linearizing of Large Language Models 6
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation 5
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization 6
LoRA-Pro: Are Low-Rank Adapters Properly Optimized? 6
LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation 4
LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation models 4
Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation 3
Local Patterns Generalize Better for Novel Anomalies 6
Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression 4
Local convergence of simultaneous min-max algorithms to differential equilibrium on Riemannian manifold 4
Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection 5
Locality Alignment Improves Vision-Language Models 5
Locality Sensitive Avatars From Video 5
Locality-aware Gaussian Compression for Fast and High-quality Rendering 5
Locally Connected Echo State Networks for Time Series Forecasting 5
LocoVR: Multiuser Indoor Locomotion Dataset in Virtual Reality 5
Logic-Logit: A Logic-Based Approach to Choice Modeling 5
Logical Consistency of Large Language Models in Fact-Checking 6
Logically Consistent Language Models via Neuro-Symbolic Integration 5
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference 5
Long Context Compression with Activation Beacon 5
Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG 4
Long-Context Linear System Identification 1
Long-Sequence Recommendation Models Need Decoupled Embeddings 4
Long-Short Decision Transformer: Bridging Global and Local Dependencies for Generalized Decision-Making 4
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection 5
Long-tailed Adversarial Training with Self-Distillation 3
Long-time asymptotics of noisy SVGD outside the population limit 4
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs 6
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement 4
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory 4
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization 5
LongVILA: Scaling Long-Context Visual Language Models for Long Videos 5
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs 3
Longhorn: State Space Models are Amortized Online Learners 4
Look Before You Leap: Universal Emergent Mechanism for Retrieval in Language Models 1
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets 4
Looking Backward: Streaming Video-to-Video Translation with Feature Banks 5
Looking Inward: Language Models Can Learn About Themselves by Introspection 3
Looking into User’s Long-term Interests through the Lens of Conservative Evidential Learning 6
Looped Transformers for Length Generalization 4
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency 3
Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding 2
Lossy Compression with Pretrained Diffusion Models 5
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction 3
LucidPPN: Unambiguous Prototypical Parts Network for User-centric Interpretable Computer Vision 4
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation 4
MA$^2$E: Addressing Partial Observability in Multi-Agent Reinforcement Learning with Masked Auto-Encoder 5
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions 5
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization 6
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL 4
MADGEN: Mass-Spec attends to De Novo Molecular generation 4
MAESTRO: Masked Encoding Set Transformer with Self-Distillation 5
MAGE: Model-Level Graph Neural Networks Explanations via Motif-based Graph Generation 5
MAGNet: Motif-Agnostic Generation of Molecules from Scaffolds 4
MAI: A Multi-turn Aggregation-Iteration Model for Composed Image Retrieval 5
MANTRA: The Manifold Triangulations Assemblage 4
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation 4
MAP: Multi-Human-Value Alignment Palette 5
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science 4
MAST: model-agnostic sparsified training 6
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine 4
MCNC: Manifold-Constrained Reparameterization for Neural Compression 6
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation 5
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks 3
MELODI: Exploring Memory Compression for Long Contexts 3
MGCFNN: A Neural MultiGrid Solver with Novel Fourier Neural Network for High Wave Number Helmholtz Equations 6
MGDA Converges under Generalized Smoothness, Provably 4
MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction 4
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs 4
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models 3
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Masked Image Modeling Representations 6
MIND over Body: Adaptive Thinking using Dynamic Computation 5
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs 3
MIRACLE 3D: Memory-efficient Integrated Robust Approach for Continual Learning on 3D Point Clouds via Shape Model Construction 4
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models 4
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering 5
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents 6
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation 5
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs 6
MLPs Learn In-Context on Regression and Classification Tasks 3
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS 5
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning 3
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection 4
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark 5
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models 4
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation 5
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? 2
MMEgo: Towards Building Egocentric Multimodal LLMs for Video QA 3
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs 6
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models 3
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models 4
MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge 3
MMQA: Evaluating LLMs with Multi-Table Multi-Hop Complex Questions 5
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation 4
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents 5
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines 4
MMTEB: Massive Multilingual Text Embedding Benchmark 5
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos 3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models 6
MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks 6
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses 1
MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection 6
MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation 6
MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations 5
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation 4
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models 2
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance 4
MTSAM: Multi-Task Fine-Tuning for Segment Anything Model 3
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models 3
MUSE: Machine Unlearning Six-Way Evaluation for Language Models 4
MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow 3
M^3PC: Test-time Model Predictive Control using Pretrained Masked Trajectory Model 5
MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers 4
Machine Unlearning Fails to Remove Data Poisoning Attacks 4
Machine Unlearning via Simulated Oracle Matching 5
MaestroMotif: Skill Design from Artificial Intelligence Feedback 3
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding 3
MagicPIG: LSH Sampling for Efficient LLM Generation 5
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment 4
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing 4
Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning 4
Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU Networks 4
Making Text Embedders Few-Shot Learners 3
Making Transformer Decoders Better Differentiable Indexers 2
MallowsPO: Fine-Tune Your LLM with Preference Dispersions 4
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations 5
MamKO: Mamba-based Koopman operator for modeling and predictive control 4
MambaExtend: A Training-Free Approach to Improve Long Context Extension of Mamba 6
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba 6
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods 3
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks 5
Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling 4
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images 6
Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion 4
Many-Objective Multi-Solution Transport 6
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model 4
Mask in the Mirror: Implicit Sparsification 4
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs 5
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer 5
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling 4
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos 5
Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement 4
MatExpert: Decomposing Materials Discovery By Mimicking Human Experts 6
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation 6
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code 3
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs 2
Matrix Product Sketching via Coordinated Sampling 2
Matryoshka Multimodal Models 5
MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection 4
Matérn Kernels for Tunable Implicit Surface Reconstruction 5
MaxCutPool: differentiable feature-aware Maxcut for pooling in graph neural networks 6
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization 5
Maximizing the Potential of Synthetic Data: Insights from Random Matrix Theory 3
McEval: Massively Multilingual Code Evaluation 6
MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction 3
Measuring And Improving Engagement of Text-to-Image Generation Models 4
Measuring And Improving Persuasiveness Of Large Language Models 6
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models 3
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse 5
Measuring memorization in RLHF for code completion 4
Mechanism and Emergence of Stacked Attention Heads in Multi-Layer Transformers 1
Mechanistic Permutability: Match Features Across Layers 2
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine 3
MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models 5
Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets 5
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis 4
Memory Efficient Transformer Adapter for Dense Predictions 4
Memory Mosaics 5
Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering 3
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers 4
MeshMask: Physics-Based Simulations with Masked Graph Neural Networks 3
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold 6
Meta-Continual Learning of Neural Fields 6
Meta-Dynamical State Space Models for Integrative Neural Data Analysis 3
MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis 2
MetaMetrics: Calibrating Metrics for Generation Tasks Using Human Preferences 6
MetaOOD: Automatic Selection of OOD Detection Models 6
MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility 5
Metalic: Meta-Learning In-Context with Protein Language Models 5
Metamizer: A Versatile Neural Optimizer for Fast and Accurate Physics Simulations 3
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models 6
Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity 3
Methods with Local Steps and Random Reshuffling for Generally Smooth Non-Convex Federated Optimization 4
Metric-Driven Attributions for Vision Transformers 5
Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks 5
Min-K%++: Improved Baseline for Pre-Training Data Detection from Large Language Models 4
Mind Control through Causal Inference: Predicting Clean Images from Poisoned Data 5
Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning 4
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models 5
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher 3
MindSimulator: Exploring Brain Concept Localization via Synthetic fMRI 4
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid 5
Mini-batch Coresets for Memory-efficient Language Model Training on Data Mixtures 4
MiniPLM: Knowledge Distillation for Pre-training Language Models 5
Minimal Impact ControlNet: Advancing Multi-ControlNet Integration 3
Minimal Variance Model Aggregation: A principled, non-intrusive, and versatile integration of black box models 3
Minimalistic Predictions for Online Class Constraint Scheduling 0
Minimax Optimal Reinforcement Learning with Quasi-Optimism 3
Minimax Optimal Two-Stage Algorithm For Moment Estimation Under Covariate Shift 4
Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models 4
Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error 1
Mitigate the Gap: Improving Cross-Modal Alignment in CLIP 5
Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization 5
Mitigating Memorization in Language Models 6
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality 2
Mitigating Object Hallucination in MLLMs via Data-augmented Phrase-level Alignment 5
Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning 5
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization 4
Mitigating Spurious Correlations in Zero-Shot Multimodal Models 6
Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace 4
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment 3
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN 3
MixEval-X: Any-to-any Evaluations from Real-world Data Mixture 3
MixMax: Distributional Robustness in Function Space via Optimal Data Mixtures 5
Mixture Compressor for Mixture-of-Experts LLMs Gains More 5
Mixture of Attentions For Speculative Decoding 4
Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models 5
Mixture of In-Context Prompters for Tabular PFNs 5
Mixture of Parrots: Experts improve memorization more than reasoning 3
Mixture-of-Agents Enhances Large Language Model Capabilities 3
MoDGS: Dynamic Gaussian Splatting from Casually-captured Monocular Videos with Depth Priors 4
MoDeGPT: Modular Decomposition for Large Language Model Compression 5
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts 4
MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling 5
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards 5
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists 5
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity 6
Model Equality Testing: Which Model is this API Serving? 4
Model Risk-sensitive Offline Reinforcement Learning 4
Model merging with SVD to tie the Knots 5
Model-Agnostic Knowledge Guided Correction for Improved Neural Surrogate Rollout 5
Model-Free Offline Reinforcement Learning with Enhanced Robustness 3
Model-agnostic meta-learners for estimating heterogeneous treatment effects over time 5
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning 6
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds 1
Modeling Complex System Dynamics with Flow Matching Across Time and Conditions 5
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning 5
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions 5
Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning 4
Modeling dynamic social vision highlights gaps between deep learning and humans 5
MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra 6
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion 4
Moner: Motion Correction in Undersampled Radial MRI with Unsupervised Neural Representation 5
Monet: Mixture of Monosemantic Experts for Transformers 5
Monitoring Latent World States in Language Models with Propositional Probes 5
Monte Carlo Planning with Large Language Model for Text-Based Game Agents 5
Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning 4
Moral Alignment for LLM Agents 5
More Experts Than Galaxies: Conditionally-Overlapping Experts with Biologically-Inspired Fixed Routing 5
More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness 4
Morphing Tokens Draw Strong Masked Image Models 6
MorphoDiff: Cellular Morphology Painting with Diffusion Models 5
MotherNet: Fast Training and Inference via Hyper-Network Transformers 5
Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning 4
Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs 3
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion 5
MotionClone: Training-Free Motion Cloning for Controllable Video Generation 3
MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer 5
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences 4
MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory 4
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models 4
MuHBoost: Multi-Label Boosting For Practical Longitudinal Human Behavior Modeling 7
MuPT: A Generative Symbolic Music Pretrained Transformer 4
Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM 3
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding 4
Multi-Dimensional Conformal Prediction 5
Multi-Draft Speculative Sampling: Canonical Decomposition and Theoretical Limits 5
Multi-Field Adaptive Retrieval 6
Multi-Label Node Classification with Label Influence Propagation 5
Multi-Label Test-Time Adaptation with Bound Entropy Minimization 5
Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen 7
Multi-Perspective Data Augmentation for Few-shot Object Detection 5
Multi-Resolution Decomposable Diffusion Model for Non-Stationary Time Series Anomaly Detection 4
Multi-Reward as Condition for Instruction-based Image Editing 4
Multi-Robot Motion Planning with Diffusion Models 5
Multi-Scale Fusion for Object Representation 4
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation 5
Multi-Task Dense Predictions via Unleashing the Power of Diffusion 6
Multi-agent cooperation through learning-aware policy gradients 3
Multi-domain Distribution Learning for De Novo Drug Design 4
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning 4
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage 4
Multi-modal brain encoding models for multi-modal stimuli 5
Multi-objective Differentiable Neural Architecture Search 6
Multi-objective antibody design with constrained preference optimization 5
Multi-session, multi-task neural decoding from distinct cell-types and brain regions 5
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains 5
Multilevel Generative Samplers for Investigating Critical Phenomena 4
Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning 5
Multimodal Lego: Model Merging and Fine-Tuning Across Topologies and Modalities in Biomedicine 5
Multimodal Quantitative Language for Generative Recommendation 5
Multimodal Situational Safety 3
Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap 6
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation 5
Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning 5
Multiplicative Logit Adjustment Approximates Neural-Collapse-Aware Decision Boundary Adjustment 6
Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning 3
MuseGNN: Forming Scalable, Convergent GNN Layers that Minimize a Sampling-Based Energy 7
Mutual Effort for Efficiency: A Similarity-based Token Pruning for Vision Transformers in Self-Supervised Learning 4
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver 5
N-ForGOT: Towards Not-forgetting and Generalization of Open Temporal Graph Learning 6
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction 4
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance 4
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation 6
NExUME: Adaptive Training and Inference for DNNs under Intermittent Power Environments 4
NL-Eye: Abductive NLI For Images 2
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals 6
NRGBoost: Energy-Based Generative Boosted Trees 6
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval 6
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models 4
NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer 5
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative 5
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability 3
Natural Language Inference Improves Compositionality in Vision-Language Models 4
NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics 4
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence 5
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents 6
Navigation-Guided Sparse Scene Representation for End-to-End Autonomous Driving 4
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields 5
NeSyC: A Neuro-symbolic Continual Learner For Complex Embodied Tasks in Open Domains 4
Near, far: Patch-ordering enhances vision foundation models' scene understanding 4
Near-Exact Privacy Amplification for Matrix Mechanisms 5
Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency 4
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form 3
Near-optimal Active Regression of Single-Index Models 1
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs 5
Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks? 3
Nesterov acceleration in benignly non-convex landscapes 1
NetFormer: An interpretable model for recovering dynamical connectivity in neuronal population dynamics 6
NetMoE: Accelerating MoE Training through Dynamic Sample Placement 3
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions 5
Neural Approximate Mirror Maps for Constrained Diffusion Models 5
Neural Causal Graph for Interpretable and Intervenable Classification 6
Neural Context Flows for Meta-Learning of Dynamical Systems 6
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback 3
Neural Eulerian Scene Flow Fields 4
Neural Exploratory Landscape Analysis for Meta-Black-Box-Optimization 6
Neural Fluid Simulation on Geometric Surfaces 4
Neural Functions for Learning Periodic Signal 6
Neural Interactive Proofs 4
Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion 4
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning 5
Neural Phylogeny: Fine-Tuning Relationship Detection among Neural Networks 4
Neural Sampling from Boltzmann Densities: Fisher-Rao Curves in the Wasserstein Geometry 2
Neural Spacetimes for DAG Representation Learning 4
Neural Stochastic Differential Equations for Uncertainty-Aware Offline RL 5
Neural Wave Equation for Irregularly Sampled Sequence Data 5
Neural networks on Symmetric Spaces of Noncompact Type 5
NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields 5
Neuralized Markov Random Field for Interaction-Aware Stochastic Human Trajectory Prediction 5
NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals 6
Neuron Platonic Intrinsic Representation From Dynamics Using Contrastive Learning 5
Neuron based Personality Trait Induction in Large Language Models 5
Neuron-based Multifractal Analysis of Neuron Interaction Dynamics in Large Models 4
Neuroplastic Expansion in Deep Reinforcement Learning 6
New Algorithms for the Learning-Augmented k-means Problem 4
Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing 3
NextBestPath: Efficient 3D Mapping of Unseen Environments 6
No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs 5
No Free Lunch: Fundamental Limits of Learning Non-Hallucinating Generative Models 0
No Location Left Behind: Measuring and Improving the Fairness of Implicit Representations for Earth Data 4
No Need to Talk: Asynchronous Mixture of Language Models 3
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images 5
No Preference Left Behind: Group Distributional Preference Optimization 5
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models 4
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models 5
Node Identifiers: Compact, Discrete Representations for Efficient Graph Learning 5
Node Similarities under Random Projections: Limits and Pathological Cases 2
Node-Time Conditional Prompt Learning in Dynamic Graphs 5
Noise Separation guided Candidate Label Reconstruction for Noisy Partial Label Learning 7
Noise-conditioned Energy-based Annealed Rewards (NEAR): A Generative Framework for Imitation Learning from Observation 5
Noisy Test-Time Adaptation in Vision-Language Models 7
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching 6
Non-Equilibrium Dynamics of Hybrid Continuous-Discrete Ground-State Sampling 5
Non-myopic Generation of Language Models for Reasoning and Planning 6
Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson–Romberg Extrapolation 2
Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping 1
Nonlinear Sequence Embedding by Monotone Variational Inequality 6
Nonlinear multiregion neural dynamics with parametric impulse response communication channels 3
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning 5
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification 6
Not All Language Model Features Are One-Dimensionally Linear 5
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models 5
Not-So-Optimal Transport Flows for 3D Point Cloud Generation 4
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning 5
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens 4
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning 4
Number Cookbook: Number Understanding of Language Models and How to Improve It 6
NutriBench: A Dataset for Evaluating Large Language Models in Nutrition Estimation from Meal Descriptions 3
O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions 0
OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes 3
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition 6
OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones? 5
OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference 6
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models 3
ODE-based Smoothing Neural Network for Reinforcement Learning Tasks 5
OGBench: Benchmarking Offline Goal-Conditioned RL 4
OLMoE: Open Mixture-of-Experts Language Models 5
OMG: Opacity Matters in Material Modeling with Gaussian Splatting 4
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code 4
ONLINE EPSILON NET & PIERCING SET FOR GEOMETRIC CONCEPTS 1
OPTAMI: Global Superlinear Convergence of High-order Methods 6
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization 3
OS-ATLAS: Foundation Action Model for Generalist GUI Agents 3
OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning 3
OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents 5
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting 6
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer 5
Object-Centric Pretraining via Target Encoder Bootstrapping 4
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding 3
OccProphet: Pushing the Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with an Observer-Forecaster-Refiner Framework 5
Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy 5
Offline Hierarchical Reinforcement Learning via Inverse Optimization 6
Offline Model-Based Optimization by Learning to Rank 5
Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics 3
Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood 5
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models 4
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces 4
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text 6
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision 5
OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs 5
OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation 4
OmniRe: Omni Urban Scene Reconstruction 5
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup 4
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities 3
On Bits and Bandits: Quantifying the Regret-Information Trade-off 4
On Calibration of LLM-based Guard Models for Reliable Content Moderation 5
On Conformal Isometry of Grid Cells: Learning Distance-Preserving Position Embedding 4
On Designing General and Expressive Quantum Graph Neural Networks with Applications to MILP Instance Representation 3
On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning 6
On Disentangled Training for Nonlinear Transform in Learned Image Compression 3
On Evaluating the Durability of Safeguards for Open-Weight LLMs 5
On Generalization Across Environments In Multi-Objective Reinforcement Learning 5
On Large Language Model Continual Unlearning 5
On Linear Representations and Pretraining Data Frequency in Language Models 5
On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning 5
On Quantizing Neural Representation for Variable-Rate Video Coding 5
On Rollouts in Model-Based Reinforcement Learning 4
On Scaling Up 3D Gaussian Splatting Training 6
On Speeding Up Language Model Evaluation 6
On Statistical Rates of Conditional Diffusion Transformers: Approximation, Estimation and Minimax Optimality 0
On Stochastic Contextual Bandits with Knapsacks in Small Budget Regime 3
On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback 5
On a Connection Between Imitation Learning and RLHF 4
On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning 5
On the Adversarial Vulnerability of Label-Free Test-Time Adaptation 5
On the Almost Sure Convergence of the Stochastic Three Points Algorithm 3
On the Benefits of Attribute-Driven Graph Domain Adaptation 4
On the Benefits of Memory for Modeling Time-Dependent PDEs 7
On the Byzantine-Resilience of Distillation-Based Federated Learning 5
On the Completeness of Invariant Geometric Deep Learning Models 4
On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions 2
On the Crucial Role of Initialization for Matrix Factorization 6
On the Expressive Power of Sparse Geometric MPNNs 5
On the Expressiveness of Rational ReLU Neural Networks With Bounded Depth 0
On the Feature Learning in Diffusion Models 3
On the Fourier analysis in the SO(3) space : the EquiLoPO Network 5
On the Hölder Stability of Multiset and Graph Neural Networks 5
On the Identification of Temporal Causal Representation with Instantaneous Dependence 4
On the Importance of Language-driven Representation Learning for Heterogeneous Federated Learning 6
On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery 2
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations 4
On the Modeling Capabilities of Large Language Models for Sequential Decision Making 1
On the Optimal Memorization Capacity of Transformers 2
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models 5
On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent 3
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective 5
On the Price of Differential Privacy for Hierarchical Clustering 5
On the Relation between Trainability and Dequantization of Variational Quantum Learning Models 1
On the Role of Attention Heads in Large Language Model Safety 5
On the Transfer of Object-Centric Representation Learning 4
On the expressiveness and spectral bias of KANs 3
On the self-verification limitations of large language models on reasoning and planning tasks 3
On-the-fly Preference Alignment via Principle-Guided Decoding 6
Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation 3
One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment 4
One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMs 5
One Step Diffusion via Shortcut Models 6
One for all and all for one: Efficient computation of partial Wasserstein distances on the line 4
One-for-All Few-Shot Anomaly Detection via Instance-Induced Prompt Learning 5
Online Clustering with Nearly Optimal Consistency 2
Online Preference Alignment for Language Models via Count-based Exploration 5
Online Reinforcement Learning in Non-Stationary Context-Driven Environments 6
Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization 4
Online-to-Offline RL for Agent Alignment 2
Open-CK: A Large Multi-Physics Fields Coupling benchmarks in Combustion Kinetics 5
Open-Set Graph Anomaly Detection via Normal Structure Regularisation 7
Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation 3
Open-World Reinforcement Learning over Long Short-Term Imagination 5
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation 4
OpenHands: An Open Platform for AI Software Developers as Generalist Agents 4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data 4
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees 3
OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures? 4
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation 3
Operator Deep Smoothing for Implied Volatility 6
OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling 2
Optimal Brain Apoptosis 5
Optimal Flow Transport and its Entropic Regularization: a GPU-friendly Matrix Iterative Algorithm for Flow Balance Satisfaction 5
Optimal Learning of Kernel Logistic Regression for Complex Classification Scenarios 3
Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes 0
Optimal Protocols for Continual Learning via Statistical Physics and Control Theory 3
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization 1
Optimal Transport for Time Series Imputation 6
Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression 0
Optimality of Matrix Mechanism on $\ell_p^p$-metric 0
Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design 6
Optimization by Parallel Quasi-Quantum Annealing with Gradient-Based Sampling 3
Optimized Multi-Token Joint Decoding With Auxiliary Model for LLM Inference 5
Optimizing $(L_0, L_1)$-Smooth Functions by Gradient Methods 2
Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images 4
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization 6
Optimizing Neural Network Representations of Boolean Networks 4
Optimizing Posterior Samples for Bayesian Optimization via Rootfinding 5
Optimizing importance weighting in the presence of sub-population shifts 4
OptionZero: Planning with Learned Options 4
Oracle efficient truncated statistics 1
Order-aware Interactive Segmentation 3
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution 4
Oscillatory State-Space Models 6
Out-of-distribution Generalization for Total Variation based Invariant Risk Minimization 5
Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection 6
Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model 5
Overcoming Lower-Level Constraints in Bilevel Optimization: A Novel Approach with Regularized Gap Functions 6
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control 5
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination 6
P-SPIKESSM: HARNESSING PROBABILISTIC SPIKING STATE SPACE MODELS FOR LONG-RANGE DEPENDENCY TASKS 5
PABBO: Preferential Amortized Black-Box Optimization 6
PAD: Personalized Alignment of LLMs at Decoding-time 5
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer 4
PAL: Sample-Efficient Personalized Reward Modeling for Pluralistic Alignment 6
PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS 5
PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks 7
PEAR: Primitive Enabled Adaptive Relabeling for Boosting Hierarchical Reinforcement Learning 5
PEARL: Parallel Speculative Decoding with Adaptive Draft Length 6
PEARL: Towards Permutation-Resilient LLMs 6
PETRA: Parallel End-to-end Training with Reversible Architectures 6
PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores 6
PFGuard: A Generative Framework with Privacy and Fairness Safeguards 4
PICASO: Permutation-Invariant Context Composition with State Space Models 5
PIED: Physics-Informed Experimental Design for Inverse Problems 5
PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations 3
PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations 3
PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows 4
PIORF: Physics-Informed Ollivier-Ricci Flow for Long–Range Interactions in Mesh Graph Neural Networks 6
PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations 6
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding 5
POTEC: Off-Policy Contextual Bandits for Large Action Spaces via Policy Decomposition 4
PPT: Patch Order Do Matters In Time Series Pretext Task 7
PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation 6
PRDP: Progressively Refined Differentiable Physics 4
PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models 4
PT-T2I/V: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Image/Video-Task 5
PWM: Policy Learning with Multi-Task World Models 5
PaCA: Partial Connection Adaptation for Efficient Fine-Tuning 4
PaLD: Detection of Text Partially Written by Large Language Models 6
PaPaGei: Open Foundation Models for Optical Physiological Signals 5
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction 3
Pacmann: Efficient Private Approximate Nearest Neighbor Search 6
Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning 4
Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy 4
Palu: KV-Cache Compression with Low-Rank Projection 4
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages 5
ParFam -- (Neural Guided) Symbolic Regression via Continuous Global Optimization 6
ParaSolver: A Hierarchical Parallel Integral Solver for Diffusion Models 6
Param$\Delta$ for Direct Mixing: Post-Train Large Language Model At Zero Cost 3
Parameter Expanded Stochastic Gradient Markov Chain Monte Carlo 6
Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization 6
Pareto Low-Rank Adapters: Efficient Multi-Task Learning with Preferences 6
Pareto Prompt Optimization 5
ParetoFlow: Guided Flows in Multi-Objective Optimization 4
Partial Gromov-Wasserstein Metric 6
Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior 5
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration 6
Pedestrian Motion Reconstruction: A Large-scale Benchmark via Mixed Reality Rendering with Multiple Perspectives and Modalities 3
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation 5
Periodic Materials Generation using Text-Guided Joint Diffusion Model 6
Perm: A Parametric Representation for Multi-Style 3D Hair Modeling 4
Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs 6
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models 4
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents 6
Persistent Pre-training Poisoning of LLMs 4
PersonalLLM: Tailoring LLMs to Individual Preferences 5
Personality Alignment of Large Language Models 6
Personalized Representation from Personalized Generation 6
Personalized Visual Instruction Tuning 4
Perturbation-Restrained Sequential Model Editing 4
PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training 2
PharmacoMatch: Efficient 3D Pharmacophore Screening via Neural Subgraph Matching 6
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis 6
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion 4
PhyMPGN: Physics-encoded Message Passing Graph Network for spatiotemporal PDE systems 5
PhyloLM: Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks 6
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational Autoencoders 5
PhysPDE: Rethinking PDE Discovery and a Physical HYpothesis Selection Benchmark 5
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process 4
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems 4
Physics of Language Models: Part 3.2, Knowledge Manipulation 0
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws 3
Physics-Informed Deep Inverse Operator Networks for Solving PDE Inverse Problems 4
Physics-Informed Diffusion Models 6
Physics-aligned field reconstruction with diffusion bridge 6
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning 5
Physiome-ODE: A Benchmark for Irregularly Sampled Multivariate Time-Series Forecasting Based on Biological ODEs 5
PiCO: Peer Review in LLMs based on Consistency Optimization 5
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance 5
PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance 4
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions 4
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming 3
Planning in Natural Language Improves LLM Search for Code Generation 3
Plastic Learning with Deep Fourier Features 4
PnP-Flow: Plug-and-Play Image Restoration with Flow Matching 6
Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception 5
Point-SAM: Promptable 3D Segmentation Model for Point Clouds 4
Point-based Instance Completion with Scene Constraints 3
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection 4
Poison-splat: Computation Cost Attack on 3D Gaussian Splatting 5
Poisson-Dirac Neural Networks for Modeling Coupled Dynamical Systems across Domains 5
PolaFormer: Polarity-aware Linear Attention for Vision Transformers 5
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model 4
Policy Design in Long-run Welfare Dynamics 4
Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy 4
PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization 5
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs 3
PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph 4
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models 5
Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation 3
PooDLe🐩: Pooled and dense self-supervised learning from naturalistic videos 4
Population Transformer: Learning Population-level Representations of Neural Activity 6
Port-Hamiltonian Architectural Bias for Long-Range Propagation in Deep Graph Networks 5
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches 5
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation 6
Post-hoc Reward Calibration: A Case Study on Length Bias 5
PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling 5
PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing 6
Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration 6
Preble: Efficient Distributed Prompt Scheduling for LLM Serving 6
Precedence-Constrained Winter Value for Effective Graph Data Valuation 5
Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs 5
Precise Parameter Localization for Textual Generation in Diffusion Models 4
Predicate Hierarchies Improve Few-Shot State Classification 3
Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning 5
Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors 4
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation 5
Predictive Uncertainty Quantification for Bird's Eye View Segmentation: A Benchmark and Novel Loss Function 5
Preference Diffusion for Recommendation 6
Preference Elicitation for Offline Reinforcement Learning 4
Preference Optimization for Reasoning with Pseudo Feedback 4
Preserving Deep Representations in One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework 6
Preserving Diversity in Supervised Fine-Tuning of Large Language Models 6
Presto! Distilling Steps and Layers for Accelerating Music Generation 5
Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline 6
Prioritized Generative Replay 4
Privacy Auditing of Large Language Models 4
Privacy-Aware Lifelong Learning 6
Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models 5
Private Mechanism Design via Quantile Estimation 2
Privately Counting Partially Ordered Data 2
ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs 4
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance 4
Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility 5
Probabilistic Conformal Prediction with Approximate Conditional Validity 5
Probabilistic Geometric Principal Component Analysis with application to neural data 4
Probabilistic Language-Image Pre-Training 6
Probabilistic Learning to Defer: Handling Missing Expert Annotations and Controlling Workload Distribution 5
Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation 5
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing 6
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models 5
Probing the Latent Hierarchical Structure of Data via Diffusion Models 3
Problem-Parameter-Free Federated Learning 3
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models 3
Procedural Synthesis of Synthesizable Molecules 5
Process Reward Model with Q-value Rankings 6
Programming Refusal with Conditional Activation Steering 7
Progress or Regress? Self-Improvement Reversal in Post-training 5
Progressive Compositionality in Text-to-Image Generative Models 4
Progressive Compression with Universally Quantized Diffusion Models 6
Progressive Mixed-Precision Decoding for Efficient LLM Inference 5
Progressive Parameter Efficient Transfer Learning for Semantic Segmentation 5
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation 5
Progressive distillation induces an implicit curriculum 5
Projection Head is Secretly an Information Bottleneck 5
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection 4
Prompting Fairness: Integrating Causality to Debias Large Language Models 4
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models 6
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids 6
ProtPainter: Draw or Drag Protein via Topology-guided Diffusion 5
Protecting against simultaneous data poisoning attacks 5
Protein Language Model Fitness is a Matter of Preference 4
ProteinBench: A Holistic Evaluation of Protein Foundation Models 5
Proteina: Scaling Flow-based Protein Structure Generative Models 6
ProtoSnap: Prototype Alignment For Cuneiform Signs 5
Prototype antithesis for biological few-shot class-incremental learning 5
Provable Benefit of Annealed Langevin Monte Carlo for Non-log-concave Sampling 2
Provable Convergence Bounds for Hybrid Dynamical Sampling and Optimization 4
Provable Convergence and Limitations of Geometric Tempering for Langevin Dynamics 0
Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization 5
Provable Uncertainty Decomposition via Higher-Order Calibration 3
Provable unlearning in topic modeling and downstream tasks 1
Provable weak-to-strong generalization via benign overfitting 2
Provably Accurate Shapley Value Estimation via Leverage Score Sampling 4
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning 6
Provably Robust Explainable Graph Neural Networks against Graph Perturbation Attacks 5
Provably Safeguarding a Classifier from OOD and Adversarial Samples 5
Provence: efficient and robust context pruning for retrieval-augmented generation 5
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning 6
Proximal Mapping Loss: Understanding Loss Functions in Crowd Counting & Localization 4
Proxy Denoising for Source-Free Domain Adaptation 5
PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection 6
Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount 4
Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection 4
Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling, and Zero-Shot Transfer 4
PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify 5
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition 5
Pyramidal Flow Matching for Efficient Video Generative Modeling 5
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation 5
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning 5
QA-Calibration of Language Model Confidence Scores 5
QERA: an Analytical Framework for Quantization Error Reconstruction 5
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing 5
QP-SNN: Quantized and Pruned Spiking Neural Networks 4
QPM: Discrete Optimization for Globally Interpretable Image Classification 5
Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks 3
QuaDiM: A Conditional Diffusion Model For Quantum State Property Estimation 3
Quality Measures for Dynamic Graph Generative Models 5
Quality over Quantity in Attention Layers: When Adding More Heads Hurts 3
Quamba: A Post-Training Quantization Recipe for Selective State Space Models 5
Quantifying Generalization Complexity for Large Language Models 3
Quantitative Approximation for Neural Operators in Nonlinear Parabolic Equations 0
Quantized Spike-driven Transformer 4
Quantum (Inspired) $D^2$-sampling with Applications 3
Quantum-PEFT: Ultra parameter-efficient fine-tuning 4
Query-based Knowledge Transfer for Heterogeneous Learning Environments 3
Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model 4
R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference 6
R2Det: Exploring Relaxed Rotation Equivariance in 2D Object Detection 6
RA-TTA: Retrieval-Augmented Test-Time Adaptation for Vision-Language Models 7
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards 4
RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression 5
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models 6
RB-Modulation: Training-Free Stylization using Reference-Based Modulation 5
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation 5
REBIND: Enhancing Ground-state Molecular Conformation Prediction via Force-Based Graph Rewiring 5
RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks 6
REEF: Representation Encoding Fingerprints for Large Language Models 5
REFINE: Inversion-Free Backdoor Defense via Model Reprogramming 6
REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments 4
REMEDY: Recipe Merging Dynamics in Large Vision-Language Models 3
RESfM: Robust Deep Equivariant Structure from Motion 5
RESuM: A Rare Event Surrogate Model for Physics Detector Design 2
REVISITING MULTI-PERMUTATION EQUIVARIANCE THROUGH THE LENS OF IRREDUCIBLE REPRESENTATIONS 4
REvolve: Reward Evolution with Large Language Models using Human Feedback 4
RFMamba: Frequency-Aware State Space Model for RF-Based Human-Centric Perception 3
RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction 6
RGB-Event ISP: The Dataset and Benchmark 5
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style 4
RMB: Comprehensively benchmarking reward models in LLM alignment 2
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything 5
RNNs are not Transformers (Yet): The Key Bottleneck on In-Context Retrieval 4
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL 5
RRM: Robust Reward Model Training Mitigates Reward Hacking 4
RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning 5
RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs 6
RaSA: Rank-Sharing Low-Rank Adaptation 4
Radar: Fast Long-Context Decoding for Any Transformer 6
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization 4
RandLoRA: Full rank parameter-efficient fine-tuning of large models 3
Random Is All You Need: Random Noise Injection on Feature Statistics for Generalizable Deep Image Denoising 4
Random-Set Neural Networks 6
Range, not Independence, Drives Modularity in Biologically Inspired Representations 3
RankSHAP: Shapley Value Based Feature Attributions for Learning to Rank 3
Ranking-aware adapter for text-driven image ordering with CLIP 5
Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering 3
Rapidly Adapting Policies to the Real-World via Simulation-Guided Fine-Tuning 3
Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure? 6
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance 6
Rational Decision-Making Agent with Learning Internal Utility Judgment 6
Rationalizing and Augmenting Dynamic Graph Neural Networks 5
RazorAttention: Efficient KV Cache Compression Through Retrieval Heads 4
Re-Aligning Language to Visual Objects with an Agentic Workflow 6
Re-Evaluating the Impact of Unseen-Class Unlabeled Data on Semi-Supervised Learning Model 5
Re-Imagining Multimodal Instruction Tuning: A Representation View 5
Re-evaluating Open-ended Evaluation of Large Language Models 3
ReAttention: Training-Free Infinite Context with Finite Attention Scope 7
ReCogLab: a framework testing relational reasoning & cognitive hypotheses on LLMs 5
ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability 6
ReGen: Generative Robot Simulation via Inverse Design 6
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement 4
ReMatching Dynamic Reconstruction Flow 5
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing 4
ReNovo: Retrieval-Based \emph{De Novo} Mass Spectrometry Peptide Sequencing 5
ReSi: A Comprehensive Benchmark for Representational Similarity Measures 5
Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model 5
Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation 4
Real-Time Video Generation with Pyramid Attention Broadcast 5
Real-time design of architectural structures with differentiable mechanics and neural networks 3
Real2Code: Reconstruct Articulated Objects via Code Generation 4
Realistic Evaluation of Deep Partial-Label Learning Algorithms 6
Reasoning Elicitation in Language Models via Counterfactual Feedback 2
Reasoning of Large Language Models over Knowledge Graphs with Super-Relations 4
Reasoning with Latent Thoughts: On the Power of Looped Transformers 4
Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval 7
Reassessing How to Compare and Improve the Calibration of Machine Learning Models 5
RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation 4
RecFlow: An Industrial Full Flow Recommendation Dataset 5
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon 3
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data 6
Reconciling Model Multiplicity for Downstream Decision Making 5
Reconsidering Faithfulness in Regular, Self-Explainable and Domain Invariant GNNs 5
Reconstruction-Guided Policy: Enhancing Decision-Making through Agent-Wise State Consistency 5
Reconstructive Visual Instruction Tuning 3
Recovering Manifold Structure Using Ollivier Ricci Curvature 6
Recovery of Causal Graph Involving Latent Variables via Homologous Surrogates 3
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow 6
Redefining the task of Bioactivity Prediction 5
Reducing Hallucinations in Large Vision-Language Models via Latent Space Steering 4
RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code 4
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning 4
Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment 2
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective 5
Reflective Gaussian Splatting 3
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation 6
Reframing Structure-Based Drug Design Model Evaluation via Metrics Correlated to Practical Needs 5
RegMix: Data Mixture as Regression for Language Model Pre-training 4
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF 7
Regret Bounds for Episodic Risk-Sensitive Linear Quadratic Regulator 3
Regret-Optimal List Replicable Bandit Learning: Matching Upper and Lower Bounds 1
Regretful Decisions under Label Noise 4
Regularization by Texts for Latent Diffusion Inverse Solvers 5
Regularizing Energy among Training Samples for Out-of-Distribution Generalization 4
Regulatory DNA Sequence Design with Reinforcement Learning 6
Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics 2
Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards 5
Reinforcement learning with combinatorial actions for coupled restless bandits 5
RelCon: Relative Contrastive Learning for a Motion Foundation Model for Wearable Data 5
Relation-Aware Diffusion for Heterogeneous Graphs with Partially Observed Features 5
Relax and Merge: A Simple Yet Effective Framework for Solving Fair $k$-Means and $k$-sparse Wasserstein Barycenter Problems 5
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA 3
Release the Powers of Prompt Tuning: Cross-Modality Prompt Transfer 4
Reliable and Diverse Evaluation of LLM Medical Knowledge Mastery 4
RelitLRM: Generative Relightable Radiance for Large Reconstruction Models 5
Remove Symmetries to Control Model Expressivity and Improve Optimization 5
Repetition Improves Language Model Embeddings 4
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph 3
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think 5
Representational Similarity via Interpretable Visual Concepts 6
Representative Guidance: Diffusion Model Sampling with Coherence 6
Repulsive Latent Score Distillation for Solving Inverse Problems 6
Residual Connections and Normalization Can Provably Prevent Oversmoothing in GNNs 5
Residual Deep Gaussian Processes on Manifolds 5
Residual Kernel Policy Network: Enhancing Stability and Robustness in RKHS-Based Reinforcement Learning 4
Residual Stream Analysis with Multi-Layer SAEs 4
Residual-MPPI: Online Policy Customization for Continuous Control 4
Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks 4
Restructuring Vector Quantization with the Rotation Trick 5
Restyling Unsupervised Concept Based Interpretable Networks with Generative Models 4
Rethinking Artistic Copyright Infringements In the Era Of Text-to-Image Generative Models 4
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives 3
Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance 5
Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior 7
Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words 3
Rethinking Fair Representation Learning for Performance-Sensitive Tasks 3
Rethinking Graph Neural Networks From A Geometric Perspective Of Node Features 5
Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off 4
Rethinking Invariance in In-context Learning 4
Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond 6
Rethinking Light Decoder-based Solvers for Vehicle Routing Problems 5
Rethinking Multiple-Instance Learning From Feature Space to Probability Space 5
Rethinking Neural Multi-Objective Combinatorial Optimization via Neat Weight Embedding 6
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? 3
Rethinking Reward Modeling in Preference-based Large Language Model Alignment 4
Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels 6
Rethinking Shapley Value for Negative Interactions in Non-convex Games 4
Rethinking Spiking Neural Networks from an Ensemble Learning Perspective 5
Rethinking Visual Counterfactual Explanations Through Region Constraint 7
Rethinking and Improving Autoformalization: Towards a Faithful Metric and a Dependency Retrieval-based Approach 4
Rethinking the generalization of drug target affinity prediction algorithms via similarity aware evaluation 5
Rethinking the role of frames for SE(3)-invariant crystal structure modeling 5
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model 5
Retri3D: 3D Neural Graphics Representation Retrieval 4
Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization 6
Retrieval Head Mechanistically Explains Long-Context Factuality 3
RetroInText: A Multimodal Large Language Model Enhanced Framework for Retrosynthetic Planning via In-Context Representation Learning 7
Reveal Object in Lensless Photography via Region Gaze and Amplification 6
Revealing and Mitigating Over-Attention in Knowledge Editing 7
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs) 5
Revealing the 3D Cosmic Web through Gravitationally Constrained Neural Fields 2
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References 3
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models 3
Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation 4
Revisit the Open Nature of Open Vocabulary Semantic Segmentation 5
Revisiting Convolution Architecture in the Realm of DNA Foundation Models 5
Revisiting In-context Learning Inference Circuit in Large Language Models 4
Revisiting Large-Scale Non-convex Distributionally Robust Optimization 4
Revisiting Mode Connectivity in Neural Networks with Bezier Surface 5
Revisiting Nearest Neighbor for Tabular Data: A Deep Tabular Baseline Two Decades Later 5
Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts 4
Revisiting Random Walks for Learning on Graphs 6
Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control 6
Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations 6
Revisiting a Design Choice in Gradient Temporal Difference Learning 2
Revisiting text-to-image evaluation with Gecko: on metrics, prompts, and human rating 4
Revolutionizing EMCCD Denoising through a Novel Physics-Based Learning Framework for Noise Modeling 4
Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning 4
Reward Learning from Multiple Feedback Types 6
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning 3
Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples 3
Risk-Sensitive Variational Actor-Critic: A Model-Based Approach 4
Robotouille: An Asynchronous Planning Benchmark for LLM Agents 4
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets 4
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection 4
Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport 6
Robust Conformal Prediction with a Single Binary Certificate 5
Robust Feature Learning for Multi-Index Models in High Dimensions 4
Robust Function-Calling for On-Device Language Model via Function Masking 5
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning 4
Robust LLM safeguarding via refusal feature adversarial training 4
Robust Representation Consistency Model via Contrastive Denoising 6
Robust Root Cause Diagnosis using In-Distribution Interventions 4
Robust Simulation-Based Inference under Missing Data via Neural Processes 5
Robust System Identification: Finite-sample Guarantees and Connection to Regularization 5
Robust Transfer of Safety-Constrained Reinforcement Learning Agents 4
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances 6
Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis 4
Robust-PIFu: Robust Pixel-aligned Implicit Function for 3D Human Digitalization from a Single Image 4
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction 4
Robustness Auditing for Linear Regression: To Singularity and Beyond 5
Robustness Inspired Graph Backdoor Defense 6
Robustness Reprogramming for Representation Learning 4
Robustness of Quantum Algorithms for Nonconvex Optimization 1
RocketEval: Efficient automated LLM evaluation via grading checklist 4
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions 3
Root Cause Analysis of Anomalies in Multivariate Time Series through Granger Causal Discovery 6
Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference 4
Round and Round We Go! What makes Rotary Positional Encodings useful? 4
RouteLLM: Learning to Route LLMs from Preference Data 6
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models 5
RuAG: Learned-rule-augmented Generation for Large Language Models 4
S4M: S4 for multivariate time series forecasting with Missing values 5
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation 5
SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection 6
SAM 2: Segment Anything in Images and Videos 6
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation 7
SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement 6
SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers 4
SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP 3
SAVA: Scalable Learning-Agnostic Data Valuation 6
SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance 4
SC-OmniGS: Self-Calibrating Omnidirectional Gaussian Splatting 5
SCBench: A KV Cache-Centric Analysis of Long-Context Methods 5
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation 6
SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning 5
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection 6
SEBRA : Debiasing through Self-Guided Bias Ranking 6
SELF-EVOLVED REWARD LEARNING FOR LLMS 6
SEMDICE: Off-policy State Entropy Maximization via Stationary Distribution Correction Estimation 4
SEPARATE: A Simple Low-rank Projection for Gradient Compression in Modern Large-scale Model Training Process 4
SFESS: Score Function Estimators for $k$-Subset Sampling 5
SFS: Smarter Code Space Search improves LLM Inference Scaling 3
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation 4
SGD with memory: fundamental properties and stochastic acceleration 2
SIM: Surface-based fMRI Analysis for Inter-Subject Multimodal Decoding from Movie-Watching Experiments 5
SIMPL: Scalable and hassle-free optimisation of neural representations from behaviour 5
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects 5
SINGER: Stochastic Network Graph Evolving Operator for High Dimensional PDEs 4
SLMRec: Distilling Large Language Models into Small for Sequential Recommendation 5
SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMs 5
SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision 5
SMITE: Segment Me In TimE 5
SMT: Fine-Tuning Large Language Models with Sparse Matrices 6
SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling 5
SONICS: Synthetic Or Not - Identifying Counterfeit Songs 6
SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization 6
SOREL: A Stochastic Algorithm for Spectral Risks Minimization 6
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal 5
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION 5
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation 4
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training 6
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Model 5
SPDIM: Source-Free Unsupervised Conditional and Label Shift Adaptation in EEG 4
SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models 3
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models 6
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks 4
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes 6
SSOLE: Rethinking Orthogonal Low-rank Embedding for Self-Supervised Learning 4
ST-GCond: Self-supervised and Transferable Graph Dataset Condensation 7
STAFF: Speculative Coreset Selection for Task-Specific Fine-tuning 6
STAMP: Scalable Task- And Model-agnostic Collaborative Perception 4
STAR: Stability-Inducing Weight Perturbation for Continual Learning 6
STAR: Synthesis of Tailored Architectures 2
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs 6
STORM: Spatio-TempOral Reconstruction Model For Large-Scale Outdoor Scenes 4
STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning 5
SV-RAG: LoRA-Contextualizing Adaptation of MLLMs for Long Document Understanding 5
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency 4
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding 3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression 4
SVDQuant: Absorbing Outliers by Low-Rank Component for 4-Bit Diffusion Models 4
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix 3
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement 4
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? 5
SWEb: A Large Web Dataset for the Scandinavian Languages 5
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration 6
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation 5
SaMer: A Scenario-aware Multi-dimensional Evaluator for Large Language Models 3
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation 3
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models 6
SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations 4
Safety Alignment Should be Made More Than Just a Few Tokens Deep 4
Safety Layers in Aligned Large Language Models: The Key to LLM Security 4
Safety Representations for Safer Policy Learning 2
Safety-Prioritizing Curricula for Constrained Reinforcement Learning 6
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration 6
Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking 4
Salvage: Shapley-distribution Approximation Learning Via Attribution Guided Exploration for Explainable Image Classification 3
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling 5
Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking 4
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models 5
Satisficing Regret Minimization in Bandits 3
ScImage: How good are multimodal large language models at scientific text-to-image generation? 4
Scalable Bayesian Learning with posteriors 6
Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video 6
Scalable Decentralized Learning with Teleportation 4
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction 2
Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics 6
Scalable Extraction of Training Data from Aligned, Production Language Models 3
Scalable Influence and Fact Tracing for Large Language Model Pretraining 4
Scalable Mechanistic Neural Networks 6
Scalable Universal T-Cell Receptor Embeddings from Adaptive Immune Repertoires 6
Scalable and Certifiable Graph Unlearning: Overcoming the Approximation Error Barrier 6
Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection 6
Scale-Free Graph-Language Models 5
Scale-aware Recognition in Satellite Images under Resource Constraints 4
Scaling Autonomous Agents via Automatic Reward Modeling And Planning 4
Scaling Diffusion Language Models via Adaptation from Autoregressive Models 6
Scaling FP8 training to trillion-token LLMs 4
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport 4
Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation 6
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning 3
Scaling Large Language Model-based Multi-Agent Collaboration 3
Scaling Laws for Adversarial Attacks on Language Model Activations and Tokens 4
Scaling Laws for Downstream Task Performance in Machine Translation 3
Scaling Laws for Precision 3
Scaling Long Context Training Data by Long-Distance Referrals 5
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining 5
Scaling Optimal LR Across Token Horizons 2
Scaling Speech-Text Pre-training with Synthetic Interleaved Data 5
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study 6
Scaling Transformers for Low-Bitrate High-Quality Speech Coding 5
Scaling Wearable Foundation Models 3
Scaling and evaluating sparse autoencoders 3
Scaling up Masked Diffusion Models on Text 6
Scaling up the Banded Matrix Factorization Mechanism for Large Scale Differentially Private ML 4
Schur's Positive-Definite Network: Deep Learning in the SPD cone with structure 5
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding 5
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery 4
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models 6
Score-based Self-supervised MRI Denoising 5
Score-based free-form architectures for high-dimensional Fokker-Planck equations 5
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning 6
SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents 3
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction 7
SeRA: Self-Reviewing and Alignment of LLMs using Implicit Reward Margins 5
Searching for Optimal Solutions with LLMs via Bayesian Optimization 4
Second Order Bounds for Contextual Bandits with Function Approximation 1
Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer 5
Second-Order Min-Max Optimization with Lazy Hessians 5
SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography 4
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding 3
See What You Are Told: Visual Attention Sink in Large Multimodal Models 5
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators 6
Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models 5
SegLLM: Multi-round Reasoning Segmentation with Large Language Models 5
Segment Any 3D Object with Language 4
SelKD: Selective Knowledge Distillation via Optimal Transport Perspective 6
Select before Act: Spatially Decoupled Action Repetition for Continuous Control 5
SelectFormer in Data Markets: Privacy-Preserving and Efficient Data Selection for Transformers with Multi-Party Computation 3
Selective Aggregation for Low-Rank Adaptation in Federated Learning 5
Selective Attention Improves Transformer 4
Selective Label Enhancement Learning for Test-Time Adaptation 6
Selective Task Group Updates for Multi-Task Optimization 4
Selective Unlearning via Representation Erasure Using Domain Adversarial Training 3
Selective induction Heads: How Transformers Select Causal Structures in Context 2
Self-Attention-Based Contextual Modulation Improves Neural System Identification 5
Self-Boosting Large Language Models with Synthetic Preference Data 3
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models 5
Self-Evolving Multi-Agent Collaboration Networks for Software Development 3
Self-Improvement in Language Models: The Sharpening Mechanism 5
Self-Improving Robust Preference Optimization 5
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models 5
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts 3
Self-Normalized Resets for Plasticity in Continual Learning 4
Self-Play Preference Optimization for Language Model Alignment 5
Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement 6
Self-Supervised Diffusion Models for Electron-Aware Molecular Representation Learning 7
Self-Updatable Large Language Models by Integrating Context into Model Parameters 5
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models 5
Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining 4
Self-supervised contrastive learning performs non-linear system identification 5
Semantic Aware Representation Learning for Lifelong Learning 4
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations 5
Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMs 3
Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning 6
Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors 6
Semantix: An Energy-guided Sampler for Semantic Style Transfer 4
Semi-Parametric Retrieval via Binary Bag-of-Tokens Index 5
Semi-Supervised CLIP Adaptation by Enforcing Semantic and Trapezoidal Consistency 5
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving 5
Semialgebraic Neural Networks: From roots to representations 2
Sensitivity Verification for Additive Decision Tree Ensembles 4
Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations 5
Sensor-Invariant Tactile Representation 1
Separation Power of Equivariant Neural Networks 0
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning 4
Sequential Controlled Langevin Diffusions 5
Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning 3
Severing Spurious Correlations with Data Pruning 4
ShEPhERD: Diffusing shape, electrostatics, and pharmacophores for bioisosteric drug design 6
Shallow diffusion networks provably learn hidden low-dimensional structure 1
Shape as Line Segments: Accurate and Flexible Implicit Surface Representation 4
Shapley-Guided Utility Learning for Effective Graph Inference Data Valuation 4
Shared-AE: Automatic Identification of Shared Subspaces in High-dimensional Neural and Behavioral Activity 6
Sharper Guarantees for Learning Neural Network Classifiers with Gradient Methods 2
Sharpness-Aware Black-Box Optimization 5
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training 4
Sharpness-Aware Minimization: General Analysis and Improved Rates 5
Shedding Light on Time Series Classification using Interpretability Gated Networks 6
Shh, don't say that! Domain Certification in LLMs 6
Shifting the Paradigm: A Diffeomorphism Between Time Series Data Manifolds for Achieving Shift-Invariancy in Deep Learning 6
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents 4
Shot2Story: A New Benchmark for Comprehensive Understanding of Multi-shot Videos 5
Should VLMs be Pre-trained with Image Data? 4
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation 5
SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training 5
SiReRAG: Indexing Similar and Related Information for Multihop Reasoning 5
SigDiffusions: Score-Based Diffusion Models for Time Series via Log-Signature Embeddings 4
Signature Kernel Conditional Independence Tests in Causal Discovery for Stochastic Processes 5
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning 5
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters 5
SimXRD-4M: Big Simulated X-ray Diffraction Data and Crystal Symmetry Classification Benchmark 5
Simple Guidance Mechanisms for Discrete Diffusion Models 5
Simple ReFlow: Improved Techniques for Fast Flow Models 4
Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation 5
Simple yet Effective Incomplete Multi-view Clustering: Similarity-level Imputation and Intra-view Hybrid-group Prototype Construction 3
Simple, Good, Fast: Self-Supervised World Models Free of Baggage 6
SimpleTM: A Simple Baseline for Multivariate Time Series Forecasting 5
Simplifying Deep Temporal Difference Learning 5
Simplifying, Stabilizing and Scaling Continuous-time Consistency Models 4
SimulPL: Aligning Human Preferences in Simultaneous Machine Translation 5
Simulating Human-like Daily Activities with Desire-driven Autonomy 3
Simulating Training Dynamics to Reconstruct Training Data from Deep Neural Networks 5
Single Teacher, Multiple Perspectives: Teacher Knowledge Augmentation for Enhanced Knowledge Distillation 5
Single-agent Poisoning Attacks Suffice to Ruin Multi-Agent Learning 2
Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions 1
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes 4
Size-Generalizable RNA Structure Evaluation by Exploring Hierarchical Geometries 4
Sketch2Diagram: Generating Vector Diagrams from Hand-Drawn Sketches 5
Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees 4
Skill Expansion and Composition in Parameter Space 4
SleepSMC: Ubiquitous Sleep Staging via Supervised Multimodal Coordination 6
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation 4
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation 3
Small Models are LLM Knowledge Triggers for Medical Tabular Prediction 7
Small-to-Large Generalization: Training Data Influences Models Consistently Across Scale 4
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling 3
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction 5
SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback 5
Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises 5
SoftCVI: Contrastive variational inference with self-generated soft labels 6
SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches 5
Solving Differential Equations with Constrained Learning 6
Solving New Tasks by Adapting Internet Video Knowledge 4
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model 5
Solving Video Inverse Problems Using Image Diffusion Models 6
Solving hidden monotone variational inequalities with surrogate losses 3
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios 5
Sort-free Gaussian Splatting via Weighted Sum Rendering 3
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation 6
SpaceGNN: Multi-Space Graph Neural Network for Node Anomaly Detection with Extremely Limited Labels 5
Sparse Autoencoders Do Not Find Canonical Units of Analysis 2
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models 3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models 4
Sparse Learning for State Space Models on Mobile 4
Sparse autoencoders reveal selective remapping of visual concepts during adaptation 4
Sparse components distinguish visual pathways & their alignment to neural networks 5
SparsyFed: Sparse Adaptive Federated Learning 5
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion 5
Specialized Foundation Models Struggle to Beat Supervised Baselines 6
Spectral Compressive Imaging via Unmixing-driven Subspace Diffusion Refinement 7
Spectral-Refiner: Accurate Fine-Tuning of Spatiotemporal Fourier Neural Operator for Turbulent Flows 6
Spectro-Riemannian Graph Neural Networks 5
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling 5
Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting 4
Speech Robust Bench: A Robustness Benchmark For Speech Recognition 6
Spherical Tree-Sliced Wasserstein Distance 6
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows 3
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking 5
Spiking Vision Transformer with Saccadic Attention 3
SpinQuant: LLM Quantization with Learned Rotations 5
SplatFormer: Point Transformer for Robust 3D Gaussian Splatting 5
SplineGS: Learning Smooth Trajectories in Gaussian Splatting for Dynamic Scene Reconstruction 4
Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports 6
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment 6
Spreading Out-of-Distribution Detection on Graphs 5
Spurious Forgetting in Continual Learning of Language Models 5
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget 5
Stabilized Neural Prediction of Potential Outcomes in Continuous Time 5
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation 3
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning 6
Stable Segment Anything Model 4
Standard Gaussian Process is All You Need for High-Dimensional Bayesian Optimization 6
Standardizing Structural Causal Models 4
Start Smart: Leveraging Gradients For Enhancing Mask-based XAI Methods 6
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection 5
State Space Models are Provably Comparable to Transformers in Dynamic Token Selection 5
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts 4
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs 0
Stealthy Shield Defense: A Conditional Mutual Information-Based Approach against Black-Box Model Inversion Attacks 6
Steering Large Language Models between Code Execution and Textual Reasoning 4
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction 5
Steering Protein Family Design through Profile Bayesian Flow 3
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion 4
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo 5
Stiefel Flow Matching for Moment-Constrained Structure Elucidation 6
StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces 5
Stochastic Bandits Robust to Adversarial Attacks 2
Stochastic Polyak Step-sizes and Momentum: Convergence Guarantees and Practical Performance 5
Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation 3
Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold 3
Storybooth: Training-Free Multi-Subject Consistency for Improved Visual Storytelling 3
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs 4
Strategic Classification With Externalities 2
Strategist: Self-improvement of LLM Decision Making via Bi-Level Tree Search 3
Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression 0
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval 3
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge 4
Streamlining Prediction in Bayesian Deep Learning 5
Streamlining Redundant Layers to Compress Large Language Models 5
Strength Estimation and Human-Like Strength Adjustment in Games 5
StringLLM: Understanding the String Processing Capability of Large Language Models 3
Strong Model Collapse 3
Strong Preferences Affect the Robustness of Preference Models and Value Alignment 4
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization 4
Structural-Entropy-Based Sample Selection for Efficient and Effective Learning 5
Structure Language Models for Protein Conformation Generation 6
Structuring Benchmark into Knowledge Graphs to Assist Large Language Models in Retrieving and Designing Models 3
Student-Informed Teacher Training 2
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning 5
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking 4
Subgraph Federated Learning for Local Generalization 7
Subtask-Aware Visual Reward Learning from Segmented Demonstrations 5
Sufficient Context: A New Lens on Retrieval Augmented Generation Systems 4
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization 5
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction 5
Supervised and Semi-Supervised Diffusion Maps with Label-Driven Diffusion 6
Support is All You Need for Certified VAE Training 6
SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars 4
Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation 4
Surprising Effectiveness of pretraining Ternary Language Model at Scale 4
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models 5
Swift4D: Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstruction of dynamic scene 4
Swing-by Dynamics in Concept Learning and Compositional Generalization 3
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning 5
Sylber: Syllabic Embedding Representation of Speech from Raw Audio 6
SyllableLM: Learning Coarse Semantic Units for Speech Language Models 5
SymDiff: Equivariant Diffusion via Stochastic Symmetrisation 6
Symbolic regression via MDLformer-guided search: from minimizing prediction error to minimizing description length 5
SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models 7
SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups 5
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints 3
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints 6
SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning 6
Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning 4
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling 4
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo 6
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search 5
Synthesizing Realistic fMRI: A Physiological Dynamics-Driven Hierarchical Diffusion Model for Efficient fMRI Acquisition 4
Synthetic continued pretraining 4
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data 5
SysBench: Can LLMs Follow System Message? 3
SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems 5
System 1.x: Learning to Balance Fast and Slow Planning with Language Models 6
Systematic Outliers in Large Language Models 3
Systematic Relational Reasoning With Epistemic Graph Neural Networks 5
Systems with Switching Causal Relations: A Meta-Causal Perspective 3
T-JEPA: Augmentation-Free Self-Supervised Learning for Tabular Data 6
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching 4
T2V-Turbo-v2: Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design 5
T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning 4
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models 5
TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation 6
TASAR: Transfer-based Attack on Skeletal Action Recognition 6
TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident 6
TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice 4
TD-Paint: Faster Diffusion Inpainting Through Time-Aware Pixel Conditioning 5
TDDBench: A Benchmark for Training data detection 5
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction 5
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data 5
TFG-Flow: Training-free Guidance in Multimodal Generative Flow 6
TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics 5
THE ROBUSTNESS OF DIFFERENTIABLE CAUSAL DISCOVERY IN MISSPECIFIED SCENARIOS 3
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation 6
TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models 4
TIPS: Text-Image Pretraining with Spatial awareness 5
TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights 4
TLDR: Token-Level Detective Reward Model for Large Vision Language Models 4
TODO: Enhancing LLM Alignment with Ternary Preferences 4
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models 4
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning 3
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees 5
TRACE: Temporal Grounding Video LLM via Causal Event Modeling 6
TRENDy: Temporal Regression of Effective Nonlinear Dynamics 5
TS-LIF: A Temporal Segment Spiking Neuron Network for Time Series Forecasting 4
TSC-Net: Prediction of Pedestrian Trajectories by Trajectory-Scene-Cell Classification 3
TSVD: Bridging Theory and Practice in Continual Learning with Pre-trained Models 5
TTVD: Towards a Geometric Framework for Test-Time Adaptation Based on Voronoi Diagram 5
TULIP: Token-length Upgraded CLIP 4
TVNet: A Novel Time Series Analysis Method Based on Dynamic Convolution and 3D-Variation 5
TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation 6
TabM: Advancing tabular deep learning with parameter-efficient ensembling 5
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks 4
TabWak: A Watermark for Tabular Diffusion Models 4
Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling 6
Tailoring Mixup to Data for Calibration 6
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics 4
Taming Overconfidence in LLMs: Reward Calibration in RLHF 5
Taming Transformer Without Using Learning Rate Warmup 5
Tamper-Resistant Safeguards for Open-Weight LLMs 6
Targeted Attack Improves Protection against Unauthorized Diffusion Customization 4
Task Descriptors Help Transformers Learn Linear Models In-Context 1
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling 5
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types 4
Teaching Human Behavior Improves Content Understanding Abilities Of VLMs 4
Teaching LLMs How to Learn with Contextual Fine-Tuning 5
TeaserGen: Generating Teasers for Long Documentaries 5
Tell me about yourself: LLMs are aware of their learned behaviors 1
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval 5
Temporal Difference Learning: Why It Can Be Fast and How It Will Be Faster 1
Temporal Flexibility in Spiking Neural Networks: Towards Generalization Across Time Steps and Deployment Friendliness 6
Temporal Heterogeneous Graph Generation with Privacy, Utility, and Efficiency 5
Temporal Reasoning Transfer from Text to Video 5
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning 3
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos 5
Test-Time Ensemble via Linear Mode Connectivity: A Path to Better Adaptation 4
Test-time Adaptation for Cross-modal Retrieval with Query Shift 5
Test-time Adaptation for Image Compression with Distribution Regularization 4
Test-time Adaptation for Regression by Subspace Alignment 6
Test-time Alignment of Diffusion Models without Reward Over-optimization 4
TestGenEval: A Real World Unit Test Generation and Test Completion Benchmark 4
TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes 5
TexTailor: Customized Text-aligned Texturing via Effective Resampling 4
Text-to-Image Rectified Flow as Plug-and-Play Priors 5
Text2PDE: Latent Diffusion Models for Accessible Physics Simulation 5
Text4Seg: Reimagining Image Segmentation as Text Generation 5
The "Law'' of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities 4
The 3D-PC: a benchmark for visual perspective taking in humans and machines 5
The AdEMAMix Optimizer: Better, Faster, Older 5
The Belief State Transformer 6
The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures 2
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG 4
The Complexity of Two-Team Polymatrix Games with Independent Adversaries 0
The Computational Complexity of Circuit Discovery for Inner Interpretability 0
The Computational Complexity of Positive Non-Clashing Teaching in Graphs 0
The Crucial Role of Samplers in Online Direct Preference Optimization 4
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise 5
The Directionality of Optimization Trajectories in Neural Networks 3
The Effectiveness of Curvature-Based Rewiring and the Role of Hyperparameters in GNNs Revisited 4
The Foundations of Tokenization: Statistical and Computational Concerns 0
The Geometry of Categorical and Hierarchical Concepts in Large Language Models 5
The Hidden Cost of Waiting for Accurate Predictions 4
The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation 4
The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws 3
The KoLMogorov Test: Compression by Code Generation 5
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs 5
The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD 4
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling 4
The Optimization Landscape of SGD Across the Feature Learning Strength 5
The Pitfalls of Memorization: When Memorization Hurts Generalization 5
The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions 5
The Ramanujan Library - Automated Discovery on the Hypergraph of Integer Relations 5
The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model 3
The Same but Different: Structural Similarities and Differences in Multilingual Language Modeling 2
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities 2
The Superposition of Diffusion Models Using the Itô Density Estimator 4
The Unreasonable Ineffectiveness of the Deeper Layers 5
The Utility and Complexity of In- and Out-of-Distribution Machine Unlearning 3
The Value of Sensory Information to a Robot 3
The impact of allocation strategies in subset learning on the expressive power of neural networks 3
Theory on Mixture-of-Experts in Continual Learning 6
Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers 1
Theory, Analysis, and Best Practices for Sigmoid Self-Attention 6
ThermalGaussian: Thermal 3D Gaussian Splatting 4
ThinK: Thinner Key Cache by Query-Driven Pruning 4
Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation 5
Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models 5
Think while You Generate: Discrete Diffusion with Planned Denoising 5
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation 4
ThinkBot: Embodied Instruction Following with Thought Chain Reasoning 6
Three Mechanisms of Feature Learning in a Linear Network 3
Three-in-One: Fast and Accurate Transducer for Hybrid-Autoregressive ASR 5
ThunderKittens: Simple, Fast, and $\textit{Adorable}$ Kernels 5
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention 6
Tight Clusters Make Specialized Experts 5
Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity 0
Tight Time Complexities in Parallel Stochastic Optimization with Arbitrary Computation Dynamics 1
Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model 3
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do 2
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts 6
Time-to-Event Pretraining for 3D Medical Imaging 6
TimeInf: Time Series Data Contribution via Influence Functions 4
TimeKAN: KAN-based Frequency Decomposition Learning Architecture for Long-term Time Series Forecasting 4
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis 5
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning 6
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting 5
To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions 4
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning 4
To Code or Not To Code? Exploring Impact of Code in Pre-training 4
To Tackle Adversarial Transferability: A Novel Ensemble Training Method with Fourier Transformation 5
To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts 5
ToVE: Efficient Vision-Language Learning via Knowledge Transfer from Vision Experts 3
ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge 4
Token Statistics Transformer: Linear-Time Attention via Variational Rate Reduction 6
Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models 5
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters 5
Tool-Planner: Task Planning with Clusters across Multiple Tools 5
ToolACE: Winning the Points of LLM Function Calling 4
ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models 3
ToolGen: Unified Tool Retrieval and Calling via Generation 6
TopoDiffusionNet: A Topology-aware Diffusion Model 3
TopoGaussian: Inferring Internal Topology Structures from Visual Clues 2
TopoLM: brain-like spatio-functional organization in a topographic language model 5
TopoNets: High performing vision and language models with brain-like topography 4
Topograph: An Efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation 4
Topological Blindspots: Understanding and Extending Topological Deep Learning Through the Lens of Expressivity 5
Topological Schrödinger Bridge Matching 5
Topological Zigzag Spaghetti for Diffusion-based Generation and Prediction on Graphs 5
TorchTitan: One-stop PyTorch native solution for production ready LLM pretraining 5
Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization 6
Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers 5
Toward Generalizing Visual Brain Decoding to Unseen Subjects 4
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment 6
Toward Understanding In-context vs. In-weight Learning 5
Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization 4
Towards Automated Knowledge Integration From Human-Interpretable Representations 5
Towards Bridging Generalization and Expressivity of Graph Neural Networks 5
Towards Calibrated Deep Clustering Network 6
Towards Certification of Uncertainty Calibration under Adversarial Attacks 6
Towards Continuous Reuse of Graph Models via Holistic Memory Diversification 5
Towards Domain Adaptive Neural Contextual Bandits 5
Towards Effective Evaluations and Comparisons for LLM Unlearning Methods 7
Towards Empowerment Gain through Causal Structure Learning in Model-Based Reinforcement Learning 5
Towards Explaining the Power of Constant-depth Graph Neural Networks for Structured Linear Programming 6
Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians 4
Towards Faster Decentralized Stochastic Optimization with Communication Compression 6
Towards Federated RLHF with Aggregated Client Preference for LLMs 6
Towards Foundation Models for Mixed Integer Linear Programming 6
Towards General-Purpose Model-Free Reinforcement Learning 6
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations 6
Towards Generalization Bounds of GCNs for Adversarially Robust Node Classification 4
Towards Hierarchical Rectified Flow 6
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings 3
Towards Improving Exploration through Sibling Augmented GFlowNets 4
Towards Interpreting Visual Information Processing in Vision-Language Models 4
Towards Learning High-Precision Least Squares Algorithms with Sequence Models 4
Towards Marginal Fairness Sliced Wasserstein Barycenter 6
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling 4
Towards Neural Scaling Laws for Time Series Foundation Models 4
Towards Optimal Multi-draft Speculative Decoding 4
Towards Out-of-Modal Generalization without Instance-level Modal Correspondence 5
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control 3
Towards Realistic Data Generation for Real-World Super-Resolution 5
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology 5
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization 5
Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization 6
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs 5
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning 6
Towards Scalable Topological Regularizers 5
Towards Self-Supervised Covariance Estimation in Deep Heteroscedastic Regression 4
Towards Semantic Equivalence of Tokenization in Multimodal LLM 5
Towards Synergistic Path-based Explanations for Knowledge Graph Completion: Exploration and Evaluation 6
Towards Unbiased Learning in Semi-Supervised Semantic Segmentation 6
Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias 3
Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning 5
Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It 5
Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective 6
Towards Understanding the Universality of Transformers for Next-Token Prediction 2
Towards Unified Human Motion-Language Understanding via Sparse Interpretable Characterization 4
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures 2
Towards a Complete Logical Framework for GNN Expressiveness 3
Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders 5
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective 4
Towards a Unified and Verified Understanding of Group-Operation Networks 4
Towards a learning theory of representation alignment 0
Towards counterfactual fairness through auxiliary variables 3
Towards hyperparameter-free optimization with differential privacy 4
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies 4
Tracing Representation Progression: Analyzing and Enhancing Layer-Wise Similarity 4
Track-On: Transformer-based Online Point Tracking with Memory 4
Tracking objects that change in appearance with phase synchrony 5
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images 3
Tractable Multi-Agent Reinforcement Learning through Behavioral Economics 2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models 7
Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context 3
Training Free Exponential Context Extension via Cascading KV Cache 6
Training Free Guided Flow-Matching with Optimal Control 4
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis 5
Training Language Models to Self-Correct via Reinforcement Learning 3
Training Large Language Models for Retrieval-Augmented Question Answering through Backtracking Correction 6
Training Neural Networks as Recognizers of Formal Languages 5
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis 2
Training One-Dimensional Graph Neural Networks is NP-Hard 0
Training Robust Ensembles Requires Rethinking Lipschitz Continuity 5
Training on the Test Task Confounds Evaluation and Emergence 4
Training-Free Activation Sparsity in Large Language Models 6
Training-Free Dataset Pruning for Instance Segmentation 7
Training-Free Diffusion Model Alignment with Sampling Demons 5
Training-Free Message Passing for Learning on Hypergraphs 5
Training-free Camera Control for Video Generation 6
Training-free LLM-generated Text Detection by Mining Token Probability Sequences 5
Trajectory attention for fine-grained video motion control 4
Trajectory-Class-Aware Multi-Agent Reinforcement Learning 5
Trajectory-LLM: A Language-based Data Generator for Trajectory Prediction in Autonomous Driving 3
Transformer Block Coupling and its Correlation with Generalization in LLMs 5
Transformer Encoder Satisfiability: Complexity and Impact on Formal Reasoning 0
Transformer Learns Optimal Variable Selection in Group-Sparse Classification 3
Transformer Meets Twicing: Harnessing Unattended Residual Information 5
Transformer-Squared: Self-adaptive LLMs 4
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning 6
Transformers Handle Endogeneity in In-Context Linear Regression 5
Transformers Learn Low Sensitivity Functions: Investigations and Implications 5
Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought 2
Transformers Provably Learn Two-Mixture of Linear Classification via Gradient Flow 3
Transformers Provably Solve Parity Efficiently with Chain of Thought 2
Transformers Struggle to Learn to Search 4
Transformers are Universal In-context Learners 0
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model 3
Transition Path Sampling with Improved Off-Policy Training of Diffusion Path Samplers 6
Tree of Attributes Prompt Learning for Vision-Language Models 5
Tree-Wasserstein Distance for High Dimensional Data with a Latent Feature Hierarchy 6
Triples as the Key: Structuring Makes Decomposition and Verification Easier in LLM-based TableQA 4
Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups 6
Truncated Consistency Models 4
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement 5
Trusted Multi-View Classification via Evolutionary Multi-View Fusion 6
Tuning Frequency Bias of State Space Models 3
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization 6
Tuning-Free Bilevel Optimization: New Algorithms and Convergence Analysis 3
Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs 5
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation 4
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models 5
Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization 6
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking 4
U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models 1
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models 4
UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models 5
UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition 3
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation 5
UNSURE: self-supervised learning with Unknown Noise level and Stein's Unbiased Risk Estimate 5
URLOST: Unsupervised Representation Learning without Stationarity or Topology 6
UTILITY: Utilizing Explainable Reinforcement Learning to Improve Reinforcement Learning 4
UV-Attack: Physical-World Adversarial Attacks on Person Detection via Dynamic-NeRF-based UV Mapping 6
Ultra-Sparse Memory Network 4
Unbounded: A Generative Infinite Game of Character Life Simulation 3
Uncertainty Herding: One Active Learning Method for All Label Budgets 3
Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations 3
Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback 5
Uncertainty modeling for fine-tuned implicit functions 6
Uncertainty-Aware Decoding with Minimum Bayes Risk 5
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language 5
Uncovering Latent Memories in Large Language Models 4
Uncovering Overfitting in Large Language Model Editing 2
Underdamped Diffusion Bridges with Applications to Sampling 4
Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning 4
Understanding Factual Recall in Transformers via Associative Memories 2
Understanding Long Videos with Multimodal Language Models 6
Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry 5
Understanding Optimization in Deep Learning with Central Flows 3
Understanding Virtual Nodes: Oversquashing and Node Heterogeneity 4
Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View 3
Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron 4
Understanding and Enhancing the Transferability of Jailbreaking Attacks 4
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing 4
Understanding and Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention 6
Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study 4
Understanding the Stability-based Generalization of Personalized Federated Learning 4
Unearthing Skill-level Insights for Understanding Trade-offs of Foundation Models 3
Unhackable Temporal Reward for Scalable Video MLLMs 4
Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection 5
Uni-Sign: Toward Unified Sign Language Understanding at Scale 5
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization 3
UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP 6
UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation 5
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models 5
UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation 5
UniDrive: Towards Universal Driving Perception Across Camera Configurations 2
UniGEM: A Unified Approach to Generation and Property Prediction for Molecules 5
UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting 5
UniMatch: Universal Matching from Atom to Task for Few-Shot Drug Discovery 7
UniRestore3D: A Scalable Framework For General Shape Restoration 4
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation 4
Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers 0
Unified Parameter-Efficient Unlearning for LLMs 5
Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search 4
Unifying Causal Representation Learning with the Invariance Principle 3
Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark 6
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization 5
Union-over-Intersections: Object Detection beyond Winner-Takes-All 5
Universal Image Restoration Pre-training via Degradation Classification 6
Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos 5
Universal generalization guarantees for Wasserstein distributionally robust models 0
Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy 6
Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning 3
Unlearning-based Neural Interpretations 3
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment 5
Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning 5
Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning 3
Unlocking Global Optimality in Bilevel Optimization: A Pilot Study 3
Unlocking Guidance for Discrete State-Space Diffusion and Flow Models 6
Unlocking Point Processes through Point Set Diffusion 6
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues 5
Unlocking the Potential of Model Calibration in Federated Learning 5
Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning 6
Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model 5
Unsupervised Disentanglement of Content and Style via Variance-Invariance Constraints 5
Unsupervised Meta-Learning via In-Context Learning 5
Unsupervised Model Tree Heritage Recovery 4
Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation 5
Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation 4
Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment 6
Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs 3
Utilitarian Algorithm Configuration for Infinite Parameter Spaces 4
Utility-Directed Conformal Prediction: A Decision-Aware Framework for Actionable Uncertainty Quantification 4
VAE-Var: Variational Autoencoder-Enhanced Variational Methods for Data Assimilation in Meteorology 6
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text 5
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control 5
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning 5
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation 4
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation 3
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration 4
VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning 3
VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation 5
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks 4
VLMaterial: Procedural Material Generation with Large Vision-Language Models 7
VOILA: Evaluation of MLLMs For Perceptual Understanding and Analogical Reasoning 5
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis 2
VTDexManip: A Dataset and Benchmark for Visual-tactile Pretraining and Dexterous Manipulation with Reinforcement Learning 5
VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems 5
Valid Conformal Prediction for Dynamic GNNs 5
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF 5
Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization 5
Variance-Reducing Couplings for Random Features 4
Variational Bayesian Pseudo-Coreset 5
Variational Best-of-N Alignment 6
Variational Diffusion Posterior Sampling with Midpoint Guidance 5
Variational Search Distributions 4
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only 5
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors 4
Vector-ICL: In-context Learning with Continuous Vector Representations 4
Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization 6
Vertical Federated Learning with Missing Features During Training and Inference 3
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement 3
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler 4
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation 6
ViSAGe: Video-to-Spatial Audio Generation 4
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models 4
Video Action Differencing 6
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators 5
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision 4
VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing 5
VideoPhy: Evaluating Physical Commonsense for Video Generation 5
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking 5
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks 3
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents 5
Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representations 3
Vision Language Models are In-Context Value Learners 2
Vision and Language Synergy for Rehearsal Free Continual Learning 7
Vision-LSTM: xLSTM as Generic Vision Backbone 4
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures 6
Visual Agents as Fast and Slow Thinkers 5
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs 7
Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark 4
Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning 3
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents 5
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning 3
Visually Consistent Hierarchical Image Classification 4
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models 5
VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words? 3
W-PCA Based Gradient-Free Proxy for Efficient Search of Lightweight Language Models 6
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations 4
Ward: Provable RAG Dataset Inference via LLM Watermarks 4
WardropNet: Traffic Flow Predictions via Equilibrium-Augmented Learning 6
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models 4
Wasserstein Distances, Neuronal Entanglement, and Sparsity 5
Wasserstein-Regularized Conformal Prediction under General Distribution Shift 4
Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy 4
Watermark Anything With Localized Messages 5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling 5
Wavelet Diffusion Neural Operator 7
Wavelet-based Positional Representation for Long Context 2
Wayward Concepts In Multimodal Models 3
Weak to Strong Generalization for Large Language Models with Multi-capabilities 4
Weak-to-Strong Generalization Through the Data-Centric Lens 5
Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model 5
Weakly Supervised Video Scene Graph Generation via Natural Language Supervision 6
Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors 5
WeatherGFM: Learning a Weather Generalist Foundation Model via In-context Learning 5
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation 7
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning 5
Weighted Multi-Prompt Learning with Description-free Large Language Model Distillation 5
Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric 5
Weighted-Reward Preference Optimization for Implicit Model Fusion 5
What Are Good Positional Encodings for Directed Graphs? 5
What Do You See in Common? Learning Hierarchical Prototypes over Tree-of-Life to Discover Evolutionary Traits 5
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis 2
What Makes Large Language Models Reason in (Multi-Turn) Code Generation? 4
What Makes a Good Diffusion Planner for Decision Making? 5
What Makes a Maze Look Like a Maze? 5
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? 4
What Matters in Learning from Large-Scale Datasets for Robot Manipulation 3
What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models 4
What is Wrong with Perplexity for Long-context Language Modeling? 4
What should a neuron aim for? Designing local objective functions based on information theory 6
What to align in multimodal contrastive learning? 6
What's New in My Data? Novelty Exploration via Contrastive Generation 3
What's the Move? Hybrid Imitation Learning via Salient Points 3
When Attention Sink Emerges in Language Models: An Empirical View 4
When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach 5
When Graph Neural Networks Meet Dynamic Mode Decomposition 7
When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings 4
When Prompt Engineering Meets Software Engineering: CNL-P as Natural and Robust "APIs'' for Human-AI Interaction 4
When Selection Meets Intervention: Additional Complexities in Causal Discovery 5
When do GFlowNets learn the right distribution? 3
When does compositional structure yield compositional generalization? A kernel theory. 4
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers 2
When narrower is better: the narrow width limit of Bayesian parallel branching neural networks 4
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction 2
Which Tasks Should Be Compressed Together? A Causal Discovery Approach for Efficient Multi-Task Representation Compression 4
Why Does the Effective Context Length of LLMs Fall Short? 7
Why In-Context Learning Models are Good Few-Shot Learners? 5
Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks 4
Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse 2
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild 3
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct 4
Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers 4
WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models 5
World Model on Million-Length Video And Language With Blockwise RingAttention 4
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale 4
X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios 6
X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing 5
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention 3
XAIguiFormer: explainable artificial intelligence guided transformer for brain disorder identification 6
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning 5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary 6
You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning 5
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs 5
YouTube-SL-25: A Large-Scale, Open-Domain Multilingual Sign Language Parallel Corpus 5
Youku Dense Caption: A Large-scale Chinese Video Dense Caption Dataset and Benchmarks 6
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data 5
Your Mixture-of-Experts LLM Is Secretly an Embedding Model for Free 4
Your Weak LLM is Secretly a Strong Teacher for Alignment 6
ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish 4
ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention 4
ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models 6
Zero-Shot Natural Language Explanations 4
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models 5
Zero-cost Proxy for Adversarial Robustness Evaluation 4
Zero-shot Imputation with Foundation Inference Models for Dynamical Systems 5
Zero-shot Model-based Reinforcement Learning using Large Language Models 5
Zero-shot forecasting of chaotic systems 5
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning 6
Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity 7
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference 1
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection 5
ZooProbe: A Data Engine for Evaluating, Exploring, and Evolving Large-scale Training Data for Multimodal LLMs 3
cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM 4
dEBORA: Efficient Bilevel Optimization-based low-Rank Adaptation 4
eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels 5
econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians 5
gRNAde: Geometric Deep Learning for 3D RNA inverse design 6
h4rm3l: A Language for Composable Jailbreak Attack Synthesis 5
kNN Attention Demystified: A Theoretical Exploration for Scalable Transformers 5
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models 5
metabench - A Sparse Benchmark of Reasoning and Knowledge in Large Language Models 5
miniCTX: Neural Theorem Proving with (Long-)Contexts 5
nGPT: Normalized Transformer with Representation Learning on the Hypersphere 4
pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation 5
qNBO: quasi-Newton Meets Bilevel Optimization 5
u-$\mu$P: The Unit-Scaled Maximal Update Parametrization 5
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs 1
xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation 5
{$\tau$}-bench: A Benchmark for \underline{T}ool-\underline{A}gent-\underline{U}ser Interaction in Real-World Domains 4