| $O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| $\Delta \mathrm{Energy}$: Optimizing Energy Change During Vision-Language Alignment Improves both OOD Detection and OOD Generalization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| $\Psi$-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| $\boldsymbol{\lambda}$-Orthogonality Regularization for Compatible Representation Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| $\epsilon$-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| $\mu$PC: Scaling Predictive Coding to 100+ Layer Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| $\textit{HiMaCon:}$ Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| $\textit{Hyper-GoalNet}$: Goal-Conditioned Manipulation Policy Learning with HyperNetworks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| $\texttt{BetaConform}$: Efficient MAP Estimation of LLM Ensemble Judgment Performance with Prior Transfer |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| $\texttt{G1}$: Teaching LLMs to Reason on Graphs with Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| $\texttt{STRCMP}$: Integrating Graph Structural Priors with Language Models for Combinatorial Optimization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| $\text{G}^2\text{M}$: A Generalized Gaussian Mirror Method to Boost Feature Selection Power |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| $\text{S}^2$Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| $i$MIND: Insightful Multi-subject Invariant Neural Decoding |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities |
β |
β
|
β
|
β |
β |
β
|
β
|
4 |
| 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 3D Equivariant Visuomotor Policy Learning via Spherical Projection |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| 3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 3D Gaussian Splatting based Scene-independent Relocalization with Unidirectional and Bidirectional Feature Fusion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 3D Human Pose Estimation with Muscles |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 3D Interaction Geometric Pre-training for Molecular Relational Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| 3D Visual Illusion Depth Estimation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| 3D-Agent: A Tri-Modal Multi-Agent Responsive Framework for Comprehensive 3D Object Annotation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 3D-GSRD: 3D Molecular Graph Auto-Encoder with Selective Re-mask Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| 3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| 3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 3DOT: Texture Transfer for 3DGS Objects from a Single Reference Image |
β |
β |
β
|
β |
β |
β |
β |
1 |
| 3DPE-Gaze:Unlocking the Potential of 3D Facial Priors for Generalized Gaze Estimation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| 4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| 4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| 4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| 4KAgent: Agentic Any Image to 4K Super-Resolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| 70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11) |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Single-Swap Local Search Algorithm for k-Means of Lines |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| A Bayesian Approach to Contextual Dynamic Pricing using the Proportional Hazards Model with Discrete Price Data |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Bayesian Fast-Slow Framework to Mitigate Interference in Non-Stationary Reinforcement Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| A Beyond-Worst-Case Analysis of Greedy k-means++ |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Black-Box Debiasing Framework for Conditional Sampling |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| A CLT for Polynomial GNNs on Community-Based Graphs |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| A Circular Argument: Does RoPE need to be Equivariant for Vision? |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| A Clean Slate for Offline Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| A Closed-Form Solution for Fast and Reliable Adaptive Testing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Closer Look at Graph Transformers: Cross-Aggregation and Beyond |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Closer Look at NTK Alignment: Linking Phase Transitions in Deep Image Regression |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| A Closer Look at TabPFN v2: Understanding Its Strengths and Extending Its Capabilities |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| A Closer Look to Positive-Unlabeled Learning from Fine-grained Perspectives: An Empirical Study |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| A Compressive-Expressive Communication Framework for Compositional Representations |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| A Computationally Viable Numerical Gradient-based Technique for Optimal Covering Problems |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| A Counterfactual Semantics for Hybrid Dynamical Systems |
β |
β
|
β |
β |
β
|
β |
β |
2 |
| A CramΓ©rβvon Mises Approach to Incentivizing Truthful Data Sharing |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| A Data-Driven Prism: Multi-View Source Separation with Diffusion Model Priors |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Difference-of-Convex Functions Approach to Energy-Based Iterative Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Differential and Pointwise Control Approach to Reinforcement Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| A Driving-Style-Adaptive Framework for Vehicle Trajectory Prediction |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| A Dynamic Learning Strategy for Dempster-Shafer Theory with Applications in Classification and Enhancement |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Fair Federated Learning Method for Handling Client Participation Probability Inconsistencies in Heterogeneous Environments |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| A Few Moments Please: Scalable Graphon Learning via Moment Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging |
β |
β |
β |
β |
β |
β |
β |
0 |
| A Generalist Intracortical Motor Decoder |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Generalized Binary Tree Mechanism for Private Approximation of All-Pair Shortest Distances |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications |
β |
β |
β |
β |
β |
β |
β
|
1 |
| A Generalized Label Shift Perspective for Cross-Domain Gaze Estimation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| A Geometric Analysis of PCA |
β |
β |
β |
β |
β |
β |
β |
0 |
| A Geometry-Aware Metric for Mode Collapse in Time Series Generative Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Gradient Guided Diffusion Framework for Chance Constrained Programming |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| A Hierarchy of Graphical Models for Counterfactual Inferences |
β
|
β |
β |
β |
β |
β |
β |
1 |
| A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| A Latent Multilayer Graphical Model For Complex, Interdependent Systems |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| A Learning-Augmented Approach to Online Allocation Problems |
β |
β |
β |
β |
β |
β |
β |
0 |
| A Learning-Augmented Dynamic Programming Approach for Orienteering Problem with Time Windows |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| A Markov Decision Process for Variable Selection in Branch & Bound |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| A Minimalist Example of Edge-of-Stability and Progressive Sharpening |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| A Minimalistic Unified Framework for Incremental Learning across Image Restoration Tasks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| A Multimodal BiMamba Network with Test-Time Adaptation for Emotion Recognition Based on Physiological Signals |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Near-Optimal Algorithm for Decentralized Convex-Concave Finite-Sum Minimax Optimization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| A Near-optimal, Scalable and Parallelizable Framework for Stochastic Bandits Robust to Adversarial Corruptions and Beyond |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Novel General Framework for Sharp Lower Bounds in Succinct Stochastic Bandits |
β |
β |
β |
β |
β |
β |
β |
0 |
| A Partition Cover Approach to Tokenization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Physics-preserved Transfer Learning Method for Differential Equations |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| A Plug-and-Play Query Synthesis Active Learning Framework for Neural PDE Solvers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Practical Guide for Incorporating Symmetry in Diffusion Policy |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| A Pre-training Framework for Relational Data with Information-theoretic Principles |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Principled Approach to Randomized Selection under Uncertainty: Applications to Peer Review and Grant Funding |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| A Principled Path to Fitted Distributional Evaluation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| A Private Approximation of the 2nd-Moment Matrix of Any Subsamplable Input |
β
|
β |
β |
β |
β |
β |
β |
1 |
| A Provable Approach for End-to-End Safe Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| A Reinforcement Learning-based Bidding Strategy for Data Consumers in Auction-based Federated Learning |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| A Reliable Cryptographic Framework for Empirical Machine Unlearning Evaluation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Scalable, Causal, and Energy Efficient Framework for Neural Decoding with Spiking Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Semantic Parsing Framework for End-to-End Time Normalization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Set of Generalized Components to Achieve Effective Poison-only Clean-label Backdoor Attacks with Collaborative Sample Selection and Triggers |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| A Signed Graph Approach to Understanding and Mitigating Oversmoothing |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| A Simple Linear Patch Revives Layer-Pruned Large Language Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| A Single-Loop First-Order Algorithm for Linearly Constrained Bilevel Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Single-Loop Gradient Algorithm for Pessimistic Bilevel Optimization via Smooth Approximation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| A Stable Whitening Optimizer for Efficient Neural Network Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| A TRIANGLE Enables Multimodal Alignment Beyond Cosine Similarity |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Tale of Two Symmetries: Exploring the Loss Landscape of Equivariant Models |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| A Temporal Difference Method for Stochastic Continuous Dynamics |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation |
β |
β
|
β |
β
|
β |
β |
β
|
3 |
| A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Theory for Worst-Case vs. Average-Case Guarantees for LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Unified Analysis of Stochastic Gradient Descent with Arbitrary Data Permutations and Beyond |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A Unified Approach to Submodular Maximization Under Noise |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| A Unified Framework for Fair Graph Generation: Theoretical Guarantees and Empirical Advances |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| A Unified Framework for Variable Selection in Model-Based Clustering with Missing Not at Random |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| A Unified Framework for the Transportability of Population-Level Causal Measures |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| A Unified Reasoning Framework for Holistic Zero-Shot Video Anomaly Analysis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| A Unifying View of Linear Function Approximation in Off-Policy RL Through Matrix Splitting and Preconditioning |
β |
β |
β |
β |
β |
β |
β |
0 |
| A data and task-constrained mechanistic model of the mouse outer retina shows robustness to contrast variations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| A faster training algorithm for regression trees with linear leaves, and an analysis of its complexity |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| A geometric framework for momentum-based optimizers for low-rank training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A learnability analysis on neuro-symbolic learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A machine learning approach that beats Rubik's cubes |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| A multiscale analysis of mean-field transformers in the moderate interaction regime |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| A solvable model of learning generative diffusion: theory and insights |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| A unified framework for establishing the universal approximation of transformer-type architectures |
β |
β |
β |
β |
β |
β |
β |
0 |
| A$^3$E: Towards Compositional Model Editing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| A-Mem: Agentic Memory for LLM Agents |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| AC-LoRA: (Almost) Training-Free Access Control Aware Multi-Modal LLMs |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ADPretrain: Advancing Industrial Anomaly Detection via Anomaly Representation Pretraining |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| AF-UMC: An Alignment-Free Fusion Framework for Unaligned Multi-View Clustering |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AI Debate Aids Assessment of Controversial Claims |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AI-Generated Video Detection via Perceptual Straightening |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AI-Researcher: Autonomous Scientific Innovation |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| AION-1: Omnimodal Foundation Model for Astronomical Sciences |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ALINE: Joint Amortization for Bayesian Inference and Active Data Acquisition |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for AudioβLanguage Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| APML: Adaptive Probabilistic Matching Loss for Robust 3D Point Cloud Reconstruction |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| AR-RAG: Autoregressive Retrieval Augmentation for Image Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ARGenSeg: Image Segmentation with Autoregressive Image Generation Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ARIA: Training Language Agents with Intention-driven Reward Aggregation |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| ARM: Adaptive Reasoning Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ARMesh: Autoregressive Mesh Generation via Next-Level-of-Detail Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ASDSV: Multimodal Generation Made Efficient with Approximate Speculative Diffusion and Speculative Verification |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| ASGO: Adaptive Structured Gradient Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Absolute Zero: Reinforced Self-play Reasoning with Zero Data |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Absorb and Converge: Provable Convergence Guarantee for Absorbing Discrete Diffusion Models |
β |
β |
β |
β |
β |
β |
β |
0 |
| Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Abstract Counterfactuals for Language Model Agents |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Abstract Rendering: Certified Rendering Under 3D Semantic Uncertainty |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Accelerated Distance-adaptive Methods for HΓΆlder Smooth and Convex Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accelerated Evolving Set Processes for Local PageRank Computation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Accelerated Vertical Federated Adversarial Learning through Decoupling Layer-Wise Dependencies |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Accelerating 3D Molecule Generative Models with Trajectory Diagnosis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Accelerating Block Coordinate Descent for LLM Finetuning via Landscape Expansion |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accelerating Diffusion LLMs via Adaptive Parallel Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Accelerating Feature Conformal Prediction via Taylor Approximation |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| Accelerating Model-Free Optimization via Averaging of Cost Samples |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Accelerating Optimization via Differentiable Stopping Time |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Accelerating Parallel Diffusion Model Serving with Residual Compression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Accelerating RL for LLM Reasoning with Optimal Advantage Regression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Accelerating Visual-Policy Learning through Parallel Differentiable Simulation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accelerating data-driven algorithm selection for combinatorial partitioning problems |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Acceleration via silver step-size on Riemannian manifolds with applications to Wasserstein space |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Accident Anticipation via Temporal Occurrence Prediction |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Accurate KV Cache Eviction via Anchor Direction Projection for Efficient LLM Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accurate and Efficient Low-Rank Model Merging in Core Space |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Accurately Predicting Protein Mutational Effects via a Hierarchical Many-Body Attention Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Achieving $\tilde{\mathcal{O}}(1/N)$ Optimality Gap in Restless Bandits through Gaussian Approximation |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Achilles' Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Activated LoRA: Fine-tuned LLMs for Intrinsics |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Activation-Guided Consensus Merging for Large Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Activation-Informed Merging of Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Active Measurement: Efficient Estimation at Scale |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Active Seriation: Efficient Ordering Recovery with Statistical Guarantees |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Active Target Discovery under Uninformative Priors: The Power of Permanent and Transient Memory |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Active Test-time Vision-Language Navigation |
β
|
β |
β
|
β
|
β |
β |
β |
3 |
| ActiveVOO: Value of Observation Guided Active Knowledge Acquisition for Open-World Embodied Lifted Regression Planning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Activity Pruning for Efficient Spiking Neural Networks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Actor-Free Continuous Control via Structurally Maximizable Q-Functions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AcuRank: Uncertainty-Aware Adaptive Computation for Listwise Reranking |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| AdaMSS: Adaptive Multi-Subspace Approach for Parameter-Efficient Fine-Tuning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AdaTS: Learning Adaptive Time Series Representations via Dynamic Soft Contrasts |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| AdaptDel: Adaptable Deletion Rate Randomized Smoothing for Certified Robustness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AdaptGrad: Adaptive Sampling to Reduce Noise |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Adaptable Safe Policy Learning from Multi-task Data with Constraint Prioritized Decision Transformer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Adaptive 3D Reconstruction via Diffusion Priors and Forward Curvature-Matching Likelihood Updates |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptive Algorithms with Sharp Convergence Rates for Stochastic Hierarchical Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Cannistraci-Hebb Network Automata Modelling of Complex Networks for Path-based Link Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Data Analysis for Growing Data |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Adaptive Data-Borrowing for Improving Treatment Effect Estimation using External Controls |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptive Discretization for Consistency Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Adaptive Fission: Post-training Encoding for Low-latency Spike Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptive Gradient Masking for Balancing ID and MLLM-based Representations in Recommendation |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Adaptive Inference-Time Scaling via Cyclic Diffusion Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Kernel Design for Bayesian Optimization Is a Piece of CAKE with LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptive Latent-Space Constraints in Personalized Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Preference Arithmetic: A Personalized Agent with Adaptive Preference Arithmetic for Dynamic Preference Modeling |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Adaptive Quantization in Generative Flow Networks for Probabilistic Sequential Prediction |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Adaptive Re-calibration Learning for Balanced Multimodal Intention Recognition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Riemannian ADMM for Nonsmooth Optimization: Optimal Complexity without Smoothing |
β
|
β |
β |
β |
β
|
β
|
β
|
4 |
| Adaptive Sigmoid Clipping for Balancing the DirectionβMagnitude Mismatch Trade-off in Differentially Private Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Adaptive Time Encoding for Irregular Multivariate Time-Series Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adaptive Variance Inflation in Thompson Sampling: Efficiency, Safety, Robustness, and Beyond |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Adaptive and Multi-scale Affinity Alignment for Hierarchical Contrastive Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adaptively Coordinating with Novel Partners via Learned Latent Strategies |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Additive Models Explained: A Computational Complexity Approach |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Addressing Mark Imbalance in Integration-free Marked Temporal Point Processes |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adjacent Words, Divergent Intents: Jailbreaking Large Language Models via Task Concurrency |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adjoint SchrΓΆdinger Bridge Sampler |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Adjusted Count Quantification Learning on Graphs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Adjusting Initial Noise to Mitigate Memorization in Text-to-Image Diffusion Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| AdmTree: Compressing Lengthy Context with Adaptive Semantic Trees |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Adv-SSL: Adversarial Self-Supervised Representation Learning with Theoretical Guarantees |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AdvEDM: Fine-grained Adversarial Attack against VLM-based Embodied Agents |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| AdvPrefix: An Objective for Nuanced LLM Jailbreaks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Advanced Sign Language Video Generation with Compressed and Quantized Multi-Condition Tokenization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Advancing Compositional Awareness in CLIP with Efficient Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Advancing Expert Specialization for Better MoE |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Advancing Interpretability of CLIP Representations with Concept Surrogate Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Advancing Machine-Generated Text Detection from an Easy to Hard Supervision Perspective |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adversarial Diffusion for Robust Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Adversarial Graph Fusion for Incomplete Multi-view Semi-supervised Learning with Tensorial Imputation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Adversarial Robustness of Nonparametric Regression |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Adversarial generalization of unfolding (model-based) networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Adversary Aware Optimization for Robust Defense |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AegisGuard: RL-Guided Adapter Tuning for TEE-Based Efficient & Secure On-Device Inference |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Affine-Invariant Global Non-Asymptotic Convergence Analysis of BFGS under Self-Concordance |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| AgentAuditor: Human-level Safety and Security Evaluation for LLM Agents |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Agentic RL Scaling Law: Spontaneous Code Execution for Mathematical Problem Solving |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Agents Robust to Distribution Shifts Learn Causal World Models Even Under Mediation |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Aggregation Hides Out-of-Distribution Generalization Failures from Spurious Correlations |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Agnostic Active Learning Is Always Better Than Passive Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Agnostic Continuous-Time Online Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Aha! - Predicting What Matters Next: Online Highlight Detection Without Looking Ahead |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-Index Models |
β
|
β |
β |
β |
β |
β |
β |
1 |
| AliO: Output Alignment Matters in Long-Term Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Alias-Free ViT: Fractional Shift Invariance via Linear Attention |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Align Your Flow: Scaling Continuous-Time Flow Map Distillation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| AlignedGen: Aligning Style Across Generated Images |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Aligning Compound AI Systems via System-level DPO |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Aligning Text to Image in Diffusion Models is Easier Than You Think |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Aligning Text-to-Image Diffusion Models to Human Preference by Classification |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Aligning Transformers with Continuous Feedback via Energy Rank Alignment |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Aligning What Matters: Masked Latent Adaptation for Text-to-Audio-Video Generation |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Aligning by Misaligning: Boundary-aware Curriculum Learning for Multimodal Alignment |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Alignment of Large Language Models with Constrained Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| All You Need is One: Capsule Prompt Tuning with a Single Vector |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Alleviating Hallucinations in Large Language Models through Multi-Model Contrastive Decoding and Dynamic Hallucination Detection |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Alligat0R: Pre-Training through Covisibility Segmentation for Relative Camera Pose Regression |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AlphaBeta is not as good as you think: a simple class of synthetic games for a better analysis of deterministic game-solving algorithms |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AlphaFold Database Debiasing for Robust Inverse Folding |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| AltLoRA: Towards Better Gradient Approximation in Low-Rank Adaptation with Alternating Projections |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Ambient Diffusion Omni: Training Good Models with Bad Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Ambient Proteins - Training Diffusion Models on Noisy Structures |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Among Us: A Sandbox for Measuring and Detecting Agentic Deception |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AmorLIP: Efficient Language-Image Pretraining via Amortization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Amortized Active Generation of Pareto Sets |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Amortized Sampling with Transferable Normalizing Flows |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Amplifying Prominent Representations in Multimodal Learning via Variational Dirichlet Process |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| An Adaptive Algorithm for Bilevel Optimization on Riemannian Manifolds |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| An Adaptive Quantum Circuit of Dempster's Rule of Combination for Uncertain Pattern Classification |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| An Analysis of Concept Bottleneck Models: Measuring, Understanding, and Mitigating the Impact of Noisy Annotations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| An Effective Levelling Paradigm for Unlabeled Scenarios |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| An Efficient Local Search Approach for Polarized Community Discovery in Signed Networks |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| An Efficient Orlicz-Sobolev Approach for Transporting Unbalanced Measures on a Graph |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| An Ellipsoid Algorithm for Online Convex Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| An Evidence-Based Post-Hoc Adjustment Framework for Anomaly Detection Under Data Contamination |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| An Improved Algorithm for Adversarial Linear Contextual Bandits via Reduction |
β
|
β |
β |
β |
β |
β |
β |
1 |
| An Information-theoretical Framework for Understanding Out-of-distribution Detection with Pretrained Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| An Investigation of Memorization Risk in Healthcare Foundation Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| An Iterative Algorithm for Differentially Private $k$-PCA with Adaptive Noise |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| An Optimized Franz-Parisi Criterion and its Equivalence with SQ Lower Bounds |
β |
β |
β |
β |
β |
β |
β |
0 |
| AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Analog Foundation Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Analogy-based Multi-Turn Jailbreak against Large Language Models |
β |
β
|
β
|
β |
β |
β
|
β
|
4 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Analyzing Similarity Metrics for Data Selection for Language Model Pretraining |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Analyzing the Power of Chain of Thought through Memorization Capabilities |
β |
β |
β |
β |
β |
β |
β |
0 |
| Anatomically inspired digital twins capture hierarchical object representations in visual cortex |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Anchor-based Maximum Discrepancy for Relative Similarity Testing |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Anchored Diffusion Language Model |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| AngleRoCL: Angle-Robust Concept Learning for Physically View-Invariant Adversarial Patches |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Angles Donβt Lie: Unlocking TrainingβEfficient RL Through the Modelβs Own Signals |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Angular Constraint Embedding via SpherePair Loss for Constrained Clustering |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Angular Steering: Behavior Control via Rotation in Activation Space |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AnimateQR: Bridging Aesthetics and Functionality in Dynamic QR Code Generation |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Anomaly Detection by an Ensemble of Random Pairs of Hyperspheres |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Anti-Aliased 2D Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Antidistillation Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Any-stepsize Gradient Descent for Separable Data under FenchelβYoung Losses |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Anytime-valid, Bayes-assisted, Prediction-Powered Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Approximate Domain Unlearning for Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Approximate Gradient Coding for Distributed Learning with Heterogeneous Stragglers |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Approximately Aligned Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Approximating Shapley Explanations in Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Approximation and Generalization Abilities of Score-based Neural Network Generative Models for Sub-Gaussian Distributions |
β |
β |
β |
β |
β |
β |
β |
0 |
| Approximation theory for 1-Lipschitz ResNets |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Architectural and Inferential Inductive Biases for Exchangeable Sequence Modeling |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Are Greedy Task Orderings Better Than Random in Continual Linear Regression? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Are Language Models Efficient Reasoners? A Perspective from Logic Programming |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Are Large Language Models Sensitive to the Motives Behind Communication? |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Are Pixel-Wise Metrics Reliable for Computerized Tomography Reconstruction? |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Ascent Fails to Forget |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Ask a Strong LLM Judge when Your Reward Model is Uncertain |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Assessing the quality of denoising diffusion models in Wasserstein distance: noisy score and optimal bounds |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Assignments for Congestion-Averse Agents: Seeking Competitive and Envy-Free Solutions |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Association-Focused Path Aggregation for Graph Fraud Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Asymmetric Dual-Lens Video Deblurring |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Asymmetric Duos: Sidekicks Improve Uncertainty |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Asymptotic theory of SGD with a general learning-rate |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Asymptotically exact variational flows via involutive MCMC kernels |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Atom of Thoughts for Markov LLM Test-Time Scaling |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Atomic Diffusion Models for Small Molecule Structure Elucidation from NMR Spectra |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Attack by Yourself: Effective and Unnoticeable Multi-Category Graph Backdoor Attacks with Subgraph Triggers Pool |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Attention (as Discrete-Time Markov) Chains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Attention Mechanism, Max-Affine Partition, and Universal Approximation |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Attention on the Sphere |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Attention with Trained Embeddings Provably Selects Important Tokens |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Attention! Your Vision Language Model Could Be Maliciously Manipulated |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Attention-based clustering |
β |
β
|
β |
β |
β |
β
|
β
|
3 |
| AttentionPredictor: Temporal Patterns Matter for KV Cache Compression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Attractive Metadata Attack: Inducing LLM Agents to Invoke Malicious Tools |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Attribution-Driven Adaptive Token Pruning for Transformers |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| AudSemThinker: Enhancing Audio-Language Models Through Reasoning over Semantics of Sound |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Audio Super-Resolution with Latent Bridge Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Audio-Sync Video Generation with Multi-Stream Temporal Control |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Audits Under Resource, Data, and Access Constraints: Scaling Laws For Less Discriminatory Alternatives |
β |
β |
β |
β
|
β |
β |
β
|
2 |
| AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Auto-Compressing Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Auto-Connect: Connectivity-Preserving RigFormer with Direct Preference Optimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Auto-Search and Refinement: An Automated Framework for Gender Bias Mitigation in Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AutoData: A Multi-Agent System for Open Web Data Collection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| AutoEdit: Automatic Hyperparameter Tuning for Image Editing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| AutoJudge: Judge Decoding Without Manual Annotation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| AutoPartGen: Autoregressive 3D Part Generation and Discovery |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Autoencoding Random Forests |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Automated Composition of Agents: A Knapsack Approach for Agentic Component Selection |
β
|
β |
β
|
β |
β |
β
|
β
|
4 |
| Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Automated Model Discovery via Multi-modal & Multi-step Pipeline |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Automatic Auxiliary Task Selection and Adaptive Weighting Boost Molecular Property Prediction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Automatic Visual Instrumental Variable Learning for Confounding-Resistant Domain Generalization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Automaton Constrained Q-Learning |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Autoregressive Motion Generation with Gaussian Mixture-Guided Latent Sampling |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Availability-aware Sensor Fusion via Unified Canonical Space |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Avoiding exp(R) scaling in RLHF through Preference-based Exploration |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Axial Neural Networks for Dimension-Free Foundation Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| BADiff: Bandwidth Adaptive Diffusion Model |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| BAM-ICL: Causal Hijacking In-Context Learning with Budgeted Adversarial Manipulation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BIPNN: Learning to Solve Binary Integer Programming via Hypergraph Neural Networks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| BLEUBERI: BLEU is a surprisingly effective reward for instruction following |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BMW: Bidirectionally Memory bank reWriting for Unsupervised Person Re-Identification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BNMusic: Blending Environmental Noises into Personalized Music |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| BaRISTA: Brain Scale Informed Spatiotemporal Representation of Human Intracranial Neural Activity |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Backdoor Cleaning without External Guidance in MLLM Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Backdoor Mitigation via Invertible Pruning Masks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Backward Conformal Prediction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Balanced Active Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Balanced Conic Rectified Flow |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Balancing Gradient and Hessian Queries in Non-Convex Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Balancing Multimodal Training Through Game-Theoretic Regularization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Balancing Performance and Costs in Best Arm Identification |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Balancing Positive and Negative Classification Error Rates in Positive-Unlabeled Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bandit Guided Submodular Curriculum for Adaptive Subset Selection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Bandit and Delayed Feedback in Online Structured Prediction |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| BayeSQP: Bayesian Optimization through Sequential Quadratic Programming |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Bayes optimal learning of attention-indexed models |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Bayesian Concept Bottleneck Models with LLM Priors |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Bayesian Ego-graph Inference for Networked Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Bayesian Optimization with Preference Exploration using a Monotonic Neural Network Ensemble |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| BecomingLit: Relightable Gaussian Avatars with Hybrid Neural Shading |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Behavior Injection: Preparing Language Models for Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Belief-Calibrated Multi-Agent Consensus Seeking for Complex NLP Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| BeliefMapNav: 3D Voxel-Based Belief Map for Zero-Shot Object Navigation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Benfordβs Curse: Tracing Digit Bias to Numerical Hallucination in LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Benign Overfitting in Single-Head Attention |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Bernsteinβvon Mises for Adaptively Collected Data |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Best-of-N Jailbreaking |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Better Estimation of the Kullback--Leibler Divergence Between Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Better Language Model Inversion by Compactly Representing Next-Token Distributions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Better NTK Conditioning: A Free Lunch from (ReLU) Nonlinear Activation in Wide Neural Networks |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Better Training Data Attribution via Better Inverse Hessian-Vector Products |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Beyond $\tilde{O}(\sqrt{T})$ Constraint Violation for Online Convex Optimization with Adversarial Constraints |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Average Value Function in Precision Medicine: Maximum Probability-Driven Reinforcement Learning for Survival Analysis |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Beyond Benign Overfitting in Nadaraya-Watson Interpolators |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Beyond Expectations: Quantile-Guided Alignment for Risk-Calibrated Language Models |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Higher Rank: Token-wise Input-Output Projections for Efficient Low-Rank Adaptation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Beyond Last-Click: An Optimal Mechanism for Ad Attribution |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Beyond Least Squares: Uniform Approximation and the Hidden Cost of Misspecification |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Beyond Modality Collapse: Representation Blending for Multimodal Dataset Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Node-Centric Modeling: Sketching Signed Networks with Simplicial Complexes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Oracle: Verifier-Supervision for Instruction Hierarchy in Reasoning and Instruction-Tuned LLMs |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Beyond Pairwise Connections: Extracting High-Order Functional Brain Network Structures under Global Constraints |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Prediction: Managing the Repercussions of Machine Learning Applications |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Beyond Random: Automatic Inner-loop Optimization in Dataset Distillation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Beyond Scores: Proximal Diffusion Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Beyond Verifiable Rewards: Scaling Reinforcement Learning in Language Models to Unverifiable Data |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Beyond the Average: Distributional Causal Inference under Imperfect Compliance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BeyondMix: Leveraging Structural Priors and Long-Range Dependencies for Domain-Invariant LiDAR Segmentation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Bi-Directional Communication-Efficient Stochastic FL via Remote Source Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Bi-Level Decision-Focused Causal Learning for Large-Scale Marketing Optimization: Bridging Observational and Experimental Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bi-Level Knowledge Transfer for Multi-Task Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bidirectional Representations Augmented Autoregressive Biological Sequence Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bilevel Network Learning via Hierarchically Structured Sparsity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bilevel Optimization for Adversarial Learning Problems: Sharpness, Generation, and Beyond |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Bio-Inspired Image Restoration |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BioCG: Constrained Generative Modeling for Biochemical Interaction Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BioOSS: A Bio-Inspired Oscillatory State System with Spatio-Temporal Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bipolar Self-attention for Spiking Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bisecle: Binding and Separation in Continual Learning for Video Language Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bit-swapping Oriented Twin-memory Multi-view Clustering in Lifelong Incomplete Scenarios |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| BitMark: Watermarking Bitwise Autoregressive Image Generative Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bits Leaked per Query: Information-Theoretic Bounds for Adversarial Attacks on LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bivariate Matrix-valued Linear Regression (BMLR): Finite-sample performance under Identifiability and Sparsity Assumptions |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Blackbox Model Provenance via Palimpsestic Membership Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Blameless Users in a Clean Room: Defining Copyright Protection for Generative Models |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Block Coordinate Descent for Neural Networks Provably Finds Global Minima |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Block-Biased Mamba for Long-Range Sequence Processing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Block-Diagonal LoRA for Eliminating Communication Overhead in Tensor Parallel LoRA Serving |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| BlockDecoder: Boosting ASR Decoders with Context and Merger Modules |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BlockScan: Detecting Anomalies in Blockchain Transactions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| BlurDM: A Blur Diffusion Model for Image Deblurring |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| BoltzNCE: Learning likelihoods for Boltzmann Generation with Stochastic Interpolants and Noise Contrastive Estimation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Boosting Adversarial Transferability with Spatial Adversarial Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Boosting Generative Image Modeling via Joint Image-Feature Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Boosting Knowledge Utilization in Multimodal Large Language Models via Adaptive Logits Fusion and Attention Reallocation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Boosting Resilience of Large Language Models through Causality-Driven Robust Optimization |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Boosting the Uniqueness of Neural Networks Fingerprints with Informative Triggers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Bootstrap Off-policy with World Model |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Bootstrap Your Uncertainty: Adaptive Robust Classification Driven by Optimal-Transport |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Bootstrapping Hierarchical Autoregressive Formal Reasoner with Chain-of-Proxy-Autoformalization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Born a Transformer -- Always a Transformer? On the Effect of Pretraining on Architectural Abilities |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Boundary-Value PDEs Meet Higher-Order Differential Topology-aware GNNs |
β |
β
|
β |
β
|
β
|
β
|
β
|
5 |
| Boundary-to-Region Supervision for Offline Safe Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Bounds on the computational complexity of neurons due to dendritic morphology |
β |
β |
β |
β |
β |
β |
β
|
1 |
| BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Brain Harmony: A Multimodal Foundation Model Unifying Morphology and Function into 1D Tokens |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Brain network science modelling of sparse neural networks enables Transformers and LLMs to perform as fully connected |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Brain-Informed Fine-Tuning for Improved Multilingual Understanding in Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Brain-Inspired fMRI-to-Text Decoding via Incremental and Wrap-Up Language Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Brain-Like Processing Pathways Form in Models With Heterogeneous Experts |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Brain-like Variational Inference |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| BrainEC-LLM: Brain Effective Connectivity Estimation by Multiscale Mixing LLM |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| BrainFlow: A Holistic Pathway of Dynamic Neural System on Manifold |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| BrainMoE: Cognition Joint Embedding via Mixture-of-Expert Towards Robust Brain Foundation Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BrainODE: Neural Shape Dynamics for Age- and Disease-aware Brain Trajectories |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Breaking ARβs Sampling Bottleneck: Provable Acceleration via Diffusion Language Models |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Breaking Latent Prior Bias in Detectors for Generalizable AIGC Image Detection |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Breaking the Compression Ceiling: Data-Free Pipeline for Ultra-Efficient Delta Compression |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Breaking the Discretization Barrier of Continuous Physics Simulation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Breaking the Frozen Subspace: Importance Sampling for Low-Rank Optimization in LLM Pretraining |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Breaking the Gradient Barrier: Unveiling Large Language Models for Strategic Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Breaking the Order Barrier: Off-Policy Evaluation for Confounded POMDPs |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Breakthrough Sensor-Limited Single View: Towards Implicit Temporal Dynamics for Time Series Domain Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BridgePure: Limited Protection Leakage Can Break Black-Box Data Protection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Bridging Arbitrary and Tree Metrics via Differentiable Gromov Hyperbolicity |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Bridging Brains and Concepts: Interpretable Visual Decoding from fMRI with Semantic Bottlenecks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging Equivariant GNNs and Spherical CNNs for Structured Physical Domains |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Bridging Expressivity and Scalability with Adaptive Unitary SSMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging Human and LLM Judgments: Understanding and Narrowing the Gap |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Bridging Scales: Spectral Theory Reveals How Local Connectivity Rules Sculpt Global Neural Dynamics in Spatially Extended Networks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging Theory and Practice in Link Representation with Graph Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging Time and Linguistics: LLMs as Time Series Analyzer through Symbolization and Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging the Gap Between Cross-Domain Theory and Practical Application: A Case Study on Molecular Dissolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bridging the gap to real-world language-grounded visual concept learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Bringing SAM to new heights: leveraging elevation data for tree crown segmentation from drone imagery |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Buffer layers for Test-Time Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Building 3D Representations and Generating Motions From a Single Image via Video-Generation |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| BΓ©zier Splatting for Fast and Differentiable Vector Graphics Rendering |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| C$^2$Prompt: Class-aware Client Knowledge Interaction for Federated Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| C-NAV: Towards Self-Evolving Continual Object Navigation in Open World |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| C-SafeGen: Certified Safe LLM Generation with Claim-Based Streaming Guardrails |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward |
β
|
β |
β |
β
|
β
|
β
|
β
|
5 |
| CADGrasp: Learning Contact and Collision Aware General Dexterous Grasping in Cluttered Scenes |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CADMorph: GeometryβDriven Parametric CAD Editing via a PlanβGenerateβVerify Loop |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CALM: Culturally Self-Aware Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CAMILA: Context-Aware Masking for Image Editing with Language Alignment |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CAMO: Convergence-Aware Multi-Fidelity Bayesian Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| CARE: Decoding-Time Safety Alignment via Rollback and Introspection Intervention |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| CAT: Content-Adaptive Image Tokenization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CDFlow: Building Invertible Layers with Circulant and Diagonal Matrices |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CF-VLM:CounterFactual Vision-Language Fine-tuning |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| CG-SSL: Concept-Guided Self-Supervised Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| CHPO: Constrained Hybrid-action Policy Optimization for Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| CHiQPM: Calibrated Hierarchical Interpretable Image Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CIDD: Collaborative Intelligence for Structure-Based Drug Design Empowered by LLMs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CLIPTTA: Robust Contrastive Vision-Language Test-Time Adaptation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| CMoB: Modality Valuation via Causal Effect for Balanced Multimodal Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| COLA: Towards Efficient Multi-Objective Reinforcement Learning with Conflict Objective Regularization in Latent Space |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| COME: Adding Scene-Centric Forecasting Control to Occupancy World Model |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| COOPERA: Continual Open-Ended Human-Robot Assistance |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| CORAL: Disentangling Latent Representations in Long-Tailed Diffusion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CORE: Collaborative Optimization with Reinforcement Learning and Evolutionary Algorithm for Floorplanning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMs |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| COS3D: Collaborative Open-Vocabulary 3D Segmentation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CPO: Condition Preference Optimization for Controllable Image Generation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CREA: A Collaborative Multi-Agent Framework for Creative Image Editing and Generation |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| CRRL: Learning Channel-invariant Neural Representations for High-performance Cross-day Decoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CReFT-CAD: Boosting Orthographic Projection Reasoning for CAD via Reinforcement Fine-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CSBrain: A Cross-scale Spatiotemporal Brain Foundation Model for EEG Decoding |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| CSGO: Content-Style Composition in Text-to-Image Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| CTSketch: Compositional Tensor Sketching for Scalable Neurosymbolic Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CURV: Coherent Uncertainty-Aware Reasoning in Vision-Language Models for X-Ray Report Generation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| CVGL: Causal Learning and Geometric Topology |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CaliGCL: Calibrated Graph Contrastive Learning via Partitioned Similarity and Consistency Discrimination |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Calibrating Translation Decoding with Quality Estimation on LLMs |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| CamEdit: Continuous Camera Parameter Control for Photorealistic Image Editing |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| CamSAM2: Segment Anything Accurately in Camouflaged Videos |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cameras as Relative Positional Encoding |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Can Agent Fix Agent Issues? |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Can Class-Priors Help Single-Positive Multi-Label Learning? |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Can DPO Learn Diverse Human Values? A Theoretical Scaling Law |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Can Dependencies Induced by LLM-Agent Workflows Be Trusted? |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Can Diffusion Models Disentangle? A Theoretical Perspective |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Can LLMs Reason Over Non-Text Modalities in a Training-Free Manner? A Case Study with In-Context Representation Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Can Large Language Models Master Complex Card Games? |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Can MLLMs Absorb Math Reasoning Abilities from LLMs as Free Lunch? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Can Multi-Modal LLMs Provide Live Step-by-Step Task Guidance? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Can NeRFs "See" without Cameras? |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Can We Infer Confidential Properties of Training Data from LLMs? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cancer Survival Analysis via Zero-shot Tumor Microenvironment Segmentation on Low-resolution Whole Slide Pathology Images |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Caption This, Reason That: VLMs Caught in the Middle |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Capturing Individual Human Preferences with Reward Features |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cascaded Language Models for Cost-Effective HumanβAI Decision-Making |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Causal Climate Emulation with Bayesian Filtering |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Causal Differentiating Concepts: Interpreting LM Behavior via Causal Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Causal Discovery and Inference through Next-Token Prediction |
β
|
β
|
β |
β
|
β
|
β
|
β
|
6 |
| Causal Discovery over Clusters of Variables in Markovian Systems |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Causal Explanation-Guided Learning for Organ Allocation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Causal LLM Routing: End-to-End Regret Minimization from Observational Data |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Causal Mixture Models: Characterization and Discovery |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Causal Spatio-Temporal Prediction: An Effective and Efficient Multi-Modal Approach |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Causal-R: A Causal-Reasoning Geometry Problem Solver for Optimized Solution Exploration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CausalPFN: Amortized Causal Effect Estimation via In-Context Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CausalVTG: Towards Robust Video Temporal Grounding via Causal Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Causality Meets the Table: Debiasing LLMs for Faithful TableQA via Front-Door Intervention |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Causality-Induced Positional Encoding for Transformer-Based Representation Learning of Non-Sequential Features |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Causally Reliable Concept Bottleneck Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CellCLIP - Learning Perturbation Effects in Cell Painting via Text-Guided Contrastive Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Centralized Reward Agent for Knowledge Sharing and Transfer in Multi-Task Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Certifying Concavity and Monotonicity in Games via Sum-of-Squares Hierarchies |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Certifying Deep Network Risks and Individual Predictions with PAC-Bayes Loss via Localized Priors |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Chain of Execution Supervision Promotes General Reasoning in Large Language Models |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Chain-of-Model Learning for Language Model |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Chain-of-Retrieval Augmented Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Channel Matters: Estimating Channel Influence for Multivariate Time Series |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Channel Simulation and Distributed Compression with Ensemble Rejection Sampling |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Characterization and Learning of Causal Graphs from Hard Interventions |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Characterizing control between interacting subsystems with deep Jacobian estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Characterizing the Expressivity of Fixed-Precision Transformer Language Models |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ChatVLA-2: Vision-Language-Action Model with Open-World Reasoning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| ChatbotID: Identifying Chatbots with Granger Causality Test |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Checklists Are Better Than Reward Models For Aligning Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ChemOrch: Empowering LLMs with Chemical Intelligence via Groundbreaking Synthetic Instructions |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ChromFound: Towards A Universal Foundation Model for Single-Cell Chromatin Accessibiltiy Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Class conditional conformal prediction for multiple inputs by p-value aggregation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Class-aware Domain Knowledge Fusion and Fission for Continual Test-Time Adaptation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Class-wise Balancing Data Replay for Federated Class-Incremental Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Clip-and-Verify: Linear Constraint-Driven Domain Clipping for Accelerating Neural Network Verification |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Cloud4D: Estimating Cloud Properties at a High Spatial and Temporal Resolution |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Clustering via Hedonic Games: New Concepts and Algorithms |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Co-PatcheR: Collaborative Software Patching with Component-specific Small Reasoning Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Co-Regularization Enhances Knowledge Transfer in High Dimensions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Co-Reinforcement Learning for Unified Multimodal Understanding and Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model |
β |
β
|
β
|
β
|
β
|
β |
β |
4 |
| CoCoA: A Minimum Bayes Risk Framework Bridging Confidence and Consistency for Uncertainty Quantification in LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| CoFFT: Chain of Foresight-Focus Thought for Visual Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| CoIDO: Efficient Data Selection for Visual Instruction Tuning via Coupled Importance-Diversity Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CoLT: The conditional localization test for assessing the accuracy of neural posterior estimates |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| CoP: Agentic Red-teaming for Large Language Models using Composition of Principles |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| CoT Information: Improved Sample Complexity under Chain-of-Thought Supervision |
β |
β |
β |
β |
β |
β |
β
|
1 |
| CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CoUn: Empowering Machine Unlearning via Contrastive Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Coarse-to-Fine 3D Part Assembly via Semantic Super-Parts and Symmetry-Aware Pose Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Codifying Character Logic in Role-Playing |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Cognitive Mirrors: Exploring the Diverse Functional Roles of Attention Heads in LLM Reasoning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Cognitive Predictive Processing: A Human-inspired Framework for Adaptive Exploration in Open-World Reinforcement Learning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Collaborative Reasoner: Self-Improving Social Agents with Synthetic Conversations |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Collaborative and Confidential Junction Trees for Hybrid Bayesian Networks |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Collapsing Taylor Mode Automatic Differentiation |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Collective Counterfactual Explanations: Balancing Individual Goals and Collective Dynamics |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Color Conditional Generation with Sliced Wasserstein Guidance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Coloring Learning for Heterophilic Graph Representation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| ComPO: Preference Alignment via Comparison Oracles |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| ComRank: Ranking Loss for Multi-Label Complementary Label Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Combinatorial Ski Rental Problem: Robust and Learning-Augmented Algorithms |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Combining Cost Constrained Runtime Monitors for AI Safety |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Compact Memory for Continual Logistic Regression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Comparator-Adaptive $\Phi$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Comparing Uniform Price and Discriminatory Multi-Unit Auctions through Regret Minimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Competitive Advantage Attacks to Decentralized Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Complete Structure Guided Point Cloud Completion via Cluster- and Instance-Level Contrastive Learning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Complexity Scaling Laws for Neural Models using Combinatorial Optimization |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Composing Linear Layers from Irreducibles |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Composition and Alignment of Diffusion Models using Constrained Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Compositional Monte Carlo Tree Diffusion for Extendable Planning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Compositional Neural Network Verification via Assume-Guarantee Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Compositional Reasoning with Transformers, RNNs, and Chain of Thought |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Compress & Cache: Vision token compression for efficient generation and retrieval |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Compress Large Language Models via Collaboration Between Learning and Matrix Approximation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Compress, Gather, and Recompute: REFORMing Long-Context Processing in Transformers |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Computable universal online learning |
β |
β |
β |
β |
β |
β |
β |
0 |
| Computation and Memory-Efficient Model Compression with Gradient Reweighting |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Computational Budget Should Be Considered in Data Selection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Computational Efficiency under Covariate Shift in Kernel Ridge Regression |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Computational Hardness of Reinforcement Learning with Partial $q^{\pi}$-Realizability |
β |
β |
β |
β |
β |
β |
β |
0 |
| Compute-Optimal Scaling for Value-Based Deep RL |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ConTextTab: A Semantics-Aware Tabular In-Context Learner |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Concentration and excess risk bounds for imbalanced classification with synthetic oversampling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Concept Incongruence: An Exploration of Time and Death in Role Playing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Concept-Guided Interpretability via Neural Chunking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Conditional Diffusion Anomaly Modeling on Graphs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conditional Distribution Compression via the Kernel Conditional Mean Embedding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Conditional Forecasts and Proper Scoring Rules for Reliable and Accurate Performative Predictions |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Conditional Gradient Methods with Standard LMO for Stochastic Simple Bilevel Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Conditional Panoramic Image Generation via Masked Autoregressive Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Conditional Representation Learning for Customized Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conditioning Matters: Training Diffusion Policies is Faster Than You Think |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| ConfTuner: Training Large Language Models to Express Their Confidence Verbally |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Confidence-Aware With Prototype Alignment for Partial Multi-label Learning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Conflict-Aware Knowledge Editing in the Wild: Semantic-Augmented Graph Representation for Unstructured Text |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Conformal Arbitrage: Risk-Controlled Balancing of Competing Objectives in Language Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Conformal Inference under High-Dimensional Covariate Shifts via Likelihood-Ratio Regularization |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Conformal Information Pursuit for Interactively Guiding Large Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Conformal Linguistic Calibration: Trading-off between Factuality and Specificity |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Conformal Mixed-Integer Constraint Learning with Feasibility Guarantees |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Conformal Online Learning of Deep Koopman Linear Embeddings |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Conformal Prediction for Causal Effects of Continuous Treatments |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conformal Prediction for Time-series Forecasting with Change Points |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Conformal Prediction in The Loop: A Feedback-Based Uncertainty Model for Trajectory Optimization |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Conformal Prediction under LΓ©vy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Conformal Risk Training: End-to-End Optimization of Conformal Risk Control |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Confounding Robust Deep Reinforcement Learning: A Causal Approach |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Confusion-Driven Self-Supervised Progressively Weighted Ensemble Learning for Non-Exemplar Class Incremental Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Connecting JensenβShannon and KullbackβLeibler Divergences: A New Bound for Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Connecting Neural Models Latent Geometries with Relative Geodesic Representations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Connectome-Based Modelling Reveals Orientation Maps in the Drosophila Optic Lobe |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Consensus-Robust Transfer Attacks via Parameter and Representation Perturbations |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Consistency Conditions for Differentiable Surrogate Losses |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Consistency of Physics-Informed Neural Networks for Second-Order Elliptic Equations |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Consistency of the $k_n$-nearest neighbor rule under adaptive sampling |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Consistent Story Generation: Unlocking the Potential of Zigzag Sampling |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Constant Bit-size Transformers Are Turing Complete |
β |
β |
β |
β |
β |
β |
β |
0 |
| Constrained Best Arm Identification |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Constrained Diffusers for Safe Planning and Control |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Constrained Discrete Diffusion |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Constrained Linear Thompson Sampling |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Constrained Optimization From a Control Perspective via Feedback Linearization |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Constrained Posterior Sampling: Time Series Generation with Hard Constraints |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Constructing an Optimal Behavior Basis for the Option Keyboard |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contact Map Transfer with Conditional Diffusion Model for Generalizable Dexterous Grasp Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Context-Aware Regularization with Markovian Integration for Attention-Based Nucleotide Analysis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ContextAgent: Context-Aware Proactive LLM Agents with Open-world Sensory Perceptions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Contextual Dynamic Pricing with Heterogeneous Buyers |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Contextual Integrity in LLMs via Reasoning and Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Contextual Online Pricing with (Biased) Offline Data |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Contextual Thompson Sampling via Generation of Missing Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contextual Tokenization for Graph Inverted Indices |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contimask: Explaining Irregular Time Series via Perturbations in Continuous Time |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Continual Gaussian Mixture Distribution Modeling for Class Incremental Semantic Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continual Knowledge Adaptation for Reinforcement Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Continual Model Merging without Data: Dual Projections for Balancing Stability and Plasticity |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Continual Multimodal Contrastive Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continual Optimization with Symmetry Teleportation for Multi-Task Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continual Release Moment Estimation with Differential Privacy |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Continuous Concepts Removal in Text-to-image Diffusion Models |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Continuous Diffusion Model for Language Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continuous Domain Generalization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Continuous Simplicial Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Continuous Soft Actor-Critic: An Off-Policy Learning Method Robust to Time Discretization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Continuous Subspace Optimization for Continual Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Continuous Thought Machines |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Contrastive Consolidation of Top-Down Modulations Achieves Sparsely Supervised Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contrastive Learning with Data Misalignment: Feature Purity, Training Dynamics and Theoretical Generalization Guarantees |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Contrastive Representations for Temporal Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contrastive Self-Supervised Learning As Neural Manifold Packing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Contribution of task-irrelevant stimuli to drift of neural representations |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ControlFusion: A Controllable Image Fusion Network with Language-Vision Degradation Prompts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Controllable 3D Molecular Generation for Structure-Based Drug Design Through Bayesian Flow Networks and Gradient Integration |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Controllable Human-centric Keyframe Interpolation with Generative Prior |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Controlled Visual Hallucination via Thalamus-Driven Decoupling Network for Domain Adaptation of Black-Box Predictors |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Controlling The Spread of Epidemics on Networks with Differential Privacy |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Controlling Thinking Speed in Reasoning Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Controlling the Flow: Stability and Convergence for Stochastic Gradient Descent with Decaying Regularization |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Convergence Rates for Gradient Descent on the Edge of Stability for Overparametrised Least Squares |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Convergence Rates of Constrained Expected Improvement |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Convergence of Clipped SGD on Convex $(L_0,L_1)$-Smooth Functions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Convergence of the Gradient Flow for Shallow ReLU Networks on Weakly Interacting Data |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Convergent Functions, Divergent Forms |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Convex Approximation of Two-Layer ReLU Networks for Hidden State Differential Privacy |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Convex Potential Mirror Langevin Algorithm for Efficient Sampling of Energy-Based Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Convolution Goes Higher-Order: A Biologically Inspired Mechanism Empowers Image Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Information Exchange and Ranking by Contrasting Layers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Copresheaf Topological Neural Networks: A Generalized Deep Learning Framework |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment |
β |
β |
β
|
β |
β
|
β |
β |
2 |
| Coreset for Robust Geometric Median: Eliminating Size Dependency on Outliers |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Coresets for Clustering Under Stochastic Noise |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Correcting misinterpretations of additive models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Corrector Sampling in Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Correlated Low-Rank Adaptation for ConvNets |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Correlation Dimension of Autoregressive Large Language Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Cosmos: Compressed and Smooth Latent Space for Text Diffusion Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cost-Aware Contrastive Routing for LLMs |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Cost-Sensitive Freeze-thaw Bayesian Optimization for Efficient Hyperparameter Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Cost-aware LLM-based Online Dataset Annotation |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning |
β
|
β
|
β
|
β |
β |
β |
β |
3 |
| Counterfactual Evolution of Multimodal Datasets via Visual Programming |
β
|
β
|
β
|
β
|
β |
β |
β |
4 |
| Counterfactual Identifiability via Dynamic Optimal Transport |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Counterfactual Image Editing with Disentangled Causal Latent Space |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Counterfactual Implicit Feedback Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Counterfactual reasoning: an analysis of in-context emergence |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Coupled Data and Measurement Space Dynamics for Enhanced Diffusion Posterior Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Coupling Generative Modeling and an Autoencoder with the Causal Bridge |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Covariances for Free: Exploiting Mean Distributions for Training-free Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Covariate-moderated Empirical Bayes Matrix Factorization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Covering Multiple Objectives with a Small Set of Solutions Using Bayesian Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Credal Prediction based on Relative Likelihood |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| CroPe: Cross-Modal Semantic Compensation Adaptation for All Adverse Scene Understanding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Cross City Traffic Flow Generation via Retrieval Augmented Diffusion Model |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cross-Modal Representational Knowledge Distillation for Enhanced Spike-informed LFP Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Cross-modal Associations in Vision and Language Models: Revisiting the Bouba-Kiki Effect |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| CrossAD: Time Series Anomaly Detection with Cross-scale Associations and Cross-window Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CrossSpectra: Exploiting Cross-Layer Smoothness for Parameter-Efficient Fine-Tuning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Crucible: Quantifying the Potential of Control Algorithms through LLM Agents |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Curious Causality-Seeking Agents in Open-ended Worlds |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Curl Descent : Non-Gradient Learning Dynamics with Sign-Diverse Plasticity |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Curly Flow Matching for Learning Non-gradient Field Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Curriculum Abductive Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Curriculum Model Merging: Harmonizing Chemical LLMs for Enhanced Cross-Task Generalization |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Curvature Tuning: Provable Training-free Model Steering From a Single Parameter |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| CyIN: Cyclic Informative Latent Space for Bridging Complete and Incomplete Multimodal Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Cyclic Counterfactuals under ShiftβScale Interventions |
β |
β |
β |
β |
β |
β |
β |
0 |
| CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Cypher-RI: Reinforcement Learning for Integrating Schema Selection into Cypher Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| D$^2$GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| D-VST: Diffusion Transformer for Pathology-Correct Tone-Controllable Cross-Dye Virtual Staining of Whole Slide Images |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| D2SA: Dual-Stage Distribution and Slice Adaptation for Efficient Test-Time Adaptation in MRI Reconstruction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DAA: Amplifying Unknown Discrepancy for Test-Time Discovery |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| DAAC: Discrepancy-Aware Adaptive Contrastive Learning for Medical Time series |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DAPO : Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage-Based Policy Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| DAPO: An Open-Source LLM Reinforcement Learning System at Scale |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| DBLoss: Decomposition-based Loss Function for Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DCA: Graph-Guided Deep Embedding Clustering for Brain Atlases |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DEAL: Diffusion Evolution Adversarial Learning for Sim-to-Real Transfer |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DEGauss: Defending Against Malicious 3D Editing for Gaussian Splatting |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| DERD-Net: Learning Depth from Event-based Ray Densities |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| DGH: Dynamic Gaussian Hair |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| DIFFSSR: Stereo Image Super-resolution Using Differential Transformer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DINGO: Constrained Inference for Diffusion LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DINO-Foresight: Looking into the Future with DINO |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DISC: Dynamic Decomposition Improves LLM Inference Scaling |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| DISCO: DISCrete nOise for Conditional Control in Text-to-Image Diffusion Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DISCO: Disentangled Communication Steering for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DKDR: Dynamic Knowledge Distillation for Reliability in Federated Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DLoFT: Gradient-Decoupled Fine-Tuning for Generalizable Long Chain-of-Thought Reasoning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| DMWM: Dual-Mind World Model with Long-Term Imagination |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| DMol: A Highly Efficient and Chemical Motif-Preserving Molecule Generation Platform |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair Paradigm |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| DONβT NEED RETRAINING: A Mixture of DETR and Vision Foundation Models for Cross-Domain Few-Shot Object Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DOTA: Distributional Test-time Adaptation of Vision-Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DOVTrack: Data-Efficient Open-Vocabulary Tracking |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DPA: A one-stop metric to measure bias amplification in classification datasets |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DPAIL: Training Diffusion Policy for Adversarial Imitation Learning without Policy Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| DPΒ²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| DSCS: Fast CPDAG-Based Verification of Collapsible Submodels in High-Dimensional Bayesian Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DSRF: A Dynamic and Scalable Reasoning Framework for Solving RPMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DUAL: Learning Diverse Kernels for Aggregated Two-sample and Independence Testing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DUET: Dual-Perspective Pseudo Labeling and Uncertainty-aware Exploration & Exploitation Training for Source-Free Domain Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DUO: No Compromise to Accuracy Degradation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Data Fusion for Partial Identification of Causal Effects |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Data Mixing Can Induce Phase Transitions in Knowledge Acquisition |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Data Selection Matters: Towards Robust Instruction Tuning of Large Multimodal Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Data-Adaptive Exposure Thresholds under Network Interference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Data-Dependent Regret Bounds for Constrained MABs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Data-Free Model Extraction for Black-box Recommender Systems via Graph Convolutions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DataRater: Meta-Learned Dataset Curation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Dataset Distillation for Pre-Trained Self-Supervised Vision Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Dataset Distillation of 3D Point Clouds via Distribution Matching |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DeCaFlow: A deconfounding causal generative model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DePass: Unified Feature Attributing by Simple Decomposed Forward Pass |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DeblurDiff: Real-Word Image Deblurring with Generative Diffusion Models |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Decentralized Dynamic Cooperation of Personalized Models for Federated Continual Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Deciphering the Extremes: A Novel Approach for Pathological Long-tailed Recognition in Scientific Discovery |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Decoding Causal Structure: End-to-End Mediation Pathways Inference |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| DecompNet: Enhancing Time Series Forecasting Models with Implicit Decomposition |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Decomposing Interventional Causality into Synergistic, Redundant, and Unique Components |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Decomposing motor units through elimination for real-time intention driven assistive neurotechnology |
β
|
β |
β |
β |
β
|
β
|
β
|
4 |
| Decomposing stimulus-specific sensory neural information via diffusion models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Decoupled Entropy Minimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Decreasing Entropic Regularization Averaged Gradient for Semi-Discrete Optimal Transport |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Deep Compositional Phase Diffusion for Long Motion Sequence Generation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Deep Continuous-Time State-Space Models for Marked Event Sequences |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Deep Gaussian from Motion: Exploring 3D Geometric Foundation Models for Gaussian Splatting |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Deep Learning with Plausible Deniability |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Deep Legendre Transform |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Deep Tree Tensor Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Deep Value Benchmark: Measuring Whether Models Generalize Deep Values or Shallow Preferences |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Deep learning for continuous-time stochastic control with jumps |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DeepDiver: Adaptive Web-Search Intensity Scaling via Reinforcement Learning |
β |
β |
β
|
β
|
β |
β
|
β
|
4 |
| DeepHalo: A Neural Choice Model with Controllable Context Effects |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Deeper with Riemannian Geometry: Overcoming Oversmoothing and Oversquashing for Graph Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate Experts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Defining and Discovering Hyper-meta-paths for Heterogeneous Hypergraphs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Degradation-Aware Dynamic SchrΓΆdinger Bridge for Unpaired Image Restoration |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Degrees of Freedom for Linear Attention: Distilling Softmax Attention with Optimal Feature Efficiency |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DeltaFormer: Unlock the state space of Transformer |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DeltaPhi: Physical States Residual Learning for Neural Operators in Data-Limited PDE Solving |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Delving into Large Language Models for Effective Time-Series Anomaly Detection |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Democratizing Clinical Risk Prediction with Cross-Cohort Cross-Modal Knowledge Transfer |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Demystifying Language Model Forgetting with Low-rank Example Associations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Demystifying Spectral Feature Learning for Instrumental Variable Regression |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Deno-IF: Unsupervised Noisy Visible and Infrared Image Fusion Method |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Denoising Trajectory Biases for Zero-Shot AI-Generated Image Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dense Associative Memory with Epanechnikov Energy |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Dense Backpropagation Improves Training for Sparse Mixture-of-Experts |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Dense Metric Depth Estimation via Event-based Differential Focus Volume Prompting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dense SAE Latents Are Features, Not Bugs |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Density Ratio-Free Doubly Robust Proxy Causal Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Dependency Matters: Enhancing LLM Reasoning with Explicit Knowledge Grounding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Dependency Parsing is More Parameter-Efficient with Normalization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Deployment Efficient Reward-Free Exploration with Linear Function Approximation |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Depth-Bounds for Neural Networks via the Braid Arrangement |
β |
β |
β |
β |
β |
β |
β |
0 |
| Depth-Supervised Fusion Network for Seamless-Free Image Stitching |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Depth-Width Tradeoffs for Transformers on Graph Tasks |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-based Decoding |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Design-Based Bandits Under Network Interference: Trade-Off Between Regret and Statistical Inference |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Detecting Data Deviations in Electronic Health Records |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Detecting Generated Images by Fitting Natural Image Distributions |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Detecting High-Stakes Interactions with Activation Probes |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DevFD : Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DiCoFlex: Model-Agnostic Diverse Counterfactuals with Flexible Control |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| DictPFL: Efficient and Private Federated Learning on Encrypted Gradients |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| DiffBreak: Is Diffusion-Based Purification Robust? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DiffE2E: Rethinking End-to-End Driving with a Hybrid Diffusion-Regression-Classification Policy |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| DiffLiG: Diffusion-enhanced Liquid Graph with Attention Propagation for Grid-to-Station Precipitation Correction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Differentiable Constraint-Based Causal Discovery |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Differentiable Cyclic Causal Discovery Under Unmeasured Confounders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differentiable Decision Tree via "ReLU+Argmin" Reformulation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differentiable Generalized Sliced Wasserstein Plans |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differentiable Hierarchical Visual Tokenization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differentiable Sparsity via $D$-Gating: Simple and Versatile Structured Penalization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Differentiable Structure Learning and Causal Discovery for General Binary Data |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Differentiable extensions with rounding guarantees for combinatorial optimization over permutations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differential Privacy for Euclidean Jordan Algebra with Applications to Private Symmetric Cone Programming |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Differential Privacy on Fully Dynamic Streams |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Differentially Private Bilevel Optimization: Efficient Algorithms with Near-Optimal Rates |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Differentially Private Gomory-Hu Trees |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Differentially Private High-dimensional Variable Selection via Integer Programming |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Differentially Private Quantiles with Smaller Error |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Differentially Private Relational Learning with Entity-level Privacy Guarantees |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Differentiation Through Black-Box Quadratic Programming Solvers |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Diffusing DeBias: Synthetic Bias Amplification for Model Debiasing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Diffusion Beats Autoregressive in Data-Constrained Settings |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Diffusion Feature Field for Text-based 3D Editing with Gaussian Splatting |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Diffusion Federated Dataset |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Diffusion Generative Modeling on Lie Group Representations |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Diffusion Guided Adversarial State Perturbations in Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Diffusion Models Meet Contextual Bandits |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Diffusion Network Inference for Cross-layer Cascades |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Diffusion Transformers as Open-World Spatiotemporal Foundation Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Diffusion Tree Sampling: Scalable inferenceβtime alignment of diffusion models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Diffusion on Demand: Selective Caching and Modulation for Efficient Generation |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Diffusion-Based Hierarchical Graph Neural Networks for Simulating Nonlinear Solid Mechanics |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Diffusion-Driven Progressive Target Manipulation for Source-Free Domain Adaptation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Diffusion-Guided Graph Data Augmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dimension-adapted Momentum Outscales SGD |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Dimension-free Score Matching and Time Bootstrapping for Diffusion Models |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Dimensional Collapse in VQVAEs: Evidence and Remedies |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Dimensionality Mismatch Between Brains and Artificial Neural Networks |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Direct Alignment with Heterogeneous Preferences |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Direct Fisher Score Estimation for Likelihood Maximization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| Directed-Tokens: A Robust Multi-Modality Alignment Approach to Large Language-Vision Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DisMo: Disentangled Motion Representations for Open-World Motion Transfer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Discovering Compositional Hallucinations in LVLMs |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Discovering Data Structures: Nearest Neighbor Search and Beyond |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Discovering Latent Graphs with GFlowNets for Diverse Conditional Image Generation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Discovering Opinion Intervals from Conflicts in Signed Graphs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Discovering Symbolic Partial Differential Equation by Abductive Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Discrete Neural Flow Samplers with Locally Equivariant Transformer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Discretization-free Multicalibration through Loss Minimization over Tree Ensembles |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition |
β |
β
|
β
|
β
|
β
|
β |
β |
4 |
| Disentangled Cross-Modal Representation Learning with Enhanced Mutual Supervision |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Disentangled Representation Learning via Modular Compositional Bias |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Disentangling Hyperedges through the Lens of Category Theory |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Disentangling Latent Shifts of In-Context Learning with Weak Supervision |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Disentangling Superpositions: Interpretable Brain Encoding Model with Sparse Concept Atoms |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Disentangling misreporting from genuine adaptation in strategic settings: a causal approach |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Distance-informed Neural Processes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distances for Markov chains from sample streams |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Distil-E2D: Distilling Image-to-Depth Priors for Event-Based Monocular Depth Estimation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Distillation Robustifies Unlearning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distilling LLM Agent into Small Models with Retrieval and Code Tools |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Distilling LLM Prior to Flow Model for Generalizable Agentβs Imagination in Object Goal Navigation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences? |
β |
β |
β |
β |
β |
β |
β |
0 |
| Distributed Multi-Agent Bandits Over ErdΕs-RΓ©nyi Random Networks |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Distributed mediation analysis with communication efficiency |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Distribution Learning Meets Graph Structure Sampling |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Distribution-Aligned Decoding for Efficient LLM Task Adaptation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Distribution-Aware Tensor Decomposition for Compression of Convolutional Neural Networks |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Distributional Adversarial Attacks and Training in Deep Hedging |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distributional Autoencoders Know the Score |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Distributional LLM-as-a-Judge |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Distributional Training Data Attribution: What do Influence Functions Sample? |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Distributionally Robust Feature Selection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Distributionally Robust Performative Optimization |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Diverse Influence Component Analysis: A Geometric Approach to Nonlinear Mixture Identifiability |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Diversifying Parallel Ergodic Search: A Signature Kernel Evolution Strategy |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Diversity-Aware Policy Optimization for Large Language Model Reasoning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Diversity-oriented Deep Multi-modal Clustering |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Do LVLMs Truly Understand Video Anomalies? Revealing Hallucination via Co-Occurrence Patterns |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Do Language Models Use Their Depth Efficiently? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Do different prompting methods yield a common task representation in language models? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Do-PFN: In-Context Learning for Causal Effect Estimation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| DoDo-Code: an Efficient Levenshtein Distance Embedding-based Code for 4-ary IDS Channel |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Doctor Approved: Generating Medically Accurate Skin Disease Images through AI-Expert Feedback |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Document Summarization with Conformal Importance Guarantees |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers? |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Does Representation Guarantee Welfare? |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Does Stochastic Gradient really succeed for bandits? |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Domain Adaptive Hashing Retrieval via VLM Assisted Pseudo-Labeling and Dual Space Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Don't Just Chase βHighlighted Tokensβ in MLLMs: Revisiting Visual Holistic Context Retention |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Don't be lazy: CompleteP enables compute-efficient deep transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Donβt Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Donβt Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Donβt Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Donβt Trade Off Safety: Diffusion Regularization for Constrained Offline RL |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Doodle to Detect: A Goofy but Powerful Approach to Skeleton-based Hand Gesture Recognition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DoseSurv: Predicting Personalized Survival Outcomes under Continuous-Valued Treatments |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the Role of Model Complexity |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Doubly Robust Alignment for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Dr. RAW: Towards General High-Level Vision from RAW with Efficient Task Conditioning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DreamLight: Towards Harmonious and Consistent Image Relighting |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| DreamPRM: Domain-reweighted Process Reward Model for Multimodal Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| DuSA: Fast and Accurate Dual-Stage Sparse Attention Mechanism Accelerating Both Training and Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dual Alignment Framework for Few-shot Learning with Inter-Set and Intra-Set Shifts |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Dual Prototype-Enhanced Contrastive Framework for Class-Imbalanced Graph Domain Adaptation |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Dual-Comb Ghost Imaging with Transformer-Based Reconstruction for Optical Fiber Endomicroscopy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via $\textit{In-the-wild}$ Cascading Flow Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Dual-Path Temporal Decoder for End-to-End Multi-Object Tracking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dual-Res Tandem Mamba-3D: Bilateral Breast Lesion Detection and Classification on Non-contrast Chest CT |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Dual-Space Semantic Synergy Distillation for Continual Learning of Unlabeled Streams |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DualCnst: Enhancing Zero-Shot Out-of-Distribution Detection via Text-Image Consistency in Vision-Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| DualEqui: A Dual-Space Hierarchical Equivariant Network for Large Biomolecules |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| DualFocus: Depth from Focus with Spatio-Focal Dual Variational Constraints |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| DualMPNN: Harnessing Structural Alignments for High-Recovery Inverse Protein Folding |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DuetGraph: Coarse-to-Fine Knowledge Graph Reasoning with Dual-Pathway Global-Local Fusion |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| DyFlow: Dynamic Workflow Framework for Agentic Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| DyMU: Dynamic Merging and Virtual Unmerging for Efficient Variable-Length VLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| DyMoDreamer: World Modeling with Dynamic Modulation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dyn-O: Building Structured World Models with Object-Centric Representations |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| DynaAct: Large Language Model Reasoning with Dynamic Action Spaces |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DynaNav: Dynamic Feature and Layer Selection for Efficient Visual Navigation |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| DynaPhArM: Adaptive and Physics-Constrained Modeling for Target-Drug Complexes with Drug-Specific Adaptations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DynaPipe: Dynamic Layer Redistribution for Efficient Serving of LLMs with Pipeline Parallelism |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dynamic Algorithm for Explainable $k$-medians Clustering under $\ell_p$ Norm |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Dynamic Configuration for Cutting Plane Separators via Reinforcement Learning on Incremental Graph |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Dynamic Diameter in High-Dimensions against Adaptive Adversary and Beyond |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Dynamic Diffusion SchrΓΆdinger Bridge in Astrophysical Observational Inversions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dynamic Focused Masking for Autoregressive Embodied Occupancy Prediction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dynamic Masking and Auxiliary Hash Learning for Enhanced Cross-Modal Retrieval |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Dynamic Regret Reduces to Kernelized Static Regret |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Dynamic Semantic-Aware Correlation Modeling for UAV Tracking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dynamic Shadow Unveils Invisible Semantics for Video Outpainting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dynamic Siamese Expansion Framework for Improving Robustness in Online Continual Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Dynamic Test-Time Compute Scaling in Control Policy: Difficulty-Aware Stochastic Interpolant Policy |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Dynamic View Synthesis as an Inverse Problem |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Dynamic and Chemical Constraints to Enhance the Molecular Masked Graph Autoencoders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Dynamics of Spontaneous Topic Changes in Next Token Prediction with Self-Attention |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Dynamics-Aligned Latent Imagination in Contextual World Models for Zero-Shot Generalization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| E2E-VGuard: Adversarial Prevention for Production LLM-based End-To-End Speech Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| E2Former: An Efficient and Equivariant Transformer with Linear-Scaling Tensor Products |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EA3D: Online Open-World 3D Object Extraction from Streaming Videos |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| EAP-GP: Mitigating Saturation Effect in Gradient-based Automated Circuit Identification |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ECO: Evolving Core Knowledge for Efficient Transfer |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| EDELINE: Enhancing Memory in Diffusion-based World Models via Linear-Time Sequence Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ELDET: Early-Learning Distillation with Noisy Labels for Object Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ELECTRA: A Cartesian Network for 3D Charge Density Prediction with Floating Orbitals |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| ENMA: Tokenwise Autoregression for Continuous Neural PDE Operators |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| EPA: Boosting Event-based Video Frame Interpolation with Perceptually Aligned Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| ESCA: Contextualizing Embodied Agents via Scene-Graph Generation |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ESCORT: Efficient Stein-variational and Sliced Consistency-Optimized Temporal Belief Representation for POMDPs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| EUGens: Efficient, Unified and General Dense Layers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EVODiff: Entropy-aware Variance Optimized Diffusion Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Each Complexity Deserves a Pruning Policy |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| EchoShot: Multi-Shot Portrait Video Generation |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| EddyFormer: Accelerated Neural Simulations of Three-Dimensional Turbulence at Scale |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Edit Flows: Variable Length Discrete Flow Matching with Sequence-Level Edit Operations |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EditInfinity: Image Editing with Binary-Quantized Generative Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Effective Neural Approximations for Geometric Optimization Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Effects of Dropout on Performance in Long-range Graph Learning Tasks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Efficient $k$-Sparse BandβLimited Interpolation with Improved Approximation Ratio |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Efficient Adaptive Experimentation with Noncompliance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Adaptive Federated Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Algorithms for Robust and Partial Semi-Discrete Optimal Transport |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Efficient Allocation of Working Memory Resource for Utility Maximization in Humans and Recurrent Neural Networks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Efficient Bayesian Experiment Design with Equivariant Networks |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Efficient Data Selection at Scale via Influence Distillation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Efficient Fairness-Performance Pareto Front Computation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Efficient Federated Learning against Byzantine Attacks and Data Heterogeneity via Aggregating Normalized Gradients |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Efficient Kernelized Learning in Polyhedral Games beyond Full Information: From Colonel Blotto to Congestion Games |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Efficient Knowledge Transfer in Federated Recommendation for Joint Venture Ecosystem |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Efficient Large Language Model Inference with Neural Block Linearization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Last-Iterate Convergence in Solving Extensive-Form Games |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Efficient Low Rank Attention for Long-Context Inference in Large Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Multi-modal Large Language Models via Progressive Consistency Distillation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Efficient Multimodal Dataset Distillation via Generative Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Efficient PAC Learning for Realizable-Statistic Models via Convex Surrogates |
β |
β |
β |
β |
β |
β |
β |
0 |
| Efficient Parametric SVD of Koopman Operator for Stochastic Dynamical Systems |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Efficient Part-level 3D Object Generation via Dual Volume Packing |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Efficient Pre-Training of LLMs via Topology-Aware Communication Alignment on More Than 9600 GPUs |
β
|
β |
β |
β
|
β
|
β
|
β
|
5 |
| Efficient Preference-Based Reinforcement Learning: Randomized Exploration meets Experimental Design |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Quadratic Corrections for Frank-Wolfe Algorithms |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Efficient RAW Image Deblurring with Adaptive Frequency Modulation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Efficient Randomized Experiments Using Foundation Models |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Efficient Rectified Flow for Image Fusion |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Efficient Representativeness-Aware Coreset Selection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Safe Meta-Reinforcement Learning: Provable Near-Optimality and Anytime Safety |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Spectral Control of Partially Observed Linear Dynamical Systems |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Efficient Training of Minimal and Maximal Low-Rank Recurrent Neural Networks |
β
|
β
|
β |
β
|
β |
β |
β
|
4 |
| Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Utility-Preserving Machine Unlearning with Implicit Gradient Surgery |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient Verified Unlearning For Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Efficient and Generalizable Mixed-Precision Quantization via Topological Entropy |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Efficient and Near-Optimal Algorithm for Contextual Dueling Bandits with Offline Regression Oracles |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Efficient semantic uncertainty quantification in language models via diversity-steered sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Efficiently Escaping Saddle Points under Generalized Smoothness via Self-Bounding Regularity |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Efficiently Maintaining the Multilingual Capacity of MCLIP in Downstream Cross-Modal Retrieval Tasks |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Efficiently Scaling LLM Reasoning Programs with Certaindex |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Efficiently Verifiable Proofs of Data Attribution |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EgoBridge: Domain Adaptation for Generalizable Imitation from Egocentric Human Data |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Elastic Robust Unlearning of Specific Knowledge in Large Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Elastic ViTs from Pretrained Models without Retraining |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Eliciting Reasoning in Language Models with Cognitive Tools |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| ElliCE: Efficient and Provably Robust Algorithmic Recourse via the Rashomon Sets |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Elucidated Rolling Diffusion Models for Probabilistic Forecasting of Complex Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Eluder dimension: localise it! |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Embedding Principle of Homogeneous Neural Network for Classification Problem |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Embeddings as Probabilistic Equivalence in Logic Programs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Embodied Cognition Augmented End2End Autonomous Driving |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Embodied Crowd Counting |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Emergence and Evolution of Interpretable Concepts in Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Emergence and scaling laws in SGD learning of shallow neural networks |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Emergence of Linear Truth Encodings in Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Emergent Risk Awareness in Rational Agents under Resource Constraints |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Emergent Temporal Correspondences from Video Diffusion Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Empower Words: DualGround for Structured Phrase and Sentence-Level Temporal Grounding |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Empowering Decision Trees via Shape Function Branching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EnCompass: Enhancing Agent Programming with Search Over Program Execution Paths |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers, and Gradient Clipping |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Encoder-Decoder Diffusion Language Models for Efficient Training and Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Encouraging metric-aware diversity in contrastive representation space |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| End-to-End Low-Light Enhancement for Object Detection with Learned Metadata from RAWs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| End-to-End Vision Tokenizer Tuning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Energy Landscape-Aware Vision Transformers: Layerwise Dynamics and Adaptive Task-Specific Training via Hopfield States |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Energy Loss Functions for Physical Systems |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Energy-based generator matching: A neural sampler for general state space |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Enforcing convex constraints in Graph Neural Networks |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Enhanced Expert Merging for Mixture-of-Experts in Graph Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhanced Self-Distillation Framework for Efficient Spiking Neural Network Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing 3D Reconstruction for Dynamic Scenes |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Enhancing Bioactivity Prediction via Spatial Emptiness Representation of Protein-ligand Complex and Union of Multiple Pockets |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Enhancing CLIP Robustness via Cross-Modality Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Consistency of Flow-Based Image Editing through Kalman Control |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Enhancing Contrastive Learning with Variable Similarity |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Enhancing Deep Batch Active Learning for Regression with Imperfect Data Guided Selection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Enhancing GUI Agent with Uncertainty-Aware Self-Trained Evaluator |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Graph Classification Robustness with Singular Pooling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Interpretability in Deep Reinforcement Learning through Semantic Clustering |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing LLM Planning for Robotics Manipulation through Hierarchical Procedural Knowledge Graphs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Enhancing LLM Watermark Resilience Against Both Scrubbing and Spoofing Attacks |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Enhancing Privacy in Multimodal Federated Learning with Information Theory |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Sample Selection Against Label Noise by Cutting Mislabeled Easy Examples |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Tactile-based Reinforcement Learning for Robotic Control |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Enhancing Temporal Understanding in Video-LLMs through Stacked Temporal Attention in Vision Encoders |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Time Series Forecasting through Selective Representation Spaces: A Patch Perspective |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Enhancing Training Data Attribution with Representational Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Enhancing Zero-Shot Black-Box Optimization via Pretrained Models with Efficient Population Modeling, Interaction, and Stable Gradient Approximation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Enhancing the Maximum Effective Window for Long-Term Time Series Forecasting |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Entropic Time Schedulers for Generative Diffusion Models |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Entropy Rectifying Guidance for Diffusion and Flow Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Entropy-Calibrated Label Distribution Learning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Environment Inference for Learning Generalizable Dynamical System |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| EnzyControl: Adding Functional and Substrate-Specific Control for Enzyme Backbone Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Epistemic Uncertainty Estimation in Regression Ensemble Models with Pairwise Epistemic Estimators |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Epistemic Uncertainty for Generated Image Detection |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Equi-mRNA: Protein Translation Equivariant Encoding for mRNA Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Equivariance Everywhere All At Once: A Recipe for Graph Foundation Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Equivariance by Contrast: Identifiable Equivariant Embeddings from Unlabeled Finite Group Actions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Equivariant Eikonal Neural Networks: Grid-Free, Scalable Travel-Time Prediction on Homogeneous Spaces |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| EraseFlow: Learning Concept Erasure Policies via GFlowNet-Driven Alignment |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Erasing Conceptual Knowledge from Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Error Feedback under $(L_0,L_1)$-Smoothness: Normalization and Momentum |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Error Forcing in Recurrent Neural Networks |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| ErrorTrace: A Black-Box Traceability Mechanism Based on Model Family Error Space |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Escaping Collapse: The Strength of Weak Data for Large Language Model Training |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Escaping saddle points without Lipschitz smoothness: the power of nonlinear preconditioning |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Establishing Linear Surrogate Regret Bounds for Convex Smooth Losses via Convolutional FenchelβYoung Losses |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Estimating Hitting Times Locally at Scale |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Estimating Model Performance Under Covariate Shift Without Labels |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Estimating cognitive biases with attention-aware inverse planning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Estimation of Stochastic Optimal Transport Maps |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Estimation of Treatment Effects in Extreme and Unobserved Data |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Eulerian Neural Network Informed by Chemical Transport for Air Quality Forecasting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Evaluating LLM-contaminated Crowdsourcing Data Without Ground Truth |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Evaluating LLMs in Open-Source Games |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Evaluating Robustness of Monocular Depth Estimation with Procedural Scene Perturbations |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Evaluating and Learning Optimal Dynamic Treatment Regimes under Truncation by Death |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Evaluating multiple models using labeled and unlabeled data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Evaluating the Inductive Abilities of Large Language Models: Why Chain-of-Thought Reasoning Sometimes Hurts More Than Helps |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Eve3D: Elevating Vision Models for Enhanced 3D Surface Reconstruction via Gaussian Splatting |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Event-Driven Dynamic Scene Depth Completion |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Event-Guided Consistent Video Enhancement with Modality-Adaptive Diffusion Pipeline |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Event-based HDR Structured Light |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EventMG: Efficient Multilevel Mamba-Graph Learning for Spatiotemporal Event Representation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| EverybodyDance: Bipartite GraphβBased Identity Correspondence for Multi-Character Animation |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| EvoBrain: Dynamic Multi-Channel EEG Graph Modeling for Time-Evolving Brain Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EvoLM: In Search of Lost Training Dynamics for Language Model Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Evolutionary Prediction Games |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Evolutionary Reasoning Does Not Arise in Standard Usage of Protein Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| EvolvedGRPO: Unlocking Reasoning in LVLMs via Progressive Instruction Evolution |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Evolving and Regularizing Meta-Environment Learner for Fine-Grained Few-Shot Class-Incremental Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ExGra-Med: Extended Context Graph Alignment for Medical Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Exact Expressive Power of Transformers with Padding |
β |
β |
β |
β |
β |
β |
β |
0 |
| Exact and Linear Convergence for Federated Learning under Arbitrary Client Participation is Attainable |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Execution Guided Line-by-Line Code Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Explainable Reinforcement Learning from Human Feedback to Improve Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Explainably Safe Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Explaining Similarity in Vision-Language Encoders with Weighted Banzhaf Interactions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Explaining and Mitigating Crosslingual Tokenizer Inequities |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Explaining the Law of Supply and Demand via Online Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Exploiting Dynamic Sparsity in Einsum |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Exploiting Task Relationships in Continual Learning via Transferability-Aware Task Embeddings |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Exploration via Feature Perturbation in Contextual Bandits |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Explore In-Context Message Passing Operator for Graph Neural Networks in A Mean Field Game |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Exploring Diffusion Transformer Designs via Grafting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Exploring Landscapes for Better Minima along Valleys |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Exploring Neural Granger Causality with xLSTMs: Unveiling Temporal Dependencies in Complex Data |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Exploring Structural Degradation in Dense Representations for Self-supervised Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Exploring Tradeoffs through Mode Connectivity for Multi-Task Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Exploring and Exploiting Model Uncertainty in Bayesian Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Exploring and Leveraging Class Vectors for Classifier Editing |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Exploring the Design Space of Diffusion Bridge Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Exploring the Limits of Vision-Language-Action Manipulation in Cross-task Generalization |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Exploring the Noise Robustness of Online Conformal Prediction |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Exploring the Translation Mechanism of Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Exploring the limits of strong membership inference attacks on large language models |
β |
β |
β
|
β
|
β |
β
|
β
|
4 |
| Exponential Convergence Guarantees for Iterative Markovian Fitting |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Exponential Dynamic Energy Network for High Capacity Sequence Memory |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Extracting task-relevant preserved dynamics from contrastive aligned neural recordings |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Extragradient Method for $(L_0, L_1)$-Lipschitz Root-finding Problems |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Extrapolation by Association: Length Generalization Transfer In Transformers |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| F-Adapter: Frequency-Adaptive Parameter-Efficient Fine-Tuning in Scientific Machine Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| FACE: A General Framework for Mapping Collaborative Filtering Embeddings into LLM Tokens |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FACE: Faithful Automatic Concept Extraction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FACT: Mitigating Inconsistent Hallucinations in LLMs via Fact-Driven Alternating Code-Text Training |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| FAN: Fourier Analysis Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FANS: A Flatness-Aware Network Structure for Generalization in Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FAPEX: Fractional Amplitude-Phase Expressor for Robust Cross-Subject Seizure Prediction |
β |
β |
β
|
β
|
β |
β |
β |
2 |
| FAST: Foregroundβaware Diffusion with Accelerated Sampling Trajectory for Segmentationβoriented Anomaly Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FEAT: Free energy Estimators with Adaptive Transport |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| FEEDBACK FRICTION: LLMs Struggle to Fully Incorporate External Feedback |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FFN Fusion: Rethinking Sequential Computation in Large Language Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| FHGS: Feature-Homogenized Gaussian Splatting |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FIGRDock: Fast Interaction-Guided Regression for Flexible Docking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FIPER: Factorized Features for Robust Image Super-Resolution and Compression |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FLAME: Fast Long-context Adaptive Memory for Event-based Vision |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| FLOWING: Implicit Neural Flows for Structure-Preserving Morphing |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| FLUX: Efficient Descriptor-Driven Clustered Federated Learning under Arbitrary Distribution Shifts |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| FNOPE: Simulation-based inference on function spaces with Fourier Neural Operators |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FORLA: Federated Object-Centric Representation Learning with Slot Attention |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FP4 All the Way: Fully Quantized Training of Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FP64 is All You Need: Rethinking Failure Modes in Physics-Informed Neural Networks |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| FRAM: Frobenius-Regularized Assignment Matching with Mixed-Precision Computing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FRBNet: Revisiting Low-Light Vision through Frequency-Domain Radial Basis Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FRN: Fractal-Based Recursive Spectral Reconstruction Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FSEO: Few-Shot Evolutionary Optimization via Meta-Learning for Expensive Multi-Objective Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| FSI-Edit: Frequency and Stochasticity Injection for Flexible Diffusion-Based Image Editing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FSNet: Feasibility-Seeking Neural Network for Constrained Optimization with Guarantees |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| FaCT: Faithful Concept Traces for Explaining Neural Network Decisions |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Factor Decorrelation Enhanced Data Removal from Deep Predictive Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Failure Prediction at Runtime for Generative Robot Policies |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fair Continuous Resource Allocation with Equality of Impact |
β
|
β |
β |
β |
β
|
β
|
β
|
4 |
| Fair Cooperation in Mixed-Motive Games via Conflict-Aware Gradient Adjustment |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Fair Deepfake Detectors Can Generalize |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fair Matroid Selection |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Fair Minimum Labeling: Efficient Temporal Network Activations for Reachability and Equity |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Fair Representation Learning with Controllable High Confidence Guarantees via Adversarial Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FairDD: Fair Dataset Distillation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FairNet: Dynamic Fairness Correction without Performance Loss via Contrastive Conditional LoRA |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fairness under Competition |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Fairness-Regularized Online Optimization with Switching Costs |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fairness-aware Anomaly Detection via Fair Projection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Fairness-aware Bayes Optimal Functional Classification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Fairshare Data Pricing via Data Valuation for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Faithful Dynamic Imitation Learning from Human Intervention with Dynamic Regret Minimization |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| Faithful Group Shapley Value |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fantastic Features and Where to Find Them: A Probing Method to combine Features from Multiple Foundation Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fast Computation and Optimization for Opinion-Based Quantities of Friedkin-Johnsen Model |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Fast Data Attribution for Text-to-Image Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fast Inference for Augmented Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast Last-Iterate Convergence of SGD in the Smooth Interpolation Regime |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Fast Local Search Algorithms for Clustering with Adaptive Sampling and Bandit Strategies |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Fast MRI for All: Bridging Access Gaps by Training without Raw Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fast Monte Carlo Tree Diffusion: 100Γ Speedup via Parallel and Sparse Planning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast Non-Log-Concave Sampling under Nonconvex Equality and Inequality Constraints with Landing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Fast Projection-Free Approach (without Optimization Oracle) for Optimization over Compact Convex Set |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast Training of Large Kernel Models with Delayed Projections |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast Zeroth-Order Convex Optimization with Quantum Gradient Methods |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast attention mechanisms: a tale of parallelism |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fast constrained sampling in pre-trained diffusion models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast exact recovery of noisy matrix from few entries: the infinity norm approach |
β
|
β |
β
|
β |
β |
β
|
β
|
4 |
| Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FastJAM: a Fast Joint Alignment Model for Images |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FastVID: Dynamic Density Pruning for Fast Video Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Faster Algorithms for Structured John Ellipsoid Computation |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Faster Fixed-Point Methods for Multichain MDPs |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Faster Generic Identification in Tree-Shaped Structural Causal Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Faster Video Diffusion with Trainable Sparse Attention |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Feasibility-Aware Decision-Focused Learning for Predicting Parameters in the Constraints |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Feature Unlearning: Theoretical Foundations and Practical Applications with Shuffling |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Feature-aware Modulation for Learning from Temporal Tabular Data |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FedEL: Federated Elastic Learning for Heterogeneous Devices |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FedFree: Breaking Knowledge-sharing Barriers through Layer-wise Alignment in Heterogeneous Federated Learning |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| FedGPS: Statistical Rectification Against Data Heterogeneity in Federated Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| FedIGL: Federated Invariant Graph Learning for Non-IID Graphs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FedLPA: Local Prior Alignment for Heterogeneous Federated Generalized Category Discovery |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FedRACE: A Hierarchical and Statistical Framework for Robust Federated Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| FedRAM: Federated Reweighting and Aggregation for Multi-Task Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FedRW: Efficient Privacy-Preserving Data Reweighting for Enhancing Federated Learning of Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FedWMSAM: Fast and Flat Federated Learning via Weighted Momentum and Sharpness-Aware Minimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Federated Continual Learning via Orchestrating Multi-Scale Expertise |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Federated Dialogue-Semantic Diffusion for Emotion Recognition under Incomplete Modalities |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Federated Multi-armed Bandits with Efficient Bit-Level Communications |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Feedback Guidance of Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Feedback-Aware MCTS for Goal-Oriented Information Seeking |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Filter Like You Test: Data-Driven Data Filtering for CLIP Pretraining |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Finding Low-Rank Matrix Weights in DNNs via Riemannian Optimization: RAdaGrad and RAdamW |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Finding separatrices of dynamical flows with Deep Koopman Eigenfunctions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fine-grained List-wise Alignment for Generative Medication Recommendation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Finite Sample Analyses for Continuous-time Linear Systems: System Identification and Online Control |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Finite-Time Analysis of Stochastic Nonconvex Nonsmooth Optimization on the Riemannian Manifolds |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Finite-Time Bounds for Average-Reward Fitted Q-Iteration |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| First Attentions Last: Better Exploiting First Attentions for Efficient Parallel Training |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| First SFT, Second RL, Third UPT: Continual Improving Multi-Modal LLM Reasoning via Unsupervised Post-Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fisher meets Feynman: score-based variational inference with a product of experts |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Fit the Distribution: Cross-Image/Prompt Adversarial Attacks on Multimodal Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fix False Transparency by Noise Guided Splatting |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fixed-Point RNNs: Interpolating from Diagonal to Dense |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Flash Invariant Point Attention |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FlashBias: Fast Computation of Attention with Bias |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FlashMD: long-stride, universal prediction of molecular dynamics |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FlashMo: Geometric Interpolants and Frequency-Aware Sparsity for Scalable Efficient Motion Generation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| FlashMoE: Fast Distributed MoE in a Single Kernel |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Flat Channels to Infinity in Neural Loss Landscapes |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Flatness is Necessary, Neural Collapse is Not: Rethinking Generalization via Grokking |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Flatten Graphs as Sequences: Transformers are Scalable Graph Generators |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Flattening Hierarchies with Policy Bootstrapping |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Flex-Judge: Text-Only Reasoning Unleashes Zero-Shot Multimodal Evaluators |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| FlexEvent: Towards Flexible Event-Frame Object Detection at Varying Operational Frequencies |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlexOLMo: Open Language Models for Flexible Data Use |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlexSelect: Flexible Token Selection for Efficient Long Video Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlexWorld: Progressively Expanding 3D Scenes for Flexible-View Exploration |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Flexible MOF Generation with Torsion-Aware Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Flexible Realignment of Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Flexible inference for animal learning rules using neural networks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Flick: Empowering Federated Learning with Commonsense Knowledge |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Flow Equivariant Recurrent Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Flow Field Reconstruction with Sensor Placement Policy Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Flow Matching Neural Processes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Flow based approach for Dynamic Temporal Causal models with non-Gaussian or Heteroscedastic Noises |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Flow-Based Policy for Online Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Flow-GRPO: Training Flow Matching Models via Online RL |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FlowDAS: A Stochastic Interpolant-based Framework for Data Assimilation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FlowFeat: Pixel-Dense Embedding of Motion Profiles |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FlowMixer: A Depth-Agnostic Neural Architecture for Interpretable Spatiotemporal Forecasting |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FlowMoE: A Scalable Pipeline Scheduling Framework for Distributed Mixture-of-Experts Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FlowNet: Modeling Dynamic Spatio-Temporal Systems via Flow Propagation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FlowPrune: Accelerating Attention Flow Calculation by Pruning Flow Network |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| FlowRefiner: A Robust Traffic Classification Framework against Label Noise |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Flux4D: Flow-based Unsupervised 4D Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FoGE: Fock Space inspired encoding for graph prompting |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Focus-Then-Reuse: Fast Adaptation in Visual Perturbation Environments |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Follow-the-Perturbed-Leader Nearly Achieves Best-of-Both-Worlds for the m-Set Semi-Bandit Problems |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| For Better or for Worse, Transformers Seek Patterns for Memorization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Force Prompting: Video Generation Models Can Learn And Generalize Physics-based Control Signals |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| ForceFM: Enhancing Protein-Ligand Predictions through Force-Guided Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Forecasting in Offline Reinforcement Learning for Non-stationary Environments |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Foresight: Adaptive Layer Reuse for Accelerated and High-Quality Text-to-Video Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Formal Models of Active Learning from Contrastive Examples |
β |
β |
β |
β |
β |
β |
β |
0 |
| Fortifying Time Series: DTW-Certified Robust Anomaly Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Foundation Cures Personalization: Improving Personalized Modelsβ Prompt Consistency via Hidden Foundation Knowledge |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Foundations of Top-$k$ Decoding for Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Fourier Clouds: Fast Bias Correction for Imbalanced Semi-Supervised Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Fourier Token Merging: Understanding and Capitalizing Frequency Domain for Efficient Image Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| FraPPE: Fast and Efficient Preference-Based Pure Exploration |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FracFace: Breaking the Visual CluesβFractal-Based Privacy-Preserving Face Recognition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Fractional Diffusion Bridge Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fractional Langevin Dynamics for Combinatorial Optimization via Polynomial-Time Escape |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Frame In-N-Out: Unbounded Controllable Image-to-Video Generation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FrameShield: Adversarially Robust Video Anomaly Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Free-Lunch Color-Texture Disentanglement for Stylized Image Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FreeControl: Efficient, Training-Free Structural Control via One-Step Attention Extraction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| FreeInv: Free Lunch for Improving DDIM Inversion |
β |
β
|
β
|
β |
β |
β
|
β
|
4 |
| FreqExit: Enabling Early-Exit Inference for Visual Autoregressive Models via Frequency-Aware Guidance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Frequency-Aware Token Reduction for Efficient Vision Transformer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Average-Iterate to Last-Iterate Convergence in Games: A Reduction and Its Applications |
β
|
β |
β |
β |
β |
β |
β |
1 |
| From Black-box to Causal-box: Towards Building More Interpretable Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| From Bytes to Ideas: Language Modeling with Autoregressive U-Nets |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards |
β
|
β |
β |
β |
β |
β |
β |
1 |
| From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Euler to AI: Unifying Formulas for Mathematical Constants |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| From Faults to Features: Pretraining to Learn Robust Representations against Sensor Failures |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Human Attention to Diagnosis: Semantic Patch-Level Integration of Vision-Language Models in Medical Imaging |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| From Indicators to Insights: Diversity-Optimized for Medical Series-Text Decoding via LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Information to Generative Exponent: Learning Rate Induces Phase Transitions in SGD |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Kolmogorov to Cauchy: Shallow XNet Surpasses KANs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Likelihood to Fitness: Improving Variant Effect Prediction in Protein and Genome Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| From Linear to Nonlinear: Provable Weak-to-Strong Generalization through Feature Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Pose to Muscle: Multimodal Learning for Piano Hand Muscle Electromyography |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Pretraining to Pathology: How Noise Leads to Catastrophic Inheritance in Medical Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| From Self-Check to Consensus: Bayesian Strategic Decoding in Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| From Sequence to Structure: Uncovering Substructure Reasoning in Transformers |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| From Shortcut to Induction Head: How Data Diversity Shapes Algorithm Selection in Transformers |
β
|
β |
β |
β
|
β |
β |
β
|
3 |
| From Softmax to Score: Transformers Can Effectively Implement In-Context Denoising Steps |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| From Synapses to Dynamics: Obtaining Function from Structure in a Connectome Constrained Model of the Head Direction Circuit |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave sampling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| FrΓ©chet Geodesic Boosting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| FuXi-Ocean: A Global Ocean Forecasting System with Sub-Daily Resolution |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Fully Autonomous Neuromorphic Navigation and Dynamic Obstacle Avoidance |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Fully Dynamic Algorithms for Chamfer Distance |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Fully Spiking Neural Networks for Unified Frame-Event Object Tracking |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| FuncGenFoil: Airfoil Generation and Editing Model in Function Space |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Functional Complexity-adaptive Temporal Tensor Decomposition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Functional Matching of Logic Subgraphs: Beyond Structural Isomorphism |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Functional Virtual Adversarial Training for Semi-Supervised Time Series Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Functional data analysis for multivariate distributions through Wasserstein slicing |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Fundamental Limitations in Pointwise Defences of LLM Finetuning APIs |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Fuse2Match: Training-Free Fusion of Flow, Diffusion, and Contrastive Models for Zero-Shot Semantic Matching |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Future Link Prediction Without Memory or Aggregation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| G-Net: A Provably Easy Construction of High-Accuracy Random Binary Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| GAMMA: Gated Multi-hop Message Passing for Homophily-Agnostic Node Representation in GNNs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GD$^2$: Robust Graph Learning under Label Noise via Dual-View Prediction Discrepancy |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| GLID$^2$E: A Gradient-Free Lightweight Fine-tune Approach for Discrete Biological Sequence Design |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| GLNCD: Graph-Level Novel Category Discovery |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| GLSim: Detecting Object Hallucinations in LVLMs via Global-Local Similarity |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| GLVD: Guided Learned Vertex Descent |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| GMM-based VAE model with Normalising Flow for effective stochastic segmentation |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| GMV: A Unified and Efficient Graph Multi-View Learning Framework |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GOATex: Geometry & Occlusion-Aware Texturing |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GPO: Learning from Critical Steps to Improve LLM Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GRAPE: Optimize Data Mixture for Group Robust Multi-target Adaptive Pretraining |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GRAVER: Generative Graph Vocabularies for Robust Graph Foundation Models Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GRIFFIN: Effective Token Alignment for Faster Speculative Decoding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GRIP: A Graph-Based Reasoning Instruction Producer |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| GRIT: Teaching MLLMs to Think with Images |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| GSAlign: Geometric and Semantic Alignment Network for Aerial-Ground Person Re-Identification |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| GSPN-2: Efficient Parallel Sequence Modeling |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| GSRF: Complex-Valued 3D Gaussian Splatting for Efficient Radio-Frequency Data Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| GTR-Loc: Geospatial Text Regularization Assisted Outdoor LiDAR Localization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| GVPO: Group Variance Policy Optimization for Large Language Model Post-Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Gains: Fine-grained Federated Domain Adaptation in Open Set |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Gate to the Vessel: Residual Experts Restore What SAM Overlooks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Gated Integration of Low-Rank Adaptation for Continual Learning of Large Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Gatekeeper: Improving Model Cascades Through Confidence Tuning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GauSAM: ContourβGuided 2D Gaussian Fields for MultiβScale Medical Image Segmentation with Segment Anything |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Gaussian Approximation and Concentration of Constant Learning-Rate Stochastic Gradient Descent |
β |
β
|
β |
β |
β
|
β
|
β
|
4 |
| Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Gaussian Process Upper Confidence Bound Achieves Nearly-Optimal Regret in Noise-Free Gaussian Process Bandits |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Gaussian Processes for Shuffled Regression |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Gaussian Regression-Driven Tensorized Incomplete Multi-View Clustering with Dual Manifold Regularization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Gaussian-Augmented Physics Simulation and System Identification with Complex Colliders |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous Driving |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Gaze-VLM: Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GeGS-PCR: Fast and Robust Color 3D Point Cloud Registration with Two-Stage Geometric-3DGS Fusion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeRaF: Neural Geometry Reconstruction from Radio Frequency Signals |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Gemstones: A Model Suite for Multi-Faceted Scaling Laws |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GenColor: Generative and Expressive Color Enhancement with Pixel-Perfect Texture Preservation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GenIR: Generative Visual Feedback for Mental Image Retrieval |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Gene Regulatory Network Inference in the Presence of Selection Bias and Latent Confounders |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| General-Reasoner: Advancing LLM Reasoning Across All Domains |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Generalizable Hand-Object Modeling from Monocular RGB Images via 3D Gaussians |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generalizable Insights for Graph Transformers in Theory and Practice |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Generalizable Reasoning through Compositional Energy Minimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalizable, real-time neural decoding with hybrid state-space models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Generalization Bounds for Kolmogorov-Arnold Networks (KANs) and Enhanced KANs with Lower Lipschitz Complexity |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Generalization Bounds for Model-based Algorithm Configuration |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Generalization Bounds for Rank-sparse Neural Networks |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Generalization Guarantees for Learning Score-Based Branch-and-Cut Policies in Integer Programming |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generalization vs Specialization under Concept Shift |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Generalized Category Discovery under Domain Shift: A Frequency Domain Perspective |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generalized Contrastive Learning for Universal Multimodal Retrieval |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalized Linear Bandits: Almost Optimal Regret with One-Pass Update |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Generalized Linear Mode Connectivity for Transformers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalized Top-k Mallows Model for Ranked Choices |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalized and Invariant Single-Neuron In-Vivo Activity Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generalizing Experience for Language Agents with Hierarchical MetaFlows |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Generalizing Single-Frame Supervision to Event-Level Understanding for Video Anomaly Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generalizing while preserving monotonicity in comparison-based preference learning models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Generating Computational Cognitive models using Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generating Creative Chess Puzzles |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Generating Multi-Table Time Series EHR from Latent Space with Minimal Preprocessing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generating Physically Sound Designs from Text and a Set of Physical Constraints |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Generating and Checking DNN Verification Proofs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generation as Search Operator for Test-Time Scaling of Diffusion-based Combinatorial Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generative Caching for Structurally Similar Prompts and Responses |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Generative Data Augmentation via Diffusion Distillation, Adversarial Alignment, and Importance Reweighting |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Generative Graph Pattern Machine |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Generative Model Inversion Through the Lens of the Manifold Hypothesis |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Generative Perception of Shape and Material from Differential Motion |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Generative Pre-trained Autoregressive Diffusion Transformer |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Generative RLHF-V: Learning Principles from Multi-modal Human Preference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Generative Trajectory Stitching through Diffusion Composition |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| Generative diffusion for perceptron problems: statistical physics analysis and efficient algorithms |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Generative property enhancer: implicit guided generation through conditional density estimation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Geo-Sign: Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoCAD: Local Geometry-Controllable CAD Generation with Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoClip: Geometry-Aware Clipping for Differentially Private SGD |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| GeoDynamics: A Geometric StateβSpace Neural Network for Understanding Brain Dynamics on Riemannian Manifolds |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| GeoLink: Empowering Remote Sensing Foundation Model with OpenStreetMap Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoRemover: Removing Objects and Their Causal Visual Artifacts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GeoVideo: Introducing Geometric Regularization into Video Generation Model |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Geometric Algebra-Enhanced Bayesian Flow Network for RNA Inverse Design |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Geometric Algorithms for Neural Combinatorial Optimization with Constraints |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Geometric Imbalance in Semi-Supervised Node Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Geometric Logit Decoupling for Energy-Based Graph Out-of-distribution Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Geometric Mixture Models for Electrolyte Conductivity Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Geometry Aware Operator Transformer as an efficient and accurate neural surrogate for PDEs on arbitrary domains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Geometry of Decision Making in Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Geometry-Aware Collaborative Multi-Solutions Optimizer for Model Fine-Tuning with Parameter Efficiency |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Geometry-Aware Edge Pooling for Graph Neural Networks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Glance2Gaze: Efficient Vision-Language Models from Glance Fusion to Gaze Compression |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Global Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Global Minimizers of $\ell^p$-Regularized Objectives Yield the Sparsest ReLU Neural Networks |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Global Minimizers of Sigmoid Contrastive Loss |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Global Prompt Refinement with Non-Interfering Attention Masking for One-Shot Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Globally Optimal Policy Gradient Algorithms for Reinforcement Learning with PID Control Policies |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Glocal Information Bottleneck for Time Series Imputation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| GnnXemplar: Exemplars to Explanations - Natural Language Rules for Global GNN Interpretability |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Go With the Flow: Fast Diffusion for Gaussian Mixture Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| GoRA: Gradient-driven Adaptive Low Rank Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| GoalLadder: Incremental Goal Discovery with Vision-Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GraSS: Scalable Data Attribution with Gradient Sparsification and Sparse Projection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GradMetaNet: An Equivariant Architecture for Learning on Gradients |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Gradient Multi-Normalization for Efficient LLM Training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Gradient Variance Reveals Failure Modes in Flow-Based Generative Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Gradient-Guided Epsilon Constraint Method for Online Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Gradient-Variation Online Adaptivity for Accelerated Optimization with HΓΆlder Smoothness |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Gradient-Weight Alignment as a Train-Time Proxy for Generalization in Classification Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| Graph Alignment via Birkhoff Relaxation |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Graph Data Selection for Domain Adaptation: A Model-Free Approach |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Graph Diffusion that can Insert and Delete |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Graph Few-Shot Learning via Adaptive Spectrum Experts and Cross-Set Distribution Calibration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Graph Neural Network Based Action Ranking for Planning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Graph Persistence goes Spectral |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Graph Your Own Prompt |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Graph-Based Attention for Differentiable MaxSAT Solving |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Graph-Theoretic Insights into Bayesian Personalized Ranking for Recommendation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Graph-based Symbolic Regression with Invariance and Constraint Encoding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| GraphChain: Large Language Models for Large-scale Graph Analysis via Tool Chaining |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| GraphKeeper: Graph Domain-Incremental Learning via Knowledge Disentanglement and Preservation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GraphTOP: Graph Topology-Oriented Prompting for Graph Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Graphs Help Graphs: Multi-Agent Graph Socialized Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| GraphβSmoothed Bayesian Black-Box Shift Estimator and Its Information Geometry |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Grasp2Grasp: Vision-Based Dexterous Grasp Translation via SchrΓΆdinger Bridges |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Greed is Good: A Unifying Perspective on Guided Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Greedy Algorithms for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure |
β |
β |
β |
β |
β |
β |
β |
0 |
| Greedy Sampling Is Provably Efficient For RLHF |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Grids Often Outperform Implicit Neural Representation at Compressing Dense Signals |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data |
β
|
β
|
β |
β
|
β |
β |
β
|
4 |
| Grounded Reinforcement Learning for Visual Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Group-Level Data Selection for Efficient Pretraining |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Group-in-Group Policy Optimization for LLM Agent Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Guarantees for Alternating Least Squares in Overparameterized Tensor Decompositions |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Guided Diffusion Sampling on Function Spaces with Applications to PDEs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Guiding LLM Decision-Making with Fairness Reward Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| H-SPLID: HSIC-based Saliency Preserving Latent Information Decomposition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| H3D-DGS: Exploring Heterogeneous 3D Motion Representation for Deformable 3D Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| HBLLM: Wavelet-Enhanced High-Fidelity 1-Bit Quantization for LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HCRMP: An LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| HEIR: Learning Graph-Based Motion Hierarchies |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HMARL-CBF β Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| HMVLM:Human Motion-Vision-Language Model via MoE LoRA |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HOComp: Interaction-Aware Human-Object Composition |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HPSERec: A Hierarchical Partitioning and Stepwise Enhancement Framework for Long-tailed Sequential Recommendation |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| HQA-VLAttack: Towards High Quality Adversarial Attack on Vision-Language Pre-Trained Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| HYPERION: Fine-Grained Hypersphere Alignment for Robust Federated Graph Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Hadamard Test is Sufficient for Efficient Quantum Gradient Estimation with Lie Algebraic Symmetries |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Hadamax Encoding: Elevating Performance in Model-Free Atari |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HairFree: Compositional 2D Head Prior for Text-Driven 360Β° Bald Texture Synthesis |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hamiltonian Descent Algorithms for Optimization: Accelerated Rates via Randomized Integration Time |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Hamiltonian Neural PDE Solvers through Functional Approximation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Handling Missing Responses under Cluster Dependence with Applications to Language Model Evaluation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Hankel Singular Value Regularization for Highly Compressible State Space Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Hardware-aligned Hierarchical Sparse Attention for Efficient Long-term Memory Access |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Harnessing Feature Resonance under Arbitrary Target Alignment for Out-of-Distribution Node Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Harnessing the Computation Redundancy in ViTs to Boost Adversarial Transferability |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Harnessing the Universal Geometry of Embeddings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hawaii: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Head Pursuit: Probing Attention Specialization in Multimodal Transformers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Heavy-Ball Momentum Method in Continuous Time and Discretization Error Analysis |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| HeavyWater and SimplexWater: Distortion-free LLM Watermarks for Low-Entropy Distributions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HeroFilter: Adaptive Spectral Graph Filter for Varying Heterophilic Relations |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Hessian-guided Perturbed Wasserstein Gradient Flows for Escaping Saddle Points |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| HetSyn: Versatile Timescale Integration in Spiking Neural Networks via Heterogeneous Synapses |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Heterogeneous Adversarial Play in Interactive Environments |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Heterogeneous Graph Transformers for Simultaneous Mobile Multi-Robot Task Allocation and Scheduling under Temporal Constraints |
β
|
β
|
β |
β
|
β
|
β
|
β
|
6 |
| Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| HiFC: High-efficiency Flash-based KV Cache Swapping for Scaling LLM Inference |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HiMoLE: Towards OOD-Robust LoRA via Hierarchical Mixture of Experts |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| HiPoNet: A Multi-View Simplicial Complex Network for High Dimensional Point-Cloud and Single-Cell data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Hierarchical Demonstration Order Optimization for Many-shot In-Context Learning |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hierarchical Implicit Neural Emulators |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Hierarchical Information Aggregation for Incomplete Multimodal Alzheimer's Disease Diagnosis |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Hierarchical Optimization via LLM-Guided Objective Evolution for Mobility-on-Demand Systems |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Hierarchical Retrieval: The Geometry and a Pretrain-Finetune Recipe |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Hierarchical Self-Attention: Generalizing Neural Attention Mechanics to Multi-Scale Problems |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Hierarchical Shortest-Path Graph Kernel Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| High Dynamic Range Imaging with Time-Encoding Spike Camera |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| High Resolution UDF Meshing via Iterative Networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| High-Dimensional Calibration from Swap Regret |
β
|
β |
β |
β |
β |
β |
β |
1 |
| High-Order Flow Matching: Unified Framework and Sharp Statistical Rates |
β |
β |
β
|
β |
β |
β |
β |
1 |
| High-Performance Arithmetic Circuit Optimization via Differentiable Architecture Search |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| High-dimensional neuronal activity from low-dimensional latent dynamics: a solvable model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| High-order Equivariant Flow Matching for Density Functional Theory Hamiltonian Prediction |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| High-order Interactions Modeling for Interpretable Multi-Agent Q-Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Higher-Order Learning with Graph Neural Networks via Hypergraph Encodings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hippocampal-like Sequential Editing for Continual Knowledge Updates in Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| HoT-VI: Reparameterizable Variational Inference for Capturing Instance-Level High-Order Correlations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Hogwild! Inference: Parallel LLM Generation via Concurrent Attention |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HoliTom: Holistic Token Merging for Fast Video Large Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Holistic Order Prediction in Natural Scenes |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HollowFlow: Efficient Sample Likelihood Evaluation using Hollow Message Passing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HoloScene: SimulationβReady Interactive 3D Worlds from a Single Video |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Homogeneous Algorithms Can Reduce Competition in Personalized Pricing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Horizon Reduction Makes RL Scalable |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| How Benchmark Prediction from Fewer Data Misses the Mark |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| How Classifier Features Transfer to Downstream: An Asymptotic Analysis in a Two-Layer Model |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| How Data Mixing Shapes In-Context Learning: Asymptotic Equivalence for Transformers with MLPs |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| How Different from the Past? Spatio-Temporal Time Series Forecasting with Self-Supervised Deviation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime? |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| How Does Sequence Modeling Architecture Influence Base Capabilities of Pre-trained Language Models? Exploring Key Architecture Design Principles to Avoid Base Capabilities Degradation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| How Does Topology Bias Distort Message Passing in Graph Recommender? A Dirichlet Energy Perspective |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| How Far Are We from Optimal Reasoning Efficiency? |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| How Many Domains Suffice for Domain Generalization? A Tight Characterization via the Domain Shattering Dimension |
β |
β |
β |
β |
β |
β |
β |
0 |
| How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need? |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| How Memory in Optimization Algorithms Implicitly Modifies the Loss |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| How Particle System Theory Enhances Hypergraph Message Passing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| How Patterns Dictate Learnability in Sequential Data |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| How Well Can Differential Privacy Be Audited in One Run? |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| How do Transformers Learn Implicit Reasoning? |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| How many measurements are enough? Bayesian recovery in inverse problems with general distributions |
β |
β |
β |
β |
β |
β |
β |
0 |
| How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| How to Learn a Star: Binary Classification with Starshaped Polyhedral Sets |
β
|
β |
β |
β |
β
|
β
|
β
|
4 |
| How to Train Your LLM Web Agent: A Statistical Diagnosis |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| How to build a consistency model: Learning flow maps via self-distillation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HubGT: Fast Graph Transformer with Decoupled Hierarchy Labeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Human Texts Are Outliers: Detecting LLM-generated Texts via Out-of-distribution Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Human-assisted Robotic Policy Refinement via Action Preference Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Hybrid Boundary Physics-Informed Neural Networks for Solving Navier-Stokes Equations with Complex Boundary |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Hybrid Latent Reasoning via Reinforcement Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Hybrid Latent Representations for PDE Emulation |
β |
β
|
β |
β
|
β
|
β
|
β
|
5 |
| Hybrid Re-matching for Continual Learning with Parameter-Efficient Tuning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Hybrid-Balance GFlowNet for Solving Vehicle Routing Problems |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hybrid-Collaborative Augmentation and Contrastive Sample Adaptive-Differential Awareness for Robust Attributed Graph Clustering |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HypRL: Reinforcement Learning of Control Policies for Hyperproperties |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Hyper-Modality Enhancement for Multimodal Sentiment Analysis with Missing Modalities |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| HyperGraphRAG: Retrieval-Augmented Generation via Hypergraph-Structured Knowledge Representation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| HyperMARL: Adaptive Hypernetworks for Multi-Agent RL |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| HyperMixup: Hypergraph-Augmented with Higher-order Information Mixup |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hyperbolic Dataset Distillation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hyperbolic Fine-Tuning for Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Hypergraph-Enhanced Contrastive Learning for Multi-View Clustering with Hyper-Laplacian Regularization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across Scales |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| HypoBootstrap: A Bootstrapping Framework for Inductive Reasoning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| IA-GGAD: Zero-shot Generalist Graph Anomaly Detection via Invariant and Affinity Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| IBGS: Image-Based Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ICLScan: Detecting Backdoors in Black-Box Large Language Models via Targeted In-context Illumination |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| IDOL: Meeting Diverse Distribution Shifts with Prior Physics for Tropical Cyclone Multi-Task Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| IF-Guide: Influence Function-Guided Detoxification of LLMs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| IMPACT: Irregular Multi-Patch Adversarial Composition Based on TwoβPhase Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| INC: An Indirect Neural Corrector for Auto-Regressive Hybrid PDE Solvers |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| IOSTOM: Offline Imitation Learning from Observations via State Transition Occupancy Matching |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| IPAD: Inverse Prompt for AI Detection - A Robust and Interpretable LLM-Generated Text Detector |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| IPFormer: Visual 3D Panoptic Scene Completion with Context-Adaptive Instance Proposals |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| IPSI: Enhancing Structural Inference with Automatically Learned Structural Priors |
β |
β
|
β
|
β
|
β
|
β |
β |
4 |
| Identifiability of Deep Polynomial Neural Networks |
β |
β |
β |
β |
β |
β |
β |
0 |
| Identifying Macro Causal Effects in C-DMGs over DMGs |
β |
β |
β |
β |
β |
β |
β |
0 |
| Identifying interactions across brain areas while accounting for individual-neuron dynamics with a Transformer-based variational autoencoder |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Identifying multi-compartment Hodgkin-Huxley models with high-density extracellular voltage recordings |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Image Editing As Programs with Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Image Stitching in Adverse Condition: A Bidirectional-Consistency Learning Framework and Benchmark |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Image Super-Resolution with Guarantees via Conformalized Generative Models |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| Image Token Matters: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Imagine Beyond ! Distributionally Robust Autoencoding for State Space Coverage in Online Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Imagine360: Immersive 360 Video Generation from Perspective Anchor |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Imagined Autocurricula |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Imitation Beyond Expectation Using Pluralistic Stochastic Dominance |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Imitation Learning with Temporal Logic Constraints |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Impact of Dataset Properties on Membership Inference Vulnerability of Deep Transfer Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Impact of Layer Norm on Memorization and Generalization in Transformers |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Impartial Selection with Predictions |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Implicit Bias of Spectral Descent and Muon on Multiclass Separable Data |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Implicit Modeling for Transferability Estimation of Vision Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Implicit-ARAP: Efficient Handle-Guided Neural Field Deformation via Local Patch Meshing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improve Temporal Reasoning in Multimodal Large Language Models via Video Contrastive Decoding |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Improved Algorithms for Fair Matroid Submodular Maximization |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Improved Algorithms for Overlapping and Robust Clustering of Edge-Colored Hypergraphs: An LP-Based Combinatorial Approach |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Improved Approximation Algorithms for Chromatic and Pseudometric-Weighted Correlation Clustering |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Improved Balanced Classification with Theoretically Grounded Loss Functions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Improved Bounds for Swap Multicalibration and Swap Omniprediction |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Improved Representation Steering for Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Improved Robust Estimation for ErdΕs-RΓ©nyi Graphs: The Sparse Regime and Optimal Breakdown Point |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Improved Scaling Laws in Linear Regression via Data Reuse |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Improved Training Technique for Shortcut Models |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Improving Bilinear RNN with Closed-loop Control |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving Decision Trees through the Lens of Parameterized Local Search |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Linear Extrapolation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving Energy Natural Gradient Descent through Woodbury, Momentum, and Randomization |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Improving Evolutionary Multi-View Classification via Eliminating Individual Fitness Bias |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Improving Generalization of Neural Combinatorial Optimization for Vehicle Routing Problems via Test-Time Projection Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improving LLM General Preference Alignment via Optimistic Online Mirror Descent |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Improving Model-Based Reinforcement Learning by Converging to Flatter Minima |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Improving Monte Carlo Tree Search for Symbolic Regression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving Perturbation-based Explanations by Understanding the Role of Uncertainty Calibration |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Improving Progressive Generation with Decomposable Flow Matching |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Improving Regret Approximation for Unsupervised Dynamic Environment Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improving Target Sound Extraction via Disentangled Codec Representations with Privileged Knowledge Distillation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Improving Task-Specific Multimodal Sentiment Analysis with General MLLMs via Prompting |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Improving Time Series Forecasting via Instance-aware Post-hoc Revision |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Improving Video Generation with Human Feedback |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving planning and MBRL with temporally-extended actions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Improving the Generation and Evaluation of Synthetic Data for Downstream Medical Causal Inference |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Improving the Straight-Through Estimator with Zeroth-Order Information |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| In Search of Adamβs Secret Sauce |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| In-Context Compositional Learning vis Sparse Coding Transformer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| In-Context Fully Decentralized Cooperative Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| In-Context Learning Strategies Emerge Rationally |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| In-Context Learning of Stochastic Differential Equations with Foundation Inference Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| In-context Learning of Linear Dynamical Systems with Transformers: Approximation Bounds and Depth-separation |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Incentivizing LLMs to Self-Verify Their Answers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Incentivizing Time-Aware Fairness in Data Sharing |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Incentivizing Truthful Language Models via Peer Elicitation Games |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Incomplete Multi-view Clustering via Hierarchical Semantic Alignment and Cooperative Completion |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Incomplete Multi-view Deep Clustering with Data Imputation and Alignment |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Increasing the Utility of Synthetic Images through Chamfer Guidance |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Incremental Sequence Classification with Temporal Consistency |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Individual Fairness In Strategic Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Individual Regret in Cooperative Stochastic Multi-Armed Bandits |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Individually Fair Diversity Maximization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Inductive Domain Transfer In Misspecified Simulation-Based Inference |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| IneqSearch: Hybrid Reasoning for Olympiad Inequality Proofs |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Inexact Column Generation for Bayesian Network Structure Learning via Difference-of-Submodular Optimization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inference of Whole Brain Electrophysiological Networks Through Multimodal Integration of Simultaneous Scalp and Intracranial EEG |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Inference with correlated priors using sisters cells |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Inference-Time Hyper-Scaling with KV Cache Compression |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Inference-Time Personalized Alignment with a Few User Preference Queries |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Inference-Time Reward Hacking in Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inference-time Alignment in Continuous Space |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inferring stochastic dynamics with growth from cross-sectional data |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Infinite Neural Operators: Gaussian processes on functions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Infinite-Width Limit of a Single Attention Layer: Analysis via Tensor Programs |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| InfinityStar: Uniο¬ed Spacetime AutoRegressive Modeling for Visual Generation |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Influence Functions for Edge Edits in Non-Convex Graph Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Influence Guided Context Selection for Effective Retrieval-Augmented Generation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Information Retrieval Induced Safety Degradation in AI Agents |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Information Theoretic Learning for Diffusion Models with Warm Start |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Information-Computation Tradeoffs for Noiseless Linear Regression with Oblivious Contamination |
β |
β |
β |
β |
β |
β |
β |
0 |
| Information-Driven Design of Imaging Systems |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Information-Theoretic Discrete Diffusion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Information-Theoretic Reward Decomposition for Generalizable RLHF |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Informed Correctors for Discrete Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Informed Initialization for Bayesian Optimization and Active Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Infrequent Exploration in Linear Bandits |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Injecting Frame-Event Complementary Fusion into Diffusion for Optical Flow in Challenging Scenes |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Inpainting the Neural Picture: Inferring Unrecorded Brain Area Dynamics from Multi-Animal Datasets |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Instance-Dependent Regret Bounds for Nonstochastic Linear Partial Monitoring |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Instance-Level Composed Image Retrieval |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Instance-Optimality for Private KL Distribution Estimation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Instant4D: 4D Gaussian Splatting in Minutes |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| InstructFlow: Adaptive Symbolic Constraint-Guided Code Generation for Long-Horizon Planning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| InstructHOI: Context-Aware Instruction for Multi-Modal Reasoning in Human-Object Interaction Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| InstructRestore: Region-Customized Image Restoration with Human Instructions |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| InstructSAM: A Training-free Framework for Instruction-Oriented Remote Sensing Object Recognition |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Integral Imprecise Probability Metrics |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Integrating Drug Substructures and Longitudinal Electronic Health Records for Personalized Drug Recommendation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Integration Matters for Learning PDEs with Backward SDEs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Interaction-Centric Knowledge Infusion and Transfer for Open Vocabulary Scene Graph Generation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Interactive Anomaly Detection for Articulated Objects via Motion Anticipation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Interactive Cross-modal Learning for Text-3D Scene Retrieval |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Interpretable Next-token Prediction via the Generalized Induction Head |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Interpretable and Parameter Efficient Graph Neural Additive Models with Random Fourier Features |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Interpreting Arithmetic Reasoning in Large Language Models using Game-Theoretic Interactions |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Interpreting Emergent Features in Deep Learning-based Side-channel Analysis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Interpreting vision transformers via residual replacement model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| IntrinsiX: High-Quality PBR Generation using Image Priors |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Intrinsic Benefits of Categorical Distributional Loss: Uncertainty-aware Regularized Exploration in Reinforcement Learning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| InvFusion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Inverse Optimization Latent Variable Models for Learning Costs Applied to Route Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Inverse Q-Learning Done Right: Offline Imitation Learning in $Q^\pi$-Realizable MDPs |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Investigating Hallucinations of Time Series Foundation Models through Signal Subspace Analysis |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Investigating and Mitigating Catastrophic Forgetting in Medical Knowledge Injection through Internal Knowledge Augmentation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Irrational Complex Rotations Empower Low-bit Optimizers |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Is Grokking a Computational Glass Relaxation? |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Is Limited Participant Diversity Impeding EEG-based Machine Learning? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Is Noise Conditioning Necessary? A Unified Theory of Unconditional Graph Diffusion Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Is Your Diffusion Model Actually Denoising? |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Is the acquisition worth the cost? Surrogate losses for Consistent Two-stage Classifiers |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Isotropic Noise in Stochastic and Quantum Convex Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Iterative Foundation Model Fine-Tuning on Multiple Rewards |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Itβs Hard to Be Normal: The Impact of Noise on Structure-agnostic Estimation |
β
|
β
|
β |
β
|
β |
β |
β
|
4 |
| JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| JAFAR: Jack up Any Feature at Any Resolution |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensemble Generation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Jacobian-Based Interpretation of Nonlinear Neural Encoding Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| JailBound: Jailbreaking Internal Safety Boundaries of Vision-Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Janus-Pro-R1: Advancing Collaborative Visual Comprehension and Generation via Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Johnson-Lindenstrauss Lemma Beyond Euclidean Geometry |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Joint Design of Protein Surface and Backbone Using a Diffusion Bridge Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Joint Modeling of fMRI and EEG Imaging Using Ordinary Differential Equation-Based Hypergraph Neural Networks |
β |
β |
β
|
β
|
β |
β
|
β
|
4 |
| Joint Relational Database Generation via Graph-Conditional Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Joint Velocity-Growth Flow Matching for Single-Cell Dynamics Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| JointβEmbedding vs Reconstruction: Provable Benefits of Latent Space Prediction for SelfβSupervised Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Jury-and-Judge Chain-of-Thought for Uncovering Toxic Data in 3D Visual Grounding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Just One Layer Norm Guarantees Stable Extrapolation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| K-DeCore: Facilitating Knowledge Transfer in Continual Structured Knowledge Reasoning via Knowledge Decoupling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KAIROS: Scalable Model-Agnostic Data Valuation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| KGGen: Extracting Knowledge Graphs from Plain Text with Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| KINDLE: Knowledge-Guided Distillation for Prior-Free Gene Regulatory Network Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| KL Penalty Control via Perturbation for Direct Preference Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KLASS: KL-Guided Fast Inference in Masked Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KOALA++: Efficient Kalman-Based Optimization with Gradient-Covariance Products |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| KSP: Kolmogorov-Smirnov metric-based Post-Hoc Calibration for Survival Analysis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KScope: A Framework for Characterizing the Knowledge Status of Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| KVLink: Accelerating Large Language Models via Efficient KV Cache Reuse |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| KaRF: Weakly-Supervised Kolmogorov-Arnold Networks-based Radiance Fields for Local Color Editing |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| KeeA*: Epistemic Exploratory A* Search via Knowledge Calibration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Keep It on a Leash: Controllable Pseudo-label Generation Towards Realistic Long-Tailed Semi-Supervised Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Kernel Learning with Adversarial Features: Numerical Efficiency and Adaptive Regularization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Kernel conditional tests from learning-theoretic bounds |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Kernel von Mises Formula of the Influence Function |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning |
β |
β |
β |
β |
β |
β |
β |
0 |
| KeyDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Kinaema: a recurrent sequence model for memory and pose in motion |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Kinetics: Rethinking Test-Time Scaling Law |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Knee-Deep in C-RASP: A Transformer Depth Hierarchy |
β |
β
|
β |
β
|
β |
β |
β
|
3 |
| Know Thyself by Knowing Others: Learning Neuron Identity from Population Context |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Know What You Don't Know: Uncertainty Calibration of Process Reward Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Knowing When to Stop: Efficient Context Processing via Latent Sufficiency Signals |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Knowledge Distillation Detection for Open-weights Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Knowledge Distillation of Uncertainty using Deep Latent Factor Model |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Knowledge Starts with Practice: Knowledge-Aware Exercise Generative Recommendation with Adaptive Multi-Agent Cooperation |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Kuramoto Orientation Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| L$^2$M: Mutual Information Scaling Law for Long-Context Language Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| L2DGCN: Learnable Enhancement and Label Selection Dynamic Graph Convolutional Networks for Mitigating Degree Bias |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| LABridge: TextβImage Latent Alignment Framework via Mean-Conditioned OU Process |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| LARGO: Latent Adversarial Reflection through Gradient Optimization for Jailbreaking LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LBMKGC: Large Model-Driven Balanced Multimodal Knowledge Graph Completion |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| LD-RoViS: Training-free Robust Video Steganography for Deterministic Latent Diffusion Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| LILO: Learning to Reason at the Frontier of Learnability |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| LLM Interpretability with Identifiable Temporal-Instantaneous Representation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| LLM Layers Immediately Correct Each Other |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| LLM Meeting Decision Trees on Tabular Data |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| LLM Query Scheduling with Prefix Reuse and Latency Constraints |
β
|
β |
β |
β
|
β
|
β
|
β
|
5 |
| LLM Safety Alignment is Divergence Estimation in Disguise |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| LLM Strategic Reasoning: Agentic Study through Behavioral Game Theory |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| LLM Unlearning via Neural Activation Redirection |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| LLM at Network Edge: A Layer-wise Efficient Federated Fine-tuning Approach |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| LLM-DAMVC: A Large Language Model Assisted Dynamic Agent for Multi-View Clustering |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| LLMs Encode Harmfulness and Refusal Separately |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LMFusion: Adapting Pretrained Language Models for Multimodal Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| LOMIA: Label-Only Membership Inference Attacks against Pre-trained Large Vision-Language Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| LOPT: Learning Optimal Pigovian Tax in Sequential Social Dilemmas |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LT-Soups: Bridging Head and Tail Classes via Subsampled Model Soups |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| LVLM-Driven Attribute-Aware Modeling for Visible-Infrared Person Re-Identification |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| LaViDa: A Large Diffusion Language Model for Multimodal Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LabelAny3D: Label Any Object 3D in the Wild |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LangHOPS: Language Grounded Hierarchical Open-Vocabulary Part Segmentation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Language Modeling by Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Language Models Can Predict Their Own Behavior |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Language Models can Self-Improve at State-Value Estimation for Better Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Language Ranker: A Lightweight Ranking framework for LLM Decoding |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| LanguageβBiasβResilient Visual Question Answering via Adaptive MultiβMargin Collaborative Debiasing |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Large Language Bayes |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Large Language Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Large Language Models Think Too Fast To Explore Effectively |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Large Language Models as End-to-end Combinatorial Optimization Solvers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Large Language Models as Model Organisms for Human Associative Learning |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Large Stepsizes Accelerate Gradient Descent for Regularized Logistic Regression |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Large language models can learn and generalize steganographic chain-of-thought under process supervision |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Last Iterate Convergence in Monotone Mean Field Games |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Last-Iterate Convergence of Smooth Regret Matching$^+$ Variants in Learning Nash Equilibria |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Latency NMS Attacks: Is It Real Life or Is It Just Fantasy? |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Latent Chain-of-Thought for Visual Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Latent Harmony: Synergistic Unified UHD Image Restoration via Latent Space Regularization and Controllable Refinement |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Latent Mixture of Symmetries for Sample-Efficient Dynamic Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Latent Principle Discovery for Language Model Self-Improvement |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Latent Refinement via Flow Matching for Training-free Linear Inverse Problem Solving |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Latent Retrieval Augmented Generation of Cross-Domain Protein Binders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Latent Space Factorization in LoRA |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Layer-wise Update Aggregation with Recycling for Communication-Efficient Federated Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| LayerNavigator: Finding Promising Intervention Layers for Efficient Activation Steering in Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LeVo: High-Quality Song Generation with Multi-Preference Alignment |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learn and Ensemble Bridge Adapters for Multi-domain Task Incremental Learning |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| Learn2Mix: Training Neural Networks Using Adaptive Data Integration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learnable Burst-Encodable Time-of-Flight Imaging for High-Fidelity Long-Distance Depth Sensing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learnable Sampler Distillation for Discrete Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Learned Prefix Caching for Efficient LLM Inference |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Learning (Approximately) Equivariant Networks via Constrained Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning 3D Anisotropic Noise Distributions Improves Molecular Force Fields |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning 3D Persistent Embodied World Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Learning CAD Modeling Sequences via Projection and Part Awareness |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Chern Numbers of Multiband Topological Insulators with Gauge Equivariant Neural Networks |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Imaging Inverse Problems |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Learning Counterfactual Outcomes Under Rank Preservation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Learning Crossmodal Interaction Patterns via Attributed Bipartite Graphs for Single-Cell Omics |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Learning Dense Hand Contact Estimation from Imbalanced Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Differential Pyramid Representation for Tone Mapping |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Diffusion Models with Flexible Representation Guidance |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Dynamics of RNNs in Closed-Loop Environments |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Learning Expandable and Adaptable Representations for Continual Learning |
β
|
β
|
β
|
β
|
β |
β |
β |
4 |
| Learning Generalizable Shape Completion with SIM(3) Equivariance |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Learning Gradient Boosted Decision Trees with Algorithmic Recourse |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Human-Object Interaction as Groups |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Learning Interactive World Model for Object-Centric Reinforcement Learning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Learning Interestingness in Automated Mathematical Theory Formation |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Learning Juntas under Markov Random Fields |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Learning Latent Variable Models via Jarzynski-adjusted Langevin Algorithm |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Linear Attention in Polynomial Time |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Learning Memory-Enhanced Improvement Heuristics for Flexible Job Shop Scheduling |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Learning Multi-Source and Robust Representations for Continual Learning |
β
|
β
|
β
|
β |
β |
β |
β |
3 |
| Learning Neural Exposure Fields for View Synthesis |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Learning Parameterized Skills from Demonstrations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Personalized Ad Impact via Contextual Reinforcement Learning under Delayed Rewards |
β
|
β |
β |
β
|
β |
β |
β
|
3 |
| Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Learning Provably Improves the Convergence of Gradient Descent |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Learning Reconfigurable Representations for Multimodal Federated Learning with Missing Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Relative Gene Expression Trends from Pathology Images in Spatial Transcriptomics |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Repetition-Invariant Representations for Polymer Informatics |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Learning Robust Spectral Dynamics for Temporal Domain Generalization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Robust Vision-Language Models from Natural Latent Spaces |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Shared Representations from Unpaired Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Simple Interpolants for Linear Integer Arithmetic |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Skill-Attributes for Transferable Assessment in Video |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning Source-Free Domain Adaptation for Visible-Infrared Person Re-Identification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning Spatial-Aware Manipulation Ordering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Stochastic Multiscale Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning Task-Agnostic Representations through Multi-Teacher Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning Theory for Kernel Bilevel Optimization |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Learning Urban Climate Dynamics via Physics-Guided Urban SurfaceβAtmosphere Interactions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Learning Without Augmenting: Unsupervised Time Series Representation Learning via Frame Projections |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning World Models for Interactive Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning a Cross-Modal SchrΓΆdinger Bridge for Visual Domain Generalization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Learning and Planning Multi-Agent Tasks via an MoE-based World Model |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Learning conformational ensembles of proteins based on backbone geometry |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning from A Single Markovian Trajectory: Optimality and Variance Reduction |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Learning from Delayed Feedback in Games via Extra Prediction |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Learning from Demonstrations via Capability-Aware Goal Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning from Disjoint Views: A Contrastive Prototype Matching Network for Fully Incomplete Multi-View Clustering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning from Interval Targets |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning from positive and unlabeled examples -Finite size sample bounds |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Learning in Compact Spaces with Approximately Normalized Transformer |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Learning long range dependencies through time reversal symmetry breaking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning non-equilibrium diffusions with SchrΓΆdinger bridges: from exactly solvable to simulation-free |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning normalized image densities via dual score matching |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning quadratic neural networks in high dimensions: SGD dynamics and scaling laws |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Learning single index models via harmonic decomposition |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Learning the Plasticity: Plasticity-Driven Learning Framework in Spiking Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Learning to Add, Multiply, and Execute Algorithmic Instructions Exactly with Neural Networks |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Learning to Better Search with Language Models via Guided Reinforced Self-Training |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Learning to Clean: Reinforcement Learning for Noisy Label Correction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Condition: A Neural Heuristic for Scalable MPE Inference |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning to Control Free-Form Soft Swimmers |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Learning to Factorize Spatio-Temporal Foundation Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Learning to Flow from Generative Pretext Tasks for Neural Architecture Encoding |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Learning to Focus: Causal Attention Distillation via GradientβGuided Token Pruning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Generalize: An Information Perspective on Neural Processes |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Learning to Generate Human-Human-Object Interactions from Textual Descriptions |
β |
β |
β
|
β |
β |
β
|
β
|
3 |
| Learning to Insert for Constructive Neural Vehicle Routing Solver |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Instruct for Visual Instruction Tuning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Learning to Integrate Diffusion ODEs by Averaging the Derivatives |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning to Learn with Contrastive Meta-Objective |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Learning to Plan Like the Human Brain via Visuospatial Perception and Semantic-Episodic Synergistic Decision-Making |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning to Rank for In-Context Example Retrieval |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Reason under Off-Policy Guidance |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction |
β
|
β
|
β |
β
|
β |
β |
β |
3 |
| Learning to Solve Complex Problems via Dataset Decomposition |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Steer: Input-dependent Steering for Multimodal LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning to Zoom with Anatomical Relations for Medical Structure Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Learning to cluster neuronal function |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning to price with resource constraints: from full information to machine-learned prices |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Learning with Calibration: Exploring Test-Time Computing of Spatio-Temporal Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Learning with Statistical Equality Constraints |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning βPartner-Awareβ Collaborators in Multi-Party Collaboration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Learning-Augmented Algorithms for $k$-median via Online Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Learning-Augmented Facility Location Mechanisms for Envy Ratio |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Learning-Augmented Online Bidding in Stochastic Settings |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Learning-Augmented Online Bipartite Fractional Matching |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Learning-Augmented Streaming Algorithms for Correlation Clustering |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Least squares variational inference |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Leaving No OOD Instance Behind: Instance-Level OOD Fine-Tuning for Anomaly Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Length Generalization via Auxiliary Tasks |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Less Greedy Equivalence Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Less but More: Linear Adaptive Graph Learning Empowering Spatiotemporal Forecasting |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Less is More: Improving LLM Alignment via Preference Data Selection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Less is More: Local Intrinsic Dimensions of Contextual Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Less is More: Unlocking Specialization of Time Series Foundation Models via Structured Pruning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Less is More: an Attention-free Sequence Prediction Modeling for Offline Embodied Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Let Brain Rhythm Shape Machine Intelligence for Connecting Dots on Graphs |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Let LRMs Break Free from Overthinking via Self-Braking Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Let Me Think! A Long Chain of Thought Can Be Worth Exponentially Many Short Ones |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Let a Neural Network be Your Invariant |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Let the LLM Stick to Its Strengths: Learning to Route Economical LLM |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Leveraging Conditional Dependence for Efficient World Model Denoising |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Leveraging robust optimization for llm alignment under distribution shifts |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Leveraging semantic similarity for experimentation with AI-generated treatments |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Lie Detector: Unified Backdoor Detection via Cross-Examination Framework |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Lifelong Safety Alignment for Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LightFair: Towards an Efficient Alternative for Fair T2I Diffusion via Debiasing Pre-trained Text Encoders |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Limitations of Normalization in Attention |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| LinPrim: Linear Primitives for Differentiable Volumetric Rendering |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Linear Attention for Efficient Bidirectional Sequence Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Linear Mixture Distributionally Robust Markov Decision Processes |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Linear Transformers Implicitly Discover Unified Numerical Algorithms |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Linearization Explains Fine-Tuning in Large Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Linearly Constrained Diffusion Implicit Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| List-Level Distribution Coupling with Applications to Speculative Decoding and Lossy Compression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Listwise Preference Diffusion Optimization for User Behavior Trajectories Prediction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LittleBit: Ultra Low-Bit Quantization via Latent Factorization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LiveStar: Live Streaming Assistant for Real-World Online Video Understanding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| LoMix: Learnable Weighted Multi-Scale Logits Mixing for Medical Image Segmentation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| LoRA vs Full Fine-tuning: An Illusion of Equivalence |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LoRA-EnVar: Parameter-Efficient Hybrid Ensemble Variational Assimilation for Weather Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LoRATv2: Enabling Low-Cost Temporal Modeling in One-Stream Trackers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LoRO: Real-Time on-Device Secure Inference for LLMs via TEE-Based Low Rank Obfuscation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LoSplit: Loss-Guided Dynamic Split for Training-Time Defense Against Graph Backdoor Attacks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| LocDiff: Identifying Locations on Earth by Diffusing in the Hilbert Space |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Local Curvature Descent: Squeezing More Curvature out of Standard and Polyak Gradient Descent |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Local-Global Associative Frames for Symmetry-Preserving Crystal Structure Modeling |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Local-Global Coupling Spiking Graph Transformer for Brain Disorders Diagnosis from Two Perspectives |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Localist Topographic Expert Routing: A Barrel Cortex-Inspired Modular Network for Sensorimotor Processing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Locality in Image Diffusion Models Emerges from Data Statistics |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Localized Data Shapley: Accelerating Valuation for Nearest Neighbor Algorithms |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Localizing Knowledge in Diffusion Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Locally Optimal Private Sampling: Beyond the Global Minimax |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Logic.py: Bridging the Gap between LLMs and Constraint Solvers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| LogicTree: Improving Complex Reasoning of LLMs via Instantiated Multi-step Synthetic Logical Data |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Logical Expressiveness of Graph Neural Networks with Hierarchical Node Individualization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Long-Tailed Recognition via Information-Preservable Two-Stage Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Long-tailed Recognition with Model Rebalancing |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| LongMagpie: A Self-synthesis Method for Generating Large-scale Long-context Instructions |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Look-Ahead Reasoning on Learning Platforms |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Lookahead Routing for Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Looking Into the Water by Unsupervised Learning of the Surface Shape |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Lorentz Local Canonicalization: How to make any Network Lorentz-Equivariant |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Lost in Transmission: When and Why LLMs Fail to Reason Globally |
β |
β
|
β
|
β |
β |
β
|
β
|
4 |
| Low Precision Streaming PCA |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Low Rank Gradients and Where to Find Them |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Low-Rank Graphon Learning for Networks |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Low-Rank Head Avatar Personalization with Registers |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Low-degree evidence for computational transition of recovery rate in stochastic block model |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Lua-LLM: Learning Unstructured-Sparsity Allocation for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| LuxDiT: Lighting Estimation with Video Diffusion Transformer |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Lyapunov-Stable Adaptive Control for Multimodal Concept Drift |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| MAESTRO : Adaptive Sparse Attention and Robust Learning for Multimodal Dynamic Time Series |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MANGO: Multimodal Attention-based Normalizing Flow Approach to Fusion Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MAP Estimation with Denoisers: Convergence Rates and Guarantees |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| MAPLE: Multi-scale Attribute-enhanced Prompt Learning for Few-shot Whole Slide Image Classification |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| MARS: A Malignity-Aware Backdoor Defense in Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MAT-Agent: Adaptive Multi-Agent Training Optimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MEIcoder: Decoding Visual Stimuli from Neural Activity by Leveraging Most Exciting Inputs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MGUP: A Momentum-Gradient Alignment Update Policy for Stochastic Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MI-TRQR: Mutual Information-Based Temporal Redundancy Quantification and Reduction for Energy-Efficient Spiking Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MIHC: Multi-View Interpretable Hypergraph Neural Networks with Information Bottleneck for Chip Congestion Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MIND: Material Interface Generation from UDFs for Non-Manifold Surface Reconstruction |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MIRA: Medical Time Series Foundation Model for Real-World Health Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MIX: A Multi-view Time-Frequency Interactive Explanation Framework for Time Series Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MJ-Video: Benchmarking and Rewarding Video Generation with Fine-Grained Video Preference |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MLEP: Multi-granularity Local Entropy Patterns for Generalized AI-generated Image Detection |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MLZero: A Multi-Agent System for End-to-end Machine Learning Automation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| MMaDA: Multimodal Large Diffusion Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MOBO-OSD: Batch Multi-Objective Bayesian Optimization via Orthogonal Search Directions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MODEL SHAPLEY: Find Your Ideal Parameter Player via One Gradient Backpropagation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MOF-BFN: Metal-Organic Frameworks Structure Prediction via Bayesian Flow Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| MOSDT: Self-Distillation-Based Decision Transformer for Multi-Agent Offline Safe Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MOTION: Multi-Sculpt Evolutionary Coarsening for Federated Continual Graph Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| MPCache: MPC-Friendly KV Cache Eviction for Efficient Private LLM Inference |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| MR. Video: MapReduce as an Effective Principle for Long Video Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| MS-BART: Unified Modeling of Mass Spectra and Molecules for Structure Elucidation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MTRec: Learning to Align with User Preferences via Mental Reward Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| MURKA: Multi-Reward Reinforcement Learning with Knowledge Alignment for Optimization Tasks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MUSTAFAR: Promoting Unstructured Sparsity for KV Cache Pruning in LLM Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MVSMamba: Multi-View Stereo with State Space Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MaNGO β Adaptable Graph Network Simulators via Meta-Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Machine Unlearning in 3D Generation: A Perspective-Coherent Acceleration Framework |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Machine Unlearning under Overparameterization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Machine Unlearning via Task Simplex Arithmetic |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| MagCache: Fast Video Generation with Magnitude-Aware Cache |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Magical: Medical Lay Language Generation via Semantic Invariance and Layperson-tailored Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MaintainCoder: Maintainable Code Generation Under Dynamic Requirements |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Majority of the Bests: Improving Best-of-N via Bootstrapping |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Make Information Diffusion Explainable: LLM-based Causal Framework for Diffusion Prediction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Making Classic GNNs Strong Baselines Across Varying Homophily: A SmoothnessβGeneralization Perspective |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Mamba Goes HoME: Hierarchical Soft Mixture-of-Experts for 3D Medical Image Segmentation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Mamba Modulation: On the Length Generalization of Mamba Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Mamba Only Glances Once (MOGO): A Lightweight Framework for Efficient Video Action Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Manipulating 3D Molecules in a Fixed-Dimensional E(3)-Equivariant Latent Space |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Manipulating Feature Visualizations with Gradient Slingshots |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Many LLMs Are More Utilitarian Than One |
β |
β
|
β
|
β |
β |
β
|
β
|
4 |
| Many Minds, One Goal: Time Series Forecasting via Sub-task Specialization and Inter-agent Cooperation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Marginal-Nonuniform PAC Learnability |
β |
β |
β |
β |
β |
β |
β |
0 |
| Markov Persuasion Processes: Learning to Persuade From Scratch |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Martingale Posterior Neural Networks for Fast Sequential Decision Making |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Martingale Score: An Unsupervised Metric for Bayesian Rationality in LLM Reasoning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Mask Image Watermarking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Masked Diffusion Models as Energy Minimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Masked Gated Linear Unit |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Matchings Under Biased and Correlated Evaluations |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| MaterialRefGS: Reflective Gaussian Splatting with Multi-view Consistent Material Inference |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Max Entropy Moment Kalman Filter for Polynomial Systems with Arbitrary Noise |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| MaxSup: Overcoming Representation Collapse in Label Smoothing |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Maximizing the Value of Predictions in Control: Accuracy Is Not Enough |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Mean Flows for One-step Generative Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Measure-Theoretic Anti-Causal Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Measuring AI Ability to Complete Long Software Tasks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Measuring and Guiding Monosemanticity |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Mechanism Design for LLM Fine-tuning with Multiple Reward Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Mechanism Design via the Interim Relaxation |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Mechanistic Interpretability of RNNs emulating Hidden Markov Models |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Median Selection with Noisy and Structural Information |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Mellow: a small audio language model for reasoning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| MemEIC: A Step Toward Continual and Compositional Knowledge Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Memorization in Graph Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Memory Injection Attacks on LLM Agents via Query-Only Interaction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Memory Mosaics at scale |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Memory by accident: a theory of learning as a byproduct of network stabilization |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Memory-Efficient Training with In-Place FFT Implementation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Memory-Enhanced Neural Solvers for Routing Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Memory-Integrated Reconfigurable Adapters: A Unified Framework for Settings with Multiple Tasks |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning |
β |
β |
β |
β
|
β
|
β
|
β
|
4 |
| Mesh Interpolation Graph Network for Dynamic and Spatially Irregular Global Weather Forecasting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Meta CLIP 2: A Worldwide Scaling Recipe |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Meta Guidance: Incorporating Inductive Biases into Deep Time Series Imputers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Meta-D2AG: Causal Graph Learning with Interventional Dynamic Data |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Meta-Learning Objectives for Preference Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Meta-learning how to Share Credit among Macro-Actions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MetaDefense: Defending Fine-tuning based Jailbreak Attack Before and During Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| MetaGS: A Meta-Learned Gaussian-Phong Model for Out-of-Distribution 3D Scene Relighting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MetaKoopman: Bayesian Meta-Learning of Koopman Operators for Modeling Structured Dynamics under Distribution Shifts |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Metis: A Foundation Speech Generation Model with Masked Generative Pre-training |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Metric Automata Theory: A Unifying Theory of RNNs |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Metritocracy: Representative Metrics for Lite Benchmarks |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Metropolis Adjusted Microcanonical Hamiltonian Monte Carlo |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Metropolis-Hastings Sampling for 3D Gaussian Reconstruction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MiCADangelo: Fine-Grained Reconstruction of Constrained CAD Models from 3D Scans |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| MiCo: Multi-image Contrast for Reinforcement Visual Reasoning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Cultural Learning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| MindJourney: Test-Time Scaling with World Models for Spatial Reasoning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| MiniMax-Remover: Taming Bad Noise Helps Video Object Removal |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Minimal Semantic Sufficiency Meets Unsupervised Domain Generalization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Minimax Adaptive Online Nonparametric Regression over Besov spaces |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Minimax-Optimal Univariate Function Selection in Sparse Additive Models: Rates, Adaptation, and the Estimation-Selection Gap |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Minimizing False-Positive Attributions in Explanations of Non-Linear Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Minimum Width for Deep, Narrow MLP: A Diffeomorphism Approach |
β |
β |
β |
β |
β |
β |
β |
0 |
| Mint: A Simple Test-Time Adaptation of Vision-Language Models against Common Corruptions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MisoDICE: Multi-Agent Imitation from Mixed-Quality Demonstrations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Miss-ReID: Delivering Robust Multi-Modality Object Re-Identification Despite Missing Modalities |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Missing Data Imputation by Reducing Mutual Information with Rectified Flows |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Mitigating Forgetting in LLM Fine-Tuning via Low-Perplexity Token Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mitigating Instability in High Residual Adaptive Sampling for PINNs via Langevin Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Mitigating Occlusions in Virtual Try-On via A Simple-Yet-Effective Mask-Free Framework |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Mitigating Overthinking in Large Reasoning Models via Manifold Steering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mitigating Reward Over-optimization in Direct Alignment Algorithms with Importance Sampling |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Mitigating Semantic Collapse in Partially Relevant Video Retrieval |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mitigating Sexual Content Generation via Embedding Distortion in Text-conditioned Diffusion Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Mitigating Spurious Features in Contrastive Learning with Spectral Regularization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mitigating the PrivacyβUtility Trade-off in Decentralized Federated Learning via f-Differential Privacy |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| MixAT: Combining Continuous and Discrete Adversarial Training for LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MixSignGraph: A Sign Sequence is Worth Mixed Graphs of Nodes |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Mixed-Sample SGD: an End-to-end Analysis of Supervised Transfer Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Mixture of Inputs: Text Generation Beyond Discrete Token Sampling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Mixture of Noise for Pre-Trained Model-Based Class-Incremental Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Mixture of Scope Experts at Test: Generalizing Deeper Graph Neural Networks with Shallow Variants |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Mixture-of-Experts Meets In-Context Reinforcement Learning |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mixtures of Subspaces for Bandwidth Efficient Context Parallel Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MoBA: Mixture of Block Attention for Long-Context LLMs |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| MoCha: Towards Movie-Grade Talking Character Generation |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| MoE-Gyro: Self-Supervised Over-Range Reconstruction and Denoising for MEMS Gyroscopes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MoEMeta: Mixture-of-Experts Meta Learning for Few-Shot Relational Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MoFo: Empowering Long-term Time Series Forecasting with Periodic Pattern Modeling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| MoRIC: A Modular Region-based Implicit Codec for Image Compression |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| MobileODE: An Extra Lightweight Network |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ModHiFi: Identifying High Fidelity predictive components for Model Modification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Model Editing for Vision Transformers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Model Inversion with Layer-Specific Modeling and Alignment for Data-Free Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Model Merging in Pre-training of Large Language Models |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Model Provenance Testing for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Model Reconciliation via Cost-Optimal Explanations in Probabilistic Logic Programming |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Model-Based Policy Adaptation for Closed-Loop End-to-end Autonomous Driving |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Model-Informed Flows for Bayesian Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Modeling Cell Dynamics and Interactions with Unbalanced Mean Field SchrΓΆdinger Bridge |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Modeling Microenvironment Trajectories on Spatial Transcriptomics with NicheFlow |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Modeling Neural Activity with Conditionally Linear Dynamical Systems |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Modeling the Economic Impacts of AI Openness Regulation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Modelling the control of offline processing with reinforcement learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ModelβBehavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isnβt the Right One |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MokA: Multimodal Low-Rank Adaptation for MLLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| MoleBridge: Synthetic Space Projecting with Discrete Markov Bridges |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Momentum Multi-Marginal SchrΓΆdinger Bridge Matching |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Momentum-SAM: Sharpness Aware Minimization without Computational Overhead |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Monitoring Risks in Test-Time Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| MonoLift: Learning 3D Manipulation Policies from Monocular RGB via Distillation |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Monoculture or Multiplicity: Which Is It? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Monotone and Separable Set Functions: Characterizations and Neural Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MoodAngels: A Retrieval-augmented Multi-agent Framework for Psychiatry Diagnosis |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| MoonCast: High-Quality Zero-Shot Podcast Generation |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| More Than Just Functional: LLM-as-a-Critique for Efficient Code Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| More of the Same: Persistent Representational Harms Under Increased Representation |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| MotionBind: Multi-Modal Human Motion Alignment for Retrieval, Recognition, and Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MuSLR: Multimodal Symbolic Logical Reasoning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Multi-Agent Collaboration via Evolving Orchestration |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Multi-Agent Debate for LLM Judges with Adaptive Stability Detection |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Multi-Agent Imitation by Learning and Sampling from Factorized Soft Q-Function |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Multi-Agent Learning under Uncertainty: Recurrence vs. Concentration |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Multi-Agent Reinforcement Learning with Communication-Constrained Priors |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Multi-Class Support Vector Machine with Differential Privacy |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Multi-Expert Distributionally Robust Optimization for Out-of-Distribution Generalization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Multi-Kernel Correlation-Attention Vision Transformer for Enhanced Contextual Understanding and Multi-Scale Integration |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Multi-Modal Interactive Agent Layer for Few-Shot Universal Cross-Domain Retrieval and Beyond |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Multi-Modal View Enhanced Large Vision Models for Long-Term Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Multi-Objective Hyperparameter Selection via Hypothesis Testing on Reliability Graphs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multi-Objective One-Shot Pruning for Large Language Models |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Multi-Objective Reinforcement Learning with Max-Min Criterion: A Game-Theoretic Approach |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multi-Scale Finetuning for Encoder-based Time Series Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multi-Token Prediction Needs Registers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multi-View Oriented GPLVM: Expressiveness and Efficiency |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multi-agent KTO: Enhancing Strategic Interactions of Large Language Model in Language Game |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multi-agent Markov Entanglement |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multi-dataset Joint Pre-training of Emotional EEG Enables Generalizable Affective Computing |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Multi-head Temporal Latent Attention |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Multi-modal contrastive learning adapts to intrinsic dimensions of shared latent variables |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Multi-order Orchestrated Curriculum Distillation for Model-Heterogeneous Federated Graph Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Multi-scale Temporal Prediction via Incremental Generation and Multi-agent Collaboration |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Multi-step Visual Reasoning with Visual Tokens Scaling and Verification |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| MultiNet: Adaptive Multi-Viewed Subgraph Convolutional Networks for Graph Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| MultiScale Contextual Bandits for Long Term Objectives |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multiclass Loss Geometry Matters for Generalization of Gradient Descent in Separable Classification |
β |
β |
β |
β |
β |
β |
β |
0 |
| Multidimensional Bayesian Utility Maximization: Tight Approximations to Welfare |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Multilevel neural simulation-based inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multimodal 3D Genome Pre-training |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Multimodal Causal Reasoning for UAV Object Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Multimodal LiDAR-Camera Novel View Synthesis with Unified Pose-free Neural Fields |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Multimodal Negative Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multimodal Tabular Reasoning with Privileged Structured Information |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multiplayer Federated Learning: Reaching Equilibrium with Less Communication |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Multiplication-Free Parallelizable Spiking Neurons with Efficient Spatio-Temporal Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multipole Attention for Efficient Long Context Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Multiresolution Analysis and Statistical Thresholding on Dynamic Networks |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Multiscale guidance of protein structure prediction with heterogeneous cryo-EM data |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Multitask Learning with Stochastic Interpolants |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multivariate Latent Recalibration for Conditional Normalizing Flows |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Multivariate Time Series Anomaly Detection with Idempotent Reconstruction |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| MutualVPR: A Mutual Learning Framework for Resolving Supervision Inconsistencies via Adaptive Clustering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| NEED: Cross-Subject and Cross-Task Generalization for Video and Image Reconstruction from EEG Signals |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| NEP: Autoregressive Image Editing via Next Editing Token Prediction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| NFIG: Multi-Scale Autoregressive Image Generation via Frequency Ordering |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| NFL-BA: Near-Field Light Bundle Adjustment for SLAM in Dynamic Lighting |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| NOBLE - Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| NUTS: Eddy-Robust Reconstruction of Surface Ocean Nutrients via Two-Scale Modeling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| NaDRO: Leveraging Dual-Reward Strategies for LLMs Training on Noisy Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Native Segmentation Vision Transformers |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Native-Resolution Image Synthesis |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Natural Gradient VI: Guarantees for Non-Conjugate Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| NavBench: Probing Multimodal Large Language Models for Embodied Navigation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Navigating the MIL Trade-Off: Flexible Pooling for Whole Slide Image Classification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| NeSyPr: Neurosymbolic Proceduralization For Efficient Embodied Reasoning |
β
|
β
|
β
|
β
|
β |
β |
β |
4 |
| Near-Exponential Savings for Population Mean Estimation with Active Learning |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Near-Optimal Quantum Algorithms for Computing (Coarse) Correlated Equilibria of General-Sum Games |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Near-Optimal Sample Complexity for Online Constrained MDPs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Nearly Dimension-Independent Convergence of Mean-Field Black-Box Variational Inference |
β |
β |
β |
β |
β |
β |
β |
0 |
| Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Nearly-Linear Time and Massively Parallel Algorithms for $k$-anonymity |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Negative Feedback Really Matters: Signed Dual-Channel Graph Contrastive Learning Framework for Recommendation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NegoCollab: A Common Representation Negotiation Approach for Heterogeneous Collaborative Perception |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Neighbor-aware Contrastive Disambiguation for Cross-Modal Hashing with Redundant Annotations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Neighborhood Self-Dissimilarity Attention for Medical Image Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Nested Learning: The Illusion of Deep Learning Architectures |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Network two-sample test for block models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| NeuSymEA: Neuro-symbolic Entity Alignment via Variational Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NeurIPT: Foundation Model for Neural Interfaces |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Neural Atlas Graphs for Dynamic Scene Decomposition and Editing |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Neural Attention Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Neural B-frame Video Compression with Bi-directional Reference Harmonization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Neural Collapse in Cumulative Link Models for Ordinal Regression: An Analysis with Unconstrained Feature Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Neural Collapse under Gradient Flow on Shallow ReLU Networks for Orthogonally Separable Data |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Neural Combinatorial Optimization for Time-Dependent Traveling Salesman Problem |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Neural Correlates of Serial Dependence: Synaptic Short-term Plasticity Orchestrates Repulsion and Attraction |
β |
β |
β |
β |
β
|
β
|
β
|
3 |
| Neural Emulator Superiority: When Machine Learning for PDEs Surpasses its Training Data |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Neural Entropy |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Neural Evolution Strategy for Black-box Pareto Set Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Neural Fractional Attention Differential Equations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Neural Greenβs Functions |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Neural Hamiltonian Diffusions for Modeling Structured Geometric Dynamics |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Neural MJD: Neural Non-Stationary Merton Jump Diffusion for Time Series Prediction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Neural Mutual Information Estimation with Vector Copulas |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Neural Networks for Learnable and Scalable Influence Estimation of Instruction Fine-Tuning Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Neural Rule Lists: Learning Discretizations, Rules, and Order in One Go |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Neural Stochastic Flows: Solver-Free Modelling and Inference for SDE Solutions |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Neural Tangent Knowledge Distillation for Optical Convolutional Networks |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Neural Thermodynamics: Entropic Forces in Deep and Universal Representation Learning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Neural-Driven Image Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Neuro-Spectral Architectures for Causal Physics-Informed Networks |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| NeuroH-TGL: Neuro-Heterogeneity Guided Temporal Graph Learning Strategy for Brain Disease Diagnosis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| NeuroPath: Neurobiology-Inspired Path Tracking and Reflection for Semantically Coherent Retrieval |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Neurons as Detectors of Coherent Sets in Sensory Dynamics |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Neurosymbolic Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| New Parallel and Streaming Algorithms for Directed Densest Subgraph |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Next Semantic Scale Prediction via Hierarchical Diffusion Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| No Experts, No Problem: Avoidance Learning from Bad Demonstrations |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| No Object Is an Island: Enhancing 3D Semantic Segmentation Generalization with Diffusion Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need! |
β
|
β |
β |
β |
β |
β |
β |
1 |
| No-Regret Online Autobidding Algorithms in First-price Auctions |
β
|
β |
β |
β |
β |
β |
β |
1 |
| No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Noise Consistency Training: A Native Approach for One-step Generator in Learning Additional Controls |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Noise Matters: Optimizing Matching Noise for Diffusion Classifiers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Noise-Robustness Through Noise: A Framework combining Asymmetric LoRA with Poisoning MoE |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Noisy Multi-Label Learning through Co-Occurrence-Aware Diffusion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Non-Adaptive Adversarial Face Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Non-Asymptotic Analysis Of Data Augmentation For Precision Matrix Estimation |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Non-Clairvoyant Scheduling with Progress Bars |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Non-Convex Tensor Recovery from Tube-Wise Sensing |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Non-Line-of-Sight 3D Reconstruction with Radar |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Non-Markovian Discrete Diffusion with Causal Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Non-Singularity of the Gradient Descent Map for Neural Networks with Piecewise Analytic Activations |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Non-Stationary Lipschitz Bandits |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Non-Stationary Structural Causal Bandits |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Non-Uniform Multiclass Learning with Bandit Feedback |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Non-convex entropic mean-field optimization via Best Response flow |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Non-equilibrium Annealed Adjoint Sampler |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Non-exchangeable Conformal Prediction with Optimal Transport: Tackling Distribution Shift with Unlabeled Data |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Non-monotone Submodular Optimization: $p$-Matchoid Constraints and Fully Dynamic Setting |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Non-rectangular Robust MDPs with Normed Uncertainty Sets |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Non-stationary Bandit Convex Optimization: A Comprehensive Study |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Non-stationary Equivariant Graph Neural Networks for Physical Dynamics Simulation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Nonlinear Laplacians: Tunable principal component analysis under directional prior information |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Nonparametric Quantile Regression with ReLU-Activated Recurrent Neural Networks |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| NopeRoomGS: Indoor 3D Gaussian Splatting Optimization without Camera Pose Input |
β |
β |
β
|
β |
β |
β |
β |
1 |
| NormFit: A Lightweight Solution for Few-Shot Federated Learning with Non-IID Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Normal-Abnormal Guided Generalist Anomaly Detection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Normalization in Attention Dynamics |
β |
β |
β |
β |
β |
β |
β |
0 |
| Normalize Filters! Classical Wisdom for Deep Vision |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Normalizing Flows are Capable Models for Continuous Control |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Not All Data are Good Labels: On the Self-supervised Labeling for Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Novel Exploration via Orthogonality |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| NystrΓΆm-Accelerated Primal LS-SVMs: Breaking the $O(an^3)$ Complexity Bottleneck for Scalable ODEs Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| OASIS: One-Shot Federated Graph Learning via Wasserstein Assisted Knowledge Integration |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OCTDiff: Bridged Diffusion Model for Portable OCT Super-Resolution and Enhancement |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| ODG: Occupancy Prediction Using Dual Gaussians |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| OMiSO: Adaptive optimization of state-dependent brain stimulation to shape neural population states |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| OOD-Barrier: Build a Middle-Barrier for Open-Set Single-Image Test Time Adaptation via Vision Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OPHR: Mastering Volatility Trading with Multi-Agent Deep Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OPMapper: Enhancing Open-Vocabulary Semantic Segmentation with Multi-Guidance Information |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| OPTFM: A Scalable Multi-View Graph Transformer for Hierarchical Pre-Training in Combinatorial Optimization |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ORIGAMISPACE: Benchmarking Multimodal LLMs in Multi-Step Spatial Reasoning with Mathematical Constraints |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OSTAR: Optimized Statistical Text-classifier with Adversarial Resistance |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Object Concepts Emerge from Motion |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Object-Centric Concept-Bottlenecks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Object-Centric Representation Learning for Enhanced 3D Semantic Scene Graph Prediction |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Object-centric 3D Motion Field for Robot Learning from Human Videos |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Object-centric binding in Contrastive Language-Image Pretraining |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Objective Soups: Multilingual Multi-Task Modeling for Speech Processing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Obliviator Reveals the Cost of Nonlinear Guardedness in Concept Erasure |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Off-policy Reinforcement Learning with Model-based Exploration Augmentation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Offline Actor-Critic for Average Reward MDPs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Omni-DNA: A Genomic Model Supporting Sequence Understanding, Long-context, and Textual Annotation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Omni-Mol: Multitask Molecular Model for Any-to-any Modalities |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| OmniCast: A Masked Latent Diffusion Model for Weather Forecasting Across Time Scales |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data |
β |
β
|
β |
β |
β |
β
|
β
|
3 |
| OmniDraft: A cross-vocabulary, online adaptive drafter for on-device speculative decoding |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| OmniFC: Rethinking Federated Clustering via Lossless and Secure Distance Reconstruction |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| OmniGaze: Reward-inspired Generalizable Gaze Estimation in the Wild |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OmniGen-AR: AutoRegressive Any-to-Image Generation |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| OmniSVG: A Unified Scalable Vector Graphics Generation Model |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| OmniTalker: One-shot Real-time Text-Driven Talking Audio-Video Generation With Multimodal Style Mimicking |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| OmniTry: Virtual Try-On Anything without Masks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| OmniZoom: A Universal Plug-and-Play Paradigm for Cross-Device Smooth Zoom Interpolation |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Omnidirectional 3D Scene Reconstruction from Single Image |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Omnipresent Yet Overlooked: Heat Kernels in Combinatorial Bayesian Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Agnostic PAC Learning in the Small Error Regime |
β
|
β |
β |
β |
β |
β |
β |
1 |
| On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Evaluating LLM Alignment by Evaluating LLMs as Judges |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| On Evaluating Policies for Robust POMDPs |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| On Extending Direct Preference Optimization to Accommodate Ties |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Fairness of Unified Multimodal Large Language Model for Image Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| On Feasible Rewards in Multi-Agent Inverse Reinforcement Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Group Sufficiency Under Label Bias |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness |
β
|
β |
β |
β |
β |
β |
β |
1 |
| On Inductive Biases That Enable Generalization in Diffusion Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Learning Verifiers and Implications to Chain-of-Thought Reasoning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| On Linear Mode Connectivity of Mixture-of-Experts Architectures |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| On Local Limits of Sparse Random Graphs: Color Convergence and the Refined Configuration Model |
β |
β |
β |
β |
β |
β |
β |
0 |
| On Logic-based Self-Explainable Graph Neural Networks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| On Optimal Steering to Achieve Exact Fairness |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| On Reasoning Strength Planning in Large Reasoning Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Traceability in $\ell_p$ Stochastic Convex Optimization |
β |
β |
β |
β |
β |
β |
β |
0 |
| On Transferring Transferability: Towards a Theory for Size Generalization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On Union-Closedness of Language Generation |
β |
β |
β |
β |
β |
β |
β |
0 |
| On Universality Classes of Equivariant Networks |
β |
β |
β |
β |
β |
β |
β |
0 |
| On Vanishing Gradients, Over-Smoothing, and Over-Squashing in GNNs: Bridging Recurrent and Graph Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On scalable and efficient training of diffusion samplers |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| On the Closed-Form of Flow Matching: Generalization Does Not Arise from Target Stochasticity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| On the Coexistence and Ensembling of Watermarks |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization |
β |
β
|
β |
β |
β
|
β
|
β
|
4 |
| On the Convergence of Single-Timescale Actor-Critic |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| On the Convergence of Stochastic Smoothed Multi-Level Compositional Gradient Descent Ascent |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| On the Edge of Memorization in Diffusion Models |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| On the Emergence of Linear Analogies in Word Embeddings |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| On the Entropy Calibration of Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| On the Existence and Complexity of Core-Stable Data Exchanges |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| On the Hardness of Approximating Distributions with Tractable Probabilistic Models |
β |
β |
β |
β |
β |
β |
β |
0 |
| On the Hardness of Conditional Independence Testing In Practice |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| On the Integration of Spatial-Temporal Knowledge: A Lightweight Approach to Atmospheric Time Series Forecasting |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| On the Loss of Context Awareness in General Instruction Fine-tuning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| On the Mechanisms of Weak-to-Strong Generalization: A Theoretical Perspective |
β |
β |
β |
β |
β |
β |
β
|
1 |
| On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| On the Optimality of the Median-of-Means Estimator under Adversarial Contamination |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| On the Relation between Rectified Flows and Optimal Transport |
β |
β |
β |
β |
β |
β |
β
|
1 |
| On the Robustness of Transformers against Context Hijacking for Linear Classification |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| On the Robustness of Verbal Confidence of LLMs in Adversarial Attacks |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| On the Role of Hidden States of Modern Hopfield Network in Transformer |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| On the SAC-BL Algorithm for Anomaly Detection |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| On the Sample Complexity Bounds of Bilevel Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| On the Sample Complexity of Differentially Private Policy Optimization |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| On the Stability and Generalization of Meta-Learning: the Impact of Inner-Levels |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On the Universal Near Optimality of Hedge in Combinatorial Settings |
β |
β |
β |
β |
β |
β |
β |
0 |
| On the VC dimension of deep group convolutional neural networks |
β |
β |
β |
β |
β |
β |
β |
0 |
| On the Value of Cross-Modal Misalignment in Multimodal Representation Learning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| On the creation of narrow AI: hierarchy and nonlocality of neural network skills |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On the necessity of adaptive regularisation: Optimal anytime online learning on $\boldsymbol{\ell_p}$-balls |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| On the rankability of visual embeddings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| On the sample complexity of semi-supervised multi-objective learning |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| On topological descriptors for graph products |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Once Upon an Input: Reasoning via Per-Instance Program Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| One Filters All: A Generalist Filter For State Estimation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| One Head to Rule Them All: Amplifying LVLM Safety through a Single Critical Attention Head |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| One Prompt Fits All: Universal Graph Adaptation for Pretrained Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| One SPACE to Rule Them All: Jointly Mitigating Factuality and Faithfulness Hallucinations in LLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| One Sample is Enough to Make Conformal Prediction Robust |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| One Stone with Two Birds: A Null-Text-Null Frequency-Aware Diffusion Models for Text-Guided Image Inpainting |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| One Token Embedding Is Enough to Deadlock Your Large Reasoning Model |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| One for All: Universal Topological Primitive Transfer for Graph Structure Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| One-Step Diffusion-Based Image Compression with Semantic Distillation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Online Bilateral Trade With Minimal Feedback: Donβt Waste Sellerβs Time |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Online Experimental Design With Estimation-Regret Trade-off Under Network Interference |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Online Feedback Efficient Active Target Discovery in Partially Observable Environments |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Online Functional Tensor Decomposition via Continual Learning for Streaming Data Completion |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Online Learning in the Repeated Mediated Newsvendor Problem |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Online Learning of Neural Networks |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Online Learning of Pure States is as Hard as Mixed States |
β |
β |
β |
β |
β |
β |
β |
0 |
| Online Locally Differentially Private Conformal Prediction via Binary Inquiries |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| Online Multi-Class Selection with Group Fairness Guarantee |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Online Optimization for Offline Safe Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Online Portfolio Selection with ML Predictions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Online Prediction with Limited Selectivity |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Online Segment Any 3D Thing as Instance Tracking |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Online Strategic Classification With Noise and Partial Feedback |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Online Time Series Forecasting with Theoretical Guarantees |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| Online Two-Stage Submodular Maximization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Online robust locally differentially private learning for nonparametric regression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Open-Vocabulary Part Segmentation via Progressive and Boundary-Aware Strategy |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Open-World Drone Active Tracking with Goal-Centered Rewards |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| OpenBox: Annotate Any Bounding Boxes in 3D |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OpenCUA: Open Foundations for Computer-Use Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| OpenMMEgo: Enhancing Egocentric Understanding for LMMs with Open Weights and Data |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-time Emotional Speech Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OpenVLThinker: Complex Vision-Language Reasoning via Iterative SFT-RL Cycles |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Opinion Maximization in Social Networks by Modifying Internal Opinions |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization |
β |
β
|
β |
β
|
β |
β
|
β
|
4 |
| OptiTree: Hierarchical Thoughts Generation with Tree Search for LLM Optimization Modeling |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Optical Coherence Tomography Harmonization with Anatomy-Guided Latent Metric SchrΓΆdinger Bridges |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Optimal Adjustment Sets for Nonparametric Estimation of Weighted Controlled Direct Effect |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Optimal Best Arm Identification under Differential Privacy |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning |
β
|
β |
β |
β
|
β |
β |
β
|
3 |
| Optimal Estimation of the Best Mean in Multi-Armed Bandits |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Optimal Graph Clustering without Edge Density Signals |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Optimal Minimum Width for the Universal Approximation of Continuously Differentiable Functions by Deep Narrow MLPs |
β |
β |
β |
β |
β |
β |
β |
0 |
| Optimal Mistake Bounds for Transductive Online Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Optimal Neural Compressors for the Rate-Distortion-Perception Tradeoff |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Optimal Nuisance Function Tuning for Estimating a Doubly Robust Functional under Proportional Asymptotics |
β
|
β
|
β |
β
|
β |
β |
β
|
4 |
| Optimal Online Change Detection via Random Fourier Features |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Optimal Rates for Generalization of Gradient Descent for Deep ReLU Classification |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Optimal Rates in Continual Linear Regression via Increasing Regularization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Optimal Regret Bounds via Low-Rank Structured Variation in Non-Stationary Reinforcement Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Optimal Regret of Bandits under Differential Privacy |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Optimal Spectral Transitions in High-Dimensional Multi-Index Models |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Optimal and Provable Calibration in High-Dimensional Binary Classification: Angular Calibration and Platt Scaling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Optimal community detection in dense bipartite graphs |
β |
β |
β |
β |
β |
β |
β |
0 |
| Optimal kernel regression bounds under energy-bounded noise |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Optimality and NP-Hardness of Transformers in Learning Markovian Dynamical Functions |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Optimism Without Regularization: Constant Regret in Zero-Sum Games |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Optimization Inspired Few-Shot Adaptation for Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Optimize the Unseen - Fast NeRF Cleanup with Free Space Prior |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Optimized Minimal 3D Gaussian Splatting |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Optimizing Anytime Reasoning via Budget Relative Policy Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Optimizing Distributional Geometry Alignment with Optimal Transport for Generative Dataset Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Optimizing Retrieval for RAG via Reinforcement Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Oracle-Efficient Combinatorial Semi-Bandits |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| OrbitZoo: Real Orbital Systems Challenges for Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| OrdShap: Feature Position Importance for Sequential Black-Box Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Order-Level Attention Similarity Across Language Models: A Latent Commonality |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Orient Anything V2: Unifying Orientation and Rotation Understanding |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Orientation Matters: Making 3D Generative Models Orientation-Aligned |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Orochi: Versatile Biomedical Image Processor |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Orthogonal Contrastive Learning for Multi-Representation fMRI Analysis |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Orthogonal Survival Learners for Estimating Heterogeneous Treatment Effects from Time-to-Event Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Out-of-Distribution Detection with Relative Angles |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Out-of-Distribution Generalized Graph Anomaly Detection with Homophily-aware Environment Mixup |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Over-squashing in Spatiotemporal Graph Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Overcoming Challenges of Long-Horizon Prediction in Driving World Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Overcoming Long Context Limitations of State Space Models via Context Dependent Sparse Attention |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| P-Law: Predicting Quantitative Scaling Law with Entropy Guidance in Large Recommendation Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| PAC-Bayes Bounds for Multivariate Linear Regression and Linear Autoencoders |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PAID: Pairwise Angular-Invariant Decomposition for Continual Test-Time Adaptation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PALQO: Physics-informed model for Accelerating Large-scale Quantum Optimization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| PANGEA: Projection-Based Augmentation with Non-Relevant General Data for Enhanced Domain Adaptation in LLMs |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| PARCO: Parallel AutoRegressive Models for Multi-Agent Combinatorial Optimization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PARTONOMY: Large Multimodal Models with Part-Level Visual Understanding |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| PASS: Path-selective State Space Model for Event-based Recognition |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PC-Net: Weakly Supervised Compositional Moment Retrieval via Proposal-Centric Network |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| PDPO: Parametric Density Path Optimization |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| PID-controlled Langevin Dynamics for Faster Sampling of Generative Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| PINN Balls: Scaling Second-Order Methods for PINNs with Domain Decomposition and Adaptive Sampling |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PINNs with Learnable Quadrature |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series in Typhoon Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PIVNO: Particle Image Velocimetry Neural Operator |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-forward Planar Splatting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PLD: A Choice-Theoretic List-Wise Knowledge Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PLEIADES: Building Temporal Kernels with Orthogonal Polynomials |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PLMTrajRec: A Scalable and Generalizable Trajectory Recovery Method with Pre-trained Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PMLF: A Physics-Guided Multiscale Loss Framework for Structurally Heterogeneous Time Series |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| POCO: Scalable Neural Forecasting through Population Conditioning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PPMStereo: Pick-and-Play Memory Construction for Consistent Dynamic Stereo Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PREAMBLE: Private and Efficient Aggregation via Block Sparse Vectors |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| PRESCRIBE: Predicting Single-Cell Responses with Bayesian Estimation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| PRESTO: Preimage-Informed Instruction Optimization for Prompting Black-Box LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PRIMT: Preference-based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| PROFIT: A Specialized Optimizer for Deep Fine Tuning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| PRSformer: Disease Prediction from Million-Scale Individual Genotypes |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| PUATE: Efficient ATE Estimation from Treated (Positive) and Unlabeled Units |
β
|
β |
β
|
β
|
β
|
β |
β |
4 |
| PaTH Attention: Position Encoding via Accumulating Householder Transformations |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| PaZO: Preconditioned Accelerated Zeroth-Order Optimization for Fine-Tuning LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PaceLLM: Brain-Inspired Large Language Models for Long-Context Understanding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| PairEdit: Learning Semantic Variations for Exemplar-based Image Editing |
β |
β
|
β |
β |
β
|
β
|
β
|
4 |
| Pairwise Calibrated Rewards for Pluralistic Alignment |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Pairwise Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PandaPose: 3D Human Pose Lifting from a Single Image via Propagating 2D Pose Prior to 3D Anchor Space |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PanoWan: Lifting Diffusion Video Generation Models to 360$^\circ$ with Latitude/Longitude-aware Mechanisms |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Panoptic Captioning: An Equivalence Bridge for Image and Text |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Parallel Scaling Law for Language Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Parallelization of Non-linear State-Space Models: Scaling Up Liquid-Resistance Liquid-Capacitance Networks for Efficient Sequence Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Parallelizing MCMC Across the Sequence Length |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Parameter Dynamics of Online Machine Learning and Test-time Adaptation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Parameter Efficient Fine-tuning via Explained Variance Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Parameter-Free Hypergraph Neural Network for Few-Shot Node Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Parameter-free Algorithms for the Stochastically Extended Adversarial Model |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Parsimonious Predictions for Strategyproof Scheduling |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Partial Correlation Network Estimation by Semismooth Newton Methods |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Partial Physics Informed Diffusion Model for Ocean Chlorophyll Concentration Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Partition to Evolve: Niching-enhanced Evolution with LLMs for Automated Algorithm Discovery |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Partition-Then-Adapt: Combating Prediction Bias for Reliable Multi-Modal Test-Time Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Partner Modelling Emerges in Recurrent Agents (But Only When It Matters) |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Path Gradients after Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Path-Enhanced Contrastive Learning for Recommendation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Path-specific effects for pulse-oximetry guided decisions in critical care |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Pattern-Guided Adaptive Prior for Structure Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Pause Tokens Strictly Increase the Expressivity of Constant-Depth Transformers |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Pay Attention to Small Weights |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Per-Architecture Training-Free Metric Optimization for Neural Architecture Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Perception Encoder: The best visual embeddings are not at the output of the network |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Perception-R1: Pioneering Perception Policy with Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Performative Validity of Recourse Explanations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Periodic Skill Discovery |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| PermLLM: Learnable Channel Permutation for N:M Sparse Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Permissioned LLMs: Enforcing Access Control in Large Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Permutation Equivariant Neural Controlled Differential Equations for Dynamic Graph Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Personalized Decision Modeling: Utility Optimization or Textualized-Symbolic Reasoning |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Personalized Federated Conformal Prediction with Localization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Personalized Subgraph Federated Learning with Differentiable Auxiliary Projections |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Personalized Visual Content Generation in Conversational Systems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Perturb a Model, Not an Image: Towards Robust Privacy Protection via Anti-Personalized Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Perturbation Bounds for Low-Rank Inverse Approximations under Noise |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Pessimistic Data Integration for Policy Evaluation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Photography Perspective Composition: Towards Aesthetic Perspective Recommendation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PhySense: Sensor Placement Optimization for Accurate Physics Sensing |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| PhySwin: An Efficient and Physically-Informed Foundation Model for Multispectral Earth Observation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PhysDiff-VTON: Cross-Domain Physics Modeling and Trajectory Optimization for Virtual Try-On |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| PhysDiff: A Physically-Guided Diffusion Model for Multivariate Time Series Anomaly Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| PhysX-3D: Physical-Grounded 3D Asset Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Physics-informed Neural Operator for Pansharpening |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Physics-informed Reduced Order Modeling of Time-dependent PDEs via Differentiable Solvers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Physics-informed machine learning with domain decomposition and global dynamics for three-dimensional intersecting flows |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PiKE: Adaptive Data Mixing for Large-Scale Multi-Task Learning Under Low Gradient Conflicts |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Pin the Tail on the Model: Blindfolded Repair of User-Flagged Failures in Text-to-Image Services |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Pinpointing Attention-Causal Communication in Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers Inference |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| PixPerfect: Seamless Latent Diffusion Local Editing with Discriminative Pixel-Space Refinement |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Pixel Reasoner: Incentivizing Pixel Space Reasoning via Curiosity-Driven Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Place Cells as Multi-Scale Position Embeddings: Random Walk Transition Kernels for Path Planning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| PlanU: Large Language Model Reasoning through Planning under Uncertainty |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Planning and Learning in Average Risk-aware MDPs |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Planning with Quantized Opponent Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Plasticity as the Mirror of Empowerment |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| PlayerOne: Egocentric World Simulator |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Plenodium: Underwater 3D Scene Reconstruction with Plenoptic Medium Representation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Plug-and-Play Context Feature Reuse for Efficient Masked Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Plug-and-play Feature Causality Decomposition for Multimodal Representation Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| PoE-World: Compositional World Modeling with Products of Programmatic Experts |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| PoLAR: Polar-Decomposed Low-Rank Adapter Representation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PocketSR: The Super-Resolution Expert in Your Pocket Mobiles |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Point Cloud Synthesis Using Inner Product Transforms |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Point-MaDi: Masked Autoencoding with Diffusion for Point Cloud Pre-training |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Point4Bit: Post Training 4-bit Quantization for Point Cloud 3D Detection |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| PointMAC: Meta-Learned Adaptation for Robust Test-Time Point Cloud Completion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| PointTruss: K-Truss for Point Cloud Registration |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PolarQuant: Leveraging Polar Transformation for Key Cache Quantization and Decoding Acceleration |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Policy Compatible Skill Incremental Learning via Lazy Learning Interface |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Policy Gradient Methods Converge Globally in Imperfect-Information Extensive-Form Games |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Policy Optimized Text-to-Image Pipeline Design |
β |
β |
β |
β
|
β |
β |
β
|
2 |
| PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| PolyPose: Deformable 2D/3D Registration via Polyrigid Transformations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Polyline Path Masked Attention for Vision Transformer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Post Hoc Regression Refinement via Pairwise Rankings |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Posterior Contraction for Sparse Neural Networks in Besov Spaces with Intrinsic Dimensionality |
β |
β |
β |
β |
β |
β |
β |
0 |
| Posterior Sampling by Combining Diffusion Models with Annealed Langevin Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Power Lines: Scaling laws for weight decay and batch size in LLM pre-training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Practical Bayes-Optimal Membership Inference Attacks |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Practical Kernel Selection for Kernel-based Conditional Independence Test |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Practical and Effective Code Watermarking for Large Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Praxis-VLM: Vision-Grounded Decision Making via Text-Driven Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Pre-Trained Policy Discriminators are General Reward Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Pre-trained Large Language Models Learn to Predict Hidden Markov Models In-context |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Precise Asymptotics and Refined Regret of Variance-Aware UCB |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Precise Diffusion Inversion: Towards Novel Samples and Few-Step Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Precise Information Control in Long-Form Text Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Preconditioned Langevin Dynamics with Score-based Generative Models for Infinite-Dimensional Linear Bayesian Inverse Problems |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Predictability Enables Parallelization of Nonlinear State Space Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Predictable Scale (Part II) --- Farseer: A Refined Scaling Law in LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Predicting Empirical AI Research Outcomes with Language Models |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| Predicting Functional Brain Connectivity with Context-Aware Deep Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Predicting partially observable dynamical systems via diffusion models with a multiscale inference scheme |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Predicting the Performance of Black-box Language Models with Follow-up Queries |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Prediction with expert advice under additive noise |
β |
β |
β |
β |
β |
β |
β |
0 |
| Prediction-Powered Causal Inferences |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Prediction-Powered Semi-Supervised Learning with Online Power Tuning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief Representation Under Partial Observability |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Predictive Preference Learning from Human Interventions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Preference Distillation via Value based Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Preference Learning with Lie Detectors can Induce Honesty or Evasion |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Preference Learning with Response Time: Robust Losses and Guarantees |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Preference Optimization by Estimating the Ratio of the Data Distribution |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Preference Optimization on Pareto Sets: On a Theory of Multi-Objective Optimization |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Preference-Based Dynamic Ranking Structure Recognition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Preference-Guided Diffusion for Multi-Objective Offline Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Preference-driven Knowledge Distillation for Few-shot Node Classification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Preserving Task-Relevant Information Under Linear Concept Removal |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Preventing Shortcuts in Adapter Training via Providing the Shortcuts |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Price of Parsimony: Complexity of Fourier Sparsity Testing |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Principled Data Augmentation for Learning to Solve Quadratic Programming Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Principled Long-Tailed Generative Modeling via Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Principled Model Routing for Unknown Mixtures of Source Domains |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Prior Forgetting and In-Context Overfitting |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Prior-Guided Diffusion Planning for Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Prior-Guided Flow Matching for Target-Aware Molecule Design with Learnable Atom Number |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Prioritizing Perception-Guided Self-Supervision: A New Paradigm for Causal Modeling in End-to-End Autonomous Driving |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Privacy Reasoning in Ambiguous Contexts |
β |
β |
β
|
β |
β |
β
|
β
|
3 |
| Privacy amplification by random allocation |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Private Continual Counting of Unbounded Streams |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Private Evolution Converges |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Private Geometric Median in Nearly-Linear Time |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Private Hyperparameter Tuning with Ex-Post Guarantee |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Private Online Learning against an Adaptive Adversary: Realizable and Agnostic Settings |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Private Set Union with Multiple Contributions |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Private Statistical Estimation via Truncation |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Private Training Large-scale Models with Efficient DP-SGD |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Private Zeroth-Order Optimization with Public Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pro3D-Editor: A Progressive-Views Perspective for Consistent and Precise 3D Editing |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| ProDAG: Projected Variational Inference for Directed Acyclic Graphs |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Probabilistic Reasoning with LLMs for Privacy Risk Estimation |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Probabilistic Stability Guarantees for Feature Attributions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Probabilistic Token Alignment for Large Language Model Fusion |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Probably Approximately Precision and Recall Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Probing Equivariance and Symmetry Breaking in Convolutional Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Probing Hidden Knowledge Holes in Unlearned LLMs |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Probing Neural Combinatorial Optimization Models |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Problem-Parameter-Free Decentralized Bilevel Optimization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning |
β
|
β
|
β
|
β
|
β |
β |
β |
4 |
| Procurement Auctions with Predictions: Improved Frugality for Facility Location |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Product Distribution Learning with Imperfect Advice |
β
|
β |
β |
β |
β |
β |
β |
1 |
| ProfiX: Improving Profile-Guided Optimization in Compilers with Graph Neural Networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Program Synthesis via Test-Time Transduction |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Progress Reward Model for Reinforcement Learning via Large Language Models |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Progressive Data Dropout: An Embarrassingly Simple Approach to Train Faster |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Progressive Inference-Time Annealing of Diffusion Models for Sampling from Boltzmann Densities |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Projection-Manifold Regularized Latent Diffusion for Robust General Image Fusion |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Projective Equivariant Networks via Second-order Fundamental Differential Invariants |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Prompt Tuning Decision Transformers with Structured and Scalable Bandits |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Prompt Tuning Transformers for Data Memorization |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Prompt-Guided Alignment with Information Bottleneck Makes Image Compression Also a Restorer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Prompt-guided Disentangled Representation for Action Recognition |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Promptable 3-D Object Localization with Latent Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Proper HΓΆlder-Kullback Dirichlet Diffusion: A Framework for High Dimensional Generative Modeling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Prot2Text-V2: Protein Function Prediction with Multimodal Contrastive Alignment |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ProtInvTree: Deliberate Protein Inverse Folding with Reward-guided Tree Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Protein Design with Dynamic Protein Vocabulary |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Protein Inverse Folding From Structure Feedback |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ProtoPairNet: Interpretable Regression through Prototypical Pair Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Protocols for Verifying Smooth Strategies in Bandits and Games |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Provable Gradient Editing of Deep Neural Networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Provable Meta-Learning with Low-Rank Adaptations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Provable Scaling Laws for the Test-Time Compute of Large Language Models |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Provable Watermarking for Data Poisoning Attacks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Provably Efficient Multi-Task Meta Bandit Learning via Shared Representations |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Provably Efficient Online RLHF with One-Pass Reward Modeling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ProxySPEX: Inference-Efficient Interpretability via Sparse Feature Interactions in LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pruning Spurious Subgraphs for Graph Out-of-Distribution Generalization |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Pruning-Robust Mamba with Asymmetric Multi-Scale Scanning Paths |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PseuZO: Pseudo-Zeroth-Order Algorithm for Training Deep Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Pseudo-Riemannian Graph Transformer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PubSub-VFL: Towards Efficient Two-Party Split Learning in Heterogeneous Environments via Publisher/Subscriber Architecture |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Puppeteer: Rig and Animate Your 3D Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Purest Quantum State Identification |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Purifying Approximate Differential Privacy with Randomized Post-processing |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Purity Law for Neural Routing Problem Solvers with Enhanced Generalizability |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| PurpCode: Reasoning for Safer Code Generation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Put CASH on Bandits: A Max K-Armed Problem for Automated Machine Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| PyraMotion: Attentional Pyramid-Structured Motion Integration for Co-Speech 3D Gesture Synthesis |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| QBasicVSR: Temporal Awareness Adaptation Quantization for Video Super-Resolution |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| QFFT, Question-Free Fine-Tuning for Adaptive Reasoning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| QSCA: Quantization with Self-Compensating Auxiliary for Monocular Depth Estimation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| QiMeng-CodeV-R1: Reasoning-Enhanced Verilog Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| QiMeng-SALV: Signal-Aware Learning for Verilog Code Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| QuARI: Query Adaptive Retrieval Improvement |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| QuadEnhancer: Leveraging Quadratic Transformations to Enhance Deep Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quadratic Coreset Selection: Certifying and Reconciling Sequence and Token Mining for Efficient Instruction Tuning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quality-Driven Curation of Remote Sensing Vision-Language Data via Learned Scoring Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| QuanDA: Quantile-Based Discriminant Analysis for High-Dimensional Imbalanced Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Quantifying Cross-Modality Memorization in Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Quantifying Distributional Invariance in Causal Subgraph for IRM-Free Graph Generalization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quantifying Elicitation of Latent Capabilities in Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Quantifying Statistical Significance of Deep Nearest Neighbor Anomaly Detection via Selective Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quantifying Task-relevant Similarities in Representations Using Decision Variable Correlations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quantifying Uncertainty in Error Consistency: Towards Reliable Behavioral Comparison of Classifiers |
β
|
β
|
β
|
β |
β |
β |
β |
3 |
| Quantifying Uncertainty in the Presence of Distribution Shifts |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Quantitative convergence of trained neural networks to Gaussian processes |
β |
β
|
β |
β |
β |
β
|
β
|
3 |
| Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Quantization-Free Autoregressive Action Transformer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Quantum Doubly Stochastic Transformers |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Quantum Speedups for Minimax Optimization and Beyond |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Quantum Visual Fields with Neural Amplitude Encoding |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Quantum speedup of non-linear Monte Carlo problems |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Quartet: Native FP4 Training Can Be Optimal for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Quasi-Self-Concordant Optimization with $\ell_{\infty}$ Lewis Weights |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Query-Efficient Locally Private Hypothesis Selection via the Scheffe Graph |
β |
β |
β |
β |
β |
β |
β |
0 |
| R$^2$ec: Towards Large Recommender Models with Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| R-KV: Redundancy-aware KV Cache Compression for Reasoning Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| RAG4GFM: Bridging Knowledge Gaps in Graph Foundation Models through Graph Retrieval Augmented Generation |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| RAGRouter: Learning to Route Queries to Multiple Retrieval-Augmented Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RANK++LETR: Learn to Rank and Optimize Candidates for Line Segment Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RAPID Hand: Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Embodied Intelligence |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| RAPTR: Radar-based 3D Pose Estimation using Transformer |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| RAST: Reasoning Activation in LLMs via Small-model Transfer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RAT: Bridging RNN Efficiency and Attention Accuracy via Chunk-based Sequence Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RCCDA: Adaptive Model Updates in the Presence of Concept Drift under a Constrained Resource Budget |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| REArtGS: Reconstructing and Generating Articulated Objects via 3D Gaussian Splatting with Geometric and Motion Constraints |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| REDOUBT: Duo Safety Validation for Autonomous Vehicle Motion Planning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| REINFORCE Converges to Optimal Policies with Any Learning Rate |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| REMI: Reconstructing Episodic Memory During Internally Driven Path Planning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| REOrdering Patches Improves Vision Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| REPA Works Until It Doesnβt: Early-Stopped, Holistic Alignment Supercharges Diffusion Training |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| RESAnything: Attribute Prompting for Arbitrary Referring Segmentation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| RETRO SYNFLOW: Discrete Flow-Matching for Accurate and Diverse Single-Step Retrosynthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| REVE: A Foundation Model for EEG - Adapting to Any Setup with Large-Scale Pretraining on 25,000 Subjects |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RF-Agent: Automated Reward Function Design via Language Agent Tree Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| RFMPose: Generative Category-level Object Pose Estimation via Riemannian Flow Matching |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RIGNO: A Graph-based Framework For Robust And Accurate Operator Learning For PDEs On Arbitrary Domains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RLVR-World: Training World Models with Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RLZero: Direct Policy Inference from Language Without In-Domain Supervision |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| RNNs perform task computations by dynamically warping neural representations |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ROGR: Relightable 3D Objects using Generative Relighting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ROOT: Rethinking Offline Optimization as Distributional Translation via Probabilistic Bridge |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ROSE: Remove Objects with Side Effects in Videos |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RUAGO: Effective and Practical Retain-Free Unlearning via Adversarial Attack and OOD Generator |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RULE: Reinforcement UnLEarning Achieves Forget-retain Pareto Optimality |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Chest X-ray with Zero-Shot Multi-Task Capability |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Radial Attention: $\mathcal{O}(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Random Forest Autoencoders for Guided Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Random Search Neural Networks for Efficient and Expressive Graph Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Randomized-MLP Regularization Improves Domain Adaptation and Interpretability in DINOv2 |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RankMatch: A Novel Approach to Semi-Supervised Label Distribution Learning Leveraging Rank Correlation between Labels |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rao-Blackwell Gradient Estimators for Equivariant Denoising Diffusion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rao-Blackwellised Reparameterisation Gradients |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rare Text Semantics Were Always There in Your Diffusion Transformer |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Rationalized All-Atom Protein Design with Unified Multi-Modal Bayesian Flow |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2) |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RayFusion: Ray Fusion Enhanced Collaborative Visual Perception |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Re-coding for Uncertainties: Edge-awareness Semantic Concordance for Resilient Event-RGB Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| ReCon-GS: Continuum-Preserved Gaussian Streaming for Fast and Compact Reconstruction of Dynamic Scenes |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ReDi: Rectified Discrete Flow |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ReDit: Reward Dithering for Improved LLM Policy Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ReMindRAG: Low-Cost LLM-Guided Knowledge Graph Traversal for Efficient RAG |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| RePO: Understanding Preference Learning Through ReLU-Based Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ReSim: Reliable World Simulation for Autonomous Driving |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reading Recognition in the Wild |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Real-DRL: Teach and Learn at Runtime |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Real-Time Execution of Action Chunking Flow Policies |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Real-Time Scene-Adaptive Tone Mapping for High-Dynamic Range Object Detection |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Real-World Reinforcement Learning of Active Perception Behaviors |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning of Vision Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Reasoning Beyond Points: A Visual Introspective Approach for Few-Shot 3D Segmentation |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Reasoning Is Not a Race: When Stopping Early Beats Going Deeper |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Reasoning Models Better Express Their Confidence |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reasoning Models Sometimes Output Illegible Chains of Thought |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Reasoning Planning for Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reasoning as an Adaptive Defense for Safety |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rebalancing Contrastive Alignment with Bottlenecked Semantic Increments in Text-Video Retrieval |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rebalancing Return Coverage for Conditional Sequence Modeling in Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reconciling Geospatial Prediction and Retrieval via Sparse Representations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Reconstruction and Secrecy under Approximate Distance Queries |
β |
β |
β |
β |
β |
β |
β |
0 |
| Rectified CFG++ for Flow Based Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rectified Point Flow: Generic Point Cloud Pose Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rectifying Shortcut Behaviors in Preference-based Reward Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Rectifying Soft-Label Entangled Bias in Long-Tailed Dataset Distillation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Recurrent Memory for Online Interdomain Gaussian Processes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Recursive Transformer: Boosting Reasoning Ability with State Stack |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Redefining Experts: Interpretable Decomposition of Language Models for Toxicity Mitigation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Redundancy-Aware Test-Time Graph Out-of-Distribution Detection |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Refinement Methods for Distributed Distribution Estimation under $\ell^p$-Losses |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Refining Norms: A Post-hoc Framework for OOD Detection in Graph Neural Networks |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Reframing Gaussian Splatting Densification with Complexity-Density Consistency of Primitives |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Refusal Direction is Universal Across Safety-Aligned Languages |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Regional Explanations: Bridging Local and Global Variable Importance |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Register and [CLS] tokens induce a decoupling of local and global features in large ViTs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Regression Trees Know Calculus |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Regret Lower Bounds for Decentralized Multi-Agent Stochastic Shortest Path Problems |
β |
β |
β |
β |
β |
β |
β |
0 |
| Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Regularized least squares learning with heavy-tailed noise is minimax optimal |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Reinforced Active Learning for Large-Scale Virtual Screening with Learnable Policy Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reinforced Context Order Recovery for Adaptive Reasoning and Planning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Reinforcement Learning Finetunes Small Subnetworks in Large Language Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reinforcement Learning Teachers of Test Time Scaling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reinforcement Learning for Out-of-Distribution Reasoning in LLMs: An Empirical Study on Diagnosis-Related Group Coding |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Reinforcement Learning for Reasoning in Large Language Models with One Training Example |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reinforcement Learning with Action Chunking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reinforcement Learning with Backtracking Feedback |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Reinforcement learning for one-shot DAG scheduling with comparability identification and dense reward |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Relaxing partition admissibility in Cluster-DAGs: a causal calculus with arbitrary variable clustering |
β |
β |
β |
β |
β |
β |
β |
0 |
| ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Reliable DecisionβMaking via CalibrationβOriented RetrievalβAugmented Generation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Reliable Lifelong Multimodal Editing: Conflict-Aware Retrieval Meets Multi-Level Guidance |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Reliably detecting model failures in deployment without labels |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Relieving the Over-Aggregating Effect in Graph Transformers |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Remarkable Robustness of LLMs: Stages of Inference? |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Remasking Discrete Diffusion Models with Inference-Time Scaling |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Removing Concepts from Text-to-Image Models with Only Negative Samples |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Rendering-Aware Reinforcement Learning for Vector Graphics Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RepGuard: Adaptive Feature Decoupling for Robust Backdoor Defense in Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Efficiency, High-Resolution Image Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reparameterized LLM Training via Orthogonal Equivalence Transformation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Replicable Distribution Testing |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Replicable Online Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Replicable Online pricing |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Repo2Run: Automated Building Executable Environment for Code Repository at Scale |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Representation Consistency for Accurate and Coherent LLM Answer Aggregation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Representation-Level Counterfactual Calibration for Debiased Zero-Shot Recognition |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Representational Difference Explanations |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Reproducing Kernel Banach Space Models for Neural Networks with Application to Rademacher Complexity Analysis |
β |
β |
β |
β |
β |
β |
β |
0 |
| Repurposing AlphaFold3-like Protein Folding Models for Antibody Sequence and Structure Co-design |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rescaled Influence Functions: Accurate Data Attribution in High Dimension |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Residual Stream Analysis of Overfitting And Structural Disruptions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Resolution of Simpson's paradox via the common cause principle |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Resounding Acoustic Fields with Reciprocity |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Resource-Constrained Federated Continual Learning: What Does Matter? |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Restoring Pruned Large Language Models via Lost Component Compensation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Restricted Global-Aware Graph Filters Bridging GNNs and Transformer for Node Classification |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Restricted Spectral Gap Decomposition for Simulated Tempering Targeting Mixture Distributions |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Rethinking Approximate Gaussian Inference in Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rethinking Circuit Completeness in Language Models: AND, OR, and ADDER Gates |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Rethinking Entropy in Test-Time Adaptation: The Missing Piece from Energy Duality |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Rethinking Fair Federated Learning from Parameter and Client View |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Rethinking Gradient Step Denoiser: Towards Truly Pseudo-Contractive Operator |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Rethinking Hebbian Principle: Low-Dimensional Structural Projection for Unsupervised Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rethinking Joint Maximum Mean Discrepancy for Visual Domain Adaptation |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Rethinking Losses for Diffusion Bridge Samplers |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rethinking Neural Combinatorial Optimization for Vehicle Routing Problems with Different Constraint Tightness Degrees |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rethinking Nighttime Image Deraining via Learnable Color Space Transformation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rethinking Out-of-Distribution Detection and Generalization with Collective Behavior Dynamics |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rethinking PCA Through Duality |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rethinking Residual Distribution in Locate-then-Edit Model Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rethinking Scale-Aware Temporal Encoding for Event-based Object Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rethinking Tokenized Graph Transformers for Node Classification |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rethinking Verification for LLM Code Generation: From Generation to Testing |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rethinking the Role of Verbatim Memorization in LLM Privacy |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Retrieval is Not Enough: Enhancing RAG through Test-Time Critique and Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Retro-R1: LLM-based Agentic Retrosynthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Return of ChebNet: Understanding and Improving an Overlooked GNN on Long Range Tasks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Revealing Multimodal Causality with Large Language Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Reverse Diffusion Sequential Monte Carlo Samplers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reverse Engineering Human Preferences with Reinforcement Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Reverse-Annealed Sequential Monte Carlo for Efficient Bayesian Optimal Experiment Design |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Revising and Falsifying Sparse Autoencoder Feature Explanations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Revisiting 1-peer exponential graph for enhancing decentralized learning efficiency |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Revisiting Agnostic Boosting |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Revisiting Bi-Linear State Transitions in Recurrent Neural Networks |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| Revisiting Consensus Error: A Fine-grained Analysis of Local SGD under Second-order Data Heterogeneity |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems |
β |
β
|
β |
β |
β |
β |
β |
1 |
| Revisiting Frank-Wolfe for Structured Nonconvex Optimization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Revisiting Glorot Initialization for Long-Range Linear Recurrences |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Revisiting Logit Distributions for Reliable Out-of-Distribution Detection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Revisiting Orbital Minimization Method for Neural Operator Decomposition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Revisiting Semi-Supervised Learning in the Era of Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Revitalizing SVD for Global Covariance Pooling: Halleyβs Method to Overcome Over-Flattening |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Revolutionizing Graph Aggregation: From Suppression to Amplification via BoostGCN |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Revolutionizing Training-Free NAS: Towards Efficient Automatic Proxy Discovery via Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Reward Reasoning Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Reward-Aware Proto-Representations in Reinforcement Learning |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Reward-oriented Causal Representation Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Rewind-to-Delete: Certified Machine Unlearning for Nonconvex Functions |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents |
β |
β |
β |
β |
β |
β |
β
|
1 |
| RiboFlow: Conditional De Novo RNA Co-Design via Synergistic Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Ridge Boosting is Both Robust and Efficient |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RidgeLoRA: Matrix Ridge Enhanced Low-Rank Adaptation of Large Language Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Riemannian Consistency Model |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Riemannian Proximal Sampler for High-accuracy Sampling on Manifolds |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Rig3R: Rig-Aware Conditioning and Discovery for 3D Reconstruction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Right for the Right Reasons: Avoiding Reasoning Shortcuts via Prototypical Neurosymbolic AI |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Rising from Ashes: Generalized Federated Learning via Dynamic Parameter Reset |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Risk Bounds For Distributional Regression |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Risk-Averse Constrained Reinforcement Learning with Optimized Certainty Equivalents |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Risk-Averse Total-Reward Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| Risk-aware Direct Preference Optimization under Nested Risk Measure |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| RiverMamba: A State Space Model for Global River Discharge and Flood Forecasting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| RoME: Domain-Robust Mixture-of-Experts for MILP Solution Prediction across Domains |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| RoMa: A Robust Model Watermarking Scheme for Protecting IP in Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| RoboScape: Physics-informed Embodied World Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Robust Contextual Pricing |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Robust Cross-modal Alignment Learning for Cross-Scene Spatial Reasoning and Grounding |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Robust Distortion-Free Watermark for Autoregressive Audio Generation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Distributed Estimation: Extending Gossip Algorithms to Ranking and Trimmed Means |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Robust Ego-Exo Correspondence with Long-Term Memory |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Robust Equilibria in Continuous Games: From Strategic to Dynamic Robustness |
β |
β |
β |
β |
β |
β |
β |
0 |
| Robust Estimation Under Heterogeneous Corruption Rates |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Robust Explanations of Graph Neural Networks via Graph Curvatures |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Graph Condensation via Classification Complexity Mitigation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Hallucination Detection in LLMs via Adaptive Token Selection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Robust Hyperbolic Learning with Curvature-Aware Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Integrated Learning and Pauli Noise Mitigation for Parametrized Quantum Circuits |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Robust LLM Alignment via Distributionally Robust Direct Preference Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Label Proportions Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Robust Minimax Boosting with Performance Guarantees |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust Regression of General ReLUs with Queries |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Robust Sampling for Active Statistical Inference |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Robust SuperAlignment: Weak-to-Strong Robustness Generalization for Vision-Language Models |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Robust and Computation-Aware Gaussian Processes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robust and Diverse Multi-Agent Learning via Rational Policy Gradient |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Robust and Scalable Autonomous Reinforcement Learning in Irreversible Environments |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Robust learning of halfspaces under log-concave marginals |
β
|
β |
β |
β |
β |
β |
β |
1 |
| RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Robustifying Learning-Augmented Caching Efficiently without Compromising 1-Consistency |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Robustly Learning Monotone Single-Index Models |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Robustness in Both Domains: CLIP Needs a Robust Text Encoder |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Role Bias in Diffusion Models: Diagnosing and Mitigating through Intermediate Decomposition |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Role-aware Multi-agent Reinforcement Learning for Coordinated Emergency Traffic Control |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Rollout Roulette: A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| RoomEditor: High-Fidelity Furniture Synthesis with Parameter-Sharing U-Net |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Root Cause Analysis of Outliers with Missing Structural Knowledge |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Rope to Nope and Back Again: A New Hybrid Attention Strategy |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Rotary Masked Autoencoders are Versatile Learners |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Routing Mamba: Scaling State Space Models with Mixture-of-Experts Projection |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| RrED: Black-box Unsupervised Domain Adaptation via Rectifying-reasoning Errors of Diffusion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| RvLLM: LLM Runtime Verification with Domain Knowledge |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| S$^2$M-Former: Spiking Symmetric Mixing Branchformer for Brain Auditory Attention Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| S$^2$NN: Sub-bit Spiking Neural Networks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| S'MoRE: Structural Mixture of Residual Experts for Parameter-Efficient LLM Fine-tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation |
β
|
β
|
β |
β
|
β
|
β
|
β
|
6 |
| S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| SAFE: Multitask Failure Detection for Vision-Language-Action Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAGE: A Unified Framework for Generalizable Object State Recognition with State-Action Graph Embedding |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| SAINT: Sequence-Aware Integration for Spatial Transcriptomics Multi-View Clustering |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| SALMONN-omni: A Standalone Speech LLM without Codec Injection for Full-duplex Conversation |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| SALS: Sparse Attention in Latent Space for KV Cache Compression |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SALoM: Structure Aware Temporal Graph Networks with Long-Short Memory Updater |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SAM2Flow: Interactive Optical Flow Estimation with Dual Memory for in vivo Microcirculation Analysis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAO-Instruct: Free-form Audio Editing using Natural Language Instructions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SAP: Exact Sorting in Splatting via Screen-Aligned Primitives |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SAS: Simulated Attention Score |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| SATURN: SAT-based Reinforcement Learning to Unleash LLMs Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SD-KDE: Score-Debiased Kernel Density Estimation |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| SD-VLM: Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SDPGO: Efficient Self-Distillation Training Meets Proximal Gradient Optimization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map Construction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SE-GUI: Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SEGA: Shaping Semantic Geometry for Robust Hashing under Noisy Supervision |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SEMPO: Lightweight Foundation Models for Time Series Forecasting |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| SGAR: Structural Generative Augmentation for 3D Human Motion Retrieval |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SGCD: Stain-Guided CycleDiffusion for Unsupervised Domain Adaptation of Histopathology Image Classification |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| SGN: Shifted Window-Based Hierarchical Variable Grouping for Multivariate Time Series Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism |
β
|
β |
β |
β |
β |
β |
β |
1 |
| SHAP values via sparse Fourier representation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SHAP zero Explains Biological Sequence Models with Near-zero Marginal Cost for Future Queries |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SHF: Symmetrical Hierarchical Forest with Pretrained Vision Transformer Encoder for High-Resolution Medical Segmentation |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| SHGR: A Generalized Maximal Correlation Coefficient |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SIFusion: A Unified Fusion Framework for Multi-granularity Arctic Sea Ice Forecasting |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SING: SDE Inference via Natural Gradients |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SMARTraj$^2$: A Stable Multi-City Adaptive Method for Multi-View Spatio-Temporal Trajectory Representation Learning |
β |
β
|
β |
β
|
β |
β |
β
|
3 |
| SNAP: Low-Latency Test-Time Adaptation with Sparse Updates |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SNEAKDOOR: Stealthy Backdoor Attacks against Distribution Matching-based Dataset Condensation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SOMBRL: Scalable and Optimistic Model-Based RL |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| SONAR: Long-Range Graph Propagation Through Information Waves |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPACE: Noise Contrastive Estimation Stabilizes Self-Play Fine-Tuning for Large Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| SPACE: SPike-Aware Consistency Enhancement for Test-Time Adaptation in Spiking Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPARKE: Scalable Prompt-Aware Diversity and Novelty Guidance in Diffusion Models via RKE Score |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPARTAN: A Sparse Transformer World Model Attending to What Matters |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| SPFL: Sequential updates with Parallel aggregation for Enhanced Federated Learning under Category and Domain Shifts |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPICED: A Synaptic Homeostasis-Inspired Framework for Unsupervised Continual EEG Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPINT: Spatial Permutation-Invariant Neural Transformer for Consistent Intracortical Motor Decoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SPMDM: Enhancing Masked Diffusion Models through Simplifying Sampling Path |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SPOT-Trip: Dual-Preference Driven Out-of-Town Trip Recommendation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SPOT: Scalable Policy Optimization with Trees for Markov Decision Processes |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SPRO: Improving Image Generation via Self-Play |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit Meshes |
β
|
β
|
β
|
β
|
β |
β |
β |
4 |
| SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SSIMBaD: Sigma Scaling with SSIM-Guided Balanced Diffusion for AnimeFace Colorization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ST$^2$360D: Spatial-to-Temporal Consistency for Training-free 360 Monocular Depth Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| STACI: Spatio-Temporal Aleatoric Conformal Inference |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| STAIR: Addressing Stage Misalignment through Temporal-Aligned Preference Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| STAR-Bets: Sequential TArget-Recalculating Bets for Tighter Confidence Intervals |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| STAR: Spatial-Temporal Tracklet Matching for Multi-Object Tracking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| STEAD: Robust Provably Secure Linguistic Steganography with Diffusion Language Model |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| STNet: Spectral Transformation Network for Solving Operator Eigenvalue Problem |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| STRAP: Spatio-Temporal Pattern Retrieval for Out-of-Distribution Generalization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| STRATUS: A Multi-agent System for Autonomous Reliability Engineering of Modern Clouds |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| STree: Speculative Tree Decoding for Hybrid State Space Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models |
β
|
β
|
β
|
β |
β |
β |
β |
3 |
| Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Safe and Stable Control via Lyapunov-Guided Diffusion Models |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Safely Learning Controlled Stochastic Dynamics |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Safety Depth in Large Language Models: A Markov Chain Perspective |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Safety Pretraining: Toward the Next Generation of Safe AI |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Salient Concept-Aware Generative Data Augmentation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Sample Complexity of Distributionally Robust Average-Reward Reinforcement Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Sample complexity of data-driven tuning of model hyperparameters in neural networks with structured parameter-dependent dual function |
β |
β |
β |
β |
β |
β |
β |
0 |
| Sample-Adaptivity Tradeoff in On-Demand Sampling |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Sample-Conditional Coverage in Split-Conformal Prediction |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Sample-Efficient Multi-Round Generative Data Augmentation for Long-Tail Instance Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sample-Efficient Tabular Self-Play for Offline Robust Reinforcement Learning |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sampled Estimators For Softmax Must Be Biased |
β |
β |
β |
β |
β |
β |
β |
0 |
| Sampling 3D Molecular Conformers with Diffusion Transformers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sampling by averaging: A multiscale approach to score estimation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sampling from multi-modal distributions with polynomial query complexity in fixed dimension via reverse diffusion |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaffolding Dexterous Manipulation with Vision-Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Scalable Best-of-N Selection for Large Language Models via Self-Certainty |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scalable Cross-View Sample Alignment for Multi-View Clustering with View Structure Similarity |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Scalable Evaluation and Neural Models for Compositional Generalization |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Scalable Exploration via Ensemble++ |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Scalable Fingerprinting of Large Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Scalable In-context Ranking with Generative Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scalable Neural Incentive Design with Parameterized Mean-Field Approximation |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Scalable Neural Network Geometric Robustness Validation via HΓΆlder Optimisation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Scalable Policy-Based RL Algorithms for POMDPs |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Scalable Signature Kernel Computations via Local Neumann Series Expansions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Scalable Valuation of Human Feedback through Provably Robust Model Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Scalable and Cost-Efficient de Novo Template-Based Molecular Generation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Scalable and adaptive prediction bands with kernel sum-of-squares |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Scalable inference of functional neural connectivity at submillisecond timescales |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Scale-invariant attention |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| ScaleDiff: Higher-Resolution Image Synthesis via Efficient and Model-Agnostic Diffusion |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaling Data-Driven Probabilistic Robustness Analysis for Semantic Segmentation Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Scaling Diffusion Transformers Efficiently via $\mu$P |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Scaling Embedding Layers in Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Scaling Epidemic Inference on Contact Networks: Theory and Algorithms |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Scaling Image Geo-Localization to Continent Level |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Scaling Language-centric Omnimodal Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaling Law with Learning Rate Annealing |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Scaling Laws For Scalable Oversight |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipfβs Law |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Scaling Laws for Optimal Data Mixtures |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Scaling Offline RL via Efficient and Expressive Shortcut Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Scaling RL to Long Videos |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaling Speculative Decoding with Lookahead Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Scaling Unlocks Broader Generation and Deeper Functional Understanding of Proteins |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Scaling Up Active Testing to Large Language Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Scaling Up Parameter Generation: A Recurrent Diffusion Approach |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Scaling and context steer LLMs along the same computational path as the human brain |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Scaling can lead to compositional generalization |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SceneForge: Enhancing 3D-text alignment with Structured Scene Compositions |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| SchrΓΆdinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Score-Based Diffusion Modeling for Nonparametric Empirical Bayes in Heteroscedastic Gaussian Mixtures |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Searching Efficient Semantic Segmentation Architectures via Dynamic Path Selection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Searching Latent Program Spaces |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Second-Order Convergence in Private Stochastic Non-Convex Optimization |
β
|
β |
β
|
β |
β |
β |
β |
2 |
| Second-order Optimization under Heavy-Tailed Noise: Hessian Clipping and Sample Complexity Limits |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Secure and Confidential Certificates of Online Fairness |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Seeds of Structure: Patch PCA Reveals Universal Compositional Cues in Diffusion Models |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Seeing the Arrow of Time in Large Multimodal Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Seeing the Wind from a Falling Leaf |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SeerAttention: Self-distilled Attention Gating for Efficient Long-context Prefilling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Seg-VAR:Image Segmentation with Visual Autoregressive Modeling |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Seg4Diff: Unveiling Open-Vocabulary Semantic Segmentation in Text-to-Image Diffusion Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SegMASt3R: Geometry Grounded Segment Matching |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Segment Anything Model Meets Semi-supervised Medical Image Segmentation: A Novel Perspective |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Selective Learning for Deep Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Selective Omniprediction and Fair Abstention |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Self Iterative Label Refinement via Robust Unlabeled Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Self supervised learning for in vivo localization of microelectrode arrays using raw local field potential |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Self-Adapting Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-Assembling Graph Perceptrons |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Self-Calibrating BCIs: Ranking and Recovery of Mental Targets Without Labels |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-Challenging Language Model Agents |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Self-Evolving Pseudo-Rehearsal for Catastrophic Forgetting with Task Similarity in LLMs |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-Guided Hierarchical Exploration for Generalist Foundation Model Web Agents |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Self-Improving Embodied Foundation Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Self-Perturbed Anomaly-Aware Graph Dynamics for Multivariate Time-Series Anomaly Detection |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Self-Refining Language Model Anonymizers via Adversarial Distillation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Self-Supervised Contrastive Learning is Approximately Supervised Contrastive Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Self-Supervised Direct Preference Optimization for Text-to-Image Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Self-Supervised Discovery of Neural Circuits in Spatially Patterned Neural Responses with Graph Neural Networks |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Self-Supervised Learning of Graph Representations for Network Intrusion Detection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Self-Supervised Selective-Guided Diffusion Model for Old-Photo Face Restoration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-Training with Dynamic Weighting for Robust Gradual Domain Adaptation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Self-Verification Provably Prevents Model Collapse in Recursive Synthetic Training |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Self-Verifying Reflection Helps Transformers with CoT Reasoning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-diffusion for Solving Inverse Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Self-supervised Blending Structural Context of Visual Molecules for Robust Drug Interaction Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Selftok-Zero: Reinforcement Learning for Visual Generation via Discrete and Autoregressive Visual Tokens |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Semantic Representation Attack against Aligned Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Semantic Retrieval Augmented Contrastive Learning for Sequential Recommendation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Semantic-guided Diverse Decoding for Large Language Model |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Semi-Supervised Regression with Heteroscedastic Pseudo-Labels |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Semi-infinite Nonconvex Constrained Min-Max Optimization |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Semi-supervised Graph Anomaly Detection via Robust Homophily Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Semi-supervised Vertex Hunting, with Applications in Network and Text Analysis |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| SensorLM: Learning the Language of Wearable Sensors |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Separating the 'what' and 'how' of compositional computation to enable reuse and continual learning |
β
|
β
|
β |
β
|
β |
β |
β
|
4 |
| Sequence Modeling with Spectral Mean Flows |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Sequential Attention-based Sampling for Histopathological Analysis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sequential Monte Carlo for Policy Optimization in Continuous POMDPs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sequential Multi-Agent Dynamic Algorithm Configuration |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Sequentially Auditing Differential Privacy |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Set Smoothness Unlocks Clarke Hyper-stationarity in Bilevel Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Set-LLM: A Permutation-Invariant LLM |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Shallow Diffuse: Robust and Invisible Watermarking through Low-Dim Subspaces in Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Shape it Up! Restoring LLM Safety during Finetuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Shape-Informed Clustering of Multi-Dimensional Functional Data via Deep Functional Autoencoders |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling |
β
|
β |
β
|
β
|
β |
β
|
β
|
5 |
| ShapeEmbed: a self-supervised learning framework for 2D contour quantification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ShapeX: Shapelet-Driven Post Hoc Explanations for Time Series Classification Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Shaping Sequence Attractor Schema in Recurrent Neural Networks |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Shapley-Based Data Valuation for Weighted $k$-Nearest Neighbors |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Sharp Analysis for KL-Regularized Contextual Bandits and RLHF |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Sharp Gaussian approximations for Decentralized Federated Learning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Sharp Matrix Empirical Bernstein Inequalities |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Sherlock: Self-Correcting Reasoning in Vision-Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ShiQ: Bringing back Bellman to LLMs |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| ShoeFit: A New Dataset and Dual-image-stream DiT Framework for Virtual Footwear Try-On |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Shortcut Features as Top Eigenfunctions of NTK: A Linear Neural Network Case and More |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Shortcuts and Identifiability in Concept-based Models from a Neuro-Symbolic Lens |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Shortcutting Pre-trained Flow Matching Diffusion Models is Almost Free Lunch |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Show-o2: Improved Native Unified Multimodal Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Siegel Neural Networks |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Sign-In to the Lottery: Reparameterizing Sparse Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SignFlow Bipartite Subgraph Network For Large-Scale Graph Link Sign Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| SilentStriker: Toward Stealthy Bit-Flip Attacks on Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Sim-LLM: Optimizing LLM Inference at the Edge through Inter-Task KV Reuse |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SimWorld-Robotics: Synthesizing Photorealistic and Dynamic Urban Environments for Multimodal Robot Navigation and Collaboration |
β
|
β |
β |
β
|
β
|
β
|
β
|
5 |
| SimWorld: An Open-ended Simulator for Agents in Physical and Social Worlds |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Simple Distillation for One-Step Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Simple and Effective Specialized Representations for Fair Classifiers |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Simple and Efficient Heterogeneous Temporal Graph Neural Network |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Simple and Optimal Sublinear Algorithms for Mean Estimation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SimpleStrat: Diversifying Language Model Generation with Stratification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Simulation-Based Inference for Adaptive Experiments |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Simultaneous Statistical Inference for Off-Policy Evaluation in Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Simultaneous Swap Regret Minimization via KL-Calibration |
β
|
β |
β |
β |
β |
β |
β |
1 |
| SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Single-Step Operator Learning for Conditioned Time-Series Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Single-Teacher View Augmentation: Boosting Knowledge Distillation via Angular Diversity |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Single-pass Adaptive Image Tokenization for Minimum Program Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sinusoidal Initialization, Time for a New Start |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Size-adaptive Hypothesis Testing for Fairness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sketch-Augmented Features Improve Learning Long-Range Dependencies in Graph Neural Networks |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| SketchMind: A Multi-Agent Cognitive Framework for Assessing Student-Drawn Scientific Sketches |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Sketched Adaptive Distributed Deep Learning: A Sharp Convergence Analysis |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Sketched Gaussian Mechanism for Private Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Skill-Driven Neurosymbolic State Abstractions |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Skrull: Towards Efficient Long Context Fine-tuning through Dynamic Data Scheduling |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| SkyLadder: Better and Faster Pretraining via Context Window Scheduling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks |
β
|
β
|
β |
β |
β |
β
|
β
|
4 |
| Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation is Wasteful |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Small Resamples, Sharp Guarantees: Convergence Rates for Resampled Studentized Quantile Estimators |
β |
β |
β |
β |
β |
β |
β |
0 |
| Small Singular Values Matter: A Random Matrix Analysis of Transformer Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SmallKV: Small Model Assisted Compensation of KV Cache Compression for Efficient LLM Inference |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Smart Surrogate Losses for Contextual Stochastic Linear Optimization with Robust Constraints |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| SmartCache: Context-aware Semantic Cache for Efficient Multi-turn LLM Inference |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Smooth Quadratic Prediction Markets |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Smooth Regularization for Efficient Video Recognition |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Smooth Sailing: Lipschitz-Driven Uncertainty Quantification for Spatial Associations |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Smooth and Flexible Camera Movement Synthesis via Temporal Masked Generative Modeling |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Smoothed Agnostic Learning of Halfspaces over the Hypercube |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Smoothed Differentiation Efficiently Mitigates Shattered Gradients in Explanations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SnapMoGen: Human Motion Generation from Expressive Texts |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Social World Model-Augmented Mechanism Design Policy Learning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Soft Task-Aware Routing of Experts for Equivariant Representation Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Soft-consensual Federated Learning for Data Heterogeneity via Multiple Paths |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Solver-Free Decision-Focused Learning for Linear Optimization Problems |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Solver-Informed RL: Grounding Large Language Models for Authentic Optimization Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Solving Discrete (Semi) Unbalanced Optimal Transport with Equivalent Transformation Mechanism and KKT-Multiplier Regularization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Solving Inverse Problems with FLAIR |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics |
β |
β |
β |
β |
β |
β |
β |
0 |
| Solving Partial Differential Equations via Radon Neural Operator |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Solving and Learning Partial Differential Equations with Variational Q-Exponential Processes |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Solving the Asymmetric Traveling Salesman Problem via Trace-Guided Cost Augmentation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Sound Logical Explanations for Mean Aggregation Graph Neural Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SpEx: A Spectral Approach to Explainable Clustering |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Space Group Equivariant Crystal Diffusion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SpaceServe: Spatial Multiplexing of Complementary Encoders and Decoders for Multimodal LLMs |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Spark Transformer: Reactivating Sparsity in Transformer FFN and Attention |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Sparse Diffusion Autoencoder for Test-time Adapting Prediction of Complex Systems |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Sparse Image Synthesis via Joint Latent and RoI Flow |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Sparse Optimistic Information Directed Sampling |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Sparse Polyak: an adaptive step size rule for high-dimensional M-estimation |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| SparseDiT: Token Sparsification for Efficient Diffusion Transformer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SparseMVC: Probing Cross-view Sparsity Variations for Multi-view Clustering |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Sparta Alignment: Collectively Aligning Multiple Language Models through Combat |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Spatial Understanding from Videos: Structured Prompts Meet Simulation Data |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spatial-Aware Decision-Making with Ring Attractors in Reinforcement Learning Systems |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SpatialLM: Training Large Language Models for Structured Indoor Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Spatially-aware Weights Tokenization for NeRF-Language Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Spatiotemporal Consensus with Scene Prior for Unsupervised Domain Adaptive Person Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SpecEM: Training-Free LLM Ensembling via Iterative Drafting, Verification, and Online Feedback |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SpecEdge: Scalable Edge-Assisted Serving Framework for Interactive LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| SpecMAS: A Multi-Agent System for Self-Verifying System Generation via Formal Model Checking |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SpecMER: Fast Protein Generation with K-mer Guided Speculative Decoding |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| SpectraLDS: Provable Distillation for Linear Dynamical Systems |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Spectral Analysis of Diffusion Models with Application to Schedule Design |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Spectral Analysis of Representational Similarity with Limited Neurons |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Spectral Compressive Imaging via Chromaticity-Intensity Decomposition |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spectral Conditioning of Attention Improves Transformer Performance |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Spectral Convolutional Conditional Neural Processes |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spectral Estimation with Free Decompression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Spectral Graph Coarsening Using Inner Product Preservation and the Grassmann Manifold |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Spectral Learning for Infinite-Horizon Average-Reward POMDPs |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Spectral Perturbation Bounds for Low-Rank Approximation with Applications to Privacy |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SpiderSolver: A Geometry-Aware Transformer for Solving PDEs on Complex Geometries |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spik-NeRF: Spiking Neural Networks for Neural Radiance Fields |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Spike-RetinexFormer: Rethinking Low-light Image Enhancement with Spiking Neural Networks |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Spike-timing-dependent Hebbian learning as noisy gradient descent |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Spike4DGS: Towards High-Speed Dynamic Scene Rendering with 4D Gaussian Splatting via a Spike Camera Array |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Spiking Meets Attention: Efficient Remote Sensing Image Super-Resolution with Attention Spiking Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spiking Neural Networks Need High-Frequency Information |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SpikingVTG: A Spiking Detection Transformer for Video Temporal Grounding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spiral: Semantic-Aware Progressive LiDAR Scene Generation and Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SplashNet: SplitβandβShare Encoders for Accurate and Efficient Typing with Surface Electromyography |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Split Gibbs Discrete Diffusion Posterior Sampling |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Split conformal classification with unsupervised calibration |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Squared families are useful conjugate priors |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stab-SGD: Noise-Adaptivity in Smooth Optimization with Stability Ratios |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Stability and Oracle Inequalities for Optimal Transport Maps between General Distributions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stability and Sharper Risk Bounds with Convergence Rate $\tilde{O}(1/n^2)$ |
β |
β |
β |
β |
β |
β |
β |
0 |
| Stabilizing LTI Systems under Partial Observability: Sample Complexity and Fundamental Limits |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation |
β |
β |
β |
β
|
β |
β
|
β
|
3 |
| Stable Coresets via Posterior Sampling: Aligning Induced and Full Loss Landscapes |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Stable Matching with Ties: Approximation Ratios and Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon |
β |
β |
β |
β |
β |
β |
β
|
1 |
| Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Stable Port-Hamiltonian Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stackelberg Learning with Outcome-based Payment |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Stackelberg Self-Annotation: A Robust Approach to Data-Efficient LLM Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Staggered Environment Resets Improve Massively Parallel On-Policy Reinforcement Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| StarTrail: Concentric Ring Sequence Parallelism for Efficient Near-Infinite-Context Transformer Model Training |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| State Entropy Regularization for Robust Reinforcement Learning |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| State Size Independent Statistical Error Bound for Discrete Diffusion Models |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| State-Covering Trajectory Stitching for Diffusion Planners |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| StateSpaceDiffuser: Bringing Long Context to Diffusion World Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Statistical Analysis of an Adversarial Bayesian Weak Supervision Method |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Statistical Analysis of the Sinkhorn Iterations for Two-Sample Schr\"{o}dinger Bridge Estimation |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Statistical Guarantees for High-Dimensional Stochastic Gradient Descent |
β |
β |
β |
β |
β |
β |
β |
0 |
| Statistical Inference for Gradient Boosting Regression |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Statistical Inference under Performativity |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Statistical Parity with Exponential Weights |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Statistical inference for Linear Stochastic Approximation with Markovian Noise |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Statistics Caching Test-Time Adaptation for Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Stealthy Yet Effective: Distribution-Preserving Backdoor Attacks on Graph Classification |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SteerConf: Steering LLMs for Confidence Elicitation |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Steering Generative Models with Experimental Data for Protein Fitness Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Steering Information Utility in Key-Value Memory for Language Model Post-Training |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Steering When Necessary: Flexible Steering Large Language Models with Backtracking |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Stepsize anything: A unified learning rate schedule for budgeted-iteration training |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Stitch and Tell: A Structured Data Augmentation Method for Spatial Understanding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Stochastic Forward-Forward Learning through Representational Dimensionality Compression |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Stochastic Gradients under Nuisances |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Stochastic Momentum Methods for Non-smooth Non-Convex Finite-Sum Coupled Compositional Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stochastic Optimization in Semi-Discrete Optimal Transport: Convergence Analysis and Minimax Rate |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Stochastic Principal-Agent Problems: Computing and Learning Optimal History-Dependent Policies |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Stochastic Process Learning via Operator Flow Matching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Stochastic Regret Guarantees for Online Zeroth- and First-Order Bilevel Optimization |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Stochastic Shortest Path with Sparse Adversarial Costs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Stochastically Dominant Peer Prediction |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Storyboard-guided Alignment for Fine-grained Video Action Recognition |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Straight-Line Diffusion Model for Efficient 3D Molecular Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Strassen Attention, Split VC Dimension and Compositionality in Transformers |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Strategic Classification with Non-Linear Classifiers |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Strategic Cost Selection in Participatory Budgeting |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Strategic Costs of Perceived Bias in Fair Selection |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Strategic Hypothesis Testing |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Strategyproof Reinforcement Learning from Human Feedback |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| Stratify or Die: Rethinking Data Splits in Image Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| StreamFlow: Streaming Audio Generation from Discrete Tokens via Streaming Flow Matching |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| StreamForest: Efficient Online Video Understanding with Persistent Event Memory |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Streaming Attention Approximation via Discrepancy Theory |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Streaming Federated Learning with Markovian Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Streaming Stochastic Submodular Maximization with On-Demand User Requests |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| StruDiCO: Structured Denoising Diffusion with Gradient-free Inference-stage Boosting for Memory and Time Efficient Combinatorial Optimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structural Causal Bandits under Markov Equivalence |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structure Matters: Dynamic Policy Gradient |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Structure-Aware Cooperative Ensemble Evolutionary Optimization on Combinatorial Problems with Multimodal Large Language Models |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structure-Aware Spectral Sparsification via Uniform Edge Sampling |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Structured Initialization for Vision Transformers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structured Linear CDEs: Maximally Expressive and Parallel-in-Time Sequence Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structured Reinforcement Learning for Combinatorial Decision-Making |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SubTrack++ : Gradient Subspace Tracking for Scalable LLM Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Subgraph Federated Learning via Spectral Methods |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Subsampled Ensemble Can Improve Generalization Tail Exponentially |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Subspace Networks: Scaling Decentralized Training with Communication-Efficient Model Parallelism |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Sum Estimation under Personalized Local Differential Privacy |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| SuperCLIP: CLIP with Simple Classification Supervision |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Superposition Yields Robust Neural Scaling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Support Vector Generation: Kernelizing Large Language Models for Efficient ZeroβShot NLP |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Surface-Aware Feed-Forward Quadratic Gaussian for Frame Interpolation with Large Motion |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| SurfelSplat: Learning Efficient and Generalizable Gaussian Surfel Representations for Sparse-View Surface Reconstruction |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Switchable Token-Specific Codebook Quantization For Face Image Compression |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| SymMaP: Improving Computational Efficiency in Linear Solvers through Symbolic Preconditioning |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| Symmetry-Preserving Conformer Ensemble Networks for Molecular Representation Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Synergistic Tensor and Pipeline Parallelism |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Synergy over Discrepancy: A Partition-Based Approach to Multi-Domain LLM Fine-Tuning |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Synthesize Privacy-Preserving High-Resolution Images via Private Textual Intermediaries |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Synthetic Series-Symbol Data Generation for Time Series Foundation Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Synthetic-powered predictive inference |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| System Prompt Optimization with Meta-Learning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| System-Embedded Diffusion Bridge Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Systematic Reward Gap Optimization for Mitigating VLM Hallucinations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| T-norm Selection for Object Detection in Autonomous Driving with Logical Constraints |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| T2V-OptJail: Discrete Prompt Optimization for Text-to-Video Jailbreak Attacks |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TADA: Improved Diffusion Sampling with Training-free Augmented DynAmics |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| TAI3: Testing Agent Integrity in Interpreting User Intent |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| TAMI: Taming Heterogeneity in Temporal Interactions for Temporal Graph Link Prediction |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| TANDEM: Bi-Level Data Mixture Optimization with Twin Networks |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| TAPIP3D: Tracking Any Point in Persistent 3D Geometry |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| TEMPO: Temporal Multi-scale Autoregressive Generation of Protein Conformational Ensembles |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TF-MAS: Training-free Mamba2 Architecture Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| TGA: True-to-Geometry Avatar Dynamic Reconstruction |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| THD-BAR: Topology Hierarchical Derived Brain Autoregressive Modeling for EEG Generic Representations |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| TITAN: A Trajectory-Informed Technique for Adaptive Parameter Freezing in Large-Scale VQE |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| TOMCAT: Test-time Comprehensive Knowledge Accumulation for Compositional Zero-Shot Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TRACE: Contrastive learning for multi-trial time series data in neuroscience |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TRACE: Grounding Time Series in Context for Multimodal Embedding and Retrieval |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TRAP: Targeted Redirecting of Agentic Preferences |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| TRIDENT: Tri-Modal Molecular Representation Learning with Taxonomic Annotations and Local Correspondence |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TRiCo: Triadic Game-Theoretic Co-Training for Robust Semi-Supervised Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| TRoVe: Discovering Error-Inducing Static Feature Biases in Temporal Vision-Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TS-MOF: Two-Stage Multi-Objective Fine-tuning for Long-Tailed Recognition |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TTRL: Test-Time Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TV-Rec: Time-Variant Convolutional Filter for Sequential Recommendation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TabDPT: Scaling Tabular Foundation Models on Real Data |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Table as a Modality for Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Table2LaTeX-RL: High-Fidelity LaTeX Code Generation from Table Images via Reinforced Multimodal Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tabula: A Tabular Self-Supervised Foundation Model for Single-Cell Transcriptomics |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Tackling Biased Evaluators in Dueling Bandits |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Tackling Continual Offline RL through Selective Weights Activation on Aligned Spaces |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tackling Feature-Classifier Mismatch in Federated Learning via Prompt-Driven Feature Transformation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tail-Optimized Caching for LLM Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Taming Adversarial Constraints in CMDPs |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Taming generative video models for zero-shot optical flow extraction |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Tapered Off-Policy REINFORCE - Stable and efficient reinforcement learning for large language models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Targeted Maximum Likelihood Learning: An Optimization Perspective |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Task-Specific Data Selection for Instruction Tuning via Monosemantic Neuronal Activations |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Taxonomy of reduction matrices for Graph Coarsening |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Teaching Language Models to Evolve with Users: Dynamic Profile Modeling for Personalized Alignment |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Teaching Language Models to Reason with Tools |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Technical Debt in In-Context Learning: Diminishing Efficiency in Long Context |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Template-Guided 3D Molecular Pose Generation via Flow Matching and Differentiable Optimization |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Temporal InβContext FineβTuning with Temporal Reasoning for Versatile Control of Video Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Temporal-Difference Variational Continual Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tensor Decomposition Networks for Fast Machine Learning Interatomic Potential Computations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tensor Product Attention Is All You Need |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tensor-Parallelism with Partially Synchronized Activations |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TensorRL-QAS: Reinforcement learning with tensor networks for improved quantum architecture search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Test Time Scaling for Neural Processes |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Test-Time Adaptation by Causal Trimming |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Test-Time Adaptive Object Detection with Foundation Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Test-Time Scaling of Diffusion Models via Noise Trajectory Search |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Test3R: Learning to Reconstruct 3D at Test Time |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| Text to Sketch Generation with Multi-Styles |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Text-to-Code Generation for Modular Building Layouts in Building Information Modeling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Adaptive Complexity of Minimizing Relative Fisher Information |
β
|
β |
β |
β |
β |
β |
β |
1 |
| The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Best Instruction-Tuning Data are Those That Fit |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| The Bias-Variance Tradeoff in Data-Driven Optimization: A Local Misspecification Perspective |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| The Boundaries of Fair AI in Medical Image Prognosis: A Causal Perspective |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Burden of Interactive Alignment with Inconsistent Preferences |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Complexity of Correlated Equilibria in Generalized Games |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Complexity of Finding Local Optima in Contrastive Learning |
β |
β
|
β |
β |
β |
β |
β |
1 |
| The Complexity of Symmetric Equilibria in Min-Max Optimization and Team Zero-Sum Games |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Computational Advantage of Depth in Learning High-Dimensional Hierarchical Targets |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| The Computational Complexity of Counting Linear Regions in ReLU Neural Networks |
β
|
β |
β |
β |
β |
β |
β |
1 |
| The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| The Cost of Robustness: Tighter Bounds on Parameter Complexity for Robust Memorization in ReLU Nets |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Curse of Depth in Large Language Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| The Dual Nature of Plasticity Loss in Deep Continual Learning: Dissection and Mitigation |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model |
β
|
β
|
β
|
β |
β
|
β |
β |
4 |
| The Emergence of Abstract Thought in Large Language Models Beyond Any Language |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| The Flood Complex: Large-Scale Persistent Homology on Millions of Points |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| The Fluorescent Veil: A Stealthy and Effective Physical Adversarial Patch Against Traffic Sign Recognition |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| The Fragile Truth of Saliency: Improving LLM Input Attribution via Attention Bias Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| The Future Unmarked: Watermark Removal in AI-Generated Images via Next-Frame Prediction |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| The Gaussian Mixing Mechanism: Renyi Differential Privacy via Gaussian Sketches |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Generative Leap: Tight Sample Complexity for Efficiently Learning Gaussian Multi-Index Models |
β
|
β |
β |
β |
β |
β |
β |
1 |
| The Good, the Bad and the Ugly: Meta-Analysis of Watermarks, Transferable Attacks and Adversarial Defenses |
β
|
β |
β |
β |
β |
β |
β |
1 |
| The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity |
β
|
β |
β
|
β |
β
|
β
|
β
|
5 |
| The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| The Indra Representation Hypothesis for Multimodal Alignment |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Logical Expressiveness of Temporal GNNs via Two-Dimensional Product Logics |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| The Mirage of Performance Gains: Why Contrastive Decoding Fails to Mitigate Object Hallucinations in MLLMs? |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| The Omni-Expert: A Computationally Efficient Approach to Achieve a Mixture of Experts in a Single Expert Model |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| The Parameterized Complexity of Computing the VC-Dimension |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Persistence of Neural Collapse Despite Low-Rank Bias |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| The Power of Iterative Filtering for Supervised Learning with (Heavy) Contamination |
β
|
β |
β |
β |
β |
β |
β |
1 |
| The Price of Opportunity Fairness in Matroid Allocation Problems |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Price of Sparsity: Sufficient Conditions for Sparse Recovery using Sparse and Sparsified Measurements |
β |
β |
β |
β |
β |
β |
β |
0 |
| The Primacy of Magnitude in Low-Rank Adaptation |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| The Promise of RL for Autoregressive Image Editing |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| The Quest for Universal Master Key Filters in DS-CNNs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Quotient Bayesian Learning Rule |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| The Rich and the Simple: On the Implicit Bias of Adam and SGD |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| The Rise of Parameter Specialization for Knowledge Storage in Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Structural Complexity of Matrix-Vector Multiplication |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| The Structure of Relation Decoding Linear Operators in Large Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The Underappreciated Power of Vision Models for Graph Structural Understanding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| The Unseen Threat: Residual Knowledge in Machine Unlearning under Perturbed Samples |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| The VLLM Safety Paradox: Dual Ease in Jailbreak Attack and Defense |
β |
β |
β
|
β |
β |
β |
β |
1 |
| The World Is Bigger! A Computationally-Embedded Perspective on the Big World Hypothesis |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| The emergence of sparse attention: impact of data distribution and benefits of repetition |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| The quest for the GRAph Level autoEncoder (GRALE) |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| The third pillar of causal analysis? A measurement perspective on causal representations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Theoretical Benefit and Limitation of Diffusion Language Model |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Theoretical Guarantees for the Retention of Strict Nash Equilibria by Coevolutionary Algorithms |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Theoretical Investigation of Adafactor for Non-Convex Smooth Optimization |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Theoretically Grounded Framework for LLM Watermarking: A Distribution-Adaptive Approach |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Theory-Driven Label-Specific Representation for Incomplete Multi-View Multi-Label Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Think Only When You Need with Large Hybrid-Reasoning Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Think before Recommendation: Autonomous Reasoning-enhanced Recommender |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Think or Not? Exploring Thinking Efficiency in Large Reasoning Models via an Information-Theoretic Lens |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ThinkSound: Chain-of-Thought Reasoning in Multimodal LLMs for Audio Generation and Editing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Thinker: Learning to Think Fast and Slow |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Thinking vs. Doing: Improving Agent Reasoning by Scaling Test-Time Interaction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Thinkless: LLM Learns When to Think |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| This Time is Different: An Observability Perspective on Time Series Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Thompson Sampling for Multi-Objective Linear Contextual Bandit |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Thompson Sampling in Function Spaces via Neural Operators |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Thought Communication in Multiagent Collaboration |
β |
β |
β
|
β
|
β
|
β |
β |
3 |
| Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Thresholds for sensitive optimality and Blackwell optimality in stochastic games |
β |
β |
β |
β |
β |
β |
β |
0 |
| Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Thumb on the Scale: Optimal Loss Weighting in Last Layer Retraining |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tight Asymptotics of Extreme Order Statistics |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Tight Bounds for Answering Adaptively Chosen Concentrated Queries |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Tight Bounds for Maximum Weight Matroid Independent Set and Matching in the Zero Communication Model |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting |
β |
β |
β |
β |
β |
β |
β |
0 |
| Tight Generalization Bounds for Large-Margin Halfspaces |
β |
β |
β |
β |
β |
β |
β |
0 |
| Tight High-Probability Bounds for Nonconvex Heavy-Tailed Scenario under Weaker Assumptions |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Tight Lower Bounds and Improved Convergence in Performative Prediction |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Tight analyses of first-order methods with error feedback |
β
|
β
|
β |
β |
β
|
β
|
β
|
5 |
| Tightening Regret Lower and Upper Bounds in Restless Rising Bandits |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization |
β |
β |
β |
β |
β |
β |
β |
0 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Time Series Generation Under Data Scarcity: A Unified Generative Modeling Approach |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Time-Embedded Algorithm Unrolling for Computational MRI |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Time-Evolving Dynamical System for Learning Latent Representations of Mouse Visual Neural Activity |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Time-Masked Transformers with Lightweight Test-Time Adaptation for Neural Speech Decoding |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Time-o1: Time-Series Forecasting Needs Transformed Label Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Time-uniform and Asymptotic Confidence Sequence of Quantile under Local Differential Privacy |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| TimeEmb: A Lightweight Static-Dynamic Disentanglement Framework for Time Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| TimePerceiver: An Encoder-Decoder Framework for Generalized Time-Series Forecasting |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TimeWak: Temporal Chained-Hashing Watermark for Time Series Data |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Timely Clinical Diagnosis through Active Test Selection |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| Titans: Learning to Memorize at Test Time |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable RL |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ToF-IP: Time-of-Flight Enhanced Sparse Inertial Poser for Real-time Human Motion Capture |
β |
β |
β |
β |
β
|
β
|
β
|
3 |
| TokMan:Tokenize Manhattan Mask Optimization for Inverse Lithography |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Token Bottleneck: One Token to Remember Dynamics |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Token Embeddings Violate the Manifold Hypothesis |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Token Perturbation Guidance for Diffusion Models |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Token-Level Self-Play with Importance-Aware Guidance for Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TokenSwap: A Lightweight Method to Disrupt Memorized Sequences in LLMs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Too Late to Recall: Explaining the Two-Hop Problem in Multimodal Knowledge Retrieval |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| ToolRL: Reward is All Tool Learning Needs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| TopER: Topological Embeddings in Graph Representation Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Topology-Aware Conformal Prediction for Stream Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Topology-Aware Learning of Tubular Manifolds via SE(3)-Equivariant Network on Ball B-Spline Curve |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Topology-aware Graph Diffusion Model with Persistent Homology |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Tortoise and Hare Guidance: Accelerating Diffusion Model Inference with Multirate Integration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Toward Efficient Inference Attacks: Shadow Model Sharing via Mixture-of-Experts |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Toward Human Deictic Gesture Target Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Toward Interpretable Evaluation Measures for Time Series Segmentation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Toward Relative Positional Encoding in Spiking Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Toward a Unified Geometry Understanding : Riemannian Diffusion Framework for Graph Generation and Prediction |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards 3D Objectness Learning in an Open World |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Towards Accurate Time Series Forecasting via Implicit Decoding |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Building Model/Prompt-Transferable Attackers against Large Vision-Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Towards Comprehensive Scene Understanding: Integrating First and Third-Person Views for LVLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Doctor-Like Reasoning: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients |
β
|
β
|
β
|
β
|
β |
β
|
β
|
6 |
| Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Towards Effective Federated Graph Foundation Model via Mitigating Knowledge Entanglement |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Towards Fully FP8 GEMM LLM Training at Scale |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Towards General Continuous Memory for Vision-Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Towards Generalizable 3D Human Pose Estimation via Ensembles on Flat Loss Landscapes |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Generalizable Detector for Generated Image |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Generalizable Multi-Policy Optimization with Self-Evolution for Job Scheduling |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Towards Generalizable Retina Vessel Segmentation with Deformable Graph Priors |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Graph Foundation Models: Training on Knowledge Graphs Enables Transferability to General Graphs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Identifiability of Hierarchical Temporal Causal Representation Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Interpretable and Efficient Attention: Compressing All by Contracting a Few |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Towards Irreversible Attack: Fooling Scene Text Recognition via Multi-Population Coevolution Search |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Multi-Table Learning: A Novel Paradigm for Complementarity Quantification and Integration |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Towards Multiscale Graph-based Protein Learning with Geometric Secondary Structural Motifs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Pre-trained Graph Condensation via Optimal Transport |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Towards Predicting Any Human Trajectory In Context |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Principled Unsupervised Multi-Agent Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Provable Emergence of In-Context Reinforcement Learning |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Towards Reliable Identification of Diffusion-based Image Manipulations |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Towards Reliable LLM-based Robots Planning via Combined Uncertainty Estimation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Towards Reliable and Holistic Visual In-Context Learning Prompt Selection |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Robust Parameter-Efficient Fine-Tuning for Federated Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Towards Robust Uncertainty Calibration for Composed Image Retrieval |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Towards Robust Zero-Shot Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Towards Self-Refinement of Vision-Language Models with Triangular Consistency |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Towards Syn-to-Real IQA: A Novel Perspective on Reshaping Synthetic Data Distributions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Understanding Transformers in Learning Random Walks |
β |
β |
β |
β
|
β |
β |
β
|
2 |
| Towards Understanding the Mechanisms of Classifier-Free Guidance |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Unsupervised Domain Bridging via Image Degradation in Semantic Segmentation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Unsupervised Open-Set Graph Domain Adaptation via Dual Reprogramming |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards Visualization-of-Thought Jailbreak Attack against Large Visual Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Towards a General Attention Framework on Gyrovector Spaces for Matrix Manifolds |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Towards a Geometric Understanding of Tensor Learning via the t-Product |
β |
β
|
β
|
β |
β |
β |
β |
2 |
| Towards a Golden Classifier-Free Guidance Path via Foresight Fixed Point Iterations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Towards a Pairwise Ranking Model with Orderliness and Monotonicity for Label Enhancement |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Towards foundational LiDAR world models with efficient latent flow matching |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Towards the Resistance of Neural Network Fingerprinting to Fine-tuning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tracing Back the Malicious Clients in Poisoning Attacks to Federated Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Tracing the Representation Geometry of Language Models from Pretraining to Post-training |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Tracing the Roots: Leveraging Temporal Dynamics in Diffusion Trajectories for Origin Attribution |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Track3R: Joint Point Map and Trajectory Prior for Spatiotemporal 3D Understanding |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Tracking and Understanding Object Transformations |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TractoTransformer: Diffusion MRI Streamline Tractography using CNN and Transformer Networks |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tradeoffs between Mistakes and ERM Oracle Calls in Online and Transductive Online Learning |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Train on Pins and Test on Obstacles for Rectilinear Steiner Minimum Tree |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Train to Defend: First Defense Against Cryptanalytic Neural Network Parameter Extraction Attacks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Training Language Models to Generate Quality Code with Program Analysis Feedback |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Training Language Models to Reason Efficiently |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Training Robust Graph Neural Networks by Modeling Noise Dependencies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Training a Scientific Reasoning Model for Chemistry |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Training the Untrainable: Introducing Inductive Bias via Representational Alignment |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Training-Free Bayesianization for Low-Rank Adapters of Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Training-Free Constrained Generation With Stable Diffusion Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Training-Free Efficient Video Generation via Dynamic Token Carving |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Training-Free Safe Denoisers for Safe Use of Diffusion Models |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Training-Free Test-Time Adaptation via Shape and Style Guidance for Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Training-free Detection of AI-generated images via Cropping Robustness |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Training-free Online Video Step Grounding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Trajectory Graph Learning: Aligning with Long Trajectories in Reinforcement Learning Without Reward Design |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Transcending Cost-Quality Tradeoff in Agent Serving via Session-Awareness |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Transductive Conformal Inference for Full Ranking |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Transfer Faster, Price Smarter: Minimax Dynamic Pricing under Cross-Market Preference Shift |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Transfer Learning on Edge Connecting Probability Estimation Under Graphon Model |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| TransferTraj: A Vehicle Trajectory Learning Model for Region and Task Transferability |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transferable Black-Box One-Shot Forging of Watermarks via Image Preference Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transferring Causal Effects using Proxies |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Transferring Linear Features Across Language Models With Model Stitching |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transformer brain encoders explain human high-level visual responses |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transformers Learn Faster with Semantic Focus |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Transformers are almost optimal metalearners for linear classification |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Transformers for Mixed-type Event Sequences |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Transforming Gaps into Gains: Bridging Model and Data Heterogeneity in Federated Learning via Knowledge Weak-Aware Zones |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Transforming Generic Coder LLMs to Effective Binary Code Embedding Models for Similarity Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Transition Matching: Scalable and Flexible Generative Modeling |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Transstratal Adversarial Attack: Compromising Multi-Layered Defenses in Text-to-Image Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Traversal Verification for Speculative Tree Decoding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| Treatment Effect Estimation for Optimal Decision-Making |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Tree Ensemble Explainability through the Hoeffding Functional Decomposition and TreeHFD Algorithm |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tree of Preferences for Diversified Recommendation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Tree-Based Premise Selection for Lean4 |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Tree-Guided Diffusion Planner |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Tree-Sliced Entropy Partial Transport |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TreeGen: A Bayesian Generative Model for Hierarchies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| TreeSplat: Mergeable Tree for Deformable Gaussian Splatting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| True Impact of Cascade Length in Contextual Cascading Bandits |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Truth over Tricks: Measuring and Mitigating Shortcut Learning in Misinformation Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Truthful Aggregation of LLMs with an Application to Online Advertising |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Turbocharging Gaussian Process Inference with Approximate Sketch-and-Project |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Turning the Tables: Enabling Backward Transfer via Causal-Aware LoRA in Continual Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Two Heads are Better than One: Simulating Large Transformers with Small Ones |
β |
β |
β |
β |
β |
β |
β |
0 |
| Two-Steps Diffusion Policy for Robotic Manipulation via Genetic Denoising |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| TwoβStage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| TΓ½r-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| U-CAN: Unsupervised Point Cloud Denoising with Consistency-Aware Noise2Noise Matching |
β |
β
|
β
|
β
|
β |
β |
β |
3 |
| U-REPA: Aligning Diffusion U-Nets to ViTs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| UEPI: Universal Energy-Behavior-Preserving Integrators for Energy Conservative/Dissipative Differential Equations |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| UFM: A Simple Path towards Unified Dense Correspondence with Flow |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UFT: Unifying Supervised and Reinforcement Fine-Tuning |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| UGG-ReID: Uncertainty-Guided Graph Model for Multi-Modal Object Re-Identification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UGM2N: An Unsupervised and Generalizable Mesh Movement Network via M-Uniform Loss |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| UGoDIT: Unsupervised Group Deep Image Prior Via Transferable Weights |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UMA: A Family of Universal Models for Atoms |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| UMAMI: Unifying Masked Autoregressive Models and Deterministic Rendering for View Synthesis |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| UMoE: Unifying Attention and FFN with Shared Experts |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Ultra-high Resolution Watermarking Framework Resistant to Extreme Cropping and Scaling |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Ultrametric Cluster Hierarchies: I Want βem All! |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| UnCLe: Towards Scalable Dynamic Causal Discovery in Non-linear Temporal Systems |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Unbalanced Optimal Total Variation Transport: A Theoretical Approach to Spatial Resource Allocation Problems |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Unbiased Prototype Consistency Learning for Multi-Modal and Multi-Task Object Re-Identification |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty Estimation by Flexible Evidential Deep Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty Estimation on Graphs with Structure Informed Stochastic Partial Differential Equations |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Uncertainty Quantification for Deep Regression using Contextualised Normalizing Flows |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Uncertainty Quantification with the Empirical Neural Tangent Kernel |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty-Calibrated Prediction of Randomly-Timed Biomarker Trajectories with Conformal Bands |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Uncertainty-Guided Exploration for Efficient AlphaZero Training |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Uncertainty-Informed Meta Pseudo Labeling for Surrogate Modeling with Limited Labeled Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty-Sensitive Privileged Learning |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Uncertainty-aware Preference Alignment for Diffusion Policies |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Uncover Governing Law of Pathology Propagation Mechanism Through A Mean-Field Game |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks |
β
|
β
|
β |
β
|
β
|
β |
β
|
5 |
| Uncovering the Spectral Bias in Diagonal State Space Models |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Under the Shadow: Exploiting Opacity Variation for Fine-grained Shadow Detection |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Understanding Adam Requires Better Rotation Dependent Assumptions |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Understanding Bias Terms in Neural Representations |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Understanding Contrastive Learning via Gaussian Mixture Models |
β |
β |
β
|
β |
β |
β |
β |
1 |
| Understanding Data Influence in Reinforcement Finetuning |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Understanding Differential Transformer Unchains Pretrained Self-Attentions |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis |
β
|
β
|
β
|
β
|
β
|
β |
β |
5 |
| Understanding Generalization in Physics Informed Models through Affine Variety Dimensions |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Understanding LLM Behaviors via Compression: Data Generation, Knowledge Acquisition and Scaling Laws |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Understanding Parametric and Contextual Knowledge Reconciliation within Large Language Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Understanding Prompt Tuning and In-Context Learning via Meta-Learning |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Understanding Softmax Attention Layers:\\ Exact Mean-Field Analysis on a Toy Problem |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Understanding and Enhancing Mask-Based Pretraining towards Universal Representations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Understanding and Enhancing Message Passing on Heterophilic Graphs via Compatibility Matrix |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Understanding and Improving Fast Adversarial Training against $l_0$ Bounded Perturbations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Understanding and Rectifying Safety Perception Distortion in VLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Understanding protein function with a multimodal retrieval-augmented foundation model |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Understanding the Evolution of the Neural Tangent Kernel at the Edge of Stability |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| Understanding the Gain from Data Filtering in Multimodal Contrastive Learning |
β
|
β |
β |
β
|
β
|
β |
β
|
4 |
| Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Understanding while Exploring: Semantics-driven Active Mapping |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Unextractable Protocol Models: Collaborative Training and Inference without Weight Materialization |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Unfolding the Black Box of Recurrent Neural Networks for Path Integration |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Uni-LoRA: One Vector is All You Need |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Uni-RL: Unifying Online and Offline RL via Implicit Value Regularization |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| UniGTE: Unified GraphβText Encoding for Zero-Shot Generalization across Graph Tasks and Domains |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| UniMotion: A Unified Motion Framework for Simulation, Prediction and Planning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| UniSite: The First Cross-Structure Dataset and Learning Framework for End-to-End Ligand Binding Site Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| UniTok: a Unified Tokenizer for Visual Generation and Understanding |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| UniTraj: Learning a Universal Trajectory Foundation Model from Billion-Scale Worldwide Traces |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| UniTransfer: Video Concept Transfer via Progressive Spatio-Temporal Decomposition |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| UniViT: Unifying Image and Video Understanding in One Vision Encoder |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site Knowledge |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unified 2D-3D Discrete Priors for Noise-Robust and Calibration-Free Multiview 3D Human Pose Estimation |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unified Reinforcement and Imitation Learning for Vision-Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Unified Scaling Laws for Compressed Representations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Unified Transferability Metrics for Time Series Foundation Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unified all-atom molecule generation with neural fields |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Uniform Wrappers: Bridging Concave to Quadratizable Functions in Online Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unifying Proportional Fairness in Centroid and Non-Centroid Clustering |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Unifying Re-Identification, Attribute Inference, and Data Reconstruction Risks in Differential Privacy |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unifying Reconstruction and Density Estimation via Invertible Contraction Mapping in One-Class Classification |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| UniteFormer: Unifying Node and Edge Modalities in Transformers for Vehicle Routing Problems |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Universal Causal Inference in a Topos |
β |
β |
β |
β |
β |
β |
β |
0 |
| Universal Cross-Tokenizer Distillation via Approximate Likelihood Matching |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Universal Few-shot Spatial Control for Diffusion Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Universal Sequence Preconditioning |
β
|
β |
β
|
β |
β |
β |
β
|
3 |
| Universal Video Temporal Grounding with Generative Multi-modal Large Language Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Universal Visuo-Tactile Video Understanding for Embodied Interaction |
β |
β |
β |
β
|
β
|
β |
β |
2 |
| Universally Invariant Learning in Equivariant GNNs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unlabeled Data Can Provably Enhance In-Context Learning of Transformers |
β |
β
|
β |
β |
β
|
β |
β
|
3 |
| Unlabeled Data Improves Fine-Grained Image Zero-shot Classification with Multimodal LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unlearning-Aware Minimization |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Unleashing Hour-Scale Video Training for Long Video-Language Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Unleashing the Power of One-Step Diffusion based Image Super-Resolution via a Large-Scale Diffusion Discriminator |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unlocker: Disentangle the Deadlock of Learning between Label-noisy and Long-tailed Data |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Unlocking Dataset Distillation with Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Unlocking Multimodal Mathematical Reasoning via Process Reward Model |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Unlocking SLM Potential for Data Analysis Code Generation via Non-Parametric Knowledge Distillation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Unmasking Puppeteers: Leveraging Biometric Leakage to Expose Impersonation in AI-Based Videoconferencing |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unraveling Metameric Dilemma for Spectral Reconstruction: A High-Fidelity Approach via Semi-Supervised Learning |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Unsupervised Federated Graph Learning |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unsupervised Trajectory Optimization for 3D Registration in Serial Section Electron Microscopy using Neural ODEs |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unveiling Concept Attribution in Diffusion Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unveiling Environmental Sensitivity of Individual Gains in Influence Maximization |
β
|
β
|
β
|
β |
β |
β
|
β
|
5 |
| Unveiling Extraneous Sampling Bias with Data Missing-Not-At-Random |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Unveiling Transformer Perception by Exploring Input Manifolds |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Unveiling m-Sharpness Through the Structure of Stochastic Gradient Noise |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Unveiling the Uncertainty in Embodied and Operational Carbon of Large AI Models through a Probabilistic Carbon Accounting Model |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| User-Instructed Disparity-aware Defocus Control |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| V-CECE: Visual Counterfactual Explanations via Conceptual Edits |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| VCM: Vision Concept Modeling with Adaptive Vision Token Compression via Instruction Fine-Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VERA: Variational Inference Framework for Jailbreaking Large Language Models |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VETA-DiT: Variance-Equalized and Temporally Adaptive Quantization for Efficient 4-bit Diffusion Transformers |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VIBE: Annotation-Free Video-to-Text Information Bottleneck Evaluation for TL;DR |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VIKING: Deep variational inference with stochastic projections |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| VIPAMIN: Visual Prompt Initialization via Embedding Selection and Subspace Expansion |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| VITRIX-CLIPIN: Enhancing Fine-Grained Visual Understanding in CLIP via Instruction-Editing Data and Long Captions |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VITRIX-UniViTAR: Unified Vision Transformer with Native Resolution |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models |
β |
β |
β |
β
|
β
|
β |
β
|
3 |
| VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunking |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VLM-RΒ³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture |
β
|
β
|
β |
β
|
β
|
β
|
β
|
6 |
| VLMs can Aggregate Scattered Training Patches |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| VORTA: Efficient Video Diffusion via Routing Sparse Attention |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| VPO: Reasoning Preferences Optimization Based on $\mathcal{V}$-Usable Information |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| VQToken: Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VTON-VLLM: Aligning Virtual Try-On Models with Human Preferences |
β |
β
|
β
|
β
|
β |
β
|
β
|
5 |
| VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Valid Inference with Imperfect Synthetic Data |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Valid Selection among Conformal Sets |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Validating LLM-as-a-Judge Systems under Rating Indeterminacy |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Value Diffusion Reinforcement Learning |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Value Gradient Guidance for Flow Matching Alignment |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| Value Improved Actor Critic Algorithms |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Value-Guided KV Compression for LLMs via Approximated CUR Decomposition |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Value-Guided Search for Efficient Chain-of-Thought Reasoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vanish into Thin Air: Cross-prompt Universal Adversarial Attacks for SAM2 |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VaporTok: RL-Driven Adaptive Video Tokenizer with Prior & Task Awareness |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| VarFlow: Proper Scoring-Rule Diffusion Distillation via Energy Matching |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits |
β
|
β
|
β |
β |
β |
β |
β
|
3 |
| Variance-Reduced Long-Term Rehearsal Learning with Quadratic Programming Reformulation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Variational Inference with Mixtures of Isotropic Gaussians |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Variational Learning Finds Flatter Solutions at the Edge of Stability |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Variational PΓ³lya Tree |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Variational Regularized Unbalanced Optimal Transport: Single Network, Least Action |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Variational Supervised Contrastive Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Variational Task Vector Composition |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Variational Transdimensional Inference |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Variational Uncertainty Decomposition for In-Context Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vector Database Watermarking |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Vector Quantization in the Brain: Grid-like Codes in World Models |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Venus-MAXWELL: Efficient Learning of Protein-Mutation Stability Landscapes using Protein Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| VeriLoC: Line-of-Code Level Prediction of Hardware Design Quality from Verilog Code |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VeriThinker: Learning to Verify Makes Reasoning Model Efficient |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Versatile Transferable Unlearnable Example Generator |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vertical Federated Feature Screening |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ViSPLA: Visual Iterative Self-Prompting for Language-Guided 3D Affordance Learning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Vicinal Label Supervision for Reliable Aleatoric and Epistemic Uncertainty Estimation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Vicinity-Guided Discriminative Latent Diffusion for Privacy-Preserving Domain Adaptation |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vid-SME: Membership Inference Attacks against Large Video Understanding Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Video Perception Models for 3D Scene Synthesis |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Video World Models with Long-term Spatial Memory |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Video-R1: Reinforcing Video Reasoning in MLLMs |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VideoLucy: Deep Memory Backtracking for Long Video Understanding |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| VideoMAR: Autoregressive Video Generation with Continuous Tokens |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| VideoTitans: Scalable Video Prediction with Integrated Short- and Long-term Memory |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VideoVLA: Video Generators Can Be Generalizable Robot Manipulators |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| ViewCraft3D: High-fidelity and View-Consistent 3D Vector Graphics Synthesis |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Vinci: Deep Thinking in Text-to-Image Generation using Unified Model with Reinforcement Learning |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| VisDiff: SDF-Guided Polygon Generation for Visibility Reconstruction, Characterization and Recognition |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Generation |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Vision Function Layer in Multimodal LLMs |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Vision Transformers Don't Need Trained Registers |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Vision Transformers with Self-Distilled Registers |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Vision-centric Token Compression in Large Language Model |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| VisionβLanguageβVision AutoβEncoder: Scalable Knowledge Distillation from Diffusion Models |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Visual Anagrams Reveal Hidden Differences in Holistic Shape Processing Across Vision Models |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Visual Instruction Bottleneck Tuning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Visual Sync: MultiβCamera Synchronization via CrossβView Object Motion |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| VisualLens: Personalization through Task-Agnostic Visual History |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| VividFace: A Robost and High-Fidelity Video Face Swapping Framework |
β |
β |
β
|
β |
β
|
β |
β
|
3 |
| Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding |
β |
β |
β |
β |
β |
β |
β |
0 |
| Vocabulary-Guided Gait Recognition |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Volume Transmission Implements Context Factorization to Target Online Credit Assignment and Enable Compositional Generalization |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| VoxDet: Rethinking 3D Semantic Scene Completion as Dense Object Detection |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Vulnerable Data-Aware Adversarial Training |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| WEDGE: Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| WHAT MAKES MATH PROBLEMS HARD FOR REINFORCEMENT LEARNING: A CASE STUDY |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| WISA: World simulator assistant for physics-aware text-to-video generation |
β |
β |
β |
β |
β
|
β |
β
|
2 |
| WKV-sharing embraced random shuffle RWKV high-order modeling for pan-sharpening |
β |
β |
β
|
β |
β |
β |
β |
1 |
| WMCopier: Forging Invisible Watermarks on Arbitrary Images |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| WaLRUS: Wavelets for Long range Representation Using State Space Methods |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| Walking the SchrΓΆdinger Bridge: A Direct Trajectory for Text-to-3D Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Walking the Tightrope: Autonomous Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| WarpGAN: Warping-Guided 3D GAN Inversion with Style-Based Novel View Inpainting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Wasserstein Convergence of Critically Damped Langevin Diffusions |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Wasserstein Transfer Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Watermarking Autoregressive Image Generation |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| WaveAR: Wavelet-Aware Continuous Autoregressive Diffusion for Accurate Human Motion Prediction |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Wavelet Canonical Coherence for Nonstationary Signals |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| Wavy Transformer |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Weak-shot Keypoint Estimation via Keyness and Correspondence Transfer |
β |
β |
β
|
β
|
β |
β |
β
|
3 |
| Weak-to-Strong Generalization under Distribution Shifts |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| WeatherPrompt: Multi-modality Representation Learning for All-Weather Drone Visual Geo-Localization |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Weaver: Shrinking the Generation-Verification Gap by Scaling Compute for Verification |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Web-Shepherd: Advancing PRMs for Reinforcing Web Agents |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| WebDancer: Towards Autonomous Information Seeking Agency |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| WebThinker: Empowering Large Reasoning Models with Deep Research Capability |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| WhAM: Towards A Translative Model of Sperm Whale Vocalization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| What Can RL Bring to VLA Generalization? An Empirical Study |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization |
β
|
β |
β |
β |
β |
β |
β |
1 |
| What Do Latent Action Models Actually Learn? |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| What Does It Take to Build a Performant Selective Classifier? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| What Expressivity Theory Misses: Message Passing Complexity for GNNs |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| What Happens During the Loss Plateau? Understanding Abrupt Learning in Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| What Makes a Reward Model a Good Teacher? An Optimization Perspective |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| What Matters in Data for DPO? |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| What Moves the Eyes: Doubling Mechanistic Model Performance Using Deep Networks to Discover and Test Cognitive Hypotheses |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| What Really is a Member? Discrediting Membership Inference via Poisoning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| What We Miss Matters: Learning from the Overlooked in Point Cloud Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| What are you sinking? A geometric approach on attention sink |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| What do you know? Bayesian knowledge inference for navigating agents |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| When Additive Noise Meets Unobserved Mediators: Bivariate Denoising Diffusion for Causal Discovery |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| When Are Concepts Erased From Diffusion Models? |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| When Can Model-Free Reinforcement Learning be Enough for Thinking? |
β |
β
|
β |
β |
β
|
β
|
β
|
4 |
| When Causal Dynamics Matter: Adapting Causal Strategies through Meta-Aware Interventions |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| When Data Can't Meet: Estimating Correlation Across Privacy Barriers |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective |
β |
β
|
β |
β |
β |
β |
β
|
2 |
| When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| When Does Curriculum Learning Help? A Theoretical Perspective |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| When Kernels Multiply, Clusters Unify: Fusing Embeddings with the Kronecker Product |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| When Lower-Order Terms Dominate: Adaptive Expert Algorithms for Heavy-Tailed Losses |
β
|
β
|
β |
β |
β
|
β |
β
|
4 |
| When Models Donβt Collapse: On the Consistency of Iterative MLE |
β
|
β |
β |
β |
β |
β |
β
|
2 |
| When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| When Thinking Drifts: Evidential Grounding for Robust Video Reasoning |
β |
β |
β
|
β |
β
|
β
|
β
|
4 |
| When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| When Worse is Better: Navigating the Compression Generation Trade-off In Visual Tokenization |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| When and How Unlabeled Data Provably Improve In-Context Learning |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| When and how can inexact generative models still sample from the data manifold? |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| When majority rules, minority loses: bias amplification of gradient descent |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Where Does It Exist from the Low-Altitude: Spatial Aerial Video Grounding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Where Graph Meets Heterogeneity: Multi-View Collaborative Graph Experts |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Which Algorithms Have Tight Generalization Bounds? |
β |
β |
β
|
β |
β |
β |
β
|
2 |
| Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions |
β |
β
|
β
|
β |
β
|
β |
β |
3 |
| Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Who Reasons in the Large Language Models? |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Who Speaks for the Trigger? Dynamic Expert Routing in Backdoored Mixture-of-Experts Transformers |
β
|
β |
β
|
β
|
β
|
β
|
β
|
6 |
| Who You Are Matters: Bridging Interests and Social Roles via LLM-Enhanced Logic Recommendation |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Whole-Body Conditioned Egocentric Video Prediction |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Whose Instructions Count? Resolving Preference Bias in Instruction Fine-Tuning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| Why Diffusion Models Donβt Memorize: The Role of Implicit Dynamical Regularization in Training |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| Why Do Some Language Models Fake Alignment While Others Don't? |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation |
β |
β
|
β |
β
|
β
|
β |
β
|
4 |
| Why Masking Diffusion Works: Condition on the Jump Schedule for Improved Discrete Diffusion |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Why Playing Against Diverse and Challenging Opponents Speeds Up Coevolution: A Theoretical Analysis on Combinatorial Games |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Why Popular MOEAs Are Popular: Proven Advantages in Approximating the Pareto Front |
β
|
β |
β |
β |
β |
β |
β |
1 |
| Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Wide-Horizon Thinking and Simulation-Based Evaluation for Real-World LLM Planning with Multifaceted Constraints |
β
|
β |
β
|
β |
β
|
β |
β
|
4 |
| Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| World Models as Reference Trajectories for Rapid Motor Adaptation |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| World-aware Planning Narratives Enhance Large Vision-Language Model Planner |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| WorldMem: Long-term Consistent World Simulation with Memory |
β
|
β |
β
|
β
|
β |
β |
β
|
4 |
| WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| X-Field: A Physically Informed Representation for 3D X-ray Reconstruction |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| X-Mahalanobis: Transformer Feature Mixing for Reliable OOD Detection |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| X2-DFD: A framework for explainable and extendable Deepfake Detection |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| YEAST: Yet Another Sequential Test |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| YOLOv12: Attention-Centric Real-Time Object Detectors |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding |
β |
β |
β
|
β |
β
|
β
|
β |
3 |
| You Can Trust Your Clustering Model: A Parameter-free Self-Boosting Plug-in for Deep Clustering |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| You Only Communicate Once: One-shot Federated Low-Rank Adaptation of MLLM |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| You Only Spectralize Once: Taking a Spectral Detour to Accelerate Graph Neural Network |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data |
β |
β
|
β
|
β
|
β |
β |
β
|
4 |
| ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ZeCO: Zero-Communication Overhead Sequence Parallelism for Linear Attention |
β
|
β |
β |
β |
β
|
β |
β
|
3 |
| Zebra-Llama: Towards Extremely Efficient Hybrid Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Zero-Shot Blind-Spot Image Denoising via Cross-Scale Non-Local Pixel Refilling |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| Zero-Shot Context Generalization in Reinforcement Learning from Few Training Contexts |
β
|
β
|
β
|
β |
β
|
β
|
β
|
6 |
| Zero-Shot Detection of LLM-Generated Text via Implicit Reward Model |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| Zero-Shot Performance Prediction for Probabilistic Scaling Laws |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Zero-shot Denoising via Neural Compression: Theoretical and algorithmic framework |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Zero-shot World Models via Search in Memory |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| Zero-shot protein stability prediction by inverse folding models: a free energy interpretation |
β |
β
|
β
|
β |
β |
β |
β
|
3 |
| ZeroPatcher: Training-free Sampler for Video Inpainting and Editing |
β
|
β |
β
|
β
|
β
|
β |
β
|
5 |
| ZeroS: ZeroβSum Linear Attention for Efficient Transformers |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| ZeroSep: Separate Anything in Audio with Zero Training |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| Zeroth-Order Optimization Finds Flat Minima |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| \(\varepsilon\)-Optimally Solving Two-Player Zero-Sum POSGs |
β
|
β
|
β
|
β |
β |
β |
β
|
4 |
| d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| dKV-Cache: The Cache for Diffusion Language Models |
β
|
β
|
β
|
β
|
β
|
β |
β
|
6 |
| iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning |
β |
β |
β
|
β
|
β
|
β |
β
|
4 |
| macOSWorld: A Multilingual Interactive Benchmark for GUI Agents |
β
|
β
|
β
|
β
|
β
|
β
|
β
|
7 |
| metaTextGrad: Automatically optimizing language model optimizers |
β
|
β
|
β
|
β
|
β |
β |
β
|
5 |
| miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward |
β |
β |
β
|
β
|
β
|
β
|
β
|
5 |
| msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| pLSTM: parallelizable Linear Source Transition Mark networks |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset |
β
|
β
|
β
|
β |
β
|
β |
β
|
5 |
| scMRDR: A scalable and flexible framework for unpaired single-cell multi-omics data integration |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| scPilot: Large Language Model Reasoning Toward Automated Single-Cell Analysis and Discovery |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| scSplit: Bringing Severity Cognizance to Image Decomposition in Fluorescence Microscopy |
β |
β
|
β
|
β |
β
|
β |
β
|
4 |
| seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| un$^2$CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP |
β |
β
|
β
|
β
|
β
|
β |
β
|
5 |
| xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar Memories |
β |
β
|
β
|
β
|
β
|
β
|
β
|
6 |
| zip2zip: Inference-Time Adaptive Tokenization via Online Compression |
β |
β
|
β
|
β |
β
|
β
|
β
|
5 |
| π§MOSPA: Human Motion Generation Driven by Spatial Audio |
β |
β |
β |
β
|
β
|
β |
β
|
3 |