Skip to yearly menu bar
Skip to main content
Main Navigation
ICML
Help/FAQ
Contact ICML
Create Profile
Code of Conduct
Privacy Policy
Press
Journal To Conference Track
Careers
Downloads
Inclusion
Future Meetings
My Stuff
Login
Select Year: (2026)
2026
2025
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2002
1996
IMLS Archives
Dates
Calls
Call for Papers
Call for Tutorial Nominations
Call for Position Papers
Call for Workshops
Call for Socials and Mentoring
Call for Tutorials
Call for Expo Presentations
Organization
Organizing Committee
ICML Board
Program Committee
About ICML
Exhibitors
2026 Exhibitors
Portal
Resources
Poster Instructions
Author Instructions
Peer Review FAQ
Peer-review Ethics
Conflict of Interest Definitions
Policy for LLM use in Reviewing
Reviewer Instructions
Area Chair Instructions
Senior Area Chair Instructions
Research Ethics
Attend
Register
Visa Information
Volunteer and Financial Aid
Attending with Children
Hotel
Browse
Visualization
Layout:
mini
compact
topic
detail
×
No topics available
No sessions available
title
author
topic
session
shuffle
by
serendipity
bookmarked first
visited first
not visited first
bookmarked but not visited
Enable Javascript in your browser to see the papers page.
Beyond Accuracy: Latent Perturbations for Cognitive-Aware Diagnosis
Advancing LLM Reasoning with Natural Language and Numerical Feedback
Hedging on the frontier: Learning new tasks with few samples
Hyper-ICL: Attention Calibration with Hyperbolic Anchor Distillation for Multimodal In-Context Learning
Cutting LLM Evaluation Costs with SySRs: A Bandit Algorithm that Provably Exploits Model Similarity
KUMA: A Novel Framework with Koopman Separation and Efficient Multilevel Extraction in Time Series Forecasting
Fast Mixture of Curvature-Aware Experts for Diverse and Dynamic Graph Topologies
Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away
LIMSSR: LLM-Driven Sequence-to-Score Reasoning under Training-Time Incomplete Multimodal Observations
Positive Distribution Shift as a Framework for Understanding Tractable Learning
Estimating the Empowerment of Language Model Agents
Universal Algorithm-Implicit Learning
Statistically Undetectable Backdoors in Deep Neural Networks
Calibrated Test-Time Guidance for Bayesian Inference
A Systematic Study of Behavioral Cloning for Scientific Data Annotation
Investigating Memory in RL with POPGym Arcade
Faster Than Flash: Exploiting Attention Sparsity for Efficient Long-Context Decoding
Zero-Flow Encoders
Towards Foundation Models for Zero-Shot Time Series Anomaly Detection: Leveraging Synthetic Data and Relative Context Discrepancy
Beyond Correctness: Distance-Based Social Dynamics of Multi-Agent Debate
MAnchors: Memorization-Based Acceleration of Anchors via Rule Reuse and Transformation
Locally Coherent Parallel Decoding in Diffusion Language Models
Dynamics Reveals Structure: Challenging the Linear Propagation Assumption
Generative Augmented Inference
Steering at the Source: Style Modulation Heads for Robust Persona Control
Fractional is Better: Learnable Derivative Orders in Neural Operator Learning
Necessary Conditions for Compositional Generalization of Embedding Models
N2M: Bridging Navigation and Manipulation by Learning Pose Preference from Rollout
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck
CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks
Non-Monotonic Autoregressive Sequence Model
Improved Bounds for Private and Robust Alignment
Energy-based Compositional Diffusion Planning
Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
When Do Diffusion Models learn to Generate Multiple Objects?
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
Find, Fix, Reason: Context Repair for Video Reasoning
Position: Quantum Kernel Machines Should Move Beyond Scalar-Valued Kernels to Realize Their Potential
Error Propagation in Dynamic Programming: From Stochastic Control to American Option Pricing
HGMem: Hypergraph-based Working Memory to Improve Multi-step RAG for Long-Context Complex Relational Modeling
Discovering Differences in Strategic Behavior between Humans and LLMs
SEPS: Semantic-Enhanced Patch Slimming Framework for Fine-Grained Cross-Modal Alignment
EDCO: Dynamic Curriculum Orchestration for Domain-specific Large Language Model Fine-tuning
ReLAM: Learning Anticipation Model for Rewarding Visual Robotic Manipulation
The Safety-Aware Denoiser for Text Diffusion Models
Provable Sample Efficiency of Curriculum Post-Training for Transformer Reasoning
IMPACT: Influence Modeling for Open-Set Time Series Anomaly Detection
AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
Confidence is Not Universal: Task-Dependent Calibration and Emergent Behavior in LLMs
DLM: Unified Decision Language Models for Offline Multi-Agent Sequential Decision Making
MMBench-Live: A Continuously Evolving Benchmark for Multimodal Models
ConFlux: Multivariate Time Series in Flux, One Unified Forecast in Confluence
One-step Latent-free Image Generation with Pixel Mean Flows
GEM: Geometric Entropy Mixing for Optimal LLM Data Curation
CausalX: A Unified and Causally-Interpretable Plug-and-Play Model for Multi-modal Spatio-Temporal Forecasting
MEC: Machine-Learning-Assisted Generalized Entropy Calibration for Semi-Supervised Mean Estimation
Thinking in Flow: A Dissipative Stabilization Operator for Robust Autoregressive Reasoning
EnsembleVLA: Ensemble Learning for Vision-Language Action Models
Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension
Nested birth-death processes are competitive with parameter-heavy neural networks as time-dependent models of protein evolution
Physics-Guided Motion Loss for Video Generation Model
SLAP: The Semantic Least Action Principle for Variational Video-Language Modeling
Immuno-VLM: Immunizing Large Vision-Language Models via Generative Semantic Antibodies for Open-World Trustworthiness
MemEvolve: Meta-Evolution of Agent Memory Systems
Domain Adaptive Object Detection via Dynamic Causal Refinement
DREAM: Dual-Standard Semantic Homogeneity with Dynamic Optimization for Graph Learning with Label Noise
Vision-aligned Latent Reasoning for Multi-Modal Large Language Model
Risk-Averse and Optimistic Advertiser Incentive Compatibility in Auto-bidding
Evolution of Benchmark: Black-Box Optimization Benchmark Design through Large Language Model
Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping
MineDraft: A Framework for Batch Parallel Speculative Decoding
A recipe for scalable attention-based ML potentials: unlocking long-range accuracy with all-to-all node attention
Keep Everyone Happy: Online Fair Division of Numerous Items with Few Copies
Efficiently Training Time-to-First-Spike Spiking Neural Networks from Scratch
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
Lost in Context: Discovering Context Anxiety in Large Language Models
Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects
GFedCL: Graph-Based Federated Continual Learning with Spatial and Temporal Awareness
Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density
Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots
Learning Compressed Shape-Aware Molecular Representations for Virtual Screening
An Exterior Method for Nonnegative Matrix Factorization
Learning to Remember, Learn, and Forget in Attention-Based Models
Learning Generalizable Skill Policy with Data-Efficient Unsupervised RL
RoboOmni: Actions Are Just Another Modality for Your Vision-Language Models
Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models
Saving Foundation Flow-Matching Priors for Inverse Problems
Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions
Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting
LoRe: Adaptive Interaction-Evaluation Routing with Per-step Interaction Budgets for Iterative Graph Solvers
AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing
Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models
MedMamba: Multi-View State Space Models with Adaptive Graph Learning for Medical Time Series Classification
Latent Diffusion Controller: Framework, Algorithms and Parameterization
MoRGEN: Mixture-of-Resolutions Generative Forecasting for Irregularly Sampled Medical Time-Series Data
When LLMs Encounter Open-world Graph Learning: A Fresh View on Unlabeled Data Uncertainty
LLMInertia: Adaptive Counter-Inertial Reasoning to Improve Evidence Faithfulness in Large Language Models
Protein Circuit Tracing via Cross-layer Transcoders
Regulating Anatomy-Aware Rewards via Trajectory-Integral Feedback for Volumetric Computed Tomography Analysis
FlashOptim: Memory Efficient Optimizers for Large-Scale Training
Position: Stop Preaching and Start Practising Data Frugality for Responsible Development of AI
Learning Cardiac Latent Representations in Vectorcardiogram Space
Kinematics-Driven Gaussian Shape Deformation for Blurry Monocular Dynamic Scenes
Scalable Traffic Signal Control with Shared Policy Framework
A Single Layer to Explain Them All: Understanding Massive Values in Large Language Models
Self-Distillation Enables Continual Learning
Diving into Kronecker Adapters: Component Design Matters
LLM4Cov: Execution-Grounded Agent Learning for High-Coverage Hardware Verification
Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning
Rethinking Forgery Attacks on Semantic Watermarks in Black-Box Settings: A Geometric Distortion Perspective
WIND: Weather Inverse Diffusion for Zero-Shot Atmospheric Modeling
MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models
The Sign Estimator: Preference Modeling for LLM Alignment under Heterogeneity
Beyond Static Allocation: Dynamic Sensitivity-Aware Fine-Tuning for Vision Transformers
OptiFluence: Principled Design of Privacy Canaries
Toward Training Superintelligent Software Agents through Self-Play SWE-RL
Detecting Fluent Optimization Based Adversarial Prompts via Sequential Entropy Changes
TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation
Active Exploring like a Pigeon: Reinforcing Spatial Reasoning via Agentic Vision-Language Models
Trojan-Speak: Bypassing Constitutional Classifiers with No Jailbreak Tax via Adversarial Finetuning
Robust Cross-Modal Retrieval via Generative Semantic Refinement and Exclusion-Guided Adaptation
Riemannian Diffusion Models on General Manifolds via Physics-Informed Neural Networks
Show, Don't Tell: Morphing Latent Reasoning into Image Generation
Tree-Structured Orthonormal Decomposition of the Aitchison Simplex
LLM Priors for ERM over Programs
An Odd Estimator for Shapley Values
Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms
Task-and-Model-Aware Fractal-Consistency for Efficient LLM Reasoning
Single-Rollout Hidden-State Dynamics for Training-Free RLVR Data Selection
FIDIA: Function-Informed Sequence Design via Inference-Aligned Policy Optimization
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
Credible Information Subset Decomposition: An End-to-End Multi-fidelity Learning Model by Modeling Label Information
MetaphorVU: Towards Metaphorical Video Understanding
RNA-FM: Flow-Matching Generative Model for Genome-wide RNA-Seq Prediction
Divisiveness-Consistent Label Distribution Learning
Capacitated Fair-Range Clustering: Hardness and Approximation Algorithms
Causal-aware Anomaly Detection for Tabular Data
Multi-view Consistent Latent Action Learning for World Modeling and Control
AppWorld-UL: Benchmarking Diverse Agent-User Interactions for Tool-Use
Functional Adjoint Sampler: Scalable Sampling on Infinite Dimensional Spaces
Phy-CoSF: Physics-Guided Continuous Spectral Fields Reconstruction and Spectral Super-Resolution for Snapshot Compressive Imaging
Functional Decomposition and Shapley Interactions for Interpreting Survival Models
SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
Safe In-Context Reinforcement Learning
Hierarchical Image Tokenization for Multi-Scale Image Super Resolution
TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning
One-step Optimal Transport via Regularized Distribution Matching Distillation
Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution
Gradient Flow Through Diagram Expansions: Learning Regimes and Explicit Solutions
Attn-QAT: 4-Bit Attention With Quantization-Aware Training
Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models
Towards Parameter-Free Temporal Difference Learning
No More K-means: Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval
PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
Safety Alignment of LMs via Non-cooperative Games
Improved Dynamic Algorithm for Non-monotone Submodular Maximization under Cardinality Constraint
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models
Graph Neural Networks Are Not Continuous Across Graph Resolutions
Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
Orthogonal Hierarchical Decomposition for Structure-Aware Table Understanding with Large Language Models
Chamaileon: Cross-Context Binder Design with Contextualized Modeling and Mixed Sampling
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs
HyperMLP: An Integrated Perspective for Sequence Modeling
A Theoretical Game of Attacks via Compositional Skills
Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models
Disentangling a Large Language Model’s Computation from its Chain-of-Thought
AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training
Understanding Multimodal Learning: A Loss Landscape Smoothness Perspective
Are Large Reasoning Models Interruptible?
Little By Little: Continual Learning via Incremental Mixture of Rank-1 Associative Memory Experts
The Implicit Bias of Depth: From Neural Collapse to Softmax Codes
Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games
DRIFT-BENCH: Diagnosing CoopeRative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction
Column Thresholding for Sparse Spiked Wigner Models: Improved Signal Strength Requirements
Optimizing Language Models for Crosslingual Knowledge Consistency
Seeing the Unseen: Physics-as-Representation for Generalizable Gaze Perception
Dispersion Loss Counteracts Embedding Condensation and Improves Generalization in Small Language Models
HieraScaffold: Learning Compact Hierarchical Representations for Scalable 4D LiDAR Generation
EqGINO: Equivariant Geometry-Informed Fourier Neural Operators for 3D Partial Differential Equations
Weakly Supervised Cross-Modal Learning for 4D Radar Scene Flow Estimation
Geometry-Preserving Unsupervised Alignment for Heterogeneous Foundation Models
Learning to Decode Against Compositional Hallucination in Video Multimodal Large Language Models
From Directions to Regions: Decomposing Activations in Language Models via Local Geometry
DataGuard: A Non-intrusive Dataset Auditing Framework via Differential Information Forensics
VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency
Towards Scalable and Consistent 3D Editing
Hierarchical Multi Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation
Statistical Consistency and Generalization of Contrastive Representation Learning
AdaHC: Accelerating Multi-Token Prediction with Adaptive Head Chunking with Pipeline Parallelism
PGC: Peak-Guided Calibration for Generalizable AI-Generated Image Detection
Position: Hallucinations Undermine Trust; Metacognition is a Way Forward
ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
From Teacher Pathways to Invariant Manifolds: Consensus Subspace Distillation for TSFMs
Position: EU AI Act's Research Exemptions Can Break the Publication Norms of Major AI Conferences
SF-Mamba: Rethinking State Space Model for Vision
Position: Quantum Deep Learning Still Needs a Quantum Leap
HugRAG: Hierarchical Causal Knowledge Graph Design for RAG
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
Expo-GS: Exposure-Aware Signed Distance Function in Gaussian Splatting for High Dynamic Range
TokenSwap: Backdoor Attack on the Compositional Understanding of Large Vision-Language Models
Inference Time Concept Removal Guidance for Text-to-Image Diffusion Models
Towards the Explainability of Temporal Graph Networks via Memory Backtracking and Topological Attribution
Memory-Efficient LLM Pretraining via Minimalist Optimizer Design
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Predicting the Order of Upcoming Tokens Improves Language Modeling
Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction
Efficient Public Verification of Private ML via Regularization
AURA: Visually Interpretable Affective Understanding via Robust Archetypes
scDEBART: Predicting in silico Single-Cell Perturbation Responses via Large-Scale Differential Expression Learning
PSG-Nav: Probabilistic Scene Graph Navigation via Multiverse Decision Making
Mechanistic Anomaly Detection via Functional Attribution
Density-Aware Translation of Spurious Correlations in Zero-Shot VLMs
TRACER: Persistent Regularization for Robust Multimodal Finetuning
Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning
AudioMosaic: Contrastive Masked Audio Representation Learning
Per-example Gradients: a New Frontier for Understanding and Improving Optimizers
Rare Event Analysis of Large Language Models
Narrowing the ANN–SNN Gap for 1D Signal Classification with Multi-Scale Temporal Encoding and Sparsity-Regularized Transform Encoding
RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models
STABLE: Simulation-Ready Tabletop Layout Generation via a Semantics–Physics Dual System
VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model
Mitigating the Modality Gap in Vision–Language Models with Fractal Spectral Geometry
MoDA: Modulation Adapter for Fine-Grained Visual Understanding in Instructional MLLMs
Fair-FedMOE: Group-Fair One-Shot Federated Learning via Prototype-Guided Experts for Medical Imaging Analysis
Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction
PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling
Zero Sum SVD: Balancing Loss Sensitivity for Low Rank LLM Compression
Rethinking the Hardness of PbRL: A Provable General Regret Bound
Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
Estimating Tail Risks in Language Model Output Distributions
The (Marginal) Value of a Search Ad: An Online Causal Framework for Repeated Second-price Auctions
Sharp Concentration Bounds for Vector Bundle-Valued Statistics on Manifolds
Plain Transformers are Surprisingly Powerful Link Predictors
Resting Neurons, Active Insights: Robustify Activation Sparsity for Large Language Models
Modelling Attention with Aitchison Geometry: Token Distinguishability and Temperature Scaling
Implicit Safety Alignment from Crowd Preferences
Sequential Kernel-based Conditional Independence Testing via Adaptive Betting
Beyond Euclidean Summaries: Online Change Point Detection for Distribution-Valued Data
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
MINIM: Privacy-Aware Minimal View for Agents via Trusted Local Sanitization
Tighter Regret Lower Bound for Gaussian Process Bandits with Squared Exponential Kernel in Hypersphere
On Regret Bounds of Thompson Sampling for Bayesian Optimization
BuildArena: A Physics‑Aligned Interactive Benchmark of LLMs for Engineering Construction
Multi-label learning with contrastive cluster self-supervision for 3D hierarchical semantic segmentation
Foreground-Aware Token Routing Vision Transformer for Real-Time Satellite Video Tracking
Scalable Event Cloud Network for Event-based Classification
Position: The Turing-Completeness of Real-World Autoregressive Transformers Relies Heavily on Context Management
Reflector: Internalizing Step-wise Reflection against Indirect Jailbreaks
Think Twice Before You Act: Protecting LLM Agents Against Tool Description Poisoning via Isolated Planning
Automatic Unsupervised Ensemble Outlier Model Selection
Causal Discovery for Irregularly Time Series with Consistency Guarantees
Chain-of-Thought Gradient Descent
Role-Level Inductive Bias for Cross-Task Generalization in Multi-Agent Reinforcement Learning
SwiftPFN: Revisiting Row-Wise Attention–Only Tabular Foundation Models with Adaptive Early Exit
Detecting Contextual Hallucinations in Large Language Models with Frequency-Aware Attention
FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction
Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression
Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs
Recursive Monte-Carlo Tree Search
Testing For Distribution Shifts with Conditional Conformal Test Martingales
Monitoring Monitorability
THETA: Threshold-Based Exclusive Batching for Memory-Bandwidth-Constrained LLM Inference
Distributed Direct Preference Optimization
Learning Efficient Guardrails for Compliance
A proximal ADMM for multiblock problems with block anti-upper triangular constraints
NaviAgent: Graph‑Driven Bilevel Planning for Scalable Tool Orchestration
A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints
VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models
Plan, Decouple, Assimilate: Physics-Aware Object Insertion in Remote Sensing Imagery
Do Vision and Text Cues Exhibit Evidential Coupling? UFO: A Benchmark for Compositional Multimodal Reasoning in Unified Models
GO-PRE:Goal-Oriented Next-Best-View Selection via Predictive Rendering Entropy for Active 3D Reconstruction
Robust Causal Discovery in Real-World Time Series with Power-Laws
Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency
Fixed Aggregation Features Can Rival GNNs
Artemis: Structured Visual Reasoning for Perception Policy Learning
LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs
Semi-Supervised Noise Adaptation: Transferring Knowledge from Noise Domain
Strategic Candidacy in Generative AI Arenas
Position: The Data Provenance–Parametric Divide in Large Language Models
Position: Explanation Stability Is a Property of the Model–Method Pair, Not the Model
Grounding Functional Similarity by Invariance-Aware Model Stitching
Hyperparameter Transfer with Mixture-of-Expert Layers
The Geometric Mechanics of Contrastive Representation Learning: Alignment Potentials, Entropic Dispersion, and Cross-Modal Divergence
CompleteP for RL: Maintaining Feature Learning When Scaling Deep Reinforcement Learning
Trajectory Seriation via Spectral Tangent Alignment and Global Embedding
A Fast and Soft Pattern Matcher for Trillion-Scale Corpus
Reward Shaping Control Variates for Off-Policy Evaluation Under Sparse Rewards
Online Linear Programming for Multi-Objective Routing in LLM Serving
Seizure-Semiology-Suite($S^3$): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding
Mirror Descent Actor Critic via Bounded Advantage Learning
Geometrically Constrained Outlier Synthesis
When does predictive inverse dynamics outperform behavior cloning?
Q-Flow: Stable and Expressive Reinforcement Learning with Flow-based Policy
ML-Embed: Inclusive and Efficient Embeddings for a Multilingual World
MeshTok: Efficient Multi-Scale Tokenization for Scalable PDE Transformers
Towards Trustworthy and Identifiable Virtual Face Generation
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining
Adaptive Node Feature Selection for Graph Neural Networks
RAIGen: Rare Attribute Identification in Text-to-Image Generative Models
MOC: Multi-Order Communication in LLM-based Multi-Agent Systems
3DGS-HPC: Distractor-free 3D Gaussian Splatting with Hybrid Patch-wise Classification
Distinguishing Imitation Error from Intrinsic Motion Learning Difficulty
Branching Diffusion for Point Processes in Time and Space
DreamDojo: A Real-Time Robot World Model from Large-Scale Human Videos
Test-Time Detoxification without Training or Learning Anything
MMPD-Bench: Bridging Multimodal Fission with Multi-Polarimetric Modalities Decomposition
Population-Aware Imitation Learning in Mean-field Games with Common Noise
Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Performance Without Small (Sub)Gradients
Bayesian Meta-Learning with Expert Feedback for Task-Shift Adaptation through Causal Embeddings
Adaptive Sharpness-Aware Minimization with a Polyak-type Step size: A Theory-Grounded Scheduler
Sampled hard labels from sparse targets mislead rotation invariant algorithms
TRACER: Trajectory Risk Aggregation for Critical Episodes in Agentic Reasoning
When Diffusion Language Models Hesitate: Detecting and Correcting Visual Hallucinations via Confidence Fluctuation
ScaleMoE: Mixture-of-Experts for Scalable Continuous Control in Actor-Critic Reinforcement Learning
Position: LLM Serving Needs Mathematical Optimization and Algorithmic Foundations, Not Just Heuristics
Self-Guidance: Enhancing Neural Codecs via Decoder Manifold Alignment
Scale-Aware Domain Harmonization for Domain Adaptation Person Search
Learnability-Informed Fine-Tuning of Diffusion Language Models
Stabilizing In-Context Multi-Source Domain Adaptation for Biomedical Images Through Controls
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies
On the Limits of LLM Adaptability: Impact of LLM Pre-Training on Annotation Task Performance
Discretely-Refined Multi-view Clustering via Aligned Anchor Learning
Training-free Composition of Pre-trained GFlowNets for Multi-Objective Generation
Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments
FlowNar: Scalable Streaming Narration for Long-Form Videos
Towards Effective Waste Segmentation for Automated Waste Recycling in Cluttered Background
Sharper Generalization Guarantees for Asynchronous SGD: Beyond Lipschitzness, Smoothness and Data Homogeneity
Rethinking Efficient Graph Coarsening via a Non-Selfishness Principle
SpaCeFormer: Space-Curve Transformer for Open-Vocabulary 3D Instance Segmentation without Proposals
NeUQI: Near-Optimal Uniform Quantization Parameter Initialization for Low-Bit LLMs
Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization’s Impact on VLMs Beyond Accuracy
Asymmetric Contrastive Objectives for Efficient Phenotypic Screening
Improving Few-Shot Design Optimization By Exploiting Auxiliary Information
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models
LLM4Branch: Large Language Model for Discovering Efficient Branching Policies of Integer Programs
A Consensus Anchor-guided Hypergraph Framework For Incomplete Multi-view Clustering
Open Materials Generation with Inference-Time Reinforcement Learning
Arboreal Neural Network
Dual-stage Contrastive Learning-enhanced Multi-view Variational Clustering
PrivGate: Steering Contextual Integrity in LLMs via Latent Space Geometry
Transformers learn factored representations
StormInsight: Hierarchical Environmental Forcing and Vertical Coupling for Weather System Evolution
ManifoldKV: Training-Free KV Cache Compression via Euclidean Outlier Detection
Optimal Estimation of Continuous Treatment Effects with Kernel Ridge Regression
Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining
Structure-Aware Riemannian Flow Matching for Registration and Fusion of Hyperspectral and Multispectral Images
dgMARK: Decoding-Guided Watermarking for Diffusion Language Models
A Task-centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula
CSG: Cognitive Structure Generation for Intelligent Education
SimpleMem: Efficient Lifelong Memory for LLM Agents
Entropy-informed Decoding: Adaptive Information-Driven Branching
Spatially-Regularized Entropy for Discriminative Token Merging in Fine-Grained Re-Identification
Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation
SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?
Vibe Checker: Aligning Code Evaluation with Human Preference
Time-Consistent Robust Multi-Objective Reinforcement Learning via a Bellman–Isaacs Weight-Adversary Recursion
M+Adam: Low-Precision Training via Mantissa–Exponent Optimization
Density-Guided Continuous Flow for Robust Counterfactual Explanations
MMKU-Bench: A Multimodal Update Benchmark for Diverse Visual Knowledge
Near-Minimax Multi-Objective RL under Predictable Adversarial Preferences and Preference-Free Exploration in Linear MDPs
TsLLM: Augmenting LLMs for General Time Series Understanding and Prediction
DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing
AnyMod-LLVE: Low-Light Video Enhancement with Modality-Agnostic Inference
Identifying Partially Observed Causal Models from Heterogeneous/Nonstationary Data
Token-Free Hierarchical Indexing for RAG beyond LLM-based Summarization
Context-Driven Incremental Compression for Multi-Turn Dialogue Generation
Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't
Convex Dataset Valuation for Post-Training
HumanLM: Simulating Users with State Alignment Beats Response Imitation
QPoint: End-to-End Lightweight Point Cloud Processing via Robust Quaternion Feature Learning
SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models
When Agents Go Rogue: Activation-Based Detection of Malicious Behaviors in Multi-Agent Systems
Investigating Component Contributions in Multi-Agent ML Systems
Position: Stop Reactively Patching Your Model Every Time and Start Proactive Test-Driven AI Development
AlphaRouter: Token-level Routing Between SLM and LLM with Reinforcement Learning and Tree Search
Semi-Supervised Neural Super-Resolution for Mesh-Based Simulations
SERA: Soft-Verified Efficient Repository Agents
ScaleErasure: Inference-Time Minimal Intervention for Precise Concept Erasure in Next-Scale Autoregressive Image Generation
Efficient Learning of Compositional Targets with Hierarchical Spectral Methods
A Hitchhiker's Guide to Poisson Gradient Estimation
Multimodal Function Vectors for Spatial Relations
Quantitative Estimation of Target Task Performance from Unsupervised Pretext Task in Semi/Self-Supervised Learning
GeoReward: Mitigating Contextual Variable Overestimation in Vision-Language Models for Cross-Market Preference Prediction
SynLaD: Latent Diffusion for Generating Synthesizable Molecules Conditioned on 3D Pharmacophore Profiles
Stabilizing Recurrent Dynamics for Test-Time Scalable Latent Reasoning in Looped Language Models
GenExam: A Multidisciplinary Text-to-Image Exam
Coupled Trigger Optimization and Vulnerable Parameter Alignment for Persistent Backdoor Attacks on Federated Learning
The Value of Variance: Mitigating Debate Collapse in Multi-Agent Systems via Uncertainty-Driven Policy Optimization
Toward Subspace-Perturbed Trajectory-Aware Backdoor Attacks in Deep Reinforcement Learning
Active Learning with Foundation Model Priors: Efficient Learning under Class Imbalance
Fingerprinting Pre-trained Encoders under Arbitrary Downstream Fine-Tuning via Adversarial Shifting
Learning in Bayesian Stackelberg Games With Unknown Follower's Types
Deep Incentive Design with Differentiable Equilibrium Blocks
OSF: On Pre-training and Scaling of Sleep Foundation Models
SleepLM: Natural-Language Intelligence for Human Sleep
Query-Based Asymmetric Modeling with Decoupled Input–Output Rates for Speech Restoration
Probing Cross-modal Information Hubs in Audio-Visual LLMs
HEARTS: Benchmarking LLM Reasoning on Health Time Series
Graph-Link: Bridging the Semantic-Structural Gap in Text-to-SQL via Constrained Subgraph Induction
Benign Overfitting in Adversarial Training for Vision Transformers
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
Understanding Private Learning From Feature Perspective
Toward Structural Multimodal Representations: Specialization, Selection, and Sparsification via Mixture-of-Experts
ReSpinQuant: Efficient Layer-Wise LLM Quantization via Subspace Residual Rotation Approximation
Joint Learning in the Gaussian Single Index Model
Frontier Models Can Take Actions at Low Probabilities
Profiling the Irrational Agent: Cognitive Modeling of LLM Behaviors in Sequential Jailbreaks
Algorithmic Recourse of In-Context Learning for Tabular Data
GKD-Recruiter: Jointly Modeling Social and Task Heterogeneity for Spatial Crowdsourcing via Graph Knowledge Distillation
Heterogeneity-Aware Knowledge Sharing for Graph Federated Learning
Mixture of Horizons in Action Chunking
A Cartesian-3j and nj Framework for Machine Learning Interatomic Potentials
Spatial Deconfounder: Interference-Aware Deconfounding for Spatial Causal Inference
DyCon: Dynamic Reasoning Control via Evolving Difficulty Modeling
ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models
Post-Training with Policy Gradients: Optimality and the Base Model Barrier
FRIGID: Scaling Diffusion-Based Molecular Generation from Mass Spectra at Training and Inference Time
Efficient Prediction of SO(3)-Equivariant Hamiltonian Matrices via SO(2) Local Frames
Neural Logistic Bandits
Convolutional Learnable-Group Weightless Neural Network
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws
Order Matters in Retrosynthesis: Structure-aware Generation via Reaction-Center-Guided Discrete Flow Matching
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Convex Optimization for Alignment and Preference Learning on a Single GPU
TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis
Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity Constraints
Where Detectors Fail: Probing Generative Space for Generalizable AI-Generated Image Detection
Temporal Context Reinstatement Drives Episodic-Like Order Memory in Long-Context Language Models
Covariance Volume Maximization for Embodied Latent Exploration in Deep Reinforcement Learning
When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
FlatLand: Personalized Graph Federated Learning via Tailored Lorentz Space
When Random Saliency Looks Trained: Architectural Center Bias in CNN Interpretability
Certified Robustness under Heterogeneous Perturbations via Hybrid Randomized Smoothing
Conditional Equivalence of DPO and RLHF: Assumptions, Failure Modes, and Provable Alignment
DC-Leap: Training-Free Acceleration of dLLMs via Draft-Guided Contiguous Leaping Decoding
FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation
Bi-Anchor Interpolation Solver for Accelerating Generative Modeling
From Player to Master: Enhancing Test-Time Learning of LLM Agents via Reinforcement Learning over Memory
SAEs-BrainMap: Unveiling the Emergence of Specialized Concepts in Deep Models via Brain Alignment
LLM-Guided Diagnostic Evidence Alignment for Medical Vision–Language Pretraining under Limited Pairing
Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation
Compositional Behavioral Semantics and Metrics for State Abstraction in Reinforcement Learning
Solving the Offline and Online Min-Max Problem of Non-smooth Submodular-Concave Functions: A Zeroth-Order Approach
Benchmarking and Evolving Reason-Reflect-Rectify for Reflective Visual Generation
Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy
MemCast: Memory-Driven Time Series Forecasting with Experience-Conditioned Reasoning
Reward Modeling from Natural Language Human Feedback
Protein Language Model Embeddings Improve Generalization of Implicit Transfer Operators
Private and Stable Test-time Adaptation with Differential Privacy
CoGenCast: A Coupled Autoregressive–Flow Generative Framework for Time Series Forecasting
Planar Symmetric Pattern Generation
AgentSteerTTS: A Multi-Agent Closed-Loop Framework for Composite-Instruction Text-to-Speech
When Labelers Stay Silent: The Power of Ties in Cost-Effective Preference Learning
TT-Sparse: Learning Sparse Rule Models with Differentiable Truth Tables
FedPAT: Federated Test-Time Adaptation via Prototype Affinity Topology
RLSF-V: Mitigating Hallucinations in MLLMs via Fuzzy Semantic Self-Feedback
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories
Weights to Code: Extracting Interpretable Algorithms from the Discrete Transformer
AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions
pTNAS: Progressive Neural Architecture Search for Tabular Data
DOUBT: Decoupled Object-level Understanding and Bridging via vMF-based Trustworthiness for Hallucination Detection in MLLMs
Confidence and Difficulty-Adaptive Policy Optimization for LLM Reasoning
Conformal Prediction for Early Stopping in Mixed Integer Optimization
Breaking the Scale Barrier: One-Shot Knowledge Transfer via Frequency Transform
Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance
SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs
Mitigating Noise-Induced Layout Priors for Object Counting in Diffusion Models
SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets
GRASP: Graph Reasoning via Agentic Solving and Probing of LLMs
Learning to Reconfigure: Co-designing Reconfigurable robots for Heterogeneous Locomotion
Variational Learning for Insertion-based Generation
Interventional Processes For Causal Uncertainty Quantification
TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches
Outcome-Aware Spectral Feature Learning for Instrumental Variable Regression
On Uniform Error Bounds for Kernel Regression under Non-Gaussian Noise
An In-Depth Study on Deep Learning Model Cloning
Incremental Transformer Neural Processes
Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases
Concept Concentration for Faithful Representation Intervention
Quantum latent distributions in deep generative models
Connecting Independently Trained Modes via Layer-Wise Connectivity
Simple Unbiased Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures
Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design
FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing
MetaDNS: Enhancing Exploration in Discrete Neural Samplers via Metadynamics
Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
Generalized Schrödinger Bridge on Graphs
LynX: Token Interface Alignment for Video+X LLMs
QUATRO: Query-Adaptive Trust Region Policy Optimization for LLM Fine-tuning
HVAE: Hyperbolic Variational Autoencoder For Flexible Knowledge Transfer Across Multiple Domains
Discrete Adjoint Schrödinger Bridge Sampler
LieStoNet: Learning Lie Symmetries from Spatiotemporal Data for Stochastic Dynamical Systems
OneSearch: A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search
Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning
Beyond Euclidean Clipping: Overcoming Exploration Collapse in LLM RL via Riemannian Isometric Policy Optimization
BALLAST: Bayesian Active Learning with Look-ahead Amendment for Sea-drifter Trajectories under Spatio-Temporal Vector Fields
Toward Scalable and Valid Conditional Independence Testing with Spectral Representations
Geometric Flow Grounding: A Unified Manifold Decoupling Framework for Dynamics Discovery and Verification
MM-Snowball: Evaluating and Mitigating Hallucination Snowballing in Multimodal Multi-turn Dialogue
Why ReLU? A Bit-Model Dichotomy for Deep Network Training
HDTree: Generative Modeling of Cellular Hierarchies for Robust Lineage Inference
Position: Hippocampal Explicit Memory Is a Cornerstone to Human-Level AI
Hugging Carbon: Quantifying the Training Carbon Emissions of AI Models at Scale
Capability-Oriented Training Induced Alignment Risk
Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs
Revisiting OOD Generalization in Programmatic RL
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Focusing Where Vision Matters: Selective Training for Large Vision Language Models via Visual Information Gain
Cross-Modal Semantic Decoupling and Transfer for Text-to-Visible-Infrared Person Re-Identification
NeuroCLUS: A Foundation Model with Functional Clustering for Intracranial Neural Decoding
Evaluating Parameter Efficient Methods for RLVR
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Controls
Toward Calibrated Mixture-of-Experts Under Distribution Shift
TabMGP: Martingale Posterior with TabPFN
PolarDepth: Monocular Transparent Object Depth from Polar-Physics Priors
Looking Locally: Object-Centric Vision Transformers as Foundation Models for Efficient Segmentation
Must All Negatives Be Pushed Away Equally? Uncertainty-Aware Cross-View Geo-Localization via Normal Inverse Gamma Distribution
Reduction of Probabilistic Chemical Reaction Networks
No Retraining at Edge: Efficient Resource-Aware Mixed-Precision Quantization via Federated Supernet Learning
Differentially Private Geodesic Regression
Position: Scale is a False Promise for Endangered Languages
Return-Critic: Bridging Goal Discrepancy for Efficient Visual Reinforcement Learning
Position: Multi-Agent Explainability Needs Contracts Before Methods
Hyper-LLaVA: Hyperbolic Uncertainty-aware Modality-Balanced Routing for Multimodal Continual Instruction Tuning
ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns
The Cost of Learning under Multiple Change Points
Sentinel-VLA: A Metacognitive VLA Model with Active Status Monitoring for Dynamic Reasoning and Error Recovery
PSMix: Robust Point Cloud Recognition through Spectral Domain Mixing
Spectral Imbalance Causes Forgetting in Low-Rank Continual Adaptation
Sequential Group Composition: A Window into the Mechanics of Deep Learning
Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection
Which LLM Multi-Agent Protocol to Choose?
From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas
Recursive Models for Long-Horizon Reasoning
Probabilistic Retrofitting of Learned Simulators
Riemannian Dueling Optimization
L-CUBE: Isolating Long-Context Capacity from Knowledge with Controllable Mutual Information Scaling
The Expressivity Limits of Transformers
A theory of learning data statistics in diffusion models, from easy to hard
Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs
MotionMAR: Multi-scale Auto-Regressive Human Motion Reconstruction from Sparse Observations
Position: Bridge the Gaps between AI Development and Regulation
World Guidance: World Modeling in Condition Space for Action Generation
IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning
Spectral Evolution Search: Efficient Inference-Time Scaling for Reward-Aligned Image Generation
DecAEvolve: Decompose, Adapt, and Evolve, or, Three Pillars of Effective LLM-based Scientific Equation Discovery
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
When RL Meets Adaptive Speculative Training: A Unified Training-Serving System
Grounded in Reality: Learning and Deploying Proactive LLM from Offline Logs
Position: Safe AI Should be Resistant and Resilient in an Evolving World
Disentangling meaning from language in LLM-based machine translation
Selecting Samples on Graphs: A Unified Dataset Pruning Framework for Lossless Training Acceleration
FedHPro: Federated Hyper-Prototype Learning via Gradient Matching
Beyond Tokens: Enhancing RTL Quality Estimation via Structural Graph Learning
FormAct: Agentic Source Editing for Rich-Format Document Generation
Rethinking Evaluation Paradigms in IBP-based Certified Training
Position: We need to re-think the concept of “real” images.
NExT-Guard: Training-Free Streaming Safeguard without Token-Level Labels
Abstraction Induces the Brain Alignment of Language and Speech Models
Quantifying Frontier LLM Capabilities for Container Sandbox Escape
PinTok: Tokenizers Deserve Dedicated Pinned CPU-Compute and Memory
SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification
Behavior-Invariant Task Representation Learning with Transformer-based World Models for Offline Meta-Reinforcement Learning
SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning
AnyCanvas: Potential Field Guidance for Training-Free Spatial Control in Text-to-Image Diffusion
Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models
Uncertainty-Constrained Trustworthiness for Graph Learning
REViT: Roto-reflection Equivariant Convolutional Vision Transformer
Efficient DP-SGD for LLMs with Randomized Clipping
Adaptive Physics Transformer with Fused Global-Local Attention for Subsurface Energy Systems
Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors
PSBench: Editing Image via GUI Agents in Photoshop
Optimality of FSQ tokens for continuous diffusion for categorical data with application to text-to-speech
On the origin of neural scaling laws: from random graphs to natural language
GenShield: Unified Detection and Artifact Correction for AI-Generated Images
Breaking the Capacity Bottleneck in Model-Heterogeneous Federated Learning via Gradual Model Restoration
Networked Information Aggregation for Binary Classification
Counterfactual Residual Data Augmentation for Regression
DynaTok: Token-Based 4D Reconstruction from Partial Point Clouds
UFO: Chain-of-Evaluation for Omni-Condition Alignment in Multi-Modal Image Generation
Zeus: Towards Tuning-Free Foundation Model for Time Series Analysis
Pull Requests as a Training Signal for Repo-Level Code Editing
Beyond Model Ranking: Predictability-Aligned Evaluation for Time Series Forecasting
Conf-Gen: Conformal Uncertainty Quantification for Generative Models
Memory Caching: RNNs with Growing Memory
ATLAS: Learning to Optimally Memorize the Context at Test Time
Learning Long Range Spatio-Temporal Representations over Continuous Time Dynamic Graphs with State Space Models
Q-SAM: Unlocking Sharpness-Aware Minimization for Generalization in Offline Reinforcement Learning
3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Toward Identifiable Sparse Autoencoders
OPTION: Optimal Transport–Guided Flow Matching for Incomplete and Unaligned Multi-View Clustering
On the Coordination of Value-Maximizing Bidders
FormalRx: Rectify and eXamine Semantic Failures in Autoformalization
CauSciBench: Evaluating LLM Causal Inference for Scientific Research
MoCo-EA: Exploiting Adversarial Mode Connectivity for Efficient Evolutionary Attacks
Spatially-Adaptive Gradient Re-parameterization for 3D Large Kernel Optimization
PLANTAIN: Plan-Answer Interleaved Reasoning
CADFit: Precise Mesh-to-CAD Program Generation with Hybrid Optimization
Provably Adaptive Linear Approximation for the Shapley Value and Beyond
Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents
Offline Preference Optimization for Rectified Flow with Noise-Tracked Pairs
Spherical Procrustes Alignment for Reliable Medical Audio Diagnosis
Sample Complexity Bounds for Robust Mean Estimation with Mean-Shift Contamination
Partial Identification under High-Dimensional Potential Outcomes and Confounders via Optimal Transport
Feasible Fusion: Constrained Joint Estimation under Structural Non-Overlap
MarketSim: Simulating Stock Markets with Large-Scale Generative Agents
Consistency Training Is Not Neutral to Alignment
Large Language Model Teaches Visual Students: Cross-Modality Transfer of Fine-Grained Conceptual Knowledge
DocHop: Benchmarking Out-of-domain Multi-hop Reasoning in Information-Dense Documents
LearniBridge: Learnable Calibration of Feature Caching for Diffusion Models Acceleration
Off-Policy Evaluation Beyond Overlap under Network Interference
Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs
MetaBio: Learning from metadata for bioacoustics foundation models
Causal Representation Learning with Optimal Compression and Complex Treatments
Bioacoustic Geolocation: Species Sounds as Geographic Signals
Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance
Position: The Term “Machine Unlearning” Is Overused in LLMs
Approximation Error Upper and Lower Bounds for Hölder Class with Transformers
Treatment Responder Classification with Abstention
MLUBench: A Benchmark for Lifelong Unlearning Evaluation in MLLMs
Budgeted Active Experimentation for Treatment Effect Estimation from Observational and Randomized Data
Polaris: Coupled Orbital Polar Embeddings for Hierarchical Concept Learning
Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs
What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
Is Fixing Schema Graphs Necessary? Full-Resolution Graph Structure Learning for Relational Deep Learning
Leveraging Evidence Priors for Robust Prompt Learning under Noisy Supervision in Vision-Language Models
Learning Reward–Cost Balance in Safe RL via Score-Based World Models
AOEB: Benchmarking Agent-Oriented Multimodal Embeddings
Towards Understanding Continual Factual Knowledge Acquisition of Language Models: From Theory to Algorithm
Zero-shot Active Mapping via Fused 360-BEV Representations and Vision–Language Models
Interactive Person Retrieval via Multi-Turn Multimodal Conversation
AdamO: A Collapse-Suppressed Optimizer for Offline RL
HypoSpace: A Diagnostic Benchmark for Set-Valued Hypothesis Generation under Underdetermination and Sublinear Coverage Bounds
Test-Time Guidance for Flow-Based Generative Models via Parallel Tempering on Source Distributions
MACD: Model-Aware Contrastive Decoding via Counterfactual Data for Video-LLMs
Reliable Neighborhood-Aware Multi-View Outlier Detection
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing
See First, Reason Later: Mutual Information-Guided Reinforcement Learning for Vision-Language Models
Episodic Memory-Guided Controllable Experience Synthesis for Reinforcement Learning
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training
SIPO: Stabilized and Improved Preference Optimization for Aligning Diffusion Models
SJD-SV: Speculative Jacobi Decoding with Semantics Verification for Autoregressive Image Generation
Resolving Blind Inverse Problems under Dynamic Range Compression via Structured Forward Operator Modeling
Efficient Multi-round LLM Inference over Disaggregated Serving
From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG
Neural Vector Lyapunov–Razumikhin Certificates for Delayed Interconnected Systems
Breaking Manifold Continuity: Vector Quantized Modeling for Real-Centric Deepfake Detection
Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction
Causal Effect Identifiability in the Presence of Latent Confounders Without Auxiliary Variables
Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models
mHC: Manifold-Constrained Hyper-Connections
SPARD: Defending Harmful Fine-Tuning Attack via Safety Projection with Relevance–Diversity Data Selection
Dynamics of neural scaling laws in random feature regression with powerlaw-distributed kernel eigenvalues
An Efficient Joint Learning Approach for Item Response Theory
Provably Protecting Fine-Tuned LLMs from Training Data Extraction
Steady-State Behavior of Constant-Stepsize Stochastic Approximation: Gaussian Approximation and Tail Bounds
PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching
MV-FGAD: Towards Efficient and Effective Federated Graph Anomaly Detection via Multi-view Learning
Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units
Any-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
CUARewardBench: Benchmark for Evaluating Reward Models on Computer-using Agent Trajectories
Ski Rental with Distributional Predictions of Unknown Quality
Fair Classification with Efficient and Post-hoc Controllable Fairness-Accuracy Trade-off
DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model
Context Forcing: Consistent Autoregressive Video Generation with Long Context
DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
Knowledge Diversion for Efficient Morphology Control and Policy Transfer
Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR
LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
OmniShow: Orchestrating Multimodal Conditions for Human-Object Interaction Video Generation
Kronecker Generative Networks: A General Neural Architecture for Parameter-Efficient Learning Across Classification Tasks
RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
Focus, Align, and Sustain: Counteracting Gradient Dilution in Incremental Object Detection
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Structured Multi-step Jailbreaking under a Hamiltonian Generative Formulation
Optimal Decision-Making Based on Prediction Sets
Statistical Early Stopping for Reasoning Models
Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion
Singular Vectors of Attention Heads Align with Features
Fast Non-Episodic Finite-Horizon RL with K-Step Lookahead Thresholding
Smooth Multi-Policy Causal Effect Estimation in Longitudinal Settings
ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation
ReCoG: Relational and Compact Context Graph Learning for Few-shot Molecular Property Prediction
What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions
Vision Transformer Finetuning Benefits from Non-Smooth Components
TPGDiff : Hierarchical Triple-Prior Guided Diffusion for Image Restoration
SpanNorm: Reconciling Training Stability and Performance in Deep Transformers
Skip-It? Theoretical Conditions for Layer Skipping in Vision–Language Models
Breaking Multi-Task Curse: Reward-Weighted Evolution for Black-Box Many-Task Optimization
Beyond Looking Up, Try Looking Around: Harmonizing Global Structure and Local Consistency in Optimal Transport for Short Text Clustering
Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks
LEGO-FL: Learning Heterogeneous Federated Models as a LEGO Assembly Games
Position: Adopting AI in Practice Does Not Guarantee the Productivity Boost
Di-BiLPS: Denoising induced Bidirectional Latent-PDE-Solver under Sparse Observations
Revisiting Regularized Policy Optimization for Stable and Efficient Reinforcement Learning in Two-Player Games
Navigating the Energy Landscape of Collaboration: Multi-Agent Communication Graph Generation via Score-Based Diffusion
Distribution Alignment for One-Shot Federated Learning via Optimal Transport
Video-SVD: Efficient Video Diffusion via Orthogonal Basis Composition
Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning
Efficient Inference for Noisy LLM-as-a-Judge Evaluation
Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation
PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training
Amodal Instance Segmentation with IRAIS Dataset for Sim-to-Real Transfer
Decoy for the Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
A Random Matrix Perspective on the Consistency of Diffusion Models
Fast k-means Seeding Under The Manifold Hypothesis
Convergence Rate of the Last Iterate of Stochastic Proximal Algorithms
WildCat: Near-Linear Attention in Theory and Practice
SD-MoE: Spectral Decomposition for Effective Expert Specialization
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents
Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers
Beyond Majority Voting: Self-Reflective Test-Time Reinforcement Learning for LLM Reasoning
FlowCloud: Learning Continuous Spatiotemporal Dynamics from Unpaired Sparse Point Cloud Snapshots
Spectra: Rethinking Optimizers for LLMs Under Spectral Anisotropy
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Continual GUI Agents
Mesh Based Simulations with Spatial and Temporal awareness
Same Question, Different Lies: Cross-Context Consistency (C³) for Black-Box Sandbagging Detection
SWING: Unlocking Implicit Graph Representations for Graph Random Features
Meerkat-VL: Implicit Risk Safety Alignment in Multimodal LLMs via Perceptual Reasoning and Self-Verification
Bridging the Gap Between Average and Discounted TD Learning
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
CPMöbius: Iterative Coach–Player Reasoning for Data-Free Reinforcement Learning
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Normalized Energy Models for Linear Inverse Problems
UniRRM: Unified Reasoning Reward Models Across Languages and Evaluation Paradigms
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
Position: Your VLM May Not Be Thinking with Interleaved Images
Scaling Beyond Masked Diffusion Language Models
Neural Honeytrace: Plug&Play Watermarking Framework against Model Extraction Attacks
Position: Preparing for AI Systems That Deceive Developers
Conditional Diffusion Sampling
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
Optimal Attention Temperature Improves the Robustness of In-Context Learning under Distribution Shift in High Dimensions
Correcting Overparameterization Effects in Fair Empirical Risk Minimization
Graph is a Substrate Across Data Modalities
Learning a Generative Meta-Model of LLM Activations
FairSSL: Fair Multimodal Self-Supervised Learning
Rethinking Loss Reweighting for Imbalance Learning as an Inverse Problem: A Neural Collapse Point of View
Fix the Loss, Not the Radius: Rethinking the Adversarial Perturbation of Sharpness-Aware Minimization
Time-PEFT: Temporal and Multichannel Complexity-Based Fine-Tuning for Time-Series Foundation Models
From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs
Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
One-Way Policy Optimization for Self-Evolving LLMs
Position: Digital Agents Require Unified Agent-Native Environments
Efficient Parallel Samplers for Recurrent-Depth Models
Stable Localized Conformal Prediction via Transduction
Alignment-Aware Decoding
Trust-Region Diffusion Policies for Massively Parallel On-Policy RL
PAWS: Preference Learning with Advantage-Weighted Segments
Position: Accountable Deployment of Agentic AI Demands Layered, System-Level Interpretability
Feed-Forward Taylor-Gaussians-Flow: Towards Non-uniform Motion for Novel View Synthesis from Monocular Video
A Minimax Approach for Optimal Intervention Policy Learning with Two-Stage Outcomes
Catch-22: On the Fundamental Tradeoff Between Detectability and Robustness in LLM Watermarking
Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models
Quantifying and Optimizing Simplicity via Polynomial Representations
Scaling Long-Horizon Agent via Context Folding
Efficient Multi-modal Dataset Distillation via Analytic Parameter Matching
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations
Faster Query-Key Learning Sharpens Attention in Self-Attention Models
Full-Spectrum Graph Neural Network: Expressive and Scalable
Practical Mechanism for Fault-Tolerant Spiking Neural Networks via Simple Input Control Based on Learnable Fragmentation
Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
RL-SPH: Learning to Achieve Feasible Solutions for Integer Linear Programs
Implicit Action Chunking for Smooth Continuous Control
LiteVSR: Enabling Cross-Domain Fine-Grained Detail Generation in Light-Weight Transformers for Video Super-Resolution
STAR: Rethinking MoE Routing as Structure-Aware Subspace Learning
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization
Similarity Is Not Logic: Factored Inference for Dual-Encoder Vision-Language Models
Locate then Correct: Debiasing Attention Heads in CLIP
STAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control
WinQ: Accelerating Quantization-Aware Training of Large Language Models around Saddle Points
Adversarial Robustness of Implicit Neural Representation-Based Classifiers
Q-Delta: Beyond Key–Value Associative State Evolution
CURVE: Learning Causality-Inspired Invariant Representations for Robust Scene Understanding via Uncertainty-Guided Regularization
Keeping a Secret Requires a Good Memory: Space Lower-Bounds for Private Algorithms
Position: Assistive Agents Need Accessibility Alignment
Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval
Bottleneck-Guided Spectral Subgoals For Offline Goal-Conditioned RL
Adaptive Recurrent Message Passing for Test Time Computing on Graphs
InfoPO: Information-Driven Policy Optimization for User-Centric Agents
From Internal Diagnosis to External Auditing: A VLM-Driven Paradigm for Data-Free Online Backdoor Defense
Enhancing LLMs for Graph Tasks via Graph-aware LoRA Generation
ActiveScope: Actively Seeking and Correcting Perception for MLLMs
FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
Minimizing Upper Confidence Bounds: A Data-Driven Framework for Stochastic Programming
Calibrating Decision Robustness via Inverse Conformal Risk Control
Cascaded Flow Matching for Heterogeneous Tabular Data with Mixed-Type Features
Push, Pop, Parallelize: Stack-Augmented Linear Attention via the Delta Rule
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections
AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
Mosaic: Unlocking Over 30$\times$ Context Length for Diffusion LLMs Inference via Global Memory Planning and Dynamic Peak Taming
InteractComp: Evaluating Search Agents With Ambiguous Queries
TexEditor: Structure-Preserving Text-Driven texture Editing
Robust Federated Learning Against Adaptive Compression
Expand Neurons, Not Parameters
MEDUSA: Motion Elimination in Diffusion Using Spectral Attack
DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning
Guided Star-Shaped Masked Diffusion
Contrastive Order Learning: A General Framework for Ordinal Regression
Explainable Federated Learning via Global–Local Attribution Alignment
Real-Time Aligned Reward Model beyond Semantics
Stochastic Order Learning: An Approach to Rank Estimation Using Noisy Data
Backjump-on-Graph: Empowering LLMs with Reinforced Retrospective Exploration for Agentic KG Reasoning
The Relative Instability of Model Comparison with Cross-validation
DistMatch: Adaptive Binning via Distribution Matching for Robust Sequential Conformal Prediction
Baguan-TS: dual in-context learning model for time series forecasting with covariates
Zeroth-Order Optimization at the Edge of Stability
Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation
SALAAD: Sparse And Low-Rank Adaptation via ADMM for Large Language Model Inference
Self-Prophetic Decoding to Unlock Visual Search in LVLMs
Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge
Improving Diffusion Planners by Self-Supervised Action Gating with Energies
Neural QAOA$^2$: Differentiable Joint Graph Partitioning and Parameter Initialization for Quantum Combinatorial Optimization
GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning
Learning Unmasking Policies for Diffusion Language Models
ProAct: A Benchmark and Multimodal Framework for Structure-Aware Proactive Response
Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation
ALSO: Adversarial Online Strategy Optimization for Social Agents
GeoDM: Geometry-aware Distribution Matching for Dataset Distillation
Generalizing Multi-Scale Time-Series Modeling with a Single Operator
View Space: Learning Representation across Arbitrary Graphs
Towards the Training of Deeper Predictive Coding Neural Networks
From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning
AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction
EcoVLA: Environment-Aware Adaptive Pruning with Interleaved Inference Orchestration for Vision-Language-Action Models
Position: Agentic AI Is a Foreseeable Pathway to AGI
On the Computational Complexity of Performative Prediction
Generative Large Neighborhood Search: Scalable Set Cover Optimization via Discrete Diffusion
EgoTactile: Learning Grasp Pressure for Everyday Objects from Egocentric Video
UniScale: Adaptive Unified Inference Scaling via Online Joint Optimization of Model Routing and Test-Time Scaling
Learning Multi-Scale Hypergraph for High-Order Brain Connectivity Analysis
Derivative Informed Learning of Exchange-Correlation Functionals
Rethink the Role of Neural Decoders in Quantum Error Correction
Regularized Offline Policy Optimization with Posterior Hybrid Bayesian Belief
Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching
Noise as a Natural Regularizer in Markov Decision Processes: Connecting Environmental Stochasticity and Policy Simplicity
Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints
PLoRA: Efficient Concurrent LoRA Training for Large Language Models
Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning
Efficient and Uncertainty-Aware Diffusion Framework for Offline-to-Online Reinforcement Learning
LAVA: A Unified Framework for Finetuning Language and Vision Models
Doubly Outlier-Robust Online Infinite Hidden Markov Model
FiSeR: Fine-Grained Source Representations for Cross-Domain AI Image Detection
Unsupervised Mode Discovery for Fine-tuning Multimodal Generative Policies
Entropy-Aware Dynamic KV Cache Sparsification for Autoregressive Image Generation and Editing
Geometry-Misalignment in Distributional Learning
Multilingual Safety Alignment Via Sparse Weight Editing
TAG: Tangential Amplifying Guidance for Hallucination-Resistant Sampling
Adversarial Attack and Defense for Denoising Diffusion Sampling
Scaling Prompt Synthesis for Large Language Model Reasoning
Multi-agent imitation learning with function approximation: linear Markov games and beyond
SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning
IVQA-LD: Inclusive Multimodal Understanding for Population with Limb-Deficiency
On the Role of Batch Size in Stochastic Conditional Gradient Methods
Agent Primitives: Reuseable Latent Building Blocks for Multi-Agent Systems
Evaluating Agentic Optimization on Large Codebases
Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation
Learning Context-Conditioned Predicate Semantics via Prototype Feedback
Innovation: An Almost Characterization of Hallucination
RiboSphere: Learning Unified and Efficient Representations of RNA Structures
Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping
Stability and Generalization of Nonconvex Optimization with Heavy-Tailed Noise
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering
In-Context Generation with Regional Constraints for Instructional Video Editing
SLIM: Secure and Efficient Inference for Large Language Models on Untrusted Devices via TEEs
FairGB: A Fair Granular-Ball Generation Method for Data Classification
Caracal: Causal Architecture via Spectral Mixing
Position: Peer Review Should Be Calibrated via LLM Scoring
SAGE-NAS: Synergizing LLM-Based Semantic Agent with Graph-Based Evaluator for Neural Architecture Search
Inference-time optimization for experiment-grounded protein ensemble generation
NanoSpec: Accelerating Speculative Decoding using Minimalist In-Context Vocabularies
UltraHorizon: Benchmarking LLM-Agent Capabilities in Ultra Long-Horizon Scenarios
FAB: A First-Order AB-based Gradient Algorithm for Distributed Bilevel Optimization over Time-Varying Directed Graphs
“very likely” Means “uncertain”? How LLMs Diverge from Humans in Linguistic Uncertainty Quantification
Controlling the Risk of Corrupted Contexts for Language Models via Early-Exiting
Self-Refining Video Sampling
Mode Seeking meets Mean Seeking for Long Video Generation
CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Subtitle Removal
A Tale of Two Problems: Multi-Task Bilevel Learning Meets Equality Constrained Multi-Objective Optimization
Don't Overthink with Pixels: Efficient Reasoning for Segmentation
DART: Distribution-Aware Adaptive Relational Transfer for Adversarial Attacks against Closed-Source MLLMs
Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
Transform Trained Transformer for Accelerating Native 4K Video Generation
Spectral Gradient Descent Mitigates Anisotropy-Driven Misalignment: A Case Study in Phase Retrieval
Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed
How does information access affect LLM monitors' ability to detect sabotage?
FreeRet: MLLMs as Training-Free Retrievers
MALICE: Memory-aware Loop Invariants Generation on Symbolic Execution Traces
Navigating the Flatlands: Dual Adaptive Sharpness-Aware Minimization for Domain Generalization
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training
CodeMamba: Shifting from Target Semantics to Self-Supervised Background Manifold Learning for Singularity Detection in Infrared Sequences
Learning Structured Reasoning via Tractable Trajectory Control
Non-Stationary Online Structured Prediction with Surrogate Losses
Distortion of AI Alignment Revisited: RLHF is a Decent Utilitarian Aligner
ThunderAgent: A Fast, Simple, and Program-Aware Agentic Inference System
Event2Vec: Processing neuromorphic events directly by representations in vector space
Frequency Matching in Spiking Neural Networks for mmWave Sensing
Quantum Algorithms for Triangle Cut Sparsification
SafeDec: Constrained Decoding for Safe Autoregressive Generalist Robot Navigation Policies
Mean Flow Distillation: Robust and Stable Distillation for Flow Matching Models
A Control-Theoretic View of Mamba on Stability and Robustness
Conformal Path Reasoning: Trustworthy Knowledge Graph Question Answering via Path-Level Calibration
Decoupling Universal Laws and Environmental Heterogeneity: A Physics-Inspired Framework for Robust Spatio-Temporal Forecasting
On the Theory of Continual Learning with Gradient Descent for Neural Networks
GEMQ: Global Expert-Level Mixed-Precision Quantization for MoE LLMs
WeatherSyn: An Instruction Tuning MLLM For Weather Forecasting Report Generation
Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge
Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution
RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization
The Truth Lies Somewhere in the Middle (of the Generated Tokens)
SAM Audio: Segment Anything in Audio
Unsupervised Camouflaged Object Detection with Dual-Eigenvector Spectral Pseudo-Labeling and Contrastive Refinement
(Be Cautious!) Bio-Foundation Models Are Not Yet Robust to Biologically Plausible Perturbations and ML Transformations
Pluralistic Leaderboards
RetrOrchestrator: A Multi-Step Retrosynthesis Agent Dynamically Orchestrating Single-Step Transition Models
Ripple Perturbations Through Structure: Likelihood-Constrained Adversarial Attacks on Heterogeneous Tabular Data
VSCD: Video-based Scene Change Detection in Unaligned Scenes
Flexibility-Aware Geometric Latent Diffusion for Full-Atom Peptide Design
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations
SC-FAGC: Size Constrained Fast Anchor-based Graph Clustering
L2G-NET: Local to Global Spectral Graph Neural Networks via Cauchy Factorizations
Can LLMs Reason Structurally? Benchmarking via the lens of Data Structures
The Role of Target Update Frequencies in Q-Learning
Invariant Representation Learning for Source-Free Time Series Forecasting with LLM-Centric Proxy Denoising
SpatialJB: How Text Distribution Art Becomes The "Jailbreak Key" for LLM Guardrails
Instruction Lens Score: Your Instruction Contributes a Powerful Object Hallucination Detector for Multimodal Large Language Models
Spike Camera Autofocus via Frequency-Domain Spectral-Centroid Migration
Bridging Time and Frequency: A Joint Modeling Framework for Irregular Multivariate Time Series Forecasting
Energy-Structured Low-Rank Adaptation for Continual Learning
TaskLoom: Weaving Knowledge Across Tasks in World Models
Bio-Vision-Inspired Spiking Neural Networks for Object Detection with Event Cameras
Efficient Reasoning with Hidden Thinking
Bridging the Gap in Autonomous Science: The Corpus and Benchmark for Biological Protocol Reasoning
Multi-timescale Reinforcement Learning by Value Reconstruction
Discovering Implicit Large Language Model Alignment Objectives
The Power of Power Law: Asymmetry Enables Compositional Reasoning
Hyperbolic Neural Operator
Solving Spatial-Spectral Fusion with Latent Spectral Operators
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
Var-JEPA: Variational Joint-Embedding Predictive Architecture – Bridging Predictive and Generative Self-Supervised Learning
Reliable Confidence Alignment for Generalized Category Discovery
EEG-Based Multimodal Learning via Hyperbolic Mixture-of-Curvature Experts
MTNL: A Unified Modeling Perspective for Enhancing Tensor Network Learning
Greedy Coordinate Diffusion: Effective and Semantically Coherent Adversarial Attacks via Diffusion Guidance
FOAM: Blocked State Folding for Memory-Efficient LLM Training
JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments
Remove the Ambiguity: Few-shot Multimodal Anomaly Detection Using Crossmodal Feature Replacers
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Information Flow Reveals When to Trust Language Models
NeuronCtrl: Geometry-Aware Safe Closed-Loop Generative Control for Neuronal Microenvironment Dynamics
Learning Manifold and Itô Dynamics with Branched Neural Rough Differential Equations
Long Grounded Thoughts: Synthesizing Grounded Visual Problems and Distilling Reasoning Chains at Scale
Position: Child Safety Necessitates New Approaches to AI Safety
CGSVD: Cascaded Granular Singular Value Decomposition for Large Language Model Compression
Membership Inference Attacks for Unseen Classes
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
SegPVSG: Panoptic Video Scene Graph Generation via Temporal Focusing and Generative Augmentation
Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification
Reliable Thinking with Images
NoiseSDF2NoiseSDF: Learning Clean Neural Fields from Noisy Supervision
Regularized Discriminative Alignment for Deep Representations under Label Shift
Questioning the Coverage-Length Metric in Conformal Prediction: When Shorter Intervals Are Not Better
Generalist Graph Anomaly Detection via Prototype-Based Distillation
Safe and Scalable Web Agent Learning via Recreated Websites
Multi-Integration of Labels across Categories for Component Identification (MILCCI)
TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts
HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs
Attention Sinks in Diffusion Transformers: A Causal Analysis
Stabilizing Reinforcement Learning for Diffusion Language Models
GCIB: Graph Contrastive Information Bottleneck for Multi-Behavior Recommendation
Understanding MARS: When Scaling Momentum Provably Helps
Stabilizing PPO via Latent-Space Regularization and KDE-Driven Exploration
Real-Time Monitoring and Calibration of Chain-of-Thought Sycophancy in Large Reasoning Models
PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks
daVinci-Dev: Agent-native Mid-training for Software Engineering
Rashomon Sets of Falling Trees
SubspacePath Pruner: Inference-time Pruning via Probe-based Representation–Parameter Coupling
CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large Vision-Language Models
Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse
Beyond Logits: Coherent Hallucination Mitigation via Attention Contrastive Decoding
Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models
XDomainBench: Diagnosing Reasoning Collapse in High-Dimensional Scientific Knowledge Composition
A Direct Second-Order Method for Solving Two-Player Zero-Sum Games
Localizing Memorized Regions in Diffusion Models via Coordinate-Wise Curvature Differences
Learning Adaptive Topology with FiLM-Guided Distillation for Tertiary Structure-Based RNA Design
Learning with Admissibility: Robust Fuzzy Hashing for Cross-Modal Retrieval with Noisy Labels
Embedding Hybrid Systems into Continuous Latent Vector Fields
AIR: Post-training Data Selection for Reasoning via Attention Head Influence
DRIVE: Distributional and Retrieval-Augmented Bidding with Value Evaluation
Hyperbolic Hierarchical Alignment for Video-Based Visible-Infrared Person Re-Identification
Reinforcement Learning with Action-Triggered Observations
LUCID: Attention with Preconditioned Representations
Turning Drift into Constraint: Robust Reasoning Alignment in Non-Stationary Multi-Stream Environments
Correspondence Cognitive Learning for Multi-Modal Object Re-Identification
FS-I2P: A Hierarchical Focus–Sweep Registration Network with Dynamically Allocated Depth
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing
Hide&Seek: Learning to explain in an end-to-end differentiable network
Large Scale Manifold Balanced Clustering
Towards Steering without Sacrifice: Principled Training of Steering Vectors for Prompt-only Interventions
VR-Thinker: Boosting Multimodal Reward Models through Think with Image Reasoning
A Recursive Decomposition Framework for Causal Structure Learning in the Presence of Latent Variables
Negatives-Dominant Contrastive Learning for Generalization in Imbalanced Domains
dnaHNet: A Scalable and Hierarchical Foundation Model for Genomic Sequence Learning
EAKV: An Entropy-Driven Adaptive KV Compression Framework for Long Video Understanding
ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs
AesFormer: Transform Everyday Photos into Beautiful Memories
CFPO : Counterfactual Policy Optimization For Multimodal Reasoning
FUSE: Full‑spectrum Unlearnable Examples via Spectral Equalization
Sobolev Regularized Score Difference Estimation in Diffusion Models
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance
XRPO: Pushing the Limits of GRPO with Targeted Exploration and Exploitation
Position: We Need Practical AI Alignment Methods that Mirror Human Reasoning
Mitigating Manifold Departure: Uncertainty-aware Subspace Rectification for Trustworthy MLLM Decoding
When Single Answer Is Not Enough: Rethinking Single-Step Retrosynthesis Benchmarks for LLMs
Regret-Based Federated Causal Discovery with Unknown Interventions
G$^2$RPO: Geometric GRPO; Escaping LLM's Reasoning Rut to Break Accuracy--Entropy Trade-off
What Information Matters? Graph Out-of-Distribution Detection via Tri-Component Information Decomposition
GFMate: Empowering Graph Foundation Models with Pre-training-agnostic Test-time Prompt Tuning
Active Attacks: Red-teaming LLMs via Adaptive Environments
Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning
NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices
CyberCycle: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities
Adaptive Symmetry Discovery for Dynamical System Identification
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
Causal Disentangled Anchor Learning for Scalable Fair Multi-view Clustering
ScalingAR: Scaling Confidence for Autoregressive Image Generation
EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting
In-Context Learning as Rate–Distortion Optimization
HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning
PointCHR: Point Cloud Analysis via Curvature-Aware Hyperbolic Rectification
ConEx: Human-Interpretable Saliency Maps via Concept-Aware Attribution
EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments
Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding
DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning
UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning
MotiMotion: Motion-Controlled Video Generation with Visual Reasoning
[CLS] is Not Enough: Multi-Label Recognition via Patch-Level Inference and Adaptive Aggregation
Capacity without Access: Reinterpreting the Mid-Depth Spectral Plateau in LLMs
What Makes a Good Representation for Single-Cell Perturbation Prediction?
Learning to Theorize the World from Observation
The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
The Truth Stays in the Family: Enhancing Contextual Truthfulness via Inherited Heads in Model Lineages
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent
HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers
Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
Diversity Matters: Revisiting Test-Time Compute in Vision-Language Models
scChord: A Probabilistic Manifold Rectification Framework for RNA-to-Protein Translation
Less is More: Geometric Unlearning for LLMs with Minimal Data Disclosure
Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning
Mirror Descent Under Generalized Smoothness
Layer-wise Gradient Disentanglement: Decoupling Semantics and Preferences in Direct Preference Optimization
Hierarchical Decision Making with Structured Policies: A Principled Design via Inverse Optimization
Adaptive Probe-based Steering for Robust LLM Jailbreaking
Mitigating Plasticity Loss through Architectural Design in Continual Learning
Leveraging Lineage Barcodes as Natural Augmentations for Contrastive Learning of Cell Fate in scRNA-seq Data
Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes
Kuramoto Oscillatory Phase Encoding: Neuro-inspired Synchronization for Improved Learning Efficiency
DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory (and Its Loss' Convexity is Dispensable)
Stabilized Supralinear Networks Learn to Switch Coding Strategies Balancing Cost and Performance
ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference
Difference-Aware Decision Learning for Multimodal Image Fusion
ConsMSA: Semantic Distribution Consistency Learning for Multimodal Sentiment Analysis
Robust Vision-Language Models via Manifold-Adversarial Adapters
GeoMoLa: Geometry-Aware Motion Latents for Learning Robust Manipulation Policies
MultiBreak: A Scalable and Diverse Multi-turn Jailbreak Benchmark for Evaluating LLM Safety
TrustworthyQENN: A Quantum Evidential Neural Network Based on Complex-Valued Contrastive Learning for Uncertainty Pattern Classification
Sample Efficient Full-Finetuning of Generative Control Policies
Towards Execution-Grounded Automated AI Research
IBMA: Information Bottleneck-Based Multimodal Alignment
Dependency-Aware Parallel Decoding via Attention for Diffusion LLMs
GenCircuit-RL: Reinforcement Learning from Hierarchical Verification for Genetic Circuit Design
Reflex: Real-Time Vision-Language-Action Control through Streaming Inference
Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Temporal Weighted Encoding: Towards Maximal-Capacity Spike Coding for ANN–SNN Conversion
When less gives more: bias from small dataset can speed up training
Emergent Communication Under Misinformation
Pessimistic Verification for Open-Ended Math Questions
Continual Learning through Control Minimization
EduMirror: Modeling Educational Social Dynamics with Value-driven Multi-agent Simulation
Dynamic Symmetric Point Tracking: Tackling Non-ideal Reference in Analog In-memory Training
Data-Source Adaptive Online Learning under Heteroscedastic Noise
Flatness-Aware Stochastic Gradient Langevin Dynamics
A Probabilistic Framework for LLM-Based Model Discovery
Tempora: Characterising the Time-Contingent Utility of Online Test-Time Adaptation
Rethinking Sparse Mixture of Experts from a Unified Perspective
Eigenvectors of Experts are Training-free Non-collapsing Routers
Finding the Minimal Parameter Budget for Implicit Reasoning: A Data Complexity Driven Scaling Law for Language Models
Probabilistic Robustness Certificates against Adversarial Attacks
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception
Impact of Connectivity on Laplacian Representations in Reinforcement Learning
Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow
ADEPT: RL-Aligned Agentic Decoding of Emotion via Evidence Probing Tools — From Consensus Learning to Ambiguity-Driven Emotion Reasoning
Generalization and Scaling Laws for Mixture-of-ExpertsTransformers
Flow Equivariant World Models: Structured Memory for Dynamic Environments
Contrastive Flow Map Matching
Tackling Fake Forgetting through Uncertainty Quantification
Exploring Nonlinear Pathway in Parameter Space for Machine Unlearning
Distributional Alignment Games for Answer-Level Fine-Tuning
FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds
Variational Adapter for Cross-modal Similarity Representation
EPSVec: Efficient and Private Synthetic Text Generation via Dataset Vectors
Local Constrained Bayesian Optimization
PortraitRL: Reinforcement Learning for Personalized Portrait Pose Transfer with Multi-Objective Reward Modeling
Contrastive Representation Regularization for Vision-Language-Action Models
Towards Uniformity and Alignment for Multimodal Representation Learning
Future Dynamic 3D Reconstruction: A 3D World Model with Disentangled Ego-Motion
Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control
Linear Regression with Unknown Truncation Beyond Gaussian Features
MORALISE: A Structured Benchmark for Moral Alignment in Visual Language Models
Adversarial Attacks and Robust Training for Hypergraph Neural Networks
Decouple and Cache: KV Cache Construction for Streaming Video Understanding
LightAVSeg: Lightweight Audio-Visual Segmentation
RAPNet: Accelerating Algebraic Multigrid with Learned Sparse Corrections
CryoACE: An Atom-centric Framework for Accurate and Automated Model Building in Cryo-EM
Assistive Prompt Mediation: Evaluating Language Models Under Accessibility Constraints
SIMoE: A Probabilistic Framework for Cardinality-Constrained Routing in Mixture-of-Experts
TimeSpot: Benchmarking Geo-Temporal Understanding in Vision–Language Models in Real-World Settings
ReflFlow: Learning Geometry-Guided Ray Tracing for Dynamic Specular Reconstruction
Keep It in Mind: User Centric Continual Spatial Intelligence Reasoning in Egocentric Video Streams
Sharp Inequalities between Total Variation and Hellinger Distances for Gaussian Mixtures
Learning Biophysical Models of Large-Scale Multineuronal Data To Enable Precise Neurostimulation
Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs
Strategy-Aware Optimization Modeling with Reasoning LLMs
RouterInterp: Understanding Superposed Specialisation in Mixture of Experts Routing
Beyond Sunk Costs: Boosting LLM Pre-training Efficiency via Orthogonal Growth of Mixture-of-Experts
Hierarchical Abstract Tree for Cross-Document Retrieval Augmented Generation
Does Reasoning Improve Seeing? Understanding When Vision-Language Models Benefit from Thinking
Scalable RF Simulation in Generative 4D Worlds
Less is More: Neuroscience-Motivated Probing for Efficient Concept Circuits Tracing
Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models
Calibrating Uncertainty for Zero-Shot Adversarial CLIP
Evolutionary Generation of Multi-Agent Systems
UI2Code^N: UI-to-Code Generation as Interactive Visual Optimization
Training-Free Bayesian Filtering with Generative Emulators
Towards a Unified Generative Model for Scarce Time Series with Domain Experts
Provable Benefits of RLVR over SFT for Reasoning Models: Learning to Backtrack Efficiently
VisionWebDev: A Hierarchical Benchmark for Visual Website Development with Agent Verification
Conformal Risk-Averse Decision Making with Action Conditional Guarantee
State-Dependent Safety Failures in Multi-Turn Language Model Interaction
When Search Goes Wrong: Red-Teaming Web-Augmented Large Language Models
OBJVanish: Prompt-Driven Generation of Physically Realizable 3D LiDAR-Invisible Objects
Steering Large Language Models through the DMTA Cycle: Structure-Based Drug Design via Knowledge-Driven Bi-Level Thompson Sampling
SPR: A Structured Prompt Refinement Network for Modality Missing
PGD-NO: A Neural Operator with Precomputed Geometry Decomposition for 3D Million-Scale physics simulations
Doc-to-LoRA: Learning to Instantly Internalize Contexts
Coupled Variational Reinforcement Learning for Language Model General Reasoning
CLINIC: Towards High-quality Graph Out-Of-Distribution Detection
Reinforcement Learning for Non-Verifiable Problems
Polishing-Only Policies in Peer Reviews are Currently Not Enforceable
UltraLIF: Fully Differentiable Spiking Neural Networks via Ultradiscretization and Max-Plus Algebra
Probabilistic Performance Guarantees for Multi-Task Reinforcement Learning
Towards Functional Correctness of Large Code Models with Selective Generation
Provably Data-driven Lagrangian Relaxation for Mixed Integer Linear Programming
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom
Discrete Survival Knowledge Distillation for Competing Risks Analysis
Twins: Learn to Predict Unified Representations with Focal Loss
Provably Data-driven Multiple Hyper-parameter Tuning with Structured Loss Function
Weasel: Out-of-Domain Generalization for Web Agents via Importance-Diversity Data Selection
ABCD: All Biases Come Disguised
T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions
Making Learner Weakness Actionable for Learning from Demonstration with Novice Teachers
Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Reasoning Models
EchoingPixels: Aliasing-Resistant Joint Token Reduction for Audio-Visual LLMs
CoMem: Context Management with A Decoupled Long-Context Model
Multilingual Safety Alignment via Representation-Space Separability
From Parameters to Feature Space: Task Arithmetic for Backdoor Mitigation in Model Merging
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Unsupervised Disentanglement Without Compromises : How Functional Orthogonality Enforces Identifiability
Maximum Likelihood Reinforcement Learning
Image-to-Brain Signal Generation for Visual Prosthesis with CLIP Guided Multimodal Diffusion Models
Logit Distance Bounds Representational Similarity
DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone
VDW-GNNs: Vector diffusion wavelets for geometric graph neural networks
On the Robustness of Langevin Dynamics to Score Function Error
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Euclean: Automated Geometry Problem Formalization with Unified Verification in Lean
PhyScene3D: Physically Consistent 3D Interactive Tabletop Scene Generation
Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation
Forget by Uncertainty: Orthogonal Entropy Unlearning for Quantized Neural Networks
Robust Signal Enhancement via Fractional Detail Views and Knowledge Guided Multi-view Fusion
Rethinking Contrastive Learning for Graph Collaborative Filtering: Limitations and A Simple Remedy
From Representation to Action: A Unified Laplacian Framework for Spatial Representation and Path Planning
Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
LSGQuant: Layer-Sensitivity Guided Quantization for One-Step Diffusion Real-World Video Super-Resolution
A Kinetic-Energy Perspective of Flow Matching
SGERA: Stein-Guided ECG-Report Alignment for ECG Representation Learning
Social Hippocampus Memory Learning
The Efficiency Gap in Byte Modeling
Harnessing Spectrum Video for Subject-Level Few-Shot and Cross-Montage EEG Generalization
Fine-to-Coarse Fairness-Informed Multi-View Clustering
VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning
Reconstructing Template-Memorized Images from Natural Prompts
RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation
FineFocus: Benchmarking and Improving Fine-Grained Text-to-Image Alignment via Paired Reinforcement Learning
Learning Coupled Continuous-Time Latent Dynamics from Irregular Events
Winformer: Transcending Pairwise Similarity for Time-series Generation
DistFlow: A Fully Distributed RL Framework for Scalable and Efficient LLM Post-Training
SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning
AI Engram: In Search of Memory Traces in Artificial Intelligence
Latent Collaboration in Multi-Agent Systems
AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning
Rethinking Low-Confidence Pseudo Labels: Influence-Aware Semi-Supervised Fine-Tuning for Hyperspectral Change Detection
Utility Boundary of Dataset Distillation: Scaling and Coverage Laws
Auto-regressive In-context Demonstration Selection
Regret Pre-training: Bridging Prior and Posterior Views for Enhanced Knowledge Grounding
Online Learning with Recency: Algorithms for Sliding-window Streaming Multi-armed Bandits
Interpretable Discovery of One-parameter Subgroups: A Modular Framework for Elliptical, Hyperbolic, and Parabolic Symmetries
Distilling Task-Level Coordination Policies for Generalizable Multi-Agent Cooperation
CLoVE: Personalized Federated Learning through Clustering of Loss Vector Embeddings
Mixing Expertise with Confidence: A Mixture of Expert Framework for Robust Multi-Modal Continual Learner
On the Infinite Width and Depth Limits of Predictive Coding Networks
Transferable Reinforcement Learning via Probabilistic Latent Embeddings and Dynamic Policy Adaptation for Sim-to-Real Deployment
xLSTM Distillation: Achieving Teacher-Student Parity Through Efficient Hybrid Architectures
A Generalist Pair-wise Progress Critic Model for Vision-Language-Action Robots
Beyond Independence: Learning Correlated Views for Variational Incomplete Multi-View Clustering
Black-Box Assisted Regression: Phase Transitions and Minimax Optimality
Bridging Scaling Laws to On-Policy Reinforcement Learning via Adaptive Batch Scaling
UniSVQ: 2-bit Unified Scalar-Vector Quantization
Semantic Editing with Coupled Stochastic Differential Equations
Turbo4DGen: Ultra-Fast Acceleration for 4D Generation
CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering
Active Learning with Low-Rank Structure for Data Selection
HVR-Met: A Hypothesis-Verification-Replaning Agentic System for Extreme Weather Diagnosis
FiRE: Fine-grained Ranking Evaluation for Machine Translation
UniRTL: Unifying Code and Graph for Robust RTL Representation Learning
ArcDAE: Asymmetric Rectified Contrastive Diffusion Autoencoder for Unified Representation Learning
Criterion-Conditional In-Context Learning: Evaluating Criterion-Shift Adaptation in Vision-Language Models
BrainJanus: A Foundation Model for Unified Understanding and Generation across Brain, Vision, and Language
Hierarchical Policy Learning via Spectral Decomposition
DRL-STAF: A DRL Framework for State-aware Forecasting of Complex Multivariate Hidden Markov Process
Diffeomorphism-Equivariant Neural Networks
MDN: Parallelizing Stepwise Momentum for Delta Linear Attention
RelaxFlow: Text-Driven Amodal 3D Generation
Lightning Unified Video Editing via In-Context Sparse Attention
Approximating Drift-Diffusion Models for User Decisions under Nudging and External Information
AReaL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models
Optimizing Few-Step Generation with Adaptive Matching Distillation
Towards Achieving Optimal Strong Regret and Constraint Violation via Computational Efficient Model-free RL
Exploring Data-Free LoRA Transferability for Video Diffusion Models
LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation
UB-SMoE: Universally Balanced Sparse Mixture-of-Experts for Resource-adaptive Federated Fine-tuning of Foundation Models
Dynamic Compression Flows for Neuroscience Data
The Stability of Singular Distribution: A Spectral Perspective on the Two-Phase Dynamics of Language Model Pre-training
Noisy-Space Policy Gradient for Diffusion Policies in Offline Reinforcement Learning
Mosaic: Runtime-Efficient Multi-Agent Embodied Planning
Meta-learning Structure-Preserving Dynamics
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
Multi-Way Representation Alignment
Extra-Merge: Tracing the Rank-1 Subspace of Model Merging in Language Model Pre-Training
Breaking the Factorization Barrier in Diffusion Language Models
Position: Beyond Reasoning Zombies — AI Reasoning Requires Process Validity
Bounded Hyperbolic Tangent: A Stable and Efficient Alternative to Pre-Layer Normalization in Large Language Models
Budget-Feasible Mechanisms for Submodular Welfare Maximization in Procurement Auctions
Theoretical Characterization of Generalization in Knowledge Distillation
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch
You Need Better Attention Priors
Set-Coupled Guidance: Set-Level Coordination in Diffusion-Based Dataset Distillation
``Someone Hid It!'': Query-Agnostic Black-Box Attacks on LLM-Based Retrieval
From Lyapunov Analysis to Algorithm Design in two-sided PL Minimax Optimization
Accelerated and Stable Convergence with Anchored Generalized Optimistic Method
SafeCompass: Dynamic Chain-of-Thought Steering via Inference-Time Safety Signals
Revisiting Photometric Ambiguity for Accurate Gaussian-Splatting Surface Reconstruction
Exploration Hacking: LLMs Can Learn to Resist RL Training
Attentive Multi-Layer Fusion for Vision Transformers
LASER: Learning Active Sensing for Continuum Field Reconstruction
Diffuse to Detect: Bi-Level Sample Rebalancing with Pseudo-Label Diffusion for Point-Supervised Infrared Small-Target Detection
Code2Worlds: Empowering Coding LLMs for 4D World Generation
SPADA: A Verifiable Test-Driven Agent for Controllable Parametric CAD Assembly Generation
Alterbute: Editing Intrinsic Attributes of Objects in Images
Compute When Worth It: Risk Control for Reasoning on a Compute Budget
A Regret Minimization Framework on Preference Learning in Large Language Models
CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction
Unveiling the Role of Data Uncertainty in Tabular Deep Learning
Tilt Matching for Scalable Sampling and Fine-Tuning
Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning
Nonparametric Data Attribution for Diffusion Models
Beyond Pixels: Mining Compressed Domain Artifacts for Efficient AI-Generated Video Detection
Vector Linking via Cross-Model Local Isometric Consistency
T-GINEE: A Tensor-Based Multi-Graph Representation Learning
Rethinking the Trust Region in LLM Reinforcement Learning
Predictive Prefetching for Retrieval-Augmented Generation
Meta Flow Maps enable scalable reward alignment
M-IDoL: Information Decomposition for Modality-Specific and Diverse Representation Learning in Medical Foundation Model
Discrete Tilt Matching
Supervised Graph Contrastive Learning for Gene Regulatory Networks
Tightening the Score Matching Gap for Diffusion Models
Advancing Analytic Class-Incremental Learning through Vision-Language Calibration
HONet: Data-Efficient Learning for Exact Cover Problems via Hypergraph Optimization
EvoCF: Multi-Agent Collaboration via Agentic Memory-Driven Evolutionary Counterfactual Planning
PCGS: Deblurring 3D Gaussian Splatting with Patch Comparison
Lifting Traces to Logic: Programmatic Skill Induction with Neuro-Symbolic Learning for Long-Horizon Agentic Tasks
Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions
Capacity-Agnostic Parameter Isolation for Continual Graph Learning
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
Divide and Contrast: Learning Robust Temporal Features without Augmentation
LaRA-Fusion: Latent-Robust Adaptation via Dual-Loop Constraints for Infrared and Visible Image Fusion
Inference-time Alignment with Rewards in Besov Spaces: Provable Advantages of Feature Learning and Multi-Step Policy Updates
SpecMD: A Comprehensive Study On Speculative Expert Prefetching
Toward Culturally Aligned LLMs through Ontology-Guided Multi-Agent Reasoning
$\texttt{Multi}^2$: Hierarchical Multi-Agent Decision-Making with LLM-Based Agents in Interactive Environments
*MemPot*: Defend Against Memory Extraction Attack with Optimized Honeypots
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning
Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution
Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving
LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
SAOT: Self-Supervised Continual Graph Learning with Structure-Aware Optimal Transport
Discriminative Mixture-of-Experts on Graphs with Reliable Expert Fusion
Light Up Your Face: A Physically Consistent Dataset and Diffusion Model for Face Fill-Light Enhancement
Robustifying Vision-Language Models via Test-Time Prompt Adaptation
What Makes a Desired Graph for Relational Deep Learning?
FunCQNet: A Functional Censored Quantile Neural Network for Predicting Long-Term Post-Transplant Kidney Survival
Optimizing Agentic Reasoning with Retrieval via Synthetic Semantic Information Gain Reward
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention
Left–Right Symmetry Breaking in CLIP-style Vision-Language Models Trained on Synthetic Spatial-Relation Data
Echoes within the Reasoning: Stealth and Effective Watermarking via Chain of Thought
Anchor-guided Hypergraph Condensation with Dual-level Discrimination
AutoMat: Physics-Guided Agentic Reasoning for Solving Ill-Posed Inverse Microscopy Problems
Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
Post-Hoc Merging is Not Enough: Many-Shot Model Merging with Loss-Gap Balancing
AliMark: Enhancing Robustness of Sentence-Level Watermarks Against Text Paraphrasing
GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation
Geometrically Constrained Stenosis Editing in Coronary Angiography via Entropic Optimal Transport
SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning
TriForces: Augmenting Atomistic GNNs for Transferable Representations
Towards One-for-All Anomaly Detection for Tabular Data
Compress then Merge: From Multiple LoRAs into One Low-Rank Adapter
Is Code Better Than Language for Algorithmic Reasoning?
Rethinking Feature Alignment in Generalist Graph Anomaly Detection: A Relational Fingerprint-based Approach
Diversity-Aware Recursive Feature Multiple Kernel Learning
SHERPA: Fine-tuning Segment Anything Models with Task-relevant Guidance
Compile to Compress: Boosting Formal Theorem Provers by Compiler Outputs
Semi-LAR: Semi-supervised Contrastive Learning with Linear Attention for Removal of Nighttime Flares
Nonconvex Low-Rank Tensor Representation with Deep Priors for Multiview Subspace Clustering
LLM Watermark Evasion via Bias Inversion
Submodular Optimization for Minimal Augmentation in Robust Language Model Alignment
Beyond the Bellman Recursion: A Pontryagin-Guided Framework for Non-Exponential Discounting
C$^{2}$R: Cross-sample Consistency Regularization Mitigates Feature Splitting and Absorption in Sparse Autoencoders
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Near-optimal and Efficient First-Order Algorithm for Multi-Task Learning with Shared Linear Representation
Learning Useful Supervision for Reinforcement Learning in Reasoning Models
Self-supervised Hierarchical Visual Reasoning with World Model
Towards Disentangled Preference Optimization Dynamics
Scene Graph Thinking: Reinforcing Structured Visual Reasoning for Multimodal Large Language Models
CausalArmor: Efficient Indirect Prompt Injection Guardrails via Causal Attribution
On the Generalization Gap in Self-Evolving Language Model Reasoning
Global Merger-Arbitrage Forecasting with Language Models
ARLArena: Demystifying Policy Gradient Stability in Agentic Reinforcement Learning
Finding DoRI: Discovery of Retained Images in Diffusion Models
Intrinsic Gradient Suppression for Label-Noise Prompt Tuning in Vision–Language Models
Riemannian Optimization for Fair Spectral Clustering
Coarse-Grained Boltzmann Generators
Rewiring Experts on the Fly: Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models
Posterior Concentration of Physics-Informed Neural Networks for Elliptic PDEs
Merge to Remember: Sharpness-Aware Isotropic Merging for Continual Learning
PolyFlow: Safe and Efficient Polytope-Constrained Flow Matching with Constraint Embedding and Projection-free Update
FUSE: FK-Steered Multi-Modal Flow Matching for Efficient Simulation-Based Posterior Estimation
Reusing Trajectories in Policy Gradients Enables Fast Convergence
Resolving the Timestep Scaling Paradox in Spiking Neural Networks with a Timestep-Scalable Neuron Model
FRISM: Fine-Grained Reasoning Injection via Subspace-Level Model Merging for Vision–Language Models
SpeedVFI: One-step Diffusion for Efficient Video Frame Interpolation
RedVisor: Reasoning-Aware Prompt Injection Defense via Zero-Copy KV Cache Reuse
Latent Guided Sampling for Combinatorial Optimization
Federated Multi-view Clustering for Remote Sensing Data
Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
Language-based Trial and Error Falls Behind in the Era of Experience
SemBind: Binding Diffusion Watermarks to Semantics Against Black-Box Forgery Attacks
MindZero: Learning Online Mental Reasoning With Zero Annotations
BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models
Cross-View Lewis Weight Fusion Empowering Exemplar Replay for Federated Class-Incremental Learning
Efficient Bilevel Optimization for CKA-Guided MoE Upcycling
TQL: Scaling Q-Functions with Transformers by Preventing Attention Collapse
Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
ExpWeaver: LLM Agents Learn from Experience via Latent RAG
GradientStabilizer: Fix the Norm, Not the Gradient
Ariadne's Thread of LipSync: Unraveling Forgeries via Inconsistency between Lip Motions and Head Poses
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
SCNS: Continual Personalization of Diffusion Models via Submodular Concept Neuron Selection
Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink
HTAC: Hierarchical Task-Aware Composition for Continual Offline Reinforcement Learning
CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
Learning Generalized Trackers with Elastic Token Budgets
MIST: Moment-Aligned Invariant Stability Transform for Robust Flow Matching
SCORE: A Unified Framework for Overshoot Refund in Online FDR Control
Scaling Laws of Global Weather Models
AdaGC: Enhancing LLM Pretraining Stability via Adaptive Gradient Clipping
Motion-Residual Conflict-Aware Time Reversal for Generative Inbetweening
Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing
LakeQA: A Benchmark for Complex Exploratory QA over a Million-Scale Data Lake
Sparks of Cooperative Reasoning: LLMs as Strategic Hanabi Agents
Deep neural networks divide and conquer dihedral multiplication
Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs
From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space
Beyond Perplexity: UTF-8 Validity in Byte-aware Language Models
New Wide-Net-Casting Jailbreak Attacks Risk Large Models
VecDesigner: Exploring Visual Guidance and Structural Consistency for Semantic Typography
AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent
Detecting Errors in AI-Generated Annotations: When and Why Semantic Neighbors Help
HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation
Are First-Order Diffusion Samplers Really Slower? A Fast Forward-Value Approach
R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning
Plasticity Activation via Polar Operator: A Plug-in Method for Balancing Stability and Plasticity
Dynamic Thinking-Token Selection for Efficient Reasoning in Large Reasoning Models
WAVE: Window-Aware Vocabulary-Efficient Early-Exit for Training-Free LLM Acceleration
Refining Dual Spectral Sparsity in Transformed Tensor Singular Values
Two Modalities Are Better Than One: Efficient Adversarial Purification via Multimodal Diffusion Models
Efficient Stochastic Optimisation via Sequential Monte Carlo
Training AI Co-Scientists Using Rubric Rewards
Model Monotonicity in Autobidding Auctions: When Do Better Predictions Lead to Better Outcomes?
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
The Quality-Utility Paradox: Why High-Reward Data Impairs Small Model Reasoning
Safe Autoregressive Image Generation with Iterative Self-Improving Codebooks
GaussTrace: Provenance Analysis of 3D Gaussian Splatting Models with Evidence-based LLM Reasoning
Curating the Future: A Scalable Recipe for Training Open-Ended Forecasters
RaGEP: Rank-aware Geometric Expert Pruning for Mixture-of-Experts Language Models
Intrinsic Credit Assignment for Long Horizon Interaction
Proxy Compression for Language Modeling
EvReflection: Event-Driven Micro-Dynamics for Reflection Removal
Scaling-Aware Adapter for Structure-Grounded LLM Reasoning
Cross-Chirality Generalization by Axial Vectors for Hetero-Chiral Protein-Peptide Interaction Design
ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation
LIF Recurrent Memory Enables Long-Horizon Spiking Computation
BOCLOAK: Optimal Transport-Guided Adversarial Attacks on Graph Neural Network-Based Bot Detection
Learning More from Less: Unlocking Internal Representations for Benchmark Compression
ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management
TGPO: Efficient Policy Optimization through Sequence Anchor and Information Gating
From Interaction Trajectories to Prompt Rules: Credit Assignment for Multi-Agent Prompt Optimization
Optimizing Network Simulation: Enhancing Performance Prediction Accuracy via Neural Architecture Search
Watermarking LLM Agent Trajectories
AdaRoPE: Not All Attention Heads Should Rotate and Scale Equally
Time-CoT: Hierarchical Reasoning with Temporal Semantic Codes for Multivariate Time Series Classification
OOVDet: Low-Density Prior Learning for Zero-Shot Out-of-Vocabulary Object Detection
See the Emotion: A Facial Emoji Proxy Modeling for EEG Emotion Recognition
Reasoning LLM Improves Speaker Recognition in Long-form TV Dramas
Know Thyself, Know Thy User: Intrinsic Dual-Perspective Reasoning for Role-Playing LLMs
On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression
Universal Redundancies in Time Series Foundation Models
FiGuRO - Intrinsic Dimension Estimation for Multi-Modal Data
BRIDGE: Triangular Fixed-Point Refinement for Long-Horizon Persona Consistency
GenAlign: Towards Unified Alignment Framework of MLLMs via Generative Reward Model
TimeMRA: LLM-Empowered Time Series Forecasting via Multi-Scale Retrieval-Augmented Representations
A Distributional View for Visual Mechanistic Interpretability: KL-Minimal Soft-Constraint Principle
DNACHUNKER: Learnable Tokenization for DNA Language Models
NeurVLA: Unleashing Failure-Handling Capability of Vision-Language-Action Models via Neural-Symbolic Reasoning
Particle-Guided Diffusion Models for Partial Differential Equations
On Testing Conditional Mean Independence for Manifold-Valued Data
Likelihood over Estimation: Robust Quadratic Discriminant Analysis for Heavy-Tailed Distributions with Theory and Evidence
Welfare-Optimal Classification with Accuracy Auctions
Reasoning on the Manifold: Bidirectional Consistency for Self-Verification in Diffusion Language Models
Multimodal Nested Learning for Decoupled and Coordinated Optimization
Text-Driven Fusion for Infrared and Visible Images: Achieving Image Scene Adaptation on Hyperbolic Space
Boundary Embedding Shaping with Adaptive Contrastive Learning for Graph Structural Disentanglement
DiScoFormer: Plug-In Density and Score Estimation with Transformers
Dimension-Independent Convergence of Underdamped Langevin Monte Carlo in KL Divergence
Terminal Dimension Reduction for Time Series with Applications
Foresee-to-Ground: From Predictive Temporal Perception to Evidence-Driven Reasoning for Video Temporal Grounding
Single-Head Attention in High Dimensions: A Theory of Generalization, Weights Spectra, and Scaling Laws
A Solvable High-Dimensional Model Where Nonlinear Autoencoders Learn Structure Invisible to PCA While Test Loss Misaligns With Generalization
L-SR1: Learned Symmetric-Rank-One Preconditioning
Multi-Label Test-Time Adaptation with Bayesian Conditional Priors
FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences
DTS: Enhancing Large Reasoning Models via Decoding Tree Sketching
Position: Safety Must Precede the Deployment of Open-Ended AI Agents
Particles Don’t Care About Z: Towards Scaling Entropy Estimation of Unnormalized Densities
No Data? No Problem: Robust Vision-Tabular Learning with Missing Values
Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge
Many Experiments, Few Repetitions, Unpaired Data, and Sparse Effects: Is Causal Inference Possible?
Mitigating Mask Prior Drift and Positional Attention Collapse in Large Diffusion Vision-Language Models
PLATE: Plasticity-Tunable Efficient Adapters for Geometry-Aware Continual Learning
Real-Time Visual Attribution Streaming in Thinking Model
Position: AI Should Facilitate Democratic Deliberation at Scale
Physics in 2-Steps: Locking Motion Priors Before Visual Refinement Erases Them
Approximation Bounds for Transformer Networks with Application to Regression
Position: Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution
PersistBench: When Should Long-Term Memories Be Forgotten by LLMs?
Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control
Adaptive Contracts for Cost-Effective AI Delegation
Any-dimensional invariant universality
Understanding Dynamic Compute Allocation in Recurrent Transformers
Trading Complexity for Expressivity Through Structured Generalized Linear Token Mixing
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation
SleepMaMi: A Universal Sleep Foundation Model for Integrating Macro- and Micro-structures
Beyond External Monitors: Enhancing Transparency of Large Language Models for Easier Monitoring
Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy
Positive-Unlabeled Learning with Extreme Scarcity of Labeled Positives
Beyond Structural Symmetries: Linear Mode Connectivity via Neuron Identifiability
Prototype-guided Bilateral Alignment Multimodal Federated Learning
Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning
A Coin Flip for Safety: LLM Judges Fail to Reliably Measure Adversarial Robustness
CoEvol-NO: State and Coordinate Co-Evolution with an Error-Driven Predictor-Corrector Paradigm for Neural Operator Transformer
Dual-Calibration Multi-View Clustering via Compact Anchor Learning
Hide and Seek in Embedding Space: Geometry-based Steganography and Detection in Large Language Models
Position: LLM-Safety Evaluations Lack Robustness
Revisiting Anisotropy in Language Transformers: The Geometry of Learning Dynamics
Improved Stochastic Optimization of LogSumExp
Measuring Meta-Cultural Competency: A Spectral Framework for LLM Knowledge Structures
Finding Most Influential Sets
FUSE: Frequency-domain Unification and Spectral Energy Alignment for Multi-modal Object Re-Identification
A Geometry-Based View of Mahalanobis OOD Detection
Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping
How much can language models memorize?
On the Convergence of Decentralized Stochastic Minimax Optimization Algorithm with Compressed Communication
Towards Fully Parameter-Free Stochastic Optimization: Grid Search with Self-Bounding Analysis
Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
DISSOLVR: An Interpretable and Fast Framework for Aqueous and Organic Solubility Prediction
Demystifying the Optimal Fair Classifier in Multi-Class Classification
$\texttt{IDEAS}$: Interpretability Driven Evolutionary Approach for the Design of Biological Sequences
Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees
FlowSeg: Dynamic Semantic Guidance for LLM-Conditioned Segmentation
Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Edit via In-Context Learning
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models
MAS-Architect: Declarative Multi-Agent System Design via Separation of Concerns
Learning Stochastic Bridges for Video Object Removal via Video-to-Video Translation
WebWorld: A Large-Scale World Model for Web Agent Training
Position Is All You Need: A Free Lunch Token Compression Strategy for MLLM-based Referring Expression Segmentation
Dissecting Causal Mechanism Shifts via FANS: Function And Noise Separation
Retriever Portfolios: A Principled Approach to Adaptive RAG
Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling
UCPO: Uncertainty-Aware Policy Optimization
This State Looks Like That: Self-Interpretable Reinforcement Learning Agents using Prototype Soft Actor-Critic
SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation
Optimal conversion from Rényi Differential Privacy to $f$-Differential Privacy
DiasR: Dual-Modal Identity-Anchored Sparse Routing for Efficient Multi-Subject Video Generation
ShapCCS: Shapley-Driven Client Coreset Selection in Federated Learning
CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing
PixCLIP: Towards Fine-grained Vision-Language Understanding via Any-granularity Pixel-Text Alignment
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
The data manifold under the microscope
Projected Gradient Ascent for Efficient Reward-Guided Updates with One-Step Generative Models
Revisiting Asymmetries in Black-box Link Stealing against Graph Neural Networks
Context-level Language Modeling by Learning Predictive Context Embeddings
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Noise-corrected GRPO: From Noisy Rewards to Unbiased Gradients
Holonomy Grid Codes for Generalisation Under Directed Actions
Dual-Latent Memory Routing for Vision-Language Reasoning
On the Expressive Power of Permutation-Equivariant Weight-Space Networks
Bridging the Perceptual Gap: Residual-Enhanced Downscaling and Manifold-Aware Perception Alignment Adaptation for NR-IQA
Phase-Aware Mixture of Experts for Agentic Reinforcement Learning
VIBE: Disentangling Social Dynamics via Kinematics-Informed Variational Inference for Behavioral Emotion
From Coarse to Fine: Deep Prototype Refinement Network for Few-Shot Point Cloud Semantic Segmentation
Q-CLIP: Unleashing the Power of Vision-Language Models for Video Quality Assessment through Unified Cross-Modal Adaptation
Guidance: Sentence-Level Citation Enforcement via Prefix-Tail Guidance during LLM Decoding
SAMT: Generating Structured Avatar Meshes and Textures from a Single Image
Rational Neural Networks have Expressivity Advantages
TimeSeed: Effective Time Series Forecasting with Sparse Endogenous Variables
Hölder++: Improving Quality-Coherence Trade-off in Multimodal VAEs
VocSim A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio
RePo: Language Models with Context Re-Positioning
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
D$^3$: Dynamic Directional Graph-Constrained Data Scheduling for LLM Training
AutoRAS: Learning Robust Agentic Systems with Primitive Representations
From Optimization to Generalization under Heavy-Tailed Data: The Role of Gradient Clipping
NeurIPS: Neuro-anatomical Inductive Priors for Sphere-based Brain Decoding
Incremental Learning of Sparse Attention Patterns in Transformers
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
WaveSSM: Multiscale State-Space Models for Non-stationary Signal Attention
Memory-Distilled Selection for Noise-Robust Anomaly Detection
MAC-NeRF: Motion-Aware Curriculum Learning for Dynamic LiDAR NeRFs
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs
Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift
Learning to Evict from Key-Value Cache
CSPLoRA: Confidence-Guided Structure Planning for Low-Rank Adaptation
Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models
Taming the Recent-Data Bias: Towards Robust Time Series Forecasting with Global Context
PRIM:Cooperative Dynamic Token Compression for Efficient Large Multimodal Models
Generalized Correctness Models: Learning Calibrated and Cross-Model Correctness Predictors from Historical Patterns
Controlled Dynamics Attractor Transformer
CoGeoAD: Hierarchical Color-Geometric Fusion with Multi-View Attention for Zero-Shot 3D Anomaly Detection
Scaling Small Agents Through Strategy Auctions
ImgCoT: Compressing Long Chain of Thought into Compact Visual Tokens for Efficient Reasoning of Large Language Model
ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT
Differentiable Weightless Controllers: Learning Logic Circuits for Continuous Control
Random Scaling of Emergence Capabilities
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs
GADA: Geometry-Aware Deformable Aggregation for Image-Based Gaussian Splatting
Floating-Point Networks with Automatic Differentiation Can Represent Almost All Floating-Point Functions and Their Gradients
Finite and Corruption-Robust Regret Bounds in Online Inverse Linear Optimization under M-Convex Action Sets
TraCeS: Learning Per-Timestep Constraint-Violation Credit from Sparse Trajectory-Level Labels
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning
Evaluating Sample Utility for Efficient Data Selection by Mimicking Model Weights
Balanced LoRA: Removing Parameter Invariance to Accelerate Convergence
Beyond Global Alignment: Fine-Grained Motion-Language Retrieval via Pyramidal Shapley-Taylor Learning
CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation
Analytic Bijections for Smooth and Interpretable Normalizing Flows
FedTreeLoRA: Reconciling Statistical and Functional Heterogeneity in Federated LoRA Fine-Tuning
MCCE: A Framework for Multi-LLM Collaborative Search in Discrete Spaces with Similarity-Filtered Preference Learning
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Improving CLIP Adaptation by Breaking Tail Alignment for Source-Free Cross-Domain Few-Shot Learning
No Need to Train Your RDB Foundation Model
Smooth Dynamic Cutoffs for Machine Learning Interatomic Potentials
Towards Understanding Steering Strength
Generalizable and Actionable Parts Pose Estimation with Symmetry Annotation-Free Learning Strategy
Bayesian-LoRA: Probabilistic Low-Rank Adaptation of Large Language Models
Discounted Beta-Bernoulli Reward Estimation for Sample-Efficient Reinforcement Learning with Verifiable Rewards
Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization
Robust Stochastic Gradient Posterior Sampling with Lattice Based Discretisation
Reasoning-VLA: An Efficient and Spatial-Guided General Vision-Language-Action Reasoning Model for Autonomous Driving
Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding
Asymmetric Perturbation in Solving Bilinear Saddle-Point Optimization
Graph of States: Solving Abductive Tasks with Large Language Models
Copula-SVI: Vine-Copula Variational Inference for Instance-Level Correlation Capturing
A Judge-Aware Ranking Framework for Evaluating Large Language Models without Ground Truth
Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code
Generative Modeling of Irregular Time Series via SDE-Induced Continuous-Discrete Variational Inference
ReViT: Rotational-equivariant Vision Transformers for Neural PDE Solvers
Corruption-Tolerant Asynchronous Q-Learning with Near-Optimal Rates
LOZO+: Provably Efficient Zeroth-Order Fine-Tuning via Greedy Low-Rank Subspace Selection
ReMoE: Boosting Expert Reuse through Router Fine-Tuning in Memory-Constrained MoE LLM Inference
MAFE: Enabling Equitable Algorithm Design in Multi-Agent Multi-Stage Decision-Making Systems
Trajectory-Aware Spiking DiTs Conversion via Membrane Potential Error-Feedback
IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking
Gram2Token: Enabling Run-time GPU-Native Grammar-Constrained Decoding for LLMs
Error Propagation and Model Collapse in Diffusion Models: A Theoretical Study
Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training
VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph
Proactive Defense Benchmark against Deepfake Generation
Closing the Expression Gap in LLM Instructions via Socratic Questioning
Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives
$E^2$PO: Embedding-perturbed Exploration Preference Optimization for Flow Models
DocOS: A Benchmark for Proactive Document-Guided Actions in GUI Agents
Structure-Centric Graph Foundation Model via Geometric Bases
Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning
Error-Driven Graph Augmentation for Mesh-Based PDE Surrogates
SAGE: Shaping Anchors for Guided Exploration in RLVR of LLMs
Evaluating AI Grading on Real-World Handwritten College Mathematics: A Large-Scale Study Toward a Benchmark
MixFP4: Extending NVFP4 to Mixed Micro-Format via Scale-Bit Reuse and Tensor Core Co-design
Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning
Weaving Graph over Tokens: Contextualizing Structured Sequences for LLMs
Fix Before Search: Benchmarking Agentic Visual Query Pre-processing in Multimodal Retrieval-augmented Generation
RefChess: Monte-Carlo Move Selection for Zero-Shot Referring Image Segmentation
When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer
Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models
PromptPilot: Game-Theoretic Multi-Agent Prompt Optimization for Segment Anything
Interleaved Selective State Space Models for Efficient WiFi-Based 3D Multi-Person Pose Estimation
On the Effect of Misspecifying the Embedding Dimension in Low-rank Network Models
Reflect-then-Correct: Rebalancing Task Optimization for Generalizable Meta-Reinforcement Learning via Distributional Value Error Reduction
Next-Token Prediction and Regret Minimization
Towards Fine-grained Robustness: Attention-guided Test-time Prompt Tuning for Vision-Language Models
A²RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation
Mitigating Error Propagation in Low-Rank Approximation of Large Models via Distribution-Aware Whitening
Task-Aware Exploration via a Predictive Bisimulation Metric
Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling
Resource-Efficient Reinforcement for Reasoning Large Language Models via Dynamic One-Shot Policy Refinement
Multi-Objective Protein Design via Memory-Aware Test-Time Scaling in Diffusion Models
Reward-free Alignment for Conflicting Objectives
SE(n)-Invariant Flow Matching: A General Framework with Application to Object Reassembly
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models
Overcoming PINNs Failure Modes In High Dimension With Low-Rank Fourier Sum
From Kepler to Newton: Inductive Biases Guide Learned World Models in Transformers
A Mechanistic Understanding of Sim-and-Real Co-Training in Generative Policies
Incentivizing Truthfulness and Collaborative Fairness in Bayesian Learning
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
Universal One-third Time Scaling in Learning Peaked Distributions
Inverse Depth Scaling From Most Layers Being Similar
Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning
Is Data Shapley Not Better than Random in Data Selection? Ask NASH
A Very Big Video Reasoning Suite
Scalable and General Whole-Body Control for Cross-Humanoid Locomotion
LineageFlow: Flow Matching for High-Fidelity Family-Aware Protein Sequence Generation
De-attribute to Forget for LLM Unlearning
Pseudo-Mallows for Efficient Probabilistic Preference Learning
TopAdapter: Topology-Aware Prompt Tuning for Efficient Point Cloud Understanding
Demystifying When Pruning Works via Representation Hierarchies
Position: Vision encoders should be image size agnostic and task driven
LabBuilder: Protocol-Grounded 3D Layout Generation for Interactable and Safe Laboratory
ForesightKV: Optimizing KV Cache Eviction for Reasoning Models by Learning Long-Term Contribution
Stochastic Lifting for Generating Trajectories of Stochastic Physical Systems
Leveraging Gauge Freedom for Learning Non-Gradient Population Dynamics of Stochastic Systems
Two-Parameter Flows for Learning Population Dynamics of Physical Systems
Optimal Learning from Label Proportions with General Loss Functions
Beyond Static Endpoints: Tool Programs as an Interface for Flexible Agentic Web Services
FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight
Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion
Alethia: a Foundational Encoder for Voice Deepfakes
SVL: Empowering Spiking Neural Networks for Efficient 3D Open-World Understanding
A Perturbation Approach to Unconstrained Linear Bandits
Search Space Synthesis for Parametric Functions
DLLMQuant: A Post-Training Quantization Framework Tailored for Diffusion-Based Large Language Models
DevEvol: Benchmarking LLM Agents on Continuous Software Evolution
Normalization-equivariant Diffusion Models: Learning Posterior Samplers From Noisy And Partial Measurements
Parameter-free Dynamic Regret: Time-varying Movement Costs, Delayed Feedback, and Memory
FairMerging: Rethinking Model Merging through the Lens of Fairness
Stationary MMD Points
InteractBench: Benchmarking LLMs on Competitive Programming under Unrevealed Information
Security–Fidelity Tradeoffs: No Universal Defense Against Prompt Injection
Injecting Distributional Awareness into MLLMs via Reinforcement Learning for Deep Imbalanced Regression
$\tau$-Voice: Benchmarking Full-Duplex Voice Agents on Real-World Domains
Hyperbolic RQ-VAE enhanced Generative Recommendation with Differential-Length Codebook Strategy
PPT-Eval: A Benchmark for Computer-Use Agents on PowerPoint Tasks
Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs
Unsupervised Diffusion for Combinatorial Optimization via Adjoint Matching
Unsupervised Neural Langevin Sampler for Mixed Integer Linear Programming
$f$-Divergence Self-Play for Tabular Anomaly Detection via Large Language Models
gp2Scale: A Class of Compactly Supported Non-Stationary Kernels and Distributed Computing for Exact Gaussian Processes on 10 Million Data Points
RoboFlow4D: A Lightweight Flow World Model Toward Real-Time Flow-Guided Robotic Manipulation
Online Robust Reinforcement Learning with General Function Approximation
TokenRatio: Principled Token-Level Preference Optimization via Ratio Matching
HieRD: Hierarchical Relational Distillation for Vision-Language Embedding Models
From Outcomes to Actions: Leveraging Hindsight for Long-Horizon Language Agent Training
Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization
Debate with Images: Detecting Deceptive Behaviors in Multimodal Large Language Models
Doppler Prompting for Stable mmWave-based Human Pose Estimation
Hierarchical Representations for Cross-task Automated Heuristic Design using LLMs
Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots
Benchmarking at the Edge of Comprehension
Betting on Predictions
Expandable, Compressible, Mineable: Open-World Thermal Infrared Image Restoration
Prompt Tuning for CLIP on the Pretrained Manifold
Debiased Model-based Representations for Sample-efficient Continuous Control
Towards A Generative Protein Evolution Machine with DPLM-Evo
SilentWood: Efficient Private Inference Over Gradient Boosting Decision Forests
TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression
ePC: Fast and Deep Predictive Coding for Digital Hardware
MatchFixAgent: Language-Agnostic Autonomous Repository-Level Code Translation Validation and Repair
Rethinking Parameter Sharing as Graph Coloring for Structured Compression
Rank-guided Diffusion for Noise Few-Shot Learning
Discrete Diffusion with Physical Mass Constraints for \emph{De Novo} Peptide Sequencing
Activation with Intrinsic-Extrinsic Consensus
HybridOM: Hybrid Physics-Based and Data-Driven Global Ocean Modeling with Efficient Regional Downscaling
Distillation Models are Good Samplers for Diffusion Reinforcement Learning
LassoFlexNet: a Flexible Neural Architecture for Tabular Data
HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation
Scaling Law for Quantization-Aware Training
(Doubly) Exponential Lower Bounds for Follow the Regularized Leader in Potential Games
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning
ANTiC: Adaptive Neural Temporal In Situ Compressor
Position: Significant impact of numerical precision in scientific machine learning
No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference
KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning
Flowers: A Warp Drive for Neural PDE Solvers
Provable Bounds for the Learnability of Sample-Compressible Families from Noisy Samples
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants
Controllable Molecule Generation via Sparse Representation Editing: An Interpretability-Driven Perspective
Optimization, Generalization and Differential Privacy Bounds for Gradient Descent on Kolmogorov–Arnold Networks
TVDRNet: Text-driven Viewpoint Optimization via Differentiable Rendering for 3D Reasoning Segmentation
Landmark-Guided Policy Optimization for Multi-Objective Language Model Selection
Barriers to Counterfactual Credit Attribution for Autoregressive Models
Graph Neural Dynamics via Learned Energy and Tangential Flows
Learning Permutation Distributions via Reflected Diffusion on Ranks
Physics-Informed Residual Flows
Solving Imperfect-Recall Games via Sum-of-Squares Optimization
Heavy-tailed Physics-Informed Neural Networks
Formally Exploring Visual Anomaly Detection Evaluation Metrics
Alignment between Brains and AI: Evidence for Convergent Evolution across Modalities, Scales and Training Trajectories
Geometric Collapse: When Vision Models Fail to Verify Physical Causality
TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers
Flow-Based Density Ratio Estimation for Intractable Distributions with Applications in Genomics
Fast and Optimal Algorithms for Private Hypothesis Selection
VLA-Arena: An Open-Source Framework for Benchmarking Vision-Language-Action Models
Data Agent: Learning to Select Data via End-to-End Dynamic Optimization
Generative Adaptation of Dynamics to Environmental Shifts via Weight-space Diffusion
Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies
Towards Understanding the Dynamics of Low-Rank Adaptation
Efficient Diffusion LLMs via Temporal-Spatial Parallel Decoding and Confidence Extrapolation
Active Budget Allocation for Efficient Scaling Law Estimation via Surrogate-Guided Pruning
Improving Backward Conformal Prediction via Non-Conformity Score Transformation
AgentScore: Autoformulation of Deployable Clinical Scoring Systems
Spectral-Progressive Thought Flow for Lightweight Multimodal Reasoning
Physics from Video: Identifiability of Time-Invariant Second-Order ODEs under Minimal Trajectory Conditions
MixQuant: Pushing the Limits of Block Rotations in Post-Training Quantization
DDP-WM: Disentangled Dynamics Prediction for Efficient World Models
Med-SegLens: Latent-Level Model Diffing for Interpretable Medical Image Segmentation
Unveiling the Potential of Quantization with MXFP4: Strategies for Quantization Error Reduction
EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment
Symmetry Reveals the In-Context Classifier: Transformers Implement Mean-Shift Dynamics
Deep Trajectory Supervision: Deep Supervision Strikes Back
LERD: Latent Event-Relational Dynamics for Neurodegenerative Classification
Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models
PoMtVRS: Preference-Optimized Multi-Task Vehicle Routing Solver with Preference Gating
Adversarial Vulnerability from Interference Between Features in Superposition
FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy
iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis
Training-Free Hashing-Based Attention via Binary Principal Components
Probing Newtonian Mechanics in Video Generative Models with Real Physical Systems
Reasoning about Reasoning: BAPO Bounds on Chain-of-Thought Token Complexity in LLMs
Head-in-Head in Linear Attention
TeamTR: Trust-Region Fine-Tuning for Multi-Agent LLM Coordination
Z-Erase: Enabling Concept Erasure in Single Stream Diffusion Transformers
AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning
Advantage Collapse in Group Relative Policy Optimization: Diagnosis and Mitigation
GraphPFN: A Prior-Data Fitted Network for Graph Node-Level Tasks
Zero-source LLM Hallucination Detection with Human-like Criteria Probing
Affine-Equivariant Kernel Space Encoding for NeRF Editing
Reasoning Cache: Learning to Extrapolate to Long Lengths via Short-Length RL
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification
Active Timepoint Selection for Learning Measure-Valued Trajectories
Understanding Truncated Positional Encodings for Graph Neural Networks
Neural Modular Physics for Elastic Simulation
LEGO: An LLM-Enabled Hierarchical Optimizer for Tensor Computation Graphs with Structure-Aware Search and Compositional Synthesis
CONTINUUM: Restoring the Contiguous Tensor Abstraction Efficiently for Dynamic AI Workloads via Hardware Virtualization
CellBRIDGE: Learning Cellular Trajectories via Interaction-Aware Alignment
Contrastive Spectral Rectification: Test-Time Defense towards Zero-shot Adversarial Robustness of CLIP
Align Your Trajectory Tangent: Training Better Consistency Models via Manifold-Aligned Tangents
RLCracker: Evaluating the Worst-Case Vulnerability of LLM Watermarks with Adaptive RL Attacks
Position: The Open Benchmark Paradox Must Be Resolved through Sovereign Medical Evaluation
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
MAGIC: A Co-Evolving Attacker–Defender Adversarial Game for Robust LLM Safety
Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment
When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs
MC-HNN: Learning Latent Structural Semantics and High-Rank Representations for Hypergraph Neural Networks
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents
Unveiling the Entropy Dynamics of Chain-of-Thought Reasoning
Beyond Accuracy and Complexity: The Effective Information Criterion for Structurally Stable Symbolic Regression
Beyond Description: Federated Adaptation via Semantic-Visual Prototype Alignment
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
MetaStreet: Semi-Supervised Multimodal Learning for Street-Level Socioeconomic Prediction
What Makes Effective Supervision in Latent Chain-of-Thought: An Information-Theoretic Analysis
From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity
PODiff: Latent Diffusion in Proper Orthogonal Decomposition Space for Scientific Super-Resolution
Motion Dynamics Learning for Few-Shot Embodied Adaptation
Bring Future Vision: Dynamic Computation Allocation Guided by Lightweight Feature Forecaster
Rel-MOSS: Towards Imbalanced Relational Deep Learning on Relational Databases
Agentic Model Predictive Questioning Control in Visual Design
From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment
Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates
Rethinking Serialization in Linear 3D Vision: Decoupling Anisotropic Geometry from Isotropic Semantics
DualCOIL: Offline Imitation Learning from Contrasting Demonstrations
Mixtures Closest To A Given Measure: A Semidefinite Programming Approach
STRIDE: Post-Training LLMs to Reason and Refine Bio-Sequences via Edit Trajectories
Glimpse: Geometry Learning of Multi-scale Structural Priors for 3D Pose Estimation
WISE: World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
EAPO: Enhancing Policy Optimization with On-Demand Expert Assistance
RealtimeTool: Parallel Decoding for Real-Time LLM Function Calling
SimGFM: Simplifying Discrete Flow Matching for Graph Generation
FIRE: Learning to Navigate and Act on Real-World Files via Stateful Reinforcement Learning
Divide-and-Denoise: A Game-Theoretic Method for Fairly Composing Diffusion Models
Spectral–Spatial Mixing with Morphology-Aware Adaptive Loss for Medical Image Segmentation.
Predicting Dynamic Stability Landscapes in Synchronization Networks
MORE: A Multilingual Document Parsing Benchmark and Evaluation
HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection
Comp-Attn: Present-and-Align Attention for Compositional Video Genneration
On the Sharp Input-Output Analysis of Nonlinear Systems under Adversarial Attacks
Furina: Fragmented Uncertainty-Driven Refusal Instability Attack
Adaptive Visual Autoregressive Acceleration via Dual-Linkage Entropy Analysis
3DGS$^2$-TR: A Scalable Second-Order Trust-Region Method for 3D Gaussian Splatting
VlogReward: Learning Multi-Dimensional Evaluation for Vlog Editing
Lie-Algebraic Neural Koopman Dynamics
Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
REAL: Resolving Knowledge Conflicts in Knowledge-Intensive Visual Question Answering via Reasoning-Pivot Alignment
Front-Loaded Robust Conformal Prediction: Heavy Calibration, Minimal Test-Time Cost
Hermes: An Evidence-Driven Agentic Framework for Trustworthy and Explainable AI-Generated Video Detection
MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction
CORE-MTL: Rethinking Gradient Balancing via Causal Orthogonal Representations
Spatial Priors via Space Filling Curves for Small and Limited Data Vision Transformers
Position: Evaluation of ECG Representations Must Be Fixed
ImpText: A Benchmark and Tool-Augmented Framework for Implicit Text Reasoning
Factored Classifier-Free Guidance
PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning
Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models
Position: Why a Dynamical Systems Perspective is Needed to Advance Time Series Modeling
HybridFlow: Resource-Adaptive Subtask Routing for Efficient Edge-Cloud LLM Inference
Dynamic Programming for Epistemic Uncertainty in Markov Decision Processes
TabularBERT: Binning-Based Self-Supervised Learning for Tabular Representation
GLAD: Bidirectional Structure-Attribute Alignment via Latent Graph Diffusion Models
EEG-FM-Bench: A Comprehensive Benchmark for the Systematic Evaluation and Diagnostic Analyses of EEG Foundation Models
SoftMoE: Soft Differentiable Routing for Mixture-of-Experts in LLMs
Prediction-Powered Adaptive Inference with Pretrained AI Models for Contextual Bandits
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks
LoRDO: Distributed Low-Rank Optimization with Infrequent Communication
Escaping the Likelihood Trap: Geometric Diversity Optimization for Long-Form Image Captioning
Prompt Optimization with Minimal Unlabeled Input via Meta-Reasoning
LEMUR: Learned Multi-Vector Retrieval
The Devil is in the Spectrum: Mitigating Representation Collapse in LLMs via Topologically Regularized Side-Path
RSTR: Reducing SpatioTemporal Redundancy in Diffusion Transformers
BioDynaSpec: Harmonic-Guided Spatio-Spectral Autoregressive Diffusion for Protein Dynamics Generation
MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
Controllable and explainable personality sliders for LLMs at inference time
Textual Stochastic Gradient Descent: Discrete Optimization of External Memory for Reasoning Language Agents
NOMAD: Lifelong Trajectory Planning via Non-Parametric Bayesian Memory-Adaptive Diffusion Experts
AutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure Design
On the Collapse of Generative Paths: A Criterion and Correction for Diffusion Steering
Step-resolved data attribution for looped transformers
Latent-Guided Cooperative Energy-Based Models
ReNF: Rethinking the Principles of Neural Long-Term Time Series Forecasters
SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm
Efficient numeracy in language models through single-token number embeddings
Toward Safe Quantization-Aware Fine-tuning: Understanding and Mitigating Safety Alignment Degradation
Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE
Prototype-Based Test-Time Adaptation of Vision-Language Models
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
Last-Iterate Convergence of Regularized Gradient Methods for Stochastic Monotone Variational Inequalities
Active Tabular Augmentation via Policy-Guided Diffusion Inpainting
Where Rectified Flows Leak: Characterizing Membership Signals Along the Interpolation Path
Multi-scale Explainer for Graph Neural Networks
A Geometric Analysis of Small-sized Language Model Hallucinations
Hybrid Policy Distillation for LLMs
Position: Bridge Human Interpretation and Machine Representation With Explicit Specification For Qualitative Data Analysis In LLM Era
Test-Time Debiasing with Probabilistic Prompts via Wasserstein Distance in Vision-Language Models
Same Graph Cross-Task Transfer in GNNs: Protocols and Predictors
MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks
CoCoQuant: Breaking the Bandwidth Wall via Co-Optimized Communication and Computation Quantization
Stronger Benchmarks for Prediction as a Service with Constraints
Emergent Alignment via Competition
$R^3$DAO: Reactive Recovery and Reconstruction for Long-horizon Data Agent Orchestration
DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
Enhancing LLM Training via Spectral Clipping
Privileged Information Distillation for Language Models
SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs
Training-Free Hierarchical Working Memory for Small Language Model Agents
Signal Strength Estimation in Logistic Regression Using Data Splitting
Feature Collapse Under Corruption: An Entropy Perspective on Robust Neural Networks
TaRO: Temporal-Aware Reasoning Optimization for Video Temporal Grounding
From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning
The Illusion of Generalization: Instruction-Following, Task Bias and Contamination in Tabular Language Model Evaluation
SlaClip: Gradient Norm Slacks can be Indicator for Adaptive Clipping in DP-SGD
DiffStyle3D: Consistent 3D Gaussian Stylization via Attention Optimization
From Talking to Singing: A New Challenge for Audio-Visual Deepfake Detection
Robust Sequential Experimental Design for A/B Testing
Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training
Federated Manifold Learning (FML): Tackling Domain Heterogeneity with Structural Knowledge Transfer
SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models
Creat3r: Confidence Reaggregation for Exploration-aware Active 3D Reconstruction
Offline Reinforcement Learning with Universal Horizon Models
AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
Preference-based Antibody Expression Ranking: Scaling with Large-scale Weak Supervision
Vision in One Vector: Implicit Visual Compression with Diffusion Foundation Models
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning
Do LLMs Signal When They’re Right? Evidence from Neuron Agreement
Representational Curvature Shapes Behavioral Uncertainty in Large Language Models
Iterative Robust Satisficing: Minimizing Performance Degradation Under Distribution Shift
$A_2$DEPT: Large Language Model–Driven Automated Algorithm Design via Evolutionary Program Trees
Controlled LLM Training on Spectral Sphere
Beyond Mode Collapse: Distribution Matching for Diverse Reasoning
Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits
Neuro-Fuzzy Concept Learning for Interpretable Large Multimodal Models
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents
Cross-task Calibration for Asynchronous Federated Continual Learning
Lightweight Federated Incremental Learning via Decoupled Replay
Taming Stochastic Gradient Descent: Almost Sure Convergence and Saddle-Point Avoidance under $(L_{0},L_{1})$-Smoothness
Lookahead-GCG: Improving Multi-Model Gradient-Based Jailbreaking Attacks via Nesterov Momentum
Referring Multiple Regions with Large Multimodal Models via Contextual Latent Steering
Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
PACT: Self-Evolving Physical Safety Alignment for Diffusion Policies in Embodied Manipulation
Enhanced Multi-Instance Partial Label Learning via Average Gradient Outer Product
Temporal Difference Calibration in Sequential Tasks: Application to Vision-Language-Action Models
DAL: A Practical Prior-Free Black-Box Framework for Piecewise Stationary Bandits
Seeing Without Understanding: Disentangling Perception, Reasoning, and Simulation in VLM Gameplay
Decoupled Low-Rank Adaptation for Robust Federated Fine-Tuning
ProactiveLLM: Learning Active Interaction for Streaming Large Language Models
Disentangling Latent Risk Pathways via Bayesian Hypergraph Inference
Weight Updates as Activation Shifts: A Principled Framework for Steering
Revealing Scaling Behavior in Large-scale Time Series Models: Implications for More Efficient and Accurate Forecasting
Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions
Efficient Online Influence Maximization under the Independent Cascade Model with Node-Level Feedback
When Drafts Evolve: Speculative Decoding Meets Online Learning
Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Vision-Language Models
Minimax-Optimal Policy Regret in Partially Observable Markov Games
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents
Moving Beyond Sparse Grounding with Complete Screen Parsing Supervision
Hearing Without Noticing? Attention-Aware Stealthy Black-box Adversarial Audio Attacks
SoftJAX & SoftTorch: Empowering Automatic Differentiation Libraries with Informative Gradients
An Empirical Study on the Resilience of Partial Merging to Model Clone Attacks
On the Ability of Transformers to Verify Plans
InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Learning Multi-Agent Coordination via Sheaf-ADMM
Sparser, Faster, Lighter Transformer Language Models
DeepHA: Scaling Action Chains Elicits Deep Hierarchical Agents
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL
PAC-Bayesian Reinforcement Learning Trains Generalizable Policies
EntRAG: Entity-Centric Retrieval-Augmented Generation for Knowledge-based Visual Question Answering
Geometry-Guided Modeling of Foundation Features Enables Generalizable Object Shape Deformation Learning
Learning Coherent Representations: A Topological Approach to Interpretability
Hyperbolic Multimodal Continual Learning
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Unfolding Generative Flows with Koopman Operators: Trajectory-Preserving Linearization
Beyond ReLU: Bifurcation, Oversmoothing, and Topological Priors
Convex Distance Operator Transport: Convex and Geometry-Preserving Information
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
Graph Alignment via Dual-Pass Spectral Encoding and Latent Space Communication
Reinforcement Learning for Tool-Calling Agents in Fast Healthcare Interoperability Resources (FHIR)
Equivariant Covariance Tensors: Guaranteed SPD Uncertainty for Tensor-Valued Geometric Learning
Asymmetric Multi-View Clustering with Hyperbolic Uncertainty Modeling
FOCUS: DLLMs Know How to Tame Their Compute Bound
Rethinking Multimodal Time-Series Forecasting Evaluation
Softsignum: Smooth Your Signum For Better Heterogeneity Handling
MalTree: Tracing Malware Evolution using Embeddings at Scale
Autoregressive Boltzmann Generators
Optimal Transport Group Counterfactual Explanations
EchoAttention: Exploiting Token-Pair Redundancy and Frame-Block Similarity for Efficient Long Video Generation
Escaping the Mode: Multi-Answer Reinforcement Learning in LMs
Spatio-Temporal LLM: Reasoning about Environments and Actions
Conditional KRR: Injecting Unpenalized Features into Kernel Methods with Applications to Kernel Thresholding
MAPS: Memory-Aware Predictive Scheduling Framework for Large Language Models Serving
MOD-SR: Unifying Multimodal Learning and Direct Optimization with Gradient-Guided Diffusion Model for Symbolic Regression
Optimizing Visual Generative Models via Distribution-wise Rewards
Robust AI Evaluation through Maximal Lotteries
Order Matters: Unveiling the Hidden Impact of Macro Placement Sequences via Proxy-Guided LLM Evolution
Learning the Interaction Prior for Protein-Protein Interaction Prediction: A Model-Agnostic Approach
Steer Like the LLM: Activation Steering that Mimics Prompting
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer
Geometric Embedding Alignment via Curvature Matching in Transfer Learning
Improved Dimension Dependence for Bandit Convex Optimization with Gradient Variations
Knowing Who, Not How Much: Learning-Augmented Mechanisms for Consumer Utility Maximization
Two-dimensional quantization for geometry-aware audio coding
Harmonized Dual Policy Improvement for Model-based Reinforcement Learning
LoPhyDA: Low-Rank Tensor and Physics Gradient Guided Diffusion for Atmospheric Data Assimilation
Bad Seeing or Bad Thinking? Rewarding Perception for Multimodal Reasoning
Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization
Learning to Refine: Spectral-Decoupled Iterative Refinement Framework for Precipitation Nowcasting
Power-Boosted Granger-Causal Discovery for Large Heterogeneous Panel Data
From Welfare to Utility: Generalized Objectives in Budget-Feasible Procurement
Scalable Kronecker-Factored Fisher Approximation for Neural Network Parameter Sensitivity
Improving LLM-Based Recommenders with Conservative Generative Flow Networks
Who can we trust? LLM-as-a-jury for Comparative Assessment
LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling
Spatial Memory for Out-of-Vision Manipulation in Vision-Language-Action
Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning
Wasserstein Geometry-Aware Adaptive Control via Meta-Learning
Recognize Your Orchestrator: An Entropy Dynamics Perspective for LLM Multi-Agent Systems
Contractive Anchor Resolvent Diffusion for Incomplete Multi-View Clustering
Learning GUI Grounding with Spatial Reasoning from Visual Feedback
The Accumulation of Score Estimation Error in Diffusion Models
AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving
Adaptive Memory Retention in Dynamic Graphs
CauchyNet: Compact and Data-Efficient Learning using Holomorphic Activation Functions
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers
d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation
Optimal Top-$k$ Identification from Pairwise Comparisons
Tracing the Persona Circuit: How Large Language Models Encode and Express Character Traits
Agentic Monte Carlo: Reinforcement Learning for Black-Box LLM Agents
Towards Long-Horizon Interpretability: Efficient and Faithful Multi-Token Attribution for Reasoning LLMs
Taming the Aleatoric Impulse in Off-Policy Reinforcement Learning
Langevin Rollout Optimization for Modelic Reinforcement Learning
Unlearning Isn’t Forgetting: Revealing Hidden Leakage in Class Unlearning Evaluations
Native Spatio-Temporal 4D Variational Autoencoder
TranX-Adapter: Bridging Artifacts and Semantics within MLLMs for Robust AI-generated Image Detection
Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models
Understanding Reasoning Collapse in LLM Agent Reinforcement Learning
Dual-branch Robust Unlearnable Examples
LMCleaner: Efficient and Certified Online Unlearning via Influence Propagation Truncation
SMD: Multi-view Safety-Critical Driving Video Generation in the Real-world Domain
The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?
DDSVM: A Differentiable Framework for Deep Support Vector Machines with Iterative Geometry-Aware Optimization
Robust Strategic Classification under Decision-Dependent Cost Uncertainty
Theoretical Analysis of Sparse Optimization with Reparameterization, Weight Decay, and Adaptive Learning Rate
Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks
Learning Treatment Representations for Downstream Instrumental Variable Regression
Process Reward Agents for Steering Knowledge-Intensive Reasoning
Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning
Attention Hijacking: Backdooring Text Dataset Distillation via Semantic Anchors
InstEmb: Instruction-Following Embeddings through Glimpses of the Future
STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems
Well-Posed KL-Regularized Control via Wasserstein and Kalman–Wasserstein KL Divergences
GoodDiffusion: Proactive Copyright Protection for Diffusion Generative Models via Learnable Sample-specific Signatures
Why Dedicated Critics: Eliminating Target Drift in Multi-Constraint RL
CARE: Adaptive Calibration for Reliable Recommendations
Progressive Graph Structure Adjustment for Homophily Shift Adaptation
Who’s in Charge? Disempowerment Patterns in Real-World LLM Usage
Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality
Statistical Learning Theory in Lean 4: Empirical Processes from Scratch
ECO: Quantized Training without Full-Precision Master Weights
CORAL: Uncertainty-Aware Regulation of Exposure Concentration in Recommender Systems
Localize and Neutralize: Gradient-Guided Token Suppression Against Visual Prompt Injection Attack
The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works
Identifying Connectivity Distributions from Neural Dynamics Using Flows
SORA: Free Second Order Attacks in Fast Adversarial Training
Representation Drift Compensation: A Zero-Cost Enhancement for LLM Decomposition
ChaosNexus: A Foundation Model for ODE-based Chaotic System Forecasting with Hierarchical Multi-scale Awareness
Adaptive Time Series Reasoning via Segment Selection
Ellipsoidal Time Series Forecasting
When to Trust the Cheap Check: Weak and Strong Verification for Reasoning
BTSP-CAM: A Brain-Inspired Geometric Memory for Class-Incremental Learning
When Simple Problems Wear Complex Costumes: Improving Efficiency in LRM’s Adaptive Reasoning
APIC: Orthogonalized Neuro-Symbolic Modeling for Nonlinear Dissipative Dynamics
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
The Velocity Deficit: Initial Energy Injection for Flow Matching
SOLAR: Self-supervised Joint Learning for Symmetric Multimodal Retrieval
Continual Segmentation under Joint Nonstationarity
TimeOmni-VL: Unified Models for Time Series Understanding and Generation
Dreaming in Code for Curriculum Learning in Open-Ended Worlds
Deep Residual Injection for Full-Spectrum Forensic Signal Perception in Multimodal Large Language Models
Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
EpiTwin: Spatiotemporal Graph Transformers for Epileptic sEEG Signal Reconstruction
The Silent Thought: Modeling Internal Cognition in Full-Duplex Spoken Dialogue Models via Latent Reasoning
Low-cost Full Fine-tuning: Learning What to Update for LLMs
Minimum Distance Summaries for Robust Neural Posterior Estimation
Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching
SE-GA: Memory-Augmented Self-Evolution for GUI Agents
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
Where Signals Are Sparse, We Synthesize: Reinforcing Self-Corrective Reasoning in Vision–Language Models via Rollout Augmentation
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency
Do You Want to Know if Two Distributions Are Close to Each Other?Testing the Closeness With Statistical Significance
Representation Learning for Equivariant Inference with Guarantees
Position: Modular Safety Guardrails Are Necessary for Foundation-Model-Enabled Robots in the Real World
R-Diverse: Mitigating Diversity Illusion in Self-Play LLM Training
Tractable Expected Information Gains for Exponential Family Posteriors
CacheEdit: Efficient Multi-round Image Editing via Adaptive Token-wise Reuse.
From Abstraction to Instantiation: Learning Behavioral Representation for Vision-Language-Action Model
Awakening Visual Reasoning: Mitigating Post-Training Failure in Vision-Text Compression
HypCL: Adapting CLIP in Hyperbolic Space for Continual Learning
Grouter: Decoupling Routing from Representation for Accelerated MoE Training
AREA: Attribute Extraction and Aggregation for CLIP-Based Class-Incremental Learning
SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning
Predictive variational inference: Learn the predictively optimal posterior distribution
Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention
MRPO: Magnitude-Regularized Policy Optimization via L1 Constraints
PRISM: Demystifying Retention and Interaction in Mid-Training
MacroGuide: Topological Guidance for Macrocycle Generation
Transporting Task Vectors across Different Architectures without Training
FlowMAP: Flow Matching for Generalizable Agent Planning
Identifying and Mitigating Errors in Gradient Aggregation of Distributed Data Parallel Training
LIMMT: Less is More for Motion Tracking
Fast Spectrally Sparse Signal Reconstruction via Jacobi-Preconditioned Gradient Descent
Optimal structure learning and conditional independence testing
Asymmetric conformal prediction with penalized kernel sum-of-squares
Localized, High-resolution Geographic Representations with Slepian Functions
Autoregressive, Yet Revisable: In Decoding Revision for Secure Code Generation
QuantWear: Quantum-scale Wear Particle Detection for Jet Engine Diagnosis
Nash Equilibria in Games with Playerwise Concave Coupling Constraints: Existence and Computation
Mirror Mean-Field Langevin Dynamics
Improving Graph Transformers via Global Structural Priors
PosterAgent: Agentic Poster Generation via Stage-Aware Reinforcement Learning
The Hidden Risk: Membership Inference Attacks on Multimodal Federated Learning via Modality Imbalance
Cognitive Fatigue in Autoregressive Transformers: Formalization and Measurement
Demystifying Mergeability: Interpretable Properties to Predict Model Merging Success
Practical and Optimal Algorithm for Linear Contextual Bandits with Rare Parameter Updates
Hardware-Aware Dynamic Sparse Training for Large Output Spaces
RA-VLA: Retrieval-Augmented VLA for Test-Time Adaptation
From geometry to dynamics: Learning overdamped Langevin dynamics from sparse observations with geometric constraints
LieWarper: Geometry-Aware Motion Transfer via Lie Algebra
Spik4lite: Refactoring Neuromorphic Sparsity for Efficient Spiking Neural Networks on Commodity Edge Devices
Inference Time Optimization with Confidence Dynamics
T-Edit: Triple-Branch Diffusion Anchoring for Consistent Editing
Differentially Private Continual Release with Relative Error
Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation
Incentivized Exploration with Stochastic Covariates: A Two-Stage Mechanism Design for Recommender System
DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation
The Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and Dimension Disparity
Fine-Tuning Masked Diffusion for Provable Self-Correction
Optimal Design for Multinomial Logit Model with Applications to Best Assortment Identification
Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis
Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation
Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation
Towards Fair Sequential Decision-Making: A Causal Decomposition Approach
Semantic Granularity Navigation in Image Editing
ROAMM: A Benchmark Dataset for Multimodal Human Attention Decoding and EEG-to-Text Modeling During Naturalistic Reading
MSP: Probabilistically Consistent Multi-Scale Action Generation
Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality
Robust Multi-View Fusion via Prototype-Anchored Unbalanced Optimal Transport
COGNOS: Universal Enhancement for Time Series Anomaly Detection via Constrained Gaussian-Noise Optimization and Smoothing
Unveiling Prior-data Fitted Networks on Causal Effect Estimation: Pre-training or Finetuning?
Lavida-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models
Geometry-Aware Probabilistic Circuits via Voronoi Tessellations
Corrected Samplers for Discrete Flow Models
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
Beyond Fixed Biases: Decoding the Role of Reasoning Uncertainty in MLLM Modality Conflicts
Decentralized Online Convex Optimization with Efficient Communication: Improved Algorithm and Lower Bounds
ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning
Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
Distribution Matching Variational AutoEncoder
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
LogicSAGE: Neuro-Symbolic Reasoning with Socratic-Guided Enhancement
Degradation-Aware Metric Prompting for Hyperspectral Image Restoration
Error Analysis of Discrete Flow with Generator Matching
Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
D$^2$O: A Dual Debiasing Operator for Training-Free Test-Time Adaptation of Vision–Language Models
Deep Learning for Bioimaging: What are we actually learning?
Topological Active Inference for Task Disambiguation
Beyond Rational Illusion: Behaviorally Realistic Strategic Classification
When Tabular Foundation Models Meet Strategic Tabular Data: A Prior Alignment Approach
SemRep: Code Transformation with Semantics-Preserving Representations
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing
AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture -of-Transformers for End-to-End Autonomous Driving
A Pure Hierarchical Spectral Parcellation Network for Brain Network Analysis
DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection
OMP: One-step Meanflow Policy with Directional Alignment
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection
Newton-coupled Dual-Teacher Semi-supervised Learning Framework
Efficient Continuous-Depth Modeling with GRU Equivalents
ReaForest: Fostering Generative Video Reasoning for Spatial Planning
Latent Laplace Diffusion for Irregular Multivariate Time Series
OmniDenseCap: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
End-to-end Graph-structured Brain Representation Learning
VideoSeeker: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
On Efficient Scaling of GNNs via IO-Aware Layers Implementations
A Formal Comparison Between Chain of Thought and Latent Thought
3D MeanFlow: One-Step Point Cloud Completion and Generation via Average-Velocity Transport
MASH: Modeling Abstention via Selective Help-Seeking
Toward Stable Value Alignment: Introducing Independent Modules for Consistent Value Guidance
A Conflict-aware Evidential Framework for Reliable Sleep Stage Classification
Beyond Soft Labels: Unifying Dataset Pruning and Distillation for Efficient Large-scale Compression
Rethinking Video Generation Model for the Embodied World
SaTeen: Learning Structural Alignment for Continual Test-Time Adaptation
Open-o3-Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
STLA: Spatiotemporal Lookahead Alignment for Post-Training Quantization
Inducing LLM Workflows with Bilevel Optimization and Textual Gradients
Towards One-to-Many Temporal Grounding
Contrastive Weak-to-Strong Generalization
Bilinear Bandits with Partially Observable Features
WEVSR: Adapting Video Diffusion Generators to Real-World Video Super‑Resolution with Wavelet-Enhanced VAE Encoder
Dustin: Draft-Augmented Sparse Verification for Efficient Long-Context Generation with Speculative Decoding
Position: Solipsistic superintelligence is unlikely to be cooperative
Routing and Reasoned Evaluation with Large Language Models
Revisiting Pre-Propagation GNNs: Robust Diffusion Operators and Hidden-State Re-Propagation
Learning to Extrapolate to New Tasks: A Relational Approach to Task Extrapolation
PISA: Privacy-Preserving Split Adaptation with Model IP Protection
GEM-FI: Gated Evidential Mixtures with Fisher Modulation
On the Power of (Approximate) Reward Models for Inference-Time Scaling
Block Rotation is All You Need for MXFP4 Quantization
PatternKV: Flattening KV Representation Expands Quantization Headroom
Breaking the Self-Confirming Loop: Diagnosing and Mitigating Systemic Reward Bias in Self-Rewarding RL
Many Needles in a Haystack: Active Hit Discovery for Perturbation Experiments
Are Common Substructures Transferable? Understanding Transferability in Graph Pretraining under Riemannian Geometry
UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis
Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution
Does a Hybrid Space-Aware Randomized Defense Improve Empirical and Certified Adversarial Robustness?
Scalable and Differentiable Point-Cloud Registration Using Maximum Mean Discrepancy
Position: Predictive Uncertainty Is Not Enough -- Joint Distribution for Full Uncertainty Representation
A Theoretical Framework for Statistical Evaluability of Generative Models
Omitted Variable Bias in Language Models Under Distribution Shift
Query Circuits: Explaining How Language Models Answer User Prompts
Information dynamics and Memory in Neural Networks through Fisher Information Diffusion
Exactly Computing do-Shapley Values
General Analysis of LMO-based Optimizers: Beyond Bounded Variance
Benchmarking and Enhancing VLM for Compressed Image Understanding
A Computational Framework for Evaluating Human-likeness in LLMs' Open-ended Human Behaviors
Contextualized Privacy Defense for LLM Agents
Self-Augmenting Retrieval for Diffusion Language Models
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
EmWorld: Emotion World Model with Latent State Evolution for Scenario-Incremental Dynamic Facial Expression Recognition
CLAM-Bench: Benchmarking LLM Agents for Library-Scale Cross-Architecture Migration
Linear Bandits beyond Inner Product Spaces, the case of Bandit Optimal Transport
TSP with predictions
Physics-informed diffusion models in spectral space
Ophiuchus: Incentivizing Tool-augmented ''Think with Images'' for Joint Medical Segmentation, Understanding and Reasoning
Can Microcanonical Langevin Dynamics Leverage Mini-Batch Gradient Noise?
Position: The Time for Sampling Is Now! Charting a New Course for Bayesian Deep Learning
AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters
GeoFlow: Geo-Aware Modeling of Inter-Area Relationships in OD Flow Prediction and Generation
Position: Human-Centric Vision Requires Topological Generalization Beyond Fixed Skeletal Topologies
CausalGame: Benchmarking Causal Thinking of LLM Agents in Games
CentaurEval: Benchmarking Human-in-the-Loop Value in Agentic Coding
Selective Deferred Routing: Enabling Cost-Efficient Collaboration between Local SLMs and Remote LLMs
All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs
World-Model Inspired Emotion-aware Token Refinement for Training-Free Multimodal Emotion Recognition
One-shot Conditional Sampling: MMD meets Nearest Neighbors
An Empirical Study of Memory Poisoning Defenses for LLM Agents
Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language
On the Learnability of Test-Time Adaptation: A Recovery Complexity Perspective
STEP: Warm-Started Visuomotor Policies with Spatiotemporal Consistency Prediction
One Bias After Another: Mechanistic Reward Shaping and Persistent Biases in Language Reward Models
Principled SVD-based Delta Compression via Quantization Error Minimization
PhysHanDI: Physics-Based Reconstruction of Hand-Deformable Object Interactions
Semantic-Aware Motion Encoding for Topology-Agnostic Character Animation
Neural Low-Discrepancy Sequences
Language Bias in LVLMs: From In-Depth Analysis to Simple and Effective Mitigation
Modality-Decoupled Online Recursive Editing
Uncovering Latent Communication Patterns in Brain Networks via Adaptive Flow Routing
Monitorability as a Free Gift: How RLVR Spontaneously Aligns Reasoning
AgentVocab: Structure-Aware Vocabulary Adaptation for Efficient LLM Agents
Expanding the Chaos: Neural Operator for Stochastic (Partial) Differential Equations
Entropy-aware Span-Constrained Optimal Transport for Robust Cross-Tokenizer Knowledge Distillation
Secure Multi-agent Reinforcement Learning for Service Systems with Affinity and Byzantine Nodes: Stability Analysis and Protection Design
From Parameter Dynamics to Risk Scoring: Quantifying Sample-Level Safety Degradation in LLM Fine-tuning
LLM-based Embeddings: Attention Values Encode Sentence Semantics Better Than Hidden States
Possibilistic Predictive Uncertainty for Deep Learning
OmniFit: Bridging Modalities via Layer-Adaptive Token Compression for Omnimodal Large Language Models
DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents
Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
RGMem: Renormalization Group–inspired Memory Evolution for Language Agents
DTKG: Dual-Track Knowledge Graph-Verified Reasoning Framework for Multi-Hop QA
Are Object-Centric Representations Better At Compositional Generalization?
Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training
Physiology as Language: Translating Nocturnal Breathing to EEG
KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering
Attention Projection Mixing with Exogenous Anchors
Decoupling Skeleton and Flesh: Efficient Multimodal Table Reasoning with Disentangled Alignment and Structure-aware Guidance
Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking
ASRU: Activation Steering Meets Reinforcement Unlearning for Multimodal Large Language Models
Optimizing Inference-Time Compute for Medical Reasoning via Uncertainty Quantification
Q-Tab: Quantized Tabular Data Generator
What Reward Structure Enables Efficient Sparse-Reward RL? A Proof-of-Concept with Policy-Aware Matrix Completion
Sheaf Neural Networks on SPD Manifolds: Second-Order Geometric Representation Learning
EvoMAS: Heuristics in the Loop—Evolving Smarter Agentic Workflows
Hard-Constrained Graph Generation with Discrete-Projection Diffusion
Removing Noise, not Finding Gold: Quality Filtering for Large-Scale Pretraining
Metric–-Phase Fields: Decoupling Distance and Sign for Thin-Structure Reconstruction from Unoriented Point Clouds
Beyond Buffer Limits: Energy-Based Data Reassembly for Continual Learning
A Risk Decomposition Framework for Pre-hoc Fine-tuning Prediction
TuneAhead: Predicting Fine-tuning Performance Before Training Begins
Gradient Flow Dynamics and Implicit Bias of Diagonal Linear Networks under Infinitesimal Initialization
Towards Understanding Adam Convergence on Highly Degenerate Polynomials
Nested Spatio-Temporal Time Series Forecasting
NITP: Next Implicit Token Prediction for LLM Pre-training
Adaptive Preconditioners Trigger Loss Spikes in Adam
Infinite-Precision Autoregressive Modeling for Vector Graphics and Layouts
Hyperbolic Associative Memory Networks
Credibility-Aware Weighting Federated Causal Discovery for Time Series
DisPPO: Quantile-Based Distributional Reinforcement Learning for Large Language Models
Towards Understanding Generalization of Federated Adversarial Learning: Perspective of Algorithmic Stability
NetDiff: Graph Diffusion with Improved Global Capabilities to Generate and Update Mobile Network Topologies
Label-Guided Representation Learning for Incomplete Multi-View Multi-Label Classification
A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research
COLLIE: Guiding Skill Discovery in Semantically Coherent Latent Space
What You Think is What You See: Driving Exploration in VLM Agents via Visual-Linguistic Curiosity
MedCRP-CL: Continual Medical Image Segmentation via Bayesian Nonparametric Semantic Modality Discovery
GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training
Video-BCI: Bayesian Cognitive Integration of Self-Prior Hypotheses for Video Understanding
MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks
FlexiFlow: decomposable flow matching for generation of flexible molecular ensemble
Spiked-CFR: Causal Representation Learning from LLMs via Wasserstein Projection Pursuit
Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation
Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion
Preference-Calibrated Optimization with Score-Level Distribution Alignment for Text-to-Image Diffusion Model Unlearning
On the Provable Suboptimality of Momentum SGD in Nonstationary Stochastic Optimization
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn
Improving Visual Token Reduction via Rectifying Distortions for Efficient Multimodal LLM Inference
NeuroMamba: A Universal Spatiotemporal Module for Robust Perception in Degraded Sensory Streams
LeakGFN: Robust Molecular Generation in Generative Flow Networks via Flow Decomposition
Breaking the Reference Bottleneck via Learning to Rewrite Conversational Queries without Gold Reference Passages
PromptRL: Prompt Matters in RL for Flow-Based Image Generation
DisPOSE: Projected Polystochastic Diffusion for Self-Supervised Multi-View 3D Human Pose Estimation
Shapley Regularized Neural Granger Causality
Offline Two-Player Zero-Sum Markov Games with KL Regularization
Online Compatible Reward Identification from Preference Feedback
From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning
Accelerating Langevin Monte Carlo via Efficient Stochastic Runge-Kutta Methods beyond Log-Concavity
PINNfluence: Interpreting PINNs through Influence Functions
Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Position: Let’s Build a Trustworthy Model Context Protocol!
The Lie We Tell: Correcting the Euclidean Fallacy in Vision Language Action Policies via Score Matching on Tangent Space
JADAI: Jointly Amortizing Adaptive Design and Bayesian Inference
InfoLaw: Information Scaling Laws for Large Language Models with Quality-Weighted Mixture Data and Repetition
Efficient Transformer Attention for SNNs via Hadamard Simplification
BeaconKV: Key-Value Cache Compression Guided by Beacon Queries for Efficient Large Reasoning Model Inference
Gradient-Free Approaches is a Key to an Efficient Interaction with Markovian Stochasticity
3DPoV: Improving 3D understanding via Patch Ordering on Videos
Target-Oriented Pretraining Data Selection via Neuron-Activated Graph
Reinforcement Learning with Evolving Rubrics for Deep Research
Exact Functional ANOVA Decomposition for Categorical Inputs
VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
Large-Scale Molecular Dynamics Simulations: Direct Interatomic Modeling with Dilated Message Passing
Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression
VCG-Bench: Towards A Unified Visual-Centric Benchmark for Structured Generation and Editing
Evaluating Robustness of Reasoning Models on Parameterized Logical Problems
New Algorithms for Fully-Dynamic k-center with Outliers
The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level
Why Deep Jacobian Spectra Separate: Depth-Induced Scaling and Singular-Vector Alignment
Trustworthy Federated Label Distribution Learning under Annotation Quality Disparity
Singularity-aware Optimization via Randomized Geometric Probing: Towards Stable Non-smooth Optimization
Privacy-Aware Data Integration for Enhanced Quantile Inference under Heterogeneity
PEARL: Differentially Private and Entropy-Aware Regulated Language Generation
Trust Region Inverse Reinforcement Learning
Group-wise Data Ordering: Enhancing Instruction Tuning of Large Language Models via Embedding Proximity
LiveFigure: Generating Editable Scientific Illustration with VLM Agents
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Video Generation
Hallucination is a Consequence of Space-Optimality: A Rate-Distortion Theorem for Membership Testing
SciNet: Evaluating AI Agents in Relation-Aware Scientific Literature Retrieval
Functional Equivalence in Attention: A Comprehensive Study with Applications to Linear Mode Connectivity
Conservation Laws for Modern Neural Architectures
$G^2$-Reader: Dual Evolving Graphs for Multimodal Document QA
Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts
OPT-Engine: Benchmarking the Limits of LLMs in Optimization Modeling via Complexity Scaling
Efficient Training of Boltzmann Generators Using Off-Policy Log-Dispersion Regularization
MCP-Persona: Benchmarking LLM Agents on Personalized MCP Tools and Tasks
Supervised Classification Heads as Semantic Prototypes: Unlocking Vision-Language Alignment via Weight Recycling
Ultrafast On-Chip Online Learning via Spline Locality in Kolmogorov–Arnold Networks
Unsupervised Partner Design Enables Robust Ad-hoc Teamwork
MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning
Fast Mixing Steady-State Control in Markov Decision Processes
Solving Stochastic Variational Inequalities without the Bounded Variance Assumption
Learning to Rank from Incomplete Rankings
Simultaneous Speech-to-Speech Translation Without Aligned Data
BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
FlashSketch: Sketch-Kernel Co-Design for Fast Sparse Sketching on GPUs
PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios
AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments
Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following
Think in Cloud, Look at Edges: Semantic-Driven Query Decomposition for Efficient Video Reasoning
RGGT: A Generative-Prior-Guided Transformer for Unified Rigid and Non-Rigid Point Cloud Registration
SL-VC: A Benchmark and Automated Framework for Separation Logic Verification Condition Proving
D²Evo: Dual Difficulty-Aware Self-Evolution for Data-Efficient Reinforcement Learning
Online Bayesian Experimental Design for Partially Observed Dynamical Systems
MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
EVMbench: Evaluating AI Agents on Smart Contract Security
Flexible Kernels for Protein Property Prediction
Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models
Adaptive Utilization of Low-Rank Adaptation via Conditioned Gating
Eyes-on-Me: Scalable RAG Poisoning through Transferable Attention-Steering Attractors
DisjunctiveNet: Neural Symbolic Learning via Differentiable Convexified Optimization Layers
Decision Transformers As Zero-Shot Learners via Text-Behavior Alignment
DDIM Inversion as a Perturbation Amplifier: Breaking Mimicry Protection via Reconstruction Error Minimization
From Winning to Understanding: A Diagnostic Long-Horizon RTS Benchmark for LLMs
Path-conditioned training: a principled way to rescale ReLU neural networks
Functional Cache Grafting: Robust and Rapid Code-Policy Synthesis for Embodied Agents
Interpreting Genomic Language Models using Sparse Autoencoders
Training-Trajectory-Aware Token Selection
Efficient Skill Grounding via Code Refactoring with Small Language Models
Counterfactual Bootstrap for Robust Meta-Reinforcement Learning
RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry
LipoPU: Pocket-level Prediction of Lipid-Protein Interactions via Positive-Unlabeled Learning
Cello: A Universal Cell-wise Feature Aggregation framework for Reliable Pathology Images Analysis
BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
Effective Model Pruning : Measuring the Redundancy of Model Components
Off-Policy Evaluation for Missingness-Aware Policies in MDPs with Rewards Missing Not at Random
Federated Data and Feature Selection by Generalized CUR Decomposition
Learn to Merge: Meta-Learning for Adaptive Multi-Task Model Merging
Position: AI/ML Deepfake Research is Misaligned with AI Generated Non-Consensual Intimate Imagery (AIG-NCII)
AdaSplash-2: Faster Differentiable Sparse Attention
DRPBench: Evaluating LLMs in Concurrent Code Comprehension via Fine-grained Data Race Prediction
Proteo-R1: Thinking Foundation Models for De Novo Protein Binder Design
Feature Bagging Provides Stability
CRPO: Character-centric Group Relative Policy Optimization for Role-aware Reasoning in Role-playing Agents
COBRA: Contribution-Based Bayesian Rank Allocation for Parameter-Efficient Fine-Tuning
FIBER: A Differentially Private Optimizer with Filter-Aware Innovation Bias Correction
Beyond Independent Genes: Learning Module-Inductive Representations for Gene Perturbation Prediction
Chebyshev Policies and the Mountain Car Problem: Reinforcement Learning for Low-dimensional Control Tasks
Scalable GANs with Transformers
Moving Out: Physically-grounded Human-AI Collaboration
Dimension-Free Multimodal Sampling via Preconditioned Annealed Langevin Dynamics
Overcoming the Incentive Collapse Paradox
From Noise to Intent: Anchoring Generative VLA Policies with Residual Bridges
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing
DiscoverLLM: From Executing Intents to Discovering Them
Time-Conditioned Foreseeing: An EHR-Specific Foundation Model for Irregular Dynamics and Calendrical Time
Telescope: Improving Zero Shot Detection of LLM Generated Content By Measuring Token Repetition Probability
$f$-Trajectory Balance: A Loss Family for Tuning GFlowNets, Generative Models, and LLMs with Off- and On-Policy Data
Dynamic Relational Priming Improves Transformer in Multivariate Time Series
Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation
Anchor-Final Self-Supervision Drives Hallucination-Aware Optimization in Large Vision-Language Models
MonoScale: Scaling Multi-Agent System with Monotonic Improvement
Causal-EPIG: Causally Aligned Active CATE Estimation
PartCo: Part-Level Correspondence Priors Enhance Category Discovery
NeuralFLoC: Neural Flow-Based Joint Registration and Clustering of Functional Data
Identifiable Smooth Conjugacy Learning via Adversarial Orthogonality
4DPC$^2$hat: Towards Dynamic Point Cloud Understanding with Failure-Aware Bootstrapping
Enhancing Conformal Prediction via Class Similarity
Preserving Plasticity in Continual Learning via Dynamical Isometry
MOES-Pred: Molecular Structural Representation Learning by Adaptive Energy-Sentinel Vibration for Generalized Property Prediction
Priority-Aware Shapley Value
Collaborative Learning for Semi-Supervised LiDAR Semantic Segmentation
Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization
Future-Gain Guided Test-Time Learning for Large Language Models
Mixing Configurations for Downstream Prediction
Improving Adversarial Robustness of Attribution via Implicit Regularization
HIER: Human-in-the-Loop Imagination–Execution Refinement for General Real-World Vision-Language-Action Models
$\alpha$-PFN: Fast Entropy Search via In-Context Learning
Spectral-Informed Neural Networks Outperform Spectral methods in High-dimensional PDEs
Learning Hamiltonian Dynamics at Scale: A Differential-Geometric Approach
Segment Anything with Robust Uncertainty-Accuracy Correlation
Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment
Robust In-Context Reinforcement Learning Under Reward Poisoning Attacks
Pose-ICL: 3D-Aware In-Context Learning for Pose-Controllable Subject Customization
Reward Learning through Ranking Mean Squared Error
Position: Machine Learning Research Should Be Guided by Explicit, Pluralistic Models of Human Purpose
CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation
Orchestrating Spatial Semantics via a Zone-Graph Paradigm for Intricate Indoor Scene Generation
Diversity-Driven Offline Multi-Objective Optimization via Bi-Level Pareto Set Learning
Laplacian Representations for Decision-Time Planning
Rational Transductors
End-to-End Compression for Tabular Foundation Models
Universal Multiclass Transductive Online Learning
Realizable Bayes-Consistency for General Metric Losses
Not All Answers Are Contextually Persuadable: Inference Dynamics in Large Language Models under Contextual Influence
Quantifying the Effect of Noise in Language Generation
Learning Partial Concept Classes and Universal Rates Under Massart Noise
RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference
Antidistillation Fingerprinting
Scout Before You Attend: Sketch-and-Walk Sparse Attention for Efficient LLM Inference
MMClima: A Framework for Multimodal Climate Science Data and Evaluation
Text-Conditional JEPA for Learning Semantically Rich Visual Representations
Learning Human-Robot Collaboration via Heterogeneous-Agent Lyapunov Policy Optimization
InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context
Inference of Online Newton Methods with Nesterov's Accelerated Sketching
Probabilistically-routed Bayesian Additive Spanning Trees for Learning on Constrained Domains
Perceptrons and Localization of Attention’s Mean-Field Landscape
Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning
Revisiting the Bertrand Paradox via Equilibrium Analysis of No-regret Learners
One Bug, Hundreds Behind: LLMs for Large-Scale Bug Discovery
Accelerating Regression Tasks with Quantum Algorithms
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation
CL-GCL: Comprehensive and Lightweight Graph Contrastive Learning
SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
Optimal Statistical Guarantees for Diffusion Models on Low-Dimensional, Multi-Modal Data
Infinite Mask Diffusion for Few-Step Distillation
DenseSteer: Steering Small Language Models towards Dense Math Reasoning
CodeClash: Benchmarking Goal-Oriented Software Engineering
The Forgetting-Retention Dilemma: Certified Unlearning Theory in Continual Learning
SkillNet: Hierarchical Skill Modeling for Compositional Generalization in Vision-Language Action Models
Theory of Continual Learning Against Data Poisoning Attacks
SCalDA: Semantics-Calibrated and Diffusion-Enhanced Data Augmentation
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
Insertion Based Sequence Generation with Learnable Order Dynamics
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems
DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training
Adaptive Volumetric Mechanical Property Fields Invariant to Resolution
Entropy-Aware On-Policy Distillation of Language Models
Equivariant Neural Networks for General Linear Symmetries on Lie Algebras
On the Expressive Power of GNNs to Solve Linear SDPs
Spectral Guidance for Flexible and Efficient Control of Diffusion Models
Near-Universal Multiplicative Updates for Nonnegative Einsum Factorization
MAMBO-G: Magnitude-Aware Mitigation for Boosted Guidance
Transformed Latent Variable Multi-Output Gaussian Processes
Faster Activation Functions at the Edge for Post-Training Speedups
Language Models as Nodes: Constructing a High-Level Neural Network
From Noise to Control: Parameterized Diffusion Policies
AlignVid: Taming Visual Dominance via Training-Free Attention Modulation in Text-guided Image-to-Video Generation
LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation
Provably Valid Uncertainty Quantification for Deep Computed Tomography
Learning to Move Before Learning to Do: Task-Agnostic pretraining for VLAs
What Do Agents Learn from Trajectory-SFT: Semantics or Interfaces?
Dissect and Prune: Enhancing Robustness in AI-Generated Image Detection
Bottleneck Communication Delay Minimization for Communication-Efficient Decentralized Learning
Unfolded Laplacian Spectral Embedding: A Theoretically Grounded Approach to Dynamic Network Representation
GuidedBridge: Training-freely Improving Bridge Models with Prior Guidance
Black-Box Combinatorial Optimization with Order-Invariant Reinforcement Learning
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations
Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs
Olivia: Harmonizing Time Series Foundation Models with Power Spectral Density
Beyond Extrapolation: Knowledge Utilization Paradigm with Bidirectional Inspiration for Time Series Forecasting
An Exploration of Non-Euclidean Gradient Descent: Muon and its Many Variants
Non-Euclidean Gradient Descent Operates at the Edge of Stability
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
DropoutTS: Sample-Adaptive Dropout for Robust Time Series Forecasting
Stability-Aware Feature Design for Robust Watermark Detection in Machine-Generated Text
LUGS: Latent-aware Guidance for Efficient Unmasking in Diffusion Large Language Models
Segment-driven Structural Induction and Semantic Alignment for Heterogeneous Tabular Representation
DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning
PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space
Dichotomy of Feature Learning and Unlearning: Fast-Slow Analysis on Neural Networks with Stochastic Gradient Descent
Let Language Constrain Geometry: Vision–Language Models as Semantic and Spatial Critics for 3D Generation
It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks
$\texttt{FlashSchNet}$: Fast and Accurate Coarse-Grained Neural Network Molecular Dynamics
Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games
SafeSeek: Universal Attribution of Safety Circuits in Language Models
Extracting alignment data in open models
When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems
Interpretability and Generalization Bounds for Learning Spatial Physics
Structured Multi-modal Graph Disentanglement for Psychiatric Diagnosis
Rotary Position Encodings for Graphs
PDFBench: A Benchmark for De Novo Protein Design from Function
Multivariate distributional reinforcement learning using sliced divergences
Optimal Self-Consistency for Efficient Reasoning with Large Language Models
LAST: Bridging Vision-Language and Action Manifolds via Gromov-Wasserstein Alignment
CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict
General Covariant Action Modeling: Constructing Generalized Manifolds via Spatio-Temporal Decoupling
Learning General Causal Structures with Hidden Dynamic Process for Climate Analysis
SpecExit: Accelerating Large Reasoning Model via Speculative Exit
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Uncovering Hidden Triggers: Backdoor Attribution in Language Models
Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
Towards Complete Multi-Agent Coordination Policy Learning via Denoising Maximum Entropy Optimization
Real Data Lies: Unveiling and Closing the Quality Shortcut in Generalizable AI-Generated Video Detection
From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning
Beyond the Proxy: Trajectory-Distilled Guidance for Offline GFlowNet Training
ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation
Boosting Video Diffusion Models via Masked Autoencoders as Tokenizers
Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
MESA: Improving MoE Safety Alignment via Decentralized Expertise
One-shot Entropy Minimization for Language Model Reasoning
HOI-PAGE: Zero-Shot Human-Object Interaction Generation with Part Affordance Guidance
Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control
IDLM: Inverse-distilled Diffusion Language Models
Variational Learning of Disentangled Representations
Building Reliable Long-Form Generation via Hallucination Rejection Sampling
E²I-VRWKV: Explicit EPI-Representation and Interaction-Aware Vision-RWKV for Light Field Semantic Segmentation
Mining Tensor/Neuron-Level Sparsity to Maximize Mixture-of-Experts Potential in Post-Training and Inference
Position: Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
AgentTailor: A Semantic-Aware LLM-Based Multi-Agent System with Actor-Critic Structure
See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model
Sparse Bayesian Deep Functional Learning with Structured Region Selection
Revisiting Coding-Based Approaches to Overcome the Curse of Dimensionality in Learning-Based Watermarking
Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learned Force Fields
SKETCH: Semantic Key-Point Conditioning for Long-Horizon Vessel Trajectory Prediction
Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation
From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators
Understanding Performance Collapse in Layer-Pruned Large Language Models via Decision Representation Transitions
EvoEGF-Mol: Evolving Exponential Geodesic Flow for Structure-based Drug Design
CLINIC : Evaluating Multilingual Trustworthiness in Language Models for Healthcare
Mean-Shift PCA by Knockoff Mean
Two-Layer Linear Auto-Regressive Models Estimate Latent States
REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge
S-Quant: Rethinking Weight Quantization with Seed-Based Generation
Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence
Unveiling the Visual Counting Bottleneck in Vision-Language Models
Anti-Aliasing Matters: A Dynamic Network for Time Series Forecasting
Gromov-Wasserstein at Scale, Beyond Squared Norms
Exploiting weight-space symmetries for approximating curvature
Fast KV Compaction via Attention Matching
A Graphop Analysis of Graph Neural Networks on Sparse Graphs: Generalization and Universal Approximation
From Pairwise Affinities to Functional Correspondences: Rethinking Attention
BIT-LLM: Brain Instruction Tuned LLM with persistent Cross-Attention for fMRI-to-Text Decoding
Exploring 3D Dataset Pruning
FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models
Position: State-of-the-Art Claims Require State-of-the-Art Evidence
RAT+: Train Dense, Infer Sparse - Recurrence Augmented Attention for Dilated Inference
Taming the Loss Landscape of PINNs with Noisy Feynman–Kac Supervision: Operator Preconditioning and Non-Asymptotic Error Bounds
CARE: Class-Adaptive Expert Consensus for Reliable Learning with Long-Tailed Noisy Labels
High-Fidelity ANN-to-SNN Conversion via Closed-Loop CKA Distillation
DynaMem: Consistent Long Video Generation via Hierarchical Memory and Motion Priors
Faults in Our Formal Benchmarking: Dataset Defects and Evaluation Failures in Lean Theorem Proving
Not All Prefills Are Equal: PPD Disaggregation for Multi-turn LLM Serving
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond
MutAtlas: A PDB-Wide Energy-Guided Atlas of Protein Mutation Effects
Towards Multimodal Large Language Models with Both Training and Inference Efficiency
Incremental BPE Tokenization
Information Geometry Loss for Time Series Forecasting
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
ConServe: Fine-Grained GPU Harvesting for LLM Online and Offline Co-Serving
Large Language Model Agents Are Not Always Faithful Self-Evolvers
Veda: Scalable Video Diffusion via Distilled Sparse Attention
Mem-T: Densifying Rewards for Long-Horizon Memory Agents
CHB: A Diagnostic Toolkit for Hardness-Aware Clustering Evaluation
On the Plasticity and Stability for Post-Training Large Language Models
Calibrated Preference Learning: The Case of Label Ranking
When Do Graph Foundation Models Transfer? A Data-Centric Theory
Panini: Continual Learning in Token Space via Structured Memory
EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions
Language Model Augmented Semi-Supervised Statistical Inference
CAReDiO: Enhancing Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization
VENOMREC: Cross-Modal Interactive Poisoning for Targeted Promotion in Multimodal LLM Recommender Systems
Brep2Shape: Boundary and Shape Representation Alignment via Self-supervised Transformers
Transolver-3: Scaling Up Transformer Solvers to Industrial-Scale Geometries
Uncovering Competency Gaps in Large Language Models and Their Benchmarks
A Unified Sparse Attention via Multi-Granularity Compression
Speculative Coupled Decoding for Training-Free Lossless Acceleration of Autoregressive Visual Generation
When Does Adaptation Win? Scaling Laws for Meta-Learning in Quantum Control
Learning Sparse Visual Representations via Spatial-Semantic Factorization
Maximin Relative Improvement: Fair Learning as a Bargaining Problem
Hierarchical Successor Representation for Robust Transfer
Logical Guidance for the Exact Composition of Diffusion Models
Sharp empirical Bernstein inequalities for the variance of bounded random variables
Source-Free Open-World RF Fingerprint Identification
Distributionally Robust Markov Games with Average Reward
Model-Free Robust Average-Reward Reinforcement Learning with Sample Complexity Analysis
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Robust Linear Dueling Bandits with Post-serving Context under Unknown Delays and Adversarial Corruptions
FedVeer: Self-Adaptive Skew Estimation for Robust Federated Learning
UniCode: Augmenting Evaluation for Code Reasoning
KernelBand: Steering LLM-based Kernel Optimization via Hardware-Aware Multi-Armed Bandits
RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models
Controlled Collaboration Geometry for Personalized Federated Learning
dTRPO : Trajectory Reduction in Policy Optimization of Diffusion Large Language Models
On Stable Long-Form Generation: Benchmarking and Mitigating Length Volatility
Equalized Generative Treatment: Matching f-divergences for Fairness in Generative Models
SpecPL: Disentangling Spectral Granularity for Prompt Learning
Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements
BESplit: Bias-Compensated Split Federated Learning with Evidential Aggregation
Generalizable and Composable Multi-Model Embedding Translation
PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion
BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series
A Random Matrix Theory of Masked Self-Supervised Learning
HeraSys: Collaborative Serving of Multiple LLM Workflows via Fine-Grained End-to-End Optimization
Dimensionality Reduction with Point-distributions Similarity Invariant
From Holo Pockets to Electron Density: GPT-style Drug Design with Density
SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning
Dynamics Are Learned, Not Told: Semi-Supervised Discovery of Latent Dynamics Geometries For Zero-Shot Policy Adaptation
Multi-Objective Bayesian Optimization via Adaptive $\varepsilon$-Constraint Decomposition
Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search
REVIS: Sparse Latent Steering to Mitigate Object Hallucination in Large Vision-Language Models
From Out-of-Distribution Detection to Hallucination Detection: A Geometric View
RoCA: Robust Cross-Domain End-to-End Autonomous Driving
AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding
Beyond Pixel Context Windows: Neural World Simulators with Persistent 3D State
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching
WFR-MFM: One-Step Inference for Dynamic Unbalanced OT
Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions
Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
Spurious Rewards: Rethinking Training Signals in RLVR
Forensic Prompting with Dual-Action Policy Optimization for Vision-Language Forgery Detection and Localization
cMoLLM at Scale: Horizontal Scaling Laws for Convolutionally-Gated Mixture-of-LLMs
Is the Last Layer Sufficient for Uncertainty Quantification?
FedPDG: Prediction Discrepancy–Guided Data Generation for Heterogeneous Federated Learning
Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts
Optimal Splitting of Language Models from Mixtures to Specialized Domains
SimpleGPT: Improving GPT via A Simple Normalization Strategy
Delving into Muon and Beyond: Deep Analysis and Extensions
Skipping the Zeros in Diffusion Models for Sparse Data Generation
Opt-Verifier: Unleashing the Power of LLMs for Optimization Modeling via Dual-Side Verification
Opt-Miner: Empowering Information-Seeking Agent with Tree-Guided Data Synthesis for Optimization Modeling
Learning High-Frequency Continuous Action Chunks in Latent Space
Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundations and Algorithms
No More, No Less: Least-Privilege Language Models
B-Spar: Bayesian Sparse-Reward Modeling for RL-based Image Editing
Shifting the Breaking Point of Flow Matching for Multi-Instance Editing
TG-RAG: A Retrieval-Augmented Framework for Reasoning Guidance in Specialized Domains
Theoretical Challenges in Learning for Branch-and-Cut
InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation
Thinking in Structures: Evaluating Spatial Intelligence through Reasoning on Constrained Manifolds
Enhancing Protein-Protein Interaction Prediction with Hierarchical Motif-based Multimodal Protein Embedding
One Tool Is Enough: Reinforcement Learning of LLM Agents for Repository-Level Code Navigation
Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
Discovering Scaling Exponents with Physics-Informed Müntz-Szász Networks
Sem-Detect: Semantic Level Detection of AI Generated Peer-Reviews
Muon in Associative Memory Learning: Training Dynamics and Scaling Laws
Score-Repellent Monte Carlo: Toward Efficient Non-Markovian Sampler with Constant Memory in General State Spaces
Provable Training Data Identification for Large Language Models
From Static Constraints to Dynamic Adaptation: Sample-Level Constraint Release for Offline-to-Online Reinforcement Learning
Neural Control: Adjoint Learning Through Equilibrium Constraints
How does Bayesian Sampling help Membership Inference Attacks?
CofactGVR: Counterfactual Intervention for Grounded Visual Reasoning
HIAL: Towards Semantics-Aware Hypergraph Active Learning via Dual-Perspective Information Maximization
Revisiting the Volume Hypothesis
T-POP: Test-Time Personalization with Online Preference Feedback
CountsDiff: A diffusion model on the natural numbers for generation and imputation of count-based data
Deep Coupling Learning for Solving PDEs
TAGRPO: Boosting GRPO on Image-to-Video Generation with Direct Trajectory Alignment
StyleDistillation: A New Insight of Image Style Enables Personalized Aesthetic Manipulation
MotionGRPO: Overcoming Low Intra-Group Diversity in GRPO-Based Egocentric Motion Recovery
Position: The Systemic Lack of Agency in Visual Reasoning
Decomposition-Based Modular Conformal Prediction for Two-Stage Modeling
Closing the Loop: Universal Repository Representation with RPG-Encoder
Noisy Pairwise-Comparison Random Search for Smooth Nonconvex Optimization
Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
Any2Any: Unified Arbitrary Modality Translation for Remote Sensing
VisionPulse: Dynamic Visual Sparsity for Efficient Multimodal Reasoning
Restoring Initial Noise Sensitivity in Text-to-Image Distillation through Geometric Alignment
Graph Alignment for Benchmarking Graph Neural Networks and Learning Positional Encodings
Rule2DRC: Benchmarking LLM Agents for DRC Script Synthesis with Execution-Guided Test Generation
The Ideal Expression Is Not a Local Optimum: A Revisit of EQL with Zero-Point Constraints
SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos
ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning
Is Training Necessary for Anomaly Detection?
From Diagrams to Code: Multilingual Programming with Visual Design
AmbiRefer3D: 3D Visual Grounding with Referential Ambiguity
MixReasoning: Switching Modes to Think
SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis
A New Framework for Cybersecurity Refusals in AI Agents
Flex-Forcing: Towards a Unified Autoregressive and Bidirectional Video Diffusion Model
VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models
SPEED: Sharpened-Teacher Distillation for Parallel Decoding of Diffusion Language Models
Unifying Heterogeneous Degradations: Uncertainty-Aware Diffusion Bridge Model for All-in-One Image Restoration
Gradient Inversion Attacks Beyond SGD
Fast and Scalable Analytical Diffusion
CLASP: Online learning algorithms for Convex Losses And Squared Penalties
Bias in Zeroth-Order Normal Estimation for Decision-Based Attacks
Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis
Stronger Semantic Encoders Can Harm Relighting Performance: A Probe of Visual Priors via Augmented Latent Intrinsics
Residual Context Diffusion Language Models
Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments
Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion
Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure
Frequentist Consistency of Prior-Data Fitted Networks for Causal Estimation
PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs
Context-free Recognition with Transformers
DiffCrossGait: Trajectory-Level Alignment for 2D-3D Cross-Modal Gait Recognition via Latent Diffusion
DeepBlip: Estimating Conditional Average Treatment Effects Over Time
Forgetting Whenever You Want: A Decentralized Continual Learning Framework with On-Demand Unlearning
TwinQuant: Learnable Subspace Decomposition for 4-Bit LLM Quantization
Continual Learning of Domain-Invariant Representations
Complexity of Decentralized Optimization with Mixed Affine Constraints
DELTA4: Sparse Matrix-Vector Multiplication for Low Sparsity
A General Neural Backbone for Mixed-Integer Linear Optimization via Dual Attention
Revealing Long-context Potential of Attention Heads via Frequency Kernels
BlueCodeAgent: A Blue Teaming Agent Powered by Automated Red Teaming for CodeGen AI
Nonparametric LLM Evaluation from Preference Data
From Feasible to Practical: Pareto-Optimal Synthesis Planning
Beyond Sample-Level Forgetting: Improving Reliability in Multimodal Unlearning
OnePO: Direct One-stage Policy Optimization for SFT-free Domain Adaptation
*Rank-Learner*: Orthogonal Ranking of Treatment Effects
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics
Regime-Adaptive Bayesian Optimization via Dirichlet Process Mixtures of Gaussian Processes
QEDBench: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs
Shortcut-Resistant CAM Distillation for Long-Tailed Recognition
Code2Video: A Code-centric Paradigm for Educational Video Creation
ProbeLLM: Automating Principled Diagnosis of LLM Failures
Obliviate: Efficient Unlearning in Recommender Systems
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching
Foundation VAE for CT Reconstruction, Augmentation, and Generation
The Implicit Bias of Adam and Muon on Smooth Homogeneous Neural Networks
Cross-Subject Modeling for Widefield Calcium Imaging via Atlas-Aligned Spatiotemporal Tokenization
“Do Diffusion Models Dream of Electric Planes?” Discrete and Continuous Simulation-Based Inference for Aircraft Design
MER-DG: Modality-Entropy Regularization for Multimodal Domain Generalization
A robust PPG foundation model using multimodal physiological supervision
Beyond Unidirectional Bias: Reciprocal Perspective Calibration in Scene Graph Generation
Learning-Augmented Scalable Linear Assignment Problem Optimization via Neural Dual Warm-Starts
SOLAR for Offline MARL: Plateau-Triggered Potential Shaping under World-Model Uncertainty
Emergent Biological Realism in RL-Trained DNA Language Models
CausalProfiler: Generating Synthetic Benchmarks for Rigorous and Transparent Evaluation of Causal Machine Learning
Learning Tight Rejection Boundaries without Negatives for Strict One-Class Audio Deepfake Detection
Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference
EntroKV: Entropy-Guided Dynamic Budget Allocation for KV-Cache Compression
QTALE: Quantization-Robust Token-Adaptive Layer Execution for LLMs
Equivalence of Context and Parameter Updates in Modern Transformer Blocks
TurboGS: Accelerating 3D Gaussian Splatting via Error-Guided Sparse Pixel Sampling and Optimization
Courtroom Analogy: New Perspective on Uncertainty-Aware Classification
Cache Coherent Resampling for Efficient Test Time Scaling in LLM Reasoning via Adaptive Sequential Monte Carlo
Scalable Bayesian Semi-supervised Clustering with Feature Selection and Adaptive Constraint Weighting
Budget-Constrained Step-Leve Diffusion Caching
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
ProtoKV: Streaming Video Understanding under Delayed Evidence with Summary-State Memory
Neural Attention Search Linear: Towards Adaptive Token-Level Hybrid Attention Models
TokenDrop: Token-Level Importance-Aware Backward Propagation Skipping for Efficient LLM Fine-Tuning
RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference
CoarseBind: Fast and Accurate Binding Affinity Prediction through Coarse Structural Representations
Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation
Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces
RePack then Refine: Efficient Diffusion Transformers with Vision Foundation Models
STORM: Segment, Track, and Object Re-Localization from a Single Image
Mitigating Reward Hacking in LLM-based Recommendation: A Preference Optimization Approach
Beyond Magnitude: Scale-Invariant Evidential Fusion for Multi-View Classification
Alignment-Sensitive Minimax Rates for Spectral Algorithms with Learned Kernels
Evidential Copula Concept Embedding Models
$\texttt{ShaplEIG}$: Bayesian Experimental Design for Shapley Value Estimation
Structured 4D Latent World Model for Robot Planning
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
USE : A Unified Self-Ensembling Framework for Test-Time Prompt Tuning
Evolution Strategies at the Hyperscale
SEER: Transformer-based Robust Time Series Forecasting via Automated Patch Enhancement and Replacement
OCNR: Stabilizing Self-Play by Mitigating Iteration-Collapse With One-Class Novelty Rewards
DAG: A Dual Correlation Network for Time Series Forecasting with Exogenous Variables
TeamWork: Multivariate Time Series Anomaly Detection via Asymmetric Role-aware Channel Modeling
MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement
MultiLoReFT: Decoupling Shared and Modality-Specific Subspaces in Multimodal Learning via Low-Rank Representation Fine-Tuning
S$^2$MAM: Semi-supervised Meta Additive Model for Robust Estimation and Variable Selection
SMM Transformer: Leveraging Spiking Neural Networks for Multimodal Tasks
From Evaluation to Design: Using Potential Energy Surface Smoothness Metrics to Guide ML Interatomic Potential Architectures
Reranker Helps, but Not Enough: Towards Strong Poisoning Attacks Against Retrieval-Augmented Generation
On the Identifiability of Poisson Branching Structural Causal Model Under Latent Confounding
Learning in Structured Stackelberg Games
Symbiosis-Inspired Knowledge Distillation for Incremental Object Detection
Boost the Identity-Preserving Embedding for Consistent Visual Generation
MolAlign3D: Enhancing Fixed-Dimensional E(3)-Equivariant Latent Space for High-Fidelity 3D Molecular Reconstruction and Editing
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
Mitigating the Safety–Utility Trade-off in LLM Alignment via Adaptive Safe Context Learning
SynerMedGen: Synergizing Medical Multimodal Understanding with Generation via Task Alignment
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data
Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents
MADA-Attack: Transferable Multi-modal Attention Distraction Adversarial Attack against Vision Language Models
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
Quantifying Temperature Scaling in Discrete Sequence (Language) Models
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases
DuetServe: Harmonizing Prefill and Decode for LLM Serving via Adaptive GPU Multiplexing
PerceptOS: Semantic-Aware Kernel Optimization for OS-Intensive Workloads via Hardware-Software Alignment
Black-Box Detection of LLM-Generated Text Using Generalized Jensen Shannon Divergence
TiME: Test-Time Mixture-of-Experts Routing via Asymmetric CO-Optimal Transport for Continual Test-Time Adaptation
DiffuReason: Enhancing Reasoning Ability for Diffusion Language Models via Monte Carlo Tree Search
The Geometry of Representational Failures in Vision Language Models
IDRBench: Understanding the Capability of Large Language Models on Interdisciplinary Research
DSENet: A Novel Dual-Stream Enhancement Network for Multi-Scale Non-Stationary Time Series Forecasting
TetraJet-v2: Accurate NVFP4 Training for Large Language Models with Oscillation Suppression and Outlier Control
How Does the Pretraining Distribution Shape In-Context Learning? A Fundamental Trade-Off
Klein Hyperbolic Metric Learning
Learning Fingerprints for Medical Time Series with Redundancy-Constrained Information Maximization
Position: AI Usage Policies Should Be Aligned with International Human Rights Law
AICrypto: Evaluating Cryptography Capabilities of Large Language Models
Orthogonal Model Merging
POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
Escaping the Verifier: Learning to Reason via Demonstrations
XYZFlow: Scaling Multidimensional Shortcut Flows for Efficient Generative Modeling
Blocking the Leakage: Manifold-Aware Gradient Projection for Long-Horizon Test-Time Adaptation
Seeing to Generalize: How Visual Data Corrects Binding Shortcuts
Multi-Task Bayesian In-Context Learning
On the Power of Statistics in Class-Incremental Learning with Pretrained Models
Predicting evolutionary rate as a pretraining task improves genome language model representations
SPAR: Support-Preserving Action Rectification
Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals
Online Learning and Inference for Cox Proportional Hazards Model Using Renewable Sieve Estimation
Clipping Makes Distributed and Federated Asynchronous SGD Robust to Stragglers
GeoAlign: Geometric Rollout Curation for Robust LLM Reinforcement Learning
Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models
Coupled Cluster con MoLe: Molecular Orbital Learning for Neural Wavefunctions
Neural Dispersion on Graphs
FedSSM: State Space Model-based Proactive Inference for Heterogeneous Multimodal Federated Learning
Smoothing Slot Attention Iterations and Recurrences
Neural Quantum States in Mixed Precision
CAMEL: Confidence-Gated Reflection for Reward Modeling
Beyond Procedure: Substantive Fairness in Conformal Prediction
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox
EVOLVING ROLLOUTS: Harnessing Historical Experience for Web Agent Evolution in Reinforcement Learning
Unlocking Cross-Modal Biosignal Synthesis: A Temporally-Aware VAE-Diffusion Model
CaliDist: Calibrating Large Language Models via Behavioral Robustness to Distraction
Fast and Expressive Multi-Byte Prediction with Probabilistic Circuits
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
Evolving Interpretable Constitutions for Multi-Agent Coordination
Text Has Curvature
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Geodesic Flow Matching for Denoising High-Dimensional Structured Representations
Feedback Control for Multi-Objective Graph Self-Supervision
Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
Batch Normalization for Neural Networks on Complex Domains
Semantic Impact–Driven Visual Scheduling in Vision-Language Models
DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts
Do LLMs “Feel”? Emotion Circuits Discovery and Control
TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting
Learning to Share: Selective Memory for Efficient Parallel Agentic Systems
Disentangling Geometry, Performance, and Training in Language Models
MedSIGHT: Towards Grounded Visual Comprehension in Medical Large Vision-Language Models
Geometric Control of Out-of-Distribution Shift in Safe Offline RL
Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling
MME-Reasoning: A Broad-Spectrum Benchmark for Evaluating Logical Reasoning in MLLMs
LLM Self-Recognition: Steering and Retrieving Activation Signatures
Segment-Aligned Policy Optimization for Multi-Modal Reasoning
DR$^2$Seg: Decomposed Two-Stage Rollouts for Efficient Reasoning Segmentation in Multimodal Large Language Models
SCOPE: Evolving Symbolic World for Planning in Open-Ended Environments
Efficient Hallucination Detection for LLMs Using Uncertainty-Aware Attention Heads
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning
ObjEmbed: Towards Universal Multimodal Object Embeddings
AnyBand-Diff: A Unified Remote Sensing Image Generation and Band Repair Framework with Spectral Priors
CUPID in the Model Zoo: Online Matchmaking for Selecting Your Dream LLM
Exposing Vulnerabilities in Explanation for Time Series Classifiers via Dual-Target Attacks
OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization
Why Do We Need Warm-up? A Theoretical Perspective
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios
Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality
When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
What Makes Value Learning Efficient in Residual Reinforcement Learning?
TSMGen: Target-Specific Molecule Generation via Higher-Order Structural Dependencies and Context-Aware Bidirectional Fusion
Edit-Based Refinement for Parallel Masked Diffusion Language Models
ReAugment: Targeted Few-Shot Time Series Augmentation via Model Zoo-Guided Reinforcement Learning
FedSDR: Federated Self-Distillation with Rectification
Learning Situated Awareness in the Real World
RAST-MoE-RL: A Regime-Aware Spatio-Temporal MoE Framework for Deep Reinforcement Learning in Ride-Hailing
FLARE-AI: Flaw Reporting for AI
PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design
Harmful Overfitting in Sobolev Spaces
Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
T-measure: A Topology-Consistent Metric for Binary Segmentation
A Unified Framework for Deep Hypergraph Clustering Beyond Homophily
DOCKSMITH: Scaling Reliable Coding Environments via an Agentic Docker Builder
Expected Returns and Policy Inconsistency-Aware Offline Federated Deep Reinforcement Learning
Reasoning Compartmentalization: Bridging the Concretization Gap via Abstraction-based Routing
SG2Loc: Sequential Visual Localization on 3D Scene Graphs
Learning-To-Measure: In-Context Active Feature Acquisition
Likelihood Matching for Diffusion Models
Position: Anthropomorphic Misalignment Research Needs Stronger Evidence
Position: Stop evaluating AI with human tests, develop principled, AI-specific tests instead
Structure-aware Granular-Ball based Information Bottleneck for Multi-modal Clustering
Shape of Thought: Progressive Object Assembly via Visual Chain-of-Thought
Robust Filter Attention: Self-Attention as a Parallel State Estimator
Class-Conditional Distribution Balancing for Group Robust Classification
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation
Discrete Diffusion Samplers and Bridges: Off-Policy Algorithms and Applications in Latent Spaces
ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure
Interpretable Self-Supervised Learning via Representer Landmarks and Nyström Approximation
LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding
Trajectory Stitching for Solving Inverse Problems with Flow-Based Models
Principled RL for Flow Matching Emerges From the Chunk-level Policy Optimization
Efficient Synthetic Network Generation via Latent Embedding Reconstruction
Attention's forward pass and Frank-Wolfe
Conditionally Site-Independent Neural Evolution of Antibody Sequences
Supervised Guidance Training for Infinite-Dimensional Diffusion Models
Solving Inverse Problems with Flow-based Models via Model Predictive Control
From LLM-Generated Conjectures to Lean Formalizations: Automated Polynomial Inequality Proving via Sum-of-Squares Certificates
The Two-Hump Problem: Bridging the Difficulty Gap in Mathematical Reinforcement Learning
Factored Gossip DiLoCo: Reducing Blocking Communication within DiLoCo
Learning the Minimum Action Distance
Hierarchical Reinforcement Learning for Sparse-Reward Search in Commutative Algebra
Positional Encoding for Spiking Transformers
Rethinking Human Intent to CAD: Parametric CAD Model Generation via Cooperative Multi-Task Alignment and Spatial-Aware Reinforcement Learning
Towards Understanding Modality Interaction in Multimodal Language Models via Partial Information Decomposition
AdaS: Adaptive Gradient Descent for Spiking Transformers
Adaptive Reinforcement Learning for Unobservable Random Delays
Efficient RL Training for LLMs with Experience Replay
Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning
Reviving Error Correction in Modern Deep Time-Series Forecasting
SmoothSpike: Spiking Transformer with Learnable Hadamard Transformation
Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering
Stability Analysis of Sharpness-Aware Minimization
ParamMem: Augmenting Language Agents with Parametric Reflective Memory
Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding
SpikingLM: Towards Fully Spiking Language Model
Position: AI Lock-In Is in Progress, and We Must Be Prepared
Cold-Start Personalization via Training-Free Priors from Structured World Models
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
TimeSAE: Sparse Decoding for Faithful Explanations of Black-Box Time Series Models
Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers
The Differences Between Direct Alignment Algorithms are a Blur
When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation
The Latent Color Subspace: Emergent Order in High-Dimensional Chaos
PLASH: Provably Linear-Time Attention with Selective Higher-Order Feature Sketching
ProOPF: Benchmarking and Improving LLMs for Professional-Grade Power Systems Optimization Modeling
SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport
Mantis: Lightweight Foundation Model for Time Series Classification
Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning
CodeChemist: Test-Time Scaling for Low-Resource Code Generation via Functional Knowledge Transfer
Approximate Nearest Neighbor Search for Modern AI: A Projection-Augmented Graph Approach
Navigating the Pareto Frontier of Alignment:Spectrum-Adaptive Fine-Tuning for LLMs
VELR: Efficient Video Reward Feedback via Ensemble Latent Reward Models
AlgoVeri: An Aligned Benchmark for Verified Code Generation on Classical Algorithms
Global Policy-Space Response Oracles for Two-Player Zero-Sum Games
Revisiting Zeroth-Order Hessian Approximation: A Single-Step Policy Optimization Lens
Improved Bounds for Reward-Agnostic and Reward-Free Exploration
Position: Universal Aesthetic Alignment Narrows Artistic Expression
Poison with Style: A Practical Poisoning Attack on Code Large Language Models
DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs
Gradient Transformer: Learning to Generate Updates for LLMs
Factored Latent Action World Models
PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection
Differential syntactic and semantic encoding in LLMs
Automated Formal Proofs of Combinatorial Identities via Wilf–Zeilberger Guidance and LLMs
Near-Optimal Private Linear Regression via Iterative Hessian Mixing
DecodeShare: Tracing the Shared Pathways of LLM Decode-Time Decisions
BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate
KineFlow: Kinematic Second-Order Flow Matching for Time-Series Forecasting
DroneDINO: Towards Heterogeneous Routed Mixture of Experts for Drone-based Unified Object Detection
Certifying Capabilities from Finite Tests: When Is It Possible?
Demystifying Action Space Design for Robotic Manipulation Policies
Seeing is Solving: Unlocking Efficient Multimodal RL via View Alignment
SE(3)-Equivariant Flow Matching with Gaussian Process Priors for Geometric Trajectory Prediction
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers
The Fairness Hierarchy: A viewpoint from causal inference
SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark
Position: Code Benchmarks Should Prioritize Rigor, Reliability, and Reproducibility
REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation
Towards Hierarchy–Uniformity Equilibrium: Recovering Semantic Depth in Hypergraph Contrastive Learning
Beyond Attention Imbalance: Mitigating Hallucinations via Spectral Surgery
Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers
Privacy-Aware Video Anomaly Detection: Guided Orthogonal Projection and a Comprehensive Evaluation Framework
Global Convergence of Adaptive Sensing for Principal Eigenvector Estimation
UAV$^2$: A Unified and Adaptive Scheduling Framework for UAV Autopilot System with Reinforcement Learning
Learning Graph Foundation Models on Riemannian Graph-of-Graphs
Position: Agentic AI systems should be making Bayes-consistent decisions
Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
Step-Level Sparse Autoencoder for Reasoning Process Interpretation
Federated Graph Learning via Structure-Aware Fusion Using a Kalman Framework with Learnable Dynamics
Hybrid-Gym: Training Coding Agents to Generalize Across Tasks
Learning Unanimously Acceptable Lotteries via Queries
VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos
CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems
Hard Labels In! Rethinking the Role of Hard Labels in Mitigating Local Semantic Drift
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation
Proximal-Based Generative Modeling for Bayesian Inverse Problems
Characterizing, Evaluating, and Optimizing Complex Reasoning
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
AffIn-Space: Learning Affine-Invariant Representations for 3D Spatial Understanding with MLLMs
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG
S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction
How Good is Post-Hoc Watermarking With Language Model Rephrasing?
DecepChain: Inducing Deceptive Reasoning in Large Language Models
Riemannian Networks over Full-Rank Correlation Matrices
Diffusion differentiable resampling
Preference-Modulated Structural Attention for Multi-Objective Combinatorial Optimization
ACO-MoE-LoRA: Evolving-while-Training for Adapting Segment Anything Model 2 to Specialized Domains
Scheduling Thoughts: Learning the Order of Thought in Diffusion Language Models
Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation
VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics
From Memorization to Parameter Interference: How Overtraining Experts Harms Model Merging
Tuning the Implicit Regularizer of Masked Diffusion Language Models: Enhancing Generalization via Insights from $k$-Parity
Delegation and Verification under AI
How to Correctly Report LLM-as-a-Judge Evaluations
RADAR: Defending RAG Dynamically against Retrieval Corruption
Learning-to-Optimize via Deep Unfolded Flows
Evolutionary Multi-View Classification with Label Noise via Gradient and Feature Dual-Perception
A Linearly Convergent Proximal Subgradient Algorithm for Sparse Portfolio Optimization with Transaction Cost
Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance
Position: Want Better ML Reviews? Stop Asking Nicely and Start Incentivizing with a Credit System
FAFO: Lossy KV Cache Compression for Lossless Inference Acceleration via Draftless Fumble Decoding
ST-Veto: Spatio-Temporal Token Veto for Diffusion MLLMs via Taylor Prediction and Visual Grounding
A Dirac-Frenkel-Onsager principle: Instantaneous residual minimization with gauge momentum for nonlinear parametrizations of PDE solutions
Shapley Neuron Values for Continual Learning: Which Neurons Matter Most?
Deep Neural Network Regression with Functional Covariates
Memory-Efficient LLMs Training with Dynamic Sparsity: From Stability to Practical Scaling
video-SALMONN S: Memory-Enhanced Streaming Audio-Visual LLM
Rethinking Code Complexity Through the Lens of Large Language Models
Context-Aware Reaonser : Enhancing Contextual Reasoning in Multimodal Large Language Models
PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts
ParalESN: Enabling parallel information processing in Reservoir Computing
When Data Is Scarce: Scaling Sparse Language Models with Repeated Training
Universality, Function Composition, and Algorithm Emulation All In-Context
Schema-Guided World Modeling for Understanding Hierarchical Visual Dynamics
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
Diffusion Models Preferentially Memorize Prototypical Examples or: Why Does My Diffusion Model Love Slop?
MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding
MulFCoder: Framework-conditioned Multi-agent for MLLM-based Multi-framework Front-end Code Generation
Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies
Generative Visual Code Mobile World Models
ForensicConcept:Transferable Forensic Concepts for AIGI Detection
AVI-Bench: Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
Random Selection Reveals Implicit Knowledge Consensus in Code Generation
Bend the Basics: Degradation-Aware Deformable Tokenization for All-in-One Image Restoration
Can Muon Fine-tune Adam-Pretrained Models?
FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
Aggregate Models, Not Explanations: Improving Feature Importance Estimation
Tvcache: A Tool-Value Cache for Post-Training LLM Agents
Beyond Text-to-SQL: Can LLMs Really Debug Enterprise ETL SQL?
MoSE: Mixture of Slimmable Experts for Efficient and Adaptive Language Models
A Flat Vocabulary or a Rich Hierarchy? Re-introducing Intrinsic Structure Transforms the Autoregressive Image Generation
On the "Induction Bias" in Sequence Models
CURE: Context-driven Diffusion with Progressive Expansion for Single Domain Generalization in Time Series Classification
Robust Harmful Features Under Jailbreak Attacks: Mechanistic Evidence from Attention Head Specialization in Large Language Models
Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective
BabyVision: Visual Reasoning Beyond Language
JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG
On the Anisotropy of Score-Based Generative Models
Learning to Perceive the World Through Control: Empowerment-Based Representation Learning
Learning Discriminative and Generalizable Anomaly Detector for Dynamic Graph with Limited Supervision
Position: If open source is to win, it must go public
Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases
DF-LoGiT: Data-Free Logic-Gated Backdoor Attacks in Vision Transformers
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
SLIP-RS: Structured-Attribute Language-Image Pre-Training for Remote Sensing Object Detection
From Distribution to Geometry: Stable Graph Generalization via Invariant Barycenters
3D Scene Assertion Verification
Drop-in Circulant Structural Priors for Transformer Decoding of Cyclic Codes
GI-GCN: Global Interacted Graph Convolutional Networks via Dominant Sets for Graph Classification
SFCLTA: Spectral Fusion Contrastive Learning with Topology-Adaptive Graph Augmentation
Deep Flow Networks
Beyond Benchmarks: Toward Causally Faithful Evaluation of Large Language Models
A Deep Learning Model of Mental Rotation Informed by Interactive VR Experiments
Thinking with Geometry: Active Geometry Integration for Spatial Reasoning
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching
Ratio-Variance Regularized Policy Optimization
UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
Continuous Diffusion Models Can Obey Formal Syntax
Identifying Latent Concepts and Structures for Generalized Category Discovery
Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing for Dense Predictions
CombinationTS: A Modular Framework for Understanding Time-Series Forecasting Models
ParaTool: Shifting Tool Representations from Context to Parameters
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis
Score Based Error Correcting Code Decoder
Zero-Shot 3D Question Answering via Hierarchical View-to-Token Transportation
Plan for Speed: Dilated Scheduling for Masked Diffusion Language Models
MODUS: Decoder-only Any-to-Any Modeling of Diverse Modalities
The Double-Edged Nature of the Rashomon Set for Trustworthy Machine Learning
Neural Minimum Weight Perfect Matching for Quantum Error Codes
Conformal Calibration Transfer
Quadratically Regularized Optimal Transport: Localization Bounds and Affine Case Analysis
Collaborative Disagreement Resolution for Scalable Oversight
Position: Uncertainty is a Strategic Signal in Human–AI Decision Making
Sufficiency is Relative: Evaluating LLM Explanations under Model-Induced Input Distributions
GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs
PyHealth 2.0: A Comprehensive Open-Source Toolkit for Accessible and Reproducible Clinical Deep Learning
Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment
ManiSoft: Towards Vision-Language Manipulation for Soft Robotics
Escaping the Diversity Trap in Robotic Manipulation via Anchor-Centric Adaptation
GraphFLEx: Unsupervised Structure Learning $\underline{\text{F}}$ramework for $\underline{\text{L}}$arge $\underline{\text{Ex}}$panding $\underline{\text{Graph}}$s
MemDecoder: Enhancing Test-Time Compute for LLM Agents via Reinforced Memory Decoding
MA$^3$S: Model-Agnostic Active Annotation Strategy for Crowdsourcing
Efficient Diffusion Models under Nonconvex Equality and Inequality constraints via Landing
Adapting to Evolving Graphs: A Scalable Framework for Dynamic Coarsening
Grounding LLMs in Scientific Discovery via Embodied Actions
Learning Rewrite-Invariant Reasoning with Targeted Alternation Training
GRPO is Secretly a Process Reward Model
FairJudge : An Adaptive, Debiased, and Consistent LLM-as-a-Judge
Position: Deployed Reinforcement Learning should be Continual
An Approximation Algorithm for Graph Label Selection
Float8@2bits: Entropy Coding Enables Data-Free Model Compression
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
Learning the Best Under Constraints: A Duality-Based Framework
Towards Whole-corpus Reconstruction of Heterogeneous RAG Knowledge Bases
Learning Hamiltonian Flow Maps: Mean Flow Consistency for Large-Timestep Molecular Dynamics
Aligning Datasets and Models for Weight Space Learning
How Should Transformers Represent Numeric Values in Electronic Health Records?
The Loss Is Not Enough: Sampling Conditions and Inductive Bias in Contrastive Representation Learning
Radial Scaling Voxelization for Accurate Small Object 3D Detection
Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning
$\texttt{MetaDistill}$: Unlocking the Performance Ceiling for Pretrained Optimizers
From Retrieval to Translation: Translating Query into Graph-level Clues for Retrieval-Augmented Generation
Large Language Models Explore by Latent Distilling
The Entropic Signature of Class Speciation in Diffusion Models
Position: Interpretability in Deep Time Series Models Demands Semantic Alignment
Exploring Relational Reasoning Capabilities in LLMs with REL
ECCO: Evidence-Driven Causal Reasoning for Compiler Optimization
Learn-to-learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-gated LLM
Generalized Boundary FDR Control under Arbitrary Dependence: An Approach on Closure Principle
SwitchCraft: Programmatic Design of State-Switching Proteins
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling
Safety-Efficacy Trade Off: Robustness against Data-Poisoning
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
On the Power of Source Screening for Learning Shared Feature Extractors
OSNIP: Breaking the Privacy-Utility-Efficiency Trilemma in LLM Inference via Obfuscated Semantic Null Space
Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought
LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning
Modeling Attributional Style at Scale: A Dataset and Analysis for Psychological Attribution Assessment and Reframing
Temporal-aware Flow Matching for Video Generation with Temporally Coherent Motion
FlatLab: A Unified Methodology Framework and Simulation-Based Benchmark for Robotic Manipulation of Flat Objects
Autobidding Auctions with LLM-Powered Creatives
DIYHealth Suite: Dataset, Model, and Benchmark for Health Management at Home
Weak-to-Strong Generalization via Bregman Bias–Variance Decomposition
Diffusion Flow Matching: Dimension-Improved KL Bounds and Wasserstein Guarantees
Visual Implicit Autoregressive Modeling
Foundation Inference Models for Ordinary Differential Equations
Speech-Audio Compositional Attacks on Multimodal LLMs and Their Defense with SALMONN-Guard
Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models
More Sail than Ballast: Addressing Harmful Knowledge Leakage in the Expansive Reasoning Space of LRMs
Knothe-Rosenblatt Quantile Regression for Risk-sensitive Multi-objective Reinforcement Learning
Through the Stealth Lens: Attention-Aware Defenses Against Poisoning in RAG
The benefits of full data shuffle, now with optimal I/O cost: $k$-wise independence and matrix transposition to the rescue
Are Your Agents Upward Deceivers?
Adaptive Bandit Algorithms for Contextual Matching Markets
Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking
Optimal Stopping in Latent Diffusion Models
PACEAttention: Principled and Adaptive Feature Compression-Expansion Grounded in the Geometry of $\text{MCR}^2$
Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation
Multiple Choice Learning of Low-Rank Adapters for Language Modeling
Hyperspectral Image Fusion with Spectral-Band and Fusion-Scale Agnosticism
Decompose and Recompose: Reasoning New Skills from Existing Abilities for Cross-Task Robotic Manipulation
Self-CriTeach: LLM Self-Teaching and Self-Critiquing for Improving Robotic Planning via Automated Domain Generation
Correct looks better: Pairwise comparisons reveal accuracy rankings
PCA of Probability Measures: Sparse and Dense Sampling Regimes
Improving the Performance and Learning Stability of Parallelizable RNNs Designed for Ultra-Low Power Applications
R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?
Fault Tolerant Multi-Agent Learning with Adversarial Budget Constraints
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning
Fourier Features Let Agents Learn High Precision Policies with Imitation Learning
Automata-Conditioned Cooperative Multi-Agent Reinforcement Learning
Probably Approximately Correct Labels
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
SpikeVLA: Vision-Language-Action Models with Spiking Neural Networks
Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control
Adversarially Robust Control of Conditional Value-at-Risk via Kelly Conformal Inference
iGRPO: Fast Online RL for Flow Matching Model with Dense Reward
Activation-Free Backbones for Image Recognition: Polynomial Alternatives for Spatial and Channel Mixing
DANCE: Dynamic, Available, Neighbor-gated Condensation for Federated Text-Attributed Graphs
Why DDIM Hallucinates More than DDPM: A Theoretical Analysis of Reverse Dynamics
Revisiting the Role of Pretrained Weights in Model Merging: On Near-Optimality within the Core Subspace
A Fully First-Order Layer for Differentiable Optimization
Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
Bridging Functional Correctness and Runtime Efficiency Gaps in LLM-Based Code Translation
On the Accuracy of Newton Step and Influence Function Data Attributions
Spectral Heat Flow for Conservative Token Condensation in Vision-Language Models
SuCo: Sufficiency-guided Continuous Adaptive Reasoning
Beyond Blind Noising: Disentangled Visual Rectification for Hallucination Mitigation in MLLMs
TGV-KV: Text-Grounded KV Eviction for Vision-Language Models
Tuning-Free One-Class Discriminant Learning for Tabular Anomaly Detection
Rethinking Visual Intelligence: Insights from Video Pretraining
Robust Self-reflective Hashing for Cross-modal Retrieval with Noisy Label
Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging
Deep Discriminative Structure Proxy Hashing for Cross-modal Retrieval
Optimal Quantum Speedups for Repeatedly Nested Expectation Estimation
Time series saliency maps: Explaining models across multiple domains
Modeling Long-Tail Relations in the Operating Room via In-Context Multimodal Learning
On Path to Multimodal Historical Reasoning: HistBench and HistAgent
How Do Language Models Speak Languages? A Case Study on Unintended Code-Switching
Spatial-Aware Reduction Framework: Towards Efficient and Faithful Visual State Space Models
Position: Neural Approximation Is Rarely Justified for Hard Combinatorial Problems
Non-Parametric Optimization for Scalable Learning in Stochastic Decision Problems
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models
RSAgent: Learning to Reason and Act via Multi-Turn Tool Invocations for Text-Guided Segmentation
The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting
Learning to Watch: Active Video Anomaly Understanding via Interleaved Policy Optimization
Linguistic Relative Policy Optimization for Video Anomaly Reasoning
Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
Structured Diffusion Bridges: Inductive Bias for Denoising Diffusion Bridges
SafeLab: An Interactive High-Fidelity Benchmark for Embodied Safety in Scientific Robotics
FedCDWA: Decoupled Federated Prototype Distillation with Hierarchical Wasserstein Aggregation
Stability beyond bounded differences: sharp generalization bounds under finite $L_p$ moments
SphericalDreamer: Generating Navigable Immersive 3D Worlds with Panorama Fusion
Salus: Strategic Diagnostic Testing for Complex Diagnosis via Multi-Agent Reinforcement Learning
Embodied-DETR: End-to-End Temporal 3D Object Detection in Egocentric Views
Active Continual Learning with Metaplastic Binary Bayesian Neural Networks
Concept-Guided Tokenization: Closing the Gap Between Reconstruction and Generation
UniMapping: Unified SLAM Framework for Map-Centric Embodied Perception
Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning
BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning
Rethinking LLM Ensembling from the Perspective of Mixture Models
Causal Dependency-Aware Unsupervised Routing for Large Reasoning Models
LEAP: Zone-Aware MCTS for LLM Self-Speculative Decoding
IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM Inference
Once-for-All: Scalable Simultaneous Forecasting via Equilibrium State Estimation
DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving
Sycophancy Towards Researchers Drives Performative Misalignment
FACT: Fuzzy Alignment with Comorbidity Topology for Reliable Multi-Label Medical Image Diagnosis
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision–Language–Action for Autonomous Driving
DyGRO-VLA: Cross-Task Scaling of Vision–Language–Action Models via Dynamic Grouped Residual Optimization
Task-Awareness Improves LLM Generations and Uncertainty
Deep Reinforcement Learning Finds Bayes-Nash Equilibrium in Competitive Newsvendor Problems
Diagnosing and Correcting Concept Omission in Multimodal Diffusion Transformers
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
D-FUSEr: Diverse Failure, Unified Success via Error-Distribution Shaping in LLM Reasoning
PsumQuant: In-line Post-training Partial Sum Quantizer for Energy Efficient NPU Inference
Universal Approximation with Softmax Attention
Distributionally Robust Causal Abstractions
DDGA: Dirichlet Distributional Gradient Aggregation for Transferable Vision-Language Adversarial Attacks
Verbalized Bayesian Persuasion
TAMPO: Task- and Model-Aware Automatic Prompt Optimization for Robust and Controllable Auto-Routing in LLM-based Systems
Offline Multi-Agent Reinforcement Learning via Sequential Score Decomposition
4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere
InfraRL: A Benchmark for Constrained Resource Allocation in Large-Scale Infrastructure Asset Management
Rethinking Federated Prompt Learning for Medical Images: From Textual Tuning to Visual Manifold Anchoring
The Invisible Lottery: How Subtle Cues Steer Algorithm Choice in LLM Code Generation
Causal Preference Elicitation
FedGain: Toward Negative-Gain-Free Client Collaboration in Federated Learning
VLANeXt: Recipes for Building Strong VLA Models
Towards Completeness in Causal Discovery from Soft Interventions with Known Targets
Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents
Root Cause Analysis of Failures in Microservices via Bayesian Root Cause Discovery
Learnability-Driven Knowledge Assimilation for Class-Incremental Semantic Segmentation
TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
Dissecting Post-Training: Uncovering the Complementary Roles of SFT and RL for Document Parsing
OvisOCR: End-to-End Document Parsing via Aligning Specialized Perception with General Reasoning
Epistemic Gain, Aleatoric Cost: Uncertainty Decomposition in Multi-Agent Debate for Math Reasoning
Latent Representation Alignment for Offline Goal-Conditioned Reinforcement Learning
LithoDreamer: A Physics-Informed World Model for Multi-Stage Computational Lithography
Robust Inter-Series Dependency Modeling for Time Series Forecasting via Information-Theoretic Alignment
Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History
Configurable Reward Model for Balanced Safety Alignment
Toward Robust Multilingual Adaptation of LLMs for Low-Resource Languages
GraphP-FL: Personalized Federated Graph Learning via Dynamic Structure Awareness and Fisher Information Elastic Alignment
Suppress and Diversify: Refining Robust Pathways for Corruption Robustness
ElicitR: Unlocking Latent Reasoning in Dense Retrievers via Generative Regularization
nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding
ReQAT: Achieving Full-Precision Reasoning Accuracy with 4-bit Floating-Point Quantization-Aware Training
Ambiguous Strategic Classification
Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
Lottery Prior: Randomized Neural Compression for Zero-Shot Inverse Problems
Automatic Layer Selection for Hallucination Detection
Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization
CB-SLICE: Concept-Based Interpretable Error Slice Discovery
Rethinking Genomic Modeling Through Optical Character Recognition
Unlocking the Potential of Continual Model Merging: An ODE Perspective
Convex Low-resource Accent-Robust Language Detection in Speech Recognition
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space
On the Optimization Trajectory of DeepWalk Embeddings
Motion-Aware Caching for Efficient Autoregressive Video Generation
Position: Collusion Risks Among AI Reasoning Agents Justify Certification Requirements for Making Market Decisions
Training-Free Multimodal Large Language Model Orchestration
Evaluating LLM Uncertainty in Long-Form Generation Using Deterministic Ground Truth
Unbiased Alignment for Large Language Models with Noisy Preferences
Learning to Execute Graph Algorithms Exactly with Graph Neural Networks
Hermite-NGP: Gradient-Augmented Hash Encoding for Learning PDEs
StretchTime: Adaptive Time Series Forecasting via Symplectic Attention
Adaptive Querying with AI Persona Priors
Automatic Pruning Discovery for Large Language Models
Spiral RoPE: Rotate Your Rotary Positional Embeddings in the 2D Plane
Benchmarking the Limits of In-Context Reinforcement Learning for Ad-Hoc Teamwork
Bringing Code ALIVE: Optimizing Interactive Frontend Mini-Games via Automated Play and Reinforcement Learning at Scale
Von Mises-Fisher Mixture Model with Dynamic Shrinkage for Realistic Test-Time Transduction
HyMTRL: A Hybrid Multi-Task Reinforcement Learning Framework via Phased Policy Evolution
ORBIT: A Prognostic World Model for Ocular Reasoning Based on Imagined Trajectories
Scaling Agentic Verifier for Competitive Coding
Stochastic Sparse Attention for Memory-Bound Inference
Outrunning LLM Cutoffs: A Live Kernel Crash Resolution Benchmark for All
Dynamic Regret via Discounted-to-Dynamic Reduction with Applications to Curved Losses and Adam Optimizer
Fair Transit Stop Placement: A Clustering Perspective and Beyond
Trainable Nonexpansive Denoisers for Contractive Image Reconstruction
Continuous-Time Piecewise-Linear Recurrent Neural Networks
Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference
TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization
Sparse by Design: Relevance-Driven Scaling for Recommender Systems
Hierarchical ODE: Learning Continuous-Time Physical Prototypes for Early Link Failure Detection
DP-KFC: Data-Free Preconditioning for Privacy-Preserving Deep Learning
AD-BTS: Adaptive Dual-Branch Token Sparsification via Spatial Information Density
FRACTAL: State Space Model with Fractional Recurrent Architecture for Computational Temporal Analysis of Long Sequences
Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding
Krause Synchronization Transformers
CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection
Privacy Amplification in Differentially Private Zeroth-Order Optimization with Hidden States
Position: The Machine Learning Community Must Treat Compute Inequality as a First-Class Research Problem
Quantile-Free Uncertainty Quantification in Graph Neural Networks
When Can We Trust Survival Model Evaluation ?
HPS: Hyperspherical Parameter Sharing for Efficient Multi-Agent Reinforcement Learning
Semi-Supervised Learning for Molecular Graphs via Ensemble Consensus
Unifying Deep Stochastic Processes for Image Enhancement
Credit-assigned Policy Gradient for Early Stage Retrieval in Two-stage Ranking
Self-Evolving LLM Agents under Offline Data Support
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
PhaseCoder: Microphone Geometry-Agnostic Spatial Audio Understanding for Multimodal LLMs
Position: Certified Correctness in Neural Constraint Reasoning Requires Symbolic Integration
SorryDB: Can AI Provers Complete Real-World Lean Theorems?
MIDSTEER: Optimal Affine Framework for Steering Generative Models
Cluster-Aware Causal Mixer for Online Anomaly Detection in Multivariate Time Series
Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective
Risk Awareness Injection: Calibrating Vision-Language Models for Safety without Compromising Utility
Flat Minima and Generalization: Insights from Stochastic Convex Optimization
Learning Gaussian Mixture-distributed Prototypes for 3D Scene Graph Generation from RGB-D Sequences
The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
Calibrating Conservatism for Scalable Oversight
Personalized Image Generation via Human-in-the-loop Bayesian Optimization
DualTimesField: Rethinking Time Series as Continuous-Time Trends and Events
Variational Entropic Optimal Transport
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation
Efficient Multi-Agent Reasoning via Confidence-Guided Adaptive Debate
MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks
Dual Optimal Transport for Multi-Concept Composition: Structural Alignment and Texture Injection in Diffusion Models
IVQ: Structured and Lightweight Vector Quantization via Binary Hierarchical Composition Inspired by $\textit{IChing}$
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
Why Linear Recurrent Memory Works in Partially Observable Reinforcement Learning
A Short and Unified Convergence Analysis of the SAG, SAGA, and IAG Algorithms
Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction
CODiff: One-Step Diffusion Model for Camouflaged Object Detection
Improving the Robustness-Utility Trade-off in Decentralized Learning over Sparse Networks
Draft-Conditioned Constrained Decoding for Structured Generation in LLMs
Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?
Scaling by Diversified Experience for Vision-Language-Action Models
Old Habits Die Hard: How Conversational History Geometrically Traps LLMs
ImpQuant: Fine-Grained Importance-Aware Quantization for Large Vision-Language Models
CRAMER: Control via Request-Aware Masking for Editing Recommenders
Names Don’t Matter: Symbol-Invariant Transformer for Open-Vocabulary Learning
TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance
DEER: A Benchmark for Evaluating Deep Research Agents on Expert Report Generation
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity
Contextualized Visual Personalization in Vision-Language Models
Trust3R: Unifying Feed-Forward Pointmap Prediction and Evidential Learning for Trust-Aware 3D Reconstruction
Automatically Finding Reward Model Biases
Breaking the Synthetic-Real Domain Shortcut for Training-Free Generative Replay-based Class Incremental Learning
Improving Explicit Dynamic Gaussian Splatting Optimization via Update Mixture
Class-Grouped-Normalized-Momentum and Faster Hyperparameter Exploration to Tackle Class Imbalance in Federated Learning
Enhanced Latent-Space Adversarial Training for Super-Resolution
Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO
Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL
Position: AI Evaluations Should be Grounded on a Theory of Capability
GRPO-based Cluster Decision Agent for Unknown-$\boldsymbol{K}$ Multi-view Clustering
PACER: Acyclic Causal Discovery from Large-scale Interventional Data
Mind Dreamer: Untethering Imagination via Active Counterfactual Reasoning on Latent Manifolds
Optimizing Diversity and Quality through Base-Aligned Model Collaboration
KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem
$\textit{S}$-SPPO: Semantic-Calibrated Self-Play Preference Optimization
Efficient and Minimax-optimal In-context Nonparametric Regression with Transformers
Physics-Informed Self-Supervised Learning on Efficient Electron-Density Images for Organic Material Property Prediction
MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts
Fully Dynamic Coreset Spectral Clustering
STD-Former: Image-Conditioned Texture Dictionary Encoding with Sparse Topological Supervision for Texture Recognition
Tight Margin-Based Generalization Bounds for Voting Classifiers over Finite Hypothesis Sets
Needles in the Haystack: Addressing Signal Dilution Improves scRNA-seq Perturbation Response Modeling and Evaluation
$f$-Divergence Regularized RLHF: Two Tales of Sampling and Unified Analyses
Benchmarking Physics-Informed Time-Series Models for Operational Global Station Weather Forecasting
Dynamic Multimodal Evaluation via Knowledge-Enhanced Benchmark Evolution
Position: Quantum Program Generation Must Prioritize Validity Over Probabilistic Scaling
Distinguishable Deletion: Unifying Knowledge Erasure and Refusal for Large Language Model Unlearning
Memoria-Bench: A Comprehensive Benchmark for Evaluating Memory in Long-Horizon Autonomous Agents
Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future
A Positive Case for Faithfulness: Explanations Help Predict Model Behavior
Transforming Weather Data from Pixel to Latent Space
Online Social Welfare Function-based Resource Allocation
SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass
Causal Fine-Tuning under Latent Confounded Shift
Box Thirding: Anytime Best Arm Identification under Insufficient Sampling
SHARP-Q: Spectral Hessian Alignment and Rectification for Post-training Quantization
Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals
Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
Bits That Count: Quantifying and Predicting Capabilities of Language Models
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training
Revisiting the Platonic Representation Hypothesis: An Aristotelian View
Data Reconstruction: Identifiability and Optimization with Sample Splitting
scCBGM: Single-Cell Editing via Concept Bottlenecks
TwinWeaver: An LLM-Based Foundation Model Framework for Pan-Cancer Digital Twins
Zero-Shot Text-to-Motion Evaluation using Video Language Models
Persona-Pruner: Sculpting Lightweight Models for Role-Playing
TIME: Tensor-Factorized Mixture-of-Experts with Intrinsic Routing for Lifelong Multimodal Knowledge Editing
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies
Learning, Solving and Optimizing PDEs with TensorGalerkin: an efficient high-performance Galerkin assembly algorithm
Harnessing Non-Adversarial Robustness in Large Language Models
Understanding Self-Supervised Learning via Latent Distribution Matching
Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing
HelioX: A GPU-Native Framework for Simulation and Training of Biophysically Detailed Networks
Textual Supervision Enhances Geospatial Representations in Vision-Language Models
Finding Differentially Private Second Order Stationary Points in Stochastic Minimax Optimization
PRISM: Gauge-Invariant Tangent-Space Differentially Private LoRA
3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models
Mining Useful General Data for Low-Resource Domain Adaptation
AutoBaxBuilder: Bootstrapping Code Security Benchmarking
From Reasoning Traces to Reusable Modules: Reinforcement Learning for Compositional Generalization in Language Model Reasoning
$\mathcal{O}(\log N)$ Latent Dimension Suffices for Universal Approximation of Permutation-invariant Function
Position: Text Embeddings Should Capture Implicit Semantics, Not Just Surface Meaning
Hom-PGD+: Fast Reparameterized Optimization over Non-convex Ball-Homeomorphic Set
A Language-Guided Bayesian Optimization for Efficient LoRA Hyperparameter Search
TimeLAVA: Learning-Agnostic Valuation for Time Series Data
Evaluating and Steering Modality Preferences in Multi-modal LLMs
Continuity-Regularized Flow Matching for Offline Reinforcement Learning
Fair Decisions from Calibrated Scores: Achieving Optimal Classification While Satisfying Sufficiency
OSCS: Online Selection with Provable FAR Control for LLM Safety
Rényi Diffusion Models
Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations
Proximal Splitting Methods for Hybrid Differentiable Models
RECAST: Model Reconstruction via Counterfactual-Aware Wasserstein Geometry under Limited Data
Don't Force the Fit: Bounded Log-Likelihood Loss for Enhanced Reasoning in Large Language Models
A Unified Density Operator View of Flow Control and Merging
Towards Seed-Robust Safety Alignment in Text-to-Image Models
Constrained Flow Optimization via Sequential Fine-Tuning for Molecular Design
SpaEF: Spatially Resolved Transcriptomics Data Element-Wise Denoising Framework Powered by Large Models
Deliberate Evolution for Sample-Efficient Symbolic Regression with LLM
CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving
From Prompts to Responses: Dual-Sided Data Leakage and Defense in Split Large Language Models
Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning
Continual Learning With Participation Privacy: An Auditable Buffering-Aggregation Recipe
Cardio-mmFlow: A Gaussian-Prior-Free Physics-Informed Flow Matching Framework for Electrocardiogram to mmWave Radar Synthesis.
Learning Credal Ensembles via Distributionally Robust Optimization
MAGIC: Multi-Granularity Language-Informed Image Clustering
AvAtar: Learning to Align via Active Optimal Transport
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics
Task-Aware Mechanism: Hybrid MoE Vision Tower Towards Holistic Video Understanding
TritonGym: A Benchmark for Agentic LLM Workflows in Triton GPU Code Generation
SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
EnerGS: Energy-Based Gaussian Splatting under Partial Geometric Observability
TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments
Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
Semi-knockoffs: a model-agnostic conditional independence testing method with finite-sample guarantees
Quaternion Self-Attention with Shared Scores
Cost-aware Stopping for Bayesian Optimization
Loss-aware distributionally robust optimization via trainable optimal transport ambiguity sets
Position: Medical AI Neglects Real Treatment Outcomes
Representational Similarity and Model Behavior in Multi-Agent Interaction
Peer-Preservation in Frontier Models
Linearizing Vision Transformer with Test-Time Training
Semantic-Enriched Latent Visual Reasoning
Efficient and Safe Molecular Assembly via Reinforcement Learning and Constraint Solving
Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities
LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning
D-ARL: A Distribution-Matched Asynchronous Reinforcement Learning Framework for Language Reasoning
SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
DLO-Lab: Benchmarking Deformable Linear Object Manipulations with Differentiable Physics
Matrix-Free GPU Semidefinite Programming for Quantum Ordered Search at the k=6 Frontier
Solving Positive Linear Programs with Differential Privacy
Rethinking Visual Autoregressive Sampling with Information-Grounding Guidance
Decoupling The "What" and "Where" With Polar Coordinate Positional Embedding
Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space
Adversarial Latent Embedding Repair for LLM Continual Learning
One-Shot Weighted Ensemble Estimation for Federated Quantile Regression: Optimal Statistical Guarantees under Heterogeneous Structured Data
On Effectiveness and Efficiency of Agentic Tool-calling and RL Training
Domain Adaptation with Adaptive $f$-Divergence: Tighter Variational Representation and Generalization Bounds
Low-Rank and Sparsity Are All You Need: Exploring Robust Hierarchical Latent Subspaces for Transferable Adversarial Attack
Scalable Simulation-Based Model Inference with Test-Time Complexity Control
The Geometry of Projection Heads: Conditioning, Invariance, and Collapse
Improving Sampling for Masked Diffusion Models via Information Gain
UOTIP: Unbalanced Optimal Transport Map for Unpaired Inverse Problems
CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features
Position: Evaluating LLMs in Finance Requires Explicit Bias Consideration
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Reflective Hamiltonian Monte Carlo: Mixing Analysis and Application to Sampling on Stiefel Manifold
Decision-Focused Learning via Tangent-Space Projection of Prediction Error
Position: Reliable AI Needs to Externalize Implicit Knowledge: A Human–AI Collaboration Perspective
VEQ: Modality-Adaptive Quantization for MoE Vision-Language Models
The Optimal Sample Complexity of Linear Contracts
Condition Number Based Low-Bit Quantization for Image Super-Resolution
The Interplay Between Interpolation and Aggregation in Regression: Optimal Sample Complexity
Maximizing mutual information between prompt and response improves LLM performance with no additional data
ExpAlign: Expectation-Guided Vision–Language Alignment for Open-Vocabulary Grounding
Beyond Literal Translation: Evaluating Cultural Effectiveness in Social Media UGC
SCOPE: Selective Conformal Optimized Pairwise LLM Judging
Anchoring Self-Play for Code Repair
Training Data Efficiency in Multimodal Process Reward Models
Darwinian Memory: A Training-Free Self-Regulating Memory System for GUI Agent Evolution
Topology-Preserving Neural Operator Learning via Hodge Decomposition
OVLR: Efficient, Scalable, and Robust Training via Output-Level Variance-Reduced Likelihood Ratio
Length Generalization Bounds for Transformers
Interpreting and Steering State-Space Models via Activation Subspace Bottlenecks
Refining Context-Entangled Content Segmentation via Curriculum Selection and Anti-Curriculum Promotion
At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization
A Unifying View of Variational Generative Wasserstein Flows
FedScar: Correcting Geometric Bias for Flatness-Consistent Federated Learning
Quantifying LLM Attention-Head Stability: Implications for Circuit Universality
Faithful Mobile GUI Agents with Guided Advantage Estimator
HSMAD: Heterophily-Driven Spectral and Manifold Learning for Graph Anomaly Detection
PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents
How RLHF Amplifies Sycophancy
RelayCaching: Accelerating LLM Collaboration via Decoding KV Cache Reuse
Federated Causal Inference on Multi-Site Observational Data via Propensity Score Aggregation
Modeling Spectral Energy Shifts in Spatio-Temporal Graph Anomaly Detection
SteeringSafety: Benchmarking Representation Steering in LLMs Across Safety Perspectives
Adaptive Batch Sizes Using Non-Euclidean Gradient Noise Scales for Stochastic Sign and Spectral Descent
OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL
Adaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings
KFStego: Key-Free Secure Image Distribution via Bipartite Structural Invariants
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Tight Stability Bounds for Robust Distributed Learning: Byzantine Failures Hurt Generalization More than Data Poisoning
Scaling Multi-Agent Environment Co-Design with Diffusion Models
Cheap2Rich: A Multi-Fidelity Framework for Data Assimilation and System Identification of Multiscale Physics - Rotating Detonation Engines
Multimarginal flow matching with optimal transport potentials
SENDAI: A Hierarchical Sparse-measurement, EfficieNt Data AssImilation Framework
Reinforced Sequential Monte Carlo for Amortised Sampling
Position: RL Should Be Used to Adjust Foundation Models, NOT Abused
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
SS‑TPT: Stability and Suitability-Guided Test-Time Prompt Tuning for Adversarially Robust Vision-Language Models
Transformers with RL or SFT Provably Learn Sparse Boolean Functions, But Differently
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?
Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
History-Bootstrapped Flow Matching for Inverse Boiling Reconstruction
Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization
KAST-BAR: Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Modeling for Universal Neural Interpretation
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in Unified Multimodal Models via Decompositional Verifiable Reward
Utonia: Toward One Encoder for All Point Clouds
Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design
Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds
SIMPC: Learning Self-Induced Mirror-Point Consistency for Unsupervised Point Cloud Denoising
Information-Geometric Adaptive Sampling for Graph Diffusion
Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture
Unifying Stacking and Cascading for Efficient Ensemble Inference
WorldCompass: Reinforcement Learning for Long-Horizon World Models
Latent Diffusion Pretraining for Crystal Property Prediction
Risk-Bounded Distribution Reconstruction: Stable Statistic Calibration for Long-Tailed Recognition
BRIDGE: Predicting Human Task Completion Time From Model Performance
FedReLa: Imbalanced Federated Learning via Re-Labeling
UniCoD: Enhancing Robot Policy via Unified Continuous and Discrete Representation Learning
Dissecting Embodied Abilities in Multimodal Language Models through Skill-level Evaluation and Diagnosis
Don't Walk the Line: Boundary Guidance for Filtered Generation
ThetaEvolve: Test-time Learning on Open Problems
Asymptotically Optimal Sequential Testing with Markovian Data
Reading Between the Tokens: Improving Preference Predictions through Mechanistic Forecasting
Position: The Alignment Community is Unintentionally Building a Censor’s Toolkit
FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning
GOTabPFN: From Feature Ordering to Compact Tokenization for Tabular Foundation Models on High-Dimensional Data
Variational Flow Maps: Make Some Noise for One-Step Conditional Generation
Do Transformers Need Three Projections? Systematic Study of QKV Variants
Bootstrapped Exploration with Causal Reasoning: A Training Paradigm for Adaptive Forecasting Agent
Falsifying Sparse Autoencoder Reasoning Features in Language Models
MM-Spectrum: Multimodal Multi-spectral Molecular Structural Elucidation with a Stable MoE Framework
A Noise Sensitivity Exponent Controls Large Statistical-to-Computational Gaps in Single- and Multi-Index Models
How Hard Can It Be? Hardness-Aware Multi-Objective Unlearning
Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Bridging Your Imagination with Audio-Video Generation via a Unified Director
Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue
TF-FACE: Time-Frequency Fusion Learning via Frequency-Domain Adaptive and Controllable Enhancement for Trajectory Prediction
SIKA-GP: Accelerating Gaussian Process Inference with Sparse Inducing Kernel Approximations for Bayesian Deep Learning
Boosting World Models Learning via Latent-Space Value Alignment
FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents
Position: Mechanisms for Aggregated Individual Reporting Should be Established for Post-Deployment Evaluation
How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs
Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning
Attention Illuminates LLM Reasoning: The Uncovered Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
TD3B: Transition-Directed Discrete Diffusion for Allosteric Binder Generation
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior
Teaching Agents to Ask Effective Clarification Questions
SeisMark: A Large-Scale Open Benchmark for Robust 3D Seismic Fault Detection
PolySAE: Modeling Feature Interactions in Sparse Autoencoders via Polynomial Decoding
SAD-Flower: Flow Matching for Safe, Admissible, and Dynamically Consistent Planning
Bridging Spherical Black-Box Optimizers
CauScale: Neural Causal Discovery at Scale
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning
InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
Chain-of-Glimpse: Search-Guided Progressive Object-Grounded Reasoning for Video Understanding
AlignedNorm: Prompting Vision–Language Models via Coupled Prompt Field
Efficient Adaptive Testing via Gradient Path Matching Subset Selection for AI Education
AugMask: Score-Based Generative Modeling of Incomplete Tabular Data via Augmentation and Masking
SpikeCLR: Self-Supervised Contrastive Learning for Visual Representations with Spiking Neural Networks
scDataset: Scalable Data Loading for Deep Learning on Large-Scale Single-Cell Omics
Giving Sensors a Voice: Multimodal JEPA for Semantic Time-Series Embeddings
STT-LLM: Structural-Temporal Tokenization for Adapting LLMs to Longitudinal Clinical Profiles
Concept Removal for Frontier Image Generative Models
FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
Multi-Level Strategic Classification: Incentivizing Improvement through Promotion and Relegation Dynamics
GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models
Are We Overconfident in Models and Results for Semi-Supervised 3D Medical Image Segmentation?
Dual Latent Memory for Visual Multi-agent System
Target-Driven Policy Optimization for Sequential Counterfactual Outcome Control
VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection
Chunk-Guided Q-Learning
Active Policy Optimization for Individualized Dosing via Gradient Variance Minimization
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Optimal Rates for Feasible Payoff Set Estimation in Games
StableI2I: Spotting Unintended Changes in Image-to-Image Transition
Self-Supervised Weight Templates for Scalable Vision Model Initialization
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation
MICE-Bench: A Challenging and Comprehensive Benchmark for Multi-Reference Image Creation and Editing
Positive–Unlabeled Reinforcement Learning Distillation for On-Premise Small Models
SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling
Joint Model and Data Sparsification via the Marginal Likelihood
Don't Drop Dropout: Optimizing Layer Sparsity for Efficient LLM Training and Inference
How (Not) to Hybridize Neural and Mechanistic Models for Epidemiological Forecasting
Agentic Framework for Epidemiological Modeling
Reparameterization Flow Policy Optimization
Beyond Instance-Level Self-Supervision in 3D Multi-Modal Medical Imaging
AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications
MINIF2F-DAFNY: LLM-Guided Mathematical Theorem Proving via Auto-Active Verification
When Shared Knowledge Hurts: Spectral Over-Accumulation in Model Merging
Annotations Mitigate Post-Training Mode Collapse
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Identifying Common Hubs in Multiple Gaussian Graphical Models
ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
Momentum Further Constrains Sharpness at the Edge of Stochastic Stability
Position: Use Sparse Autoencoders to Discover Unknowns
AnalogVerifier: A Neuro-Symbolic Framework for Analog Circuit Verification
Ramba: Selective State-Space Models for Relational Deep Learning
LFQ: Logit-aware Final-block Quantization for Boosting the Generation Quality of Low-Bit Quantized LLMs
RevealLayer: Disentangling Hidden and Visible Layers via Occlusion-Aware Image Decomposition
Self-correcting for Debiasing Large Language Models
Optimal Transport under Group Fairness Constraints
From Observations to States: Latent Time Series Forecasting
Scalable Topology-Preserving Graph Coarsening: Concepts and Algorithms
Towards Professional-Grade Financial Agents: Benchmarking, Tooling, and Structured Reasoning
FiX: Introducing Fine-grained Forget Gate into Softmax Attention
OTora: A Unified Red Teaming Framework for Reasoning-Level Denial-of-Service in LLM Agents
Sparse Autoencoders are Topic Models
Rh-3DGS: Robust Open-Vocabulary Scene Understanding via Riemannian Huber Distillation and Manifold-Aware Sampling
How to Avoid Debate: Scalable AI Safety via Doubly-Efficient Interactive Proofs
Margin-Adaptive Confidence Ranking for Reliable LLM Judgement
Unsupervised Hierarchical Skill Discovery
DOT-MoE: Differentiable Optimal Transport for MoEfication
Induction Meets Biology: Mechanisms of Repeat Detection in Protein Language Models
(1D) Ordered Tokens Enable Efficient Test-Time Search
Physics-informed coarsening for multigrid graph neural networks surrogates
On The Variability Of Concept Activation Vectors
CACR: Reinforcing Temporal Answer Grounding in Instructional Video via Candidate-Aware Causal Reasoning
FEDEMOE: IMPROVING PERSONALIZATION ON HET- EROGENEOUS FEDERATED LEARNING VIA ELASTIC MIXTURE OF EXPERTS ARCHITECTURE
PlugGuard: A Streaming Safeguard for Large Models via Latent Dynamics-Guided Risk Detection
Rethinking Attention in Spiking Transformers: Overcoming Density Bias with Set Similarity
Position: Regulating Algorithms Is Not Enough. A Study of Content Discovery in Online Platforms
Scaling the Prior: Size-Consistent Geometric Diffusion for 3D Molecular Generation
Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning
GenUnfold: Rapidly Predict Protein Mechanical Unfolding Trajectory via a Physics-Guided Diffusion Model
RBCBF: Decoding Time Safety Alignment via Risk Guided Rollback and Barrier Control
Enhancing Neural Theorem Proving via High-Quality Proof Selection and Verifier Feedback
Quantifying the Generalization Gap in Seizure Detection: A Large-Scale Empirical Benchmark via the SzCORE Challenge
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
What is Missing? Explaining Neurons Activated by Absent Concepts
Addressing Semantic Blind Spots in Text-to-SQL via Component Pre-generation and AST Matching Rewards
Information-Theoretic Disentangled Latent Modeling with Conditional Diffusion for Incomplete Multi-View Clustering
OPIC: Enhancing Language Model Merging via Optimizing In-Context Capability
HexGen-3: A Fully Disaggregated LLM Serving Framework with Fine-Grained Heterogeneous Resource Autoscaling
Compressed Sensing for Capability Localization in Large Language Models
Deterministic Differentiable Structured Pruning for Large Language Models
Persistent Semantic Entities in Tool-Augmented LLM Systems
Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations
Anchored Policy Optimization: Mitigating Exploration Collapse via Support-Constrained Rectification
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Probabilistic Bisection Algorithm Provably Achieves Exponential Convergence
xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
The Extra Tokens Matter: Disentangled Representation Learning with Vision Transformers
SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
A Call to Lagrangian Action: Learning Population Mechanics from Temporal Snapshots
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
STFlow: Data-Coupled Flow Matching for Geometric Trajectory Simulation
You Don’t Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models
Can Recommender Systems Teach Themselves? A Recursive Self-Improving Framework with Fidelity Control
CatFlow: Co-generation of Slab-Adsorbate Systems via Flow Matching
Context Tuning for In-Context Optimization
On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning
Component-Wise Composite Likelihood Distillation for Censored Time-to-Event Data
A Linear Expectation Constraint for Selective Prediction and Routing with False-Discovery Control
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
Designing Observation and Action Models for Efficient Reinforcement Learning with LLMs
GeoEvo: Identity-Aware Potential Game with Geometric Evolution for Personalized Multimodal Federated Learning
EchoRL: Reinforcement Learning via Rollout Echoing
QuArch: A Benchmark for Evaluating LLM Reasoning in Computer Architecture
High-Probability Convergence Guarantees of Decentralized SGD
Conflict-Aware Additive Guidance for Flow Models under Compositional Rewards
More Capable, Less Cooperative? When LLMs Fail at Zero-Cost Collaboration
Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization
WBMM: Windowed Batch Matrix Multiplication for Efficient Large Receptive Field Convolution
Cardinality-Invariant Neural Operator Policies for Scalable PDE Control
Position: Modular Memory is the Key to Continual Learning Agents
Conversation for Non-verifiable Learning: Self-Evolving Large Language Models through Meta-Evaluation
Timestep Rescheduling in Diffusion Inversion
SoftBinary Coding: A New Information-Theoretic Paradigm for Neural Compression via Fast Channel Simulation
ProSAR: Prototype-Guided Semantic Augmentation and Refinement for Time Series Contrastive Learning
Unlocking Noise-Resistant Vision: Key Architectural Secrets for Robust Models Against Gaussian Noise
Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning
Closing the Sim-to-Real Gap in Non-Markovian Spreading Processes via GPU-Accelerated Distributional RL
Inference-Time Conformal Reasoning with Valid Factuality Control for Large Language Models
Geometric Pocket-Centric Protein Encoding for Polypharmacology-Guided Multi-Target Drug Design
InfoDLM: an Information-Adaptive Framework for Discrete Diffusion Language Model Pretraining
Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models
Solving Time-Dependent Differential Equations with Physical Dynamical Systems
Interpretable Functional Koopman Learning with Non-Markovian Closure for Spatiotemporal Systems
Experience Augmented Policy Optimization for LLM Reasoning
Deep networks learn to parse uniform-depth context-free languages from local statistics
Deep Ensemble Clustering for Visual Representation Learning
XPERT: Expert Knowledge Transfer for Effective Training of Language Models
Co-Evolving Latent Action World Models
PACE: Post-Causal Entropy Modeling for Learned LiDAR Point Cloud Compression
Towards Understanding Massive Activations in Attention Sink Mechanism
Decentralized and Disentangled Task–Role Representation Learning for Generalizable Offline Multi-Agent Meta Reinforcement Learning
DEGAP: Dynamic Entropy-Guided Attention Perturbation for Contrastive Decoding in Large Vision-Language Models
Fast Byte Latent Transformer
A Spiking Heterogeneous Harmonic Resonate-and-Fire State Space Model for Time Series
Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation
Operator Splitting with Hamilton-Jacobi-based Proximals
Condition-Aware Graph Flow Matching for Modeling the Distributions of Complex Physical Systems
Optimal Regret for Policy Optimization in Contextual Bandits
Beyond Binary: Continuous State Optimization with Graph-Structured Objectives
Stochastic Linear Bandits with Parameter Noise
Understanding Generalization from Embedding Dimension and Distributional Convergence
Near-Optimal Regret for Policy Optimization in Contextual MDPs with General Offline Function Approximation
Byte Pair Encoding for Efficient Time Series Forecasting
Local Intrinsic Dimension of Representations Predicts Alignment and Generalization in AI Models and Human Brain
OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild
Interpretability Transfer from Language to Vision via Sparse Autoencoders
Relational Structural Causal Models
VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding
Fast Inverse Lithography via GRPO Reinforced Flow Matching
A$^2$SG: Adaptive and Asymmetric Surrogate Gradients for Training Deep Spiking Neural Network
Learning Randomized Reductions
Group Distributionally Robust Optimization-Driven RL for LLM Reasoning
MapUQ: Map with Uncertainty Quantification for Robust BEV Vectorized Construction
The Geometry of Reasoning: Self-Evaluation via Layerwise Trajectory Evolution
PostTrainBench: Can LLM Agents Automate LLM Post-Training?
Identifying Learnwares via Reduced Neural Conditional Mean Embedding
Lookahead Sample Reward Guidance for Test-Time Scaling of Diffusion Models
Amortized Simulation-Based Inference in Generalized Bayes via Neural Posterior Estimation
WorldComp2D: Spatio-semantic Representations of Object Identity and Location from Local Views
Position: Time to Close The Validation Gap in LLM Social Simulations
Learning Self-Interpretation from Interpretability Artifacts: Training Lightweight Adapters on Vector-Label Pairs
Rethinking the Flow-based Gradual Domain Adaption: A Semi-Dual Optimal Transport Perspective
Endogenous Resistance to Activation Steering in Language Models: Evidence for Internal Consistency Monitoring in Llama-3.3-70B
CONTEXTOR: Contextualized High-order Contrastive Learning
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training
Deep Progressive Training: scaling up depth capacity of zero/one-layer models
Exact and Approximate Algorithms for Polytree Learning
Decoupling Regularization and Privacy in Differentially Private Ridge Regression and ERM
Are VLMs Seeing or Just Saying? Uncovering the Illusion of Visual Re-examination
Markov Chain Monte Carlo without Evaluating the Target: an Auxiliary Variable Approach
Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs
Geometric Reciprocity: Unlocking Self-Supervision for Stereoscopic Video Generation
iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning
URS: A Unified Neural Routing Solver for Cross-Problem Zero-Shot Generalization
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
SDM: A Powerful Tool for Evaluating Model Robustness
MetaMoE: Diversity-Aware Proxy Selection for Privacy-Preserving Mixture-of-Experts Unification
SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes
Modular Pretraining Enables Access Control
JANUS-LORA: A Balanced Low-Rank Adaptation for Continual Learning
SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
Twice Sequential Monte Carlo for Tree Search
SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks
Cross-Tactile Sensor Representation Learning
TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity
WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints
Tiny Brains, Giant Impact: Uncovering the Keystone Neurons of LLM with Just a Few Prompts
Variance-Reduced Zeroth-Order Langevin Dynamics for Non-Log-Concave Black-Box Sampling and Inverse Problems
Trajectory-Aware Heuristic Learning for Combinatorial Search
Reasoning Can Be Restored by Correcting a Few Decision Tokens
Internalizing Safety Understanding in Large Reasoning Models via Verification
Rethinking Calibration for Early-Exit Neural Networks
VeriSimpl: Robust Optimization Modeling from Natural Language using Simplification-based Verification
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
SRPO: Self-Reflective Policy Optimization for Long-Horizon Reasoning
AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition
Spectral Bridge Variational Inference: Dynamic LoRA via Bures-Wasserstein Gradient Flows
Differentially Private Synthetic Tabular Data via Private Evolution
Position: Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment
The Unlearnability Phenomenon in RLVR for Language Models
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
Learning the ESG Geometry with Domain Aware Language Models
MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation
Unsat Core Prediction through Polarity-Aware Representation Learning over Clause-Literal Hypergraphs
Learning to Self-Verify Makes Language Models Better Reasoners
PADA-Coder: Improving Plan-Following Code Generation via Perturbation-Verified Attention Distillation and Dynamic Alignment
Accurate Large-scale Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo
Mind Your Margin and Boundary: Are Your Distilled Datasets Truly Robust?
Prioritized Model Experience Replay
3D-DLP: Self-supervised 3D Object-centric Scene Representation Learning
Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!
Discriminative Attribute Graph Clustering Through Topology-Guided Contrastive Learning
EPS3D: End-to-End Feed-Forward 3D Panoptic Segmentation
CauseCollab: Causal Unified and Modality-Agnostic Network for Heterogeneous Collaborative Perception
One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception
Rethinking 3D Shape Generation: Diffusion over Superquadrics
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
Approximation Theory for Lipschitz Continuous Transformers
Theoretical Guarantees for One-Shot Magnitude Pruning and Compute-Adaptive Early Exit
Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding
Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning
Origo: Physically Interpretable Multi-Physics PDE Pre-training through Neural Operator Splitting
Latent Space Robust Optimization of Neural Processes with Aligned Stratified Order-Statistic Loss Reduction
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering
Multimodal Meta-Verifier with Explicit Structured Recalibration
Temporal Preference Optimization for Unsupervised Retrieval
Evaluating and Explaining Prompt Sensitivity of LLMs Using Interactions
SP-Mind: An Autonomous Reasoning Agent for Spatial Proteomics Analysis
DualOptim+: Bridging Shared and Decoupled Optimizer States for Better Machine Unlearning in Large Language Models
Zero-Shot Off-Policy Learning
CSOR: Coreset Selection for Object Re-identification via Class Pruning
Path-dependent Discrete Amortized Inference
A Unified Approach to Interpreting Knowledge Distillation for Large Language Models via Interactions
CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas
Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory
U$^3$CF: Unbiased, Unconfounding, and Unified Causal Framework for Multi-Target Domain Adaptation
Semantic Robustness Certification for Vision-Language Models
Bayesian Rain Field Reconstruction using Commercial Microwave Links and Diffusion Model Priors
Unified Time Series Explanations via Semi-Amortized Optimization and Instance-level Multi-Expert Knowledge Distillation
CoRe: Collaborative Reasoning via Cross Teaching
Diversity Over Frequency: Rethinking Tool Use in Visual Chain-of-Thought Agents
MARS: Modular Agent with Reflective Search for Automated AI Research
d2p: Fast and Scalable Structured Attention with Differentiable Dynamic Programming
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents
MEDA: Medical-Oriented Activation Editing for Hallucination Mitigation in Medical Large Vision-Language Model
Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization
FlashSinkhorn: IO-Aware Entropic Optimal Transport on GPU
Unified Safe In-context Image Generation in Multimodal Diffusion Transformers
Achieving Logarithmic Regret in KL-Regularized Zero-Sum Markov Games
Factored Causal Representation Learning for Robust Reward Modeling in RLHF
OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving
Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO
PADS-TAL: Padding-Annealed Diffusion Sampling in Text-Aware Latent Space for Robust and Diverse Text-to-Music Generation
You Don't Protect if You Don't Expect: Breaking the Key Assumption behind CLIP's Test-Time Defenses
Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model
QPKO: Differentiable QP-Embedded Deep Koopman Framework for Modeling Nonlinear Systems
Group Cognition Learning: Making Everything Better Through Controlled Two-Stage Agents Collaboration
Trajectory-Level Speculative Decoding for Diffusion Language Models
Position: Stop Chasing the C-index when Evaluating Survival Analysis Models
Unified Episodic and Semantic Memory via Modulating Transformer FeedForward Layers
OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing
Towards Sub-second Biological Foundation Model Infrastructure: A Quantized Consistency Diffusion Framework for Molecular Docking
Position: The Privacy-Auditability Paradox in Federated Learning: Why We Need Controllable Secure Aggregation
Realistic Adaptive Merging
Multi-Distribution Robust Conformal Prediction
A Statistical Framework for Analyzing Specification Resistance to Learnware-Inversion Risks
BizFinBench.v2: Towards Reliable LLMs in Finance via Real-User Data and Offline/Online Bilingual Evaluation
Conditional Coverage Diagnostics for Conformal Prediction
L-Drive: Beyond a Single Mapping—Latent Context Drives Time Series Forecasting
Detecting and Filtering Unsafe Training Data via Data Attribution with Denoised Representation
PESD-TSF: A Period-Aware and Explicit Structured Decomposition Framework for Long-Term Time Series Forecasting
ArcVQ-VAE: A Spherical Vector Quantization Framework with ArcCosine Additive Margin
GENEB: Why Genomic Models Are Hard to Compare
Learning from Comparison: Constrained Projection Policy Optimization for Pareto-Front Improvement
Vision-Language-Action Pretraining from Large-Scale Human Videos
Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform
Fast Reconstruction of Mixtures of Bernoulli Product Distributions
A Two-Layer Framework for Joint Online Configuration Selection and Admission Control
Tail Annealing for Heavy-Tailed Flow Matching
Improving Zero-Shot Offline RL via Behavioral Task Sampling
Responsible Text-to-Image Diffusion: Interpretable and Linearly Controllable Semantics for Fair and Safe Generation
Think Less, Act Early: Reinforced Latent Reasoning with Early Exit in Vision-Language-Action Models
The Surprising Difficulty of Search in Model-Based Reinforcement Learning
DiL: Discrete-anchored Representation Alignment for Semi-Supervised Continual Learning
Editable Proof Sketch for Automated Theorem Proving
Emergent Visual Representations through Unsupervised Spiking Networks with Synaptic Pruning
ProConMV: Provenance-Enabled Conceptual Framework for Interpretable Multi-View Diabetic Retinopathy Diagnosis
Intentional Updates for Streaming Reinforcement Learning
Decoupled Training with Local Reinforcement Fine-Tuning in Federated Learning
Wait, Wait, Wait... Why Do Reasoning Models Loop?
Time-series forecasting through the lens of dynamics
Mitigating Bias in Locally Constrained Decoding via Tractable Proposals
When Iteration Helps and Hurts in Self-Training: Denoising vs. Signal Forgetting
Learning to Rank by Directly Optimizing Full-Order Probabilities
Fleet: Few-Shots Lead Effective AIGI Detection
A Unifying Relational Perspective on Expressive Lottery Tickets
Attributed Network Alignment: Statistical Limits and Efficient Algorithm
A Foundation-style Model for Zero-Shot Statistical Dependency Measurement
From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models
Tokenised Flow Matching for Hierarchical Simulation Based Inference
Verifying Meta-Awareness via Predictive Rewards in Reasoning Models
MODEL SOUPS NEED ONLY ONE INGREDIENT
Constrained Adaptive Rejection Sampling
Adversarial Flow Models
Nonparametric Distribution Regression Re-calibration
Long-term Fairness with Selective Labels
Autoregressive Direct Preference Optimization
MODEL MERGING SCALING LAWS IN LARGE LANGUAGE MODELS
TileSparse: Arithmetic-Intensity-Aware Sparse Attention for Compute-Bound LLM Decoding
Rethinking Pretraining Data Detection for LLMs: From Local to Global
Learn from Your Mistakes: Tree-like Self-Play on Vulnerability Nodes for Secure Code LLMs
An Evidential Route to Asymptotic Bayes Optimality under Sparsity
Uncovering the Latent Potential of Deep Intermediate Representations
Hidden in Plain Sight -- Class Competition Focuses Attribution Maps
Pareto-Guided Optimal Transport for Multi-Reward Alignment
GUI-Spotlight: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding
Transitive Representation Learning Enhances Histopathology Annotation
Rapid Poison: Practical Poisoning Attacks Against the Rapid Response Framework
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning
Unifying Adversarial Robustness and Training Across Text Scoring Models
FrameOracle: Learning What to See and How Much to See in Videos
Mitigating Visual Hallucinations via Semantic Curriculum Preference Optimization in MLLMs
Gaussian Mean Field Variational Inference can Overestimate Predictive Variance
Causal Structure Learning for Sparse Matrix Fill-in Reduction
DRIVE: Best Data Scheduling Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation
Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique
The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs
Zeroth-Order Forward-Only SNN Training Inspiring Neuromorphic On-Chip Learning
SuperHype: Hypergraph Generation via Graph-Superposition Decomposition
Training LLM Agents to Empower Humans
Learning Discrete Diffusion on Graphs via Free-Energy Gradient Flows
Joint-Space Empowerment as a Theory of Dexterous Motor Coordination
Maximum-Likelihood Learning of Latent Dynamics Without Reconstruction
Unbiased Principles, Robust Rewards
Local-Minima-Preserving Polynomial Relaxation of Ising Problems
NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs
A Progressive Evidence Localization Framework Based on Wasserstein Gradient Flows for Document Visual Question Answering
When Replanning Becomes the Bottleneck: Budgeted Replanning for Embodied Agents
FoundObj: Self-supervised Foundation Models as Rewards for Label-free 3D Object Segmentation
HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation
$V_0$: A Generalist Value Model for Any Policy at State Zero
Listening Through the Noise: Cauchy-Driven Diffusion Bridges for Robust Gastrointestinal Auscultation and Clinical Benchmarking
LMM4-IC4K: A Large Multimodal Model Powered Integrated Circuit Footprint Geometry Understanding
Open-Text Aerial Detection: A Unified Framework For Aerial Visual Grounding And Detection
Learning Multi-Timescale Abstractions for Hierarchical Combinatorial Planning
RAG without Forgetting: Continual Query-Infused Key Memory
Can VLMs Diagnose and Recover from VLA Manipulation Faults?
Toward More Reliable Agent Evaluation: A Component-Based Benchmark Auditing Pipeline
Generative Neural Operators through Diffusion Last Layer
LIFT: A Novel Framework for Enhancing Long-Context Understanding of LLMs via Long Input Fine-Tuning
Learning Junta Distributions, Quantum Junta States, and QAC$^0$ Circuits
Forget-It-All: Multi-Concept Machine Unlearning via Concept-Aware Neuron Masking
Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence
GeoLoom: High-quality Geometric Diagram Generation from Textual Input
Frictional Q-Learning
World-Shaper: A Unified Framework for 360° Panoramic Editing
Identifying and Correcting Label Noise for Robust GNNs via Influence Contradiction
Synthesizable Molecular Generation via Soft-constrained GFlowNets with Rich Chemical Priors
Neural Feature Geometry Evolves as Discrete Ricci Flow
Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning
E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory
Leveraging Machine Unlearning for Cost-Efficient Preference Alignment
OLion: Approaching the Hadamard Ideal by Intersecting Spectral and L inf Implicit Biases
ACTG-ARL: Differentially Private Conditional Text Generation with RL-Boosted Control
Structurally Aligned Subtask-Level Memory for Software Engineering Agents
From Intent to Solver Code: Semantic Alignment in Optimization Modeling
From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion
VIA-SD: Verification via Intra-Model Routing for Speculative Decoding
Less Is More: Elevating RAG via Performance-Driven Context Compression
Meta-iLaD: Identifiable Latent Dynamics via Meta-Learning of Dynamics Environments
LATO: 3D Mesh Flow Matching with Structured TOpology Preserving LAtents
Plan in Sandbox, Navigate in Open Worlds: Learning Physics-Grounded Abstracted Experience for Embodied Navigation
Mind the budget: Accelerating Deep Reinforcement Learning using Early Exit Neural Networks
Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models
Evolving Quantitative Reasoning through Self-Play in Digital Twin Markets
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators
Layer-Centric Factors of Variation Disentanglement for Task- and Model-Agnostic Generalization
Compositional Planning with Jumpy World Models
Fine-Tuning of Transformer models with Frames
DIVER: Diving Deeper into Distilled Data via Expressive Semantic Recovery
Factor-Wise Homogeneity of Slot-Attention for Continual Object-Centric Learning
Geometry-Guided Generative Representation for Functional Brain Graphs
Deep sequence models tend to memorize geometrically; it is unclear why.
Learning Realistic Depth via Physics-Grounded Noise Disentanglement with Semantic-Geometric Collaboration
CHESS: Chebyshev Spectral Synthesis for Trajectory Condensation
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
Anti-causal domain generalization: Leveraging unlabeled data
CSD: Content-aware Speculative Decoding for Efficient Image Generation
Demystifying Entropy Control in LLM RL Training: Theoretical Analysis and Dynamic Scheduling
Preference-Enhanced Reinforcement Learning for Pluralistic Image Inpainting
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Log-Normal Multiplicative Dynamics for Stable Low-Precision Deep Learning
Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning
PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models
From Pixels to Tokens: A Systematic Study of Latent Action Supervision for Vision-Language-Action Models
Can Simple Denoising Improve Uniform State Diffusion Models?
Mind the Gap: Catching Hallucinations via Evidence Drop on the Reasoning Manifold
DRFusion: Drift-Resilient Temporally Consistent Infrared–Visible Video Fusion
Nonlinear Covariate Balance in Experimental Design
Singular Bayesian Neural Networks
Entangled No More: Multi-Domain Decoupling for Robust Dynamic Graph Neural Networks
A Regime-Aware Trajectory Prediction Framework for 1000+ Systems Biology Models
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training
Ego3S: Select, Strengthen, and Synchronize for Efficient Egocentric Reasoning
FOAM: Frequency and Operator-Error Based Adaptive Damping Method for Reducing Staleness-Oriented Error for Shampoo
A Geometry-Aware Efficient Algorithm for Compositional Entropic Risk Minimization
ProjQ: Project-and-Quantize for Adapter-Aware LLM Compression
Distilling Geometry Priors for 3D-Consistent Video Generation
KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware
ITSPACE: Monotone Gaussian Optimal Transport Updates
Ranking Time Series using a Time Warping Ideal Point Model
Global Geometry Is Not Enough for Vision Representations
CoPE: Continual Probe-guided Expansion for Large Vision-Language Models
MiVE: Multiscale Vision-language features for reference-guided video Editing
Different Usage of Shared Components Explains Behavioral Variance in LLMs
Orthogonal Concept Erasure for Diffusion Models
Rethinking Thinking Tokens: LLMs as Improvement Operators
How to guide your flow: Steering flow maps for rapid test-time alignment
WaterSIC: information-theoretically (near) optimal linear layer quantization
Norm$\times$Direction: Restoring the Missing Query Norm in Vision Linear Attention
A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation
Exploration-free Algorithms for Multi-group Mean Estimation
$\tau$-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge
AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning
Population-Free Pareto Tracking for Sample-Efficient Multi-Policy MORL
Bridging Structure and Semantics: Uncertainty-Modulated Dual-Path Diffusion for Robust Text-Attributed Graph Learning
An Asymmetric Latent Factorization-of-Tensors Model for Relation Extraction
Large Language Models as Topological Thinkers: A Benchmark on Graph Persistent Homology
Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems
TextResNet: Decoupling and Routing Optimization Signals in Compound AI Systems via Deep Residual Tuning
TEAM: Temporal–Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration
MentisOculi: Revealing the Limits of Reasoning with Mental Imagery
Learning to Watermark in the Latent Space of Generative Models
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
Doubly Robust Distributionally Robust Offline Contextual Pricing
LARA: Latent Action Representation Alignment for Vision-Language-Action Models
Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
Beyond Prediction: Tail-Aware Scheduling for LLM Inference
MaMa: A Game-Theoretic Approach for Designing Safe Agentic Systems
Distributionally Robust Reinforcement Learning with Human Feedback
RELO: Reinforcement Learning to Localize for Visual Object Tracking
Language Generation with Replay: A Learning-Theoretic View of Model Collapse
SARL: Structure-Aligned Reinforcement Learning for Bridging the Perception-Action Gap in Airspace
Budget-Efficient Attacks and Robustness Training for Cooperative MARL
Differentiable Optimization Layers for Guaranteed Fairness in Deep Learning
Manifold-Aligned Guided Integrated Gradients for Reliable Feature Attribution
Exact Unlearning in Reinforcement Learning
Self-Supervised Learning as Discrete Communication
Escaping Whack-a-Mole: Code Documentation Optimization via Dependency-Guided Bi-level Search
Size Transferability of Graph Convolutional Networks across Sparsity: A Generalized Graphon Perspective
Differentially Private Submodular Maximization with a Knapsack Constraint
Variational Inference for Uncertain Optimal Transport via Sinkhorn Parametrization
Attend to Anything: Foundation Model for Unified Human Attention Modeling
Privacy Risks of Agentic Inferential Capabilities in Data Linkage Attacks
Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization
WET: Mitigating World-Conditioned Knowledge Conflicts via World Entropy Tethering
Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models
Value-as-Return: A Two-Stage Framework to Align on the Optimal Score Function
Measurement-Consistent Langevin Corrector for Stabilizing Latent Diffusion Inverse Problem Solvers
Test-Time Reinforcement Learning for Flow Matching
Zero-Shot Rankability: Revealing Latent Ordinal Structure in Multimodal Large Language Models via Language
Quantifying Biases in LLM-as-a-Judge Evaluations
Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning
VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models
ReasonEdit: Editing Vision--Language Models using Human Reasoning
Building Social World Model with Large Language Models
Use What You Know: Causal Foundation Models with Partial Graphs
Modeling temporal scRNA-seq data with latent Gaussian process and optimal transport
The Deterministic Horizon: When Extended Reasoning Fails and Tool Delegation Becomes Necessary
Platonic Transformers: A Solid Choice For Equivariance
FedUSD: Unbiased Synthetic Data for Federated Learning
RSF-GLLM: Bridging the Semantic Gap in Multi-Hop Knowledge Graph QA via Recurrent Soft-Flow and Decoupled LLM Generation
Recurrent Structural Policy Gradient for Partially Observable Mean Field Games
HiPPO Zoo: Making Implicit State Space Memory Explicit
Avoid What You Know: Divergent Trajectory Balance for GFlowNets
Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning
Extending Prediction-Powered Inference through Conformal Prediction
Adapting Noise to Data: Generative Flows from Learned 1D Processes
RefineEvo: Planning-Guided Heuristic Evolution with Bidirectional Experience
Torus Graphs for Large Scale Neural Phase Analysis
Federated Sketching LoRA: A Flexible Framework for Heterogeneous Collaborative Fine-Tuning of LLMs
Multi-Scale Wavelet Transformers for Operator Learning of Dynamical Systems
DISCO: Mitigating Bias in Deep Learning with Conditional Distance Correlation
MuCO: Generative Peptide Cyclization Empowered by Multi-stage Conformation Optimization
DecoVer: A Decompose-and-Verify Neuro-Symbolic Framework for Embodied Task Planning with BC+
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention
Making Models Unmergeable via Scaling-Sensitive Loss Landscape
FedHera: Towards Drift-Resilient Federated Fine-tuning with Heterogeneous Resources
Random Process Flow Matching: Generative Implicit Representations of Multivariate Random Fields
Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences
Procedural Generation Of Algorithm Discovery Tasks in Machine Learning
Advancing SVD-based LLM Compression via Layer-Wise Error Model Search
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding
Message Tuning Outshines Graph Prompt Tuning: A Prismatic Space Perspective
Learning Interpretable Options by Identifying Reward Diffusion Bottlenecks in Reinforcement Learning
Convergence Rate Analysis of the AdamW-Style Shampoo: Unifying One-sided and Two-Sided Preconditioning
Token Sample Complexity of Attention
Learning to Memorize with Attributive and Associative Memory for Online Test-Time Adaptation of Vision-Language Models
Scalable and Stable Estimation of Amari $\alpha$-Divergence using Random Fourier Features
Diffusion-based learning framework for Constrained Nonconvex Optimization with Weighted Bootstrapped Refinement
OpenIKLR: Bridging the Reasoning Gap in Open-World Scenarios via Iterative Premise Completion
Unbiased Reward Modeling from Implicit Preference
LAPRAS : Learning-Augmented PRivate Answering for linear query Streams.
Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance
Rethinking Personalization in Large Language Models at the Token Level
PRISM: Sequence Modeling as Parallel Residual Iteration
CollabBench: Benchmarking and Unleashing Collaborative Ability of LLMs with Diverse Players via Proactive Engagement
HInT: Hypergraph Infusion at the Structural Layers Improves Table Understanding
Hista and Numca: Estimate State Value Effectively for Large Language Model Reinforcement Learning
Robust Bayes-Assisted Conformal Prediction
LitReview Arena: Evaluating Literature Review Agents with Battle-style Peer Review Platform
How Powerful are LLMs in Generating Program Specifications?
Image Restoration via Diffusion Models with Dynamic Resolution
Lower Bounds for Frank-Wolfe on Strongly Convex Sets
Causally Evaluating the Learnability of Formal Language Tasks
One Batch Is Enough: A Unified Dataset Condensation Framework for General Time Series Analysis
Architecture Matters for Multi-Agent Security
The Latent Guardian: Defending Collaborative Perception via Feature-Level Consistency Verification
MAS-ProVe: Understanding the Process Verification of Multi-Agent Systems
HiST: A Hierarchical Sparse Transformer for Cross-Modal Spatial Transcriptomics Modeling
The Realignment Problem: When Right becomes Wrong in LLMs
Towards Spectroscopy: Susceptibility Clusters in Language Models
When Softmax Fails at the Top: Extreme‑Value Corrections for InfoNCE
A Narrowing Geometry in Contaminated Reasoning
CAOS: Conformal Aggregation of One-Shot Predictors
Towards Atoms of Large Language Models
Implicit Intelligence - Evaluating Agents on What Users Don’t Say
Escaping the Subspace Trap: The Role of Optimizer Geometry in Model Width Expansion
A KL-regularization framework for learning to plan with adaptive priors
COFT: Counterfactual–Conformal Decoding for Fair Chain‑of‑Thought Reasoning in Large Language Models
Unveiling the Structure of Do-Calculus Reasoning via Derivation Graphs
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
SEMA: a Scalable and Efficient Mamba like Attention via Token Localization and Averaging
DECOR: Learning to Decompose and Collaborate in Deep Search via Multi-Agent Reinforcement Learning
Do Language Models Track Entities Across State Changes?
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs
CooT: Learning to Coordinate In-Context with Coordination Transformers
ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution
Trajectory-Aware Certified Decentralized Unlearning via SGD Stability
Deep Pre-Alignment for VLMs
MIMOMamba: From Scalar Duality to Matrix-Valued Attention
Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation
Position: There are futures that benchmark-driven AI cannot see
Learning Dynamics of Zeroth-Order Optimization: A Kernel Perspective
Understanding and Mitigating Token-Pruning-Induced Vulnerabilities in VLMs
Exploring Motif-based Heterogeneous Graph Learning for ReDoS Detection
Efficient Code Analysis via Graph-Guided Large Language Models
From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs
DADP: Domain Adaptive Diffusion Policy
The Flexibility Trap: Rethinking the Value of Arbitrary Order in Diffusion Language Models
StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars
Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models
Uncovering Grounding IDs: How External Cues Shape Multi-Modal Binding
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding
WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching
Fast-SAM3D: 3Dfy Anything in Images but Faster
Scaling Unsupervised Multi-Source Federated Domain Adaptation through Group-Wise Discrepancy Minimization
ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation.
Conflict-Aware Adaptive Alignment for LLM Hallucination Mitigation
Watch Your Step: Information Injection in Diffusion Models via Shadow Timestep Embedding
Reliability-Aware LLM Alignment from Inconsistent Human Feedback
Smoothness Errors in Dynamics Models and How to Avoid Them
Finite-Width Neural Tangent Kernels from Feynman Diagrams
FLIP2: Expanding Protein Fitness Landscape Benchmarks for Real-World Machine Learning Applications
Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
Rubric Curriculum RL: Exploiting the Generation-Verification Gap in Creative Writing
LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions
Towards Generative Graph Matching for Graph Edit Distance Computation
Topology-Aware Contrastive Learning: Regulating Representation Connectivity via Persistent Homology
Low Kruskal-Rank Adaptation
EMBGUARD: Constructing Hazard-Aware Guardrails for Safe Planning in Embodied Agents
On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
Symmetries in language statistics shape the geometry of model representations
A Graph Foundation Model with Cross-Modal Alignment and Modality-Aware Expert Fusion for Multi-Modal Graphs
GP2F: Cross-Domain Graph Prompting with Adaptive Fusion of Pre-trained Graph Neural Networks
Large Vision–Language Models Get Lost in Attention
Compass-RoPE: Isotropic Rotary Position Embeddings for Vision Transformers
Revisiting Positive Samples in Graph Contrastive Learning: From the Perspective of Message Passing
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Coverage Improvement and Fast Convergence of On-policy Preference Learning
Domain-Shift-Aware Conformal Prediction for Large Language Models
Two-Stage Unit Tying for Simplifying Differentiable Logic Gate Networks
DF-ExpEnse: Diffusion Filtered Exploration for Sample Efficient Finetuning
GSRQ: Gain-Shape Residual Quantization for Sub-1-bit KV Cache
Think Twice Before You Act: Enhancing Agent Behavioral Safety with Thought Correction
Primal-Spectral Generative Modeling: Fast Analytical Generation via Pseudoinverse Lévy Inversion
Efficient Neural Controlled Differential Equations via Attentive Kernel Smoothing
VectorWorld: Efficient Streaming World Model via Diffusion Flow on Vector Graphs
Refined Analysis of Entropy-Regularized Actor-Critic
Learning Attribute–Affordance Hierarchies in Hyperbolic Space for Open-Vocabulary 3D Object Affordance Grounding
DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders
Unbiased and Second-Order-Free Training for High-Dimensional PDEs
Lookahead Unmasking Elicits Reliable Decoding in Diffusion Language Models
Approximate Proportionality in Online Fair Division
RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models
HSGG: Training-Free Hierarchical Scene Graph Generation with Geometry-Guided Relation Reasoning
FlowState: Sampling-Rate‑Equivariant Time‑Series Forecasting
VeRO: An Evaluation Harness for Agents to Optimize Agents
Cure-SFT: Diagnostic-Guided Data Curation for Instruction Tuning
Motion Attribution for Video Generation
Anytime Safe PAC Efficient Reasoning
Position: Interpretability Can Be Actionable
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
Bullet Trains: Parallelizing Training of Temporally Precise Spiking Neural Networks
Memory as Dynamics: Learning Reliability-Guided Predictive Models for Online Video Perception
Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation
Causal Matrix Completion under Multiple Treatments via Mixed Synthetic Nearest Neighbors
Turning Bias into Bugs: Bandit-Guided Style Manipulation Attacks on LLM Judges
Position: AI Governance Needs ISO-like Interoperability Protocols, Not Just Laws
DIVA: Harnessing the Representation Divergence in Unified Multimodal Models for Mutual Reinforcement
Learning to Route Languages for Multilingual Preference Optimization
On the Theoretical Limitations of Embedding-based Link Prediction
Provable Accuracy Collapse of Embedding-Based Representations under Dimensionality Mismatch
A Game-Theoretic Framework for Measuring and Explaining Metric Compatibility in Fair Machine Learning
Large-capacity and Receiver Authenticable Generative Image Steganography
Truthfulness Does Not Scale Like Reasoning: Why Polling Fails as a Proxy Verifier
From Moments to Models: Graphon-Mixture Learning for Mixup and Contrastive Learning
LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents
On Local Policies for Graph-Structured Markov Decision Processes
Olaf-World: Orienting Latent Actions for Video World Modeling
Improved Analysis of the Accelerated Noisy Power Method with Applications to Decentralized PCA
Evaluating bivariate causal statements based on mutual compatibility
Toward Understanding Adversarial Distillation: Why Robust Teachers Fail
Scientific logicality enriched methodology for LLM reasoning: A practice in physics
Speculative Safety Honeypot: Toward Proactive Defense Against Multi-turn Agent Attacks
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-Body Manipulation
From Drift to Coherence: Stabilizing Beliefs in LLMs
Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models
Training-Free Guided Diffusion for Planning: A Unified Framework via Doob’s h-Transform with Safety Guarantees
Learning to Search and Searching to Learn for Generalization in Planning
Discretized Density-Guided Source-Free Adaptation for Continuous Targets
Exploring More to Solve More: Boosting Diversity in Text Diffusion Models via Entropy-Based Guidance
Manifold-Aware Perturbations for Constrained Generative Modeling
OpenDeception: Learning Deception and Trust in Human–AI Interaction via Multi-Agent Simulation
Reverse-Engineering Model Editing on Language Models
Pruning at Initialisation through the lens of Graphon Limit: Convergence, Expressivity, and Generalisation
MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
Stable Spectral Copula Alignment for Robust Multimodal Learning
The Implicit Bias of Steepest Descent with Mini-batch Stochastic Gradient
ViTok-v2: Scaling Native-Resolution Autoencoders to 5B
MAST: Motif-Augmented Diffusion with Search Tree for Spectroscopic Molecular Structure Elucidation
MindFlow: Mind Supernet Powered Thinking Flows for Research Idea Innovation
Position: Zeroth-Order Optimization in Deep Learning Is Underexplored, Not Underpowered
Granularity-Aware Adaptive Classifier Expansion via Zero-Shot Learning
A Robust Optimization Guided Pruning Framework for Vision and Large Language Models
High-accuracy sampling for diffusion models and log-concave distributions
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
MN-Diff: Diffusion Parameterized MoE-NCDE for Continuous Time Series Generation with Irregular Observations
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
LIVE: Long-horizon Interactive Video World Modeling
Experience-Evolving Multi-Turn Tool-Use Agent with Hybrid Episodic–Procedural Memory
Channel Adapter for Time Series Foundation Models in Zero-Shot Multivariate Forecasting
Real-Time and Lightweight Diffusion Image Compression
TabPack: Efficient Hyperparameter Ensembles for Tabular Deep Learning
Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling
SemanticNVS: Improving Semantic Scene Understanding in Generative Novel View Synthesis
DGG-HMR: Multi-Person Human Mesh Recovery with Depth-Guided Geometric Anchoring
A model of errors in transformers
Position: It is Time to Virtualize Foundation Models with a Self-evolving Operating System Layer
Generalized Linear Bandits with Memory
3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis
Coupled Training with Privileged Features and Unlabeled Data
Parametric Prior Mapping Framework for Non-stationary Probabilistic Time Series Forecasting
iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework
A Framework for Understanding Learnability in Transformers
Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch
Task-Driven Subspace Decomposition for Knowledge Sharing and Isolation in LoRA-based Continual Learning
Hierarchical Procedural Meta-Reasoning for Generalizable Multimodal Agents
Signature-Informed Transformer for Asset Allocation
Unifying Value Alignment and Assignment in Cross-Domain Offline Reinforcement Learning with Heterogeneous Datasets
Server-Proximal Aggregation for Federated Domain-Incremental Learning under Partial Participation: Task-Uniform Convergence and Backward Transfer
Gradient-Based Causal Tree Ensembles: A Backbone Architecture for Heterogeneous Treatment Effects
Focusing: View-Consistent Sparse Voxels for Efficient 3D VAE
Off-Policy Evaluation with Strategic Agents via Local Disclosure
SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation
Milestone-Guided Policy Learning for Long-Horizon Language Agents
Let the Prototype Guide You: Robust Aggregation of Sparse Multi-Class Annotations via Annotator Prototype Learning
Power-Calibrated LLM Watermarking: A Statistical Framework
Focus and Dilution: The Multi-stage Learning Process of Attention
Dyn-VPP: Video Prediction Policy Optimization for Improved Visual Dynamics
Just Noticeable Difference Modeling for Deep Visual Features
Powerful and Theoretically Guaranteed Independence Testing on Heterogeneous Federated Clients
Consistency Deep Equilibrium Models
Streaming Covariate Balancing via Discrepancy-Based Feature Coresets
Reasoning Is Not Free: Robust Adaptive Cost-Efficient Router for LLM-as-a-Judge
Concept Heterogeneity-aware Representation Steering
Reference-Free Meta-Learning for Generalized Implicit Neural Representation in Efficient MRI Reconstruction
Op-CAD: Benchmarking and Investigating Operation-oriented CAD Generation
Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Model
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
EvoC2F: Compiling Tool Orchestration for Efficient and Evolvable LLM Agents
JADE: Expert-Grounded Dynamic Evaluation for Open-Ended Professional Tasks
Adaptive Protein Tokenization
The Geometric Reasoner: Manifold-Informed Latent Foresight Search for Long-Context Reasoning
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment
V1: Unifying Generation and Self-Verification for Parallel Reasoners
Reverse Flow Matching: A Unified Framework for Online Reinforcement Learning with Diffusion and Flow Policies
Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit
Lookahead Path Likelihood Optimization for Diffusion LLMs
Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents
DSGCR: Decomposed Spectral Geometry-Aware Cross-Modal Semantic Representation for 3D Visual Grounding
Which Algorithms Can Graph Neural Networks Learn?
DyLLM: Efficient Diffusion LLM inference via saliency-based token selection and partial attention
$\mu$pscaling small models: Principled warm starts and hyperparameter transfer
Hierarchical Anchor Graph Learning for Multi-View Clustering
FedPissa: Towards Federated Personalized Adaptation of Foundation Models via LoRA Subspace Mapping
Revisiting ML Training under Fully Homomorphic Encryption: Convergence Guarantees, Differential Privacy, and Efficient Algorithms
Physically-Guided Data-Space Rectified Flow for Precipitation Nowcasting
Emergence of Hierarchical Emotion Organization in Large Language Models
Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences
Emergence of Biased Consensus in Multi-Agent LLM Debates
Position: AI Welfare Is Bullshit
Adaptive DNA Sequence Modeling via Synergistic Plasticity Units
Learning Task-Sufficient World Models by Synergizing Agentic Exploration and Structured Modeling
Excited Pfaffians: Generalized Neural Wave Functions Across Structure and State
Eliminating Solution Bias in Differentially Private Optimization
Position: We Need Large Language Models Optimized For Our Well-Being
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
From Generalist to Specialist Representation
Constrained Multi-Objective Reinforcement Learning with Max-Min Criterion
Aitchison Embeddings for Learning Compositional Graph Representations
De-Linearizing Agent Traces: Bayesian Inference of Latent Partial Orders for Efficient Execution
OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
Learning to Reason for Factuality
CoRe: Context-Robust Remasking for Diffusion Language Models
CiteGuard: Conformal False-Discovery Control for Faithful Retrieval-Augmented Generation
SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
SOPE: Situation-Aware and Statistically Indistinguishable Privacy Exfiltration for MCP-enabled Agents
TPV: Parameter Perturbations Through the Lens of Test Prediction Variance
Transport and Merge: Cross-Architecture Merging for Large Language Models
Triadic Dynamics Aware Diffusion Posterior Sampling for Inverse Problems: Optimizing Guidance and Stochasticity Schedules
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
Semantic-level Backdoor Attack against Text-to-Image Diffusion Models
CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models
WildActor: Unconstrained Identity-Preserving Video Generation
Benchmarking Dense and Indiscernible Object Counting with Blueberries
WatchLog: Efficient and Interpretable Event Reasoning for Endpoint Detection and Response Logs with Multimodal LLMs
On Information Self-Locking in Reinforcement Learning for Active Reasoning
TraceRouter: Robust Safety for Large Foundation Models via Path-Level Intervention
TWLA: Breaking the Barrier to W1.58A4 Post-Training Quantization for LLMs
Explainable Forensics of Manipulated Segments in Untrimmed Long Videos
Logit-Attention Divergence: Mitigating Position Bias in Multi-Image Retrieval via Attention-Guided Calibration
Threshold-Guided Optimization for Visual Generative Models
DNA: Uncovering Universal Latent Forgery Knowledge
Finding Stationary Points by Comparisons
Who Transfers Safety? Identifying and Targeting Cross-Lingual Shared Safety Neurons
PACE: Proactive Agent-Level Admission Control for Efficient Agentic Batch Inference
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook
SPUR: Scale-Partitioned Uncertainty Rectification for Robust UAV-on-UAV Interception
PICACO: Pluralistic In-Context Value Alignment via Total Correlation Optimization
Efficient Mismatch-Tolerant Coding for Model-Driven Compression
Disentangling Consensus and Value-Specific Representations for Controllable Pluralistic Value Alignment of LLMs
Sparse ActionGen: Accelerating Diffusion Policy with Real-time Pruning
GRASP: Awakening Latent Spatial Reasoning in LVLMs via Training-free Geometric Rectification
Time-Series Decomposition as a standalone Task: A Mechanism-Driven Diagnostic Benchmark
Step-Size Stability in Stochastic Optimization: A Theoretical Perspective
One Intervention per Component is Enough: Towards Identifiability in Linear Stochastic Dynamics from Steady State
SPR-RAFT: Parameter-Efficient Regression-Aware Fine-Tuning for Biomedical LLM Regression
AIR-VLA: Vision-Language-Action Systems for Aerial Manipulation
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Dual Quaternion SE(3) Synchronization with Recovery Guarantees
Distributionally Robust Set Representation Learning Under Inference-Time Element Corruption
Safety Game: Inference-Time Alignment of Black-Box LLMs via Constrained Optimization
Training Diffusion Language Models for Black-Box Optimization
AdaSCALE: Adaptive Scaling for OOD Detection
Is Spurious Correlation Removal Always Learnable?
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration
Bayesian Gated Non-Negative Contrastive Learning
Sparse Relaxed-Lasso Steering: Automatic Sparse-Autoencoder Feature Selection for Precise Image Editing
Test-Time Anchoring for Discrete Diffusion Posterior Sampling
Turning Adaptation into Assets: Cross-Domain Bridging for Online Vision-Language Navigation
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Efficient Preference Poisoning Attack on Offline RLHF
Fedfit: Federated dynamic pruning via Fisher Information scoring
Diffract: Spectral View of LLM Domain Adaptation
How Far Can LLM Agents Reason with Tables? Benchmarking Multi-Turn Agentic Table Question Answering in the Wild
Data Augmentation of Contrastive Learning is Estimating Positive-incentive Noise
Position: Unlabeled ≠ No Human Supervision in Visual Learning
Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization
PointDiT: Pixel-Space Diffusion for Monocular Geometry Estimation
Stable Velocity: A Variance Perspective on Flow Matching
ARC-Decode: Accelerated Decoding with Risk-Bounded Acceptance
Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Skill Neologisms: Towards Skill-based Continual Learning
BOOSTAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models
A Solver-Free Training Method for Predict-then-Optimize
Decentralized Bandits without Global Clock for Dynamic Matching Market
Test-time Generalization for Physics through Neural Operator Splitting
Intervene When It Doubts: Conjunction-Guided Interactive Reasoning
Dissecting the Safety Circuit: Neuronal Intervention for Transferable Adversarial Attacks on VLMs
LoKiFormer: Locality-aware Attention with Decoupled Knowledge Memory for Efficient Large Language Model Pretraining
From Zero to Hero: Advancing Zero-Shot Foundation Models for Tabular Outlier Detection
Walrus: A Cross-domain Foundation Model for Continuum Dynamics
Geometry-based Schrödinger Bridges for Trustworthy Multimodal Fusion
ModernVBERT: Towards Smaller Visual Document Retrievers
Multimodal Fusion via Self-Consistent Task-Gradient Fields
PathwayLLM: Explainable Clinical Trajectory Modeling with Structured Pathways for Sepsis Prediction
Revisiting Neural Processes via Fourier Transform and Volterra Series
TCAP: Tri-Component Attention Profiling for Unsupervised Backdoor Detection in MLLM Fine-Tuning
JanusPipe: Efficient Pipeline Parallel Training for Machine Learning Interatomic Potentials
Efficient Generative Modeling beyond Memoryless Diffusion via Adjoint Schrödinger Bridge Matching
Unlearning’s Blind Spots: Over‑Unlearning and Prototypical Relearning Attack
Being More Lightweight and Practical: Mini-sized Contrastive Learning Pre-trained Models for Fine-grained Traffic Task
Equivariant Latent Alignment via Flow Matching under Group Symmetries
Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-supervised Learning
HEXST: Hexagonal Shifted-Window Transformer for Spatial Transcriptomics Gene Expression Prediction
Balancing Understanding and Generation in Discrete Diffusion Models
What Makes Synthetic Data Effective in Image Segmentation
Bridging On-Device and Cloud LLMs for Collaborative Reasoning: A Unified Methodology for Local Routing and Post-Training
PDAgent: An LLM-Driven Autonomous Agent Framework Towards *In Silico* Protein Design via Directed Mutation
GR-LoRA: Gradient-Recycling Low-Rank Adaptation for Class-Incremental Learning
LoCoT2V-Bench: Benchmarking Long-Form and Complex Text-to-Video Generation
Approximating f -Divergences with Rank Statistics
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO
Easier to Judge than to Find: Predicting In-Context Learning Success for Demonstration Selection
Progressive Cramming: Reliable Token Compression and What It Reveals
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability
Scaling Vision Transformers for Functional MRI with Flat Maps
AlienLM: Alienization of Language for API-Boundary Privacy in Black-Box LLMs
Optimal Domain-Aware Privacy Mechanisms for Synthetic Data Generation
Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models
How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
Training-Free Adversarial Robustness in Deep Learning MRI Reconstruction
Scaling Continual Learning with Bi-Level Routing Mixture-of-Experts
Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation
Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation
Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity
Functional building blocks of neural networks: from network motifs to collective dynamics
Noise Tectonics: Measuring the Stability of AI Benchmark Ecosystems
On Multi-Step Theorem Prediction via Non-Parametric Structural Priors
Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion
Structure Enables Effective Self-Localization of Errors in LLMs
Categorical Flow Maps
Monotonic Variational Gaussian Process for Efficient Data Collection
Reasoning-preserved Efficient Distillation of Large Language Models via Activation-aware Initialization
Depth over Fidelity in Fixed-Budget Noisy Evolution Strategies
ProMiSE: Protein Multi-state Structure Evaluation Benchmark in Biological Contexts
Target-Aware Bandit Allocation for Scalable Surrogate Optimization in Chemical Space
What If We Allocate Test-Time Compute Adaptively?
Improving ML attacks on LWE with data repetition and stepwise regression
Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation
Stage-wise Distortion–Perception Traversal in Zero-shot Inverse Problems with Diffusion Models
Towards Unified Multimodal Pretraining
A Theoretical Framework for Modular Learning of Robust Generative Models
Inconsistency-Aware Minimization: Improving Generalization with Unlabeled Data
Slash the Sink: Sharpening Structural Attention Inside LLMs
MIRA: A Score for Conditional Distribution Accuracy and Model Comparison
TextME: Bridging Unseen Modalities Through Text Descriptions
Bimodal masked language modeling for bulk RNA-seq and DNA methylation representation learning
RADIO1D: Elastic Representations for Condensed Vision Modeling
Instance-Specific Approximation Ratios for Correlation Clustering and Max-Cut
Self-Supervised Dynamical System Representations for Physiological Time-Series
Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic Methods
Addressing Instrument-Outcome Confounding in Mendelian Randomization through Representation Learning
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA
Schur-A*: Layer-wise Optimal Expert Pruning for Sparse MoEs via Schur-Complement Guided A* Search
Towards Rule-Based Knowledge Sharing in Federated Learning
Swift-SVD: Theoretical Optimality Meets Practical Efficiency in Low-Rank LLM Compression
Learning Disentangled Multi-Agent World Model for Decentralized Control
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data
OptMaster: A DAG-Based Framework for Formulation and Heuristic Discovery in Optimization
From Extrinsic to Intrinsic: Geodesic-Guided Representation Learning for 3D Geometric Data
Position: Graph Condensation Needs a Reset—Move Beyond Full-dataset Training and Model-Dependence
RC-FCL: Combating Asynchronous Concept Drift in Federated Continual Learning via Retrospective Calibration
Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals
Reparameterization Proximal Policy Optimization
Scalable Sampling via Generalized Fixed-Point Diffusion Matching
Beyond Single Embedding: Modeling User Preferences as Distribution in Federated Recommendation
Neural Concept Verifier: Scaling Prover-Verifier Games via Concept Encodings
Leveraging Low-Rank Structures for High-Dimensional Score-Based Sampling
Sampling from Your Language Model One Byte at a Time
Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs
Proximal Decoding: Provably Reducing Copyright Risk for Any Language Model
ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Face Recognition
Benchmarking LLM-Assisted Blue Teaming via Standardized Threat Hunting
Why Specialist Models Still Matter: A Heterogeneous Multi-Agent Paradigm for Medical Artificial Intelligence
AnyEdit++: Adaptive Long-Form Knowledge Editing via Bayesian Surprise
DeCoDe: Decoupling Binding Position and Molecular Conformation in 3D Ligand Diffusion for Structure-Based Drug Design
RedDebate: Safer Responses Through Multi-Agent Red Teaming Debates
Learning Gaussian Graphical Models from a Glauber Trajectory Without Mixing
Transformers Can Learn Posterior Predictive Distributions In-Context
Towards Resource-Efficient LLMs: End-to-End Energy Accounting of Distillation Pipelines
Dual-View Predictive Diffusion: Lightweight Speech Enhancement via Spectrogram-Image Synergy
Asymptotically Fast Clebsch-Gordan Tensor Products with Vector Spherical Harmonics
How can embedding models bind concepts?
Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
VideoSEAL: Separating Planning from Answer Authority for Agentic Long Video Understanding
MoVie: Multimodal Video Compression with Text Guidance
VBA: Vector Bundle Attention for Intrinsically Geometric Representation Learning
PASO: Step Parallel Stochastic Optimization
APEX: Approximate-but-exhaustive search for ultra-large combinatorial synthesis libraries
Geometry-Aware Image Flow Matching
AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
Allocating Variance to Maximize Expectation
Letting Trajectories Spread: Quality-Preserving Control for Diverse Flow Matching
Uni-DocRobust: Universal Plug-and-Play Robustness Enhancement for Multi-modal LLMs via Feature Restoration
Depth-Progressive Monotonic Learning without Backpropagation
SpeedCP: Fast Kernel-based Conditional Conformal Prediction
Sparse Regression with $\ell_0$ Constraints for $\alpha$-Mixing Time Series: Algorithms and Guarantees
Periodic Bayesian Flow Networks with Additive Accuracy
TodoEvolve: Learning to Architect Agent Planning Systems
On Group Relative Policy Optimization Collapse in Agent Search: The Lazy Likelihood-Displacement
A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents
On the Convergence Rate of LoRA Gradient Descent
When RAG Hurts: Diagnosing and Mitigating Attention Distraction in Retrieval-Augmented LVLMs
TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation
Reconstruction Outcomes Look Similar but Processes Differ: Improving Context Consistency and Coverage in Graph Masked Auto-Encoder
Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks
Token-Level LLM Collaboration via FusionRoute
Position: LLMs Should Incorporate Explicit Mechanisms for Human Empathy
Geometry-Aware Neural Optimizer for Shape Optimization and Inversion
Introspection Adapters: Training LLMs to Report Their Learned Behaviors
Second-Order Bilevel Optimization with Accelerated Convergence Rates
Target-Agnostic Calibration under Distribution Shift with Frequency-Aware Gradient Rectification
Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior
Evaluating Object-Centric Models beyond Object Discovery
Deep Multi-view Graph Clustering via Attribute-aware Bidirectional Structural Refinement and Pseudo-label Guided Multi-level Fusion
Dual-channel Dynamic Graph Neural Networks with Adaptive Adjacency Learning and Multi-scale Representation Fusion
SAGE: A Dataflow-Native Framework for Modular, Controllable, and Transparent LLM-Augmented Reasoning
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
PromptDyG: Test-Time Prompt Adaptation on Dynamic Graphs
Normality Calibration in Semi-supervised Graph Anomaly Detection
LILO: Bayesian Optimization with Natural Language Feedback
Uncovering Bias Mechanisms in Observational Studies
Test-time Offline Reinforcement Learning on Goal-related Experience
Policy Search via Bayesian Optimization with Temporal Difference Gaussian Processes
Reinforcement Learning via Self-Distillation
Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers
From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping
CoFrGeNet: Continued Fraction Architectures for Language Generation
Achieving Structurally Robust Gromov Wasserstein Distance via Adaptive Dual-Mask
Controlled SDEs for Long-Horizon Motion Generation under Latent Decision Uncertainty
A Direct Approach for Handling Contextual Bandits with Latent State Dynamics
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
LoBCD-GW: A Fast and Data-Dependent Algorithm for Computing Gromov-Wasserstein Distance via Localized Block Coordinate Descent
PRISM: Training-Free Video Anomaly Detection via Intrinsic Statistical Modeling
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback
Complexity bounds for Dirichlet process slice samplers
Bridging Local–Global Dissonance: Learning from Compressive Measurements for Hyperspectral Reconstruction
Parameter Manifold Purification
When to Think, When to Speak: Learning Disclosure Policies for Large Language Model Reasoning
AgentWebBench: Benchmarking Multi-Agent Coordination in Agentic Web
Relational In-Context Learning via Synthetic Pre-training with Structural Prior
Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner
Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization
Mechanisms of Introspective Awareness
Stop When Further Reasoning Won’t Help: Attention-State Adaptive Generation in Reasoning Models
Don't Reinvent the Wheel, Just Realign the Spokes: Resource-Efficient Federated Fine-Tuning via Rank-Wise Expert Assembly
Divergence Decoding: Targeted Unlearning via Auxiliary Models
Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces
Generative Online Reinforcement Learning
Flow Inverse Reinforcement Learning
Generalized Discrete Diffusion with Self-Correction
Let EEG Models Learn EEG
ViEEG: Hierarchical Visual Neural Representation for EEG Brain Decoding
Autoregression with Self-Token Prediction
Rays as Pixels: Learning A Joint Distribution of Video and Camera Trajectories
Operationalizing the Superficial Alignment Hypothesis via Task Complexity
Scaling Transformers for End-to-End Discrete Audio Tokenization
HiMe: Hierarchical Embodied Memory for Long-Horizon Vision-Language-Action Control
LangPrecip: Language-Aware Multimodal Precipitation Nowcasting
Machine Learning Hamiltonians are Accurate Energy-Force Predictors
Global Credit Assignment via Dynamical Criticality
Beyond Problem Solving: UOJ-Bench for Evaluating Code Generation, Hacking, and Repair in Competitive Programming
Weight Decay Improves Language Model Plasticity
Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training
Building Better Deception Probes Using Targeted Instruction Pairs
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination
TransLight: Image-Guided Customized Lighting Control with Generative Decoupling
Local Minima in Quadratic-Penalty Relaxations of Binary Linear Programs
From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models
PhotoAgent: Exploratory Visual Aesthetic Planning with Large Vision Models
The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution
FairRARI: A Plug and Play Framework for Fairness-Aware PageRank
Turning Back Without Forgetting: Selective Backward Refinement for Parameter-Efficient Continual Learning
Precision-Induced Miscalibration: Understanding and Correcting Confidence Distortion in Quantized Neural Networks
A Diffusive Classification Loss for Learning Energy-based Generative Models
On Densest $k$-Subgraph Mining and Diagonal Loading: Optimization Landscape and Finite-Step Exact Convergence Analysis
Discriminative Visual Process Rewards for Scaling Thinking at Test-Time with Images
Randomized Feasibility Methods for Constrained Optimization with Adaptive Step Sizes
MUSE: Resolving Manifold Misalignment in Visual Tokenization via Topological Orthogonality
On the Learning Dynamics of RLVR at the Edge of Competence
Divide and Learn: Multi-Objective Combinatorial Optimization at Scale
Accuracy-First Rényi Differential Privacy and Post-Processing Immunity
Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment
Video-Based Optimal Transport for Feedback-Efficient Offline Preference-Based Reinforcement Learning
ConvexBench: Can LLMs Recognize Convex Functions?
ReJump: A Tree-Jump Representation for Analyzing and Improving LLM Reasoning
LoSA: Locality Aware Sparse Attention in Diffusion Language Models
Attacks on Machine-Text Detectors Retain Stylistic Fingerprints
Reinforcement Learning with Verifiable Rewards: GRPO's Loss, Dynamics, and Success Amplification
Expert-level Leaf Cell Layout Generation via Preference-Optimized LLM
Position: LLM Benchmark Datasets should be Contamination-Resistant
StructMamPose: From Sequential Perception to Structural Reasoning for 3D Human Pose Estimation
The ACE Protocol: Operationalizing Language Model Activations for Better Calibration and Utility
With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots
TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems
Position: From Crowdsourcing to Crowd-LLM-Sourcing and LLM-Sourcing
Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination
KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices
Conformal Policy Control
Influence-Guided Symbolic Regression: Scientific Discovery via LLM-Driven Equation Search with Granular Feedback
Frequency-Aware Perceptual Optimization for Low-Complexity Implicit Image Compression
Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL
IO-Adam: Rethinking Memory-Efficient Adaptive Optimizers from Gradient Computation
Position: The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs
DecoderTCR: Compositional Pretraining and Entropy-Guided Decoding for TCR-pMHC Interactions
FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training
SDiD:Shared diffusion prior for efficient distributed stereo image compression
DomED: Redesigning Ensemble Distillation for Domain Generalization
General Synthetic-Powered Inference
Scam2Prompt: A Scalable Framework for Auditing Malicious Scam Endpoints in Production LLMs
Sketch-Based Low-Rank Model Merging with Shared Circulant Transforms
FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models
FUSE: Ensembling Verifiers with Zero Labeled Data
Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data Augmentation
Contextual Slate GLM Bandits with Limited Adaptivity
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
CrossQ: Task-Aligned Cross-Token Conditional Quantization for Late Interaction Retrieval
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
Revisiting Efficiency–Accuracy Scaling in Mixture-of-Experts Architectures
Vegas: Self-Speculative Decoding with Verification-Guided Sparse Attention
LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
MFCL Audio: An Audio Function Calling Evaluation for Large Language Models
AGoQ: Activation and Gradient Quantization for Memory-Efficient Distributed Training of LLMs
FrontierCS: Evolving Challenges for Evolving Intelligence
Can Large Language Models Generalize Procedures Across Representations?
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics
Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs
LLM-Guided Loop Bound Generation for Program Termination Verification
Selling Data as a Digital Good with Scaling Valuations
Characterizing Agents in Production
FakeWorld 1.0: An Omni modal Benchmark for Fake Media and Content
A Bayesian Approach to Quantify the Uncertainty of Human Ratings in a Single-Instance Multimodal Framework
Gauge-Equivariant Graph Networks via Self-Interference Cancellation
Correcting Split Selection in Online Decision Trees via Anytime-Valid Inference
From Human Labels to Literature: Semi-Supervised Learning of NMR Chemical Shifts at Scale
CAMP: Coherent Alignment of Multimodal Prototypes for Explainable Complementary Learning
Mitigating Conversational Inertia in Multi-Turn Agents
MapDream: Task-Driven Map Learning for Vision-Language Navigation
The Trojan Knowledge: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search
Equilibrium Pricing in Oligopolistic Data Markets
CG-MLLM: Captioning and Generating 3D content via Multi-modal Large Language Models
Enhancing Affine Maximizer Auctions with Correlation-Aware Payment
HilbertA: Hilbert-Curve–Aligned Sparse Attention for 2D Structured Data
Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations
Formalizing and Falsifying Causal Pathways of Rare Events
IQA-Spider: Unifying Reasoning, Grounding, and Referring for Multi-Granularity Image Quality Assessment
ReGen: Hierarchical Multi-Prompt Representation Generation for Efficient Waveform Diffusion Models
Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering
Pair2Scene: Learning Local Object Relations for Procedural Scene Generation
Gaming Consensus: Coordinated Manipulation in Crowdsourced Fact-Checking
VisualScore: Learning Holistic Visual Quality Scores via Multi-Task Reasoning
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation
Gradient Regularization Prevents Reward Hacking in Reinforcement Learning from Human Feedback and Verifiable Rewards
The impact of LoRA on Oversmoothing $\colon$ Understanding Catastrophic Forgetting in Mean-Field Attention Dynamics
It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation
AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
Principle-Evolvable Scientific Discovery via Uncertainty Minimization
EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding
Riemannian stochastic optimization for sufficient dimension reduction
FLAG: Foundation model representation with Latent diffusion Alignment via Graph for spatial gene expression prediction
A3: an Analytical Low-Rank Approximation Framework for Attention
Revenue Efficiency of Correlated Equilibria in First Price Auctions
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
ST-TGExplainer: Disentangling Stability and Transition Patterns for Temporal GNN Interpretability
SyMerge: From Non-Interference to Synergistic Merging via Single-Layer Adaptation
Prism: Spectral-Aware Block-Sparse Attention
Implicit Preference Alignment for Human Image Animation
Noise-Guided Transport: Imitation Learning from Random Priors
ScDiVa: Masked Discrete Diffusion for Joint Modeling of Single-Cell Identity and Expression
ScaLoRA: Optimally Scaled Low-Rank Adaptation for Efficient High-Rank Fine-Tuning
Breaking the Block: Preserving Data Continuity to Train Superior SAEs for Instruct Models
CE$^4$L: Continual Ego, Exo, and Ego-Exo Learning
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
Mitigating Error Accumulation in Continuous Navigation via Memory-Augmented Kalman Filtering
How Transformers Represent Hierarchies: A Local-to-Global Mechanism
Abductive Reasoning with Probabilistic Commonsense
Training with Honeypots: Reshaping How LLMs Fail
Recontextualization Mitigates Specification Gaming Without Modifying the Specification
The Cost of Information: Phase Transitions in Contextual Bandits with Paid Observations
Model Fusion via Retrofitting
CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning
Reasoning Structure of Large Language Models
LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models
Adaptive Quasimetric Mapping : Principled Topological Abstraction for Robust Offline Goal-Conditioned Navigation
From Token to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction
GHOST: Unmasking Phantom States in Mamba2 via Grouped Hidden-state Output-aware Selection & Truncation
Three Years of r/ChatGPT: Societal Impact Evaluations from Social Media Data
MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology
Proteus: Lookup-Free Trellis-Coded Quantization by Lattice-Breaking Compute Codes for 2-Bit LLMs
The Cylindrical Representation Hypothesis for Language Model Steering
NaviCache: Test-Time Self-Calibration Caching for Video Generation
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens
Private Learning with Public Feature Conditioning
Your Latent Reasoning is Secretly Policy Improvement Operator
$\texttt{PRISM}$:A 3D Probabilistic Neural Representation for Interpretable Shape Modeling
BEDTime: A Unified Benchmark for Automatically Describing Time Series
Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling
Beyond Hamming: Query-Aware Decoding of Binary Cosine Sketches
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
Adaptively Grouped Contextual Bandits for Heterogeneous Human-AI Decision Making with Conformal Prediction Sets
Noise-Robust Density Estimation for Tabular Data Anomaly Detection
R2-Router: A New Paradigm for LLM Routing with Reasoning
Cross-Embodiment Robot Foundation World Models with Latent Actions
How Reasoning Evolves from Post-Training Data: An Empirical Study Using Chess
IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning
Spherical SO(3) Equivariant Local Attention
Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction
AdaMEM: Test-Time Adaptive Memory for Language Agents
On the Difficulty of Learning a Meta-network for Training Data Selection
Credit Assignment via Neural Manifold Noise Correlation
Unifying Low Dimensional Spectra in Deep Learning
Structure-Preserving Learning Improves Geometry Generalization in Neural PDEs
Probabilistic Modeling of Latent Agentic Substructures in Deep Neural Networks
FourTune: Towards Fully 4-Bit Efficient Post-Training for Diffusion Models
Rationality Measurement and Theory for Reinforcement Learning Agents
Minibatch selection for Language Models via Partition Matroid Constrained Gradient Matching
SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding
TileQ: Efficient Low-Rank Quantization of Mixture-of-Experts with 2D Tiling
Stochastic Minimum-Cost Reach-Avoid Reinforcement Learning
Direct Flow Q-Learning
SmartThinker: Progressive Chain-of-Thought Length Calibration for Efficient Large Language Model Reasoning
AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions
A Decision-Theoretic View of Test-Time Training: When, How Far, and Which Directions to Adapt
ContrastiveCFG: Guiding Diffusion Sampling by Contrasting Positive and Negative Concepts
Uncovering the Gradient Geometry of Long CoT: A Spectral-guided Approach to Reasoning Distillation
Structure-Aware Consistency Priors for Shape from Polarization in Complex Media
Transfer Learning in Nonparametric Regression with Deep ReLU Networks
A Fourier perspective on the learning dynamics of neural networks: from sample complexities to mechanistic insights
DB-KSVD: Scalable Alternating Optimization for Disentangling High-Dimensional Embedding Spaces
DPsurv: Dual-Prototype Evidential Fusion for Uncertainty-Aware and Interpretable Whole Slide Image Survival Prediction
Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces
HiPER: Hierarchical Plan–Execute RL for Multi-Turn LLM Agents
Accelerated Multiple Wasserstein Gradient Flows for Multi-objective Distributional Optimization
SMILE: Extended Deep Submodular Function-Based Instruction and In-context Learning Demonstration Selection
Incorporating Importance Weighting in Optimal Transport Based Domain Alignment
Dismantling the Illusion of Vision-Language-Action Models Competence via Explicit Distributional Shifts
Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning
A General Framework for Fair and Robust Regression
HOBIT: Hardness Optimized Batch Sampling for InfoNCE Training
EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization
Towards Realistic Lifelong Re-identification: Identity Recurrence with Changing Clothes
WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference
VAnim: Rendering-Aware Sparse State Modeling for Structure-Preserving Vector Animation
BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback
Multi-Adapter Representation Interventions via Energy Calibration
Hierarchical Goal Abstractions via Learned Subset Relations
BubbleSpec: Turning Long-Tail Bubbles into Speculative Rollout Drafts for Synchronous Reinforcement Learning
Differentially Private Cross-Silo Recommendation from Implicit Feedback
Discovering Interpretable Algorithms by Decompiling Transformers to RASP
Chiral Symmetry Breaking in Transformers: A Group-Equivariant Framework for Solving the Reversal Curse via Adjoint Manifold Mappings
SafeSpec: Fast and Safe LLM via Dynamic Reflective Sampling
FIPN: Forward Self-Organizing Interpretable Polynomial Networks for Time Series Forecasting
S3Audio: Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
See More, Forecast Better and Faster: Enhancing Time Series Foundation Models via Inference-Time Plug-and-Play Downsampling
Fast kernel methods: Sobolev, physics-informed, and additive models
On the Separability of Information in Diffusion Models
Enhancing Membership Inference Attacks on Diffusion Models from a Frequency-Domain Perspective
Differentially Private Range Subgraph Counting
Geometric Decoupling: Diagnosing the Structural Instability of Latent
Enhancing Numerical Prediction in LLMs via Smooth MMD Alignment
ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling
WhisperSplat: Lossless Steganography in 3D Gaussian Splatting
Memory Savings at What Cost? A Study of Alternatives to Backpropagation
Gradient Flow Sampler-based Distributionally Robust Optimization
User-Aware Active Knowledge Acquisition for Emotional Support Dialogue
Bulk-Calibrated Credal Ambiguity Sets: Fast, Tractable Decision Making under Out-of-Sample Contamination
DecFus: Decentralized Layer-wise Fusion with Dynamic Exploration and Exploitation
Phase-Type Variational Autoencoders for Heavy-Tailed Data
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings
Subspace-Aware Feature Reshaping for Open-Set Graph Class-Incremental Learning
ConFu: Contemplate the Future for Better Speculative Sampling
Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction
Learning Generalized Label Distributions
LS$^{2}$MC-GDA: A Smoothed Algorithm for Federated Stochastic Compositional Minimax Optimization
ASAP: Exploiting the Satisficing Generalization Edge in Neural Combinatorial Optimization
A Minimal Agent for Automated Theorem Proving
A Geometric Lens on Physics-Aligned Data Compression
A unified theory of feature learning in RNNs and DNNs
Intrinsic Task Symmetry Drives Generalization in Algorithmic Tasks
GIPO: Gaussian Importance Sampling Policy Optimization
GameVerse: Can Vision-Language Models Learn from Video-based Reflection?
Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs
Hyperparameter Transfer Laws for Non-Recurrent Multi-Path Neural Networks
Continuous Viewpoint Adaptation for Single View 3D Object Reconstruction
MaMi-HOI: Harmonizing Global Kinematics and Local Geometry for Human-Object Interaction Generation
Logarithmic Switching Regret for Online Convex Optimization
Evaluating Contextual Illegality: AI Compliance in Corporate Law Scenarios
High-Dimensional Sensitivity Analysis for Genomic Studies: An Adversarial Framework for Learning Worst-Case Latent Confounders
Tri-Scale Neural ODEs for Continuous Multi-Omics Disease Modeling
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
Path-Coupled Bellman Flows for Distributional Reinforcement Learning
Symmetries in PAC-Bayesian Learning
Training–Inference Consistent Segmented Execution for Long-Context LLMs
Multimodal Latent Language Modeling with Next-Token Diffusion
On the Interaction of Batch Noise, Adaptivity, and Compression, under $(L_0,L_1)$-Smoothness: An SDE Approach
SLAT: Segment-Level Adaptive Trimming for Efficient CoT Reasoning
Benchmarking the Scientific Mind: Toward Evaluation of Complex-Reasoning Biomedical VQA
Efficient Online Variational Estimation via Monte Carlo Sampling
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
On the existence of consistent adversarial attacks in high-dimensional linear classification
Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens
SC$^{2}$-WM: A Self-Correcting World Model with Closed-Loop Feedback for Vision-and-Language Navigation in Continuous Environments
Multilingual Unlearning in LLMs: Transfer, Dynamics, and Reversibility
Balancing plasticity and stability with Fast and Slow Successor Features
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic
SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model
Model-Preserving Adaptive Rounding
BYORn: Bootstrap Your Own Responses to Defend Large Vision-Language Models Against Backdoor Attacks
ICR-RL: Deep Reinforcement Learning via In-Context-Regression
Parsimonious Learning-Augmented Online Metric Matching
Efficient Diffusion Models via Time Step Optimization with Consistent Training and Inference Constraints
Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation
Taking the GP Out of the Loop
Optimal Pricing for Data-Augmented AutoML Marketplaces
Understanding Data Temporality Impact on Large Language Models Pre-training
Geometry-Aware Dataset Condensation for Diffusion Model Training
Learning in the Fisher Subspace: A Guided Initialization for LoRA Fine-Tuning
Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models
Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers
Meta Context Engineering via Agentic Skill Evolution
TFTF: Training-Free Targeted Flow for Conditional Sampling
DFlash: Block Diffusion for Flash Speculative Decoding
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
Don't Forget Why You Started: Tackling Dual Forgetting in Vision-Language Continual Learning
Language Model Circuits Are Sparse in the Neuron Basis
UniDrag: Unified Multi-Field Prediction and Robust Shape Optimization for Vehicle Aerodynamics
Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
Understanding the Gaps in Satisficing Bandits
Do Neural Operators Forget Geometry? The Forgetting Hypothesis in Deep Operator Learning
How can we assess human-agent interactions? Case studies in software agent design
Non-Uniform Noise-to-Signal Ratio in the REINFORCE Policy-Gradient Estimator
PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards
APE-Bench: Evaluating Automated Proof Engineering for Formal Math Libraries
Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning
Selective Disclosure Watermarking for Large Language Models
Discontinuous Galerkin Neural Operator for Pathology Defocus Deblurring
Problem Distributions as Tasks: Repurposing Meta Learning for Generative Combinatorial Optimization towards Multi-task Pretrain and Adaptation
The Expressive Power of Low Precision Softmax Transformers with (Summarized) Chain-of-Thought
Persuasive Privacy
ClimateAR: Multi-Scale Autoregressive Generative Modeling for Seasonal-to-Interannual Climate Forecasting
Coordinated Disentanglement with Iterative Mode Discovery Under Hidden Correlations
More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing
Recursive Binding on a Budget: Subspace Carving in Order-$p$ Tensor Memories
DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants
Denoising without Diffusion: Fixed-Noise Denoiser Anomaly Detection in Tabular Data
Mitigating the Contractivity Trap in Diffusion ODEs via Stein Stabilization
Multimodal Scaling Laws for Task & Data-Optimized Models of Visual Cortex
MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling
Toward Effective Multimodal Graph Foundation Model: A Divide-and-Conquer Based Approach
MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio
Learning Protein Structure-Function Relationships through Knowledge-guided Representation Decomposition
Seeing Symbols, Missing Structure: A Real-World Handwritten Mathematical Expression Recognition Benchmark for Large Models
Context Distillation Retains Post-Training Capabilities in Continually Trained LMs
Moment Matching Q-Learning
The Fisher Dimension: Instance-Dependent Complexity for Causal Discovery
Identifying dependent components from multi-domain linear mixtures
Beyond Rewards in RL for Cyber Defence
Robust Learning via Nested Distributionally Robust Optimization
$\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment
MuLoCo: Muon is a Practical Inner Optimizer for DiLoCo
NAACA: Training-Free NeuroAuditory Attentive Cognitive Architecture with Oscillatory Working Memory for Salience-Driven Attention Gating
Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains
Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners
Spectral Collapse Drives Loss of Plasticity in Deep Continual Learning
Prediction-Powered Risk Monitoring of Deployed Models for Detecting Harmful Distribution Shifts
TVI-CoT: Text-Visual Interleaved Chain-of-Thought Reasoning for Multimodal Understanding
Co-Generative De Novo Functional Protein Design
NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces
Goal-Oriented Lower-Tail Calibration of Gaussian Processes for Bayesian Optimization
Jailbreak to Protect: Buffering Harmful Fine-Tuning via Temporary Jailbreaking LoRA in Large Language Models
Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation
Principled Zero-shot Ranking Agents with Tournament Graphs
Benchmarking World-Model Learning with Environment-Level Queries
Physiology-Aware Masked Cross-Modal Reconstruction for Biosignal Representation Learning
Cycle-of-Science: Reliable Reasoning through Counterfactual Verification for Agent Decision Making
A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets
Computationally-efficient Graph Modeling with Refined Graph Random Features
BEST: Benchmarking Efficiency in Space and Time for LLM-Generated Code
LLawCo: Learning Laws of Cooperation for Modeling Embodied Multi-Agent Behavior
Provably Convergent Actor-Critic in Risk-averse MARL
Flatland: The Adventures of Gradient Descent with Large Step Sizes
Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models
CamGeo: Sparse Camera-Conditioned Image-to-Video Generation with 3D Geometry Priors
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention
Identifiable Nonlinear Differentiable Causal Discovery via Independence and Adaptive Group Sparsity
Efficient Bayesian Inference from Noisy Pairwise Comparisons
Words Towards Explainability: Caption Label-Free Learning via Dual Loop Agentic Time Series Captioning
PATCHCODE: Discrete Latent Predictive Learning for EEG Foundation Model
API: Adaptive Prototype Imputation for Incomplete Multimodal Sentiment Analysis
From Denoising to De-Channeling: Integrating Physical Channel Priors into Diffusion Models for Radio Signal Understanding
INFER: Learning Implicit Neural Frequency Response Fields for Confined Acoustic Environments
Learning High-Dimensional Parity Functions with Product Networks using Gradient Descent
Overcoming the Modality Gap in Context-Aided Forecasting
Variational Bayesian Flow Network for Graph Generation
GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
STAND: Self-Aware Precondition Induction for Interactive Task Learning
Quantifying the noise sensitivity of the Wasserstein metric for images
Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism
Upper-Linearizability of Online Non-Monotone DR-Submodular Maximization over Down-Closed Convex Sets
GEM: Geometric Erasure by Contrastive Velocity Matching in Rectified Flows
SURGE: Surrogate Gradient Adaptation in Binary Neural Networks
TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation
Modeling Hierarchical Thinking in Large Reasoning Models
Agent Learning via Early Experience
Censoring with Plausible Deniability: Asymmetric Local Privacy for Multi-Category CDF Estimation
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
SABER: Continual Learning with Representation Conflict Management
Syntax vs. Semantics: How Transformers Learn Deep Dependencies
Attention Implements the Fisher Geometry of Exponential Families
Last-iterate Convergence of ADMM on Multi-affine Quadratic Equality Constrained Problem
Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective
MobileFusion: Mobile-Friendly Infrared and Visible Image Fusion via Structural Re-parameterization
SCoA: Revisiting Domain Generalized Object Detection with Style-Conditioned Adaptation
The Value Function Semi-Algebraic Set in Partially Observable Markov Decision Processes
Threat2Traffic: Multi-Agent Environment Synthesis for Malware Traffic Generation from Threat Intelligence
RED-HDP-HMM: Observation-Dependent Durations for Bayesian Nonparametric Sequential Models
Better, Faster: Harnessing Self-Improvement in Large Reasoning Models
SCOUT: Active Information Foraging for Long-Text Understanding with Decoupled Epistemic States
Test-Time Training Is Secretly Linear Attention
PRM-PBE: Process Reward Model for Reinforcement Learning in Programming-by-Example
From Patches to Plans: Reasoning Distillation for Repository-Level Program Repair
Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling
Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment
Envy-Free Allocation of Indivisible Goods via Noisy Queries
Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models
CGRiC: Compositional Risk Certification for Structured LLM Outputs
KANFIS: A Neuro-Symbolic Framework for Interpretable and Uncertainty-Aware Learning
Unlocking Zero-Shot Geospatial Reasoning via Indirect Rewards
The Secret Engine Behind RLHF: It's Contarstive Learning All Along
Guaranteed Optimal Compositional Explanations for Neurons
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
QHyer: Q-conditioned Hybrid Attention-mamba Transformer for Offline Goal-conditioned RL
Adaptive Residual-Update Steering for Low-Overhead Hallucination Mitigation in Large Vision-Language Models
SafeHarbor: Defining Precise Decision Boundaries via Hierarchical Memory-Augmented Guardrail for LLM Agent Safety
Linguistic Nepotism: Trading-off Quality for Language Preference in Multilingual RAG
When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
LazyAttention: Efficient Retrieval-Augmented Generation with Deferred Positional Encoding
Online Fair Division with Additional Information
Semantic Tube Prediction: Beating LLM Data Efficiency with JEPA
Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents
A Stronger Benchmark for Online Bilateral Trade: From Fixed Prices to Distributions
TN-SHAP-G: Graph-Structured Tensor Network Surrogates for Shapley Values and Interactions
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning
All ERMs Can Fail in Stochastic Convex Optimization Lower Bounds in Linear Dimension
HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning
Scaling Laws for Precision in High-Dimensional Linear Regression
R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations
PINE: Pruning Boosted Tree Ensembles with Conformal In-Distribution Prediction Equivalence
Stein Diffusion Guidance: Training-Free Posterior Correction for Sampling Beyond High-Density Regions
Rate or Fate? RLV$^{\varepsilon}$R: Reinforcement Learning with Verifiable Noisy Rewards
LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis
Dynamic Optimizations of LLM Ensembles with Two-Stage Reinforcement Learning Agents
Second-Order Smooth Planning with Optimal-Transport Bellman Smoothing
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
Offline Reinforcement Learning with Generative Trajectory Policies
Estimating Correlation Clustering Cost in Node-Arrival Stream
FlowPET: Physics-Informed Symplectic Flow Matching for Low-Count PET Reconstruction
Multi-marginal temporal Schrödinger Bridge Matching from unpaired data
Prompt Injection as Role Confusion
One Coin Has Two Sides: Single Poistive Multi Label Learning from Salient Annotations
Accurate Evaluation of Quickest Changepoint Detectors via Non-parametric Survival Analysis
STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
Disease-Centric Vision-Language Pretraining with Hybrid Visual Encoding for 3D Computed Tomography
On Learnability and Disambiguation of Multiclass Partial Concept Classes
Posterior Mismatch Matters: Adversarial Training for Long-Tailed Robustness
FastSESR: Fast Scene-level Explicit Surface Reconstruction
MePo: Meta Post-Refinement for Rehearsal-Free General Continual Learning
When Is Rank-1 Enough? Geometry-Guided Initialization for Parameter-Efficient Fine-Tuning
Deep Scientific Reasoning under Physical Constraints: Structure-Aware Spectrum Prediction for Electronic Density of States
A Hypertoroidal Covering for Perfect Color Equivariance
InfoGlobe: Local-and-Global Information-Preserving Statistical Manifold Learning for Single-Cell Transcriptomics
Normalizing Diffusion Kernels with Optimal Transport
Mitigating Hallucinations in Large Vision-Language Models via Causal Route Gating
Parameter Decorrelation via Transition-Variance Alignment for Multivariate Time-series Forecasting
Design Linear Constrained Neural Layers with Implicit Convex Optimization
OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph
Function-Valued Causal Influence in Nonlinear Time Series
Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching
Data Difficulty and the Generalization–Extrapolation Tradeoff in LLM Fine-Tuning
NavOL: Navigation Policy with Online Imitation Learning
Dynamic Fractal Mamba: A Neural Renormalization Group Flow for Scale-Invariant Sequence Modeling
Polyphonia: Training-Free Context-Aware Music Editing with Acoustic-Informed Attention Calibration
CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning
ECHO: Elastic Speculative Decoding with Sparse Gating for High-Concurrency Scenarios
Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization
Why Tree-Style Branching Matters for Thought Advantage Estimation in GRPO
Towards Generalizable EEG-to-fMRI Synthesis via a Unified, Context-Aware Prompting Framework
WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning
Test-Time Learning of Causal Structure from Interventional Data
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Probing RLVR Training Instability through the Lens of Objective-Level Hacking
Combinatorial Sparse PCA Beyond the Spiked Identity Model
Fully Zero-Shot Image Dehazing
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
Exploring Accurate and Transparent Domain Adaptation in Predictive Healthcare via Concept-Grounded Orthogonal Inference
Recovering Hidden Reward in Diffusion-Based Policies
FHAIM: Fully Homomorphic AIM for Private Synthetic Data Generation
Flow Matching Calibration for Simulation-Based Inference under Model Misspecification
CARD: Coarse-to-fine Autoregressive Modeling with Radix-based Decomposition for Transferable Free Energy Estimation
Causal Detection of Multi-Step LLM Agent Attacks
SFedPO: Streaming Federated Learning with a Prediction Oracle under Temporal Shifts
Student-Centered Distillation Narrows the Agentic Gap Between Small and Large LLMs
Riemannian Metric Matching for Scalable Geometric Modeling of Distributions
Decision-focused Sparse Tangent Portfolio Optimization
Action Manifold Smoothing: A Lipschitz Pathway Perspective on High-Dimensional Reinforcement Learning
Edge-colored Clustering in Hypergraphs: A MaxECC Approximation
LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning
Amortized Variational Inference for Partial-Label Learning: A Probabilistic Approach to Label Disambiguation
TadABench-1M: A Large-Scale Wet-Lab Protein Benchmark For Rigorous OOD Evaluation
The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
How2Everything: Mining the Web for How-to Procedures to Evaluate and Improve LLMs
Compositional Generative Modeling from Decentralized Data
When More Data Doesn't Help: Limits of Adaptation in Multitask Learning
CausalXRL: Explainable Reinforcement Learning through Causal Graph Reasoning
TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution
Is Graph Mixup Beneficial? Investigating Interpolation And Empirical Performance of Graph Mixup Methods
Neural-HSS: Hierarchical Semi-Separable Neural PDE Solver
Dimensional Collapse in Transformer Attention Outputs: A Challenge for Sparse Dictionary Learning
Scalable Training of 3D Gaussian Splatting via Out-of-Core Optimization
One-Step Graph-Structured Neural Flows for Irregular Multivariate Time Series Classification
GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry
HARD-KV: Head-Adaptive Regularization for Decoding-time KV Compression
FIRE: Multi-fidelity Regression with Distribution-conditioned In-context Learning using Tabular Foundation Models
Unsupervised Process-Aware Coreset Selection for In-Context Learning
Envisioning Beyond the Few: Disentangled Semantics and Primitives for Few-Shot Atypical Layout-to-Image Generation
Instruction Decomposition and Action Alignment for Vision-Language Navigation
SONAR: Spectral‑Contrastive Audio Residuals for Generalizable Deepfake Detection
Can LLM Agents Stick to the Script? Modeling Commitment in Interactive Narratives
IPMark: A Sentence-Level Watermark for LLMs with Hierarchical Personalization and Efficient Detection
Mixture of Concept Bottleneck Experts
Neuro-Symbolic AI for Analytical Solutions of Differential Equations
DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion
Dynamic TMoE: A Drift-Aware Dynamic Mixture of Experts Framework for Non-Stationary Time Series Forecasting
Less Is More: Fast and Accurate Reasoning with Cross-Head Unified Sparse Attention
Scaling Laws and Architectural Frontiers in Metagenomic Foundation Models
Adaptive Momentum and Nonlinear Damping for Neural Network Training
How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning
PPI Candidate Ranking: Large-Scale Evaluation of a Domain Knowledge–Guided Pipeline
SURGE:Unbiased Data Assimilation for Diffusion Model via Particle Filtering
Principled Synthetic Data Enables the First Scaling Laws for LLMs in Recommendation
A Provable Expressiveness Hierarchy in Hybrid Linear-Full Attention
Parameter-Masked Decoupled Optimization for Cross-Domain Class-Incremental Learning
PULSE: Generative Phase Evolution for Non-Stationary Time Series Forecasting
Teaching Molecular Dynamics to a Non-Autoregressive Ionic Transport Predictor
Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning
Online Change Point Detection for Multivariate Inhomogeneous Poisson Processes Time Series
Out-of-Distribution Evaluation of Rule-Based and Strategic Reasoning in Chess Transformers
Parallel Stochastic Gradient-Based Planning for World Models
FoeGlass: When Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors
Mind the Gap: Structure-Aware Consistency in Preference Learning
Uncertainty-Guided Exploration and Stable Planning for Sparse-Reward Manipulation from Limited Demonstrations
Beyond Heuristics: Learnable Density Control for 3D Gaussian Splatting
Theoretical Investigation on Inductive Bias of Isolation Forest
XTransfer: Modality-Agnostic Few-Shot Model Transfer for Human Sensing at the Edge
Is One Layer Enough? Understanding Inference Dynamics in Tabular Foundation Models
Blending Neural Control Density Functions for Stabilization and Safety
Infinite-dimensional generative diffusions via Doob's h-transform
Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
SCOPE and SCION: Benchmark and Method for Ontology Induction and Fusion from Text
DMCO: Budget-Aware Co-Optimization of Data Cleaning and AutoML
Biased Generalization in Diffusion Models
TextMesh4D: Zero-shot Text-to-4D Mesh Generation
Revealing Differences in Multi-Modal Embeddings via Constrained Kernel Analysis
Optimal Transport for Reward Modeling from Noisy Feedback
Knowing the Unknown: Interpretable Open-World Object Detection via Concept Decomposition Model
ALAS: Additive Learnable Alpha-Stable Kernels for Flexible Bayesian Optimization
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities
Sample Margin-Aware Recalibration of Temperature Scaling
Identifiable Markov Switching Models with Instantaneous Effects and Exponential Families
E-VAds: An E-commerce Short Videos Understanding Benchmark for MLLMs
GameDevBench: Evaluating Agentic Capabilities Through Game Development
From Muon to Gluon: Bridging Theory and Practice of LMO-based Optimizers for LLMs
Efficient LLM Moderation with Multi-Layer Latent Prototypes
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
Learning 3D-Gaussian Simulators from RGB Videos
ADHD Disease Detection Based on Short- and Long-Term Brain Function Encoding and Memory Graph Network
Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents
Certified Circuits: Stability Guarantees for Mechanistic Circuits
Continual Model Routing in Evolving Model Hubs
Watermarking Graph Neural Networks via Explanations for Ownership Protection
Batched Contextual Reinforcement
Scalable Single-Cell Gene Expression Generation with Latent Diffusion Models
Variational inference via Gaussian interacting particles in the Bures-Wasserstein geometry
CURE: Consistency-under-Unified Semantic Regularization for Generalized Category Discovery
Solver-in-the-Loop: MDP-Based Benchmarks for Self-Correction and Behavioral Rationality in Operations Research
Improving Classifier-Free Guidance of Flow Matching via Manifold Projection
Understanding the Ability of LLMs to Handle Character-Level Perturbation
TopoDistill: Distilling Global System Topology for Causal Discovery in Multivariate Time Series
Explicit representation of germline and non-germline residues improves antibody language modeling
Saliency-Aware Model Merging
DITING: A Weak Degradation Listener for Battery Lifetime Early Prediction
QuITE: Query-based Irregular Time-series Embedding
A Refined Generalization Analysis for Extreme Multi-class Supervised Contrastive Representation Learning
Absorbing Quantization Error by Deformable Noise Scheduler for Diffusion Models
AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection
Do Natural Language Interpretability Methods Convey Privileged Information?
Training-Free Coverless Multi-Image Steganography with Access Control
Verifiable Multimodal Reasoning: Fact-level Attribution with Multimodal Sources
Bridging the Grounding Gap in VideoQA via Typed Memory for Language-based Belief-State Reasoning
GradPower: Powering Gradients for Faster Language Model Pre-Training
Conditional Quantile Adjusted Conformal Prediction for Time Series
SSDCN: Spatial-Spectral Dual-Clustering-based Network for Hyperspectral Image Super-resolution
Domain Transfer Becomes Identifiable via a Single Alignment
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?
Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning
Critique-Guided Distillation for Robust Reasoning via Refinement
PVDepth: Panoramic Video Depth Estimation via Geometry-Aware Spatiotemporal Adaptation
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
Neuro-evolutionary Continual Reinforcement Learning
Bilevel Optimization over Saddle Points of Zero-Sum Markov Games
Lagrangian Perturbation Diffusion Steering: Latent Reinforcement Learning for Generative Policies
The Hippocampal Place Field Gradient: A Bio-inspired Framework Building Multiscale Representation for Better Sample Efficiency
Counterfactual Occlusion-Aware Learning via Visibility Intervention for LiDAR Anomaly Detection
Adaptive Multi-Round Allocation with Stochastic Arrivals
How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning
TOM-SWE: User Mental Modeling For Software Engineering Agents
Sparse Autoencoders for Interpretable Emotion Control in Text-to-Speech
RECOVER:Reliable Detection of Unauthorized Data Usage in Text-to-Image Diffusion Models via Inversion Robustness
A General Framework for Dynamic Consistent Submodular Maximization
Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models
Near-Optimal Dynamic Matching via Coarsening with Application to Heart Transplantation
SPARe: Stacked Parallelism with Adaptive Reordering for Fault-Tolerant LLM Pretraining Systems with 100k+ GPUs
Beyond First-order Asymptotics in Sequential Mean Testing
Dimension-free convergence of diffusion models for approximate Gaussian mixtures
On Computation and Reinforcement Learning
Adaptive Token Refinement in Long-Tailed Large Vision-Language Models Fine-Tuning
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
MARS-SQL: A Multi-Agent Reinforcement Learning Framework For Text-To-SQL
PepCompass: Navigating Peptide Embedding Spaces Using Riemannian Geometry
Efficient privacy loss accounting for subsampling and random allocation
rePIRL: Learn PRM with Inverse RL for LLM Reasoning
AES: Curing Optimizer Blindness in Long-Tailed Recognition via State-Aware Correction
Perceptual Flow Network for Visually Grounded Reasoning
Real-World Unsupervised Models Generalize to Predict Brain Responses to Out-of-Distribution Stimuli
Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search
Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search
Local MAP Sampling for Diffusion Models
Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
Task-Aware Preference Calibration for Direct Preference Optimization
Emergent Analogical Reasoning in Transformers
Entropic Mirror Monte Carlo
When Attributes Disagree: Gradient Conflict in Image Aesthetic Assessment
The First Drop of Ink: Nonlinear Impact of Misleading Information in Long-Context Reasoning
Scalable Medical Multimodal Fusion via Symmetric Consistency Modeling
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
MuonSSM: Orthogonalizing State Space Models for Sequence Modeling
ProMeCD: Unifying Long-Tailed and Noisy Label Learning via White-Box Control
Constitutional Black-Box Monitoring for Scheming in LLM Agents
MusicDET: Zero-Shot AI-Generated Music Detection
Learning to Bet for Horizon-Aware Anytime-Valid Testing
DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Interactive Segmentation with Elaborate Focus Prior
Euler–Poincaré Neural Dynamics: A Geometric-Mechanics Framework for Scientific Simulation
Desirable Effort Fairness and Optimality Trade-offs in Strategic Learning
An analytic theory of convolutional neural network inverse problems solvers
Learning Transferable Interaction Primitives from Game Videos for Humanoids
GRAPE: Let GRPO Supervise Query Rewriting by Ranking for Retrieval
VideoTrace-R1: Long Video-based Retrieval-Augmented Generation via Temporal Path Graph Understanding
Failure-Driven Workflow Refinement
Train Once, Reuse Everywhere: Generalizable Implicit ICL by Routing Attention
Probabilistic Salient Object Ranking
PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity
Federated Distillation for Whole Slide Image via Gaussian-Mixture Feature Alignment and Curriculum Integration
DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation
Butterworth as Attention: Anisotropic Spectral Gating for Pansharpening
The Geometric Origin of Grokking: Accelerating Generalization via Active Structural Reorganization
Inverting Data Transformations via Diffusion Sampling
A Theory of Contrastive Learning with Natural Images
MiniMax Learning of Interpretable Factored Stochastic Policies from Conjoint Data, with Uncertainty Quantification
Unifying and Optimizing Data Values for Selection via Sequential Decision-Making
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
An Information-Theoretic Criterion for Efficient Data Synthesis
Persistent Backdoor Attacks in Class-Incremental Learning via Structural Invariant Anchoring
Row-stochastic matrices can provably outperform doubly stochastic matrices in decentralized learning
DREAM: A Unified Framework for Drift-Corrected Federated Multi-Objective Learning
GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy
Optimal Regularization for Performative Learning
Scalable and Interpretable Representation Alignment with Ordinal Similarity
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
Transport or Discard: Robust Unbalanced Optimal Transport for Cross-Domain Policy Adaptation
HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction
Multicalibration Yields Better Matchings
Learnable Kernel Density Estimation for Graphs and Its Application to Graph-Level Anomaly Detection
VIP: Visual-guided Prompt Evolution for Efficient Dense Vision-Language Inference
EAGer: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling
A Theory of Data Acquisition and Pricing at Scale
Active Regression for Single-Index Models with Unknown Link Functions
Convex Basins in Single-Index Model Loss Landscapes: Applications to Robust Recovery under Strong Adversarial Corruption
Understanding SAM through Minimax Perspective
IEC: When Information-Driven Exploration Meets Spectral Consensus via Primal–Dual Reward Regularization in Decentralized Multi-Agent RL
From Growing to Looping: A Unified View of Iterative Computation in LLMs
HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench
Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos
PaperBanana: Automating Academic Illustration for AI Scientists
SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
Semantic Router: On the Feasibility of Hijacking MLLMs via a Single Adversarial Perturbation
Temporal Difference Learning for Diffusion Models
Romberg-Extrapolated Zeroth-Order Gradient Estimator: Higher-Order Bias Reduction with Preserved Leading Directional Variance
Mitigating Gradient Pathology in PINNs through Aligned Constraint
LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
Characterizing Vision-Language-Action Models across XPUs: Constraints and Acceleration for On-Robot Deployment
TMD-Bench: A Multi-Level Evaluation Paradigm for Music–Dance Co-Generation
Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs
A geometric relation of the error introduced by sampling a language model's output distribution to its internal state
VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding
Efficient Distributionally Robust Assortment Optimization in MNL Bandits
KITE: Knowledge-Guided Probabilistic Modeling for Time Series Forecasting with Exogenous Variables
Variable Clustering via Distributionally Robust Nodewise Regression
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
Predicting Future KV Utility: Global Combinatorial Optimization for Task-Agnostic KV Cache Eviction
Star Elastic: Many-in-One Reasoning LLMs with Efficient Budget Control
EXVERUS: Verus Proof Repair via Counterexample Reasoning
HELIX: Hybrid Encoding with Learnable Identity and Cross-dimensional Synthesis for Time Series Imputation
Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation
EquiCAD: A Geometric Equivariant Neural Network for 3D Shape Classification
StructMAR: Structure-Aware Masked Autoregression for Explicit Layout Alignment in Text-to-Image Generation
When Sample Selection Bias Precipitates Model Collapse
Evidential Reasoning Advances Interpretable Real-World Disease Screening
CoRe: Combined Rewards with Vision-Language Model Feedback for Preference-Aligned Reinforcement Learning
MFH-NAS:A Hybrid Neural Architecture Search Framework for Multimodal Fusion Object Detection
E-mem: Multi-Agent Based Episodic Context Reconstruction for LLM Agent Memory
Around the World in Eighty Ratings? Quantifying the Salience of Geo-Cultural Values for Pluralistic Alignment
DenseMLLM: Standard Multimodal LLMs are Intrinsic Dense Predictors
Revisiting Distribution Correction Estimation for Offline Imitation Learning with Suboptimal Dataset
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Neutral-Reference Prompting for Vision–Language Models
HIVE-3D: Hierarchical Voxel Enhancement for High-Quality 3D Scene Generation
Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning
Bridging Tokens and Geometry: Token-wise 3D Supervision for CAD Generation
Projection-Free Algorithms for Minimax Problems
Backward SDE–Based Diffusion for Physics-Constrained Generation
Who Said Neural Networks Aren't Linear?
Efficient, Validation-Free Intrinsic Quality Estimation for Large-Scale Face Recognition Datasets
SPLIT-VLM: Salience-Guided Partitioning towards Local Coverage for Importance-Aware Token Dropping in Vision-Language Models
Unleashing the Representational Power of Fourier Shapes for Attacking Infrared Object Detection
Unified Multimodal Visual Tracking with Dual Mixture-of-Experts
Crowd4D: Scene-Aware Monocular 4D Crowd Reconstruction
SkelHCC: A Hyperbolic CLIP-Driven Cache Adaptation Framework for Skeleton-based One-Shot Action Recognition
Bipartite Graph Attention-based Clustering for Large-scale scRNA-seq Data
CoverPruneGS: Coverage-Preserving Structured Pruning for Hierarchical 3D Gaussian Splatting from Sparse-View Monocular Videos
Towards Optimal Robustness in Learning-Augmented Paging
CocoRNA: Collective RNA Design with Cooperative Multi-agent Reinforcement Learning
RESIDUAL-GUIDED MULTI-RESOLUTION REFINEMENT OF FOUNDATION MODELS - A CASE STUDY IN DROUGHT FORECASTING
Global Plane Waves From Local Gaussians: Periodic Charge Densities in a Blink
Bandit Social Leaning Dynamics with Exploration Episodes
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
Online Conformal Prediction via Universal Portfolio Algorithms
Spectral Reach: Understanding Neural Scaling through Kernel Alignment Dynamics
Flow for Future: Geometric SE(3)-Equivariant Flow Matching for 3D Trajectory Prediction
Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation
Control Consistency Losses for Diffusion Bridges
Learning from Pairwise Preferences in Long-Term Decision Problems
AIR: Improving Agent Safety through Incident Response
GHOST: Geometry-Guided Hallucination of Opaque Surface Textures
Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models
Rethinking GNNs and Missing Features: Challenges, Evaluation and a Robust Solution
MultiPriv: Benchmarking Individual-Level Privacy Reasoning in Vision-Language Models
Semi-Supervised Gaze Estimation via Disentangled Subspace Contrastive Learning
Efficient Test-time Inference for Generative Planning Models with OCL Search
Adversarially Robust Approximate Furthest Neighbor
Seeking Commonality, Preserving Specificity: A Spectral-Aware Hierarchical Framework for Cross-City Road Representation Learning
Practical and Scalable Hamiltonian Monte Carlo Without the Metropolis Test
Probing How Scalable Table Data Enhances General Long-Context Reasoning
Training-Free Rate-Distortion-Perception Traversal With Diffusion
Message Passing on the Edge: Towards Scalable and Expressive GNNs
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
RADE: Unbiased Random Add-Drop Edge as a Regularizer
Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection
Graph-Preference Learning: Debiasing Network-Sampled Human Feedback for Target Welfare Estimation
On the Salience of Low-Probability Tokens for AI-Generated Text Detection: A Multiscale Uncertainty Perspective
Mitigating Reward Hacking in RLHF via Bayesian Non-negative Reward Modeling
Riemannian MeanFlow for One-Step Generation on Manifolds
Generative Inverse Design with Abstention via Diagonal Flow Matching
Commit to the Bit: Reactive Reinforcement Learning Done Right
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
$\phi$-Balancing for Mixture-of-Experts Training
Imposing Boundary Conditions on Neural Operators via Learned Function Extensions
WarmServe: Enabling One-for-Many GPU Prewarming for Multi-LLM Serving
LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries
Simple Algorithms for Bad Triangle Transversals with Applications to Correlation Clustering
FAIL: Flow Matching Adversarial Imitation Learning for Image Generation
UniFast-HGR: Scalable and Efficient Maximal Correlation for Multimodal Models
On Structured State-Space Duality
Geometry-Aware Contrastive Learning for Few-Shot Automatic Modulation Recognition
Revisiting Spectral Representations in Generative Diffusion Models
Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts
PrivCode++ : Latent-Conditioned Differentially Private Code Generation for Comprehensive Guarantees
DiP-G: Discrete Prompting for Graph Neural Networks
Knowing Bias, Doing Better: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement
Mixture of Distributions Matters: Dynamic Sparse Attention for Efficient Video Diffusion Transformers
PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference
Multi-task Linear Regression without Eigenvalue Lower Bounds: Adaptivity, Robustness and Safety
Move-Then-Operate: Behavioral Phasing for Human-Like Robotic Manipulation
CIRBench: Evaluating Large Language Models as LLVM IR Optimizers
Clustered Influence Functions
Sparse and Faithful Local Explanations with Piecewise Linear Surrogates
Trajectory Consistency for One-Step Generation on Euler Mean Flows
Differentially Private Preference Data Synthesis for Large Language Model Alignment
Towards Solving the Gilbert-Pollak Conjecture via Large Language Models
Clustering in Deep Stochastic Transformers
Beyond Softmax: A Natural Parameterization for Categorical Random Variables
Escaping Mode Collapse in LLM Generation
Optimized Deferral for Imbalanced Settings
Fairness in Aggregation: Optimal Top-$k$ and Improved Full Ranking
Data- and Variance-dependent Regret Bounds for Online Tabular MDPs
Noisy-Channel Minimum Bayes Risk Decoding
Clover: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation
Hallucination Detection from Structural Reasoning Model
Spatiotemporal Imputation with Graph-Informed Flow Matching
Beyond Point Predictions: Manifold Expansion and Dual Alignment for Robust Time Series Distillation
LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization
Efficient Distributed MLLM Training with ModalGlue
Predicting What Matters: Robust Generalist Robot Policy Learning via Future Semantic Mask
Decision Tree Learning on Product Spaces
Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway
From Bits to Rounds: Parallel Decoding with Exploration for Diffusion Language Models
Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis
Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs
H$^2$CL: Heterogeneity-Aware Hypergraph Contrastive Learning for Robust Representation
QuantumBoost: A lazy, yet fast, quantum algorithm for learning with weak hypotheses
OSM+: Billion-Level Open Street Map Dataset for City-wide Experiments
MAD: Manifold Attracted Diffusion
Preconditioning Neural Tangent Kernel for Adaptive Optimization
MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation
Normalization Equivariance for Arbitrary Backbones, with Application to Image Denoising
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting
SpatioLM: Towards General Physical Spatial Intelligence in Vision-Language Models
A Tight Theory of Error Feedback Algorithms in Distributed Optimization
Patterning: The Dual of Interpretability
Geometric Conformal Prediction with Spatial Ranks and Multivariate Quantiles
Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms
A Machine-Learned Comorbidity Index
Optimal Transport with Symmetry Groups
How Can Mamba Learn In Context with Outliers and Generalize Provably?
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations
Where Concept Erasure Should Occur: Concept–Layer Alignment in Text-to-Video Diffusion Models
RaBiT: Residual Aware Binarization Training for Accurate and Efficient LLMs
Kernel-based Maximum-of-difference Test for Two-sample Comparison
Learning $U$-Statistics with Active Inference
The Information Geometry of Softmax: Probing and Steering
Prism-MoE: Efficient Dense-to-MoE Conversion for Visual Autoregressive Generation
UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory
$\text{DT}^\text{2}$: Decision-Targeted Digital Twins
Online Rubrics Elicitation from Pairwise Comparisons
EigenCache: Rethinking Diffusion Acceleration as Covariance-Optimal Forecasting and Submodular Information Allocation
Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing
Active Curriculum Refinement for Reinforcement Learning
A World in Pieces: Structural Certification of General Agents
CONGA:Confidence-and-Gradient-Aware Learning Rate Schedule for Test Time Adaptation
Fine-grained Analysis of Brain-LLM Alignment through Input Attribution
AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks
From Interactions to Principles: Experience-Driven Self-Distillation for Evolving LLM Agents
Joint-Embedding Predictive Learning of Latent Market States in U.S. Equities
SEMIR: Semantic Minor-Induced Representation Learning on Graphs for Visual Segmentation
Synergistic Space-Vision Processing for Predicate Inference
On Revisiting Entropy for Identifying Mislabeled Medical Images
In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning
Induction Heads Interpolate N-Grams
DiLA: Disentangled Latent Action World Models
From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching
Learning Permutation-invariant Macroscopic Dynamics
Sparse Tokens Suffice: Jailbreaking Audio Language Models via Token-Aware Gradient Optimization
EmBrace: A Collective Knowledge Fusion Framework Toward Unified EEG Foundation Models
RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
Certificates for Complex-Compatible Learned Cochain Laplacians
When Generalized Zero-Shot Learning Meets PU Learning: A Plug-and-Play Framework for Seen-Class Bias Mitigation
See, Act, Adapt: Active Perception for Unsupervised Cross-Domain Visual Adaptation via Personalized VLM-Guided Agent
Agentic Confidence Calibration
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
UrbanMLLM: Joint Learning of Cross-view Imagery for Urban Understanding
Expert-guided Clinical Text Augmentation via Query-Based Model Collaboration
From Content to Knowledge: Lightning Fast Long-Video Understanding with Neural Knowledge Representations
Over-Alignment vs Over-Fitting: The Role of Feature Learning Strength in Generalization
CELL: A Causal Perspective for Fairness-aware Graph Adaptation
Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data
BAS: Bridging Adam and SignSGD for Memory-Efficient LLM Training
GPan-LoRA: Gaussian Process Amortized Networks for Bayesian Low-Rank Adaptation in Large Language Models
How to Price Data: A Market Equilibrium Based Approach
AVTrack: Audio-Visual Speaker Tracking in Complex Scenes
General Quantification of Covariate and Concept Shifts
Structure Abstraction and Generalization in a Hippocampus-Entorhinal Inspired World Model
Bridging Dynamics and Data: A Unified Diffusion Framework for Mechanistically-Informed Epidemic Forecasting
UGround: Towards Unified Visual Grounding with Unrolled Transformers
UHR-BAT: Budget-Aware Token Compression Vision-Language model for Ultra-High-Resolution Remote Sensing
Respecting Modality Gap in Post-hoc Out-of-distribution Detection with Pre-trained Vision-Language Models
CoDA-Bench: Can Code Agents Handle Data-Intensive Tasks?
Demystifying LLM-as-a-Judge: Analytically Tractable Model for Inference-Time Scaling
Weight-Space Learning for Certifiable Few-shot Transfer Learning
IACW: Intent-Aware Controllable Watermarking for Scalable Authorial Intent Attribution
Small Agent Group is the Future of Digital Health
Formal Concept Lattices are Good Semantic Scaffolds for Concept-Based Learning
Calibrated Knowledge Aggregation in Bayesian Mixture-of-Experts for Continual VQA
RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension
$L^3$: Large Lookup Layers
GemDepth: Geometry-Embedded Features for 3D-Consistent Video Depth
Does Reinforcement Fine-Tuning Improve Generalization of LLM Agents? An Empirical Study
Batched First-Order Methods for Parallel LP Solving in MIP
Provably Label-Efficient Conformal Prediction
BIOARC: Discovering Optimal Neural Architectures for Biological Foundation Models
Regularization in the Axiomatic Approach to Learning from Human Preferences
Resilient Coresets and Consistent Clustering
Coverage ≠ Exposure: Auditable Control of Same-Support Tail Failures under Multimodal Missingness
CAT-Q: Cost-efficient and Accurate Ternary Quantization for LLMs
Understanding Behavior Cloning with Action Quantization
Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR
Towards Pareto-Optimal Tool-Integrated Agents with Pareto Ranking Policy Optimization
OSAQ: Outlier Self-Absorption for Accurate Low-bit LLM Quantization
Learning Manifold Data with Flow Matching
ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
Kalman Linear Attention: Parallel Bayesian Filtering For Efficient Language Modeling and State Tracking
Feature Resemblance: Towards a Theoretical Understanding of Analogical Reasoning in Transformers
Making Foundation Models Probabilistic via Singular Value Ensembles
Approximation Preserving Coresets
ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving
GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation
Search for Truth from Reasoning: A Dynamic Representation Editing Framework for Steering LLM Trajectories
Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment
S$^3$GNN: Efficient Global Mixing and Local Message Passing for Long-Range Graph Learning
Spherical Steering: Geometry-Aware Activation Rotation for Language Models
GOCM: Single-Step Graph Outlier Synthesis via Origin Consistency Model
New Bounds for Kernel Sums via Fast Spherical Embeddings
Correcting in Hindsight: Editing Past Key-Value States for Robust LLM Reasoning
Marrying Generative Model of Healthcare Events with Digital Twin of Human-Environment Interaction for Disease Reasoning
BAT: Better Audio Transformer Guided by Convex Gated Probing
Beyond Drift: Stabilizing Subjective LLM Evaluation with Information-Theoretic Rubrics
Certificate-Guided Pruning for Stochastic Lipschitz Optimization
Select to Think: Unlocking SLM Potential with Local Sufficiency
Statistically Optimal Scaling for Token Merging in Transformers
Gateways to Tractability for Satisfiability in Pearl’s Causal Hierarchy
Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints
Learning to Discover at Test Time
Geometric Rate–Distortion Invariance for Domain Generalization
Beyond Model Base Retrieval: Weaving Knowledge to Master Fine-grained Neural Network Design
RVAS: Referring Video Active Exploration and Segmentation
How Out-of-Distribution Detection Learning Theory Enhances Transformer: Learnability and Reliability
Scaling Real-World Robot Policy Evaluation via Discrete Diffusion World Model
AG-REPA: Causal Layer Selection for Representation Alignment in Audio Flow Matching
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Anti-Backdoor Coreset Selection via Cumulative Entropy
Mixture Prototype Flow Matching for Open-Set Supervised Anomaly Detection
RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress
Sparse Topology-Aware Pairwise Scoring for Large-Scale Multi-Agent Reinforcement Learning
DuRP: Dual-Stage Physics-Embedded Learning for Joint Radiance and Polarization Restoration
Class-Prior Perturbation-Robust Regularization for Imbalanced Unreliable Partial Label Learning
PLSemanticsBench: A Formal Semantics Reasoning Benchmark for Code
Information-Theoretic Generalization Bounds for VAEs: A Role of Encoder and Latent Variable
The Geometry of Updates: Fisher Alignment at Vocabulary Scale
Bayes-inspired Integration of Pretrained Priors and Few-Shot Evidence for Few-Shot Classification
Forward-Chaining Temporal Point Process
Olmix: A Framework for Data Mixing Throughout LM Development
General and Efficient Steering of Unconditional Diffusion Models
Strategy Executability in Mathematical Reasoning: Leveraging Human–Model Differences for Effective Guidance
TD-VAD: Breaking Visual Dependence in Video Anomaly Detection with Text-Driven Learning
EGG: An Expert-Guided Agent Framework for Kernel Generation
Executable Agentic Memory for GUI Agent
SlerpFlow: Spherical Trajectory Correction for Rectified Flow Inversion
A Strictly Proper Scoring Rule and a Calibration Metric for Interval-Censored Data Analysis
Retrieval-Aware Distillation for Transformer-SSM Hybrids
LiveNewsBench: Evaluating LLM Web Search Capabilities with Freshly Curated News
Directly Optimizing Natural Language Explanations for Behavioral Faithfulness: Simulatability and Recoverability
Judgment Operators: A Composition-Invariant Substrate for Multi-Agent Action Spaces
RTInfer: Exploiting Concurrency for Multiple Real-Time DNN Inference on Edge GPUs
Rethinking Depth Pruning for Vision Transformers: A Heterogeneity-Aware Perspective
CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception
PAMD: Structured Adaptive Distances for Bisimulation Representations in Visual Reinforcement Learning
BLOCK-EM: Preventing Emergent Misalignment via Latent Blocking
Required Spine Optional Limbs: Heterogeneous Federated Learning via Backbone-sharing and Activation-guided Selection
In-Training Defenses Against Emergent Misalignment in Language Models
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
Large-Scale Notification Dispatch with Bundle Treatments and Multi-Outcome Uplift Optimization
WF-Bench: A Benchmark for Neural-Network WaveFunction Expressivity and Scaling Laws
How Language Models Process Negation
CoLA: Cross-Modal Low-rank Adaptation for Multimodal Downstream Tasks
Simple Policy Gradients for Reasoning with Diffusion Language Models
PCRNet: Phase-aware Complex Refinement Network for EEG-based Auditory Attention Decoding
Data-driven Mixed Integer Optimization through Probabilistic Multi-variable Branching
Width Independent Bounds for the Local Lipschitz Constant of Deep Neural Networks at Random Initialization and after Lazy Training
MDGMIX: Boundary-Aware Subgraph Mixing for Multi-Domain Graph Pre-Training
Matroid Algorithms Under Size-Sensitive Independence Oracles
Copyright-Bench: Agentic Evaluation of Copyright Law Compliance
Sharpness-Aware Minimization Can Hallucinate Minimizers
SEDRAS: Symbolically Evaluated Deep Research And Science
Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance
SVD as a Fast Interpretability Method for Transformers
ChartE$^{3}$: A Comprehensive Benchmark for End-to-End Chart Editing
FaPS: A General and Fast Training Method for Diffusion Models
PHALAR: Phasors for Learned Musical Audio Representations
Tucker Attention: A generalization of approximate attention mechanisms
Reading the Cell, Designing the Cure: Perturbation-Conditioned Molecular Diffusion for Function-Oriented Drug Design
An Algebraic View of the Expressivity of Recurrent Language Models
Rex: A Family of Reversible Exponential (Stochastic) Runge-Kutta Solvers
NAVIGATE: Evaluating Visual-Guided Search Decision-Making on the Open Web
When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression
Calibrated Multimodal Representation Learning with Missing Modalities
When Does Sparsity Mitigate the Curse of Depth in LLMs
Mixtures of geodesic factor analyzers on Riemannian homogeneous spaces
From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors
Anytime-Valid Inference for Online Ranking of Large Language Models
De4D-SLAM: Gradient-Isolated Static-Dynamic Decoupling for Monocular SLAM in Dynamic Environments
Federated Bilevel Performative Prediction
Towards Reliable Marking and Verification of AI-Generated Text via Geometry-aware Sentence-level Watermarking
$\mathbb{R}^{2k}$ is Theoretically Large Enough for Embedding-based Top-$k$ Retrieval
V-ABS: Action-Observer Driven Beam Search for Dynamic Visual Reasoning
Temper-Then-Tilt: Principled Unlearning for Generative Models through Tempering and Classifier Guidance
Agent JIT Compilation for Latency-Optimizing Computer-Use Agent Planning and Scheduling
Uncertainty-Aware Clarification in LLM Agents with Information Gain
POLIA: Policy Optimization with Visual-Object-Level Intrinsic Advantage for Multimodal Reasoning
ASIR: Steganography for Diffusion Models via Antipodal Sampling and Iterative Recovery
Transitivity Meets Cyclicity: Explicit Preference Decomposition for Dynamic Large Language Model Alignment
Breaking the Echo Chamber: A Dynamic Ensemble Pruning Perspective on MoE
Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints
Towards Context-Invariant Safety Alignment for Large Language Models
What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
SpikeNet: Sparse Spike-Driven Mask Vector Transformer for Energy-Efficient and Stable Spiking Point Cloud Processing
Partial Fusion of Neural Networks: Efficient Tradeoffs Between Ensembles and Weight Aggregation
Learn from A Rationalist: Distilling Intermediate Interpretable Rationales
A Diagnostic Study of Multi-Agent LLMs for Real-World Debates
One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models
From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs
Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking
Attention with Routed-Memory for Learnable Sparse Control
Identifiable Token Correspondence for World Models
Quantum Robust Inner Minimization for Reinforcement Learning with Quadratic Speed-Up in Query Complexity
RulePlanner: All-in-One Reinforcement Learner for Unifying Design Rules in 3D Floorplanning
OpenSage: Self-programming Agent Generation Engine
A Unified Framework for Diffusion Model Unlearning with f-Divergence
Dynamic High-Dimensional Facility Location with Low Recourse
GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance
Trust Functions: Near Lossless Weak-to-Strong Generalization by Learning to Trust the Weak Teacher
Fix the Mind, Not the Move: Interpretable AI Assistance via Knowledge-Gap Localization
INDEXGUARD: Index-only Backdoor Vetting for Secure Federated PEFT of Large Language Models
Variance-Reduced $(\varepsilon, \delta)-$Unlearning using Forget Set Gradients
Transformers Learn the Optimal DDPM Denoiser for Multi-Token GMMs
FedARC: Anchor-Guided Residual Compensation for Data and Model Heterogeneous Federated Learning
Learning Global Representation from Queries for Vectorized HD Map Construction
Tailoring the Training: Difficulty-Aware Learning Strategy Allocation for Large Language Models
Variable-Length Tokenization via Learnable Global Merging for Diffusion Transformers
1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization
Reward-Preserving Counterfactual State Editing for Offline Reinforcement Learning
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs
Personalized Policy Learning through Discrete Experimentation
Understanding Generalization and Forgetting in In-Context Continual Learning
Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning
LagLLM: LLM-empowered lead–lag dependency learning for spatial-temporal time series forecasting
UNIVERSAL REPRESENTATION OF GENERALIZED CONVEX FUNCTIONS AND THEIR GRADIENTS
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?
Calibrating Generative Models to Distributional Constraints
On Expressive Power of Floating-Point Transformers
Theory of Minimal Weight Perturbations in Deep Networks and its Applications for Low-Rank Activated Backdoor Attacks
TextAtlas5M: A Large-Scale Dataset for Long Text Image Generation
Parametrized Power-Iteration Clustering for Directed Graphs
Sparser Block-Sparse Attention via Token Permutation
BiCrossNet with Decoupled Dual Generators: A Parameter‑Efficient and Generalizable Few‑Shot Custom Gesture Recognition Framework
Delving into Non-Exchangeability for Conformal Prediction in Graph-Structured Multivariate Time Series
Rethinking generative image pretraining: How far are we from scaling up next-pixel prediction?
Causes and Consequences of Representational Similarity in Machine Learning Models
To Grok Grokking: Provable Grokking in Ridge Regression
How Chain of Thought Decomposes Complex Tasks
Offline Multi-agent Continual Cooperation via Skill Partition and Reuse
TSFAdv: Frequency-Guided Black-Box Adversarial Attacks on Time Series Forecasting
Ideal Attribution and Faithful Watermarks for Language Models
Adaptive Estimation and Inference in Semi-parametric Heterogeneous Clustered Multitask Learning via Neyman Orthogonality
Universal Skeleton Understanding via Differentiable Rendering and MLLMs
Compositional Perception and Generalizing Induction: Latent Compositional Manifold Assumption on Generalized Category Discovery
ERAlign: Energy-based Representation Alignment of GNNs and LLMs on Text-attributed Graphs
Designing noise schedules for diffusion models with spectral analysis
Learning Locally, Revising Globally: Global Reviser for Federated Learning with Noisy Labels
ProtoVAR: Efficient Dataset Distillation via Prototype-Guided Visual Autoregressive Modeling
What Preferences Can—and Cannot—Predict in Multi-Agent Online Learning
Adaptively Robust Resettable Streaming
Domain Restriction via SAE Multi-Layer Transitions
DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution
Midtraining Bridges Pretraining and Posttraining Distributions
Partitioning for Intrinsic Model Inversion Resistance in Collaborative Inference
Biologically plausible heavy-tailed connectivity enhances generalizations on cognitive tasks in recurrent neural networks
The Heterogeneous Safety Impacts of Benign Multilingual Fine-Tuning
Towards Diverse Scientific Hypothesis Search with Large Language Models
MIMO-LP: A Multi-Input Multi-Output Framework for Subgraph-based Link Prediction
Geodesic Calculus on Implicitly Defined Latent Manifolds
Thinking in Latent Space: Progressive Multimodal Simplification for Visual Reasoning
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Regret Minimization With a Crowd of Awakening Experts
Neural-Inspired Modeling of Auditory Selection and Compensation for Audio-Visual Speech Separation
AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Grokking Finite-Dimensional Algebra
Structured Expert Routing with Multi-View Task Priors for Offline Meta-Reinforcement Learning
Active Reasoning Vision-Language Model via Sequential Experimental Design
Equilibrium Propagation for Non-Conservative Systems
Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training
REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations
Focus-Then-Contact: Speeding Up Robotic Contact-Rich Task Learning with Affordance-Guided Real-World Residual Reinforcement Learning
Tracing the Emergence of Symbol Grounding in Multimodal Language Models
VIRUS: Injecting Persistent Cognitive Pathogens into Stateful Zero-Shot Object Navigation Agents
Rethinking Time-Series Imputation as Conditional Inference along Temporal Evolution
KernelFoundry: Hardware-Aware Evolutionary GPU Kernel Optimization
Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks
Homophily-Heterogeneity Gradient Surgery for Federated Graph Learning
RSA-CP: Efficient Conformal Prediction in Small-Sample Regimes via Random Score Alignment
ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
Less Token, More Signal: MoE Expert Pruning via Critical Token Selection
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
Improving Video Sparse Attention with Fine-grained Router and Sparse Rebasing
Beyond Single-View Indexing: Structure-Aware Multi-View Retrieval for Knowledge-Based VQA
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
Omni-Perception Policy Optimization for Multimodal Emotion Reasoning
MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models
Attacking Gray-Box Large Vision-Language Models with Adaptive SVD-Structured Adversarial Alignment
Adaptive Multiscale Binary Expansion Tests for Independence
Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion Models
Jailbreaking Vision-Language Models Through the Visual Modality
Formalizing the Binding Problem
How do Human Processes AI-generated Hallucination Contents: a Neuroimaging Study
MGAL: A Multilingual Granularity-Aware Long-Context Benchmark
Efficient Equivariant High-Order Crystal Tensor Prediction via Cartesian Local-Environment Many-Body Coupling
Value Aggregation with Uncertainty in Online Decentralized MARL
Efficient, Property-Aligned Fan-Out Retrieval via RL-Amortized Diffusion
Constrained Meta Reinforcement Learning with Provable Test-Time Safety
DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter
Which Reasoning Traces Are Worth Generating Further? Data Curation for Training Reasoning Models
Understanding Dynamics of Adam in Zero-Sum Games: An ODE Approach
AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation
Singular Proxies for Adaptive Caching in Diffusion Language Models
Towards a Science of AI Agent Reliability
REAR: Test-time Preference Realignment through Reward Decomposition
VecMol: Vector-Field Representations for 3D Molecule Generation
Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation
CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
Stochastic Gradient Variational Inference with Price's Gradient Estimator from Bures-Wasserstein to Parameter Space
ERGeoBench: A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models
Resolution as a Direction: Vector-Panning Feature Alignment for Cross-Resolution Re-Identification
Discovering Symmetry Groups with Flow Matching
Toward Cybersecurity-Expert Small Language Models
Boosting CVaR Policy Optimization with Quantile Gradients
The Catastrophic Failure of *the* k-Means Algorithm in High Dimensions, and How Hartigan's Algorithm Avoids It
Linear Causal Representation Learning by Topological Ordering, Pruning, and Disentanglement
Esoteric Language Models
From Extraction to Deduction: Resolving Functional Misalignment in RAG via a Collaborative Critic-Reasoner Framework
Deformba: Vision State Space Model with Adaptive State Fusion
Evolving Interdependent Operators with Large Language Models for Multi-Objective Combinatorial Optimization
Elastic Diffusion Transformer
Computing Provable Bounds for Exact Shapley Values of Neural Networks
Just Y-Prediction: Enabling Historical Cumulative Inconsistency in Label Diffusion for Learning with Noisy Label
Constrained Bayesian Experimental Design via Online Planning
$\sigma$: Sigmoid Modulation for Ultra High Resolution Diffusion
Conformal Reliability: A New Evaluation Metric for Conditional Generation
Independent Component Discovery in Temporal Count Data
Adversarial Training for Process Reward Models
CSPO: Constraint-Sensitive Policy Optimization for Safe Reinforcement Learning
Curriculum Reinforcement Learning for Black-Box Prompt Tuning via Large Language Models
PRISM: Synergizing Vision Foundation Models via Self-organized Expert Specialization
ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
Genome-Factory: A Library for Tuning, Deploying, and Interpreting Genomic Foundation Models
Draft-and-Audit Reinforcement Learning for Optimization Modeling
GraphFlow: A Graph-Based Workflow Management for Efficient LLM-Agent Serving
Bias-Spectrum Neural Processes for Parametric PDEs: Architecture Priors Meet PDE Constraints
Accurate, private, secure, federated U-statistics with higher degree
Learning to Label: A Reinforced Self-Evolving Framework for Semi-supervised Referring Expression Segmentation
Reasoning Models Struggle to Control their Chains of Thought
LocalV: Exploiting Information Locality for IP-level Verilog Generation
HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
Beyond Reactivity: Proactive Adaptive Conformal Inference for Online LLM Factuality
Spike-HTR: Spiking Neural Transformer for Handwritten Text Recognition
Pressure Reveals Character: Behavioural Alignment Evaluation at Depth
Rectifying Gradient Trajectories: A Hierarchical Geometric Framework with Structural Constraints for Few-Shot EEG Adaptation
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
Meta-Black-Box Optimization Can Do Search Guidance for Expensive Constrained Multi-Objective Optimization
Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration
Fine-Tune Once, Reuse Across Models: Bayesian Task-Update Factors and Approximations
CalPro: Prior-Aware Evidential Conformal Prediction with Structure-Aware Sensitivity Bounds for Protein Structures
Inner-layer Token Self-Modulation as Another Scaling Axis for LLMs
AutoVSR: Automatic Visual-to-Symbolic Reasoning for Symbolic Expression Generation from Circuit Schematic
The cost of commitment in option-based hierarchical RL
Language Generation in the Limit: Complexity Barriers and Implications for Learning
Distilling Linearized Behavior into Non-linear Fine-Tuning for Effective Task Arithmetic
Split Group Knockoffs: Controlling False Discovery Rate in Transformational Group Sparsity
Towards a Holistic Understanding of Selection Bias for Causal Effect Identification
Robust Contextual Optimization with Missing Covariates
Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization
REG: In-Sample RL via Regularizing the Evaluation Gap
Ekka: Automated Diagnosis of Silent Errors in LLM Inference
Variance Driven Exploration: A Provable and Efficient Methodology for Pure Exploration in Highly Stochastic Environments
Exploring and Exploiting Stability in Latent Flow Matching
Identifiable Equivariant Networks are Layerwise Equivariant
Convergence Analysis of the Lion Optimizer in Centralized and Distributed Settings
ECSEL: Explainable Classification via Signomial Equation Learning
Weaving in the Clouds: Achieving Synergistic Collaboration among LLM Agents via Federated Learning
Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles
Sampling and Identity-Testing Without Approximate Tensorization of Entropy
Transformer Circuits Can Realize Clustering Algorithms
The Abstraction Gap in Vision-Language Causal Reasoning
Heterogeneous Customizable Personalized Federated Fine-Tuning Approach for Large Language Models
A Penalty Approach For Differentiation Through Black-box Quadratic Programming Solvers
Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models
DR-MMSearchAgent: Deepening Reasoning in Multimodal Search Agents
A Sketch-and-Project Analysis of Subsampled Natural Gradient Algorithms
SARSteer: Safeguarding Large Audio Language Models via Safe-Ablated Refusal Steering
Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum
LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues
Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective
SMART: Scalable Mesh‑free Aerodynamic Simulations from Raw Geometries using a Transformer‑based Surrogate Model
AC-ODM: Actor–Critic Online Data Mixing for Sample-Efficient LLM Pretraining
AutoSizer: Automatic Sizing of Analog and Mixed-Signal Circuits via Large Language Model (LLM) Agents
Optimizing Rank for High-Fidelity Implicit Neural Representations
Efficient Learning of Deep State Space Models via Importance Smoothing
Beyond Logits: Metastable Latent Dynamics for Sample-Efficient Best-of-N Selection in LLMs
From Per-Image Low-Rank to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers
Learn to Think: Improving Multimodal Reasoning through Vision-Aware Self-Improvement Training
Towards Efficient LLMs Annealing with Principled Sample Selection
SAEmnesia: Erasing Concepts in Diffusion Models with Supervised Sparse Autoencoders
Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling
TINNs: Time-Induced Neural Networks for Solving Time-Dependent PDEs
Causal discovery for time series with endogenous context variables
CRAG: Can 3D Generative Models Help 3D Assembly?
DiscoForcing: A Unified Framework for Real-Time Audio-Driven Character Control with Diffusion Forcing
Fair Dataset Distillation via Cross-Group Barycenter Alignment
Robust Bayesian Optimisation with Unbounded Corruptions
Vector Quantization using Gaussian Variational Autoencoder
Prototype-Grounded Concept Models for Verifiable Concept Alignment
GXPO: Group Cross-Lingual Relative Policy Optimization for Code Generation
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
Anomaly-Preference Image Generation
LOTTERY: Learning from Reference-Only Samples in Two-Sample Testing under Size Asymmetry
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
MoCL: Metabolic Optimization for Curvature-Aware Continual Learning
Hierarchical Causal Abduction: A Foundation Framework for Explainable Model Predictive Control
UniFLoW: Universal Multi-Modal Federated LoRA Fine-Tuning Framework with Analytical Aggregation
Model-Dowser: Data-Free Importance Probing to Mitigate Catastrophic Forgetting in Multimodal Large Language Models
CAffNet: Hard Constraint-Affine Neural Networks
Task-Aware Structured Memory for Dynamic Multi-modal In-Context Learning
Vision Language Models Cannot Reason About Physical Transformation
RQ-MoE: Residual Quantization via Mixture of Experts for Efficient Input-Dependent Vector Compression
ACTIVE-o3 : Empowering MLLMs with Active Perception via Pure Reinforcement Learning
On the Fragility of Data Attribution When Learning Is Distributed
Reason with Thumbnails, Answer with Focus: An Efficient and Effective Paradigm for Multimodal Grounded Visual Reasoning
MUSA-PINN: Multi-scale Weak-form Physics-Informed Neural Networks for Fluid Flow in Complex Geometries
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding
On the Generalization in Topology Optimization via Sensitivity-Conditioned Bernoulli Flow Matching
An Interactive Paradigm for Deep Research
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Autonomous Machine Learning Engineering
MemIncept: Steering LLM Agents via Cooperative Stealthy Memory Injections
Symbol-Equivariant Recurrent Reasoning Models
VideoBrain: Learning Adaptive Frame Sampling for Long Video Understanding
OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft
When Model Merging Breaks Routing: Training-Free Calibration for MoE
NEMO: Execution-Aware Optimization Modeling via Autonomous Coding Agents
FedRGL: Robust Federated Graph Learning under Label Noise
FluxNet: Learning Capacity-Constrained Local Transport Operators for Conservative and Bounded PDE Surrogates
DC-LA: Difference-of-Convex Langevin Algorithm
Less Is More in Federated Continual Learning: RieSelect for Conflict-Aware Layer Selection in LLMs
Optimal Classical and Quantum Algorithms for Gradient Testing and Estimation by Comparisons
LiftQuant: Continuous Bit-Width Control for Pareto-Optimal LLM Deployment
PhaseAlign: Complex Phase Alignment for Stable Open-Vocabulary Semantic Segmentation
INT vs. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Object-level Semantic and Spatial Distillation for Open Vocabulary Detection
Simultaneous Confidence Bounds for Aggregated Effects via Exact Subset Optimization
Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
TACTIC: Task-Aware Sparse Coordination Graphs for Multi-Task Multi-agent Reinforcement Learning
Feature-aware (Hyper)graph Generation via Next-Scale Prediction
Structure-Induced Information for Rerooting Levin Tree Search
ClinTutor-R1: Advancing Scalable and Robust One-to-Many Alignment in Clinical Socratic Education
Directional Neural Collapse for Self-Supervised Visual Representation Learning
Broadening the Backdoor Basin: Understanding LLM Backdoors Collapse and Making Backdoors Persistent
Safe Reinforcement Learning with Preference-based Constraint Inference
Computational Arbitrage in AI Model Markets
Explicitly Modeling Censoring Produces Superior Survival Predictors
Shift-Dependent Asymmetry: Orthogonal Inverse Low-Rank Adaptation for Federated Medical Segmentation
MetaOthello: A Controlled Study of Multiple World Models in Transformers
VideoSEG-O3: A Multi-turn Reinforcement Learning Framework for Reasoning Video Object Segmentation
Fast Estimation for Forest Matrix of Signed Graphs
HECTOR: Hybrid Editable Compositional Object References for Video Generation
Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
Robustness of Mixtures of Experts to Feature Noise
Incomplete Multi-View Clustering via Neighborhood-Conditioned Diffusion
Efficiently Solving Discounted MDPs via Predictions with Unknown Prediction Errors
Learning to Approximate Uniform Facility Location via Graph Neural Networks
SceneDirector: Bridging Explicit Geometry and Generative Priors for Unified Driving Scene Editing
FOCA: Future-Oriented Conditioning for Data-Efficient Vision-Language-Action Adaptation
WinDeskGround: A Benchmark for Robust GUI Grounding in Complex Multi-Window Desktop Environments
EpiCoCo: De Novo Epitope Generation via MHC-Context Co-Modeling and Contrastive Affinity Guidance
Towards Docking-oriented De Novo Ligand Design via Gradient Inversion
Learning Molecular Semantic Invariant Representation with Prototype Constraint
Can local learning match self-supervised backpropagation?
Constructing Industrial-Scale Optimization Modeling Benchmark
MPFM: Cross Multi-Domain Prototype Flow Matching for Log Anomaly Detection
SciAgentGym: Benchmarking Multi-Step Scientific Tool-Use in LLM Agents
Understanding LoRA as Knowledge Memory: An Empirical Analysis
AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search
FreeText: Training-Free Text Rendering via Attention Localization and Spectral Glyph Injection
Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data
Online Contract Design With Unknown Technology
ForceForget: Reinforcement Concept Removal for Enhancing Safety in Text-to-Image Models
Generalization Bounds for Out-of-distribution Generalization
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
Can We Build a Monolithic Model for Fake Image Detection? SICA: Semantic-Induced Constrained Adaptation for Unified-Yet-Discriminative Artifact Feature Space Reconstruction
Faithful Relational Reasoning with Region-based Embeddings: Expressivity of Convex Coordinate-wise Models
Protein Autoregressive Modeling via Multiscale Structure Generation
The Consistency Trap in LLMs: Generator-Evaluator Agreement and Vulnerability to Mistakes
Learning Permutation from Structure Without Supervision
Generative Modeling of Discrete Latent Structures via Dynamic Policy Gradients
The Tell-Tale Norm: $\ell_2$ Magnitude as a Signal for Reasoning Dynamics in Large Language Models
Low-Compute Watermark Removal via Dual-Domain Natural Projection
UnMaskFork: Test-Time Scaling for Masked Diffusion via Deterministic Action Branching
HiCI: Hierarchical Construction–Integration for Long-Context Attention
SI-IGCL: Subject Invariance-aware Inverse Graph Contrastive Learning for Psychiatric Disorder Identification
Position: Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain
Position: In Defense of Information Leakage in Concept-based Models
Position: Unplugging a Seemingly Sentient Machine Is the Rational Choice — A Metaphysical Perspective
Position: Evidence and Implications of Texture Bias in Deep Neural Networks
Position: Token Taxes Can Mitigate AI's Economic Risks
HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Position: Prompts for Public-Sector LLMs Should Be Governed as Commons
Position: Weight Space Should Be a First-Class Generative AI Modality
Position: Federated Learning is a Lens towards a Democratized Future for the Scaling Law Era
Position: We Need A Unified Definition of Hallucination (It’s The World Model, Stupid!)
Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering
Position: Responsible AI for AI companions must actively combat violence toward intimate partners
Position: Verifiable Data Minimization is a Prerequisite for Responsible, Privacy-Preserving Industrial Vision
Position: Interestingness is an Inductive Heuristic for Future Compression Progress
Position: Breaking the Dual Curse of Multilingual AI Requires Socio-Technical Guardrails, Not Post-Hoc Alignment
Position: Multiplicity is an Inevitable and Inherent Challenge in Multimodal Learning
Position: Profiling Game Worlds by Transition Complexity
Position: Multi-Agent Systems Should Prioritize Concurrency Control
Position: Benchmarks for Vision–Language Models in Urban Perception Should Be Reliability-Aware and Negotiated
Position: *Beyond Text* The Text-Centric Bias in Foundation Models Must Be Revisited for a Speech-First Future
Position: LLM Agents Are the Antidote to Walled Gardens
GAE: Unleashing Physical Potential of VLM with Generalizable Action Expert
Position: Peer Review in ML/AI Conferences Should Separate Publication from Presentation and Offer Non-Anonymous Review Tracks
Position: Irresponsible AI: big tech’s influence on AI research and associated impacts
Position: LLM for Physics Research Requires Domain-Specialized Training and Tooling
Position: Temporal Measurement Interval Determines Computational and Model Complexity in Single-Cell Perturbation Analysis
Position: LLMs can't jump
Position: Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences
Position: Behavioral Systems Require Behavioral Tests
Position: Towards Responsible Evaluation for Text-to-Speech
LECTOR: Joint Learning of Scientific Reasoning Graphs and Introduction Generation
Position: Assistive AI requires Personalized Specialists, not Generalists
Position: ICML Should Treat Hosted LLM APIs as Versioned Dependencies and Require Drift-Audit Artifacts
Position: When AI Decides Who Gets an Organ: Multi-Agentic AI Systems in Transplant Medicine Risk Amplifying Disparities Without Targeted Explainability and Deployment Strategies
Position: Current Benchmarking Hinders Real Progress in Deep Learning for Time Series Forecasting
Position: Benchmarks Do Not Measure Deployment Readiness in Clinical AI
Position: Preregister Experiments with AI Agents
Position: Epistemic uncertainty estimation methods are fundamentally incomplete
Position: Every Ground Truth is a Human Construction, not an Objective Truth
Visual Persuasion: What Influences Decisions of Vision-Language Models?
Position: AI Evaluation Should Work With Humans
Why Are Linear RNNs More Parallelizable?
Position: Prioritize Identifying Structure, Not Complex Models, for Scientific Discovery
Position: Neglecting the Sustainability of AI is Fuelling a Global AI Arms Race
Position: AI Researchers Must Lead Arms Control to Mitigate Military AI Risks
Position: Multiple Definitions & Unrealistic Assumptions of Model Collapse Distract from Real World Threats
Position: Topological Machine Learning Cannot Progress without Experimental Standards
Position: Current Model Cards Are Insufficient for Downstream Governance of Open-Weight Foundation Models
Position: Carbon Footprint Reporting Should Be Routine in Machine Learning Research
Position: Beyond Sensitive Attributes, ML Fairness Should Quantify Structural Injustice via Social Determinants
Position: Video LLMs Must Not Ignore the Pixel Dynamics in Plain Sight
Position: Creating High-Fidelity Synthetic Training Data Should Employ Multi-level Optimization
Position: Stop Using Culturally Biased Human Cognitive Benchmarks to Evaluate LLMs
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities
PMSPO: Progressive Matching and Semantic-Aware Policy Optimization for Camouflaged Object Detection
Position: Agentic Systems Should be General
Position: Time-Series Foundation Models Require Explicit Domain-Level Benchmarks
Position: Reframing Hallucination: Latent Space Geodesics as a Pathway for Generative Discovery
Unraveling Syntax: Language Modeling and the Substructure of Grammars
Position: Don't Just "Fix it in Post'': A Science of AI Must Study Learning Dynamics
Position: Comprehensive AI governance requires addressing non-model capability gains
Position: Ideas Should be the Center of Machine Learning Research
Position: Age Estimation Models Do Not Process Biometric Data
Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory
Position: Web Agents Should Use Typed Actions Instead of Click-Based Browsing
Position: The Age of AI Agents Demands A New Scientific Paradigm To Sustain Trustworthy Science
Position: Model identity in machine learning is a convention, not a property
Position: Spatial Fairness: Foundations, Pitfalls, and a Path Forward
Position: To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack
Position: Sycophancy is an Educational Safety Risk: Why LLM Tutors Need Sycophancy Benchmarks
Position: Genomic Model Research Must Move Beyond Anecdotal Evaluation of Interpretability Methods
Position: Knowing Isn’t Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight
Position: Embodied AI Requires a Privacy-Utility Tradeoff
Position: Virtual Cells Need Context, Not Just Scale
Position: Safe Models Do Not Guarantee Safe Societies: The Case for Sociopolitical Risk
Position: VLM Causal Reasoning Benchmarks Should Probe Temporal Understanding, Not Presume It
Position: Sustainable Open-Source AI Requires Tracking the Cumulative Footprint of Derivatives
Position: The Inevitable Transition to Machine Learning in Quantum Chemistry
Position: Robust AI Personalization Will Require a Human Context Protocol
Position: Natural Language Should Not Fully Replace Formal Languages
Position: Predicting AI’s Impact on Labor Is a Core Machine Learning Problem
Position: Artificial Intelligence Needs Meta Intelligence - the Case for Metacognitive AI
Position: Privacy Is a Claim, Not a Property of Synthetic Data
Position: AI Must Become Planet-Centered, Not Human-Centered
Position: AI Leaderboards Are Underserving the Global South: A Case Study from India
Position: Retire the "Positive Backdoor" Label—Secret Alignment Requires Strict and Systematic Evaluation
Position: RL Researchers Need to Distinguish Between Solving Simulators and Using Simulators as a Proxy
Position: Generative Engine Optimization Creates Underexamined Risks, Governance Must Target Concentration, Disclosure, and Academic Blind Spots
Position: `AI Alignment' Encompasses Competing Technical Priorities
Position: Adversarial ML for LLMs Is Not Making Any Progress
Position: Reasoning After Perception Means Reasoning Without Vision
Position: Prompting Intent Should Be Audited in LLM-Assisted Peer Review
Position: Explainability Research Must Prioritize Foundations over Ad-hoc Methods
Position: Responsible Practices and Model Performance are Not Competing Goals
Position: Agent Security Needs Redefinition through a Holistic Framework
Position: Express Your Doubts — Probabilistic World Modeling Should not be Based on Token *logprobs*
Position: We Need AI Efficiency Incentives for Accessibility and Sustainability
Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models
Position: Generative Models Erode Temporal Learning Through Market Selection
Position: AI Capabilities Are Not Increasing Exponentially
Position: Enabling Fair Revenue Sharing for Data Providers in GenAI Systems
Distributional Active Inference
Position: Agent Evaluation Should Be Agentified for Openness, Standardization, and Reproducibility
Performative Learning Theory
Position: It’s Time to Optimize for Self-Consistency
Position: The Case for Theory-Level Autoformalization
Position: AGI Requires a Coordination Layer on Top of Pattern Repositories
Position: No Retroactive Cure for Infringement during Training
LiMuon: Light and Fast Muon Optimizer for Large Models
Position: Collaborative Agentic AI Needs Interoperability Across Ecosystems
Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation
Position: Causality is Key for Interpretability Claims to Generalise
Position: Machine Learning for Heart Transplant Allocation Policy Optimization Should Account for Incentives
Position: Metaphysical Concepts in AI Should Be Judged by Their Consequences
Position: Fairness Failure in Generative Models is an Evaluation Problem
Position: LLM-Based Social Simulations Require a Boundary
Position: Deciphering the Functions of DNAs, RNAs, and Proteins Should Consider Multi-Modal Large Language Models
Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents
Position: Agentic Safety is an Epistemic Property, Not a Behavioral One
Position: Current XAI Methods Cannot Satisfy Financial AI Explainability Requirements
Position: World Models as an Intermediary between Agents and the Real World
Position: Make Planning Research Rigorous Again!
Position: Improved Documentation is Necessary for Benchmarking AI Systems in Geometry
Position: Stop Automating Peer Review Without Rigorous Evaluation
Position: Good Embodied Reward Models Need Bad Behavior Data
Position: Generative Distributional Integrity against Backdoor Attacks
Semi-Supervised Learning with Noisy Covariates: Generalization Bounds and Distribution Regression
Action-Sufficient Goal Representations
Riemannian Neural Optimal Transport
Learning to Emulate Chaos: Adversarial Optimal Transport Regularization
CLIMB: Taming the LoRA Residency Cliff in Multi-LoRA Serving
Prefix cache aware data reordering for LLM augmented database analytics
Optimal and Scalable MAPF via Multi-Marginal Optimal Transport and Schrödinger Bridges
What Language is This? Ask Your Tokenizer.
PFT: Phonon Fine-tuning for Machine Learned Interatomic Potentials
Look on Demand: A Cognitive Scheduling Framework for Visual Evidence Acquisition in Multimodal Reasoning
Position: AI for Science Should Treat Measurement-to-Dataset Pipelines as Inference Components
Forward-KL Convergence of Time-Inhomogeneous Langevin Diffusions
Representation Unlearning: Forgetting through Information Compression
Speculative Sampling For Faster Molecular Dynamics
AOEPT: Breaking the Implicit Modality-Reduction Bottleneck in Modality Missing Prompt Tuning
Understanding Transfer Learning of RNA Foundation Models on Downstream Tasks
Unpaired Visual Editing with Self-Consistent Flow Matching
AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Material Structures
Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
Categorical Reparameterization with Denoising Diffusion models
Multiview Self-Representation Learning across Heterogeneous Views
Deterministic Component Mining for Multi-framework UI2Code Generation
Debate2Create: Robot Co-design via Multi-Agent LLM Debate
Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery
Reinforcement-aware Knowledge Distillation for LLM Reasoning
Provably Learning Attention with Queries
Position: Measuring Human Preferences in RLHF is a Social Science Problem
PS-PPO : Prefix-Sampling PPO for Critic-Free RLHF
Causal Feature Learning via Generalized Rayleigh Quotients
Backward Oversmoothing: why is it hard to train deep Graph Neural Networks?
Overthinking: Amplifying Reasoning Weights to Extract Learned Secrets
Bridging RGB and RAW: Single-step Deterministic Flow with Homogeneous Aligned Guidance
TarGATE: Target-Aware Data Selection via Token-Attenuation Gates
Skewness-Robust Causal Discovery in Location-Scale Noise Models
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
Token-Efficient Change Detection in LLM APIs
MADE: Benchmark Environments for Closed-Loop Materials Discovery
BioToken and BioFM – Biologically-Informed Tokenization Enables Accurate and Efficient Genomic Foundation Models
Transfer Learning in High-dimensional Ising Models
What If We Let Forecasting Forget? A Sparse Bottleneck for Cross-Variable Dependencies
Non-Parametric Probabilistic Robustness: A Conservative Risk Estimator under Unknown Perturbation Distributions
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Lower Complexity Bounds for Nonconvex-Strongly-Convex Bilevel Optimization with First-Order Oracles
Expressive Graph Neural Networks via Equivariant Use of Noise
Adversarial Reinforcement Learning for Robust Diffusion Large Language Model Unlearning
Richer Bayesian Last Layers with Subsampled NTK Features
Gradient-Aware Scheduling: Coupling Curriculum and Staleness for Async Reinforcement Learning
Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning
G$^2$TAM: Geometry Grounded Track Anything Model
Sinkhorn Treatment Effects
Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs
Self-Soupervision: Cooking Model Soups without Labels
Ask Less, See More: Communication-Conditioned Token Pruning for Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models
Adaptive Group Elicitation via Multi-Turn LLM Interactions
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
Rethinking Convergence in MoE Training: The Role of Routing Sparsity
The surprising strength of weak classifiers for validating neural posterior estimates
Causal Modeling of Selection in Evolution
Learning Latent Action World Models In The Wild
Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization
Low-dimensional topology of deep neural networks
Predicting Large Model Test Losses with a Noisy Quadratic System
Lightweight and Interpretable Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
Causal Identification from Counterfactual Data: Completeness and Bounding Results
Scaling Laws in Model Fine-tuning for Audio DeepFake Detection
LayerT2V: A Unified Multi-Layer Video Generation Framework
Improved Scaling Laws via Weak-to-Strong Generalization in Random Features Ridge Regression
The Geometry of Narrow Fine-Tuning Degradation: Trajectory Lock-in and Spectral Bifurcation
TMS: Trajectory-Mixed Supervision for Reward-Free, On-Policy SFT
NorMuon: Making Muon more efficient and scalable
Distributional Inverse Reinforcement Learning
Reward Hacking Benchmark: Measuring Exploits in LLM Agents with Tool Use
Stem: Rethinking Causal Information Flow in Sparse Attention
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation
(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models
d2: Improved Techniques for Training Reasoning Diffusion Language Models
Prior Diffusiveness and Regret in the Linear-Gaussian Bandit
Mitigating Per-Sample Harm in Stochastic Optimization
The Signal is in the Steps: Local Scoring for Reasoning Data Selection
Re-FORC: Adaptive Reward Prediction for Efficient Chain-of-Thought Reasoning
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
Normalized Rewards for Preference Optimization
On Minimum Depth and Width of Floating-Point Neural Networks for Representing Floating-Point Functions
BVS: Bayesian Visual Search with Multimodal Large Language Model for Fine-grained Perception
Rethinking Neural Network Learning Rates: A Stackelberg Perspective
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers
Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review
Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models
A Capacity-Based Rationale for Multi-Head Attention
Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression
Improved Distribution Estimation in $\ell_\infty$
Securing Multimodal AI through Internal Information Decomposition
Think in Latent, Explain in Language: Self-Explainable Latent Reasoning
Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning
The Perception–Physics Paradox: Probing Scientific Alignment with TC-Atlas
RiskZero: Plan More to Risk Less with a Learned Model
Curated Synthetic Data Doesn’t Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences
Explaining Concept Shift with Interpretable Feature Attribution
Mean Flow Policy Optimization
From Guessing to Placeholding: A Cost-Theoretic Framework for Uncertainty-Aware Code Completion
Expanding the AI Evaluation Toolbox with Statistical Models
Asymptotic Optimality of the High-Dimensional Gaussian Mechanism and Improved Low-Dimensional Mechanisms for Differential Privacy
INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic
Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models
ScoreMatchingRiesz: Score Matching for Debiased Machine Learning and Policy Path Estimation
Human-AI Collaborative Uncertainty Quantification
ProtDBench: A Unified Benchmark of Protein Binder Design and Evaluation
Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models
SURF: Separation via Unsupervised Remixing Flow
Axiomatic Atlas: A Prescriptive Framework for Neural Architecture Design
Great Minds Think Alike: Contextual Tacit Communication for Decentralized LLM-Agent Cooperation
Stabilizing Native Low-Rank LLM Pretraining
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Accelerating Q-learning through Efficient Value-sharing across Actions
TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning
Towards Trustworthy Video Anomaly Understanding: A Class-Guided Chain-of-Evaluation Metric and An Anomaly-focused Meta-Benchmark
Certifying Graph Neural Networks Against Label and Structure Poisoning
Agentic Proposing: Enhancing Large language Model Reasoning via Compositional Skill Synthesis
TabICooL: A better, faster, scalable, and open tabular foundation model
High-Dimensional Learning Dynamics of Quantized Models with Straight-Through Estimator
Dynamics and representation structure of local approximations to gradient-based learning in linear recurrent neural networks
Artificial Hippocampus Networks for Efficient Long-Context Modeling
Alleviating Observation Bias via Causal-Invariant Meta-Learning for Unbalanced Incomplete Multi-view Clustering
RubricRobustness: A Simple Framework for Evaluating the Robustness of Rubrics-Based Benchmarks
A Bi-metric Framework for Efficient Nearest Neighbor Search
On the Convergence of Steepest Descent and Adaptive Gradient Methods under Non-Uniform Smoothness
Anytime-Valid Inference Under Outcome Delay: A Design-Based Approach
GAAVI: Global Asymptotic Anytime Valid Inference for the Conditional Mean Function
Language Generation with Feedback: Queries and Mistakes
Crisp: A Spectral-Based Interaction Strategy for Multivariate Time Series Forecasting
Efficient Rashomon Set Approximation for Decision Trees
A Theory of How Pretraining Shapes Inductive Bias in Fine-Tuning
Learning-Augmented Online Covering Problems
Learning on Higher-Order Structures with Effective Operators
Explaining Data Mixing Scaling Laws
Accuracy and Normalized Accuracy under Length Bias: Analysis, Guidelines, and a Bayesian Alternative
Probing the Inductive Bias of Neural Networks through Learning Random Cellular Automata
On the Sample Efficiency of Inverse Dynamics Models for Semi-Supervised Imitation Learning
OC-space: a Unifying Perspective on Verification of Tree Ensembles
Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models
Interpreting Physics in Video World Models
Trajectory-Stabilized Inference for Diffusion-Based Video Inpainting
Universal Learning of Nonlinear Dynamics
From Generative to Episodic: Sample-Efficient Replicable RL
Natural Language Actor–Critic Is Bilevel: Learning to Reason with Textual Feedback
Optimizing KV Cache Eviction from an Output Perturbation Perspective
Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data
Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders
A Constrained Optimization Perspective of Unrolled Transformers
Symbal: Detecting Systematic Misalignments in Model-Generated Captions
Success-Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success
Effective Reasoning Chains Reduce Intrinsic Dimensionality
Protein Fold Classification at Scale: Benchmarking and Pretraining
Set-Preserving Calibration from Conformal P-Values to E-Values
Generative Modeling with Probabilistic Constraints
The Theory and Practice of MAP Inference over Non-Convex Constraints
NBCG: Nash-Bargained Causal Game for Long-Tailed Multi-Label NLP
Adaptive Policy Backbone via Shared Network
Learning Adaptive Perturbation-Conditioned Contexts for Robust Transcriptional Response Prediction
Autoregressive Image Generation with Masked Bit Modeling
Foundations of Equivariant Deep Learning: Unifying Graph and Sheaf Neural Networks
STAR-VAE: Structured Topology-Aware Regularization for Audio Reconstruction and Generation
You Can Learn Tokenization End-to-End with Reinforcement Learning
Unlocking Speech–Text Compositional Powers: Instruction-Following Speech Language Models without Instruction Tuning
When Embedding-Based Defenses Fail: Rethinking Safety in LLM-Based Multi-Agent Systems
Deep Single-Index Fréchet Regression
Regression Language Models for Code
Learning Rate Annealing Improves Tuning Robustness in Stochastic Optimization
Breaking the Computational Barrier: Provably Efficient Actor–Critic for Low-Rank MDPs
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers
Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization
G-RANS: Generalizable Residual-Aware Neural Solvers for Sparse Systems
Local Redundancy: An Information-Theoretic Measure of Plasticity from Synthetic Memorization
Even Faster Kernel Matrix Linear Algebra via Density Estimation
Proximal-IMH: Proximal Posterior Proposals for Independent Metropolis–Hastings with Approximate Operators
Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes
The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models
Collaborative Threshold Watermarking
Learning Anisotropic Value Geometry with Finsler Reinforcement Learning
Influence-Disentangled Federated Training: Learning Models That Are Easy to Unlearn
Multi-Round Human–AI Collaboration with User-Specified Requirements
Dense associative memory for Gaussian distributions
Geometric Coherence Learning for Structuring Value Functions in Plain MDPs
RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning
Taming I2V models for Image HOI Editing: A Cognitive Benchmark and Agentic Self-Correcting Framework
From Individual Calibration to Reliable Classifiers: ALD Parameterization with mPAIC Guarantees
SecCodePRM: A Process Reward Model for Code Security
Higher-Order Certified Robustness for Regression
Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
SurrogateSHAP: Training-Free Contributor Attribution for Text-to-Image (T2I) Models
MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
Omni-fMRI: A Universal Atlas-Free fMRI Foundation Model
Collaborative and Efficient Fine-tuning: Leveraging Task Similarity
Dissecting Quantization Error: A Concentration-Alignment Perspective
Align Forward, Adapt Backward: Closing the Discretization Gap in Logic Gate Networks
Learning-Guided Integration Contours Construction for Fast Large-Scale Generalized Eigensolvers
Thinned Mean Field Langevin Dynamics
Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
The Pareto-optimal Trade-off between Regret and Statistical Inference in Linear Stochastic Bandits under Safety Constraints
Probability of Matching for Batch Multi-Objective Bayesian Optimization
BLIPs: Bayesian Learned Interatomic Potentials
PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization
Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale
RapTB: Rooted Absorbed Trajectory Balance with Submodular Replay for Stable Autoregressive GFlowNet Training
MaxSAT-Based Compression for Tsetlin Machines
Continuous Variable Hamiltonian Learning at Heisenberg Limit via Displacement-Random Unitary Transformation
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Temporal-Emerged Prompting for Segment Anything in Multiframe Infrared Small Target Detection
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
GAUSS: Graph-Assisted Uncertainty Quantification using Structure and Semantics for Long-Form Generation in LLMs
Dynamic Linear Attention
PyVision-RL: Forging Open Agentic Vision Models via RL
Beyond the Trade-off: Unifying Fairness and Performance in Federated Learning
Ensembling Sparse Autoencoders
Towards Robust Human-AI Complementarity under Uncertainty
PGS: Effective LLM Code Refinement via Property-Oriented and Structurally Minimal Feedback
Set Diffusion: Interpolating Token Orderings between Autoregression and Diffusion for Fast and Flexible Decoding
Multi-Agent Reinforcement Learning with Submodular Reward
Biases in the Blind Spot: Detecting What LLMs Fail to Mention
Approximate Equivariance via Projection-Based Regularisation
Streaming Sliced Optimal Transport
A Factorized Low-Rank RNN Framework for Uncovering Independent Neural Latent Dynamics and Connectivity
Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps
Convergent World Representations and Divergent Tasks
A Studentized Spherical Harmonics–Based Nonparametric Two-Sample Test for Compositional and Directional Data
BFTS: Thompson Sampling with Bayesian Additive Regression Trees
How do LLMs Compute Verbal Confidence?
Online Continual Learning with Dynamic Label Hierarchies
Decomposing Query-Key Feature Interactions Using Contrastive Covariances
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn’t)
Learning-augmented Rent-or-Buy with a Sample
Return of Frustratingly Easy Unsupervised Video Domain Adaptation
Clipping Low-Probability Tokens in SFT Yields a Generalizable Initialization for RL
ANCHOR: Automated Alignment Auditing for CLI Agents on Real-World Harm
SymSpectra: Symmetric Information Bottleneck Framework for Molecular Structure Recognition under Imbalanced Settings
On the Relationship Between Activation Outliers and Feature Death in Sparse Autoencoders
From Prior to Pro: Efficient Skill Mastering via Distribution Contractive RL Finetuning
A Time-Reparameterized Cumulative Intensity Extrapolation Sampler for Discrete Flow Matching
Flow Sampling : Learning to Sample from Unnormalized Densities via Denoising Conditional Processes
MiniX: Mitigating Low-Rank Collapse and Attention Bottlenecks in Tabular Foundation Models
Betting on Equilibrium: Monitoring Strategic Behavior in Multi-Agent Systems
EasyBalance: Cross-Layer Load Balancing in Distributed MoE Inference
Efficient Learned Image Compression without Entropy Coding
Inference-Time Forward-Process Alignment in Diffusion Models
PRISM: Perception Reasoning Interleaved for Sequential Decision Making.
Unitary Convolutions for Message-passing and Positional Encodings on Directed Graphs
Can vision language models learn intuitive physics from interaction?
Evaluating and Rewarding LALMs for Expressive Role-Play TTS via Mean Continuation Log-Probability
What Does Thompson Sampling Optimize?
SAQNN: Spectral Adaptive Quantum Neural Network as a Universal Approximator
Deep Networks Learn Deep Hierarchical Models
RN-D: Discretized Categorical Actors with Regularized Networks for On-Policy Reinforcement Learning
Steer Where It Matters: Token-Level Visual-Sensitivity Steering for LVLMs Hallucination Mitigation
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation
Spectral Flow Matching: Stabilizing Stochastic GFlowNets via Frequency-Domain Regularization
Hidden in Plain Tokens: Simply Robust, Gradient-Free Watermark for Synthetic Audio
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs
Contrastive Symbolic Regression: Aligned Representations, Adaptive Prediction, and Diverse Ensembles
Curriculum-Guided Layer Scaling for Language Model Pretraining
Beyond Policy Training: Recursive Solution Search from Unannotated Videos
Lions and Muons: Optimization via Stochastic Frank-Wolfe
When AI Agents Compete for Jobs: Strategic Capabilities and Economic Dynamics of AI Labour Markets
High-accuracy and dimension-free sampling with diffusions
Error Amplification Limits ANN-to-SNN Conversion in Continuous Control
CircuitPrint: Mechanistic Circuit Fingerprints for Large Language Models
OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
Detecting Perspective Shifts in Multi-Agent Systems
Turning Stale Gradients into Stable Gradients: Coherent Coordinate Descent with Implicit Landscape Smoothing for Lightweight Zeroth-Order Optimization
An Embarrasingly Simple Way to Optimize Orthogonal Matrices at Scale
Bregman meets Lévy: Stochastic Mirror Descent with Heavy-Tailed Noise in Continuous and Discrete Time
Robust and Consistent Ski Rental with Distributional Advice
Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction
Hybrid Reinforcement Learning in Adversarial Markov Decision Processes
Local Mechanisms of Compositional Generalization
Detecting the Semantic Fixed Point: A Geometric Framework for Efficient Inference
BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps
Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization
Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in Reinforcement Learning
Multi-Agent Teams Hold Experts Back
Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning
Stochastic Gradient Methods under Heavy-Tailed Noises in Weakly Convex Optimization
CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill
Don't Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation
RSPO: Regularized Self-Play Alignment of Large Language Models
Online Packet Scheduling with Deadlines and Learning
AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation
Consistent Diffusion Language Models
X-EviProbe: Post-hoc Parameter-free Evidential Uncertainty Quantification for Frozen Graph Neural Networks
Iterated Population Based Training with Task-Agnostic Restarts
Spatial Conformal Inference through Localized Quantile Regression
Transport Clustering: Solving Low-Rank Optimal Transport via Clustering
Scalable Bayesian Inference for Nonlinear Conservation Laws
Differentiable Conformal Training for LLM Reasoning Factuality
PGT: Procedurally Generated Tasks for improving fine-grained understanding in MLLMs
Personalized Additive Modeling for Multi-level Federated Learning
Sponge Tool Attack: Stealthy Denial-of-Efficiency against Tool-Augmented Agentic Reasoning
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
NeurOCNN: A Neural-Operator-Based Model for Physiological Time Series
Goal-Conditioned Agents that Learn Everything All at Once
Simple yet Effective: Low-Rank Spatial Attention for Neural Operators
Adaptive Generation of Bias-Eliciting Questions for LLMs
MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline
Optimal Transport–Guided Stochastic Control for Graph Combinatorial Optimization
Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL
One LR Doesn’t Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
Trifuse: Enhancing Attention-Based GUI Grounding via Multimodal Fusion
Can Adaptive Gradient Methods Converge under Heavy-Tailed Noise? A Case Study of AdaGrad
Leaderboard Incentives: Model Rankings under Strategic Post-Training
Overclocking Electrostatic Generative Models
Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting
MoLoRA: Composable Specialization via Per-Token Adapter Routing
Asymptotic Theory of Iterated Empirical Risk Minimization, with Applications to Active Learning
Procedural Pretraining: Warming Up Language Models with Abstract Data
cuRegOT: A GPU-Accelerated Solver for Entropic-Regularized Optimal Transport
U-Cast: A Surprisingly Simple Frontier Probabilistic AI Weather Forecaster
SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding
Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator
Evaluating Language Models in Realistic Conversational Contexts
CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Learning
Convergence Analysis of Decentralized Hessian-/Jacobian-Free Algorithm for Nonconvex Stochastic Bilevel Optimization
Global Directional Priors with Local Statistical Validation for Scalable Causal Discovery
Explanations are a Means to an End: A Value of Information Framework for Validating Explanations
Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations
Training Deep Spiking Neural Networks without Normalization
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
NanoQuant: Efficient Sub-1-bit Quantization of Large Language Models
Ubiquity of Homeostatic Hebbian Dynamics in Regularized Learning
Measuring Intent Comprehension in LLMs
Transformers Efficiently Perform In-Context Logistic Regression via Normalized Gradient Descent
MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models
Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
Stochastic Neural Ray Tracing for Radio Frequency Channel Modeling
Instance-Level Costs for Nuanced Classifier Evaluation
CCLRec: Consensus-driven Contrastive Learning for LLM-enhanced Graph Recommendation
How high is ‘high’? Rethinking the roles of dimensionality in topological data analysis and manifold learning
Motion Planning in Compressed Representation Spaces
SVL: Goal-Conditioned Reinforcement Learning as Survival Learning
Particle Flow for Learning from Label Proportions
Bayesian Tensor Decomposition with Diffusion Model Prior
Reason, Then Re-reason: Cross-view Revisiting Improves Spatial Reasoning
Riemannian MeanFlow
Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation
Empirical Gaussian Processes
RL4RLA: Teaching ML to Discover Randomized Linear Algebra Algorithms through Curriculum Design and Graph-based Search
What Does Preference Learning Recover from Pairwise Comparison Data?
Content-Style Identification via Differential Independence
Generalization Bounds for Discrete Diffusion: Statistical Advantage of Masking
A Close Look at Negative Label Guided Out-of-distribution Detection in Pre-trained Vision-Language Models
Improved Algorithms for Nash Welfare in Linear Bandits
Hyperbolic neural population geometry benefits computation
Shuffling-Aware Optimization for Private Vector Mean Estimation
PolicyGuard: Towards Test-time and Step-level Backdoor Defense for Reinforcement Learning Agent
effGen: Enabling Small Language Models as Capable Autonomous Agents
PACE: Parameter Change for Unsupervised Environment Design
Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance
FUSE: Quantifying Uncertainty in Multimodal LLMs by Bayesian Fusing Epistemic and Aleatoric Uncertainty
Negative Sampling From the Ground Up: A Redesign for Graph-based Recommendations
Doubly Regularized Markov Decision Processes for Robust Reinforcement Learning
State Space Model with Continuous Limit of HiPPO Matrix: Eigenvalue Analysis and Explicit Solution Formula
Causal Attention with Lookahead Keys
Beyond Additive Decompositions: Interpretability Through Separability
LLM-MatLogic: Executable Exchange Contracts for Knowledge-Graph Query Answering with Scoped Negation
Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization
No Global Plan in Sight: Uncover the Myopic Planning Horizon of LLMs
ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning
Improved Convergence Analysis of Topology Dependence in Decentralized SGD
Neural–Evolutionary Symbolic Regression with Global Constraints: Constraint-Aware Decoding and Reward Shaping
An Exponential Separation Between Quantum and Quantum-Inspired Classical Algorithms for Linear Systems
Robust Parallel Diffusion Sampling via Dynamic Jacobian Bandwidth
FOVI: A biologically-inspired foveated interface for deep vision models
Turbo Connection: Reasoning as Information Flow from Higher to Lower Layers
VFMF: Dense Forecasting by Generating Foundation Model Features
What Does Flow-Matching Bring to TD-Learning?
HieraMAS: Optimizing Intra-Node LLM Mixtures and Inter-Node Topology for Multi-Agent Systems
Learning syntax without semantics: Disentangled tiny language models
MINT: Minimal Information Neuro-Symbolic Tree for Objective-Driven Knowledge-Gap Reasoning and Active Elicitation
Scalable Option Learning in High-Throughput Environments
ABSINT-AI: Agentic Heap Abstractions for Abstract Interpretation
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Weight-sparse transformers have interpretable circuits
Propose, Solve, Verify: Self-Play Through Formal Verification
DeltaEvolve: Accelerating Scientific Discovery through Momentum-Driven Evolution
When LLMs Develop Languages: Symbolic Communication for Efficient Multi-Agent Reasoning
LORD-GoF: A Robust Online Detection Approach for LLM Watermarks in Sparse and Mixed Streams
Decomposing Out-of-Distribution Error in Conditional Flow Matching via Wasserstein Geometry
Outcome-Based Rewards Do Not Guarantee Faithful and Verifiable Reasoning
Expected Return Causes Outcome-Level Mode Collapse in Reinforcement Learning and How to Fix It with Inverse Probability Scaling
When Is Symbolic Regression Tractable?
InvGNN: Learning Invertible Node Representations on Graphs
On the Epistemic Uncertainty of Overparametrized Neural Networks
Learning the Neighborhood: Contrast-Free Multimodal Self-Supervised Molecular Graph Pretraining
SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
Separating representation from reconstruction enables scalable text encoders
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
Deriving Neural Scaling Laws from the Statistics of Natural Language
Probing the Geometry of Diffusion Models with the String Method
Anatomy of Massive Activations and Attention Sinks
Clipped Q-Learning: Your Value Clipping Is Secretly A Robust Operator
From Prompts to Tokens: Internalizing Causal Supervision in Vision-Language Model for Multi-Image Causal Reasoning
Joint Navigation and Manipulation Planning with 3D Interaction Chains
Learning Flexible Generalization in Video Quality Assessment by Bringing Device and Viewing Condition Distributions
BPL: Generalizable Deepfake Detection via Bias-only Pair-aware Learning
Normalizing Flows with Iterative Denoising
Neural Collapse by Design: Learning Class Prototypes on the Hypersphere
Expectation Alignment of Language Models for Real-World User Expectations
Eating for a Sustainable Planet: Personalized Sustainable Diet Recommendation via Constraint-Aware Decision-Making Modeling
Causal-JEPA: Learning World Models through Object-Level Latent Interventions
Cooperative variance estimation and Bayesian neural networks disentangle aleatoric and epistemic uncertainties
Boosting Monocular Metric Depth Estimation via Bokeh Rendering
Quantized Maximum Likelihood Estimation under Normal Mean-Variance Mixture Model
Dependence-Aware Label Aggregation for LLM-as-a-Judge via Ising Models
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
OServe: Accelerating LLM Serving via Spatial-Temporal Workload Orchestration
Long-Context Modeling with Dynamic Hierarchical Sparse Attention for Memory-Constrained LLM Inference
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Learning What to Generate: A Reinforcement Learning-based Closed-Loop Augmentation Framework for Person Re-identification
TreePO: Enhancing Policy Efficacy and Inference Efficiency with Tree Modeling
UniSparse: Combining Weight Pruning and Spike Sparsification in Spiking Neural Networks
ExSkill: Continual Learning from Experience and Skills in Multimodal Agents
Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards
Mesh Field Theory: Port–Hamiltonian Formulation of Mesh-Based Physics
Optimal Unconstrained Self-Distillation in Ridge Regression: Strict Improvements, Precise Asymptotics, and One-Shot Tuning
Covariance estimation using Markov chain Monte Carlo
Corrigibility Transformation: Constructing Goals That Accept Updates
Do-Prompt: Causal Interventions Meet Variational Prompt Bottlenecks
Unifying Masked Diffusion Models with Various Generation Orders and Beyond
Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism
Trajectory-Level Data Augmentation for Offline Reinforcement Learning
Steal the Patch Size: Adversarially Manipulate Vision Language Models
Magnitude Distance: A Geometric Measure of Dataset Similarity
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
Formalizing Learning from Language Feedback with Provable Guarantees
Ambient Dataloops: Generative Models for Dataset Refinement
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
Query-efficient model evaluation using cached responses
Conditional Clifford-Steerable CNNs for PDE Modeling
Latent Forcing: Reordering the Diffusion Trajectory for Pixel-Space Image Generation
Near-Optimal Convergence of Accelerated Gradient Methods under Generalized and $(L_0,L_1)$-Smoothness
Compact Conformal Subgraphs
Inference-Aware Meta-Alignment of LLMs via Non-Linear GRPO
Geometry-Aware Tabular Diffusion
Performative Policy Gradient: Optimality in Performative Reinforcement Learning
PPDL: LLM-Based Flows as Probabilistic Programs
Very Efficient Listwise Multimodal Reranking for Long Documents
Learning Treatment Allocations with Risk Control Under Partial Identifiability
Accelerated Dual Method for Distributed Optimization: An Inexact-Gradient View of Local Updates
X-MoGe: A Cross-Modal Adaptation Framework with Mixture-of-Experts and Geometry Guidance for Heterogeneous Collaborative Perception
Distributed Stochastic $K$-Level Optimization Over Networks
Delayed Momentum Aggregation: Communication-efficient Byzantine-robust Federated Learning with Partial Participation
CoPE: A Framework for Optimizing Coordination between Planning and Execution in LLM-based Agents
Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation
Subliminal Effects in Your Data: A General Mechanism via Log-Linearity
PhenoBrain: Phenotype-Conditioned Long-Range Communication for Multi-Modal Brain Network Analysis
Predictable Compression Failures: Order Sensitivity and Information Budgeting for Evidence-Grounded Binary Adjudication
CLARITree: Cholesky and Lookahead Accelerations for Regression with Interpretable Piecewise Linear Trees
NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search
Optimal Bayesian Stopping for Efficient Inference of Consistent LLM Answers
Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
Tailoring Strictly Proper Scoring Rules for Downstream Tasks: An Application to Causal Inference
Diversity-aware Weight Perturbation Promotes Robust Adaptation
Graph Rewiring based on Flow Alignment for Improving Fluid Simulation
Learning-Augmented Online Minimization with Dual Predictions
The Crowded Embedding Space: A Mean-Field Mechanism for Emergent Marginalization in Retrieval-Augmented Agents
Removing Sandbagging in LLMs by Training with Weak Supervision
Convergence of Two-Timescale Stochastic Approximation with Markovian Samples and Applications in Reinforcement Learning
Reasoning Models Are Test Exploiters: Rethinking Multiple Choice
Hamiltonian Asymmetric Fusion: One-Way Safe Directed Refinement under Modality Imbalance
PretrainZero: Reinforcement Active Pretraining
Geometric Convergence of Gauss–Newton for Neural Networks: Riemannian Geometry and Adaptive Damping
ACON: Optimizing Context Compression for Long-horizon LLM Agents
Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making
Factorized Scheduling Principle: Learning Interpretable and Transferable Policies via Structured Additive Functions
ZeroDiff: Zero-Shot Time Series Reconstruction via Informed-Prior Diffusion
Temporal Straightening for Latent Planning
Learn to change the world: Multi-level reinforcement learning with model-changing actions
Causal Flow Q-Learning for Robust Offline Reinforcement Learning
PnP-Corrector: A Universal Correction Framework for Coupled Spatiotemporal Forecasting
Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction
Predicting the Emergence of Induction Heads in Language Model Pretraining
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation
Adaptive Code Watermarking Through Reinforcement Learning
On Contraction of Sequential and Offset Rademacher Complexities
The Viscosity of Logic: Phase Transitions and Hysteresis in DPO Alignment
Not All Frequencies Are Equal: Energy-Adaptive Diffusion for Time Series Forecasting
SLAE: Strictly Local All-atom Environment for Protein Representation
Text Generation as Continuous Latent Dynamics via Reinforcement Learning
Efficiently Learning Drifting Halfspaces with Massart Noise
Beyond Scalar Rewards: Learning from Text Feedback in LLM Post-Training
Cerebellar-Inspired Residual Control for Fault Recovery: From Inference-Time Adaptation to Structural Consolidation
The Generalization Spectrum: A Chromatographic Approach to Evaluating Learning Algorithms
Subgroup Discovery with the Cox Model
MoSSP: A Momentum-Based Single-Loop Stochastic Penalty Method for Nonconvex Constrained DC Optimization
HodgeFlow Policy Search by Topologically Dissecting Temporal-Difference Signals in Non-Markovian Environments
Enhancing Cross-subject Emotion Recognition via Heterogeneous Distribution Augmentation and Collaborative Learning
SVRG and Beyond via Posterior Correction
Native Active Perception as Reasoning for Omni-Modal Understanding
GLARE: Scalable Neuro-Symbolic Reward Shaping for LLM Agents via Group-Level Automata
FOCUS & RePAIR: Mitigating Text Degeneration via Token-Level Guidance For Pruned Large Language Models
Partial Ring Scan: Revisiting Scan Order in Vision State Space Models
BiRQA: Bidirectional Robust Quality Assessment for Images
Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pretraining
FPTQuant: Function-Preserving Transforms for LLM Quantization
A Two-Tier Perspective on Inference-Time Parallelism in Multi-Agent LLM Systems
Learning to Correct: Reinforcement Learning for Multi-Attempt Chain-of-Thought
XSpecMesh: Quality-Preserving Auto-Regressive Mesh Generation Acceleration via Multi-Head Speculative Decoding
Extending Fair Null-Space Projections for Continuous Attributes to Kernel Methods
DSGym: A Standardized and Holistic Framework for Advancing Data Science Agents
POLCA: Stochastic Generative Optimization with LLM
COPF: An Online Framework for Deployment-Stable Counterfactual Fairness in Evolving Graphs
A Fine-Grained Understanding of Uniform Convergence for Halfspaces
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
Base Models Know How to Reason, Thinking Models Learn When
The Convergent Representation of Vision-Language Contrastive Learning: Geometry, Modality Gap and Shared Space Alignment
SparseInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
Generation is Required for Data-Efficient Perception
Clarify Before You Draw: Proactive Agents for Robust Text-to-CAD Generation
Self Optimizing Language Models
Bridging Functional and Representational Similarity via Usable Information
Learning Taxonomic Trees with Hierarchical Representation Regularization for Large Multimodal Models
CANDI: Hybrid Discrete-Continuous Diffusion Models
The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models
Markovian Projection of Star-Shaped Diffusion for Exponential Family Distributions
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing
Amortized Maximum Inner Product Search with Learned Support Functions
Relevance-Based Embeddings: Lightweight Candidate Selection via Heavy Ranker Calls
Masked Multi-path Contrast with Confidence-Gated Semantic Imputation for Incomplete Multi-view Clustering
We use cookies to store which papers have been visited.
I agree
Successful Page Load
ICML uses cookies for essential functions only. We do not sell your personal information.
Our Privacy Policy »
Accept
We use cookies to store which papers have been visited.
I agree