Skip to yearly menu bar
Skip to main content
Main Navigation
ICML
Help/FAQ
Contact ICML
Create Profile
Code of Conduct
Privacy Policy
Press
Journal To Conference Track
Careers
Downloads
Inclusion
Future Meetings
My Stuff
Login
Select Year: (2026)
2026
2025
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2002
1996
IMLS Archives
Getting Started
Schedule
Tutorials
Main Conference
Invited Talks
Test of Time Award
Papers
Orals
Awards
Spotlight Posters
Position Paper Posters
Journal Track Posters
Workshops
Community
Socials
Exhibitors
Expo
Exhibitors
Organizers
Help
RocketChat Help
RocketChat Desktop Client
FAQ
Browse
Visualization
Layout:
mini
compact
topic
detail
×
No topics available
No sessions available
title
author
topic
session
shuffle
by
serendipity
bookmarked first
visited first
not visited first
bookmarked but not visited
Enable Javascript in your browser to see the papers page.
Annotations Mitigate Post-Training Mode Collapse
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Scalable Topology-Preserving Graph Coarsening: Concepts and Algorithms
Robust Bayesian Optimisation with Unbounded Corruptions
Dimension-Free Multimodal Sampling via Preconditioned Annealed Langevin Dynamics
Active Exploring like a Pigeon: Reinforcing Spatial Reasoning via Agentic Vision-Language Models
Timestep Rescheduling in Diffusion Inversion
FiX: Introducing Fine-grained Forget Gate into Softmax Attention
Breaking Manifold Continuity: Vector Quantized Modeling for Real-Centric Deepfake Detection
SpikeNet: Sparse Spike-Driven Mask Vector Transformer for Energy-Efficient and Stable Spiking Point Cloud Processing
Time-Consistent Robust Multi-Objective Reinforcement Learning via a Bellman–Isaacs Weight-Adversary Recursion
Hyper-ICL: Attention Calibration with Hyperbolic Anchor Distillation for Multimodal In-Context Learning
DOT-MoE: Differentiable Optimal Transport for MoEfication
LiteVSR: Enabling Cross-Domain Fine-Grained Detail Generation in Light-Weight Transformers for Video Super-Resolution
Magnitude Distance: A Geometric Measure of Dataset Similarity
Flow for Future: Geometric SE(3)-Equivariant Flow Matching for 3D Trajectory Prediction
QPoint: End-to-End Lightweight Point Cloud Processing via Robust Quaternion Feature Learning
Towards Understanding Steering Strength
AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning
MAMBO-G: Magnitude-Aware Mitigation for Boosted Guidance
Single-Rollout Hidden-State Dynamics for Training-Free RLVR Data Selection
On Information Self-Locking in Reinforcement Learning for Active Reasoning
Position: We Need A Unified Definition of Hallucination (It’s The World Model, Stupid!)
FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment
Pair2Scene: Learning Local Object Relations for Procedural Scene Generation
What Does Preference Learning Recover from Pairwise Comparison Data?
Multi-Task Bayesian In-Context Learning
Two-dimensional quantization for geometry-aware audio coding
DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
Unlocking Noise-Resistant Vision: Key Architectural Secrets for Robust Models Against Gaussian Noise
Heavy-tailed Physics-Informed Neural Networks
WEVSR: Adapting Video Diffusion Generators to Real-World Video Super‑Resolution with Wavelet-Enhanced VAE Encoder
CodeChemist: Test-Time Scaling for Low-Resource Code Generation via Functional Knowledge Transfer
SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm
CAReDiO: Enhancing Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization
FedTreeLoRA: Reconciling Statistical and Functional Heterogeneity in Federated LoRA Fine-Tuning
Sparse Autoencoders for Interpretable Emotion Control in Text-to-Speech
A Machine-Learned Comorbidity Index
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
Towards Understanding Modality Interaction in Multimodal Language Models via Partial Information Decomposition
Optimal Bayesian Stopping for Efficient Inference of Consistent LLM Answers
The Silent Thought: Modeling Internal Cognition in Full-Duplex Spoken Dialogue Models via Latent Reasoning
Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance
A Fully First-Order Layer for Differentiable Optimization
Learning Stochastic Bridges for Video Object Removal via Video-to-Video Translation
Deep Progressive Training: scaling up depth capacity of zero/one-layer models
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook
CausalGame: Benchmarking Causal Thinking of LLM Agents in Games
UniDrag: Unified Multi-Field Prediction and Robust Shape Optimization for Vehicle Aerodynamics
Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale
MetaMoE: Diversity-Aware Proxy Selection for Privacy-Preserving Mixture-of-Experts Unification
NeurOCNN: A Neural-Operator-Based Model for Physiological Time Series
Training-Free Hierarchical Working Memory for Small Language Model Agents
PsumQuant: In-line Post-training Partial Sum Quantizer for Energy Efficient NPU Inference
Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging
Mechanistic Anomaly Detection via Functional Attribution
Large Language Models Explore by Latent Distilling
Submodular Optimization for Minimal Augmentation in Robust Language Model Alignment
Differentially Private Synthetic Tabular Data via Private Evolution
Regulating Anatomy-Aware Rewards via Trajectory-Integral Feedback for Volumetric Computed Tomography Analysis
Shortcut-Resistant CAM Distillation for Long-Tailed Recognition
TwinQuant: Learnable Subspace Decomposition for 4-Bit LLM Quantization
Why Linear Recurrent Memory Works in Partially Observable Reinforcement Learning
TF-FACE: Time-Frequency Fusion Learning via Frequency-Domain Adaptive and Controllable Enhancement for Trajectory Prediction
Identifying Common Hubs in Multiple Gaussian Graphical Models
The Realignment Problem: When Right becomes Wrong in LLMs
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
Weak-to-Strong Generalization via Bregman Bias–Variance Decomposition
AgentSteerTTS: A Multi-Agent Closed-Loop Framework for Composite-Instruction Text-to-Speech
Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Performance Without Small (Sub)Gradients
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
Latent Space Robust Optimization of Neural Processes with Aligned Stratified Order-Statistic Loss Reduction
Temporal Preference Optimization for Unsupervised Retrieval
TG-RAG: A Retrieval-Augmented Framework for Reasoning Guidance in Specialized Domains
PnP-Corrector: A Universal Correction Framework for Coupled Spatiotemporal Forecasting
Operationalizing the Superficial Alignment Hypothesis via Task Complexity
HilbertA: Hilbert-Curve–Aligned Sparse Attention for 2D Structured Data
QPKO: Differentiable QP-Embedded Deep Koopman Framework for Modeling Nonlinear Systems
Weight-sparse transformers have interpretable circuits
Do LLMs “Feel”? Emotion Circuits Discovery and Control
Multi-Distribution Robust Conformal Prediction
Low-dimensional topology of deep neural networks
DC-Leap: Training-Free Acceleration of dLLMs via Draft-Guided Contiguous Leaping Decoding
Outcome-Based Rewards Do Not Guarantee Faithful and Verifiable Reasoning
Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs
FlatLand: Personalized Graph Federated Learning via Tailored Lorentz Space
Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models
Disentangling Consensus and Value-Specific Representations for Controllable Pluralistic Value Alignment of LLMs
ArcDAE: Asymmetric Rectified Contrastive Diffusion Autoencoder for Unified Representation Learning
Faster Query-Key Learning Sharpens Attention in Self-Attention Models
Towards Understanding the Dynamics of Low-Rank Adaptation
DTS: Enhancing Large Reasoning Models via Decoding Tree Sketching
RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models
Noise Tectonics: Measuring the Stability of AI Benchmark Ecosystems
RGMem: Renormalization Group–inspired Memory Evolution for Language Agents
DIVA: Harnessing the Representation Divergence in Unified Multimodal Models for Mutual Reinforcement
Improving Zero-Shot Offline RL via Behavioral Task Sampling
Envy-Free Allocation of Indivisible Goods via Noisy Queries
Efficient Rashomon Set Approximation for Decision Trees
When Is Symbolic Regression Tractable?
MedSIGHT: Towards Grounded Visual Comprehension in Medical Large Vision-Language Models
$\sigma$: Sigmoid Modulation for Ultra High Resolution Diffusion
Towards Achieving Optimal Strong Regret and Constraint Violation via Computational Efficient Model-free RL
Guaranteed Optimal Compositional Explanations for Neurons
A Risk Decomposition Framework for Pre-hoc Fine-tuning Prediction
TuneAhead: Predicting Fine-tuning Performance Before Training Begins
Overcoming the Incentive Collapse Paradox
Delving into Muon and Beyond: Deep Analysis and Extensions
The Cost of Learning under Multiple Change Points
GENEB: Why Genomic Models Are Hard to Compare
Adaptive Memory Retention in Dynamic Graphs
Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents
Learning Gaussian Mixture-distributed Prototypes for 3D Scene Graph Generation from RGB-D Sequences
SuperHype: Hypergraph Generation via Graph-Superposition Decomposition
Learning Structured Reasoning via Tractable Trajectory Control
Deriving Neural Scaling Laws from the Statistics of Natural Language
Localizing Memorized Regions in Diffusion Models via Coordinate-Wise Curvature Differences
Group Distributionally Robust Optimization-Driven RL for LLM Reasoning
Detecting Errors in AI-Generated Annotations: When and Why Semantic Neighbors Help
Motion Attribution for Video Generation
Continuous Diffusion Models Can Obey Formal Syntax
Learning Flexible Generalization in Video Quality Assessment by Bringing Device and Viewing Condition Distributions
A Solvable High-Dimensional Model Where Nonlinear Autoencoders Learn Structure Invisible to PCA While Test Loss Misaligns With Generalization
Markov Chain Monte Carlo without Evaluating the Target: an Auxiliary Variable Approach
Near-Minimax Multi-Objective RL under Predictable Adversarial Preferences and Preference-Free Exploration in Linear MDPs
Preference-Enhanced Reinforcement Learning for Pluralistic Image Inpainting
Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory
Assistive Prompt Mediation: Evaluating Language Models Under Accessibility Constraints
Neural Collapse by Design: Learning Class Prototypes on the Hypersphere
Causal-JEPA: Learning World Models through Object-Level Latent Interventions
Antidistillation Fingerprinting
Optimal conversion from Rényi Differential Privacy to $f$-Differential Privacy
Parameter Manifold Purification
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Improved Stochastic Optimization of LogSumExp
Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions
Dichotomy of Feature Learning and Unlearning: Fast-Slow Analysis on Neural Networks with Stochastic Gradient Descent
Preserving Expert-Level Privacy in Offline Reinforcement Learning
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling
Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
Rethinking Code Complexity Through the Lens of Large Language Models
Optimal Decision-Making Based on Prediction Sets
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
Learning to Watermark in the Latent Space of Generative Models
Attentive Multi-Layer Fusion for Vision Transformers
NeurVLA: Unleashing Failure-Handling Capability of Vision-Language-Action Models via Neural-Symbolic Reasoning
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems
Understanding Truncated Positional Encodings for Graph Neural Networks
Rethinking 3D Shape Generation: Diffusion over Superquadrics
Improved Scaling Laws via Weak-to-Strong Generalization in Random Features Ridge Regression
ProOPF: Benchmarking and Improving LLMs for Professional-Grade Power Systems Optimization Modeling
Transformers with RL or SFT Provably Learn Sparse Boolean Functions, But Differently
AnyMod-LLVE: Low-Light Video Enhancement with Modality-Agnostic Inference
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces
ProtoKV: Streaming Video Understanding under Delayed Evidence with Summary-State Memory
SmoothSpike: Spiking Transformer with Learnable Hadamard Transformation
MAS-ProVe: Understanding the Process Verification of Multi-Agent Systems
QuArch: A Benchmark for Evaluating LLM Reasoning in Computer Architecture
Learning to Remember, Learn, and Forget in Attention-Based Models
CoMem: Context Management with A Decoupled Long-Context Model
Position: Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment
Tailoring Strictly Proper Scoring Rules for Downstream Tasks: An Application to Causal Inference
Variance-Reduced Zeroth-Order Langevin Dynamics for Non-Log-Concave Black-Box Sampling and Inverse Problems
AnyCanvas: Potential Field Guidance for Training-Free Spatial Control in Text-to-Image Diffusion
Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars
Distributionally Robust Markov Games with Average Reward
Achieving Structurally Robust Gromov Wasserstein Distance via Adaptive Dual-Mask
With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots
FLIP2: Expanding Protein Fitness Landscape Benchmarks for Real-World Machine Learning Applications
Entropy-Aware Dynamic KV Cache Sparsification for Autoregressive Image Generation and Editing
OBJVanish: Prompt-Driven Generation of Physically Realizable 3D LiDAR-Invisible Objects
Boundary Embedding Shaping with Adaptive Contrastive Learning for Graph Structural Disentanglement
SVL: Goal-Conditioned Reinforcement Learning as Survival Learning
Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-Body Manipulation
Manifold-Aware Perturbations for Constrained Generative Modeling
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
FormalRx: Rectify and eXamine Semantic Failures in Autoformalization
ImgCoT: Compressing Long Chain of Thought into Compact Visual Tokens for Efficient Reasoning of Large Language Model
From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models
From Lyapunov Analysis to Algorithm Design in two-sided PL Minimax Optimization
DualTimesField: Rethinking Time Series as Continuous-Time Trends and Events
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
Bi-Anchor Interpolation Solver for Accelerating Generative Modeling
Physically-Guided Data-Space Rectified Flow for Precipitation Nowcasting
Eliminating Solution Bias in Differentially Private Optimization
Allocating Variance to Maximize Expectation
XDomainBench: Diagnosing Reasoning Collapse in High-Dimensional Scientific Knowledge Composition
Data Augmentation of Contrastive Learning is Estimating Positive-incentive Noise
Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation
FlowSeg: Dynamic Semantic Guidance for LLM-Conditioned Segmentation
Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation
Off-Policy Evaluation for Missingness-Aware Policies in MDPs with Rewards Missing Not at Random
Where Rectified Flows Leak: Characterizing Membership Signals Along the Interpolation Path
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning
PyHealth 2.0: A Comprehensive Open-Source Toolkit for Accessible and Reproducible Clinical Deep Learning
Towards Reliable Marking and Verification of AI-Generated Text via Geometry-aware Sentence-level Watermarking
DroneDINO: Towards Heterogeneous Routed Mixture of Experts for Drone-based Unified Object Detection
GI-GCN: Global Interacted Graph Convolutional Networks via Dominant Sets for Graph Classification
DISSOLVR: An Interpretable and Fast Framework for Aqueous and Organic Solubility Prediction
ChartE$^{3}$: A Comprehensive Benchmark for End-to-End Chart Editing
Efficient Continuous-Depth Modeling with GRU Equivalents
AlienLM: Alienization of Language for API-Boundary Privacy in Black-Box LLMs
Exploring Motif-based Heterogeneous Graph Learning for ReDoS Detection
A Geometric Lens on Physics-Aligned Data Compression
Learning Self-Interpretation from Interpretability Artifacts: Training Lightweight Adapters on Vector-Label Pairs
KUMA: A Novel Framework with Koopman Separation and Efficient Multilevel Extraction in Time Series Forecasting
HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection
Factored Classifier-Free Guidance
Memory-Efficient LLMs Training with Dynamic Sparsity: From Stability to Practical Scaling
Equivariant Neural Networks for General Linear Symmetries on Lie Algebras
Near-optimal and Efficient First-Order Algorithm for Multi-Task Learning with Shared Linear Representation
Local Redundancy: An Information-Theoretic Measure of Plasticity from Synthetic Memorization
Dynamic Optimizations of LLM Ensembles with Two-Stage Reinforcement Learning Agents
Beyond Scalar Rewards: Learning from Text Feedback in LLM Post-Training
Selective Concept Bottleneck Models Without Predefined Concepts
Rethinking Pretraining Data Detection for LLMs: From Local to Global
Conformal Calibration Transfer
Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs
Reward Learning through Ranking Mean Squared Error
Benchmarking LLM-Assisted Blue Teaming via Standardized Threat Hunting
Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos
RSAgent: Learning to Reason and Act via Multi-Turn Tool Invocations for Text-Guided Segmentation
Symbal: Detecting Systematic Misalignments in Model-Generated Captions
Unlearning Isn’t Forgetting: Revealing Hidden Leakage in Class Unlearning Evaluations
Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment
Evaluating Object-Centric Models beyond Object Discovery
SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks
SORA: Free Second Order Attacks in Fast Adversarial Training
Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
Ambiguous Strategic Classification
Position: Uncertainty is a Strategic Signal in Human–AI Decision Making
Position: In Defense of Information Leakage in Concept-based Models
Flowers: A Warp Drive for Neural PDE Solvers
Robustifying Vision-Language Models via Test-Time Prompt Adaptation
Cold-Start Personalization via Training-Free Priors from Structured World Models
Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models
Mixture of Concept Bottleneck Experts
Welfare-Optimal Classification with Accuracy Auctions
Diffuse to Detect: Bi-Level Sample Rebalancing with Pseudo-Label Diffusion for Point-Supervised Infrared Small-Target Detection
Toward Cybersecurity-Expert Small Language Models
Temporal Difference Calibration in Sequential Tasks: Application to Vision-Language-Action Models
4DPC$^2$hat: Towards Dynamic Point Cloud Understanding with Failure-Aware Bootstrapping
Transformers learn factored representations
Learning to Decode Against Compositional Hallucination in Video Multimodal Large Language Models
From Individual Calibration to Reliable Classifiers: ALD Parameterization with mPAIC Guarantees
Higher-Order Certified Robustness for Regression
Adaptive Volumetric Mechanical Property Fields Invariant to Resolution
Continual Segmentation under Joint Nonstationarity
Mitigating Conversational Inertia in Multi-Turn Agents
Improving Diffusion Planners by Self-Supervised Action Gating with Energies
PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection
Gradient Regularization Prevents Reward Hacking in Reinforcement Learning from Human Feedback and Verifiable Rewards
U-Cast: A Surprisingly Simple Frontier Probabilistic AI Weather Forecaster
Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
MODUS: Decoder-only Any-to-Any Modeling of Diverse Modalities
Latent Laplace Diffusion for Irregular Multivariate Time Series
CaliDist: Calibrating Large Language Models via Behavioral Robustness to Distraction
FACT: Fuzzy Alignment with Comorbidity Topology for Reliable Multi-Label Medical Image Diagnosis
Model Fusion via Retrofitting
A Hitchhiker's Guide to Poisson Gradient Estimation
Fault Tolerant Multi-Agent Learning with Adversarial Budget Constraints
DistMatch: Adaptive Binning via Distribution Matching for Robust Sequential Conformal Prediction
$f$-Divergence Self-Play for Tabular Anomaly Detection via Large Language Models
It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents
Can Microcanonical Langevin Dynamics Leverage Mini-Batch Gradient Noise?
Teaching Agents to Ask Effective Clarification Questions
How Do Language Models Speak Languages? A Case Study on Unintended Code-Switching
Do Vision and Text Cues Exhibit Evidential Coupling? UFO: A Benchmark for Compositional Multimodal Reasoning in Unified Models
cMoLLM at Scale: Horizontal Scaling Laws for Convolutionally-Gated Mixture-of-LLMs
Performative Policy Gradient: Optimality in Performative Reinforcement Learning
Position: When AI Decides Who Gets an Organ: Multi-Agentic AI Systems in Transplant Medicine Risk Amplifying Disparities Without Targeted Explainability and Deployment Strategies
Approximate Nearest Neighbor Search for Modern AI: A Projection-Augmented Graph Approach
Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance
ObjEmbed: Towards Universal Multimodal Object Embeddings
A Unifying Relational Perspective on Expressive Lottery Tickets
Language Generation with Replay: A Learning-Theoretic View of Model Collapse
Causal Detection of Multi-Step LLM Agent Attacks
InfoPO: Information-Driven Policy Optimization for User-Centric Agents
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
Exploring Nonlinear Pathway in Parameter Space for Machine Unlearning
MindFlow: Mind Supernet Powered Thinking Flows for Research Idea Innovation
Training with Honeypots: Reshaping How LLMs Fail
AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
InteractComp: Evaluating Search Agents With Ambiguous Queries
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Factored Gossip DiLoCo: Reducing Blocking Communication within DiLoCo
Adaptive Preconditioners Trigger Loss Spikes in Adam
LumiNet: Perception-Driven Knowledge Distillation via Statistical Logit Calibration
SLIM: Secure and Efficient Inference for Large Language Models on Untrusted Devices via TEEs
What Do Agents Learn from Trajectory-SFT: Semantics or Interfaces?
Unraveling Syntax: Language Modeling and the Substructure of Grammars
LoSA: Locality Aware Sparse Attention in Diffusion Language Models
DADP: Domain Adaptive Diffusion Policy
Anatomy of Massive Activations and Attention Sinks
Fair Transit Stop Placement: A Clustering Perspective and Beyond
FedReLa: Imbalanced Federated Learning via Re-Labeling
Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence
Zeroth-Order Optimization at the Edge of Stability
Reverse-Engineering Model Editing on Language Models
UB-SMoE: Universally Balanced Sparse Mixture-of-Experts for Resource-adaptive Federated Fine-tuning of Foundation Models
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Universal Learning of Nonlinear Dynamics
The Optimal Sample Complexity of Linear Contracts
Population-Aware Imitation Learning in Mean-field Games with Common Noise
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens
Unlocking Speech–Text Compositional Powers: Instruction-Following Speech Language Models without Instruction Tuning
Boosting CVaR Policy Optimization with Quantile Gradients
Towards Complete Multi-Agent Coordination Policy Learning via Denoising Maximum Entropy Optimization
Scaling-Aware Adapter for Structure-Grounded LLM Reasoning
BEDTime: A Unified Benchmark for Automatically Describing Time Series
Confidence is Not Universal: Task-Dependent Calibration and Emergent Behavior in LLMs
Debate2Create: Robot Co-design via Multi-Agent LLM Debate
Federated Graph Learning via Structure-Aware Fusion Using a Kalman Framework with Learnable Dynamics
LEMUR: Learned Multi-Vector Retrieval
See First, Reason Later: Mutual Information-Guided Reinforcement Learning for Vision-Language Models
Beyond Rewards in RL for Cyber Defence
Learning More from Less: Unlocking Internal Representations for Benchmark Compression
IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning
MedMamba: Multi-View State Space Models with Adaptive Graph Learning for Medical Time Series Classification
Exact Functional ANOVA Decomposition for Categorical Inputs
Caracal: Causal Architecture via Spectral Mixing
Unifying Low Dimensional Spectra in Deep Learning
Structure-Preserving Learning Improves Geometry Generalization in Neural PDEs
Taming I2V models for Image HOI Editing: A Cognitive Benchmark and Agentic Self-Correcting Framework
Text-Conditional JEPA for Learning Semantically Rich Visual Representations
Quantifying the noise sensitivity of the Wasserstein metric for images
SoftBinary Coding: A New Information-Theoretic Paradigm for Neural Compression via Fast Channel Simulation
MEC: Machine-Learning-Assisted Generalized Entropy Calibration for Semi-Supervised Mean Estimation
TVDRNet: Text-driven Viewpoint Optimization via Differentiable Rendering for 3D Reasoning Segmentation
Rationality Measurement and Theory for Reinforcement Learning Agents
Proteo-R1: Thinking Foundation Models for De Novo Protein Binder Design
Unifying Stacking and Cascading for Efficient Ensemble Inference
Privacy Risks of Agentic Inferential Capabilities in Data Linkage Attacks
Configurable Reward Model for Balanced Safety Alignment
Dynamic Compression Flows for Neuroscience Data
DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning
Dismantling the Illusion of Vision-Language-Action Models Competence via Explicit Distributional Shifts
How does information access affect LLM monitors' ability to detect sabotage?
SpikingLM: Towards Fully Spiking Language Model
FUSE: Full‑spectrum Unlearnable Examples via Spectral Equalization
Balancing Understanding and Generation in Discrete Diffusion Models
OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning
FlashSinkhorn: IO-Aware Entropic Optimal Transport on GPU
Turbo Connection: Reasoning as Information Flow from Higher to Lower Layers
Probabilistic Bisection Algorithm Provably Achieves Exponential Convergence
FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds
Emergence of Hierarchical Emotion Organization in Large Language Models
VENOMREC: Cross-Modal Interactive Poisoning for Targeted Promotion in Multimodal LLM Recommender Systems
CSD: Content-aware Speculative Decoding for Efficient Image Generation
ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation.
Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
Probability of Matching for Batch Multi-Objective Bayesian Optimization
Accelerated and Stable Convergence with Anchored Generalized Optimistic Method
Reasoning Compartmentalization: Bridging the Concretization Gap via Abstraction-based Routing
Robust Contextual Optimization with Missing Covariates
Smooth Dynamic Cutoffs for Machine Learning Interatomic Potentials
Bottleneck Communication Delay Minimization for Communication-Efficient Decentralized Learning
STORM: Segment, Track, and Object Re-Localization from a Single Image
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning
Discontinuous Galerkin Neural Operator for Pathology Defocus Deblurring
LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents
Position: Robust AI Personalization Will Require a Human Context Protocol
Sampling from Your Language Model One Byte at a Time
T-Edit: Triple-Branch Diffusion Anchoring for Consistent Editing
Newton-coupled Dual-Teacher Semi-supervised Learning Framework
Hidden in Plain Tokens: Simply Robust, Gradient-Free Watermark for Synthetic Audio
Self-CriTeach: LLM Self-Teaching and Self-Critiquing for Improving Robotic Planning via Automated Domain Generation
Learning in the Fisher Subspace: A Guided Initialization for LoRA Fine-Tuning
Beyond the Trade-off: Unifying Fairness and Performance in Federated Learning
Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation
Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
Position: AI Usage Policies Should Be Aligned with International Human Rights Law
Principled Synthetic Data Enables the First Scaling Laws for LLMs in Recommendation
Towards Foundation Models for Zero-Shot Time Series Anomaly Detection: Leveraging Synthetic Data and Relative Context Discrepancy
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG
MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio
Optimal Pricing for Data-Augmented AutoML Marketplaces
MetaDNS: Enhancing Exploration in Discrete Neural Samplers via Metadynamics
BuildArena: A Physics‑Aligned Interactive Benchmark of LLMs for Engineering Construction
A Factorized Low-Rank RNN Framework for Uncovering Independent Neural Latent Dynamics and Connectivity
RaGEP: Rank-aware Geometric Expert Pruning for Mixture-of-Experts Language Models
InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation
Formalizing and Falsifying Causal Pathways of Rare Events
Noisy Pairwise-Comparison Random Search for Smooth Nonconvex Optimization
Fast and Expressive Multi-Byte Prediction with Probabilistic Circuits
Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models
Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs
Budgeted Active Experimentation for Treatment Effect Estimation from Observational and Randomized Data
Attention Sinks in Diffusion Transformers: A Causal Analysis
Dynamic Programming for Epistemic Uncertainty in Markov Decision Processes
S3Audio: Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
TRACER: Trajectory Risk Aggregation for Critical Episodes in Agentic Reasoning
Temporal Weighted Encoding: Towards Maximal-Capacity Spike Coding for ANN–SNN Conversion
How Does the Pretraining Distribution Shape In-Context Learning? A Fundamental Trade-Off
Sheaf Neural Networks on SPD Manifolds: Second-Order Geometric Representation Learning
Automatic Pruning Discovery for Large Language Models
SAQNN: Spectral Adaptive Quantum Neural Network as a Universal Approximator
Faster Than Flash: Exploiting Attention Sparsity for Efficient Long-Context Decoding
Semantic Impact–Driven Visual Scheduling in Vision-Language Models
$\alpha$-PFN: Fast Entropy Search via In-Context Learning
Differentially Private Range Subgraph Counting
Rubric Curriculum RL: Exploiting the Generation-Verification Gap in Creative Writing
Proximal Decoding: Provably Reducing Copyright Risk for Any Language Model
PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training
ANTiC: Adaptive Neural Temporal In Situ Compressor
Enhancing Numerical Prediction in LLMs via Smooth MMD Alignment
WhisperSplat: Lossless Steganography in 3D Gaussian Splatting
Conditional Coverage Diagnostics for Conformal Prediction
Reward Hacking Benchmark: Measuring Exploits in LLM Agents with Tool Use
Let EEG Models Learn EEG
Spectral Flow Matching: Stabilizing Stochastic GFlowNets via Frequency-Domain Regularization
User-Aware Active Knowledge Acquisition for Emotional Support Dialogue
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
A Tale of Two Problems: Multi-Task Bilevel Learning Meets Equality Constrained Multi-Objective Optimization
SpaEF: Spatially Resolved Transcriptomics Data Element-Wise Denoising Framework Powered by Large Models
Bridging Structure and Semantics: Uncertainty-Modulated Dual-Path Diffusion for Robust Text-Attributed Graph Learning
Phase-Type Variational Autoencoders for Heavy-Tailed Data
Scaling Prompt Synthesis for Large Language Model Reasoning
IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM Inference
Epistemic Gain, Aleatoric Cost: Uncertainty Decomposition in Multi-Agent Debate for Math Reasoning
Learning Generalized Label Distributions
Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL
LS$^{2}$MC-GDA: A Smoothed Algorithm for Federated Stochastic Compositional Minimax Optimization
ASAP: Exploiting the Satisficing Generalization Edge in Neural Combinatorial Optimization
Optimizing Few-Step Generation with Adaptive Matching Distillation
Sequential Group Composition: A Window into the Mechanics of Deep Learning
TIME: Tensor-Factorized Mixture-of-Experts with Intrinsic Routing for Lifelong Multimodal Knowledge Editing
Tuning-Free One-Class Discriminant Learning for Tabular Anomaly Detection
Beyond Buffer Limits: Energy-Based Data Reassembly for Continual Learning
You Need Better Attention Priors
Unveiling the Potential of Quantization with MXFP4: Strategies for Quantization Error Reduction
Maximizing mutual information between prompt and response improves LLM performance with no additional data
Offline Multi-Agent Reinforcement Learning via Sequential Score Decomposition
Training Data Efficiency in Multimodal Process Reward Models
Density-Guided Continuous Flow for Robust Counterfactual Explanations
Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs
Hierarchical Representations for Cross-task Automated Heuristic Design using LLMs
Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization
Hyperparameter Transfer Laws for Non-Recurrent Multi-Path Neural Networks
Can VLMs Diagnose and Recover from VLA Manipulation Faults?
Towards Fair Sequential Decision-Making: A Causal Decomposition Approach
Temporal Context Reinstatement Drives Episodic-Like Order Memory in Long-Context Language Models
Evaluating Contextual Illegality: AI Compliance in Corporate Law Scenarios
High-Dimensional Sensitivity Analysis for Genomic Studies: An Adversarial Framework for Learning Worst-Case Latent Confounders
InfoLaw: Information Scaling Laws for Large Language Models with Quality-Weighted Mixture Data and Repetition
Path-Coupled Bellman Flows for Distributional Reinforcement Learning
Beyond Pixels: Mining Compressed Domain Artifacts for Efficient AI-Generated Video Detection
Target-Oriented Pretraining Data Selection via Neuron-Activated Graph
Personalized Additive Modeling for Multi-level Federated Learning
CAMP: Coherent Alignment of Multimodal Prototypes for Explainable Complementary Learning
On the Interaction of Batch Noise, Adaptivity, and Compression, under $(L_0,L_1)$-Smoothness: An SDE Approach
Simple yet Effective: Low-Rank Spatial Attention for Neural Operators
From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping
Optimal Transport–Guided Stochastic Control for Graph Combinatorial Optimization
Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning
Learning Interpretable Options by Identifying Reward Diffusion Bottlenecks in Reinforcement Learning
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos
Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity Constraints
SPR-RAFT: Parameter-Efficient Regression-Aware Fine-Tuning for Biomedical LLM Regression
From Pixels to Tokens: A Systematic Study of Latent Action Supervision for Vision-Language-Action Models
Light Up Your Face: A Physically Consistent Dataset and Diffusion Model for Face Fill-Light Enhancement
Model-Preserving Adaptive Rounding
Seeing Without Understanding: Disentangling Perception, Reasoning, and Simulation in VLM Gameplay
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs
Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games
Optimal Splitting of Language Models from Mixtures to Specialized Domains
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models
Taking the GP Out of the Loop
Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning
Certified Robustness under Heterogeneous Perturbations via Hybrid Randomized Smoothing
Gromov-Wasserstein at Scale, Beyond Squared Norms
When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation
Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation
Planar Symmetric Pattern Generation
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Can We Build a Monolithic Model for Fake Image Detection? SICA: Semantic-Induced Constrained Adaptation for Unified-Yet-Discriminative Artifact Feature Space Reconstruction
TrustworthyQENN: A Quantum Evidential Neural Network Based on Complex-Valued Contrastive Learning for Uncertainty Pattern Classification
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Abductive Reasoning with Probabilistic Commonsense
Demystifying Entropy Control in LLM RL Training: Theoretical Analysis and Dynamic Scheduling
AugMask: Score-Based Generative Modeling of Incomplete Tabular Data via Augmentation and Masking
Robust Vision-Language Models via Manifold-Adversarial Adapters
Language Model Circuits Are Sparse in the Neuron Basis
MADE: Benchmark Environments for Closed-Loop Materials Discovery
VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency
Near-Universal Multiplicative Updates for Nonnegative Einsum Factorization
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation
Do Neural Operators Forget Geometry? The Forgetting Hypothesis in Deep Operator Learning
Insertion Based Sequence Generation with Learnable Order Dynamics
Structured Multi-modal Graph Disentanglement for Psychiatric Diagnosis
Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training
Selective Disclosure Watermarking for Large Language Models
Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps
Persuasive Privacy
SMILE: Extended Deep Submodular Function-Based Instruction and In-context Learning Demonstration Selection
More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing
Post-Hoc Merging is Not Enough: Many-Shot Model Merging with Loss-Gap Balancing
Semantic Robustness Certification for Vision-Language Models
Monotonic Variational Gaussian Process for Efficient Data Collection
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior
DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants
Resolution as a Direction: Vector-Panning Feature Alignment for Cross-Resolution Re-Identification
AliMark: Enhancing Robustness of Sentence-Level Watermarks Against Text Paraphrasing
PACE: Parameter Change for Unsupervised Environment Design
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Betting on Predictions
Are Large Reasoning Models Interruptible?
SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents
Momentum Further Constrains Sharpness at the Edge of Stochastic Stability
Bayes-inspired Integration of Pretrained Priors and Few-Shot Evidence for Few-Shot Classification
Video-SVD: Efficient Video Diffusion via Orthogonal Basis Composition
Scaling Unsupervised Multi-Source Federated Domain Adaptation through Group-Wise Discrepancy Minimization
Time-series forecasting through the lens of dynamics
Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control
EnerGS: Energy-Based Gaussian Splatting under Partial Geometric Observability
CARE: Adaptive Calibration for Reliable Recommendations
On the origin of neural scaling laws: from random graphs to natural language
SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
Causal Attention with Lookahead Keys
Midtraining Bridges Pretraining and Posttraining Distributions
Improving ML attacks on LWE with data repetition and stepwise regression
Model Monotonicity in Autobidding Auctions: When Do Better Predictions Lead to Better Outcomes?
The Trojan Knowledge: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search
WF-Bench: A Benchmark for Neural-Network WaveFunction Expressivity and Scaling Laws
Learning Hamiltonian Dynamics at Scale: A Differential-Geometric Approach
Learning Attribute–Affordance Hierarchies in Hyperbolic Space for Open-Vocabulary 3D Object Affordance Grounding
Robust Learning via Nested Distributionally Robust Optimization
Scalable RF Simulation in Generative 4D Worlds
Explicit representation of germline and non-germline residues improves antibody language modeling
Tree-Structured Orthonormal Decomposition of the Aitchison Simplex
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization
Domain-Shift-Aware Conformal Prediction for Large Language Models
Hyperbolic Associative Memory Networks
Gateways to Tractability for Satisfiability in Pearl’s Causal Hierarchy
Revisiting ML Training under Fully Homomorphic Encryption: Convergence Guarantees, Differential Privacy, and Efficient Algorithms
MAFE: Enabling Equitable Algorithm Design in Multi-Agent Multi-Stage Decision-Making Systems
Self-Guidance: Enhancing Neural Codecs via Decoder Manifold Alignment
Structured Multi-step Jailbreaking under a Hamiltonian Generative Formulation
Provably Adaptive Linear Approximation for the Shapley Value and Beyond
Radial Scaling Voxelization for Accurate Small Object 3D Detection
Probabilistic Retrofitting of Learned Simulators
Calibrating Decision Robustness via Inverse Conformal Risk Control
Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning
FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight
Position: Causality is Key for Interpretability Claims to Generalise
Pareto-Guided Optimal Transport for Multi-Reward Alignment
On the Limits of LLM Adaptability: Impact of LLM Pre-Training on Annotation Task Performance
Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization
TriForces: Augmenting Atomistic GNNs for Transferable Representations
Beyond Extrapolation: Knowledge Utilization Paradigm with Bidirectional Inspiration for Time Series Forecasting
$V_0$: A Generalist Value Model for Any Policy at State Zero
Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains
MOD-SR: Unifying Multimodal Learning and Direct Optimization with Gradient-Guided Diffusion Model for Symbolic Regression
Curating the Future: A Scalable Recipe for Training Open-Ended Forecasters
Learning Randomized Reductions
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs
Aligning Datasets and Models for Weight Space Learning
SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model
Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents
Building Better Deception Probes Using Targeted Instruction Pairs
Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL
Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality
Discounted Beta-Bernoulli Reward Estimation for Sample-Efficient Reinforcement Learning with Verifiable Rewards
Rethinking Evaluation Paradigms in IBP-based Certified Training
Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning
Cerebellar-Inspired Residual Control for Fault Recovery: From Inference-Time Adaptation to Structural Consolidation
SAGE-NAS: Synergizing LLM-Based Semantic Agent with Graph-Based Evaluator for Neural Architecture Search
INFER: Learning Implicit Neural Frequency Response Fields for Confined Acoustic Environments
QuITE: Query-based Irregular Time-series Embedding
From Content to Knowledge: Lightning Fast Long-Video Understanding with Neural Knowledge Representations
LIMMT: Less is More for Motion Tracking
FOCA: Future-Oriented Conditioning for Data-Efficient Vision-Language-Action Adaptation
Evaluating bivariate causal statements based on mutual compatibility
WinQ: Accelerating Quantization-Aware Training of Large Language Models around Saddle Points
CAffNet: Hard Constraint-Affine Neural Networks
Dynamics Are Learned, Not Told: Semi-Supervised Discovery of Latent Dynamics Geometries For Zero-Shot Policy Adaptation
BeaconKV: Key-Value Cache Compression Guided by Beacon Queries for Efficient Large Reasoning Model Inference
How much can language models memorize?
RLSF-V: Mitigating Hallucinations in MLLMs via Fuzzy Semantic Self-Feedback
Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations
Semantic Editing with Coupled Stochastic Differential Equations
Adversarial Flow Models
Adaptive DNA Sequence Modeling via Synergistic Plasticity Units
Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learned Force Fields
Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence
Align Forward, Adapt Backward: Closing the Discretization Gap in Logic Gate Networks
Tri-Scale Neural ODEs for Continuous Multi-Omics Disease Modeling
GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction
Mind the Gap: Structure-Aware Consistency in Preference Learning
A Cartesian-3j and nj Framework for Machine Learning Interatomic Potentials
Neural Honeytrace: Plug&Play Watermarking Framework against Model Extraction Attacks
Universal One-third Time Scaling in Learning Peaked Distributions
A Theoretical Framework for Modular Learning of Robust Generative Models
Optimized Deferral for Imbalanced Settings
Well-Posed KL-Regularized Control via Wasserstein and Kalman–Wasserstein KL Divergences
STABLE: Simulation-Ready Tabletop Layout Generation via a Semantics–Physics Dual System
CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation
Responsible Text-to-Image Diffusion: Interpretable and Linearly Controllable Semantics for Fair and Safe Generation
AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation
Online Learning with Recency: Algorithms for Sliding-window Streaming Multi-armed Bandits
PLANTAIN: Plan-Answer Interleaved Reasoning
On the Computational Complexity of Performative Prediction
Iterated Population Based Training with Task-Agnostic Restarts
Inverse Depth Scaling From Most Layers Being Similar
Rethinking the Flow-based Gradual Domain Adaption: A Semi-Dual Optimal Transport Perspective
TOM-SWE: User Mental Modeling For Software Engineering Agents
Quadratically Regularized Optimal Transport: Localization Bounds and Affine Case Analysis
You Don’t Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models
Categorical Reparameterization with Denoising Diffusion models
Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models
Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts
Singularity-aware Optimization via Randomized Geometric Probing: Towards Stable Non-smooth Optimization
Learning in Structured Stackelberg Games
DTKG: Dual-Track Knowledge Graph-Verified Reasoning Framework for Multi-Hop QA
Zeroth-Order Forward-Only SNN Training Inspiring Neuromorphic On-Chip Learning
A Narrowing Geometry in Contaminated Reasoning
Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models
Variational Entropic Optimal Transport
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
iGRPO: Fast Online RL for Flow Matching Model with Dense Reward
MetaphorVU: Towards Metaphorical Video Understanding
Beyond Point Predictions: Manifold Expansion and Dual Alignment for Robust Time Series Distillation
Rewiring Experts on the Fly: Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation
Reasoning Can Be Restored by Correcting a Few Decision Tokens
Degradation-Aware Metric Prompting for Hyperspectral Image Restoration
Tiny Brains, Giant Impact: Uncovering the Keystone Neurons of LLM with Just a Few Prompts
Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion Models
Mitigating Reward Hacking in LLM-based Recommendation: A Preference Optimization Approach
Contrastive Weak-to-Strong Generalization
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
NExT-Guard: Training-Free Streaming Safeguard without Token-Level Labels
Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization
A Regret Minimization Framework on Preference Learning in Large Language Models
ConsMSA: Semantic Distribution Consistency Learning for Multimodal Sentiment Analysis
Experience Augmented Policy Optimization for LLM Reasoning
Evolving Quantitative Reasoning through Self-Play in Digital Twin Markets
Breaking the Capacity Bottleneck in Model-Heterogeneous Federated Learning via Gradual Model Restoration
Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Adaptive Time Series Reasoning via Segment Selection
Butterworth as Attention: Anisotropic Spectral Gating for Pansharpening
Wikipedia in the Era of LLMs: Evolution and Risks
MoCo-EA: Exploiting Adversarial Mode Connectivity for Efficient Evolutionary Attacks
Reasoning Models Are Test Exploiters: Rethinking Multiple Choice
QuantWear: Quantum-scale Wear Particle Detection for Jet Engine Diagnosis
Sparse Bayesian Deep Functional Learning with Structured Region Selection
Statistically Optimal Scaling for Token Merging in Transformers
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
Leveraging Lineage Barcodes as Natural Augmentations for Contrastive Learning of Cell Fate in scRNA-seq Data
Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs
Mitigating Manifold Departure: Uncertainty-aware Subspace Rectification for Trustworthy MLLM Decoding
Normality Calibration in Semi-supervised Graph Anomaly Detection
Geometrically Constrained Stenosis Editing in Coronary Angiography via Entropic Optimal Transport
Reasoning-preserved Efficient Distillation of Large Language Models via Activation-aware Initialization
Probing the Geometry of Diffusion Models with the String Method
DAL: A Practical Prior-Free Black-Box Framework for Piecewise Stationary Bandits
Solving Stochastic Variational Inequalities without the Bounded Variance Assumption
Convergence Rate of the Last Iterate of Stochastic Proximal Algorithms
BFTS: Thompson Sampling with Bayesian Additive Regression Trees
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Position: Preparing for AI Systems That Deceive Developers
WarmServe: Enabling One-for-Many GPU Prewarming for Multi-LLM Serving
Unveiling the Structure of Do-Calculus Reasoning via Derivation Graphs
TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning
Learning Coherent Representations: A Topological Approach to Interpretability
Generative Modeling of Irregular Time Series via SDE-Induced Continuous-Discrete Variational Inference
The Deterministic Horizon: When Extended Reasoning Fails and Tool Delegation Becomes Necessary
Precise Asymptotics of Bagging Regularized M-estimators
Offline Preference Optimization for Rectified Flow with Noise-Tracked Pairs
Causal Modeling of Selection in Evolution
Optimal Quantum Speedups for Repeatedly Nested Expectation Estimation
Foresee-to-Ground: From Predictive Temporal Perception to Evidence-Driven Reasoning for Video Temporal Grounding
Non-Monotonic Autoregressive Sequence Model
Lifting Traces to Logic: Programmatic Skill Induction with Neuro-Symbolic Learning for Long-Horizon Agentic Tasks
Posterior Mismatch Matters: Adversarial Training for Long-Tailed Robustness
Budget-Feasible Mechanisms for Submodular Welfare Maximization in Procurement Auctions
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
Poison with Style: A Practical Poisoning Attack on Code Large Language Models
Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation
Statistical-Computational Trade-offs for Recursive Adaptive Partitioning Estimators
Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies
SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning
Discretized Density-Guided Source-Free Adaptation for Continuous Targets
Efficient Synthetic Network Generation via Latent Embedding Reconstruction
An Approximation Algorithm for Graph Label Selection
Distributionally Robust Reinforcement Learning with Human Feedback
MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction
Designing a Conditional Prior Distribution for Flow-Based Generative Models
StormInsight: Hierarchical Environmental Forcing and Vertical Coupling for Weather System Evolution
Easier to Judge than to Find: Predicting In-Context Learning Success for Demonstration Selection
EduMirror: Modeling Educational Social Dynamics with Value-driven Multi-agent Simulation
Hierarchical Filtering and Refinement Classification for Few-Shot Class-Incremental Learning
Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Efficient and Unbiased Sampling from Boltzmann Distributions via Variance-Tuned Diffusion Models
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Counterfactual Bootstrap for Robust Meta-Reinforcement Learning
StethoLM: Audio Language Model for Cardiopulmonary Analysis Across Clinical Tasks
HypCL: Adapting CLIP in Hyperbolic Space for Continual Learning
Bad Seeing or Bad Thinking? Rewarding Perception for Multimodal Reasoning
Reasoning-Driven Synthetic Data Generation and Evaluation
Geometry-Preserving Unsupervised Alignment for Heterogeneous Foundation Models
Learning with Admissibility: Robust Fuzzy Hashing for Cross-Modal Retrieval with Noisy Labels
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
TimeMRA: LLM-Empowered Time Series Forecasting via Multi-Scale Retrieval-Augmented Representations
Correspondence Cognitive Learning for Multi-Modal Object Re-Identification
Advancing LLM Reasoning with Natural Language and Numerical Feedback
Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers
Beyond Accuracy: What Matters in Designing Well-Behaved Image Classification Models?
Inference-time optimization for experiment-grounded protein ensemble generation
Inference of Online Newton Methods with Nesterov's Accelerated Sketching
Information-Geometric Adaptive Sampling for Graph Diffusion
G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving
Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense
Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers
SAM Audio: Segment Anything in Audio
Your Latent Reasoning is Secretly Policy Improvement Operator
Identifiable Markov Switching Models with Instantaneous Effects and Exponential Families
Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
No Data? No Problem: Robust Vision-Tabular Learning with Missing Values
Position: The Time for Sampling Is Now! Charting a New Course for Bayesian Deep Learning
mSOP-765k: A Benchmark For Multi-Modal Structured Output Predictions
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
RePo: Language Models with Context Re-Positioning
Characterizing Vision-Language-Action Models across XPUs: Constraints and Acceleration for On-Robot Deployment
Optimization Dynamics of Equivariant and Augmented Neural Networks
ParamMem: Augmenting Language Agents with Parametric Reflective Memory
Geometry-Aware Contrastive Learning for Few-Shot Automatic Modulation Recognition
TimeAutoDiff: A Unified Framework for Generation, Imputation, Forecasting, and Time-Varying Metadata Conditioning of Heterogeneous Time Series Tabular Data
Neural-Inspired Modeling of Auditory Selection and Compensation for Audio-Visual Speech Separation
Neural Modular Physics for Elastic Simulation
LO-BCQ: Locally Optimal Block Clustered Quantization for 4-bit (W4A4) LLM Inference
Towards Sub-second Biological Foundation Model Infrastructure: A Quantized Consistency Diffusion Framework for Molecular Docking
The Choice of Normalization Influences Shrinkage in Regularized Regression
Infinite-Precision Autoregressive Modeling for Vector Graphics and Layouts
Robust Reinforcement Learning in a Sample-Efficient Setting
Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors
Comparing Deterministic and Soft Policy Gradients for Optimizing Gaussian Mixture Actors
WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models
Exploiting Hankel-Toeplitz Structures for Fast Computation of Kernel Precision Matrices
Position: Multi-Agent Explainability Needs Contracts Before Methods
A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets
Optimization with Access to Auxiliary Information
Computationally-efficient Graph Modeling with Refined Graph Random Features
MOES-Pred: Molecular Structural Representation Learning by Adaptive Energy-Sentinel Vibration for Generalized Property Prediction
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives
ParalESN: Enabling parallel information processing in Reservoir Computing
Concept Concentration for Faithful Representation Intervention
effGen: Enabling Small Language Models as Capable Autonomous Agents
Component-Wise Composite Likelihood Distillation for Censored Time-to-Event Data
Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning
Toward More Reliable Agent Evaluation: A Component-Based Benchmark Auditing Pipeline
BM^2: Coupled Schrödinger Bridge Matching
Stabilized Supralinear Networks Learn to Switch Coding Strategies Balancing Cost and Performance
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
Expert-guided Clinical Text Augmentation via Query-Based Model Collaboration
Explainable Federated Learning via Global–Local Attribution Alignment
Multi-Accurate CATE is Robust to Unknown Covariate Shifts
Trust-Region Diffusion Policies for Massively Parallel On-Policy RL
One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models
Identifiability of Causal Graphs under Non-Additive Conditionally Parametric Causal Models
ManifoldKV: Training-Free KV Cache Compression via Euclidean Outlier Detection
Rethinking Neural Network Learning Rates: A Stackelberg Perspective
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
Less is More: Neuroscience-Motivated Probing for Efficient Concept Circuits Tracing
Extending Mean-Field Variational Inference via Entropic Regularization: Theory and Computation
Efficient Bayesian Inference from Noisy Pairwise Comparisons
Ekka: Automated Diagnosis of Silent Errors in LLM Inference
The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
Probing Cross-modal Information Hubs in Audio-Visual LLMs
Collaborative likelihood-ratio estimation over graphs
[CLS] is Not Enough: Multi-Label Recognition via Patch-Level Inference and Adaptive Aggregation
(De)-regularized Maximum Mean Discrepancy Gradient Flow
Factor-Wise Homogeneity of Slot-Attention for Continual Object-Centric Learning
PhaseCoder: Microphone Geometry-Agnostic Spatial Audio Understanding for Multimodal LLMs
Online Robust Reinforcement Learning with General Function Approximation
SERA: Soft-Verified Efficient Repository Agents
PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization
A model of errors in transformers
LieStoNet: Learning Lie Symmetries from Spatiotemporal Data for Stochastic Dynamical Systems
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation
Asymmetric Perturbation in Solving Bilinear Saddle-Point Optimization
MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models
GraphFLEx: Unsupervised Structure Learning $\underline{\text{F}}$ramework for $\underline{\text{L}}$arge $\underline{\text{Ex}}$panding $\underline{\text{Graph}}$s
Unified Multimodal Visual Tracking with Dual Mixture-of-Experts
Making Models Unmergeable via Scaling-Sensitive Loss Landscape
Beyond Model Base Retrieval: Weaving Knowledge to Master Fine-grained Neural Network Design
Are Your Agents Upward Deceivers?
Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences
Memory as Dynamics: Learning Reliability-Guided Predictive Models for Online Video Perception
Symmetry Reveals the In-Context Classifier: Transformers Implement Mean-Shift Dynamics
Physiology-Aware Masked Cross-Modal Reconstruction for Biosignal Representation Learning
A Direct Second-Order Method for Solving Two-Player Zero-Sum Games
Zero-Shot Text-to-Motion Evaluation using Video Language Models
What Makes Value Learning Efficient in Residual Reinforcement Learning?
Theoretical Analysis of Sparse Optimization with Reparameterization, Weight Decay, and Adaptive Learning Rate
DocHop: Benchmarking Out-of-domain Multi-hop Reasoning in Information-Dense Documents
The Illusion of Generalization: Instruction-Following, Task Bias and Contamination in Tabular Language Model Evaluation
Position: LLM-Based Social Simulations Require a Boundary
ScaleErasure: Inference-Time Minimal Intervention for Precise Concept Erasure in Next-Scale Autoregressive Image Generation
Multi-Label Test-Time Adaptation with Bayesian Conditional Priors
FakeWorld 1.0: An Omni modal Benchmark for Fake Media and Content
FUSE: Frequency-domain Unification and Spectral Energy Alignment for Multi-modal Object Re-Identification
Position: Measuring Human Preferences in RLHF is a Social Science Problem
The Expressivity Limits of Transformers
Adaptive Probe-based Steering for Robust LLM Jailbreaking
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
TGV-KV: Text-Grounded KV Eviction for Vision-Language Models
SleepMaMi: A Universal Sleep Foundation Model for Integrating Macro- and Micro-structures
Position: Explanation Stability Is a Property of the Model–Method Pair, Not the Model
Backward Oversmoothing: why is it hard to train deep Graph Neural Networks?
The Signal is in the Steps: Local Scoring for Reasoning Data Selection
Position: Mechanisms for Aggregated Individual Reporting Should be Established for Post-Deployment Evaluation
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth
CauchyNet: Compact and Data-Efficient Learning using Holomorphic Activation Functions
Position: Agent Evaluation Should Be Agentified for Openness, Standardization, and Reproducibility
Controllable and explainable personality sliders for LLMs at inference time
Position: Enabling Fair Revenue Sharing for Data Providers in GenAI Systems
Confidence and Difficulty-Adaptive Policy Optimization for LLM Reasoning
VectorWorld: Efficient Streaming World Model via Diffusion Flow on Vector Graphs
Graph-Link: Bridging the Semantic-Structural Gap in Text-to-SQL via Constrained Subgraph Induction
Restoring Initial Noise Sensitivity in Text-to-Image Distillation through Geometric Alignment
Position: Generative Models Erode Temporal Learning Through Market Selection
Attend to Anything: Foundation Model for Unified Human Attention Modeling
SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
Scaling the Prior: Size-Consistent Geometric Diffusion for 3D Molecular Generation
When Model Merging Breaks Routing: Training-Free Calibration for MoE
Position: Express Your Doubts — Probabilistic World Modeling Should not be Based on Token *logprobs*
Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization
Robust Strategic Classification under Decision-Dependent Cost Uncertainty
Offline Reinforcement Learning with Generative Trajectory Policies
LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs
Position: Prompting Intent Should Be Audited in LLM-Assisted Peer Review
Data Difficulty and the Generalization–Extrapolation Tradeoff in LLM Fine-Tuning
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
Position: Adversarial ML for LLMs Is Not Making Any Progress
VideoSEAL: Separating Planning from Answer Authority for Agentic Long Video Understanding
PASO: Step Parallel Stochastic Optimization
Estimating the Empowerment of Language Model Agents
When Attributes Disagree: Gradient Conflict in Image Aesthetic Assessment
Efficient Diffusion Models via Time Step Optimization with Consistent Training and Inference Constraints
Adaptive Contracts for Cost-Effective AI Delegation
Do-Prompt: Causal Interventions Meet Variational Prompt Bottlenecks
Learning a Generative Meta-Model of LLM Activations
PEARL: Differentially Private and Entropy-Aware Regulated Language Generation
PostTrainBench: Can LLM Agents Automate LLM Post-Training?
Position: Predicting AI’s Impact on Labor Is a Core Machine Learning Problem
When Does Sparsity Mitigate the Curse of Depth in LLMs
Zero-source LLM Hallucination Detection with Human-like Criteria Probing
LangPrecip: Language-Aware Multimodal Precipitation Nowcasting
Predicting Future KV Utility: Global Combinatorial Optimization for Task-Agnostic KV Cache Eviction
The Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and Dimension Disparity
PSBench: Editing Image via GUI Agents in Photoshop
Position: To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack
EvReflection: Event-Driven Micro-Dynamics for Reflection Removal
Debiased Model-based Representations for Sample-efficient Continuous Control
NanoQuant: Efficient Sub-1-bit Quantization of Large Language Models
Geometry-Aware Probabilistic Circuits via Voronoi Tessellations
PULSE: Generative Phase Evolution for Non-Stationary Time Series Forecasting
RLCracker: Evaluating the Worst-Case Vulnerability of LLM Watermarks with Adaptive RL Attacks
Minibatch selection for Language Models via Partition Matroid Constrained Gradient Matching
CountsDiff: A diffusion model on the natural numbers for generation and imputation of count-based data
Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference
ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT
CoLA: Cross-Modal Low-rank Adaptation for Multimodal Downstream Tasks
Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation
Repositioning the Subject within Image
SDiD:Shared diffusion prior for efficient distributed stereo image compression
TabICooL: A better, faster, scalable, and open tabular foundation model
Ophiuchus: Incentivizing Tool-augmented ''Think with Images'' for Joint Medical Segmentation, Understanding and Reasoning
Factored Latent Action World Models
Exact and Approximate Algorithms for Polytree Learning
DOUBT: Decoupled Object-level Understanding and Bridging via vMF-based Trustworthiness for Hallucination Detection in MLLMs
Semantic-Aware Motion Encoding for Topology-Agnostic Character Animation
LUGS: Latent-aware Guidance for Efficient Unmasking in Diffusion Large Language Models
Uncovering Latent Communication Patterns in Brain Networks via Adaptive Flow Routing
RAIGen: Rare Attribute Identification in Text-to-Image Generative Models
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training
Position: Want Better ML Reviews? Stop Asking Nicely and Start Incentivizing with a Credit System
HOI-PAGE: Zero-Shot Human-Object Interaction Generation with Part Affordance Guidance
Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning
ReViT: Rotational-equivariant Vision Transformers for Neural PDE Solvers
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
CoRe: Context-Robust Remasking for Diffusion Language Models
Process Reward Models That Think
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering
Breaking the Echo Chamber: A Dynamic Ensemble Pruning Perspective on MoE
GRAPE: Let GRPO Supervise Query Rewriting by Ranking for Retrieval
MCP-Persona: Benchmarking LLM Agents on Personalized MCP Tools and Tasks
Training AI Co-Scientists Using Rubric Rewards
Position: Carbon Footprint Reporting Should Be Routine in Machine Learning Research
SimpleGPT: Improving GPT via A Simple Normalization Strategy
Does Reinforcement Fine-Tuning Improve Generalization of LLM Agents? An Empirical Study
CSOR: Coreset Selection for Object Re-identification via Class Pruning
PolarDepth: Monocular Transparent Object Depth from Polar-Physics Priors
LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Entangled No More: Multi-Domain Decoupling for Robust Dynamic Graph Neural Networks
Improving Visual Token Reduction via Rectifying Distortions for Efficient Multimodal LLM Inference
Regime-Adaptive Bayesian Optimization via Dirichlet Process Mixtures of Gaussian Processes
AdaMEM: Test-Time Adaptive Memory for Language Agents
Position: Multiple Definitions & Unrealistic Assumptions of Model Collapse Distract from Real World Threats
Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models
Hierarchical Policy Learning via Spectral Decomposition
GeoLoom: High-quality Geometric Diagram Generation from Textual Input
Mixing Configurations for Downstream Prediction
Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination
More Sail than Ballast: Addressing Harmful Knowledge Leakage in the Expansive Reasoning Space of LRMs
Position: Epistemic uncertainty estimation methods are fundamentally incomplete
Position: Benchmarks Do Not Measure Deployment Readiness in Clinical AI
Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions
Emergence of Biased Consensus in Multi-Agent LLM Debates
Improving Adversarial Robustness of Attribution via Implicit Regularization
Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis
Position: Solipsistic superintelligence is unlikely to be cooperative
When Labelers Stay Silent: The Power of Ties in Cost-Effective Preference Learning
Uni-DocRobust: Universal Plug-and-Play Robustness Enhancement for Multi-modal LLMs via Feature Restoration
Approximate Equivariance via Projection-Based Regularisation
Watch Your Step: Information Injection in Diffusion Models via Shadow Timestep Embedding
Linear Causal Representation Learning by Topological Ordering, Pruning, and Disentanglement
Motion Dynamics Learning for Few-Shot Embodied Adaptation
Learning to Emulate Chaos: Adversarial Optimal Transport Regularization
Equivalence of Context and Parameter Updates in Modern Transformer Blocks
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
From Parameter Dynamics to Risk Scoring: Quantifying Sample-Level Safety Degradation in LLM Fine-tuning
Position: Neural Approximation Is Rarely Justified for Hard Combinatorial Problems
Position: LLMs can't jump
Learn-to-learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-gated LLM
Beyond Procedure: Substantive Fairness in Conformal Prediction
Edit-Based Refinement for Parallel Masked Diffusion Language Models
Benchmarking the Scientific Mind: Toward Evaluation of Complex-Reasoning Biomedical VQA
Solving the Offline and Online Min-Max Problem of Non-smooth Submodular-Concave Functions: A Zeroth-Order Approach
D²Evo: Dual Difficulty-Aware Self-Evolution for Data-Efficient Reinforcement Learning
Exploring Accurate and Transparent Domain Adaptation in Predictive Healthcare via Concept-Grounded Orthogonal Inference
RapTB: Rooted Absorbed Trajectory Balance with Submodular Replay for Stable Autoregressive GFlowNet Training
FOAM: Frequency and Operator-Error Based Adaptive Damping Method for Reducing Staleness-Oriented Error for Shampoo
RoboFlow4D: A Lightweight Flow World Model Toward Real-Time Flow-Guided Robotic Manipulation
Position: Verifiable Data Minimization is a Prerequisite for Responsible, Privacy-Preserving Industrial Vision
HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning
Position: Machine Learning Research Should Be Guided by Explicit, Pluralistic Models of Human Purpose
SaTeen: Learning Structural Alignment for Continual Test-Time Adaptation
ERGeoBench: A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models
Adversarial Attack and Defense for Denoising Diffusion Sampling
d2p: Fast and Scalable Structured Attention with Differentiable Dynamic Programming
SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass
CircuitPrint: Mechanistic Circuit Fingerprints for Large Language Models
Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling
Zero Sum SVD: Balancing Loss Sensitivity for Low Rank LLM Compression
DP-KFC: Data-Free Preconditioning for Privacy-Preserving Deep Learning
Learning the Best Under Constraints: A Duality-Based Framework
SynLaD: Latent Diffusion for Generating Synthesizable Molecules Conditioned on 3D Pharmacophore Profiles
Optimizing Diversity and Quality through Base-Aligned Model Collaboration
OSCS: Online Selection with Provable FAR Control for LLM Safety
Dynamics Reveals Structure: Challenging the Linear Propagation Assumption
One-Way Policy Optimization for Self-Evolving LLMs
Delving into Non-Exchangeability for Conformal Prediction in Graph-Structured Multivariate Time Series
PATCHCODE: Discrete Latent Predictive Learning for EEG Foundation Model
Entropy-aware Span-Constrained Optimal Transport for Robust Cross-Tokenizer Knowledge Distillation
Position: Hippocampal Explicit Memory Is a Cornerstone to Human-Level AI
UniScale: Adaptive Unified Inference Scaling via Online Joint Optimization of Model Routing and Test-Time Scaling
DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders
CauScale: Neural Causal Discovery at Scale
From Prompts to Tokens: Internalizing Causal Supervision in Vision-Language Model for Multi-Image Causal Reasoning
DiffuReason: Enhancing Reasoning Ability for Diffusion Language Models via Monte Carlo Tree Search
FIPN: Forward Self-Organizing Interpretable Polynomial Networks for Time Series Forecasting
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
Quantum Robust Inner Minimization for Reinforcement Learning with Quadratic Speed-Up in Query Complexity
The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
TextResNet: Decoupling and Routing Optimization Signals in Compound AI Systems via Deep Residual Tuning
Physics-Guided Motion Loss for Video Generation Model
Deterministic Differentiable Structured Pruning for Large Language Models
Dynamic Multimodal Evaluation via Knowledge-Enhanced Benchmark Evolution
Injecting Distributional Awareness into MLLMs via Reinforcement Learning for Deep Imbalanced Regression
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
Position: Zeroth-Order Optimization in Deep Learning Is Underexplored, Not Underpowered
EvoC2F: Compiling Tool Orchestration for Efficient and Evolvable LLM Agents
Optimizing Rank for High-Fidelity Implicit Neural Representations
Less is More: Geometric Unlearning for LLMs with Minimal Data Disclosure
Unbiased Principles, Robust Rewards
CACR: Reinforcing Temporal Answer Grounding in Instructional Video via Candidate-Aware Causal Reasoning
Baguan-TS: dual in-context learning model for time series forecasting with covariates
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
Overcoming the Modality Gap in Context-Aided Forecasting
CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
One-Shot Weighted Ensemble Estimation for Federated Quantile Regression: Optimal Statistical Guarantees under Heterogeneous Structured Data
Controllable Molecule Generation via Sparse Representation Editing: An Interpretability-Driven Perspective
Training-Free Adversarial Robustness in Deep Learning MRI Reconstruction
TimeSpot: Benchmarking Geo-Temporal Understanding in Vision–Language Models in Real-World Settings
EmWorld: Emotion World Model with Latent State Evolution for Scenario-Incremental Dynamic Facial Expression Recognition
Spatial Deconfounder: Interference-Aware Deconfounding for Spatial Causal Inference
Symmetries in language statistics shape the geometry of model representations
Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning
World-Model Inspired Emotion-aware Token Refinement for Training-Free Multimodal Emotion Recognition
FedScar: Correcting Geometric Bias for Flatness-Consistent Federated Learning
MUSA-PINN: Multi-scale Weak-form Physics-Informed Neural Networks for Fluid Flow in Complex Geometries
Beyond Global Alignment: Fine-Grained Motion-Language Retrieval via Pyramidal Shapley-Taylor Learning
Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models
From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity
LSGQuant: Layer-Sensitivity Guided Quantization for One-Step Diffusion Real-World Video Super-Resolution
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning
X-EviProbe: Post-hoc Parameter-free Evidential Uncertainty Quantification for Frozen Graph Neural Networks
GUI-Spotlight: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding
AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing
video-SALMONN S: Memory-Enhanced Streaming Audio-Visual LLM
Reparameterization Flow Policy Optimization
Reparameterization Proximal Policy Optimization
CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception
LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
Rethinking Loss Reweighting for Imbalance Learning as an Inverse Problem: A Neural Collapse Point of View
AIR: Post-training Data Selection for Reasoning via Attention Head Influence
FAFO: Lossy KV Cache Compression for Lossless Inference Acceleration via Draftless Fumble Decoding
Scout Before You Attend: Sketch-and-Walk Sparse Attention for Efficient LLM Inference
Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation
Fix the Loss, Not the Radius: Rethinking the Adversarial Perturbation of Sharpness-Aware Minimization
SIKA-GP: Accelerating Gaussian Process Inference with Sparse Inducing Kernel Approximations for Bayesian Deep Learning
VCG-Bench: Towards A Unified Visual-Centric Benchmark for Structured Generation and Editing
Theoretical Challenges in Learning for Branch-and-Cut
Reliability-Aware LLM Alignment from Inconsistent Human Feedback
EchoAttention: Exploiting Token-Pair Redundancy and Frame-Block Similarity for Efficient Long Video Generation
The Two-Hump Problem: Bridging the Difficulty Gap in Mathematical Reinforcement Learning
GCIB: Graph Contrastive Information Bottleneck for Multi-Behavior Recommendation
Looking Locally: Object-Centric Vision Transformers as Foundation Models for Efficient Segmentation
Verifying Meta-Awareness via Predictive Rewards in Reasoning Models
Turning Drift into Constraint: Robust Reasoning Alignment in Non-Stationary Multi-Stream Environments
Position: Prompts for Public-Sector LLMs Should Be Governed as Commons
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
Guided Star-Shaped Masked Diffusion
Differentially Private Submodular Maximization with a Knapsack Constraint
From Static Constraints to Dynamic Adaptation: Sample-Level Constraint Release for Offline-to-Online Reinforcement Learning
Beyond Static Endpoints: Tool Programs as an Interface for Flexible Agentic Web Services
Towards Docking-oriented De Novo Ligand Design via Gradient Inversion
One Tool Is Enough: Reinforcement Learning of LLM Agents for Repository-Level Code Navigation
Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Position: Benchmarks for Vision–Language Models in Urban Perception Should Be Reliability-Aware and Negotiated
WildCat: Near-Linear Attention in Theory and Practice
Solving Time-Dependent Differential Equations with Physical Dynamical Systems
Trajectory-Aware Heuristic Learning for Combinatorial Search
Hamiltonian Asymmetric Fusion: One-Way Safe Directed Refinement under Modality Imbalance
Unbiased Reward Modeling from Implicit Preference
VLANeXt: Recipes for Building Strong VLA Models
Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models
Invariant Representation Learning for Source-Free Time Series Forecasting with LLM-Centric Proxy Denoising
Theoretical Guarantees for One-Shot Magnitude Pruning and Compute-Adaptive Early Exit
SceneDirector: Bridging Explicit Geometry and Generative Priors for Unified Driving Scene Editing
RSPO: Regularized Self-Play Alignment of Large Language Models
LEAP: Zone-Aware MCTS for LLM Self-Speculative Decoding
Attributed Network Alignment: Statistical Limits and Efficient Algorithm
Quantized Maximum Likelihood Estimation under Normal Mean-Variance Mixture Model
PhyScene3D: Physically Consistent 3D Interactive Tabletop Scene Generation
DDP-WM: Disentangled Dynamics Prediction for Efficient World Models
PAC-Bayesian Reinforcement Learning Trains Generalizable Policies
Learning the Neighborhood: Contrast-Free Multimodal Self-Supervised Molecular Graph Pretraining
Investigating Memory in RL with POPGym Arcade
OVLR: Efficient, Scalable, and Robust Training via Output-Level Variance-Reduced Likelihood Ratio
LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling
How to Correctly Report LLM-as-a-Judge Evaluations
Toward Training Superintelligent Software Agents through Self-Play SWE-RL
Generative Neural Operators through Diffusion Last Layer
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision–Language–Action for Autonomous Driving
You Can Learn Tokenization End-to-End with Reinforcement Learning
Detecting Contextual Hallucinations in Large Language Models with Frequency-Aware Attention
Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
Toward Understanding Adversarial Distillation: Why Robust Teachers Fail
De-Linearizing Agent Traces: Bayesian Inference of Latent Partial Orders for Efficient Execution
On the Provable Suboptimality of Momentum SGD in Nonstationary Stochastic Optimization
Latent-Guided Cooperative Energy-Based Models
BLOCK-EM: Preventing Emergent Misalignment via Latent Blocking
MetaOthello: A Controlled Study of Multiple World Models in Transformers
MePo: Meta Post-Refinement for Rehearsal-Free General Continual Learning
iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning
Strategy-Aware Optimization Modeling with Reasoning LLMs
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
ModernVBERT: Towards Smaller Visual Document Retrievers
$A_2$DEPT: Large Language Model–Driven Automated Algorithm Design via Evolutionary Program Trees
Removing Sandbagging in LLMs by Training with Weak Supervision
PDAgent: An LLM-Driven Autonomous Agent Framework Towards *In Silico* Protein Design via Directed Mutation
Unified Safe In-context Image Generation in Multimodal Diffusion Transformers
SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport
MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
SALAAD: Sparse And Low-Rank Adaptation via ADMM for Large Language Model Inference
Forward-Chaining Temporal Point Process
Simple Unbiased Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures
Divide and Learn: Multi-Objective Combinatorial Optimization at Scale
Understanding Data Temporality Impact on Large Language Models Pre-training
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
Branching Diffusion for Point Processes in Time and Space
From Drift to Coherence: Stabilizing Beliefs in LLMs
Adapting to Evolving Graphs: A Scalable Framework for Dynamic Coarsening
AdaGC: Enhancing LLM Pretraining Stability via Adaptive Gradient Clipping
AffIn-Space: Learning Affine-Invariant Representations for 3D Spatial Understanding with MLLMs
HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
GenExam: A Multidisciplinary Text-to-Image Exam
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
Breaking the Block: Preserving Data Continuity to Train Superior SAEs for Instruct Models
Efficient Generative Modeling beyond Memoryless Diffusion via Adjoint Schrödinger Bridge Matching
PRISM: Gauge-Invariant Tangent-Space Differentially Private LoRA
GPan-LoRA: Gaussian Process Amortized Networks for Bayesian Low-Rank Adaptation in Large Language Models
Calibrated Preference Learning: The Case of Label Ranking
Learning to Watch: Active Video Anomaly Understanding via Interleaved Policy Optimization
Why Do We Need Warm-up? A Theoretical Perspective
Simultaneous Confidence Bounds for Aggregated Effects via Exact Subset Optimization
FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning
GHOST: Unmasking Phantom States in Mamba2 via Grouped Hidden-state Output-aware Selection & Truncation
The Cylindrical Representation Hypothesis for Language Model Steering
Formal Concept Lattices are Good Semantic Scaffolds for Concept-Based Learning
Coupled Trigger Optimization and Vulnerable Parameter Alignment for Persistent Backdoor Attacks on Federated Learning
SPARe: Stacked Parallelism with Adaptive Reordering for Fault-Tolerant LLM Pretraining Systems with 100k+ GPUs
On the Robustness of Langevin Dynamics to Score Function Error
AesFormer: Transform Everyday Photos into Beautiful Memories
FedRGL: Robust Federated Graph Learning under Label Noise
SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding
DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts
Critique-Guided Distillation for Robust Reasoning via Refinement
Robust Linear Dueling Bandits with Post-serving Context under Unknown Delays and Adversarial Corruptions
$G^2$-Reader: Dual Evolving Graphs for Multimodal Document QA
Why Tree-Style Branching Matters for Thought Advantage Estimation in GRPO
From Holo Pockets to Electron Density: GPT-style Drug Design with Density
Structure-Aware Consistency Priors for Shape from Polarization in Complex Media
VideoBrain: Learning Adaptive Frame Sampling for Long Video Understanding
Less Is More: Fast and Accurate Reasoning with Cross-Head Unified Sparse Attention
Gaussian Mean Field Variational Inference can Overestimate Predictive Variance
Discovering Interpretable Algorithms by Decompiling Transformers to RASP
Global Credit Assignment via Dynamical Criticality
From Zero to Hero: Advancing Zero-Shot Foundation Models for Tabular Outlier Detection
Clover: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation
On the Separability of Information in Diffusion Models
Gradient Flow Sampler-based Distributionally Robust Optimization
Breaking the Reference Bottleneck via Learning to Rewrite Conversational Queries without Gold Reference Passages
A Minimal Agent for Automated Theorem Proving
Position: Current Benchmarking Hinders Real Progress in Deep Learning for Time Series Forecasting
RECAST: Model Reconstruction via Counterfactual-Aware Wasserstein Geometry under Limited Data
Regularized Offline Policy Optimization with Posterior Hybrid Bayesian Belief
Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction
SpeedCP: Fast Kernel-based Conditional Conformal Prediction
Learnability-Informed Fine-Tuning of Diffusion Language Models
Reconstructing Template-Memorized Images from Natural Prompts
OPTION: Optimal Transport–Guided Flow Matching for Incomplete and Unaligned Multi-View Clustering
When Diffusion Language Models Hesitate: Detecting and Correcting Visual Hallucinations via Confidence Fluctuation
CoRe: Combined Rewards with Vision-Language Model Feedback for Preference-Aligned Reinforcement Learning
Latent Diffusion Pretraining for Crystal Property Prediction
Motion-Residual Conflict-Aware Time Reversal for Generative Inbetweening
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO
TexEditor: Structure-Preserving Text-Driven texture Editing
Alignment-Sensitive Minimax Rates for Spectral Algorithms with Learned Kernels
An In-Depth Study on Deep Learning Model Cloning
RelayCaching: Accelerating LLM Collaboration via Decoding KV Cache Reuse
PrivGate: Steering Contextual Integrity in LLMs via Latent Space Geometry
MemIncept: Steering LLM Agents via Cooperative Stealthy Memory Injections
Do Language Models Track Entities Across State Changes?
SCoA: Revisiting Domain Generalized Object Detection with Style-Conditioned Adaptation
Symmetries in PAC-Bayesian Learning
The Hidden Risk: Membership Inference Attacks on Multimodal Federated Learning via Modality Imbalance
Multimodal Latent Language Modeling with Next-Token Diffusion
SAEmnesia: Erasing Concepts in Diffusion Models with Supervised Sparse Autoencoders
Multi-Level Strategic Classification: Incentivizing Improvement through Promotion and Relegation Dynamics
ICR-RL: Deep Reinforcement Learning via In-Context-Regression
Structured Expert Routing with Multi-View Task Priors for Offline Meta-Reinforcement Learning
TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation
From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion
LoRe: Adaptive Interaction-Evaluation Routing with Per-step Interaction Budgets for Iterative Graph Solvers
EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding
Denoising without Diffusion: Fixed-Noise Denoiser Anomaly Detection in Tabular Data
MuLoCo: Muon is a Practical Inner Optimizer for DiLoCo
Cross-task Calibration for Asynchronous Federated Continual Learning
CRAG: Can 3D Generative Models Help 3D Assembly?
MindZero: Learning Online Mental Reasoning With Zero Annotations
Lightweight Federated Incremental Learning via Decoupled Replay
PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models
Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning
Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
Sycophancy Towards Researchers Drives Performative Misalignment
Rethinking Human Intent to CAD: Parametric CAD Model Generation via Cooperative Multi-Task Alignment and Spatial-Aware Reinforcement Learning
Bridging Dynamics and Data: A Unified Diffusion Framework for Mechanistically-Informed Epidemic Forecasting
Stable Velocity: A Variance Perspective on Flow Matching
Dissecting Quantization Error: A Concentration-Alignment Perspective
Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks
Federated Causal Inference on Multi-Site Observational Data via Propensity Score Aggregation
LEGO-FL: Learning Heterogeneous Federated Models as a LEGO Assembly Games
Fair Dataset Distillation via Cross-Group Barycenter Alignment
One-shot Entropy Minimization for Language Model Reasoning
Uncertainty-Aware Clarification in LLM Agents with Information Gain
Autobidding Auctions with LLM-Powered Creatives
SEPS: Semantic-Enhanced Patch Slimming Framework for Fine-Grained Cross-Modal Alignment
Quantifying Biases in LLM-as-a-Judge Evaluations
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn’t)
From Abstraction to Instantiation: Learning Behavioral Representation for Vision-Language-Action Model
Harnessing Non-Adversarial Robustness in Large Language Models
Statistical Early Stopping for Reasoning Models
AnalogVerifier: A Neuro-Symbolic Framework for Analog Circuit Verification
LabBuilder: Protocol-Grounded 3D Layout Generation for Interactable and Safe Laboratory
Unifying and Optimizing Data Values for Selection via Sequential Decision-Making
MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement
Zero-Shot Off-Policy Learning
SLAP: The Semantic Least Action Principle for Variational Video-Language Modeling
FOCUS & RePAIR: Mitigating Text Degeneration via Token-Level Guidance For Pruned Large Language Models
NoiseSDF2NoiseSDF: Learning Clean Neural Fields from Noisy Supervision
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
Jailbreak to Protect: Buffering Harmful Fine-Tuning via Temporary Jailbreaking LoRA in Large Language Models
Benchmarking and Enhancing VLM for Compressed Image Understanding
No Retraining at Edge: Efficient Resource-Aware Mixed-Precision Quantization via Federated Supernet Learning
LLawCo: Learning Laws of Cooperation for Modeling Embodied Multi-Agent Behavior
From Per-Image Low-Rank to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers
GADA: Geometry-Aware Deformable Aggregation for Image-Based Gaussian Splatting
scDataset: Scalable Data Loading for Deep Learning on Large-Scale Single-Cell Omics
Mitigating Noise-Induced Layout Priors for Object Counting in Diffusion Models
Learnable Kernel Density Estimation for Graphs and Its Application to Graph-Level Anomaly Detection
Knowing Bias, Doing Better: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement
Immuno-VLM: Immunizing Large Vision-Language Models via Generative Semantic Antibodies for Open-World Trustworthiness
Concept Removal for Frontier Image Generative Models
Video-Based Optimal Transport for Feedback-Efficient Offline Preference-Based Reinforcement Learning
Causal Fine-Tuning under Latent Confounded Shift
OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization
SARSteer: Safeguarding Large Audio Language Models via Safe-Ablated Refusal Steering
Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding
RED-HDP-HMM: Observation-Dependent Durations for Bayesian Nonparametric Sequential Models
Regret Pre-training: Bridging Prior and Posterior Views for Enhanced Knowledge Grounding
Towards Trustworthy and Identifiable Virtual Face Generation
Distilling Linearized Behavior into Non-linear Fine-Tuning for Effective Task Arithmetic
From Patches to Plans: Reasoning Distillation for Repository-Level Program Repair
Learning General Causal Structures with Hidden Dynamic Process for Climate Analysis
Heterogeneous Customizable Personalized Federated Fine-Tuning Approach for Large Language Models
VR-Thinker: Boosting Multimodal Reward Models through Think with Image Reasoning
Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment
Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in Reinforcement Learning
MixQuant: Pushing the Limits of Block Rotations in Post-Training Quantization
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
RePack then Refine: Efficient Diffusion Transformers with Vision Foundation Models
Linguistic Nepotism: Trading-off Quality for Language Preference in Multilingual RAG
Trustworthy Federated Label Distribution Learning under Annotation Quality Disparity
Learning in Bayesian Stackelberg Games With Unknown Follower's Types
Noisy-Channel Minimum Bayes Risk Decoding
Grounded in Reality: Learning and Deploying Proactive LLM from Offline Logs
Tracing the Emergence of Symbol Grounding in Multimodal Language Models
Chamaileon: Cross-Context Binder Design with Contextualized Modeling and Mixed Sampling
What Makes a Good Representation for Single-Cell Perturbation Prediction?
Dependency-Aware Parallel Decoding via Attention for Diffusion LLMs
Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning
Positive–Unlabeled Reinforcement Learning Distillation for On-Premise Small Models
Absorbing Quantization Error by Deformable Noise Scheduler for Diffusion Models
Corruption-Tolerant Asynchronous Q-Learning with Near-Optimal Rates
Function-Valued Causal Influence in Nonlinear Time Series
The Lie We Tell: Correcting the Euclidean Fallacy in Vision Language Action Policies via Score Matching on Tangent Space
On the Ability of Transformers to Verify Plans
Position: Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain
ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling
Sampling and Identity-Testing Without Approximate Tensorization of Entropy
Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
Closing the Expression Gap in LLM Instructions via Socratic Questioning
Creat3r: Confidence Reaggregation for Exploration-aware Active 3D Reconstruction
Doppler Prompting for Stable mmWave-based Human Pose Estimation
Fedfit: Federated dynamic pruning via Fisher Information scoring
Dissecting Embodied Abilities in Multimodal Language Models through Skill-level Evaluation and Diagnosis
Learning Generalizable Skill Policy with Data-Efficient Unsupervised RL
A$^2$SG: Adaptive and Asymmetric Surrogate Gradients for Training Deep Spiking Neural Network
AutoSizer: Automatic Sizing of Analog and Mixed-Signal Circuits via Large Language Model (LLM) Agents
Causal Structure Learning for Sparse Matrix Fill-in Reduction
SABER: Continual Learning with Representation Conflict Management
Position: Why a Dynamical Systems Perspective is Needed to Advance Time Series Modeling
Understanding the Gaps in Satisficing Bandits
Ensembling Sparse Autoencoders
SurrogateSHAP: Training-Free Contributor Attribution for Text-to-Image (T2I) Models
Revisiting Photometric Ambiguity for Accurate Gaussian-Splatting Surface Reconstruction
Convergence Analysis of the Lion Optimizer in Centralized and Distributed Settings
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
Rethinking Parameter Sharing as Graph Coloring for Structured Compression
A Foundation-style Model for Zero-Shot Statistical Dependency Measurement
Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models
Protein Circuit Tracing via Cross-layer Transcoders
FlowNar: Scalable Streaming Narration for Long-Form Videos
Stochastic Linear Bandits with Parameter Noise
High-Fidelity ANN-to-SNN Conversion via Closed-Loop CKA Distillation
TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems
Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective
Learning to Rank by Directly Optimizing Full-Order Probabilities
Flow Matching Calibration for Simulation-Based Inference under Model Misspecification
Tailoring the Training: Difficulty-Aware Learning Strategy Allocation for Large Language Models
Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
Credibility-Aware Weighting Federated Causal Discovery for Time Series
3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Simultaneous Speech-to-Speech Translation Without Aligned Data
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Data-Source Adaptive Online Learning under Heteroscedastic Noise
WISE: World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
Bridging the Grounding Gap in VideoQA via Typed Memory for Language-based Belief-State Reasoning
Position: The Open Benchmark Paradox Must Be Resolved through Sovereign Medical Evaluation
MARS: Modular Agent with Reflective Search for Automated AI Research
Thinking in Structures: Evaluating Spatial Intelligence through Reasoning on Constrained Manifolds
Lower Bounds for Frank-Wolfe on Strongly Convex Sets
PhysHanDI: Physics-Based Reconstruction of Hand-Deformable Object Interactions
Likelihood Matching for Diffusion Models
On the Power of (Approximate) Reward Models for Inference-Time Scaling
Better, Faster: Harnessing Self-Improvement in Large Reasoning Models
Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic Methods
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
Provably Convergent Actor-Critic in Risk-averse MARL
LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning
Causal Preference Elicitation
Variational inference via Gaussian interacting particles in the Bures-Wasserstein geometry
Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models
AudioMosaic: Contrastive Masked Audio Representation Learning
Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents
NITP: Next Implicit Token Prediction for LLM Pre-training
LieWarper: Geometry-Aware Motion Transfer via Lie Algebra
DNA: Uncovering Universal Latent Forgery Knowledge
Pressure Reveals Character: Behavioural Alignment Evaluation at Depth
From Directions to Regions: Decomposing Activations in Language Models via Local Geometry
ARC-Decode: Accelerated Decoding with Risk-Bounded Acceptance
Accelerated Dual Method for Distributed Optimization: An Inexact-Gradient View of Local Updates
PaperBanana: Automating Academic Illustration for AI Scientists
Position: Interpretability Can Be Actionable
Beyond Reactivity: Proactive Adaptive Conformal Inference for Online LLM Factuality
DAG: A Dual Correlation Network for Time Series Forecasting with Exogenous Variables
Inference-time Alignment with Rewards in Besov Spaces: Provable Advantages of Feature Learning and Multi-Step Policy Updates
Efficient Multi-round LLM Inference over Disaggregated Serving
TVI-CoT: Text-Visual Interleaved Chain-of-Thought Reasoning for Multimodal Understanding
TraceRouter: Robust Safety for Large Foundation Models via Path-Level Intervention
OServe: Accelerating LLM Serving via Spatial-Temporal Workload Orchestration
SCOUT: Active Information Foraging for Long-Text Understanding with Decoupled Epistemic States
PolySAE: Modeling Feature Interactions in Sparse Autoencoders via Polynomial Decoding
An Asymmetric Latent Factorization-of-Tensors Model for Relation Extraction
Spatially-Regularized Entropy for Discriminative Token Merging in Fine-Grained Re-Identification
TAG: Tangential Amplifying Guidance for Hallucination-Resistant Sampling
Offline Multi-agent Continual Cooperation via Skill Partition and Reuse
Trainable Nonexpansive Denoisers for Contractive Image Reconstruction
Delayed Momentum Aggregation: Communication-efficient Byzantine-robust Federated Learning with Partial Participation
TAMPO: Task- and Model-Aware Automatic Prompt Optimization for Robust and Controllable Auto-Routing in LLM-based Systems
Accurate, private, secure, federated U-statistics with higher degree
Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs
Loss-aware distributionally robust optimization via trainable optimal transport ambiguity sets
MORE: A Multilingual Document Parsing Benchmark and Evaluation
MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding
UniSparse: Combining Weight Pruning and Spike Sparsification in Spiking Neural Networks
The Invisible Lottery: How Subtle Cues Steer Algorithm Choice in LLM Code Generation
Rare Event Analysis of Large Language Models
Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Model
Online Change Point Detection for Multivariate Inhomogeneous Poisson Processes Time Series
Rethinking Memory in Continual Learning: Beyond a Monolithic Store of the Past
Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds
Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation
Position: The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
G$^2$TAM: Geometry Grounded Track Anything Model
VecDesigner: Exploring Visual Guidance and Structural Consistency for Semantic Typography
Hedging on the frontier: Learning new tasks with few samples
Blending Neural Control Density Functions for Stabilization and Safety
GradPower: Powering Gradients for Faster Language Model Pre-Training
Position: Modular Safety Guardrails Are Necessary for Foundation-Model-Enabled Robots in the Real World
Task-Aware Mechanism: Hybrid MoE Vision Tower Towards Holistic Video Understanding
FlatLab: A Unified Methodology Framework and Simulation-Based Benchmark for Robotic Manipulation of Flat Objects
All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs
Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
PRISM: Synergizing Vision Foundation Models via Self-organized Expert Specialization
Multiview Self-Representation Learning across Heterogeneous Views
Probing How Scalable Table Data Enhances General Long-Context Reasoning
Why Deep Jacobian Spectra Separate: Depth-Induced Scaling and Singular-Vector Alignment
Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control
Position: Spatial Fairness: Foundations, Pitfalls, and a Path Forward
Identifying and Mitigating Errors in Gradient Aggregation of Distributed Data Parallel Training
$\tau$-Voice: Benchmarking Full-Duplex Voice Agents on Real-World Domains
Controlled Dynamics Attractor Transformer
Time-Conditioned Foreseeing: An EHR-Specific Foundation Model for Irregular Dynamics and Calendrical Time
The Accumulation of Score Estimation Error in Diffusion Models
SGERA: Stein-Guided ECG-Report Alignment for ECG Representation Learning
Differentially Private Cross-Silo Recommendation from Implicit Feedback
Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models
Continual Model Routing in Evolving Model Hubs
Scalable Single-Cell Gene Expression Generation with Latent Diffusion Models
Parameter-free Dynamic Regret: Time-varying Movement Costs, Delayed Feedback, and Memory
Optimal Classical and Quantum Algorithms for Gradient Testing and Estimation by Comparisons
Finding Stationary Points by Comparisons
Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens
SpikeVLA: Vision-Language-Action Models with Spiking Neural Networks
Towards Understanding Continual Factual Knowledge Acquisition of Language Models: From Theory to Algorithm
Omni-fMRI: A Universal Atlas-Free fMRI Foundation Model
Kernel-based Maximum-of-difference Test for Two-sample Comparison
Robust Signal Enhancement via Fractional Detail Views and Knowledge Guided Multi-view Fusion
Task-Aware Structured Memory for Dynamic Multi-modal In-Context Learning
Lookahead-GCG: Improving Multi-Model Gradient-Based Jailbreaking Attacks via Nesterov Momentum
Meerkat-VL: Implicit Risk Safety Alignment in Multimodal LLMs via Perceptual Reasoning and Self-Verification
UFO: Chain-of-Evaluation for Omni-Condition Alignment in Multi-Modal Image Generation
A Diffusive Classification Loss for Learning Energy-based Generative Models
Deformba: Vision State Space Model with Adaptive State Fusion
DualOptim+: Bridging Shared and Decoupled Optimizer States for Better Machine Unlearning in Large Language Models
The Forgetting-Retention Dilemma: Certified Unlearning Theory in Continual Learning
Spik4lite: Refactoring Neuromorphic Sparsity for Efficient Spiking Neural Networks on Commodity Edge Devices
Latent Diffusion Controller: Framework, Algorithms and Parameterization
Adaptive Bandit Algorithms for Contextual Matching Markets
PRM-PBE: Process Reward Model for Reinforcement Learning in Programming-by-Example
Questioning the Coverage-Length Metric in Conformal Prediction: When Shorter Intervals Are Not Better
GSRQ: Gain-Shape Residual Quantization for Sub-1-bit KV Cache
E-VAds: An E-commerce Short Videos Understanding Benchmark for MLLMs
Stochastic Gradient Variational Inference with Price's Gradient Estimator from Bures-Wasserstein to Parameter Space
How Powerful are LLMs in Generating Program Specifications?
Beyond Single Embedding: Modeling User Preferences as Distribution in Federated Recommendation
Credit-assigned Policy Gradient for Early Stage Retrieval in Two-stage Ranking
From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models
From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators
S-Quant: Rethinking Weight Quantization with Seed-Based Generation
Beyond Independent Genes: Learning Module-Inductive Representations for Gene Perturbation Prediction
Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction
Predicting Large Model Test Losses with a Noisy Quadratic System
Decompose and Recompose: Reasoning New Skills from Existing Abilities for Cross-Task Robotic Manipulation
Domain Adaptive Object Detection via Dynamic Causal Refinement
Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives
Understanding Dynamics of Adam in Zero-Sum Games: An ODE Approach
Continual Learning With Participation Privacy: An Auditable Buffering-Aggregation Recipe
Feature-aware (Hyper)graph Generation via Next-Scale Prediction
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
See More, Forecast Better and Faster: Enhancing Time Series Foundation Models via Inference-Time Plug-and-Play Downsampling
$\texttt{Multi}^2$: Hierarchical Multi-Agent Decision-Making with LLM-Based Agents in Interactive Environments
Taming the Recent-Data Bias: Towards Robust Time Series Forecasting with Global Context
Furina: Fragmented Uncertainty-Driven Refusal Instability Attack
HieRD: Hierarchical Relational Distillation for Vision-Language Embedding Models
Beyond Description: Federated Adaptation via Semantic-Visual Prototype Alignment
JanusPipe: Efficient Pipeline Parallel Training for Machine Learning Interatomic Potentials
Finding Differentially Private Second Order Stationary Points in Stochastic Minimax Optimization
Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning
Robust Filter Attention: Self-Attention as a Parallel State Estimator
Reliable Thinking with Images
Trajectory-Aware Certified Decentralized Unlearning via SGD Stability
Gradient Transformer: Learning to Generate Updates for LLMs
Optimal Design for Multinomial Logit Model with Applications to Best Assortment Identification
GraphP-FL: Personalized Federated Graph Learning via Dynamic Structure Awareness and Fisher Information Elastic Alignment
AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving
Mining Tensor/Neuron-Level Sparsity to Maximize Mixture-of-Experts Potential in Post-Training and Inference
VisionWebDev: A Hierarchical Benchmark for Visual Website Development with Agent Verification
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data
PCGS: Deblurring 3D Gaussian Splatting with Patch Comparison
Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink
Interpretable Discovery of One-parameter Subgroups: A Modular Framework for Elliptical, Hyperbolic, and Parabolic Symmetries
Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs
Incremental Learning of Sparse Attention Patterns in Transformers
Veda: Scalable Video Diffusion via Distilled Sparse Attention
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
Complexity of Decentralized Optimization with Mixed Affine Constraints
Induction Heads Interpolate N-Grams
SJD-SV: Speculative Jacobi Decoding with Semantics Verification for Autoregressive Image Generation
Position: Child Safety Necessitates New Approaches to AI Safety
Finding DoRI: Discovery of Retained Images in Diffusion Models
ECHO: Elastic Speculative Decoding with Sparse Gating for High-Concurrency Scenarios
When Shared Knowledge Hurts: Spectral Over-Accumulation in Model Merging
Geometry-Guided Modeling of Foundation Features Enables Generalizable Object Shape Deformation Learning
The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models
Quantum Algorithms for Triangle Cut Sparsification
Cross-Modal Semantic Decoupling and Transfer for Text-to-Visible-Infrared Person Re-Identification
Root Cause Analysis of Failures in Microservices via Bayesian Root Cause Discovery
Autoregressive Direct Preference Optimization
The Efficiency Gap in Byte Modeling
Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance
Position: Stop Using Culturally Biased Human Cognitive Benchmarks to Evaluate LLMs
Active Budget Allocation for Efficient Scaling Law Estimation via Surrogate-Guided Pruning
TokenDrop: Token-Level Importance-Aware Backward Propagation Skipping for Efficient LLM Fine-Tuning
Beyond Softmax: A Natural Parameterization for Categorical Random Variables
Design Linear Constrained Neural Layers with Implicit Convex Optimization
Dual-Calibration Multi-View Clustering via Compact Anchor Learning
When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs
DocOS: A Benchmark for Proactive Document-Guided Actions in GUI Agents
LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
Mesh Field Theory: Port–Hamiltonian Formulation of Mesh-Based Physics
Contextual Slate GLM Bandits with Limited Adaptivity
Fairness in Aggregation: Optimal Top-$k$ and Improved Full Ranking
Transformed Latent Variable Multi-Output Gaussian Processes
From Bits to Rounds: Parallel Decoding with Exploration for Diffusion Language Models
CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection
Incomplete Multi-View Clustering via Neighborhood-Conditioned Diffusion
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting
Revisiting Anisotropy in Language Transformers: The Geometry of Learning Dynamics
Preference-based Antibody Expression Ranking: Scaling with Large-scale Weak Supervision
Policy Search via Bayesian Optimization with Temporal Difference Gaussian Processes
Beyond Text-to-SQL: Can LLMs Really Debug Enterprise ETL SQL?
Discretely-Refined Multi-view Clustering via Aligned Anchor Learning
A World in Pieces: Structural Certification of General Agents
The Value of Variance: Mitigating Debate Collapse in Multi-Agent Systems via Uncertainty-Driven Policy Optimization
Fine-grained Analysis of Brain-LLM Alignment through Input Attribution
Efficiently Training Time-to-First-Spike Spiking Neural Networks from Scratch
Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts
Geometry-Aware Tabular Diffusion
LiMuon: Light and Fast Muon Optimizer for Large Models
Beyond the Proxy: Trajectory-Distilled Guidance for Offline GFlowNet Training
From Distribution to Geometry: Stable Graph Generalization via Invariant Barycenters
The Flexibility Trap: Rethinking the Value of Arbitrary Order in Diffusion Language Models
UCPO: Uncertainty-Aware Policy Optimization
Position: The Alignment Community is Unintentionally Building a Censor’s Toolkit
Does a Hybrid Space-Aware Randomized Defense Improve Empirical and Certified Adversarial Robustness?
Small Agent Group is the Future of Digital Health
CAT-Q: Cost-efficient and Accurate Ternary Quantization for LLMs
SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification
MDN: Parallelizing Stepwise Momentum for Delta Linear Attention
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
CausalXRL: Explainable Reinforcement Learning through Causal Graph Reasoning
FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
Quantile-Free Uncertainty Quantification in Graph Neural Networks
Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training
COBRA: Contribution-Based Bayesian Rank Allocation for Parameter-Efficient Fine-Tuning
Information-Theoretic Generalization Bounds for VAEs: A Role of Encoder and Latent Variable
Conditional KRR: Injecting Unpenalized Features into Kernel Methods with Applications to Kernel Thresholding
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents
BizFinBench.v2: Towards Reliable LLMs in Finance via Real-User Data and Offline/Online Bilingual Evaluation
Training-Free Coverless Multi-Image Steganography with Access Control
Required Spine Optional Limbs: Heterogeneous Federated Learning via Backbone-sharing and Activation-guided Selection
SIMoE: A Probabilistic Framework for Cardinality-Constrained Routing in Mixture-of-Experts
EDCO: Dynamic Curriculum Orchestration for Domain-specific Large Language Model Fine-tuning
ImpQuant: Fine-Grained Importance-Aware Quantization for Large Vision-Language Models
Smoothing Slot Attention Iterations and Recurrences
UOTIP: Unbalanced Optimal Transport Map for Unpaired Inverse Problems
Beyond Accuracy and Complexity: The Effective Information Criterion for Structurally Stable Symbolic Regression
Evolution of Benchmark: Black-Box Optimization Benchmark Design through Large Language Model
Exploring Data-Free LoRA Transferability for Video Diffusion Models
FrameOracle: Learning What to See and How Much to See in Videos
Domain Restriction via SAE Multi-Layer Transitions
Copyright-Bench: Agentic Evaluation of Copyright Law Compliance
The Loss Is Not Enough: Sampling Conditions and Inductive Bias in Contrastive Representation Learning
Turning Bias into Bugs: Bandit-Guided Style Manipulation Attacks on LLM Judges
Infinite Mask Diffusion for Few-Step Distillation
AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning
ChaosNexus: A Foundation Model for ODE-based Chaotic System Forecasting with Hierarchical Multi-scale Awareness
Are Common Substructures Transferable? Understanding Transferability in Graph Pretraining under Riemannian Geometry
Dual Latent Memory for Visual Multi-agent System
Stage-wise Distortion–Perception Traversal in Zero-shot Inverse Problems with Diffusion Models
Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking
Identifiable Token Correspondence for World Models
MuCO: Generative Peptide Cyclization Empowered by Multi-stage Conformation Optimization
MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks
Learning to Extrapolate to New Tasks: A Relational Approach to Task Extrapolation
Calibrating Generative Models to Distributional Constraints
CLINIC: Towards High-quality Graph Out-Of-Distribution Detection
FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
Ideal Attribution and Faithful Watermarks for Language Models
Test-Time Guidance for Flow-Based Generative Models via Parallel Tempering on Source Distributions
MSP: Probabilistically Consistent Multi-Scale Action Generation
APEX: Approximate-but-exhaustive search for ultra-large combinatorial synthesis libraries
MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling
Ranking Time Series using a Time Warping Ideal Point Model
Distributional Alignment Games for Answer-Level Fine-Tuning
Return-Critic: Bridging Goal Discrepancy for Efficient Visual Reinforcement Learning
DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning
Task-Driven Subspace Decomposition for Knowledge Sharing and Isolation in LoRA-based Continual Learning
How Chain of Thought Decomposes Complex Tasks
Parametrized Power-Iteration Clustering for Directed Graphs
Enhanced Multi-Instance Partial Label Learning via Average Gradient Outer Product
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs
Geometry-Aware Dataset Condensation for Diffusion Model Training
Large Language Model Teaches Visual Students: Cross-Modality Transfer of Fine-Grained Conceptual Knowledge
OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild
PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space
PyVision-RL: Forging Open Agentic Vision Models via RL
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
ACON: Optimizing Context Compression for Long-horizon LLM Agents
PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts
Understanding Generalization and Forgetting in In-Context Continual Learning
SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
Reward-Preserving Counterfactual State Editing for Offline Reinforcement Learning
1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization
XTransfer: Modality-Agnostic Few-Shot Model Transfer for Human Sensing at the Edge
Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium
ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning
Constructing Industrial-Scale Optimization Modeling Benchmark
Interpreting and Steering State-Space Models via Activation Subspace Bottlenecks
What Makes Synthetic Data Effective in Image Segmentation
Nonconvex Low-Rank Tensor Representation with Deep Priors for Multiview Subspace Clustering
BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series
Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation
On the Epistemic Uncertainty of Overparametrized Neural Networks
Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models
Distillation Models are Good Samplers for Diffusion Reinforcement Learning
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
FIBER: A Differentially Private Optimizer with Filter-Aware Innovation Bias Correction
Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning
TAGRPO: Boosting GRPO on Image-to-Video Generation with Direct Trajectory Alignment
ForensicConcept:Transferable Forensic Concepts for AIGI Detection
Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments
MAGIC: Multi-Granularity Language-Informed Image Clustering
Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models
One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception
A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research
CauseCollab: Causal Unified and Modality-Agnostic Network for Heterogeneous Collaborative Perception
A Unified Framework for Diffusion Model Unlearning with f-Divergence
Calibrated Test-Time Guidance for Bayesian Inference
OpenSage: Self-programming Agent Generation Engine
Neural Quantum States in Mixed Precision
PRISM: Training-Free Video Anomaly Detection via Intrinsic Statistical Modeling
DisPOSE: Projected Polystochastic Diffusion for Self-Supervised Multi-View 3D Human Pose Estimation
Weaving Graph over Tokens: Contextualizing Structured Sequences for LLMs
Towards Robust Human-AI Complementarity under Uncertainty
GRPO-based Cluster Decision Agent for Unknown-$\boldsymbol{K}$ Multi-view Clustering
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Controlling the Risk of Corrupted Contexts for Language Models via Early-Exiting
Position: Anthropomorphic Misalignment Research Needs Stronger Evidence
Comp-Attn: Present-and-Align Attention for Compositional Video Genneration
Attention with Routed-Memory for Learnable Sparse Control
CGRiC: Compositional Risk Certification for Structured LLM Outputs
Position: Beyond Reasoning Zombies — AI Reasoning Requires Process Validity
FairRARI: A Plug and Play Framework for Fairness-Aware PageRank
VideoSEG-O3: A Multi-turn Reinforcement Learning Framework for Reasoning Video Object Segmentation
CoPE: A Framework for Optimizing Coordination between Planning and Execution in LLM-based Agents
SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
Activation with Intrinsic-Extrinsic Consensus
AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture -of-Transformers for End-to-End Autonomous Driving
Discovering Symmetry Groups with Flow Matching
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs
SlaClip: Gradient Norm Slacks can be Indicator for Adaptive Clipping in DP-SGD
PromptDyG: Test-Time Prompt Adaptation on Dynamic Graphs
OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft
Towards Context-Invariant Safety Alignment for Large Language Models
AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation
Bias-Spectrum Neural Processes for Parametric PDEs: Architecture Priors Meet PDE Constraints
DeepHA: Scaling Action Chains Elicits Deep Hierarchical Agents
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
Measuring Meta-Cultural Competency: A Spectral Framework for LLM Knowledge Structures
VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph
Masked Multi-path Contrast with Confidence-Gated Semantic Imputation for Incomplete Multi-view Clustering
DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning
Spectral Evolution Search: Efficient Inference-Time Scaling for Reward-Aligned Image Generation
ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation
Investigating Continual Pretraining in Large Language Models: Insights and Implications
Controlled SDEs for Long-Horizon Motion Generation under Latent Decision Uncertainty
The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
MA$^3$S: Model-Agnostic Active Annotation Strategy for Crowdsourcing
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
Gaming Consensus: Coordinated Manipulation in Crowdsourced Fact-Checking
CocoRNA: Collective RNA Design with Cooperative Multi-agent Reinforcement Learning
TimeSAE: Sparse Decoding for Faithful Explanations of Black-Box Time Series Models
DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation
MemCast: Memory-Driven Time Series Forecasting with Experience-Conditioned Reasoning
CoGenCast: A Coupled Autoregressive–Flow Generative Framework for Time Series Forecasting
Anytime-Valid Inference for Online Ranking of Large Language Models
PADA-Coder: Improving Plan-Following Code Generation via Perturbation-Verified Attention Distillation and Dynamic Alignment
AutoVSR: Automatic Visual-to-Symbolic Reasoning for Symbolic Expression Generation from Circuit Schematic
Can Large Language Models Generalize Procedures Across Representations?
Navigating the Flatlands: Dual Adaptive Sharpness-Aware Minimization for Domain Generalization
Rex: A Family of Reversible Exponential (Stochastic) Runge-Kutta Solvers
StableI2I: Spotting Unintended Changes in Image-to-Image Transition
GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry
Calibrated Multimodal Representation Learning with Missing Modalities
FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training
Principled Zero-shot Ranking Agents with Tournament Graphs
ThetaEvolve: Test-time Learning on Open Problems
CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas
The Abstraction Gap in Vision-Language Causal Reasoning
Differential syntactic and semantic encoding in LLMs
Position: Current Model Cards Are Insufficient for Downstream Governance of Open-Weight Foundation Models
Generative Visual Code Mobile World Models
PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents
KANFIS: A Neuro-Symbolic Framework for Interpretable and Uncertainty-Aware Learning
DLM: Unified Decision Language Models for Offline Multi-Agent Sequential Decision Making
ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation
Reading the Cell, Designing the Cure: Perturbation-Conditioned Molecular Diffusion for Function-Oriented Drug Design
CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features
PHALAR: Phasors for Learned Musical Audio Representations
Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping
One Bias After Another: Mechanistic Reward Shaping and Persistent Biases in Language Reward Models
Position: Responsible Practices and Model Performance are Not Competing Goals
Learning Adaptive Perturbation-Conditioned Contexts for Robust Transcriptional Response Prediction
Modelling Attention with Aitchison Geometry: Token Distinguishability and Temperature Scaling
WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching
3DGS$^2$-TR: A Scalable Second-Order Trust-Region Method for 3D Gaussian Splatting
QEDBench: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs
Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning
One-shot Conditional Sampling: MMD meets Nearest Neighbors
Overclocking Electrostatic Generative Models
Towards Trustworthy Video Anomaly Understanding: A Class-Guided Chain-of-Evaluation Metric and An Anomaly-focused Meta-Benchmark
DC-LA: Difference-of-Convex Langevin Algorithm
Frontier Models Can Take Actions at Low Probabilities
Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning
Structure-Induced Information for Rerooting Levin Tree Search
Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance
Breaking the Synthetic-Real Domain Shortcut for Training-Free Generative Replay-based Class Incremental Learning
Learning to Route Languages for Multilingual Preference Optimization
SuCo: Sufficiency-guided Continuous Adaptive Reasoning
Bridging Functional Correctness and Runtime Efficiency Gaps in LLM-Based Code Translation
Matroid Algorithms Under Size-Sensitive Independence Oracles
Position: Modular Memory is the Key to Continual Learning Agents
STAR-VAE: Structured Topology-Aware Regularization for Audio Reconstruction and Generation
Logit Distance Bounds Representational Similarity
MDGMIX: Boundary-Aware Subgraph Mixing for Multi-Domain Graph Pre-Training
AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition
Inference-Time Conformal Reasoning with Valid Factuality Control for Large Language Models
The Velocity Deficit: Initial Energy Injection for Flow Matching
Anchor-Final Self-Supervision Drives Hallucination-Aware Optimization in Large Vision-Language Models
Escaping Whack-a-Mole: Code Documentation Optimization via Dependency-Guided Bi-level Search
An Empirical Study of Memory Poisoning Defenses for LLM Agents
Broadening the Backdoor Basin: Understanding LLM Backdoors Collapse and Making Backdoors Persistent
Explicitly Modeling Censoring Produces Superior Survival Predictors
Beyond Binary: Continuous State Optimization with Graph-Structured Objectives
Optimality of FSQ tokens for continuous diffusion for categorical data with application to text-to-speech
Efficiently Solving Discounted MDPs via Predictions with Unknown Prediction Errors
Large-Scale Notification Dispatch with Bundle Treatments and Multi-Outcome Uplift Optimization
On Minimum Depth and Width of Floating-Point Neural Networks for Representing Floating-Point Functions
From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning
Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment
Weakly Supervised Cross-Modal Learning for 4D Radar Scene Flow Estimation
AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions
Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
Propose, Solve, Verify: Self-Play Through Formal Verification
“Do Diffusion Models Dream of Electric Planes?” Discrete and Continuous Simulation-Based Inference for Aircraft Design
ContrastiveCFG: Guiding Diffusion Sampling by Contrasting Positive and Negative Concepts
Batched Contextual Reinforcement
GradientStabilizer: Fix the Norm, Not the Gradient
CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large Vision-Language Models
SpanNorm: Reconciling Training Stability and Performance in Deep Transformers
Future-Gain Guided Test-Time Learning for Large Language Models
MemEvolve: Meta-Evolution of Agent Memory Systems
On the Theoretical Limitations of Embedding-based Link Prediction
Mem-T: Densifying Rewards for Long-Horizon Memory Agents
TodoEvolve: Learning to Architect Agent Planning Systems
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Memoria-Bench: A Comprehensive Benchmark for Evaluating Memory in Long-Horizon Autonomous Agents
Visual Persuasion: What Influences Decisions of Vision-Language Models?
RECOVER:Reliable Detection of Unauthorized Data Usage in Text-to-Image Diffusion Models via Inversion Robustness
Position: Behavioral Systems Require Behavioral Tests
Keep It in Mind: User Centric Continual Spatial Intelligence Reasoning in Egocentric Video Streams
EAPO: Enhancing Policy Optimization with On-Demand Expert Assistance
ArcVQ-VAE: A Spherical Vector Quantization Framework with ArcCosine Additive Margin
FIRE: Learning to Navigate and Act on Real-World Files via Stateful Reinforcement Learning
Riemannian MeanFlow
DyCon: Dynamic Reasoning Control via Evolving Difficulty Modeling
Position: Unplugging a Seemingly Sentient Machine Is the Rational Choice — A Metaphysical Perspective
Any-dimensional invariant universality
ST-Veto: Spatio-Temporal Token Veto for Diffusion MLLMs via Taylor Prediction and Visual Grounding
Position: AI Welfare Is Bullshit
Med-SegLens: Latent-Level Model Diffing for Interpretable Medical Image Segmentation
Causal Effect Identifiability in the Presence of Latent Confounders Without Auxiliary Variables
LiveNewsBench: Evaluating LLM Web Search Capabilities with Freshly Curated News
Online Fair Division with Additional Information
SlerpFlow: Spherical Trajectory Correction for Rectified Flow Inversion
Learning Situated Awareness in the Real World
Position: Stop evaluating AI with human tests, develop principled, AI-specific tests instead
BioToken and BioFM – Biologically-Informed Tokenization Enables Accurate and Efficient Genomic Foundation Models
Margin-Adaptive Confidence Ranking for Reliable LLM Judgement
On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression
Position: Profiling Game Worlds by Transition Complexity
$\texttt{MetaDistill}$: Unlocking the Performance Ceiling for Pretrained Optimizers
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Order Matters: Unveiling the Hidden Impact of Macro Placement Sequences via Proxy-Guided LLM Evolution
Mitigating the Contractivity Trap in Diffusion ODEs via Stein Stabilization
TritonGym: A Benchmark for Agentic LLM Workflows in Triton GPU Code Generation
MM-Snowball: Evaluating and Mitigating Hallucination Snowballing in Multimodal Multi-turn Dialogue
Improving the Robustness-Utility Trade-off in Decentralized Learning over Sparse Networks
VeriSimpl: Robust Optimization Modeling from Natural Language using Simplification-based Verification
Position: Multi-Agent Systems Should Prioritize Concurrency Control
Generalist Graph Anomaly Detection via Prototype-Based Distillation
Position: Evaluation of ECG Representations Must Be Fixed
No More, No Less: Least-Privilege Language Models
Beyond External Monitors: Enhancing Transparency of Large Language Models for Easier Monitoring
Learning Manifold Data with Flow Matching
Triadic Dynamics Aware Diffusion Posterior Sampling for Inverse Problems: Optimizing Guidance and Stochasticity Schedules
Position: Neglecting the Sustainability of AI is Fuelling a Global AI Arms Race
VecMol: Vector-Field Representations for 3D Molecule Generation
RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
DynaMem: Consistent Long Video Generation via Hierarchical Memory and Motion Priors
AG-REPA: Causal Layer Selection for Representation Alignment in Audio Flow Matching
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration
Semantic-Enriched Latent Visual Reasoning
Position: Stop Preaching and Start Practising Data Frugality for Responsible Development of AI
Calibrating Conservatism for Scalable Oversight
Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism
DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution
Position: Genomic Model Research Must Move Beyond Anecdotal Evaluation of Interpretability Methods
Position: Time-Series Foundation Models Require Explicit Domain-Level Benchmarks
Position: AI Governance Needs ISO-like Interoperability Protocols, Not Just Laws
Towards Spectroscopy: Susceptibility Clusters in Language Models
GRASP: Awakening Latent Spatial Reasoning in LVLMs via Training-free Geometric Rectification
Is Training Necessary for Anomaly Detection?
AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training
Context Forcing: Consistent Autoregressive Video Generation with Long Context
PathwayLLM: Explainable Clinical Trajectory Modeling with Structured Pathways for Sepsis Prediction
In-Context Learning as Rate–Distortion Optimization
PhaseAlign: Complex Phase Alignment for Stable Open-Vocabulary Semantic Segmentation
TN-SHAP-G: Graph-Structured Tensor Network Surrogates for Shapley Values and Interactions
Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO
Position: Safe AI Should be Resistant and Resilient in an Evolving World
Position: Machine Learning for Heart Transplant Allocation Policy Optimization Should Account for Incentives
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox
Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
Diffusion differentiable resampling
Certificate-Guided Pruning for Stochastic Lipschitz Optimization
LLMInertia: Adaptive Counter-Inertial Reasoning to Improve Evidence Faithfulness in Large Language Models
On the Convergence Rate of LoRA Gradient Descent
Think in Cloud, Look at Edges: Semantic-Driven Query Decomposition for Efficient Video Reasoning
Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents
Aitchison Embeddings for Learning Compositional Graph Representations
Multi-Adapter Representation Interventions via Energy Calibration
Position: AI Should Facilitate Democratic Deliberation at Scale
Bring Future Vision: Dynamic Computation Allocation Guided by Lightweight Feature Forecaster
HiMe: Hierarchical Embodied Memory for Long-Horizon Vision-Language-Action Control
Learning to Move Before Learning to Do: Task-Agnostic pretraining for VLAs
Iterative Robust Satisficing: Minimizing Performance Degradation Under Distribution Shift
Bayesian-LoRA: Probabilistic Low-Rank Adaptation of Large Language Models
Treatment Responder Classification with Abstention
A novel statistical approach to analyze image classification
SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity
PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering
Bringing Code ALIVE: Optimizing Interactive Frontend Mini-Games via Automated Play and Reinforcement Learning at Scale
Seeing to Generalize: How Visual Data Corrects Binding Shortcuts
SC-FAGC: Size Constrained Fast Anchor-based Graph Clustering
Benchmarking the Limits of In-Context Reinforcement Learning for Ad-Hoc Teamwork
Feasible Fusion: Constrained Joint Estimation under Structural Non-Overlap
Left–Right Symmetry Breaking in CLIP-style Vision-Language Models Trained on Synthetic Spatial-Relation Data
Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code
FedVeer: Self-Adaptive Skew Estimation for Robust Federated Learning
Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs
Scaling Agentic Verifier for Competitive Coding
ReLAM: Learning Anticipation Model for Rewarding Visual Robotic Manipulation
EntRAG: Entity-Centric Retrieval-Augmented Generation for Knowledge-based Visual Question Answering
Beyond Perplexity: UTF-8 Validity in Byte-aware Language Models
InfoGlobe: Local-and-Global Information-Preserving Statistical Manifold Learning for Single-Cell Transcriptomics
Heterogeneity-Aware Knowledge Sharing for Graph Federated Learning
$\texttt{PRISM}$:A 3D Probabilistic Neural Representation for Interpretable Shape Modeling
DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion
Training-Trajectory-Aware Token Selection
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
THETA: Threshold-Based Exclusive Batching for Memory-Bandwidth-Constrained LLM Inference
S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction
Periodic Bayesian Flow Networks with Additive Accuracy
OpenIKLR: Bridging the Reasoning Gap in Open-World Scenarios via Iterative Premise Completion
Adaptive Physics Transformer with Fused Global-Local Attention for Subsurface Energy Systems
Hard-Constrained Graph Generation with Discrete-Projection Diffusion
Norm$\times$Direction: Restoring the Missing Query Norm in Vision Linear Attention
Covariance Volume Maximization for Embodied Latent Exploration in Deep Reinforcement Learning
Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction
Use What You Know: Causal Foundation Models with Partial Graphs
From Out-of-Distribution Detection to Hallucination Detection: A Geometric View
WFR-MFM: One-Step Inference for Dynamic Unbalanced OT
Provably Label-Efficient Conformal Prediction
T-GINEE: A Tensor-Based Multi-Graph Representation Learning
RoCA: Robust Cross-Domain End-to-End Autonomous Driving
Return-Aligned Decision Transformer
$\phi$-Balancing for Mixture-of-Experts Training
Winformer: Transcending Pairwise Similarity for Time-series Generation
Unveiling the Visual Counting Bottleneck in Vision-Language Models
Cure-SFT: Diagnostic-Guided Data Curation for Instruction Tuning
Context-level Language Modeling by Learning Predictive Context Embeddings
Adversarial Robustness of Implicit Neural Representation-Based Classifiers
One Coin Has Two Sides: Single Poistive Multi Label Learning from Salient Annotations
Agentic Monte Carlo: Reinforcement Learning for Black-Box LLM Agents
Self-supervised Hierarchical Visual Reasoning with World Model
PFT: Phonon Fine-tuning for Machine Learned Interatomic Potentials
PINE: Pruning Boosted Tree Ensembles with Conformal In-Distribution Prediction Equivalence
Towards Rule-Based Knowledge Sharing in Federated Learning
Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making
Deep Pre-Alignment for VLMs
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer
ALSO: Adversarial Online Strategy Optimization for Social Agents
Fix Before Search: Benchmarking Agentic Visual Query Pre-processing in Multimodal Retrieval-augmented Generation
CamGeo: Sparse Camera-Conditioned Image-to-Video Generation with 3D Geometry Priors
Reflect-then-Correct: Rebalancing Task Optimization for Generalizable Meta-Reinforcement Learning via Distributional Value Error Reduction
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension
When Random Saliency Looks Trained: Architectural Center Bias in CNN Interpretability
Non-Parametric Probabilistic Robustness: A Conservative Risk Estimator under Unknown Perturbation Distributions
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models
Dynamic Thinking-Token Selection for Efficient Reasoning in Large Reasoning Models
Diversity-Driven Offline Multi-Objective Optimization via Bi-Level Pareto Set Learning
AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
Trajectory-Level Data Augmentation for Offline Reinforcement Learning
Large Scale Manifold Balanced Clustering
Reliable Confidence Alignment for Generalized Category Discovery
Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
EPS3D: End-to-End Feed-Forward 3D Panoptic Segmentation
Diffusion Flow Matching: Dimension-Improved KL Bounds and Wasserstein Guarantees
Optimal Estimation of Continuous Treatment Effects with Kernel Ridge Regression
The Tell-Tale Norm: $\ell_2$ Magnitude as a Signal for Reasoning Dynamics in Large Language Models
AVTrack: Audio-Visual Speaker Tracking in Complex Scenes
NOMAD: Lifelong Trajectory Planning via Non-Parametric Bayesian Memory-Adaptive Diffusion Experts
Condition-Aware Graph Flow Matching for Modeling the Distributions of Complex Physical Systems
Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior
What You Think is What You See: Driving Exploration in VLM Agents via Visual-Linguistic Curiosity
Intervene When It Doubts: Conjunction-Guided Interactive Reasoning
Difference-Aware Decision Learning for Multimodal Image Fusion
Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities
A KL-regularization framework for learning to plan with adaptive priors
BAS: Bridging Adam and SignSGD for Memory-Efficient LLM Training
Semi-LAR: Semi-supervised Contrastive Learning with Linear Attention for Removal of Nighttime Flares
CELL: A Causal Perspective for Fairness-aware Graph Adaptation
SimpleMem: Efficient Lifelong Memory for LLM Agents
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning
Weight Decay Improves Language Model Plasticity
TransLight: Image-Guided Customized Lighting Control with Generative Decoupling
A Two-Layer Framework for Joint Online Configuration Selection and Admission Control
From Generalist to Specialist Representation
WET: Mitigating World-Conditioned Knowledge Conflicts via World Entropy Tethering
Expert-level Leaf Cell Layout Generation via Preference-Optimized LLM
4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere
DecoderTCR: Compositional Pretraining and Entropy-Guided Decoding for TCR-pMHC Interactions
Training-Free Hashing-Based Attention via Binary Principal Components
Harmonized Dual Policy Improvement for Model-based Reinforcement Learning
SCORE: A Unified Framework for Overshoot Refund in Online FDR Control
Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints
BIT-LLM: Brain Instruction Tuned LLM with persistent Cross-Attention for fMRI-to-Text Decoding
PoMtVRS: Preference-Optimized Multi-Task Vehicle Routing Solver with Preference Gating
Adaptive Symmetry Discovery for Dynamical System Identification
Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation
Learning Sparse Visual Representations via Spatial-Semantic Factorization
Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning
Steering Large Language Models through the DMTA Cycle: Structure-Based Drug Design via Knowledge-Driven Bi-Level Thompson Sampling
When Simple Problems Wear Complex Costumes: Improving Efficiency in LRM’s Adaptive Reasoning
When Generalized Zero-Shot Learning Meets PU Learning: A Plug-and-Play Framework for Seen-Class Bias Mitigation
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression
Temporal-Emerged Prompting for Segment Anything in Multiframe Infrared Small Target Detection
Spatially-Adaptive Gradient Re-parameterization for 3D Large Kernel Optimization
Knowing the Unknown: Interpretable Open-World Object Detection via Concept Decomposition Model
On the Role of Batch Size in Stochastic Conditional Gradient Methods
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
SwiftPFN: Revisiting Row-Wise Attention–Only Tabular Foundation Models with Adaptive Early Exit
Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms
AdaHC: Accelerating Multi-Token Prediction with Adaptive Head Chunking with Pipeline Parallelism
MV-FGAD: Towards Efficient and Effective Federated Graph Anomaly Detection via Multi-view Learning
Threshold-Guided Optimization for Visual Generative Models
Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis
AReaL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models
ECO: Quantized Training without Full-Precision Master Weights
The Geometry of Projection Heads: Conditioning, Invariance, and Collapse
A Fast and Soft Pattern Matcher for Trillion-Scale Corpus
Mining Useful General Data for Low-Resource Domain Adaptation
AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions
Neural Low-Discrepancy Sequences
Random Selection Reveals Implicit Knowledge Consensus in Code Generation
Joint Learning in the Gaussian Single Index Model
Toward Culturally Aligned LLMs through Ontology-Guided Multi-Agent Reasoning
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
C$^{2}$R: Cross-sample Consistency Regularization Mitigates Feature Splitting and Absorption in Sparse Autoencoders
Revisiting the Volume Hypothesis
AdaRoPE: Not All Attention Heads Should Rotate and Scale Equally
*Rank-Learner*: Orthogonal Ranking of Treatment Effects
Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs
From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching
gp2Scale: A Class of Compactly Supported Non-Stationary Kernels and Distributed Computing for Exact Gaussian Processes on 10 Million Data Points
Scaling Beyond Masked Diffusion Language Models
Feed-Forward Taylor-Gaussians-Flow: Towards Non-uniform Motion for Novel View Synthesis from Monocular Video
PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion
LogicSAGE: Neuro-Symbolic Reasoning with Socratic-Guided Enhancement
E-mem: Multi-Agent Based Episodic Context Reconstruction for LLM Agent Memory
PODiff: Latent Diffusion in Proper Orthogonal Decomposition Space for Scientific Super-Resolution
EgoTactile: Learning Grasp Pressure for Everyday Objects from Egocentric Video
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
TopoDistill: Distilling Global System Topology for Causal Discovery in Multivariate Time Series
Attacking Gray-Box Large Vision-Language Models with Adaptive SVD-Structured Adversarial Alignment
DynaTok: Token-Based 4D Reconstruction from Partial Point Clouds
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Characterizing, Evaluating, and Optimizing Complex Reasoning
From Interactions to Principles: Experience-Driven Self-Distillation for Evolving LLM Agents
Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models
Rethinking Visual Autoregressive Sampling with Information-Grounding Guidance
Deep Multi-view Graph Clustering via Attribute-aware Bidirectional Structural Refinement and Pseudo-label Guided Multi-level Fusion
A Refined Generalization Analysis for Extreme Multi-class Supervised Contrastive Representation Learning
dnaHNet: A Scalable and Hierarchical Foundation Model for Genomic Sequence Learning
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents
Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?
AgentTailor: A Semantic-Aware LLM-Based Multi-Agent System with Actor-Critic Structure
Evaluating Agentic Optimization on Large Codebases
Towards Completeness in Causal Discovery from Soft Interventions with Known Targets
EigenCache: Rethinking Diffusion Acceleration as Covariance-Optimal Forecasting and Submodular Information Allocation
LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis
NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices
Online Rubrics Elicitation from Pairwise Comparisons
MapDream: Task-Driven Map Learning for Vision-Language Navigation
Variational Adapter for Cross-modal Similarity Representation
Causal Disentangled Anchor Learning for Scalable Fair Multi-view Clustering
Dual-channel Dynamic Graph Neural Networks with Adaptive Adjacency Learning and Multi-scale Representation Fusion
HSMAD: Heterophily-Driven Spectral and Manifold Learning for Graph Anomaly Detection
PointCHR: Point Cloud Analysis via Curvature-Aware Hyperbolic Rectification
Efficient and Uncertainty-Aware Diffusion Framework for Offline-to-Online Reinforcement Learning
Capacity without Access: Reinterpreting the Mid-Depth Spectral Plateau in LLMs
Predicting the Order of Upcoming Tokens Improves Language Modeling
Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking
GoodDiffusion: Proactive Copyright Protection for Diffusion Generative Models via Learnable Sample-specific Signatures
Transformers Learn the Optimal DDPM Denoiser for Multi-Token GMMs
Crisp: A Spectral-Based Interaction Strategy for Multivariate Time Series Forecasting
Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions
Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers
REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
SemBind: Binding Diffusion Watermarks to Semantics Against Black-Box Forgery Attacks
Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots
Optimal Transport with Symmetry Groups
Deterministic Component Mining for Multi-framework UI2Code Generation
Deep Flow Networks
Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
Decentralized and Disentangled Task–Role Representation Learning for Generalizable Offline Multi-Agent Meta Reinforcement Learning
GR-LoRA: Gradient-Recycling Low-Rank Adaptation for Class-Incremental Learning
GenShield: Unified Detection and Artifact Correction for AI-Generated Images
Sparks of Cooperative Reasoning: LLMs as Strategic Hanabi Agents
HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs
HexGen-3: A Fully Disaggregated LLM Serving Framework with Fine-Grained Heterogeneous Resource Autoscaling
Flex-Forcing: Towards a Unified Autoregressive and Bidirectional Video Diffusion Model
Optimizing Return Distributions with Distributional Dynamic Programming
L-SR1: Learned Symmetric-Rank-One Preconditioning
Residual Context Diffusion Language Models
Understanding Dynamic Compute Allocation in Recurrent Transformers
ACTG-ARL: Differentially Private Conditional Text Generation with RL-Boosted Control
One-step Latent-free Image Generation with Pixel Mean Flows
Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
DecepChain: Inducing Deceptive Reasoning in Large Language Models
H$^2$CL: Heterogeneity-Aware Hypergraph Contrastive Learning for Robust Representation
Polaris: Coupled Orbital Polar Embeddings for Hierarchical Concept Learning
Tempora: Characterising the Time-Contingent Utility of Online Test-Time Adaptation
AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction
Information Flow Reveals When to Trust Language Models
Hyperbolic Neural Operator
KITE: Knowledge-Guided Probabilistic Modeling for Time Series Forecasting with Exogenous Variables
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Glimpse: Geometry Learning of Multi-scale Structural Priors for 3D Pose Estimation
Neural Logistic Bandits
Provable Sample Efficiency of Curriculum Post-Training for Transformer Reasoning
Rethinking the Trust Region in LLM Reinforcement Learning
Reward-free Alignment for Conflicting Objectives
On Path to Multimodal Historical Reasoning: HistBench and HistAgent
Representational Similarity and Model Behavior in Multi-Agent Interaction
Wasserstein Geometry-Aware Adaptive Control via Meta-Learning
ePC: Fast and Deep Predictive Coding for Digital Hardware
Random Scaling of Emergence Capabilities
Towards Fully Parameter-Free Stochastic Optimization: Grid Search with Self-Bounding Analysis
Prompt Injection as Role Confusion
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
A Geometry-Aware Efficient Algorithm for Compositional Entropic Risk Minimization
Prototype-Based Test-Time Adaptation of Vision-Language Models
Visual Implicit Autoregressive Modeling
SoftMoE: Soft Differentiable Routing for Mixture-of-Experts in LLMs
ProAct: A Benchmark and Multimodal Framework for Structure-Aware Proactive Response
Expandable, Compressible, Mineable: Open-World Thermal Infrared Image Restoration
Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
Escaping Mode Collapse in LLM Generation
Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
Taming Stochastic Gradient Descent: Almost Sure Convergence and Saddle-Point Avoidance under $(L_{0},L_{1})$-Smoothness
Cheap2Rich: A Multi-Fidelity Framework for Data Assimilation and System Identification of Multiscale Physics - Rotating Detonation Engines
RNA-FM: Flow-Matching Generative Model for Genome-wide RNA-Seq Prediction
MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation
Statistically Undetectable Backdoors in Deep Neural Networks
SEER: Transformer-based Robust Time Series Forecasting via Automated Patch Enhancement and Replacement
Differentially Private Preference Data Synthesis for Large Language Model Alignment
Hierarchical Procedural Meta-Reasoning for Generalizable Multimodal Agents
CombinationTS: A Modular Framework for Understanding Time-Series Forecasting Models
3DPoV: Improving 3D understanding via Patch Ordering on Videos
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification
Agent Primitives: Reuseable Latent Building Blocks for Multi-Agent Systems
A Recursive Decomposition Framework for Causal Structure Learning in the Presence of Latent Variables
Concept-Guided Tokenization: Closing the Gap Between Reconstruction and Generation
Move-Then-Operate: Behavioral Phasing for Human-Like Robotic Manipulation
Process Reward Agents for Steering Knowledge-Intensive Reasoning
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency
Emergent Biological Realism in RL-Trained DNA Language Models
Rethinking Convergence in MoE Training: The Role of Routing Sparsity
Linear Bandits beyond Inner Product Spaces, the case of Bandit Optimal Transport
SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning
Physics-informed diffusion models in spectral space
Provable Accuracy Collapse of Embedding-Based Representations under Dimensionality Mismatch
VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos
TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers
Emergent Communication Under Misinformation
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
OTora: A Unified Red Teaming Framework for Reasoning-Level Denial-of-Service in LLM Agents
MME-Reasoning: A Broad-Spectrum Benchmark for Evaluating Logical Reasoning in MLLMs
OPT-Engine: Benchmarking the Limits of LLMs in Optimization Modeling via Complexity Scaling
Boosting Video Diffusion Models via Masked Autoencoders as Tokenizers
LLM Watermark Evasion via Bias Inversion
Don't Forget Why You Started: Tackling Dual Forgetting in Vision-Language Continual Learning
Vector Quantization using Gaussian Variational Autoencoder
Stability-Aware Feature Design for Robust Watermark Detection in Machine-Generated Text
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion
Rotary Position Encodings for Graphs
Delegation and Verification under AI
SeisMark: A Large-Scale Open Benchmark for Robust 3D Seismic Fault Detection
Grokking Finite-Dimensional Algebra
Beyond Rational Illusion: Behaviorally Realistic Strategic Classification
Optimal Transport under Group Fairness Constraints
Agentic Confidence Calibration
Reinforced Sequential Monte Carlo for Amortised Sampling
SHERPA: Fine-tuning Segment Anything Models with Task-relevant Guidance
SAMT: Generating Structured Avatar Meshes and Textures from a Single Image
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
Conditionally Site-Independent Neural Evolution of Antibody Sequences
DOCKSMITH: Scaling Reliable Coding Environments via an Agentic Docker Builder
Role-Level Inductive Bias for Cross-Task Generalization in Multi-Agent Reinforcement Learning
Causal Discovery for Irregularly Time Series with Consistency Guarantees
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
Stability Analysis of Sharpness-Aware Minimization
How Out-of-Distribution Detection Learning Theory Enhances Transformer: Learnability and Reliability
MEDA: Medical-Oriented Activation Editing for Hallucination Mitigation in Medical Large Vision-Language Model
Asymmetric conformal prediction with penalized kernel sum-of-squares
REAR: Test-time Preference Realignment through Reward Decomposition
ViEEG: Hierarchical Visual Neural Representation for EEG Brain Decoding
Discrete Diffusion with Physical Mass Constraints for \emph{De Novo} Peptide Sequencing
Sparse ActionGen: Accelerating Diffusion Policy with Real-time Pruning
Aggregate Models, Not Explanations: Improving Feature Importance Estimation
Langevin Rollout Optimization for Modelic Reinforcement Learning
From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas
Group Cognition Learning: Making Everything Better Through Controlled Two-Stage Agents Collaboration
Prefix cache aware data reordering for LLM augmented database analytics
From Prior to Pro: Efficient Skill Mastering via Distribution Contractive RL Finetuning
Scalable Event Cloud Network for Event-based Classification
LIMSSR: LLM-Driven Sequence-to-Score Reasoning under Training-Time Incomplete Multimodal Observations
Evolving Interpretable Constitutions for Multi-Agent Coordination
Safety-Efficacy Trade Off: Robustness against Data-Poisoning
A Linear Expectation Constraint for Selective Prediction and Routing with False-Discovery Control
Evaluating LLM Uncertainty in Long-Form Generation Using Deterministic Ground Truth
EasyBalance: Cross-Layer Load Balancing in Distributed MoE Inference
PRISM: Perception Reasoning Interleaved for Sequential Decision Making.
Efficient Learned Image Compression without Entropy Coding
Rule2DRC: Benchmarking LLM Agents for DRC Script Synthesis with Execution-Guided Test Generation
EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments
Seeking Commonality, Preserving Specificity: A Spectral-Aware Hierarchical Framework for Cross-City Road Representation Learning
Utility Boundary of Dataset Distillation: Scaling and Coverage Laws
Partial Ring Scan: Revisiting Scan Order in Vision State Space Models
RN-D: Discretized Categorical Actors with Regularized Networks for On-Policy Reinforcement Learning
Rethinking Efficient Graph Coarsening via a Non-Selfishness Principle
Black-Box Detection of LLM-Generated Text Using Generalized Jensen Shannon Divergence
Don't Overthink with Pixels: Efficient Reasoning for Segmentation
Slash the Sink: Sharpening Structural Attention Inside LLMs
Through the Stealth Lens: Attention-Aware Defenses Against Poisoning in RAG
Identifying dependent components from multi-domain linear mixtures
RealtimeTool: Parallel Decoding for Real-Time LLM Function Calling
LARA: Latent Action Representation Alignment for Vision-Language-Action Models
Topology-Preserving Neural Operator Learning via Hodge Decomposition
Scaling Continual Learning with Bi-Level Routing Mixture-of-Experts
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Controls
Graph is a Substrate Across Data Modalities
From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning
Sparse Relaxed-Lasso Steering: Automatic Sparse-Autoencoder Feature Selection for Precise Image Editing
Kalman Linear Attention: Parallel Bayesian Filtering For Efficient Language Modeling and State Tracking
Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture
Evidential Reasoning Advances Interpretable Real-World Disease Screening
Learning Human-Robot Collaboration via Heterogeneous-Agent Lyapunov Policy Optimization
$f$-Trajectory Balance: A Loss Family for Tuning GFlowNets, Generative Models, and LLMs with Off- and On-Policy Data
LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models
VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection
Scaling Vision Transformers for Functional MRI with Flat Maps
NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs
LynX: Token Interface Alignment for Video+X LLMs
INT vs. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Saliency-Aware Model Merging
HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation
Coverage Improvement and Fast Convergence of On-policy Preference Learning
MixReasoning: Switching Modes to Think
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Generalizable and Actionable Parts Pose Estimation with Symmetry Annotation-Free Learning Strategy
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
Beyond Drift: Stabilizing Subjective LLM Evaluation with Information-Theoretic Rubrics
Ramba: Selective State-Space Models for Relational Deep Learning
Verbalized Bayesian Persuasion
Detecting Perspective Shifts in Multi-Agent Systems
TWLA: Breaking the Barrier to W1.58A4 Post-Training Quantization for LLMs
Turning Stale Gradients into Stable Gradients: Coherent Coordinate Descent with Implicit Landscape Smoothing for Lightweight Zeroth-Order Optimization
Position: Text Embeddings Should Capture Implicit Semantics, Not Just Surface Meaning
Krause Synchronization Transformers
Hybrid Reinforcement Learning in Adversarial Markov Decision Processes
PMSPO: Progressive Matching and Semantic-Aware Policy Optimization for Camouflaged Object Detection
Real-Time Monitoring and Calibration of Chain-of-Thought Sycophancy in Large Reasoning Models
ReSpinQuant: Efficient Layer-Wise LLM Quantization via Subspace Residual Rotation Approximation
Scalable and General Whole-Body Control for Cross-Humanoid Locomotion
Goal-Conditioned Agents that Learn Everything All at Once
Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing for Dense Predictions
Position: Stop Automating Peer Review Without Rigorous Evaluation
Adaptive Multiscale Binary Expansion Tests for Independence
Provably Protecting Fine-Tuned LLMs from Training Data Extraction
Singular Bayesian Neural Networks
Corrected Samplers for Discrete Flow Models
PLSemanticsBench: A Formal Semantics Reasoning Benchmark for Code
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization
See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model
CoverPruneGS: Coverage-Preserving Structured Pruning for Hierarchical 3D Gaussian Splatting from Sparse-View Monocular Videos
Lie-Algebraic Neural Koopman Dynamics
Nonparametric LLM Evaluation from Preference Data
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
Error Analysis of Discrete Flow with Generator Matching
FlowPET: Physics-Informed Symplectic Flow Matching for Low-Count PET Reconstruction
Crowd4D: Scene-Aware Monocular 4D Crowd Reconstruction
Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation
Precision-Induced Miscalibration: Understanding and Correcting Confidence Distortion in Quantized Neural Networks
Hard Labels In! Rethinking the Role of Hard Labels in Mitigating Local Semantic Drift
Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away
Geometric Coherence Learning for Structuring Value Functions in Plain MDPs
VDW-GNNs: Vector diffusion wavelets for geometric graph neural networks
Harmful Overfitting in Sobolev Spaces
E²I-VRWKV: Explicit EPI-Representation and Interaction-Aware Vision-RWKV for Light Field Semantic Segmentation
MCCE: A Framework for Multi-LLM Collaborative Search in Discrete Spaces with Similarity-Filtered Preference Learning
Rethink the Role of Neural Decoders in Quantum Error Correction
Steering at the Source: Style Modulation Heads for Robust Persona Control
Approximating Drift-Diffusion Models for User Decisions under Nudging and External Information
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs
On the Convergence of Decentralized Stochastic Minimax Optimization Algorithm with Compressed Communication
Principled SVD-based Delta Compression via Quantization Error Minimization
Energy-based Compositional Diffusion Planning
Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation
Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models
Beyond Accuracy: Latent Perturbations for Cognitive-Aware Diagnosis
Position: AI Lock-In Is in Progress, and We Must Be Prepared
Quantifying and Optimizing Simplicity via Polynomial Representations
Fast Spectrally Sparse Signal Reconstruction via Jacobi-Preconditioned Gradient Descent
Column Thresholding for Sparse Spiked Wigner Models: Improved Signal Strength Requirements
Improving Classifier-Free Guidance of Flow Matching via Manifold Projection
Non-Uniform Noise-to-Signal Ratio in the REINFORCE Policy-Gradient Estimator
GeoFlow: Geo-Aware Modeling of Inter-Area Relationships in OD Flow Prediction and Generation
MMBench-Live: A Continuously Evolving Benchmark for Multimodal Models
Population-Free Pareto Tracking for Sample-Efficient Multi-Policy MORL
CoGeoAD: Hierarchical Color-Geometric Fusion with Multi-View Attention for Zero-Shot 3D Anomaly Detection
InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning
Two Modalities Are Better Than One: Efficient Adversarial Purification via Multimodal Diffusion Models
Speculative Safety Honeypot: Toward Proactive Defense Against Multi-turn Agent Attacks
Batch Normalization for Neural Networks on Complex Domains
From Feasible to Practical: Pareto-Optimal Synthesis Planning
Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression
SymSpectra: Symmetric Information Bottleneck Framework for Molecular Structure Recognition under Imbalanced Settings
Learning to Rank from Incomplete Rankings
FiSeR: Fine-Grained Source Representations for Cross-Domain AI Image Detection
From Human Labels to Literature: Semi-Supervised Learning of NMR Chemical Shifts at Scale
Scaling Laws for Precision in High-Dimensional Linear Regression
Measuring Intent Comprehension in LLMs
DenseMLLM: Standard Multimodal LLMs are Intrinsic Dense Predictors
Around the World in Eighty Ratings? Quantifying the Salience of Geo-Cultural Values for Pluralistic Alignment
RoboOmni: Actions Are Just Another Modality for Your Vision-Language Models
Sparse and Faithful Local Explanations with Piecewise Linear Surrogates
The Extra Tokens Matter: Disentangled Representation Learning with Vision Transformers
Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting
Beyond Tokens: Enhancing RTL Quality Estimation via Structural Graph Learning
RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization
Error Propagation in Dynamic Programming: From Stochastic Control to American Option Pricing
Curriculum Reinforcement Learning for Black-Box Prompt Tuning via Large Language Models
From Optimization to Generalization under Heavy-Tailed Data: The Role of Gradient Clipping
GLAD: Bidirectional Structure-Attribute Alignment via Latent Graph Diffusion Models
Towards Steering without Sacrifice: Principled Training of Steering Vectors for Prompt-only Interventions
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases
Divergence Decoding: Targeted Unlearning via Auxiliary Models
Beyond Structural Symmetries: Linear Mode Connectivity via Neuron Identifiability
Lightning Unified Video Editing via In-Context Sparse Attention
Dual-Latent Memory Routing for Vision-Language Reasoning
Federated Data and Feature Selection by Generalized CUR Decomposition
UltraLIF: Fully Differentiable Spiking Neural Networks via Ultradiscretization and Max-Plus Algebra
Riemannian Diffusion Models on General Manifolds via Physics-Informed Neural Networks
HELIX: Hybrid Encoding with Learnable Identity and Cross-dimensional Synthesis for Time Series Imputation
Head-in-Head in Linear Attention
SE(n)-Invariant Flow Matching: A General Framework with Application to Object Reassembly
OMP: One-step Meanflow Policy with Directional Alignment
Federated Sketching LoRA: A Flexible Framework for Heterogeneous Collaborative Fine-Tuning of LLMs
Flatness-Aware Stochastic Gradient Langevin Dynamics
BALLAST: Bayesian Active Learning with Look-ahead Amendment for Sea-drifter Trajectories under Spatio-Temporal Vector Fields
Improving LLM-Based Recommenders with Conservative Generative Flow Networks
A Single Layer to Explain Them All: Understanding Massive Values in Large Language Models
Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling
Graph Alignment via Dual-Pass Spectral Encoding and Latent Space Communication
Diving into Kronecker Adapters: Component Design Matters
Operator Splitting with Hamilton-Jacobi-based Proximals
Depth-Progressive Monotonic Learning without Backpropagation
Outcome-Aware Spectral Feature Learning for Instrumental Variable Regression
Romberg-Extrapolated Zeroth-Order Gradient Estimator: Higher-Order Bias Reduction with Preserved Leading Directional Variance
Set Diffusion: Interpolating Token Orderings between Autoregression and Diffusion for Fast and Flexible Decoding
Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality
From Growing to Looping: A Unified View of Iterative Computation in LLMs
IEC: When Information-Driven Exploration Meets Spectral Consensus via Primal–Dual Reward Regularization in Decentralized Multi-Agent RL
TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation
DreamDojo: A Real-Time Robot World Model from Large-Scale Human Videos
From Denoising to De-Channeling: Integrating Physical Channel Priors into Diffusion Models for Radio Signal Understanding
VIP: Visual-guided Prompt Evolution for Efficient Dense Vision-Language Inference
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants
Stochastic Sparse Attention for Memory-Bound Inference
Reusing Trajectories in Policy Gradients Enables Fast Convergence
d2: Improved Techniques for Training Reasoning Diffusion Language Models
Scalable and Interpretable Representation Alignment with Ordinal Similarity
Divisiveness-Consistent Label Distribution Learning
VLA-Arena: An Open-Source Framework for Benchmarking Vision-Language-Action Models
DREAM: A Unified Framework for Drift-Corrected Federated Multi-Objective Learning
Accurate Evaluation of Quickest Changepoint Detectors via Non-parametric Survival Analysis
Stability beyond bounded differences: sharp generalization bounds under finite $L_p$ moments
What Makes a Desired Graph for Relational Deep Learning?
Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models
No More K-means: Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval
LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation
ThunderAgent: A Fast, Simple, and Program-Aware Agentic Inference System
Graph Neural Networks Are Not Continuous Across Graph Resolutions
DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation
Negative Sampling From the Ground Up: A Redesign for Graph-based Recommendations
Achieving Logarithmic Regret in KL-Regularized Zero-Sum Markov Games
Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing
Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization
Cross-Subject Modeling for Widefield Calcium Imaging via Atlas-Aligned Spatiotemporal Tokenization
AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications
Platonic Transformers: A Solid Choice For Equivariance
Learning to Bet for Horizon-Aware Anytime-Valid Testing
MusicDET: Zero-Shot AI-Generated Music Detection
DiLA: Disentangled Latent Action World Models
ProMeCD: Unifying Long-Tailed and Noisy Label Learning via White-Box Control
MuonSSM: Orthogonalizing State Space Models for Sequence Modeling
PACE: Post-Causal Entropy Modeling for Learned LiDAR Point Cloud Compression
Which Algorithms Can Graph Neural Networks Learn?
Structure Abstraction and Generalization in a Hippocampus-Entorhinal Inspired World Model
From Interaction Trajectories to Prompt Rules: Credit Assignment for Multi-Agent Prompt Optimization
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents
Spatio-Temporal LLM: Reasoning about Environments and Actions
Improving CLIP Adaptation by Breaking Tail Alignment for Source-Free Cross-Domain Few-Shot Learning
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
Scalable Kronecker-Factored Fisher Approximation for Neural Network Parameter Sensitivity
TarGATE: Target-Aware Data Selection via Token-Attenuation Gates
Learning to Memorize with Attributive and Associative Memory for Online Test-Time Adaptation of Vision-Language Models
POLIA: Policy Optimization with Visual-Object-Level Intrinsic Advantage for Multimodal Reasoning
GRASP: Graph Reasoning via Agentic Solving and Probing of LLMs
Mitigating the Modality Gap in Vision–Language Models with Fractal Spectral Geometry
Rethinking Personalization in Large Language Models at the Token Level
Local MAP Sampling for Diffusion Models
Learning Compressed Shape-Aware Molecular Representations for Virtual Screening
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
HieraMAS: Optimizing Intra-Node LLM Mixtures and Inter-Node Topology for Multi-Agent Systems
Learning syntax without semantics: Disentangled tiny language models
RetrOrchestrator: A Multi-Step Retrosynthesis Agent Dynamically Orchestrating Single-Step Transition Models
Interpretable Functional Koopman Learning with Non-Markovian Closure for Spatiotemporal Systems
Rethinking Video Generation Model for the Embodied World
Compress then Merge: From Multiple LoRAs into One Low-Rank Adapter
Scalable Option Learning in High-Throughput Environments
Adaptive Group Elicitation via Multi-Turn LLM Interactions
Frequentist Consistency of Prior-Data Fitted Networks for Causal Estimation
Adaptive Token Refinement in Long-Tailed Large Vision-Language Models Fine-Tuning
Scalable Bayesian Semi-supervised Clustering with Feature Selection and Adaptive Constraint Weighting
Quantum latent distributions in deep generative models
PatternKV: Flattening KV Representation Expands Quantization Headroom
CoDA-Bench: Can Code Agents Handle Data-Intensive Tasks?
Benchmarking Physics-Informed Time-Series Models for Operational Global Station Weather Forecasting
Ask Less, See More: Communication-Conditioned Token Pruning for Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models
DeltaEvolve: Accelerating Scientific Discovery through Momentum-Driven Evolution
When LLMs Develop Languages: Symbolic Communication for Efficient Multi-Agent Reasoning
Convex Low-resource Accent-Robust Language Detection in Speech Recognition
Improving Explicit Dynamic Gaussian Splatting Optimization via Update Mixture
WinDeskGround: A Benchmark for Robust GUI Grounding in Complex Multi-Window Desktop Environments
AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
How (Not) to Hybridize Neural and Mechanistic Models for Epidemiological Forecasting
Benchmarking World-Model Learning with Environment-Level Queries
From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning
Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
Attention Implements the Fisher Geometry of Exponential Families
Efficient Reasoning with Hidden Thinking
Decomposing Out-of-Distribution Error in Conditional Flow Matching via Wasserstein Geometry
Accelerating Regression Tasks with Quantum Algorithms
Clustering in Deep Stochastic Transformers
Neuro-evolutionary Continual Reinforcement Learning
Extending Fair Null-Space Projections for Continuous Attributes to Kernel Methods
Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation
Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression
Learning Cardiac Latent Representations in Vectorcardiogram Space
One Bug, Hundreds Behind: LLMs for Large-Scale Bug Discovery
Depth over Fidelity in Fixed-Budget Noisy Evolution Strategies
Latent Representation Alignment for Offline Goal-Conditioned Reinforcement Learning
Speculative Sampling For Faster Molecular Dynamics
Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models
Feature Resemblance: Towards a Theoretical Understanding of Analogical Reasoning in Transformers
Geometry-Aware Neural Optimizer for Shape Optimization and Inversion
Domain Transfer Becomes Identifiable via a Single Alignment
Hermite-NGP: Gradient-Augmented Hash Encoding for Learning PDEs
Conditional Quantile Adjusted Conformal Prediction for Time Series
When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
TwinWeaver: An LLM-Based Foundation Model Framework for Pan-Cancer Digital Twins
Multimodal Scaling Laws for Task & Data-Optimized Models of Visual Cortex
Context-Aware Reaonser : Enhancing Contextual Reasoning in Multimodal Large Language Models
Decision-focused Sparse Tangent Portfolio Optimization
Data Reconstruction: Identifiability and Optimization with Sample Splitting
Joint Navigation and Manipulation Planning with 3D Interaction Chains
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
MER-DG: Modality-Entropy Regularization for Multimodal Domain Generalization
Understanding Private Learning From Feature Perspective
Position: Topological Machine Learning Cannot Progress without Experimental Standards
BPL: Generalizable Deepfake Detection via Bias-only Pair-aware Learning
CURE: Consistency-under-Unified Semantic Regularization for Generalized Category Discovery
AutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure Design
Expectation Alignment of Language Models for Real-World User Expectations
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
CiteGuard: Conformal False-Discovery Control for Faithful Retrieval-Augmented Generation
Eating for a Sustainable Planet: Personalized Sustainable Diet Recommendation via Constraint-Aware Decision-Making Modeling
Possibilistic Predictive Uncertainty for Deep Learning
Geometric Flow Grounding: A Unified Manifold Decoupling Framework for Dynamics Discovery and Verification
WorldCompass: Reinforcement Learning for Long-Horizon World Models
Asymmetric Contrastive Objectives for Efficient Phenotypic Screening
Adaptive Recurrent Message Passing for Test Time Computing on Graphs
Position: AI for Science Should Treat Measurement-to-Dataset Pipelines as Inference Components
Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift
DeepBlip: Estimating Conditional Average Treatment Effects Over Time
New Wide-Net-Casting Jailbreak Attacks Risk Large Models
Bottleneck-Guided Spectral Subgoals For Offline Goal-Conditioned RL
Nonparametric Data Attribution for Diffusion Models
SPUR: Scale-Partitioned Uncertainty Rectification for Robust UAV-on-UAV Interception
STT-LLM: Structural-Temporal Tokenization for Adapting LLMs to Longitudinal Clinical Profiles
Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models
Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
Revisiting Asymmetries in Black-box Link Stealing against Graph Neural Networks
FoundObj: Self-supervised Foundation Models as Rewards for Label-free 3D Object Segmentation
Reasoning Cache: Learning to Extrapolate to Long Lengths via Short-Length RL
Beyond Model Ranking: Predictability-Aligned Evaluation for Time Series Forecasting
Position: Don't Just "Fix it in Post'': A Science of AI Must Study Learning Dynamics
Infinite-dimensional generative diffusions via Doob's h-transform
Breaking the Self-Confirming Loop: Diagnosing and Mitigating Systemic Reward Bias in Self-Rewarding RL
CryoACE: An Atom-centric Framework for Accurate and Automated Model Building in Cryo-EM
Scale-Aware Domain Harmonization for Domain Adaptation Person Search
Theoretical Investigation on Inductive Bias of Isolation Forest
ConEx: Human-Interpretable Saliency Maps via Concept-Aware Attribution
Beyond Heuristics: Learnable Density Control for 3D Gaussian Splatting
What Makes Effective Supervision in Latent Chain-of-Thought: An Information-Theoretic Analysis
Amodal Instance Segmentation with IRAIS Dataset for Sim-to-Real Transfer
Vision in One Vector: Implicit Visual Compression with Diffusion Foundation Models
InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context
From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment
Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning
ClinTutor-R1: Advancing Scalable and Robust One-to-Many Alignment in Clinical Socratic Education
Trajectory-Aware Spiking DiTs Conversion via Membrane Potential Error-Feedback
Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning
On Stable Long-Form Generation: Benchmarking and Mitigating Length Volatility
OneSearch: A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search
SE(3)-Equivariant Flow Matching with Gaussian Process Priors for Geometric Trajectory Prediction
Compute When Worth It: Risk Control for Reasoning on a Compute Budget
UltraHorizon: Benchmarking LLM-Agent Capabilities in Ultra Long-Horizon Scenarios
Sample Complexity Bounds for Robust Mean Estimation with Mean-Shift Contamination
ConvexBench: Can LLMs Recognize Convex Functions?
A Positive Case for Faithfulness: Explanations Help Predict Model Behavior
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length
Leveraging Evidence Priors for Robust Prompt Learning under Noisy Supervision in Vision-Language Models
Forget by Uncertainty: Orthogonal Entropy Unlearning for Quantized Neural Networks
MotiMotion: Motion-Controlled Video Generation with Visual Reasoning
Reward Modeling from Natural Language Human Feedback
TT-Sparse: Learning Sparse Rule Models with Differentiable Truth Tables
Muon in Associative Memory Learning: Training Dynamics and Scaling Laws
Covariance estimation using Markov chain Monte Carlo
Why ReLU? A Bit-Model Dichotomy for Deep Network Training
Estimating Tail Risks in Language Model Output Distributions
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
Instruction Decomposition and Action Alignment for Vision-Language Navigation
Improved Dimension Dependence for Bandit Convex Optimization with Gradient Variations
Online Conformal Prediction via Universal Portfolio Algorithms
Condition Number Based Low-Bit Quantization for Image Super-Resolution
FedPAT: Federated Test-Time Adaptation via Prototype Affinity Topology
Bayesian Rain Field Reconstruction using Commercial Microwave Links and Diffusion Model Priors
Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations
Is Graph Mixup Beneficial? Investigating Interpolation And Empirical Performance of Graph Mixup Methods
TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution
Revisiting Zeroth-Order Hessian Approximation: A Single-Step Policy Optimization Lens
ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution
G$^2$RPO: Geometric GRPO; Escaping LLM's Reasoning Rut to Break Accuracy--Entropy Trade-off
LLM-based Embeddings: Attention Values Encode Sentence Semantics Better Than Hidden States
Supervised Graph Contrastive Learning for Gene Regulatory Networks
Position: The Privacy-Auditability Paradox in Federated Learning: Why We Need Controllable Secure Aggregation
WorldComp2D: Spatio-semantic Representations of Object Identity and Location from Local Views
Abstraction Induces the Brain Alignment of Language and Speech Models
VocSim A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio
Star Elastic: Many-in-One Reasoning LLMs with Efficient Budget Control
Counterfactual Residual Data Augmentation for Regression
Anytime Safe PAC Efficient Reasoning
MIDSTEER: Optimal Affine Framework for Steering Generative Models
SafeCompass: Dynamic Chain-of-Thought Steering via Inference-Time Safety Signals
RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry
AlgoVeri: An Aligned Benchmark for Verified Code Generation on Classical Algorithms
Identifiable Smooth Conjugacy Learning via Adversarial Orthogonality
Noise-corrected GRPO: From Noisy Rewards to Unbiased Gradients
PixCLIP: Towards Fine-grained Vision-Language Understanding via Any-granularity Pixel-Text Alignment
On Uniform Error Bounds for Kernel Regression under Non-Gaussian Noise
Learning Multi-Agent Coordination via Sheaf-ADMM
VideoSeeker: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning
ReasonEdit: Editing Vision--Language Models using Human Reasoning
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks
Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching
Bridging On-Device and Cloud LLMs for Collaborative Reasoning: A Unified Methodology for Local Routing and Post-Training
Distinguishing Imitation Error from Intrinsic Motion Learning Difficulty
Towards Understanding Generalization of Federated Adversarial Learning: Perspective of Algorithmic Stability
LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
TQL: Scaling Q-Functions with Transformers by Preventing Attention Collapse
Entropy-informed Decoding: Adaptive Information-Driven Branching
FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing
Learning Fingerprints for Medical Time Series with Redundancy-Constrained Information Maximization
COLLIE: Guiding Skill Discovery in Semantically Coherent Latent Space
Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation
CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict
STEP: Warm-Started Visuomotor Policies with Spatiotemporal Consistency Prediction
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
Seeing is Solving: Unlocking Efficient Multimodal RL via View Alignment
On Learnability and Disambiguation of Multiclass Partial Concept Classes
Same Question, Different Lies: Cross-Context Consistency (C³) for Black-Box Sandbagging Detection
A Deep Learning Model of Mental Rotation Informed by Interactive VR Experiments
Taming the Aleatoric Impulse in Off-Policy Reinforcement Learning
Variable Clustering via Distributionally Robust Nodewise Regression
TraCeS: Learning Per-Timestep Constraint-Violation Credit from Sparse Trajectory-Level Labels
Coverage ≠ Exposure: Auditable Control of Same-Support Tail Failures under Multimodal Missingness
Tight Stability Bounds for Robust Distributed Learning: Byzantine Failures Hurt Generalization More than Data Poisoning
Learning Junta Distributions, Quantum Junta States, and QAC$^0$ Circuits
Disease-Centric Vision-Language Pretraining with Hybrid Visual Encoding for 3D Computed Tomography
Estimating Correlation Clustering Cost in Node-Arrival Stream
AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning
(Doubly) Exponential Lower Bounds for Follow the Regularized Leader in Potential Games
MMKU-Bench: A Multimodal Update Benchmark for Diverse Visual Knowledge
TaRO: Temporal-Aware Reasoning Optimization for Video Temporal Grounding
On the Theory of Continual Learning with Gradient Descent for Neural Networks
Language Models as Nodes: Constructing a High-Level Neural Network
Sample Efficient Full-Finetuning of Generative Control Policies
Backjump-on-Graph: Empowering LLMs with Reinforced Retrospective Exploration for Agentic KG Reasoning
Spectral Imbalance Causes Forgetting in Low-Rank Continual Adaptation
Can Recommender Systems Teach Themselves? A Recursive Self-Improving Framework with Fidelity Control
Relational In-Context Learning via Synthetic Pre-training with Structural Prior
Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents
Proactive Defense Benchmark against Deepfake Generation
Detecting and Filtering Unsafe Training Data via Data Attribution with Denoised Representation
Learning Rate Annealing Improves Tuning Robustness in Stochastic Optimization
LazyAttention: Efficient Retrieval-Augmented Generation with Deferred Positional Encoding
Enhancing Neural Theorem Proving via High-Quality Proof Selection and Verifier Feedback
MetaStreet: Semi-Supervised Multimodal Learning for Street-Level Socioeconomic Prediction
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Stabilizing Native Low-Rank LLM Pretraining
NanoSpec: Accelerating Speculative Decoding using Minimalist In-Context Vocabularies
Advantage Collapse in Group Relative Policy Optimization: Diagnosis and Mitigation
Coordinated Disentanglement with Iterative Mode Discovery Under Hidden Correlations
Implicit Safety Alignment from Crowd Preferences
Pluralistic Leaderboards
Federated Multi-view Clustering for Remote Sensing Data
Position: If open source is to win, it must go public
L2G-NET: Local to Global Spectral Graph Neural Networks via Cauchy Factorizations
Expanding the AI Evaluation Toolbox with Statistical Models
LMM4-IC4K: A Large Multimodal Model Powered Integrated Circuit Footprint Geometry Understanding
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic
DECOR: Learning to Decompose and Collaborate in Deep Search via Multi-Agent Reinforcement Learning
Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression
KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem
FedCDWA: Decoupled Federated Prototype Distillation with Hierarchical Wasserstein Aggregation
Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification
SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning
Temporal Difference Learning for Diffusion Models
When Tabular Foundation Models Meet Strategic Tabular Data: A Prior Alignment Approach
Test-Time Training Is Secretly Linear Attention
SLIP-RS: Structured-Attribute Language-Image Pre-Training for Remote Sensing Object Detection
Flow Equivariant World Models: Structured Memory for Dynamic Environments
Tractable Expected Information Gains for Exponential Family Posteriors
Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents
Seizure-Semiology-Suite($S^3$): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding
Censoring with Plausible Deniability: Asymmetric Local Privacy for Multi-Category CDF Estimation
FOCUS: DLLMs Know How to Tame Their Compute Bound
Rethinking Calibration for Early-Exit Neural Networks
Lightweight and Interpretable Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Position: RL Should Be Used to Adjust Foundation Models, NOT Abused
Hallucination Detection from Structural Reasoning Model
Recursive Models for Long-Horizon Reasoning
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity
Upper-Linearizability of Online Non-Monotone DR-Submodular Maximization over Down-Closed Convex Sets
TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation
Layer-wise Gradient Disentanglement: Decoupling Semantics and Preferences in Direct Preference Optimization
PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks
FedSDR: Federated Self-Distillation with Rectification
STAND: Self-Aware Precondition Induction for Interactive Task Learning
Problem Distributions as Tasks: Repurposing Meta Learning for Generative Combinatorial Optimization towards Multi-task Pretrain and Adaptation
Test-time Offline Reinforcement Learning on Goal-related Experience
Learning High-Dimensional Parity Functions with Product Networks using Gradient Descent
Is Spurious Correlation Removal Always Learnable?
$R^3$DAO: Reactive Recovery and Reconstruction for Long-horizon Data Agent Orchestration
Local Constrained Bayesian Optimization
Sharp Inequalities between Total Variation and Hellinger Distances for Gaussian Mixtures
Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
Ariadne's Thread of LipSync: Unraveling Forgeries via Inconsistency between Lip Motions and Head Poses
Towards a Unified Generative Model for Scarce Time Series with Domain Experts
MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
Reason with Thumbnails, Answer with Focus: An Efficient and Effective Paradigm for Multimodal Grounded Visual Reasoning
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention
MAnchors: Memorization-Based Acceleration of Anchors via Rule Reuse and Transformation
Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality
Bridging Local–Global Dissonance: Learning from Compressive Measurements for Hyperspectral Reconstruction
Efficient Public Verification of Private ML via Regularization
Quaternion Self-Attention with Shared Scores
Multi-view Consistent Latent Action Learning for World Modeling and Control
Beyond Independence: Learning Correlated Views for Variational Incomplete Multi-View Clustering
Active Learning with Low-Rank Structure for Data Selection
SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning
Functional Decomposition and Shapley Interactions for Interpreting Survival Models
Position: Fairness Failure in Generative Models is an Evaluation Problem
DRL-STAF: A DRL Framework for State-aware Forecasting of Complex Multivariate Hidden Markov Process
Diffeomorphism-Equivariant Neural Networks
Generalizing Multi-Scale Time-Series Modeling with a Single Operator
Noisy-Space Policy Gradient for Diffusion Policies in Offline Reinforcement Learning
Agentic Model Predictive Questioning Control in Visual Design
Rapid Poison: Practical Poisoning Attacks Against the Rapid Response Framework
Prediction-Powered Risk Monitoring of Deployed Models for Detecting Harmful Distribution Shifts
Q-Delta: Beyond Key–Value Associative State Evolution
Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners
Structure Enables Effective Self-Localization of Errors in LLMs
Locally Coherent Parallel Decoding in Diffusion Language Models
Functional Cache Grafting: Robust and Rapid Code-Policy Synthesis for Embodied Agents
Partial Fusion of Neural Networks: Efficient Tradeoffs Between Ensembles and Weight Aggregation
SorryDB: Can AI Provers Complete Real-World Lean Theorems?
VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding
CodeMamba: Shifting from Target Semantics to Self-Supervised Background Manifold Learning for Singularity Detection in Infrared Sequences
Draft-Conditioned Constrained Decoding for Structured Generation in LLMs
CAOS: Conformal Aggregation of One-Shot Predictors
Supervised Guidance Training for Infinite-Dimensional Diffusion Models
BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models
Gauge-Equivariant Graph Networks via Self-Interference Cancellation
MoVie: Multimodal Video Compression with Text Guidance
Are First-Order Diffusion Samplers Really Slower? A Fast Forward-Value Approach
Steer Where It Matters: Token-Level Visual-Sensitivity Steering for LVLMs Hallucination Mitigation
Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue
Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models
GaussTrace: Provenance Analysis of 3D Gaussian Splatting Models with Evidence-based LLM Reasoning
Efficient Parallel Samplers for Recurrent-Depth Models
BOCLOAK: Optimal Transport-Guided Adversarial Attacks on Graph Neural Network-Based Bot Detection
PGD-NO: A Neural Operator with Precomputed Geometry Decomposition for 3D Million-Scale physics simulations
Watermarking LLM Agent Trajectories
Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets
Conservation Laws for Modern Neural Architectures
Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-supervised Learning
Boosting World Models Learning via Latent-Space Value Alignment
MoDA: Modulation Adapter for Fine-Grained Visual Understanding in Instructional MLLMs
Relational Structural Causal Models
Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
FRACTAL: State Space Model with Fractional Recurrent Architecture for Computational Temporal Analysis of Long Sequences
Contrastive Symbolic Regression: Aligned Representations, Adaptive Prediction, and Diverse Ensembles
Dimension-Independent Convergence of Underdamped Langevin Monte Carlo in KL Divergence
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching
Meta Context Engineering via Agentic Skill Evolution
Enhancing Cross-subject Emotion Recognition via Heterogeneous Distribution Augmentation and Collaborative Learning
Provable Benefits of RLVR over SFT for Reasoning Models: Learning to Backtrack Efficiently
SPEED: Sharpened-Teacher Distillation for Parallel Decoding of Diffusion Language Models
Echoes within the Reasoning: Stealth and Effective Watermarking via Chain of Thought
Reinforcement Learning for Tool-Calling Agents in Fast Healthcare Interoperability Resources (FHIR)
Error Amplification Limits ANN-to-SNN Conversion in Continuous Control
Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation
The Geometric Reasoner: Manifold-Informed Latent Foresight Search for Long-Context Reasoning
Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing
*MemPot*: Defend Against Memory Extraction Attack with Optimized Honeypots
Equivariant Latent Alignment via Flow Matching under Group Symmetries
Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundations and Algorithms
Particle Flow for Learning from Label Proportions
Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units
Evaluating Robustness of Reasoning Models on Parameterized Logical Problems
Variational Learning of Disentangled Representations
PRISM: Demystifying Retention and Interaction in Mid-Training
Finding Most Influential Sets
Causal Dependency-Aware Unsupervised Routing for Large Reasoning Models
AOEPT: Breaking the Implicit Modality-Reduction Bottleneck in Modality Missing Prompt Tuning
SpaCeFormer: Space-Curve Transformer for Open-Vocabulary 3D Instance Segmentation without Proposals
COGNOS: Universal Enhancement for Time Series Anomaly Detection via Constrained Gaussian-Noise Optimization and Smoothing
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Protein Language Model Embeddings Improve Generalization of Implicit Transfer Operators
Normalized Energy Models for Linear Inverse Problems
On the Expressive Power of Permutation-Equivariant Weight-Space Networks
Learning Molecular Semantic Invariant Representation with Prototype Constraint
Constrained Adaptive Rejection Sampling
What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
V1: Unifying Generation and Self-Verification for Parallel Reasoners
Knothe-Rosenblatt Quantile Regression for Risk-sensitive Multi-objective Reinforcement Learning
Spatial-Aware Reduction Framework: Towards Efficient and Faithful Visual State Space Models
SARL: Structure-Aligned Reinforcement Learning for Bridging the Perception-Action Gap in Airspace
Where Detectors Fail: Probing Generative Space for Generalizable AI-Generated Image Detection
OSM+: Billion-Level Open Street Map Dataset for City-wide Experiments
TimeLAVA: Learning-Agnostic Valuation for Time Series Data
Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding
Measurement-Consistent Langevin Corrector for Stabilizing Latent Diffusion Inverse Problem Solvers
Intrinsic Task Symmetry Drives Generalization in Algorithmic Tasks
Advancing Analytic Class-Incremental Learning through Vision-Language Calibration
InstEmb: Instruction-Following Embeddings through Glimpses of the Future
SubspacePath Pruner: Inference-time Pruning via Probe-based Representation–Parameter Coupling
From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning
Granularity-Aware Adaptive Classifier Expansion via Zero-Shot Learning
One LR Doesn’t Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
Subspace-Aware Feature Reshaping for Open-Set Graph Class-Incremental Learning
Automatic Unsupervised Ensemble Outlier Model Selection
Controlled LLM Training on Spectral Sphere
LOZO+: Provably Efficient Zeroth-Order Fine-Tuning via Greedy Low-Rank Subspace Selection
CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning
Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting
Milestone-Guided Policy Learning for Long-Horizon Language Agents
Can Adaptive Gradient Methods Converge under Heavy-Tailed Noise? A Case Study of AdaGrad
Stationary MMD Points
Enhancing Membership Inference Attacks on Diffusion Models from a Frequency-Domain Perspective
cuRegOT: A GPU-Accelerated Solver for Entropic-Regularized Optimal Transport
DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model
Fast kernel methods: Sobolev, physics-informed, and additive models
SafeSpec: Fast and Safe LLM via Dynamic Reflective Sampling
Chiral Symmetry Breaking in Transformers: A Group-Equivariant Framework for Solving the Reversal Curse via Adjoint Manifold Mappings
SilentWood: Efficient Private Inference Over Gradient Boosting Decision Forests
Endogenous Resistance to Activation Steering in Language Models: Evidence for Internal Consistency Monitoring in Llama-3.3-70B
Explanations are a Means to an End: A Value of Information Framework for Validating Explanations
KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning
Nonparametric Distribution Regression Re-calibration
RADAR: Defending RAG Dynamically against Retrieval Corruption
Adaptive Reinforcement Learning for Unobservable Random Delays
UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
Training Deep Spiking Neural Networks without Normalization
Graph Neural Dynamics via Learned Energy and Tangential Flows
VAnim: Rendering-Aware Sparse State Modeling for Structure-Preserving Vector Animation
Transfer Learning in High-dimensional Ising Models
Scalable and Differentiable Point-Cloud Registration Using Maximum Mean Discrepancy
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models
Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control
Stochastic Neural Ray Tracing for Radio Frequency Channel Modeling
Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning
A General Framework for Fair and Robust Regression
The First Drop of Ink: Nonlinear Impact of Misleading Information in Long-Context Reasoning
Collaborative Disagreement Resolution for Scalable Oversight
CausalArmor: Efficient Indirect Prompt Injection Guardrails via Causal Attribution
Implicit Action Chunking for Smooth Continuous Control
Beyond Sunk Costs: Boosting LLM Pre-training Efficiency via Orthogonal Growth of Mixture-of-Experts
Mesh Based Simulations with Spatial and Temporal awareness
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Efficient Prediction of SO(3)-Equivariant Hamiltonian Matrices via SO(2) Local Frames
Unveiling the Entropy Dynamics of Chain-of-Thought Reasoning
Bootstrapped Exploration with Causal Reasoning: A Training Paradigm for Adaptive Forecasting Agent
A Control-Theoretic View of Mamba on Stability and Robustness
RulePlanner: All-in-One Reinforcement Learner for Unifying Design Rules in 3D Floorplanning
Sketch-Based Low-Rank Model Merging with Shared Circulant Transforms
Dynamic Linear Attention
Causal Feature Learning via Generalized Rayleigh Quotients
Probing Newtonian Mechanics in Video Generative Models with Real Physical Systems
Rel-MOSS: Towards Imbalanced Relational Deep Learning on Relational Databases
HiPER: Hierarchical Plan–Execute RL for Multi-Turn LLM Agents
Enhancing LLM Training via Spectral Clipping
Breaking the Factorization Barrier in Diffusion Language Models
Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces
Prompt Tuning for CLIP on the Pretrained Manifold
GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation
Robust Bayes-Assisted Conformal Prediction
Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements
Shuffling-Aware Optimization for Private Vector Mean Estimation
Preserving Plasticity in Continual Learning via Dynamical Isometry
Not All Prefills Are Equal: PPD Disaggregation for Multi-turn LLM Serving
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
A Decision-Theoretic View of Test-Time Training: When, How Far, and Which Directions to Adapt
SOPE: Situation-Aware and Statistically Indistinguishable Privacy Exfiltration for MCP-enabled Agents
IDLM: Inverse-distilled Diffusion Language Models
Last-Iterate Convergence of Regularized Gradient Methods for Stochastic Monotone Variational Inequalities
Capability-Oriented Training Induced Alignment Risk
Spectral–Spatial Mixing with Morphology-Aware Adaptive Loss for Medical Image Segmentation.
TileQ: Efficient Low-Rank Quantization of Mixture-of-Experts with 2D Tiling
Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization
Signal Strength Estimation in Logistic Regression Using Data Splitting
FourTune: Towards Fully 4-Bit Efficient Post-Training for Diffusion Models
APIC: Orthogonalized Neuro-Symbolic Modeling for Nonlinear Dissipative Dynamics
Localized, High-resolution Geographic Representations with Slepian Functions
Optimal Stopping in Latent Diffusion Models
Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
Deep Residual Injection for Full-Spectrum Forensic Signal Perception in Multimodal Large Language Models
Noise-Robust Density Estimation for Tabular Data Anomaly Detection
Real-Time Visual Attribution Streaming in Thinking Model
Convolutional Learnable-Group Weightless Neural Network
Kinematics-Driven Gaussian Shape Deformation for Blurry Monocular Dynamic Scenes
Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance
Unbiased Alignment for Large Language Models with Noisy Preferences
Weight Updates as Activation Shifts: A Principled Framework for Steering
From Kepler to Newton: Inductive Biases Guide Learned World Models in Transformers
Minimax-Optimal Policy Regret in Partially Observable Markov Games
Hybrid Policy Distillation for LLMs
Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion
Prioritized Model Experience Replay
HEXST: Hexagonal Shifted-Window Transformer for Spatial Transcriptomics Gene Expression Prediction
EchoingPixels: Aliasing-Resistant Joint Token Reduction for Audio-Visual LLMs
Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution
Size Transferability of Graph Convolutional Networks across Sparsity: A Generalized Graphon Perspective
HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing
Finite-Width Neural Tangent Kernels from Feynman Diagrams
How Far Can LLM Agents Reason with Tables? Benchmarking Multi-Turn Agentic Table Question Answering in the Wild
To Grok Grokking: Provable Grokking in Ridge Regression
Geometric Decoupling: Diagnosing the Structural Instability of Latent
From Token to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction
Identifying Latent Concepts and Structures for Generalized Category Discovery
Meta Flow Maps enable scalable reward alignment
Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval
Unified Time Series Explanations via Semi-Amortized Optimization and Instance-level Multi-Expert Knowledge Distillation
Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning
CodeClash: Benchmarking Goal-Oriented Software Engineering
Learning GUI Grounding with Spatial Reasoning from Visual Feedback
Recontextualization Mitigates Specification Gaming Without Modifying the Specification
How Transformers Represent Hierarchies: A Local-to-Global Mechanism
From Winning to Understanding: A Diagnostic Long-Horizon RTS Benchmark for LLMs
Leveraging Low-Rank Structures for High-Dimensional Score-Based Sampling
On the Infinite Width and Depth Limits of Predictive Coding Networks
RAST-MoE-RL: A Regime-Aware Spatio-Temporal MoE Framework for Deep Reinforcement Learning in Ride-Hailing
iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework
A Geometry-Based View of Mahalanobis OOD Detection
Scaling by Diversified Experience for Vision-Language-Action Models
Noise-Guided Transport: Imitation Learning from Random Priors
Statistical Learning Theory in Lean 4: Empirical Processes from Scratch
Prism: Spectral-Aware Block-Sparse Attention
MAGIC: A Co-Evolving Attacker–Defender Adversarial Game for Robust LLM Safety
Tokenised Flow Matching for Hierarchical Simulation Based Inference
Attacks on Machine-Text Detectors Retain Stylistic Fingerprints
ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation
The Implicit Bias of Steepest Descent with Mini-batch Stochastic Gradient
The Sign Estimator: Preference Modeling for LLM Alignment under Heterogeneity
Trajectory Stitching for Solving Inverse Problems with Flow-Based Models
On the Power of Source Screening for Learning Shared Feature Extractors
MixFP4: Extending NVFP4 to Mixed Micro-Format via Scale-Bit Reuse and Tensor Core Co-design
SAD-Flower: Flow Matching for Safe, Admissible, and Dynamically Consistent Planning
Tilt Matching for Scalable Sampling and Fine-Tuning
ReGen: Hierarchical Multi-Prompt Representation Generation for Efficient Waveform Diffusion Models
DSGCR: Decomposed Spectral Geometry-Aware Cross-Modal Semantic Representation for 3D Visual Grounding
Mirror Mean-Field Langevin Dynamics
Great Minds Think Alike: Contextual Tacit Communication for Decentralized LLM-Agent Cooperation
A Spiking Heterogeneous Harmonic Resonate-and-Fire State Space Model for Time Series
Falsifying Sparse Autoencoder Reasoning Features in Language Models
Plasticity Activation via Polar Operator: A Plug-in Method for Balancing Stability and Plasticity
Selling Data as a Digital Good with Scaling Valuations
Spherical Procrustes Alignment for Reliable Medical Audio Diagnosis
Information dynamics and Memory in Neural Networks through Fisher Information Diffusion
Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios
Position: Agentic AI systems should be making Bayes-consistent decisions
On the Coordination of Value-Maximizing Bidders
ASIR: Steganography for Diffusion Models via Antipodal Sampling and Iterative Recovery
Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models
Text Has Curvature
FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences
Structure-aware Granular-Ball based Information Bottleneck for Multi-modal Clustering
Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces
Towards Atoms of Large Language Models
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation
scCBGM: Single-Cell Editing via Concept Bottlenecks
AppWorld-UL: Benchmarking Diverse Agent-User Interactions for Tool-Use
Power-Boosted Granger-Causal Discovery for Large Heterogeneous Panel Data
On the Salience of Low-Probability Tokens for AI-Generated Text Detection: A Multiscale Uncertainty Perspective
Bilinear Bandits with Partially Observable Features
Variational Bayesian Flow Network for Graph Generation
DisPPO: Quantile-Based Distributional Reinforcement Learning for Large Language Models
ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns
Learning Credal Ensembles via Distributionally Robust Optimization
Representation Drift Compensation: A Zero-Cost Enhancement for LLM Decomposition
Group-wise Data Ordering: Enhancing Instruction Tuning of Large Language Models via Embedding Proximity
ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving
PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios
Adaptive Utilization of Low-Rank Adaptation via Conditioned Gating
OC-space: a Unifying Perspective on Verification of Tree Ensembles
Localize and Neutralize: Gradient-Guided Token Suppression Against Visual Prompt Injection Attack
Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
CSPLoRA: Confidence-Guided Structure Planning for Low-Rank Adaptation
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Video Generation
SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs
PADS-TAL: Padding-Annealed Diffusion Sampling in Text-Aware Latent Space for Robust and Diverse Text-to-Music Generation
``Someone Hid It!'': Query-Agnostic Black-Box Attacks on LLM-Based Retrieval
Dynamic Relational Priming Improves Transformer in Multivariate Time Series
Skip-It? Theoretical Conditions for Layer Skipping in Vision–Language Models
$\texttt{FlashSchNet}$: Fast and Accurate Coarse-Grained Neural Network Molecular Dynamics
Think in Latent, Explain in Language: Self-Explainable Latent Reasoning
SynerMedGen: Synergizing Medical Multimodal Understanding with Generation via Task Alignment
OmniDenseCap: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy
DenseSteer: Steering Small Language Models towards Dense Math Reasoning
On the Expressive Power of GNNs to Solve Linear SDPs
L-CUBE: Isolating Long-Context Capacity from Knowledge with Controllable Mutual Information Scaling
Resolving the Timestep Scaling Paradox in Spiking Neural Networks with a Timestep-Scalable Neuron Model
AutoRAS: Learning Robust Agentic Systems with Primitive Representations
The Geometry of Narrow Fine-Tuning Degradation: Trajectory Lock-in and Spectral Bifurcation
PACE: Proactive Agent-Level Admission Control for Efficient Agentic Batch Inference
OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL
Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR
Context Tuning for In-Context Optimization
Towards Generative Graph Matching for Graph Edit Distance Computation
Log-Normal Multiplicative Dynamics for Stable Low-Precision Deep Learning
Code2Worlds: Empowering Coding LLMs for 4D World Generation
Understanding Performance Collapse in Layer-Pruned Large Language Models via Decision Representation Transitions
Two-Layer Linear Auto-Regressive Models Estimate Latent States
Adaptive Protein Tokenization
FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models
RAT+: Train Dense, Infer Sparse - Recurrence Augmented Attention for Dilated Inference
Faults in Our Formal Benchmarking: Dataset Defects and Evaluation Failures in Lean Theorem Proving
Transfer Learning in Nonparametric Regression with Deep ReLU Networks
Quantifying Temperature Scaling in Discrete Sequence (Language) Models
SP-Mind: An Autonomous Reasoning Agent for Spatial Proteomics Analysis
Functional Equivalence in Attention: A Comprehensive Study with Applications to Linear Mode Connectivity
Real-Time and Lightweight Diffusion Image Compression
Monitoring Monitorability
ExSkill: Continual Learning from Experience and Skills in Multimodal Agents
SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning
A Game-Theoretic Framework for Measuring and Explaining Metric Compatibility in Fair Machine Learning
Test-Time Debiasing with Probabilistic Prompts via Wasserstein Distance in Vision-Language Models
Continuous-Time Piecewise-Linear Recurrent Neural Networks
Reconstruction Outcomes Look Similar but Processes Differ: Improving Context Consistency and Coverage in Graph Masked Auto-Encoder
RelaxFlow: Text-Driven Amodal 3D Generation
Learning on Higher-Order Structures with Effective Operators
A Systematic Study of Behavioral Cloning for Scientific Data Annotation
Graph Alignment for Benchmarking Graph Neural Networks and Learning Positional Encodings
The (Marginal) Value of a Search Ad: An Online Causal Framework for Repeated Second-price Auctions
Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models
Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs
Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation
Structured 4D Latent World Model for Robot Planning
Cardinality-Invariant Neural Operator Policies for Scalable PDE Control
Position: Irresponsible AI: big tech’s influence on AI research and associated impacts
Lagrangian Perturbation Diffusion Steering: Latent Reinforcement Learning for Generative Policies
NavOL: Navigation Policy with Online Imitation Learning
Chebyshev Policies and the Mountain Car Problem: Reinforcement Learning for Low-dimensional Control Tasks
DecoVer: A Decompose-and-Verify Neuro-Symbolic Framework for Embodied Task Planning with BC+
AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention
Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis
GP2F: Cross-Domain Graph Prompting with Adaptive Fusion of Pre-trained Graph Neural Networks
FedSSM: State Space Model-based Proactive Inference for Heterogeneous Multimodal Federated Learning
Procedural Generation Of Algorithm Discovery Tasks in Machine Learning
DNACHUNKER: Learnable Tokenization for DNA Language Models
TACTIC: Task-Aware Sparse Coordination Graphs for Multi-Task Multi-agent Reinforcement Learning
Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models
Generative Adaptation of Dynamics to Environmental Shifts via Weight-space Diffusion
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
FrontierCS: Evolving Challenges for Evolving Intelligence
Beyond Additive Decompositions: Interpretability Through Separability
Shapley Neuron Values for Continual Learning: Which Neurons Matter Most?
Learning Generalized Trackers with Elastic Token Budgets
Learning the Minimum Action Distance
LitReview Arena: Evaluating Literature Review Agents with Battle-style Peer Review Platform
Characterizing Agents in Production
Transitive Representation Learning Enhances Histopathology Annotation
Incremental BPE Tokenization
Learning Graph Foundation Models on Riemannian Graph-of-Graphs
Proximal-Based Generative Modeling for Bayesian Inverse Problems
PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference
How Good is Post-Hoc Watermarking With Language Model Rephrasing?
Preference-Modulated Structural Attention for Multi-Objective Combinatorial Optimization
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
On the "Induction Bias" in Sequence Models
Revisiting the Bertrand Paradox via Equilibrium Analysis of No-regret Learners
Embodied-DETR: End-to-End Temporal 3D Object Detection in Egocentric Views
When Replanning Becomes the Bottleneck: Budgeted Replanning for Embodied Agents
Foundation Inference Models for Ordinary Differential Equations
TabularBERT: Binning-Based Self-Supervised Learning for Tabular Representation
Learning Discrete Diffusion on Graphs via Free-Energy Gradient Flows
DF-LoGiT: Data-Free Logic-Gated Backdoor Attacks in Vision Transformers
PersistBench: When Should Long-Term Memories Be Forgotten by LLMs?
UniCoD: Enhancing Robot Policy via Unified Continuous and Discrete Representation Learning
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data
SimGFM: Simplifying Discrete Flow Matching for Graph Generation
Orchestrating Spatial Semantics via a Zone-Graph Paradigm for Intricate Indoor Scene Generation
Online Continual Learning with Dynamic Label Hierarchies
When Iteration Helps and Hurts in Self-Training: Denoising vs. Signal Forgetting
SwitchCraft: Programmatic Design of State-Switching Proteins
Look on Demand: A Cognitive Scheduling Framework for Visual Evidence Acquisition in Multimodal Reasoning
DR-MMSearchAgent: Deepening Reasoning in Multimodal Search Agents
ANCHOR: Automated Alignment Auditing for CLI Agents on Real-World Harm
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
Revealing Scaling Behavior in Large-scale Time Series Models: Implications for More Efficient and Accurate Forecasting
RaBiT: Residual Aware Binarization Training for Accurate and Efficient LLMs
Unveiling Prior-data Fitted Networks on Causal Effect Estimation: Pre-training or Finetuning?
ECSEL: Explainable Classification via Signomial Equation Learning
On Computation and Reinforcement Learning
DLO-Lab: Benchmarking Deformable Linear Object Manipulations with Differentiable Physics
Efficient Multi-modal Dataset Distillation via Analytic Parameter Matching
Think Less, Act Early: Reinforced Latent Reasoning with Early Exit in Vision-Language-Action Models
Plan in Sandbox, Navigate in Open Worlds: Learning Physics-Grounded Abstracted Experience for Embodied Navigation
Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases
Sparser, Faster, Lighter Transformer Language Models
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
CellBRIDGE: Learning Cellular Trajectories via Interaction-Aware Alignment
Decoding Safety Feedback from Diverse Raters: A Data-driven Lens on Responsiveness to Severity
Hearing Without Noticing? Attention-Aware Stealthy Black-box Adversarial Audio Attacks
Deep Reinforcement Learning Finds Bayes-Nash Equilibrium in Competitive Newsvendor Problems
UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
A Unified Density Operator View of Flow Control and Merging
PromptRL: Prompt Matters in RL for Flow-Based Image Generation
On the Optimization Trajectory of DeepWalk Embeddings
Adaptive Querying with AI Persona Priors
Curriculum-Guided Layer Scaling for Language Model Pretraining
Beyond Policy Training: Recursive Solution Search from Unannotated Videos
Inference-Time Forward-Process Alignment in Diffusion Models
MM-Spectrum: Multimodal Multi-spectral Molecular Structural Elucidation with a Stable MoE Framework
Outrunning LLM Cutoffs: A Live Kernel Crash Resolution Benchmark for All
Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy
Asymptotically Optimal Sequential Testing with Markovian Data
High-accuracy and dimension-free sampling with diffusions
Discovering Differences in Strategic Behavior between Humans and LLMs
FairGB: A Fair Granular-Ball Generation Method for Data Classification
Beyond Logits: Coherent Hallucination Mitigation via Attention Contrastive Decoding
Feature Bagging Provides Stability
Fully Dynamic Coreset Spectral Clustering
BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate
Box Thirding: Anytime Best Arm Identification under Insufficient Sampling
Addressing Semantic Blind Spots in Text-to-SQL via Component Pre-generation and AST Matching Rewards
SHARP-Q: Spectral Hessian Alignment and Rectification for Post-training Quantization
Enhancing Affine Maximizer Auctions with Correlation-Aware Payment
SONAR: Spectral‑Contrastive Audio Residuals for Generalizable Deepfake Detection
HelioX: A GPU-Native Framework for Simulation and Training of Biophysically Detailed Networks
BubbleSpec: Turning Long-Tail Bubbles into Speculative Rollout Drafts for Synchronous Reinforcement Learning
SmartThinker: Progressive Chain-of-Thought Length Calibration for Efficient Large Language Model Reasoning
Generalization and Scaling Laws for Mixture-of-ExpertsTransformers
SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling
Functional building blocks of neural networks: from network motifs to collective dynamics
Debate with Images: Detecting Deceptive Behaviors in Multimodal Large Language Models
From Prompts to Responses: Dual-Sided Data Leakage and Defense in Split Large Language Models
Scalable Bayesian Inference for Nonlinear Conservation Laws
FedHPro: Federated Hyper-Prototype Learning via Gradient Matching
Differentiable Conformal Training for LLM Reasoning Factuality
Topology-Aware Contrastive Learning: Regulating Representation Connectivity via Persistent Homology
Lookahead Path Likelihood Optimization for Diffusion LLMs
Mitigating the Safety–Utility Trade-off in LLM Alignment via Adaptive Safe Context Learning
Unsupervised Camouflaged Object Detection with Dual-Eigenvector Spectral Pseudo-Labeling and Contrastive Refinement
Persona-Pruner: Sculpting Lightweight Models for Role-Playing
Language-based Trial and Error Falls Behind in the Era of Experience
Geometry-based Schrödinger Bridges for Trustworthy Multimodal Fusion
AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments
TRACER: Persistent Regularization for Robust Multimodal Finetuning
Coupled Cluster con MoLe: Molecular Orbital Learning for Neural Wavefunctions
Length Generalization Bounds for Transformers
On Multi-Step Theorem Prediction via Non-Parametric Structural Priors
Optimal Domain-Aware Privacy Mechanisms for Synthetic Data Generation
SS‑TPT: Stability and Suitability-Guided Test-Time Prompt Tuning for Adversarially Robust Vision-Language Models
Global Convergence of Adaptive Sensing for Principal Eigenvector Estimation
Active Reasoning Vision-Language Model via Sequential Experimental Design
In-Context Generation with Regional Constraints for Instructional Video Editing
JANUS-LORA: A Balanced Low-Rank Adaptation for Continual Learning
Do Transformers Need Three Projections? Systematic Study of QKV Variants
SafeSeek: Universal Attribution of Safety Circuits in Language Models
SPAR: Support-Preserving Action Rectification
Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator
CUARewardBench: Benchmark for Evaluating Reward Models on Computer-using Agent Trajectories
xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
Self-Refining Video Sampling
Multi-Agent Teams Hold Experts Back
Perceptual Flow Network for Visually Grounded Reasoning
CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Learning
Artemis: Structured Visual Reasoning for Perception Policy Learning
Beyond Benchmarks: Toward Causally Faithful Evaluation of Large Language Models
RubricRobustness: A Simple Framework for Evaluating the Robustness of Rubrics-Based Benchmarks
Multimarginal flow matching with optimal transport potentials
TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance
T-POP: Test-Time Personalization with Online Preference Feedback
Dual Optimal Transport for Multi-Concept Composition: Structural Alignment and Texture Injection in Diffusion Models
Bridging Spherical Black-Box Optimizers
Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks
The Geometric Mechanics of Contrastive Representation Learning: Alignment Potentials, Entropic Dispersion, and Cross-Modal Divergence
Neural Concept Verifier: Scaling Prover-Verifier Games via Concept Encodings
Resolving Blind Inverse Problems under Dynamic Range Compression via Structured Forward Operator Modeling
Scalable Traffic Signal Control with Shared Policy Framework
Beyond Fixed Biases: Decoding the Role of Reasoning Uncertainty in MLLM Modality Conflicts
Compositional Perception and Generalizing Induction: Latent Compositional Manifold Assumption on Generalized Category Discovery
PAWS: Preference Learning with Advantage-Weighted Segments
Test-Time Reinforcement Learning for Flow Matching
Position: Multiplicity is an Inevitable and Inherent Challenge in Multimodal Learning
Hölder++: Improving Quality-Coherence Trade-off in Multimodal VAEs
nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding
Regret-Based Federated Causal Discovery with Unknown Interventions
Toward Robust Multilingual Adaptation of LLMs for Low-Resource Languages
AvAtar: Learning to Align via Active Optimal Transport
ReCoG: Relational and Compact Context Graph Learning for Few-shot Molecular Property Prediction
SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes
Context-free Recognition with Transformers
Resource-Efficient Reinforcement for Reasoning Large Language Models via Dynamic One-Shot Policy Refinement
Search for Truth from Reasoning: A Dynamic Representation Editing Framework for Steering LLM Trajectories
Query-efficient model evaluation using cached responses
STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems
Normalizing Diffusion Kernels with Optimal Transport
Fine-Tune Once, Reuse Across Models: Bayesian Task-Update Factors and Approximations
Automata-Conditioned Cooperative Multi-Agent Reinforcement Learning
Conditional Clifford-Steerable CNNs for PDE Modeling
Kronecker Generative Networks: A General Neural Architecture for Parameter-Efficient Learning Across Classification Tasks
HONet: Data-Efficient Learning for Exact Cover Problems via Hypergraph Optimization
Optimal Regularization for Performative Learning
Autoregression with Self-Token Prediction
PartCo: Part-Level Correspondence Priors Enhance Category Discovery
Distribution Alignment for One-Shot Federated Learning via Optimal Transport
MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology
RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning
CONTEXTOR: Contextualized High-order Contrastive Learning
CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning
How Should Transformers Represent Numeric Values in Electronic Health Records?
BioDynaSpec: Harmonic-Guided Spatio-Spectral Autoregressive Diffusion for Protein Dynamics Generation
Convergent World Representations and Divergent Tasks
TaskLoom: Weaving Knowledge Across Tasks in World Models
On the Generalization Gap in Self-Evolving Language Model Reasoning
Towards Generalizable EEG-to-fMRI Synthesis via a Unified, Context-Aware Prompting Framework
GOTabPFN: From Feature Ordering to Compact Tokenization for Tabular Foundation Models on High-Dimensional Data
Neuro-Symbolic AI for Analytical Solutions of Differential Equations
RAG without Forgetting: Continual Query-Infused Key Memory
ReaForest: Fostering Generative Video Reasoning for Spatial Planning
Toward Calibrated Mixture-of-Experts Under Distribution Shift
Conformal Policy Control
Omni-Perception Policy Optimization for Multimodal Emotion Reasoning
DDGA: Dirichlet Distributional Gradient Aggregation for Transferable Vision-Language Adversarial Attacks
TadABench-1M: A Large-Scale Wet-Lab Protein Benchmark For Rigorous OOD Evaluation
Universality, Function Composition, and Algorithm Emulation All In-Context
NAVIGATE: Evaluating Visual-Guided Search Decision-Making on the Open Web
The Quality-Utility Paradox: Why High-Reward Data Impairs Small Model Reasoning
One-step Optimal Transport via Regularized Distribution Matching Distillation
Solver-in-the-Loop: MDP-Based Benchmarks for Self-Correction and Behavioral Rationality in Operations Research
Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
ImpText: A Benchmark and Tool-Augmented Framework for Implicit Text Reasoning
Mixing Expertise with Confidence: A Mixture of Expert Framework for Robust Multi-Modal Continual Learner
Test-Time Learning of Causal Structure from Interventional Data
Transport and Merge: Cross-Architecture Merging for Large Language Models
WildActor: Unconstrained Identity-Preserving Video Generation
Do LLMs Signal When They’re Right? Evidence from Neuron Agreement
Action-Sufficient Goal Representations
Combinatorial Sparse PCA Beyond the Spiked Identity Model
SFCLTA: Spectral Fusion Contrastive Learning with Topology-Adaptive Graph Augmentation
Fully Zero-Shot Image Dehazing
Explainable Forensics of Manipulated Segments in Untrimmed Long Videos
Reasoning-VLA: An Efficient and Spatial-Guided General Vision-Language-Action Reasoning Model for Autonomous Driving
MutAtlas: A PDB-Wide Energy-Guided Atlas of Protein Mutation Effects
Distributional Inverse Reinforcement Learning
Long Grounded Thoughts: Synthesizing Grounded Visual Problems and Distilling Reasoning Chains at Scale
DRIVE: Distributional and Retrieval-Augmented Bidding with Value Evaluation
Who Transfers Safety? Identifying and Targeting Cross-Lingual Shared Safety Neurons
AD-BTS: Adaptive Dual-Branch Token Sparsification via Spatial Information Density
GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models
Regularized Discriminative Alignment for Deep Representations under Label Shift
Scientific logicality enriched methodology for LLM reasoning: A practice in physics
From Pairwise Affinities to Functional Correspondences: Rethinking Attention
ReAugment: Targeted Few-Shot Time Series Augmentation via Model Zoo-Guided Reinforcement Learning
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
Online Linear Programming for Multi-Objective Routing in LLM Serving
Inference Time Concept Removal Guidance for Text-to-Image Diffusion Models
It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks
daVinci-Dev: Agent-native Mid-training for Software Engineering
Action Manifold Smoothing: A Lipschitz Pathway Perspective on High-Dimensional Reinforcement Learning
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Overthinking: Amplifying Reasoning Weights to Extract Learned Secrets
A Short and Unified Convergence Analysis of the SAG, SAGA, and IAG Algorithms
AICrypto: Evaluating Cryptography Capabilities of Large Language Models
On the Generalization in Topology Optimization via Sensitivity-Conditioned Bernoulli Flow Matching
Salus: Strategic Diagnostic Testing for Complex Diagnosis via Multi-Agent Reinforcement Learning
Less Token, More Signal: MoE Expert Pruning via Critical Token Selection
DRIVE: Best Data Scheduling Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation
PerceptOS: Semantic-Aware Kernel Optimization for OS-Intensive Workloads via Hardware-Software Alignment
Amortized Variational Inference for Partial-Label Learning: A Probabilistic Approach to Label Disambiguation
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
MultiLoReFT: Decoupling Shared and Modality-Specific Subspaces in Multimodal Learning via Low-Rank Representation Fine-Tuning
Differentially Private Geodesic Regression
The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
DiffCrossGait: Trajectory-Level Alignment for 2D-3D Cross-Modal Gait Recognition via Latent Diffusion
Let the Prototype Guide You: Robust Aggregation of Sparse Multi-Class Annotations via Annotator Prototype Learning
Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders
RGGT: A Generative-Prior-Guided Transformer for Unified Rigid and Non-Rigid Point Cloud Registration
How2Everything: Mining the Web for How-to Procedures to Evaluate and Improve LLMs
PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs
Multimodal Fusion via Self-Consistent Task-Gradient Fields
Position: Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences
TopAdapter: Topology-Aware Prompt Tuning for Efficient Point Cloud Understanding
Mind Dreamer: Untethering Imagination via Active Counterfactual Reasoning on Latent Manifolds
Constrained Flow Optimization via Sequential Fine-Tuning for Molecular Design
B-Spar: Bayesian Sparse-Reward Modeling for RL-based Image Editing
Spectral-Informed Neural Networks Outperform Spectral methods in High-dimensional PDEs
Task-and-Model-Aware Fractal-Consistency for Efficient LLM Reasoning
When Drafts Evolve: Speculative Decoding Meets Online Learning
Dimensional Collapse in Transformer Attention Outputs: A Challenge for Sparse Dictionary Learning
Who Said Neural Networks Aren't Linear?
AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding
Intentional Updates for Streaming Reinforcement Learning
Token Sample Complexity of Attention
Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection
Generalized Schrödinger Bridge on Graphs
One-Step Graph-Structured Neural Flows for Irregular Multivariate Time Series Classification
Q-CLIP: Unleashing the Power of Vision-Language Models for Video Quality Assessment through Unified Cross-Modal Adaptation
Agentic Proposing: Enhancing Large language Model Reasoning via Compositional Skill Synthesis
Towards Resource-Efficient LLMs: End-to-End Energy Accounting of Distillation Pipelines
Accelerating Q-learning through Efficient Value-sharing across Actions
BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference
LAPRAS : Learning-Augmented PRivate Answering for linear query Streams.
Decentralized Bandits without Global Clock for Dynamic Matching Market
IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models
Large-capacity and Receiver Authenticable Generative Image Steganography
Discrete Diffusion Samplers and Bridges: Off-Policy Algorithms and Applications in Latent Spaces
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Black-Box Combinatorial Optimization with Order-Invariant Reinforcement Learning
Provably Data-driven Multiple Hyper-parameter Tuning with Structured Loss Function
MultiBreak: A Scalable and Diverse Multi-turn Jailbreak Benchmark for Evaluating LLM Safety
From Representation to Action: A Unified Laplacian Framework for Spatial Representation and Path Planning
Bridging the Gap in Autonomous Science: The Corpus and Benchmark for Biological Protocol Reasoning
PICACO: Pluralistic In-Context Value Alignment via Total Correlation Optimization
Pose-ICL: 3D-Aware In-Context Learning for Pose-Controllable Subject Customization
SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis
NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search
Correct looks better: Pairwise comparisons reveal accuracy rankings
Semantic-level Backdoor Attack against Text-to-Image Diffusion Models
Learning Gaussian Graphical Models from a Glauber Trajectory Without Mixing
Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers
A Language-Guided Bayesian Optimization for Efficient LoRA Hyperparameter Search
From Noise to Intent: Anchoring Generative VLA Policies with Residual Bridges
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention
Effective Model Pruning : Measuring the Redundancy of Model Components
Complexity bounds for Dirichlet process slice samplers
Server-Proximal Aggregation for Federated Domain-Incremental Learning under Partial Participation: Task-Uniform Convergence and Backward Transfer
Geometric Reciprocity: Unlocking Self-Supervision for Stereoscopic Video Generation
Teaching Molecular Dynamics to a Non-Autoregressive Ionic Transport Predictor
Trust Functions: Near Lossless Weak-to-Strong Generalization by Learning to Trust the Weak Teacher
Adaptive Momentum and Nonlinear Damping for Neural Network Training
Dynamic Regret via Discounted-to-Dynamic Reduction with Applications to Curved Losses and Adam Optimizer
Efficient Training of Boltzmann Generators Using Off-Policy Log-Dispersion Regularization
ScalingAR: Scaling Confidence for Autoregressive Image Generation
NaviCache: Test-Time Self-Calibration Caching for Video Generation
Large-Scale Molecular Dynamics Simulations: Direct Interatomic Modeling with Dilated Message Passing
Chain-of-Glimpse: Search-Guided Progressive Object-Grounded Reasoning for Video Understanding
Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
The Crowded Embedding Space: A Mean-Field Mechanism for Emergent Marginalization in Retrieval-Augmented Agents
Mitigating Per-Sample Harm in Stochastic Optimization
Uncertainty-Guided Exploration and Stable Planning for Sparse-Reward Manipulation from Limited Demonstrations
G-RANS: Generalizable Residual-Aware Neural Solvers for Sparse Systems
Byte Pair Encoding for Efficient Time Series Forecasting
Random Erasing vs. Model Inversion: A Promising Defense or a False Hope?
TranX-Adapter: Bridging Artifacts and Semantics within MLLMs for Robust AI-generated Image Detection
Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation
Label-Guided Representation Learning for Incomplete Multi-View Multi-Label Classification
Convergence of Two-Timescale Stochastic Approximation with Markovian Samples and Applications in Reinforcement Learning
MMPD-Bench: Bridging Multimodal Fission with Multi-Polarimetric Modalities Decomposition
Uncertainty-Constrained Trustworthiness for Graph Learning
How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?
EvoMAS: Heuristics in the Loop—Evolving Smarter Agentic Workflows
IDRBench: Understanding the Capability of Large Language Models on Interdisciplinary Research
What Reward Structure Enables Efficient Sparse-Reward RL? A Proof-of-Concept with Policy-Aware Matrix Completion
Physiology as Language: Translating Nocturnal Breathing to EEG
Secure Multi-agent Reinforcement Learning for Service Systems with Affinity and Byzantine Nodes: Stability Analysis and Protection Design
PretrainZero: Reinforcement Active Pretraining
Causal Matrix Completion under Multiple Treatments via Mixed Synthetic Nearest Neighbors
EEG-FM-Bench: A Comprehensive Benchmark for the Systematic Evaluation and Diagnostic Analyses of EEG Foundation Models
RELO: Reinforcement Learning to Localize for Visual Object Tracking
Biased Generalization in Diffusion Models
How can we assess human-agent interactions? Case studies in software agent design
Layer-Centric Factors of Variation Disentanglement for Task- and Model-Agnostic Generalization
From Extraction to Deduction: Resolving Functional Misalignment in RAG via a Collaborative Critic-Reasoner Framework
FAB: A First-Order AB-based Gradient Algorithm for Distributed Bilevel Optimization over Time-Varying Directed Graphs
Revealing Differences in Multi-Modal Embeddings via Constrained Kernel Analysis
GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
Optimal Transport for Reward Modeling from Noisy Feedback
ALAS: Additive Learnable Alpha-Stable Kernels for Flexible Bayesian Optimization
Contrastive Representation Regularization for Vision-Language-Action Models
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities
Sample Margin-Aware Recalibration of Temperature Scaling
Vision-Language-Action Pretraining from Large-Scale Human Videos
Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models
The Unlearnability Phenomenon in RLVR for Language Models
A Formal Comparison Between Chain of Thought and Latent Thought
A Kinetic-Energy Perspective of Flow Matching
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing
Topological Active Inference for Task Disambiguation
Contrastive Flow Map Matching
From Muon to Gluon: Bridging Theory and Practice of LMO-based Optimizers for LLMs
Decoupling Universal Laws and Environmental Heterogeneity: A Physics-Inspired Framework for Robust Spatio-Temporal Forecasting
Vibe Checker: Aligning Code Evaluation with Human Preference
ADHD Disease Detection Based on Short- and Long-Term Brain Function Encoding and Memory Graph Network
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining
V-ABS: Action-Observer Driven Beam Search for Dynamic Visual Reasoning
Neural Dispersion on Graphs
Sem-Detect: Semantic Level Detection of AI Generated Peer-Reviews
Courtroom Analogy: New Perspective on Uncertainty-Aware Classification
Off-Policy Evaluation with Strategic Agents via Local Disclosure
HGMem: Hypergraph-based Working Memory to Improve Multi-step RAG for Long-Context Complex Relational Modeling
Self-Supervised Dynamical System Representations for Physiological Time-Series
How to guide your flow: Steering flow maps for rapid test-time alignment
Shapley Regularized Neural Granger Causality
Remove the Ambiguity: Few-shot Multimodal Anomaly Detection Using Crossmodal Feature Replacers
Beyond Soft Labels: Unifying Dataset Pruning and Distillation for Efficient Large-scale Compression
Flow Sampling : Learning to Sample from Unnormalized Densities via Denoising Conditional Processes
LECTOR: Joint Learning of Scientific Reasoning Graphs and Introduction Generation
Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals
Multimodal Meta-Verifier with Explicit Structured Recalibration
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?
MRPO: Magnitude-Regularized Policy Optimization via L1 Constraints
ForceForget: Reinforcement Concept Removal for Enhancing Safety in Text-to-Image Models
GOCM: Single-Step Graph Outlier Synthesis via Origin Consistency Model
Sparse Autoencoders are Topic Models
Hyperbolic Multimodal Continual Learning
CSG: Cognitive Structure Generation for Intelligent Education
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
HypoSpace: A Diagnostic Benchmark for Set-Valued Hypothesis Generation under Underdetermination and Sublinear Coverage Bounds
Position: RL Researchers Need to Distinguish Between Solving Simulators and Using Simulators as a Proxy
Efficient DP-SGD for LLMs with Randomized Clipping
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models
Multi-scale Explainer for Graph Neural Networks
QTALE: Quantization-Robust Token-Adaptive Layer Execution for LLMs
Test-Time Detoxification without Training or Learning Anything
Position: Creating High-Fidelity Synthetic Training Data Should Employ Multi-level Optimization
Position: Deciphering the Functions of DNAs, RNAs, and Proteins Should Consider Multi-Modal Large Language Models
Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning
CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Subtitle Removal
TPV: Parameter Perturbations Through the Lens of Test Prediction Variance
Sobolev Regularized Score Difference Estimation in Diffusion Models
RSTR: Reducing SpatioTemporal Redundancy in Diffusion Transformers
The Hippocampal Place Field Gradient: A Bio-inspired Framework Building Multiscale Representation for Better Sample Efficiency
HybridFlow: Resource-Adaptive Subtask Routing for Efficient Edge-Cloud LLM Inference
Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
Hierarchical Successor Representation for Robust Transfer
SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation
3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models
Learning Coupled Continuous-Time Latent Dynamics from Irregular Events
Learning Partial Concept Classes and Universal Rates Under Massart Noise
High-Probability Convergence Guarantees of Decentralized SGD
Generative Online Reinforcement Learning
MTNL: A Unified Modeling Perspective for Enhancing Tensor Network Learning
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Don't Reinvent the Wheel, Just Realign the Spokes: Resource-Efficient Federated Fine-Tuning via Rank-Wise Expert Assembly
Sinkhorn Treatment Effects
Z-Erase: Enabling Concept Erasure in Single Stream Diffusion Transformers
Dispersion Loss Counteracts Embedding Condensation and Improves Generalization in Small Language Models
Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization
State-Dependent Safety Failures in Multi-Turn Language Model Interaction
Quantifying the Effect of Noise in Language Generation
Self-Soupervision: Cooking Model Soups without Labels
CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
Low-cost Full Fine-tuning: Learning What to Update for LLMs
Resting Neurons, Active Insights: Robustify Activation Sparsity for Large Language Models
Deep Discriminative Structure Proxy Hashing for Cross-modal Retrieval
Transformers Efficiently Perform In-Context Logistic Regression via Normalized Gradient Descent
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs
Near-Optimal Dynamic Matching via Coarsening with Application to Heart Transplantation
Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization
Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure
Differentially Private and Scalable Estimation of the Network Principal Component
Unsupervised Diffusion for Combinatorial Optimization via Adjoint Matching
Code2Video: A Code-centric Paradigm for Educational Video Creation
Transporting Task Vectors across Different Architectures without Training
GEMQ: Global Expert-Level Mixed-Precision Quantization for MoE LLMs
Decomposition-Based Modular Conformal Prediction for Two-Stage Modeling
ProMiSE: Protein Multi-state Structure Evaluation Benchmark in Biological Contexts
FlexiFlow: decomposable flow matching for generation of flexible molecular ensemble
DLLMQuant: A Post-Training Quantization Framework Tailored for Diffusion-Based Large Language Models
Predicting Dynamic Stability Landscapes in Synchronization Networks
Pseudo-Mallows for Efficient Probabilistic Preference Learning
Investigating Component Contributions in Multi-Agent ML Systems
Learning Permutation from Structure Without Supervision
RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models
Overcoming PINNs Failure Modes In High Dimension With Low-Rank Fourier Sum
GameVerse: Can Vision-Language Models Learn from Video-based Reflection?
RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization
Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments
A Conflict-aware Evidential Framework for Reliable Sleep Stage Classification
Real-World Unsupervised Models Generalize to Predict Brain Responses to Out-of-Distribution Stimuli
TreePO: Enhancing Policy Efficacy and Inference Efficiency with Tree Modeling
Long-Context Modeling with Dynamic Hierarchical Sparse Attention for Memory-Constrained LLM Inference
Schema-Guided World Modeling for Understanding Hierarchical Visual Dynamics
MC-HNN: Learning Latent Structural Semantics and High-Rank Representations for Hypergraph Neural Networks
APE-Bench: Evaluating Automated Proof Engineering for Formal Math Libraries
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance
When Does Adaptation Win? Scaling Laws for Meta-Learning in Quantum Control
Attention Hijacking: Backdooring Text Dataset Distillation via Semantic Anchors
Learning-Guided Integration Contours Construction for Fast Large-Scale Generalized Eigensolvers
ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning
CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering
Floating-Point Networks with Automatic Differentiation Can Represent Almost All Floating-Point Functions and Their Gradients
Inference Time Optimization with Confidence Dynamics
MedCRP-CL: Continual Medical Image Segmentation via Bayesian Nonparametric Semantic Modality Discovery
Are We Overconfident in Models and Results for Semi-Supervised 3D Medical Image Segmentation?
DevEvol: Benchmarking LLM Agents on Continuous Software Evolution
Generalized Correctness Models: Learning Calibrated and Cross-Model Correctness Predictors from Historical Patterns
RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation
AdaSCALE: Adaptive Scaling for OOD Detection
Rethinking generative image pretraining: How far are we from scaling up next-pixel prediction?
E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory
(1D) Ordered Tokens Enable Efficient Test-Time Search
LAST: Bridging Vision-Language and Action Manifolds via Gromov-Wasserstein Alignment
Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't
Active Curriculum Refinement for Reinforcement Learning
Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning
Rational Neural Networks have Expressivity Advantages
Q-SAM: Unlocking Sharpness-Aware Minimization for Generalization in Offline Reinforcement Learning
Dissecting Post-Training: Uncovering the Complementary Roles of SFT and RL for Document Parsing
Unsupervised Neural Langevin Sampler for Mixed Integer Linear Programming
The Surprising Difficulty of Search in Model-Based Reinforcement Learning
GFMate: Empowering Graph Foundation Models with Pre-training-agnostic Test-time Prompt Tuning
Generalized Boundary FDR Control under Arbitrary Dependence: An Approach on Closure Principle
The Stability of Singular Distribution: A Spectral Perspective on the Two-Phase Dynamics of Language Model Pre-training
Conflict-Aware Adaptive Alignment for LLM Hallucination Mitigation
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
NeurIPS: Neuro-anatomical Inductive Priors for Sphere-based Brain Decoding
STLA: Spatiotemporal Lookahead Alignment for Post-Training Quantization
Efficient Skill Grounding via Code Refactoring with Small Language Models
On Structured State-Space Duality
Position: Virtual Cells Need Context, Not Just Scale
$\texttt{IDEAS}$: Interpretability Driven Evolutionary Approach for the Design of Biological Sequences
Fractional is Better: Learnable Derivative Orders in Neural Operator Learning
Learning Disentangled Multi-Agent World Model for Decentralized Control
Efficient Code Analysis via Graph-Guided Large Language Models
LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation
Temper-Then-Tilt: Principled Unlearning for Generative Models through Tempering and Classifier Guidance
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
Doubly Regularized Markov Decision Processes for Robust Reinforcement Learning
Modeling Long-Tail Relations in the Operating Room via In-Context Multimodal Learning
REAL: Resolving Knowledge Conflicts in Knowledge-Intensive Visual Question Answering via Reasoning-Pivot Alignment
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
Model-Free Robust Average-Reward Reinforcement Learning with Sample Complexity Analysis
Bridging Time and Frequency: A Joint Modeling Framework for Irregular Multivariate Time Series Forecasting
Multi-Way Representation Alignment
Enhancing Protein-Protein Interaction Prediction with Hierarchical Motif-based Multimodal Protein Embedding
Parallel Stochastic Gradient-Based Planning for World Models
Origo: Physically Interpretable Multi-Physics PDE Pre-training through Neural Operator Splitting
Learning Latent Action World Models In The Wild
Optimal Learning from Label Proportions with General Loss Functions
AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters
Decoupled Low-Rank Adaptation for Robust Federated Fine-Tuning
Inverting Data Transformations via Diffusion Sampling
CentaurEval: Benchmarking Human-in-the-Loop Value in Agentic Coding
Interpreting Physics in Video World Models
Convergence Rate Analysis of the AdamW-Style Shampoo: Unifying One-sided and Two-Sided Preconditioning
Cross-Embodiment Robot Foundation World Models with Latent Actions
Terminal Dimension Reduction for Time Series with Applications
Hista and Numca: Estimate State Value Effectively for Large Language Model Reinforcement Learning
Towards Unified Multimodal Pretraining
Scalable and Stable Estimation of Amari $\alpha$-Divergence using Random Fourier Features
Native Active Perception as Reasoning for Omni-Modal Understanding
CofactGVR: Counterfactual Intervention for Grounded Visual Reasoning
Rank-guided Diffusion for Noise Few-Shot Learning
Unsupervised Disentanglement Without Compromises : How Functional Orthogonality Enforces Identifiability
Persistent Backdoor Attacks in Class-Incremental Learning via Structural Invariant Anchoring
Event2Vec: Processing neuromorphic events directly by representations in vector space
DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection
Gram2Token: Enabling Run-time GPU-Native Grammar-Constrained Decoding for LLMs
Towards Seed-Robust Safety Alignment in Text-to-Image Models
Demystifying the Optimal Fair Classifier in Multi-Class Classification
MESA: Improving MoE Safety Alignment via Decentralized Expertise
Ski Rental with Distributional Predictions of Unknown Quality
TurboGS: Accelerating 3D Gaussian Splatting via Error-Guided Sparse Pixel Sampling and Optimization
The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting
Variance Driven Exploration: A Provable and Efficient Methodology for Pure Exploration in Highly Stochastic Environments
SKETCH: Semantic Key-Point Conditioning for Long-Horizon Vessel Trajectory Prediction
Efficient Stochastic Optimisation via Sequential Monte Carlo
Hyperspectral Image Fusion with Spectral-Band and Fusion-Scale Agnosticism
Understanding SAM through Minimax Perspective
EvoCF: Multi-Agent Collaboration via Agentic Memory-Driven Evolutionary Counterfactual Planning
DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training
Scalable Simulation-Based Model Inference with Test-Time Complexity Control
Privileged Information Distillation for Language Models
Powerful and Theoretically Guaranteed Independence Testing on Heterogeneous Federated Clients
FiRE: Fine-grained Ranking Evaluation for Machine Translation
AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent
Motion-Aware Caching for Efficient Autoregressive Video Generation
FedGain: Toward Negative-Gain-Free Client Collaboration in Federated Learning
Reasoning LLM Improves Speaker Recognition in Long-form TV Dramas
Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks
Retrospective Feature Estimation for Continual Learning
What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression
Temporal-aware Flow Matching for Video Generation with Temporally Coherent Motion
PointDiT: Pixel-Space Diffusion for Monocular Geometry Estimation
Semantic Router: On the Feasibility of Hijacking MLLMs via a Single Adversarial Perturbation
Scene Graph Thinking: Reinforcing Structured Visual Reasoning for Multimodal Large Language Models
ProSAR: Prototype-Guided Semantic Augmentation and Refinement for Time Series Contrastive Learning
A Task-centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula
TMD-Bench: A Multi-Level Evaluation Paradigm for Music–Dance Co-Generation
Is Code Better Than Language for Algorithmic Reasoning?
World-Shaper: A Unified Framework for 360° Panoramic Editing
OSNIP: Breaking the Privacy-Utility-Efficiency Trilemma in LLM Inference via Obfuscated Semantic Null Space
Push, Pop, Parallelize: Stack-Augmented Linear Attention via the Delta Rule
Anchor-guided Hypergraph Condensation with Dual-level Discrimination
Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed
Safe In-Context Reinforcement Learning
Axiomatic Atlas: A Prescriptive Framework for Neural Architecture Design
EXVERUS: Verus Proof Repair via Counterexample Reasoning
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention
Multi-label learning with contrastive cluster self-supervision for 3D hierarchical semantic segmentation
VIBE: Disentangling Social Dynamics via Kinematics-Informed Variational Inference for Behavioral Emotion
TCAP: Tri-Component Attention Profiling for Unsupervised Backdoor Detection in MLLM Fine-Tuning
EquiCAD: A Geometric Equivariant Neural Network for 3D Shape Classification
StructMAR: Structure-Aware Masked Autoregression for Explicit Layout Alignment in Text-to-Image Generation
Return of Frustratingly Easy Unsupervised Video Domain Adaptation
Olmix: A Framework for Data Mixing Throughout LM Development
Capacity-Agnostic Parameter Isolation for Continual Graph Learning
Learning the Interaction Prior for Protein-Protein Interaction Prediction: A Model-Agnostic Approach
SPADA: A Verifiable Test-Driven Agent for Controllable Parametric CAD Assembly Generation
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
DR$^2$Seg: Decomposed Two-Stage Rollouts for Efficient Reasoning Segmentation in Multimodal Large Language Models
Distinguishable Deletion: Unifying Knowledge Erasure and Refusal for Large Language Model Unlearning
Learning Dynamics of Zeroth-Order Optimization: A Kernel Perspective
SVL: Empowering Spiking Neural Networks for Efficient 3D Open-World Understanding
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
Revisiting Distribution Correction Estimation for Offline Imitation Learning with Suboptimal Dataset
Olivia: Harmonizing Time Series Foundation Models with Power Spectral Density
HIVE-3D: Hierarchical Voxel Enhancement for High-Quality 3D Scene Generation
Navigating the Energy Landscape of Collaboration: Multi-Agent Communication Graph Generation via Score-Based Diffusion
Geometric Collapse: When Vision Models Fail to Verify Physical Causality
Mosaic: Runtime-Efficient Multi-Agent Embodied Planning
FRISM: Fine-Grained Reasoning Injection via Subspace-Level Model Merging for Vision–Language Models
Convex Dataset Valuation for Post-Training
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
How Hard Can It Be? Hardness-Aware Multi-Objective Unlearning
Turbo4DGen: Ultra-Fast Acceleration for 4D Generation
Coarse-Grained Boltzmann Generators
Tucker Attention: A generalization of approximate attention mechanisms
Gradient Inversion Attacks Beyond SGD
One Batch Is Enough: A Unified Dataset Condensation Framework for General Time Series Analysis
Probabilistic Performance Guarantees for Multi-Task Reinforcement Learning
CollabBench: Benchmarking and Unleashing Collaborative Ability of LLMs with Diverse Players via Proactive Engagement
Discriminative Mixture-of-Experts on Graphs with Reliable Expert Fusion
SPLIT-VLM: Salience-Guided Partitioning towards Local Coverage for Importance-Aware Token Dropping in Vision-Language Models
CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction
Physics-Informed Self-Supervised Learning on Efficient Electron-Density Images for Organic Material Property Prediction
Emergent Alignment via Competition
Improving Graph Transformers via Global Structural Priors
Nonlinear Covariate Balance in Experimental Design
Riemannian Generative Decoder
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
Clustered Influence Functions
SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents
SkelHCC: A Hyperbolic CLIP-Driven Cache Adaptation Framework for Skeleton-based One-Shot Action Recognition
Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies
From Coarse to Fine: Deep Prototype Refinement Network for Few-Shot Point Cloud Semantic Segmentation
Listening Through the Noise: Cauchy-Driven Diffusion Bridges for Robust Gastrointestinal Auscultation and Clinical Benchmarking
Structure-Centric Graph Foundation Model via Geometric Bases
Image-to-Brain Signal Generation for Visual Prosthesis with CLIP Guided Multimodal Diffusion Models
HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation
CL-GCL: Comprehensive and Lightweight Graph Contrastive Learning
Obliviate: Efficient Unlearning in Recommender Systems
The Pareto-optimal Trade-off between Regret and Statistical Inference in Linear Stochastic Bandits under Safety Constraints
AdaS: Adaptive Gradient Descent for Spiking Transformers
Innovation: An Almost Characterization of Hallucination
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
Global Plane Waves From Local Gaussians: Periodic Charge Densities in a Blink
Anchored Policy Optimization: Mitigating Exploration Collapse via Support-Constrained Rectification
RC-FCL: Combating Asynchronous Concept Drift in Federated Continual Learning via Retrospective Calibration
The data manifold under the microscope
Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers
SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
Scaling Transformers for End-to-End Discrete Audio Tokenization
Continual Learning of Domain-Invariant Representations
Inconsistency-Aware Minimization: Improving Generalization with Unlabeled Data
Geometry-Guided Generative Representation for Functional Brain Graphs
Learning Hamiltonian Flow Maps: Mean Flow Consistency for Large-Timestep Molecular Dynamics
Control Consistency Losses for Diffusion Bridges
A Flat Vocabulary or a Rich Hierarchy? Re-introducing Intrinsic Structure Transforms the Autoregressive Image Generation
From Intent to Solver Code: Semantic Alignment in Optimization Modeling
A Theoretical Game of Attacks via Compositional Skills
D$^2$O: A Dual Debiasing Operator for Training-Free Test-Time Adaptation of Vision–Language Models
MORALISE: A Structured Benchmark for Moral Alignment in Visual Language Models
Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Reasoning Models
Probabilistic Robustness Certificates against Adversarial Attacks
Finding the Minimal Parameter Budget for Implicit Reasoning: A Data Complexity Driven Scaling Law for Language Models
MultiPriv: Benchmarking Individual-Level Privacy Reasoning in Vision-Language Models
SphericalDreamer: Generating Navigable Immersive 3D Worlds with Panorama Fusion
Universal Redundancies in Time Series Foundation Models
GeoMoLa: Geometry-Aware Motion Latents for Learning Robust Manipulation Policies
Different Usage of Shared Components Explains Behavioral Variance in LLMs
Scheduling Thoughts: Learning the Order of Thought in Diffusion Language Models
Learning Multi-Scale Hypergraph for High-Order Brain Connectivity Analysis
Intrinsic Credit Assignment for Long Horizon Interaction
Emergent Visual Representations through Unsupervised Spiking Networks with Synaptic Pruning
Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes
Message Passing on the Edge: Towards Scalable and Expressive GNNs
Mirror Descent Under Generalized Smoothness
Autoregressive Image Generation with Masked Bit Modeling
Linear Regression with Unknown Truncation Beyond Gaussian Features
Scam2Prompt: A Scalable Framework for Auditing Malicious Scam Endpoints in Production LLMs
Similarity Is Not Logic: Factored Inference for Dual-Encoder Vision-Language Models
The Theory and Practice of MAP Inference over Non-Convex Constraints
SURGE:Unbiased Data Assimilation for Diffusion Model via Particle Filtering
Generative Modeling with Probabilistic Constraints
RADE: Unbiased Random Add-Drop Edge as a Regularizer
Reference-Free Meta-Learning for Generalized Implicit Neural Representation in Efficient MRI Reconstruction
SCOPE: Evolving Symbolic World for Planning in Open-Ended Environments
Open-o3-Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
STAR: Rethinking MoE Routing as Structure-Aware Subspace Learning
LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation
CyberCycle: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities
Single-Head Attention in High Dimensions: A Theory of Generalization, Weights Spectra, and Scaling Laws
Seeing Symbols, Missing Structure: A Real-World Handwritten Mathematical Expression Recognition Benchmark for Large Models
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
When to Think, When to Speak: Learning Disclosure Policies for Large Language Model Reasoning
Spike Camera Autofocus via Frequency-Domain Spectral-Centroid Migration
JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG
Causal Identification from Counterfactual Data: Completeness and Bounding Results
Just Noticeable Difference Modeling for Deep Visual Features
Transformers Can Learn Posterior Predictive Distributions In-Context
Think Twice Before You Act: Protecting LLM Agents Against Tool Description Poisoning via Isolated Planning
Embedding Hybrid Systems into Continuous Latent Vector Fields
Robustness of Mixtures of Experts to Feature Noise
Trajectory-Stabilized Inference for Diffusion-Based Video Inpainting
Interactive Person Retrieval via Multi-Turn Multimodal Conversation
Full-Spectrum Graph Neural Network: Expressive and Scalable
Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse
Unfolded Laplacian Spectral Embedding: A Theoretically Grounded Approach to Dynamic Network Representation
Probabilistically-routed Bayesian Additive Spanning Trees for Learning on Constrained Domains
Explaining Data Mixing Scaling Laws
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent
Cycle-of-Science: Reliable Reasoning through Counterfactual Verification for Agent Decision Making
Stabilizing PPO via Latent-Space Regularization and KDE-Driven Exploration
Decision Transformers As Zero-Shot Learners via Text-Behavior Alignment
STD-Former: Image-Conditioned Texture Dictionary Encoding with Sparse Topological Supervision for Texture Recognition
Mitigating Reward Hacking in RLHF via Bayesian Non-negative Reward Modeling
Generalized Linear Bandits with Memory
CrossQ: Task-Aligned Cross-Token Conditional Quantization for Late Interaction Retrieval
Robust Self-reflective Hashing for Cross-modal Retrieval with Noisy Label
Disentangling Latent Risk Pathways via Bayesian Hypergraph Inference
Commit to the Bit: Reactive Reinforcement Learning Done Right
FLARE-AI: Flaw Reporting for AI
Envisioning Beyond the Few: Disentangled Semantics and Primitives for Few-Shot Atypical Layout-to-Image Generation
Learning Manifold and Itô Dynamics with Branched Neural Rough Differential Equations
Position Is All You Need: A Free Lunch Token Compression Strategy for MLLM-based Referring Expression Segmentation
Latent Guided Sampling for Combinatorial Optimization
Bayesian Meta-Learning with Expert Feedback for Task-Shift Adaptation through Causal Embeddings
Plan for Speed: Dilated Scheduling for Masked Diffusion Language Models
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
FOAM: Blocked State Folding for Memory-Efficient LLM Training
Discovering Implicit Large Language Model Alignment Objectives
Zero-shot Active Mapping via Fused 360-BEV Representations and Vision–Language Models
Instruction Lens Score: Your Instruction Contributes a Powerful Object Hallucination Detector for Multimodal Large Language Models
Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning
HugRAG: Hierarchical Causal Knowledge Graph Design for RAG
Are Object-Centric Representations Better At Compositional Generalization?
DEGAP: Dynamic Entropy-Guided Attention Perturbation for Contrastive Decoding in Large Vision-Language Models
Hidden in Plain Sight -- Class Competition Focuses Attribution Maps
Riemannian MeanFlow for One-Step Generation on Manifolds
Learning Tight Rejection Boundaries without Negatives for Strict One-Class Audio Deepfake Detection
CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks
(Be Cautious!) Bio-Foundation Models Are Not Yet Robust to Biologically Plausible Perturbations and ML Transformations
From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space
Understanding Generalization from Embedding Dimension and Distributional Convergence
FAIL: Flow Matching Adversarial Imitation Learning for Image Generation
Budget-Constrained Step-Leve Diffusion Caching
GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance
Revealing Long-context Potential of Attention Heads via Frequency Kernels
Frequency Matching in Spiking Neural Networks for mmWave Sensing
Certifying Graph Neural Networks Against Label and Structure Poisoning
Global Policy-Space Response Oracles for Two-Player Zero-Sum Games
Optimal Transport Group Counterfactual Explanations
Revisiting Spectral Representations in Generative Diffusion Models
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
Beyond Majority Voting: Self-Reflective Test-Time Reinforcement Learning for LLM Reasoning
UniSVQ: 2-bit Unified Scalar-Vector Quantization
Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization
Mode Seeking meets Mean Seeking for Long Video Generation
Trajectory Seriation via Spectral Tangent Alignment and Global Embedding
Detecting the Semantic Fixed Point: A Geometric Framework for Efficient Inference
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
“very likely” Means “uncertain”? How LLMs Diverge from Humans in Linguistic Uncertainty Quantification
Agent Learning via Early Experience
WatchLog: Efficient and Interpretable Event Reasoning for Endpoint Detection and Response Logs with Multimodal LLMs
EntroKV: Entropy-Guided Dynamic Budget Allocation for KV-Cache Compression
Evolution Strategies at the Hyperscale
Find, Fix, Reason: Context Repair for Video Reasoning
Scalable GANs with Transformers
Multi-task Linear Regression without Eigenvalue Lower Bounds: Adaptivity, Robustness and Safety
ParaTool: Shifting Tool Representations from Context to Parameters
De4D-SLAM: Gradient-Isolated Static-Dynamic Decoupling for Monocular SLAM in Dynamic Environments
Beyond Attention Imbalance: Mitigating Hallucinations via Spectral Surgery
PLoRA: Efficient Concurrent LoRA Training for Large Language Models
Ellipsoidal Time Series Forecasting
Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
Negatives-Dominant Contrastive Learning for Generalization in Imbalanced Domains
Position: Bridge the Gaps between AI Development and Regulation
Neural QAOA$^2$: Differentiable Joint Graph Partitioning and Parameter Initialization for Quantum Combinatorial Optimization
LoPhyDA: Low-Rank Tensor and Physics Gradient Guided Diffusion for Atmospheric Data Assimilation
Posterior Concentration of Physics-Informed Neural Networks for Elliptic PDEs
ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management
The Relative Instability of Model Comparison with Cross-validation
Rh-3DGS: Robust Open-Vocabulary Scene Understanding via Riemannian Huber Distillation and Manifold-Aware Sampling
Scaling Multi-Agent Environment Co-Design with Diffusion Models
Curated Synthetic Data Doesn’t Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences
FS-I2P: A Hierarchical Focus–Sweep Registration Network with Dynamically Allocated Depth
TokenSwap: Backdoor Attack on the Compositional Understanding of Large Vision-Language Models
Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning
Incorporating Importance Weighting in Optimal Transport Based Domain Alignment
Phase-Aware Mixture of Experts for Agentic Reinforcement Learning
Hyperbolic Hierarchical Alignment for Video-Based Visible-Infrared Person Re-Identification
Twins: Learn to Predict Unified Representations with Focal Loss
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights
Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers
How do Human Processes AI-generated Hallucination Contents: a Neuroimaging Study
Mind the budget: Accelerating Deep Reinforcement Learning using Early Exit Neural Networks
Positive-Unlabeled Learning with Extreme Scarcity of Labeled Positives
Deep Neural Network Regression with Functional Covariates
Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization
Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning
Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway
TileSparse: Arithmetic-Intensity-Aware Sparse Attention for Compute-Bound LLM Decoding
Reinforcement Learning from Bagged Reward
Unbiased and Second-Order-Free Training for High-Dimensional PDEs
Accuracy-First Rényi Differential Privacy and Post-Processing Immunity
Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling
Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation
Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation
On the Plasticity and Stability for Post-Training Large Language Models
Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion
Reasoning on the Manifold: Bidirectional Consistency for Self-Verification in Diffusion Language Models
BVS: Bayesian Visual Search with Multimodal Large Language Model for Fine-grained Perception
From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors
FairSSL: Fair Multimodal Self-Supervised Learning
When Search Goes Wrong: Red-Teaming Web-Augmented Large Language Models
Distributionally Robust Set Representation Learning Under Inference-Time Element Corruption
CIRBench: Evaluating Large Language Models as LLVM IR Optimizers
UniRRM: Unified Reasoning Reward Models Across Languages and Evaluation Paradigms
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding
SkillNet: Hierarchical Skill Modeling for Compositional Generalization in Vision-Language Action Models
Prompt Optimization with Minimal Unlabeled Input via Meta-Reasoning
Attention Illuminates LLM Reasoning: The Uncovered Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
Optimal Regret for Policy Optimization in Contextual Bandits
Video-BCI: Bayesian Cognitive Integration of Self-Prior Hypotheses for Video Understanding
Open-Text Aerial Detection: A Unified Framework For Aerial Visual Grounding And Detection
MAD: Manifold Attracted Diffusion
BabyVision: Visual Reasoning Beyond Language
Near-Optimal Regret for Policy Optimization in Contextual MDPs with General Offline Function Approximation
Distributed Direct Preference Optimization
Learning Rewrite-Invariant Reasoning with Targeted Alternation Training
TPGDiff : Hierarchical Triple-Prior Guided Diffusion for Image Restoration
SEDRAS: Symbolically Evaluated Deep Research And Science
Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms
Advancing SVD-based LLM Compression via Layer-Wise Error Model Search
Active Regression for Single-Index Models with Unknown Link Functions
Progressive Graph Structure Adjustment for Homophily Shift Adaptation
RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
How Can Mamba Learn In Context with Outliers and Generalize Provably?
Uncovering the Gradient Geometry of Long CoT: A Spectral-guided Approach to Reasoning Distillation
Enhancing Conformal Prediction via Class Similarity
LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding
VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
CUPID in the Model Zoo: Online Matchmaking for Selecting Your Dream LLM
Efficient, Property-Aligned Fan-Out Retrieval via RL-Amortized Diffusion
Universal Approximation with Softmax Attention
Leaderboard Incentives: Model Rankings under Strategic Post-Training
NorMuon: Making Muon more efficient and scalable
Geometry-Aware Image Flow Matching
Motion Planning in Compressed Representation Spaces
UHR-BAT: Budget-Aware Token Compression Vision-Language model for Ultra-High-Resolution Remote Sensing
Mechanisms of Introspective Awareness
Think Twice Before You Act: Enhancing Agent Behavioral Safety with Thought Correction
Safety Alignment of LMs via Non-cooperative Games
Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
A²RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation
mHC: Manifold-Constrained Hyper-Connections
On the Accuracy of Newton Step and Influence Function Data Attributions
The Information Geometry of Softmax: Probing and Steering
Bayesian Gated Non-Negative Contrastive Learning
DGG-HMR: Multi-Person Human Mesh Recovery with Depth-Guided Geometric Anchoring
MACD: Model-Aware Contrastive Decoding via Counterfactual Data for Video-LLMs
AOEB: Benchmarking Agent-Oriented Multimodal Embeddings
Deep Single-Index Fréchet Regression
Efficient numeracy in language models through single-token number embeddings
CURE: Context-driven Diffusion with Progressive Expansion for Single Domain Generalization in Time Series Classification
Prototype-guided Bilateral Alignment Multimodal Federated Learning
DB-KSVD: Scalable Alternating Optimization for Disentangling High-Dimensional Embedding Spaces
HDTree: Generative Modeling of Cellular Hierarchies for Robust Lineage Inference
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
Metric–-Phase Fields: Decoupling Distance and Sign for Thin-Structure Reconstruction from Unoriented Point Clouds
Speculative Coupled Decoding for Training-Free Lossless Acceleration of Autoregressive Visual Generation
ADEPT: RL-Aligned Agentic Decoding of Emotion via Evidence Probing Tools — From Consensus Learning to Ambiguity-Driven Emotion Reasoning
CFPO : Counterfactual Policy Optimization For Multimodal Reasoning
Time series saliency maps: Explaining models across multiple domains
Robust Causal Discovery in Real-World Time Series with Power-Laws
CooT: Learning to Coordinate In-Context with Coordination Transformers
PSG-Nav: Probabilistic Scene Graph Navigation via Multiverse Decision Making
Efficient Learning of Deep State Space Models via Importance Smoothing
SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
Hermes: An Evidence-Driven Agentic Framework for Trustworthy and Explainable AI-Generated Video Detection
Episodic Memory-Guided Controllable Experience Synthesis for Reinforcement Learning
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection
Learnability-Driven Knowledge Assimilation for Class-Incremental Semantic Segmentation
VFMF: Dense Forecasting by Generating Foundation Model Features
In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning
Local Minima in Quadratic-Penalty Relaxations of Binary Linear Programs
MiniX: Mitigating Low-Rank Collapse and Attention Bottlenecks in Tabular Foundation Models
Networked Information Aggregation for Binary Classification
Revisiting Coding-Based Approaches to Overcome the Curse of Dimensionality in Learning-Based Watermarking
Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases
MN-Diff: Diffusion Parameterized MoE-NCDE for Continuous Time Series Generation with Irregular Observations
Panini: Continual Learning in Token Space via Structured Memory
Federated Distillation for Whole Slide Image via Gaussian-Mixture Feature Alignment and Curriculum Integration
Moment Matching Q-Learning
Beyond Problem Solving: UOJ-Bench for Evaluating Code Generation, Hacking, and Repair in Competitive Programming
Social Hippocampus Memory Learning
Towards Solving the Gilbert-Pollak Conjecture via Large Language Models
Neural Minimum Weight Perfect Matching for Quantum Error Codes
Beyond Instance-Level Self-Supervision in 3D Multi-Modal Medical Imaging
SpikeCLR: Self-Supervised Contrastive Learning for Visual Representations with Spiking Neural Networks
Testing For Distribution Shifts with Conditional Conformal Test Martingales
Behavior-Invariant Task Representation Learning with Transformer-based World Models for Offline Meta-Reinforcement Learning
PinTok: Tokenizers Deserve Dedicated Pinned CPU-Compute and Memory
What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions
Swift-SVD: Theoretical Optimality Meets Practical Efficiency in Low-Rank LLM Compression
Memory Caching: RNNs with Growing Memory
NaviAgent: Graph‑Driven Bilevel Planning for Scalable Tool Orchestration
Rethinking Genomic Modeling Through Optical Character Recognition
Speech-Audio Compositional Attacks on Multimodal LLMs and Their Defense with SALMONN-Guard
Boosting Monocular Metric Depth Estimation via Bokeh Rendering
HIAL: Towards Semantics-Aware Hypergraph Active Learning via Dual-Perspective Information Maximization
FineFocus: Benchmarking and Improving Fine-Grained Text-to-Image Alignment via Paired Reinforcement Learning
Position: The Systemic Lack of Agency in Visual Reasoning
Riemannian Dueling Optimization
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
Decoy for the Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting
LearniBridge: Learnable Calibration of Feature Caching for Diffusion Models Acceleration
Any2Any: Unified Arbitrary Modality Translation for Remote Sensing
Revisiting the Platonic Representation Hypothesis: An Aristotelian View
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Var-JEPA: Variational Joint-Embedding Predictive Architecture – Bridging Predictive and Generative Self-Supervised Learning
TabMGP: Martingale Posterior with TabPFN
Learning What to Generate: A Reinforcement Learning-based Closed-Loop Augmentation Framework for Person Re-identification
AdamO: A Collapse-Suppressed Optimizer for Offline RL
Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization
Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding
GRPO is Secretly a Process Reward Model
Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization
NeUQI: Near-Optimal Uniform Quantization Parameter Initialization for Low-Bit LLMs
Executable Agentic Memory for GUI Agent
Convex Distance Operator Transport: Convex and Geometry-Preserving Information
From Reasoning Traces to Reusable Modules: Reinforcement Learning for Compositional Generalization in Language Model Reasoning
Toward Scalable and Valid Conditional Independence Testing with Spectral Representations
Causal Representation Learning with Optimal Compression and Complex Treatments
Beyond Euclidean Clipping: Overcoming Exploration Collapse in LLM RL via Riemannian Isometric Policy Optimization
SemanticNVS: Improving Semantic Scene Understanding in Generative Novel View Synthesis
SAEs-BrainMap: Unveiling the Emergence of Specialized Concepts in Deep Models via Brain Alignment
Learning to Reconfigure: Co-designing Reconfigurable robots for Heterogeneous Locomotion
Nested Spatio-Temporal Time Series Forecasting
Position: Peer Review Should Be Calibrated via LLM Scoring
See the Emotion: A Facial Emoji Proxy Modeling for EEG Emotion Recognition
How to Price Data: A Market Equilibrium Based Approach
Finite and Corruption-Robust Regret Bounds in Online Inverse Linear Optimization under M-Convex Action Sets
Weights to Code: Extracting Interpretable Algorithms from the Discrete Transformer
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories
Learning Long Range Spatio-Temporal Representations over Continuous Time Dynamic Graphs with State Space Models
CHESS: Chebyshev Spectral Synthesis for Trajectory Condensation
EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment
Unsupervised Mode Discovery for Fine-tuning Multimodal Generative Policies
Towards Fine-grained Robustness: Attention-guided Test-time Prompt Tuning for Vision-Language Models
On Revisiting Entropy for Identifying Mislabeled Medical Images
Distilling Geometry Priors for 3D-Consistent Video Generation
A Unified Framework for Deep Hypergraph Clustering Beyond Homophily
SURF: Separation via Unsupervised Remixing Flow
Adaptive Batch Sizes Using Non-Euclidean Gradient Noise Scales for Stochastic Sign and Spectral Descent
TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis
A unified theory of feature learning in RNNs and DNNs
SegPVSG: Panoptic Video Scene Graph Generation via Temporal Focusing and Generative Augmentation
UGround: Towards Unified Visual Grounding with Unrolled Transformers
ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models
Unifying Masked Diffusion Models with Various Generation Orders and Beyond
Towards A Generative Protein Evolution Machine with DPLM-Evo
Beyond Unidirectional Bias: Reciprocal Perspective Calibration in Scene Graph Generation
REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation
RevealLayer: Disentangling Hidden and Visible Layers via Occlusion-Aware Image Decomposition
GKD-Recruiter: Jointly Modeling Social and Task Heterogeneity for Spatial Crowdsourcing via Graph Knowledge Distillation
The Truth Stays in the Family: Enhancing Contextual Truthfulness via Inherited Heads in Model Lineages
A Constrained Optimization Perspective of Unrolled Transformers
Demystifying LLM-as-a-Judge: Analytically Tractable Model for Inference-Time Scaling
Efficient Adaptive Testing via Gradient Path Matching Subset Selection for AI Education
GeoDM: Geometry-aware Distribution Matching for Dataset Distillation
Contextualized Privacy Defense for LLM Agents
Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation
Diffusion posterior sampling for simulation-based inference in tall data settings
Efficient Learning of Compositional Targets with Hierarchical Spectral Methods
How RLHF Amplifies Sycophancy
Robust AI Evaluation through Maximal Lotteries
IACW: Intent-Aware Controllable Watermarking for Scalable Authorial Intent Attribution
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training
Memory-Distilled Selection for Noise-Robust Anomaly Detection
Adaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings
Optimizing Inference-Time Compute for Medical Reasoning via Uncertainty Quantification
Don't Drop Dropout: Optimizing Layer Sparsity for Efficient LLM Training and Inference
Cardio-mmFlow: A Gaussian-Prior-Free Physics-Informed Flow Matching Framework for Electrocardiogram to mmWave Radar Synthesis.
REViT: Roto-reflection Equivariant Convolutional Vision Transformer
Random Process Flow Matching: Generative Implicit Representations of Multivariate Random Fields
The benefits of full data shuffle, now with optimal I/O cost: $k$-wise independence and matrix transposition to the rescue
SpatialJB: How Text Distribution Art Becomes The "Jailbreak Key" for LLM Guardrails
The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution
ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
When Single Answer Is Not Enough: Rethinking Single-Step Retrosynthesis Benchmarks for LLMs
A Theoretical Framework for Statistical Evaluability of Generative Models
Beyond Hamming: Query-Aware Decoding of Binary Cosine Sketches
CHB: A Diagnostic Toolkit for Hardness-Aware Clustering Evaluation
Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits
FlowState: Sampling-Rate‑Equivariant Time‑Series Forecasting
Towards Effective Waste Segmentation for Automated Waste Recycling in Cluttered Background
Position: World Models as an Intermediary between Agents and the Real World
Navigating the Pareto Frontier of Alignment:Spectrum-Adaptive Fine-Tuning for LLMs
CoPE: Continual Probe-guided Expansion for Large Vision-Language Models
Divide-and-Denoise: A Game-Theoretic Method for Fairly Composing Diffusion Models
Step-Level Sparse Autoencoder for Reasoning Process Interpretation
Training-Free Guided Diffusion for Planning: A Unified Framework via Doob’s h-Transform with Safety Guarantees
SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning
ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
CG-MLLM: Captioning and Generating 3D content via Multi-modal Large Language Models
Position: Digital Agents Require Unified Agent-Native Environments
Towards Pareto-Optimal Tool-Integrated Agents with Pareto Ranking Policy Optimization
HieraScaffold: Learning Compact Hierarchical Representations for Scalable 4D LiDAR Generation
OSAQ: Outlier Self-Absorption for Accurate Low-bit LLM Quantization
Active Tabular Augmentation via Policy-Guided Diffusion Inpainting
Klein Hyperbolic Metric Learning
Mirror Descent Actor Critic via Bounded Advantage Learning
MalTree: Tracing Malware Evolution using Embeddings at Scale
Plan, Decouple, Assimilate: Physics-Aware Object Insertion in Remote Sensing Imagery
View Space: Learning Representation across Arbitrary Graphs
Learning Treatment Allocations with Risk Control Under Partial Identifiability
ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure
Bridging the Gap Between Average and Discounted TD Learning
Set-Coupled Guidance: Set-Level Coordination in Diffusion-Based Dataset Distillation
Sharp empirical Bernstein inequalities for the variance of bounded random variables
Spiked-CFR: Causal Representation Learning from LLMs via Wasserstein Projection Pursuit
Making Foundation Models Probabilistic via Singular Value Ensembles
Preference-Calibrated Optimization with Score-Level Distribution Alignment for Text-to-Image Diffusion Model Unlearning
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
Efficient Mismatch-Tolerant Coding for Model-Driven Compression
Enhanced Latent-Space Adversarial Training for Super-Resolution
GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning
Risk-Bounded Distribution Reconstruction: Stable Statistic Calibration for Long-Tailed Recognition
Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation
The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models
Fine-Tuning of Transformer models with Frames
From Memorization to Parameter Interference: How Overtraining Experts Harms Model Merging
GAE: Unleashing Physical Potential of VLM with Generalizable Action Expert
Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training
IBMA: Information Bottleneck-Based Multimodal Alignment
Demystifying When Pruning Works via Representation Hierarchies
ACTIVE-o3 : Empowering MLLMs with Active Perception via Pure Reinforcement Learning
The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs
Hierarchical Multi Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation
Mitigating Plasticity Loss through Architectural Design in Continual Learning
Decoupling Skeleton and Flesh: Efficient Multimodal Table Reasoning with Disentangled Alignment and Structure-aware Guidance
Differentiable Optimization Layers for Guaranteed Fairness in Deep Learning
URS: A Unified Neural Routing Solver for Cross-Problem Zero-Shot Generalization
CoEvol-NO: State and Coordinate Co-Evolution with an Error-Driven Predictor-Corrector Paradigm for Neural Operator Transformer
SE-GA: Memory-Augmented Self-Evolution for GUI Agents
A Close Look at Negative Label Guided Out-of-distribution Detection in Pre-trained Vision-Language Models
Trajectory-Level Speculative Decoding for Diffusion Language Models
Deliberate Evolution for Sample-Efficient Symbolic Regression with LLM
On the Anisotropy of Score-Based Generative Models
PhenoBrain: Phenotype-Conditioned Long-Range Communication for Multi-Modal Brain Network Analysis
Robust Parallel Diffusion Sampling via Dynamic Jacobian Bandwidth
Forgetting Whenever You Want: A Decentralized Continual Learning Framework with On-Demand Unlearning
Lavida-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models
New Bounds for Kernel Sums via Fast Spherical Embeddings
Bounded Hyperbolic Tangent: A Stable and Efficient Alternative to Pre-Layer Normalization in Large Language Models
Predictable Compression Failures: Order Sensitivity and Information Budgeting for Evidence-Grounded Binary Adjudication
VideoTrace-R1: Long Video-based Retrieval-Augmented Generation via Temporal Path Graph Understanding
IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning
DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
DiL: Discrete-anchored Representation Alignment for Semi-Supervised Continual Learning
BAT: Better Audio Transformer Guided by Convex Gated Probing
Robust Federated Learning Against Adaptive Compression
What Preferences Can—and Cannot—Predict in Multi-Agent Online Learning
MiVE: Multiscale Vision-language features for reference-guided video Editing
MFCL Audio: An Audio Function Calling Evaluation for Large Language Models
Reinforcement Learning with Evolving Rubrics for Deep Research
Geometric Rate–Distortion Invariance for Domain Generalization
From Parameters to Feature Space: Task Arithmetic for Backdoor Mitigation in Model Merging
Mixture Prototype Flow Matching for Open-Set Supervised Anomaly Detection
Sparse Topology-Aware Pairwise Scoring for Large-Scale Multi-Agent Reinforcement Learning
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
Towards Parameter-Free Temporal Difference Learning
An Empirical Study on the Resilience of Partial Merging to Model Clone Attacks
Class-Prior Perturbation-Robust Regularization for Imbalanced Unreliable Partial Label Learning
Mean Flow Policy Optimization
Hierarchical Image Tokenization for Multi-Scale Image Super Resolution
The Geometry of Updates: Fisher Alignment at Vocabulary Scale
A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation
Credible Information Subset Decomposition: An End-to-End Multi-fidelity Learning Model by Modeling Label Information
StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation
FIDIA: Function-Informed Sequence Design via Inference-Aligned Policy Optimization
HVAE: Hyperbolic Variational Autoencoder For Flexible Knowledge Transfer Across Multiple Domains
Strategy Executability in Mathematical Reasoning: Leveraging Human–Model Differences for Effective Guidance
EGG: An Expert-Guided Agent Framework for Kernel Generation
Bridging RGB and RAW: Single-step Deterministic Flow with Homogeneous Aligned Guidance
Show, Don't Tell: Morphing Latent Reasoning into Image Generation
CCLRec: Consensus-driven Contrastive Learning for LLM-enhanced Graph Recommendation
Detecting Fluent Optimization Based Adversarial Prompts via Sequential Entropy Changes
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies
Approximate Proportionality in Online Fair Division
Retrieval-Aware Distillation for Transformer-SSM Hybrids
Bayesian Tensor Decomposition with Diffusion Model Prior
Multimodal Nested Learning for Decoupled and Coordinated Optimization
Joint-Space Empowerment as a Theory of Dexterous Motor Coordination
RTInfer: Exploiting Concurrency for Multiple Real-Time DNN Inference on Edge GPUs
Rethinking Depth Pruning for Vision Transformers: A Heterogeneity-Aware Perspective
OmniFit: Bridging Modalities via Layer-Adaptive Token Compression for Omnimodal Large Language Models
Diversity-aware Weight Perturbation Promotes Robust Adaptation
Re-FORC: Adaptive Reward Prediction for Efficient Chain-of-Thought Reasoning
PAMD: Structured Adaptive Distances for Bisimulation Representations in Visual Reinforcement Learning
Online Social Welfare Function-based Resource Allocation
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions
VisionPulse: Dynamic Visual Sparsity for Efficient Multimodal Reasoning
MapUQ: Map with Uncertainty Quantification for Robust BEV Vectorized Construction
Controlled Collaboration Geometry for Personalized Federated Learning
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning
Optimizing Language Models for Crosslingual Knowledge Consistency
Modeling Attributional Style at Scale: A Dataset and Analysis for Psychological Attribution Assessment and Reframing
MADA-Attack: Transferable Multi-modal Attention Distraction Adversarial Attack against Vision Language Models
Generalizable and Composable Multi-Model Embedding Translation
SVD as a Fast Interpretability Method for Transformers
How do LLMs Compute Verbal Confidence?
GEM: Geometric Entropy Mixing for Optimal LLM Data Curation
Automatically Finding Reward Model Biases
Fixed Aggregation Features Can Rival GNNs
Base Models Know How to Reason, Thinking Models Learn When
Conformal Path Reasoning: Trustworthy Knowledge Graph Question Answering via Path-Level Calibration
Fast Non-Episodic Finite-Horizon RL with K-Step Lookahead Thresholding
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
DropoutTS: Sample-Adaptive Dropout for Robust Time Series Forecasting
Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
Anti-Aliasing Matters: A Dynamic Network for Time Series Forecasting
Doc-to-LoRA: Learning to Instantly Internalize Contexts
Alethia: a Foundational Encoder for Voice Deepfakes
EcoVLA: Environment-Aware Adaptive Pruning with Interleaved Inference Orchestration for Vision-Language-Action Models
Position: Universal Aesthetic Alignment Narrows Artistic Expression
Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation
On the Sample Efficiency of Inverse Dynamics Models for Semi-Supervised Imitation Learning
When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression
Alignment-Aware Decoding
SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos
A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints
A Capacity-Based Rationale for Multi-Head Attention
Reasoning Structure of Large Language Models
UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators
Frictional Q-Learning
A Framework for Understanding Learnability in Transformers
Contrastive Order Learning: A General Framework for Ordinal Regression
A Progressive Evidence Localization Framework Based on Wasserstein Gradient Flows for Document Visual Question Answering
Online Packet Scheduling with Deadlines and Learning
Collaborative Threshold Watermarking
Beyond Euclidean Summaries: Online Change Point Detection for Distribution-Valued Data
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
Fingerprinting Pre-trained Encoders under Arbitrary Downstream Fine-Tuning via Adversarial Shifting
Transitivity Meets Cyclicity: Explicit Preference Decomposition for Dynamic Large Language Model Alignment
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Query Circuits: Explaining How Language Models Answer User Prompts
Search Space Synthesis for Parametric Functions
Unified Episodic and Semantic Memory via Modulating Transformer FeedForward Layers
Position: Safe Models Do Not Guarantee Safe Societies: The Case for Sociopolitical Risk
Revisiting Positive Samples in Graph Contrastive Learning: From the Perspective of Message Passing
Tackling Fake Forgetting through Uncertainty Quantification
Internalizing Safety Understanding in Large Reasoning Models via Verification
The Entropic Signature of Class Speciation in Diffusion Models
Learning to Self-Verify Makes Language Models Better Reasoners
Position: Deployed Reinforcement Learning should be Continual
On the Identifiability of Poisson Branching Structural Causal Model Under Latent Confounding
Provable Training Data Identification for Large Language Models
Towards Understanding Massive Activations in Attention Sink Mechanism
Simple Algorithms for Bad Triangle Transversals with Applications to Correlation Clustering
Private Learning with Public Feature Conditioning
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models
IQA-Spider: Unifying Reasoning, Grounding, and Referring for Multi-Granularity Image Quality Assessment
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn
Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation
Coupled Training with Privileged Features and Unlabeled Data
General Analysis of LMO-based Optimizers: Beyond Bounded Variance
High-Dimensional Learning Dynamics of Quantized Models with Straight-Through Estimator
FlowCloud: Learning Continuous Spatiotemporal Dynamics from Unpaired Sparse Point Cloud Snapshots
Must All Negatives Be Pushed Away Equally? Uncertainty-Aware Cross-View Geo-Localization via Normal Inverse Gamma Distribution
NetDiff: Graph Diffusion with Improved Global Capabilities to Generate and Update Mobile Network Topologies
Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects
Structure-Aware Riemannian Flow Matching for Registration and Fusion of Hyperspectral and Multispectral Images
The Devil is in the Spectrum: Mitigating Representation Collapse in LLMs via Topologically Regularized Side-Path
Saving Foundation Flow-Matching Priors for Inverse Problems
Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization
Physics-informed coarsening for multigrid graph neural networks surrogates
ScDiVa: Masked Discrete Diffusion for Joint Modeling of Single-Cell Identity and Expression
Mitigating Mask Prior Drift and Positional Attention Collapse in Large Diffusion Vision-Language Models
Fix the Mind, Not the Move: Interpretable AI Assistance via Knowledge-Gap Localization
Local-Minima-Preserving Polynomial Relaxation of Ising Problems
Rényi Diffusion Models
Probabilistic Modeling of Latent Agentic Substructures in Deep Neural Networks
Efficient Diffusion Models under Nonconvex Equality and Inequality constraints via Landing
Beyond Prediction: Tail-Aware Scheduling for LLM Inference
Real Data Lies: Unveiling and Closing the Quality Shortcut in Generalizable AI-Generated Video Detection
Accurate Large-scale Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo
Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning
Laplacian Representations for Decision-Time Planning
Deep Scientific Reasoning under Physical Constraints: Structure-Aware Spectrum Prediction for Electronic Density of States
What Does Thompson Sampling Optimize?
Unifying Heterogeneous Degradations: Uncertainty-Aware Diffusion Bridge Model for All-in-One Image Restoration
Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO
Rays as Pixels: Learning A Joint Distribution of Video and Camera Trajectories
Tuning the Implicit Regularizer of Masked Diffusion Language Models: Enhancing Generalization via Insights from $k$-Parity
CacheEdit: Efficient Multi-round Image Editing via Adaptive Token-wise Reuse.
Protein Autoregressive Modeling via Multiscale Structure Generation
Regularization in the Axiomatic Approach to Learning from Human Preferences
Fast Inverse Lithography via GRPO Reinforced Flow Matching
Certified Circuits: Stability Guarantees for Mechanistic Circuits
OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
Variable-Length Tokenization via Learnable Global Merging for Diffusion Transformers
General Synthetic-Powered Inference
MentisOculi: Revealing the Limits of Reasoning with Mental Imagery
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
An Algebraic View of the Expressivity of Recurrent Language Models
Attention Projection Mixing with Exogenous Anchors
Extending Prediction-Powered Inference through Conformal Prediction
VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model
The ACE Protocol: Operationalizing Language Model Activations for Better Calibration and Utility
Decomposing Query-Key Feature Interactions Using Contrastive Covariances
DRPBench: Evaluating LLMs in Concurrent Code Comprehension via Fine-grained Data Race Prediction
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?
Flow-Based Density Ratio Estimation for Intractable Distributions with Applications in Genomics
MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence
DyGRO-VLA: Cross-Task Scaling of Vision–Language–Action Models via Dynamic Grouped Residual Optimization
A Generalist Pair-wise Progress Critic Model for Vision-Language-Action Robots
KineFlow: Kinematic Second-Order Flow Matching for Time-Series Forecasting
Biases in the Blind Spot: Detecting What LLMs Fail to Mention
Divide and Contrast: Learning Robust Temporal Features without Augmentation
Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language
Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs
Multi-Agent Reinforcement Learning with Submodular Reward
Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch
ERAlign: Energy-based Representation Alignment of GNNs and LLMs on Text-attributed Graphs
T-measure: A Topology-Consistent Metric for Binary Segmentation
Predictive Prefetching for Retrieval-Augmented Generation
Gradient-Based Causal Tree Ensembles: A Backbone Architecture for Heterogeneous Treatment Effects
A Geometric Analysis of Small-sized Language Model Hallucinations
Interpretability Transfer from Language to Vision via Sparse Autoencoders
Particle-Guided Diffusion Models for Partial Differential Equations
UI2Code^N: UI-to-Code Generation as Interactive Visual Optimization
Path-conditioned training: a principled way to rescale ReLU neural networks
UAV$^2$: A Unified and Adaptive Scheduling Framework for UAV Autopilot System with Reinforcement Learning
Stabilizing Recurrent Dynamics for Test-Time Scalable Latent Reasoning in Looped Language Models
Dissecting the Safety Circuit: Neuronal Intervention for Transferable Adversarial Attacks on VLMs
LoKiFormer: Locality-aware Attention with Decoupled Knowledge Memory for Efficient Large Language Model Pretraining
Focusing: View-Consistent Sparse Voxels for Efficient 3D VAE
Uncovering Competency Gaps in Large Language Models and Their Benchmarks
ZeroDiff: Zero-Shot Time Series Reconstruction via Informed-Prior Diffusion
TsLLM: Augmenting LLMs for General Time Series Understanding and Prediction
AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Revisiting Neural Processes via Fourier Transform and Volterra Series
Trojan-Speak: Bypassing Constitutional Classifiers with No Jailbreak Tax via Adversarial Finetuning
A New Framework for Cybersecurity Refusals in AI Agents
Solving Inverse Problems with Flow-based Models via Model Predictive Control
AmbiRefer3D: 3D Visual Grounding with Referential Ambiguity
Multilingual Safety Alignment Via Sparse Weight Editing
Calibrating Uncertainty for Zero-Shot Adversarial CLIP
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations
Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training
Spurious Rewards: Rethinking Training Signals in RLVR
AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Material Structures
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward
Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution
MoSE: Mixture of Slimmable Experts for Efficient and Adaptive Language Models
DecodeShare: Tracing the Shared Pathways of LLM Decode-Time Decisions
Homophily-Heterogeneity Gradient Surgery for Federated Graph Learning
RSA-CP: Efficient Conformal Prediction in Small-Sample Regimes via Random Score Alignment
Deep Coupling Learning for Solving PDEs
Flat Minima and Generalization: Insights from Stochastic Convex Optimization
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
RiskZero: Plan More to Risk Less with a Learned Model
Exploring Relational Reasoning Capabilities in LLMs with REL
Approximation Bounds for Transformer Networks with Application to Regression
Adversarially Robust Approximate Furthest Neighbor
Unsupervised Hierarchical Skill Discovery
Rational Transductors
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
GEM: Geometric Erasure by Contrastive Velocity Matching in Rectified Flows
A Solver-Free Training Method for Predict-then-Optimize
Learning Biophysical Models of Large-Scale Multineuronal Data To Enable Precise Neurostimulation
HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models
Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
Jailbreaking Vision-Language Models Through the Visual Modality
Dense associative memory for Gaussian distributions
Value Aggregation with Uncertainty in Online Decentralized MARL
Score Based Error Correcting Code Decoder
OmniShow: Orchestrating Multimodal Conditions for Human-Object Interaction Video Generation
Energy-Structured Low-Rank Adaptation for Continual Learning
Constrained Meta Reinforcement Learning with Provable Test-Time Safety
DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter
Even Faster Kernel Matrix Linear Algebra via Density Estimation
Singular Proxies for Adaptive Caching in Diffusion Language Models
PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning
Decoupling Regularization and Privacy in Differentially Private Ridge Regression and ERM
Towards a Science of AI Agent Reliability
CLAM-Bench: Benchmarking LLM Agents for Library-Scale Cross-Architecture Migration
Native Spatio-Temporal 4D Variational Autoencoder
Efficient and Safe Molecular Assembly via Reinforcement Learning and Constraint Solving
SLAE: Strictly Local All-atom Environment for Protein Representation
Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution
ProConMV: Provenance-Enabled Conceptual Framework for Interpretable Multi-View Diabetic Retinopathy Diagnosis
Influence-Guided Symbolic Regression: Scientific Discovery via LLM-Driven Equation Search with Granular Feedback
An Evidential Route to Asymptotic Bayes Optimality under Sparsity
Breaking the Computational Barrier: Provably Efficient Actor–Critic for Low-Rank MDPs
Elastic Diffusion Transformer
RA-VLA: Retrieval-Augmented VLA for Test-Time Adaptation
Interleaved Selective State Space Models for Efficient WiFi-Based 3D Multi-Person Pose Estimation
Computing Provable Bounds for Exact Shapley Values of Neural Networks
Benchmarking and Evolving Reason-Reflect-Rectify for Reflective Visual Generation
Text Generation as Continuous Latent Dynamics via Reinforcement Learning
Regression Language Models for Code
Consistency Deep Equilibrium Models
Just Y-Prediction: Enabling Historical Cumulative Inconsistency in Label Diffusion for Learning with Noisy Label
Conformal Reliability: A New Evaluation Metric for Conditional Generation
EqGINO: Equivariant Geometry-Informed Fourier Neural Operators for 3D Partial Differential Equations
Bridging the Perceptual Gap: Residual-Enhanced Downscaling and Manifold-Aware Perception Alignment Adaptation for NR-IQA
Suppress and Diversify: Refining Robust Pathways for Corruption Robustness
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching
Rethinking Contrastive Learning for Graph Collaborative Filtering: Limitations and A Simple Remedy
Subgroup Discovery with the Cox Model
Escaping the Mode: Multi-Answer Reinforcement Learning in LMs
MoSSP: A Momentum-Based Single-Loop Stochastic Penalty Method for Nonconvex Constrained DC Optimization
Rethinking Low-Confidence Pseudo Labels: Influence-Aware Semi-Supervised Fine-Tuning for Hyperspectral Change Detection
Spectral Guidance for Flexible and Efficient Control of Diffusion Models
Draft-and-Audit Reinforcement Learning for Optimization Modeling
HodgeFlow Policy Search by Topologically Dissecting Temporal-Difference Signals in Non-Markovian Environments
AlphaRouter: Token-level Routing Between SLM and LLM with Reinforcement Learning and Tree Search
Dual-View Predictive Diffusion: Lightweight Speech Enhancement via Spectrogram-Image Synergy
CauSciBench: Evaluating LLM Causal Inference for Scientific Research
On Effectiveness and Efficiency of Agentic Tool-calling and RL Training
Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design
Spectral Bridge Variational Inference: Dynamic LoRA via Bures-Wasserstein Gradient Flows
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models
Efficient and Minimax-optimal In-context Nonparametric Regression with Transformers
Effective Reasoning Chains Reduce Intrinsic Dimensionality
MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks
Reasoning Models Struggle to Control their Chains of Thought
GLARE: Scalable Neuro-Symbolic Reward Shaping for LLM Agents via Group-Level Automata
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies
Extracting alignment data in open models
CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation
Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization
Adapting Noise to Data: Generative Flows from Learned 1D Processes
Optimizing KV Cache Eviction from an Output Perturbation Perspective
Induction Meets Biology: Mechanisms of Repeat Detection in Protein Language Models
Mantis: Lightweight Foundation Model for Time Series Classification
SVRG and Beyond via Posterior Correction
Sequential Kernel-based Conditional Independence Testing via Adaptive Betting
Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration
DuetServe: Harmonizing Prefill and Decode for LLM Serving via Adaptive GPU Multiplexing
Zero-Flow Encoders
CalPro: Prior-Aware Evidential Conformal Prediction with Structure-Aware Sensitivity Bounds for Protein Structures
SCalDA: Semantics-Calibrated and Diffusion-Enhanced Data Augmentation
MoLoRA: Composable Specialization via Per-Token Adapter Routing
A Very Big Video Reasoning Suite
Active Learning with Foundation Model Priors: Efficient Learning under Class Imbalance
Vegas: Self-Speculative Decoding with Verification-Guided Sparse Attention
Path-dependent Discrete Amortized Inference
Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique
Euclean: Automated Geometry Problem Formalization with Unified Verification in Lean
GAAVI: Global Asymptotic Anytime Valid Inference for the Conditional Mean Function
Split Group Knockoffs: Controlling False Discovery Rate in Transformational Group Sparsity
From Diagrams to Code: Multilingual Programming with Visual Design
Power-Calibrated LLM Watermarking: A Statistical Framework
Dyn-VPP: Video Prediction Policy Optimization for Improved Visual Dynamics
PRIM:Cooperative Dynamic Token Compression for Efficient Large Multimodal Models
A Unified Sparse Attention via Multi-Granularity Compression
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation
MINIF2F-DAFNY: LLM-Guided Mathematical Theorem Proving via Auto-Active Verification
KFStego: Key-Free Secure Image Distribution via Bipartite Structural Invariants
Escaping the Diversity Trap in Robotic Manipulation via Anchor-Centric Adaptation
How Reasoning Evolves from Post-Training Data: An Empirical Study Using Chess
Likelihood over Estimation: Robust Quadratic Discriminant Analysis for Heavy-Tailed Distributions with Theory and Evidence
Diffusion-based learning framework for Constrained Nonconvex Optimization with Weighted Bootstrapped Refinement
HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation
FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy
Weaving in the Clouds: Achieving Synergistic Collaboration among LLM Agents via Federated Learning
Implicit Preference Alignment for Human Image Animation
BiRQA: Bidirectional Robust Quality Assessment for Images
Geometrically Constrained Outlier Synthesis
DMCO: Budget-Aware Co-Optimization of Data Cleaning and AutoML
AURA: Visually Interpretable Affective Understanding via Robust Archetypes
Removing Noise, not Finding Gold: Quality Filtering for Large-Scale Pretraining
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization
ScaLoRA: Optimally Scaled Low-Rank Adaptation for Efficient High-Rank Fine-Tuning
Retriever Portfolios: A Principled Approach to Adaptive RAG
Uncovering Hidden Triggers: Backdoor Attribution in Language Models
AnyBand-Diff: A Unified Remote Sensing Image Generation and Band Repair Framework with Spectral Priors
ProbeLLM: Automating Principled Diagnosis of LLM Failures
Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pretraining
Referring Multiple Regions with Large Multimodal Models via Contextual Latent Steering
GO-PRE:Goal-Oriented Next-Best-View Selection via Predictive Rendering Entropy for Active 3D Reconstruction
FunCQNet: A Functional Censored Quantile Neural Network for Predicting Long-Term Post-Transplant Kidney Survival
Thinking with Geometry: Active Geometry Integration for Spatial Reasoning
Fast-SAM3D: 3Dfy Anything in Images but Faster
Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Edit via In-Context Learning
Biologically plausible heavy-tailed connectivity enhances generalizations on cognitive tasks in recurrent neural networks
AIR: Improving Agent Safety through Incident Response
HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks
Expo-GS: Exposure-Aware Signed Distance Function in Gaussian Splatting for High Dynamic Range
Transferable Reinforcement Learning via Probabilistic Latent Embeddings and Dynamic Policy Adaptation for Sim-to-Real Deployment
GraphPFN: A Prior-Data Fitted Network for Graph Node-Level Tasks
Making Learner Weakness Actionable for Learning from Demonstration with Novice Teachers
Rethinking GNNs and Missing Features: Challenges, Evaluation and a Robust Solution
The Cost of Information: Phase Transitions in Contextual Bandits with Paid Observations
Future Dynamic 3D Reconstruction: A 3D World Model with Disentangled Ego-Motion
EpiTwin: Spatiotemporal Graph Transformers for Epileptic sEEG Signal Reconstruction
DSGym: A Standardized and Holistic Framework for Advancing Data Science Agents
Second-Order Bilevel Optimization with Accelerated Convergence Rates
MolAlign3D: Enhancing Fixed-Dimensional E(3)-Equivariant Latent Space for High-Fidelity 3D Molecular Reconstruction and Editing
Risk Awareness Injection: Calibrating Vision-Language Models for Safety without Compromising Utility
The Perception–Physics Paradox: Probing Scientific Alignment with TC-Atlas
RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models
Learn to Think: Improving Multimodal Reasoning through Vision-Aware Self-Improvement Training
Hybrid-Gym: Training Coding Agents to Generalize Across Tasks
Refining Context-Entangled Content Segmentation via Curriculum Selection and Anti-Curriculum Promotion
EAKV: An Entropy-Driven Adaptive KV Compression Framework for Long Video Understanding
BRIDGE: Predicting Human Task Completion Time From Model Performance
Real-Time Aligned Reward Model beyond Semantics
Proximal Splitting Methods for Hybrid Differentiable Models
StructMamPose: From Sequential Perception to Structural Reasoning for 3D Human Pose Estimation
CGSVD: Cascaded Granular Singular Value Decomposition for Large Language Model Compression
Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL
End-to-End Compression for Tabular Foundation Models
Ripple Perturbations Through Structure: Likelihood-Constrained Adversarial Attacks on Heterogeneous Tabular Data
Towards Whole-corpus Reconstruction of Heterogeneous RAG Knowledge Bases
Two-Parameter Flows for Learning Population Dynamics of Physical Systems
Doubly Outlier-Robust Online Infinite Hidden Markov Model
LAVA: A Unified Framework for Finetuning Language and Vision Models
Trust3R: Unifying Feed-Forward Pointmap Prediction and Evidential Learning for Trust-Aware 3D Reconstruction
Little By Little: Continual Learning via Incremental Mixture of Rank-1 Associative Memory Experts
Step-resolved data attribution for looped transformers
Consistency Training Is Not Neutral to Alignment
Fine-Tuning Masked Diffusion for Provable Self-Correction
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
USE : A Unified Self-Ensembling Framework for Test-Time Prompt Tuning
Structurally Aligned Subtask-Level Memory for Software Engineering Agents
Geometric Control of Out-of-Distribution Shift in Safe Offline RL
Maximum-Likelihood Learning of Latent Dynamics Without Reconstruction
TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting
Adaptively Grouped Contextual Bandits for Heterogeneous Human-AI Decision Making with Conformal Prediction Sets
A theory of learning data statistics in diffusion models, from easy to hard
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents
Logit-Attention Divergence: Mitigating Position Bias in Multi-Image Retrieval via Attention-Guided Calibration
Anomaly-Preference Image Generation
Learning Useful Supervision for Reinforcement Learning in Reasoning Models
Stronger Semantic Encoders Can Harm Relighting Performance: A Probe of Visual Priors via Augmented Latent Intrinsics
DIYHealth Suite: Dataset, Model, and Benchmark for Health Management at Home
Order Matters in Retrosynthesis: Structure-aware Generation via Reaction-Center-Guided Discrete Flow Matching
Spherical SO(3) Equivariant Local Attention
Physics-Informed Residual Flows
The Convergent Representation of Vision-Language Contrastive Learning: Geometry, Modality Gap and Shared Space Alignment
From Outcomes to Actions: Leveraging Hindsight for Long-Horizon Language Agent Training
Toward Subspace-Perturbed Trajectory-Aware Backdoor Attacks in Deep Reinforcement Learning
RQ-MoE: Residual Quantization via Mixture of Experts for Efficient Input-Dependent Vector Compression
Credit Assignment via Neural Manifold Noise Correlation
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers
SparseInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
Kuramoto Oscillatory Phase Encoding: Neuro-inspired Synchronization for Improved Learning Efficiency
Optimal Top-$k$ Identification from Pairwise Comparisons
Alignment between Brains and AI: Evidence for Convergent Evolution across Modalities, Scales and Training Trajectories
Tail Annealing for Heavy-Tailed Flow Matching
Efficient Neural Controlled Differential Equations via Attentive Kernel Smoothing
Disentangling a Large Language Model’s Computation from its Chain-of-Thought
New Algorithms for Fully-Dynamic k-center with Outliers
Reflector: Internalizing Step-wise Reflection against Indirect Jailbreaks
An Interactive Paradigm for Deep Research
Foreground-Aware Token Routing Vision Transformer for Real-Time Satellite Video Tracking
De-attribute to Forget for LLM Unlearning
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking
PGC: Peak-Guided Calibration for Generalizable AI-Generated Image Detection
DataGuard: A Non-intrusive Dataset Auditing Framework via Differential Information Forensics
NEMO: Execution-Aware Optimization Modeling via Autonomous Coding Agents
Criterion-Conditional In-Context Learning: Evaluating Criterion-Shift Adaptation in Vision-Language Models
Scaling Laws in Model Fine-tuning for Audio DeepFake Detection
DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving
Mitigating Error Propagation in Low-Rank Approximation of Large Models via Distribution-Aware Whitening
Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy
Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation
D-ARL: A Distribution-Matched Asynchronous Reinforcement Learning Framework for Language Reasoning
Neural-HSS: Hierarchical Semi-Separable Neural PDE Solver
Don't Walk the Line: Boundary Guidance for Filtered Generation
Improving Sampling for Masked Diffusion Models via Information Gain
Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension
Self-Supervised Learning as Discrete Communication
Randomized Feasibility Methods for Constrained Optimization with Adaptive Step Sizes
FLAG: Foundation model representation with Latent diffusion Alignment via Graph for spatial gene expression prediction
A Linearly Convergent Proximal Subgradient Algorithm for Sparse Portfolio Optimization with Transaction Cost
Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
Multivariate distributional reinforcement learning using sliced divergences
Step-Size Stability in Stochastic Optimization: A Theoretical Perspective
LipoPU: Pocket-level Prediction of Lipid-Protein Interactions via Positive-Unlabeled Learning
LFQ: Logit-aware Final-block Quantization for Boosting the Generation Quality of Low-Bit Quantized LLMs
LLM-Guided Loop Bound Generation for Program Termination Verification
PortraitRL: Reinforcement Learning for Personalized Portrait Pose Transfer with Multi-Objective Reward Modeling
SWING: Unlocking Implicit Graph Representations for Graph Random Features
More Capable, Less Cooperative? When LLMs Fail at Zero-Cost Collaboration
Orthogonal Model Merging
SpecPL: Disentangling Spectral Granularity for Prompt Learning
Beyond Pixel Context Windows: Neural World Simulators with Persistent 3D State
Markovian Projection of Star-Shaped Diffusion for Exponential Family Distributions
Unsupervised Partner Design Enables Robust Ad-hoc Teamwork
Logical Guidance for the Exact Composition of Diffusion Models
Focusing Where Vision Matters: Selective Training for Large Vision Language Models via Visual Information Gain
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
VBA: Vector Bundle Attention for Intrinsically Geometric Representation Learning
Reinforcement Learning with Verifiable Rewards: GRPO's Loss, Dynamics, and Success Amplification
The Ideal Expression Is Not a Local Optimum: A Revisit of EQL with Zero-Point Constraints
Pull Requests as a Training Signal for Repo-Level Code Editing
Learning to Refine: Spectral-Decoupled Iterative Refinement Framework for Precipitation Nowcasting
Deep networks learn to parse uniform-depth context-free languages from local statistics
Cluster-Aware Causal Mixer for Online Anomaly Detection in Multivariate Time Series
TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches
Dynamic Fractal Mamba: A Neural Renormalization Group Flow for Scale-Invariant Sequence Modeling
When Sample Selection Bias Precipitates Model Collapse
SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?
Persistent Semantic Entities in Tool-Augmented LLM Systems
Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search
Optimal Statistical Guarantees for Diffusion Models on Low-Dimensional, Multi-Modal Data
Stop When Further Reasoning Won’t Help: Attention-State Adaptive Generation in Reasoning Models
Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity
Old Habits Die Hard: How Conversational History Geometrically Traps LLMs
Textual Stochastic Gradient Descent: Discrete Optimization of External Memory for Reasoning Language Agents
Token-Level LLM Collaboration via FusionRoute
ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning
InfraRL: A Benchmark for Constrained Resource Allocation in Large-Scale Infrastructure Asset Management
Spectral-Progressive Thought Flow for Lightweight Multimodal Reasoning
Solving Positive Linear Programs with Differential Privacy
DyLLM: Efficient Diffusion LLM inference via saliency-based token selection and partial attention
ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference
Relevance-Based Embeddings: Lightweight Candidate Selection via Heavy Ranker Calls
From Extrinsic to Intrinsic: Geodesic-Guided Representation Learning for 3D Geometric Data
OptMaster: A DAG-Based Framework for Formulation and Heuristic Discovery in Optimization
AVI-Bench: Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
Can local learning match self-supervised backpropagation?
Learning to Evict from Key-Value Cache
TextMesh4D: Zero-shot Text-to-4D Mesh Generation
ConServe: Fine-Grained GPU Harvesting for LLM Online and Offline Co-Serving
General and Efficient Steering of Unconditional Diffusion Models
AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search
Adversarial Attacks and Robust Training for Hypergraph Neural Networks
Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data
FreeRet: MLLMs as Training-Free Retrievers
TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity
Formalizing Learning from Language Feedback with Provable Guarantees
Memory Savings at What Cost? A Study of Alternatives to Backpropagation
POLCA: Stochastic Generative Optimization with LLM
SEM-CTRL: Semantically Controlled Decoding
Coupled Variational Reinforcement Learning for Language Model General Reasoning
Towards Multimodal Large Language Models with Both Training and Inference Efficiency
Catch-22: On the Fundamental Tradeoff Between Detectability and Robustness in LLM Watermarking
Uncovering Bias Mechanisms in Observational Studies
Seeing the Unseen: Physics-as-Representation for Generalizable Gaze Perception
Understanding Reasoning Collapse in LLM Agent Reinforcement Learning
Forward-KL Convergence of Time-Inhomogeneous Langevin Diffusions
Connecting Independently Trained Modes via Layer-Wise Connectivity
Position: Evidence and Implications of Texture Bias in Deep Neural Networks
Rethinking Thinking Tokens: LLMs as Improvement Operators
Position: Code Benchmarks Should Prioritize Rigor, Reliability, and Reproducibility
Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Vision-Language Models
LLM-MatLogic: Executable Exchange Contracts for Knowledge-Graph Query Answering with Scoped Negation
Position: Federated Learning is a Lens towards a Democratized Future for the Scaling Law Era
Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering
Position: Agentic AI Is a Foreseeable Pathway to AGI
Position: AI/ML Deepfake Research is Misaligned with AI Generated Non-Consensual Intimate Imagery (AIG-NCII)
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Position: Accountable Deployment of Agentic AI Demands Layered, System-Level Interpretability
Position: Temporal Measurement Interval Determines Computational and Model Complexity in Single-Cell Perturbation Analysis
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
Position: Stop Chasing the C-index when Evaluating Survival Analysis Models
Position: Bridge Human Interpretation and Machine Representation With Explicit Specification For Qualitative Data Analysis In LLM Era
Distortion of AI Alignment Revisited: RLHF is a Decent Utilitarian Aligner
Position: We Need Practical AI Alignment Methods that Mirror Human Reasoning
Position: Video LLMs Must Not Ignore the Pixel Dynamics in Plain Sight
MaMi-HOI: Harmonizing Global Kinematics and Local Geometry for Human-Object Interaction Generation
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities
Position: Vision encoders should be image size agnostic and task driven
WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference
Position: Quantum Deep Learning Still Needs a Quantum Leap
Position: There are futures that benchmark-driven AI cannot see
Position: Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution
Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning
Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance
Genome-Factory: A Library for Tuning, Deploying, and Interpreting Genomic Foundation Models
Position: Embodied AI Requires a Privacy-Utility Tradeoff
Segmentation From Attention: Training-Free Layer Selection and One-Shot Tuning for Segmentation in VLMs
Position: VLM Causal Reasoning Benchmarks Should Probe Temporal Understanding, Not Presume It
Position: From Crowdsourcing to Crowd-LLM-Sourcing and LLM-Sourcing
Position: Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Position: Sustainable Open-Source AI Requires Tracking the Cumulative Footprint of Derivatives
Position: The Inevitable Transition to Machine Learning in Quantum Chemistry
Position: AI Leaderboards Are Underserving the Global South: A Case Study from India
Position: Retire the "Positive Backdoor" Label—Secret Alignment Requires Strict and Systematic Evaluation
Position: `AI Alignment' Encompasses Competing Technical Priorities
Position: Reasoning After Perception Means Reasoning Without Vision
Position: Agent Security Needs Redefinition through a Holistic Framework
Thinking in Latent Space: Progressive Multimodal Simplification for Visual Reasoning
Position: The Term “Machine Unlearning” Is Overused in LLMs
Position: Safety Must Precede the Deployment of Open-Ended AI Agents
Position: Graph Condensation Needs a Reset—Move Beyond Full-dataset Training and Model-Dependence
On the Effect of Misspecifying the Embedding Dimension in Low-rank Network Models
Position: Quantum Kernel Machines Should Move Beyond Scalar-Valued Kernels to Realize Their Potential
Position: Collaborative Agentic AI Needs Interoperability Across Ecosystems
Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation
ARLArena: Demystifying Policy Gradient Stability in Agentic Reinforcement Learning
Position: Agentic Safety is an Epistemic Property, Not a Behavioral One
Position: Human-Centric Vision Requires Topological Generalization Beyond Fixed Skeletal Topologies
Position: Improved Documentation is Necessary for Benchmarking AI Systems in Geometry
Position: State-of-the-Art Claims Require State-of-the-Art Evidence
FedPissa: Towards Federated Personalized Adaptation of Foundation Models via LoRA Subspace Mapping
Rethinking Federated Prompt Learning for Medical Images: From Textual Tuning to Visual Manifold Anchoring
API: Adaptive Prototype Imputation for Incomplete Multimodal Sentiment Analysis
The Geometry of Reasoning: Self-Evaluation via Layerwise Trajectory Evolution
EchoRL: Reinforcement Learning via Rollout Echoing
SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations
FairJudge : An Adaptive, Debiased, and Consistent LLM-as-a-Judge
Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment
Autoregressive, Yet Revisable: In Decoding Revision for Secure Code Generation
Convergence Analysis of Decentralized Hessian-/Jacobian-Free Algorithm for Nonconvex Stochastic Bilevel Optimization
On the Fragility of Data Attribution When Learning Is Distributed
Online Learning and Inference for Cox Proportional Hazards Model Using Renewable Sieve Estimation
Learning Permutation-invariant Macroscopic Dynamics
PLATE: Plasticity-Tunable Efficient Adapters for Geometry-Aware Continual Learning
Non-Parametric Optimization for Scalable Learning in Stochastic Decision Problems
Logarithmic Switching Regret for Online Convex Optimization
Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
From Player to Master: Enhancing Test-Time Learning of LLM Agents via Reinforcement Learning over Memory
What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
Frequency-Aware Perceptual Optimization for Low-Complexity Implicit Image Compression
PINNfluence: Interpreting PINNs through Influence Functions
Lottery Prior: Randomized Neural Compression for Zero-Shot Inverse Problems
Transformer Circuits Can Realize Clustering Algorithms
Low-Rank and Sparsity Are All You Need: Exploring Robust Hierarchical Latent Subspaces for Transferable Adversarial Attack
Not All Answers Are Contextually Persuadable: Inference Dynamics in Large Language Models under Contextual Influence
Uncovering Grounding IDs: How External Cues Shape Multi-Modal Binding
JADE: Expert-Grounded Dynamic Evaluation for Open-Ended Professional Tasks
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
Sentinel-VLA: A Metacognitive VLA Model with Active Status Monitoring for Dynamic Reasoning and Error Recovery
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction
Bridging Functional and Representational Similarity via Usable Information
Representation Unlearning: Forgetting through Information Compression
StretchTime: Adaptive Time Series Forecasting via Symplectic Attention
CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill
ForesightKV: Optimizing KV Cache Eviction for Reasoning Models by Learning Long-Term Contribution
Formalizing the Binding Problem
GenCircuit-RL: Reinforcement Learning from Hierarchical Verification for Genetic Circuit Design
Contrastive Spectral Rectification: Test-Time Defense towards Zero-shot Adversarial Robustness of CLIP
MIST: Moment-Aligned Invariant Stability Transform for Robust Flow Matching
A Regime-Aware Trajectory Prediction Framework for 1000+ Systems Biology Models
Mean Flow Distillation: Robust and Stable Distillation for Flow Matching Models
Bias in Zeroth-Order Normal Estimation for Decision-Based Attacks
Intrinsic Gradient Suppression for Label-Noise Prompt Tuning in Vision–Language Models
PCRNet: Phase-aware Complex Refinement Network for EEG-based Auditory Attention Decoding
Temporal Straightening for Latent Planning
AgentWebBench: Benchmarking Multi-Agent Coordination in Agentic Web
Position: Quantum Program Generation Must Prioritize Validity Over Probabilistic Scaling
Q-Tab: Quantized Tabular Data Generator
Evaluating and Steering Modality Preferences in Multi-modal LLMs
Position: AI Researchers Must Lead Arms Control to Mitigate Military AI Risks
Stable Localized Conformal Prediction via Transduction
WBMM: Windowed Batch Matrix Multiplication for Efficient Large Receptive Field Convolution
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion
Auto-regressive In-context Demonstration Selection
XRPO: Pushing the Limits of GRPO with Targeted Exploration and Exploitation
TSMGen: Target-Specific Molecule Generation via Higher-Order Structural Dependencies and Context-Aware Bidirectional Fusion
Geometric Pocket-Centric Protein Encoding for Polypharmacology-Guided Multi-Target Drug Design
Exact Unlearning in Reinforcement Learning
SciAgentGym: Benchmarking Multi-Step Scientific Tool-Use in LLM Agents
PlugGuard: A Streaming Safeguard for Large Models via Latent Dynamics-Guided Risk Detection
Patterning: The Dual of Interpretability
Scalable Medical Multimodal Fusion via Symmetric Consistency Modeling
Same Graph Cross-Task Transfer in GNNs: Protocols and Predictors
Continuous Viewpoint Adaptation for Single View 3D Object Reconstruction
Towards Hierarchy–Uniformity Equilibrium: Recovering Semantic Depth in Hypergraph Contrastive Learning
Clipped Q-Learning: Your Value Clipping Is Secretly A Robust Operator
Asymmetric Multi-View Clustering with Hyperbolic Uncertainty Modeling
LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
ActiveScope: Actively Seeking and Correcting Perception for MLLMs
On the Collapse of Generative Paths: A Criterion and Correction for Diffusion Steering
Dynamic Symmetric Point Tracking: Tackling Non-ideal Reference in Analog In-memory Training
Lookahead Sample Reward Guidance for Test-Time Scaling of Diffusion Models
Recovering Hidden Reward in Diffusion-Based Policies
LiftQuant: Continuous Bit-Width Control for Pareto-Optimal LLM Deployment
RADIO1D: Elastic Representations for Condensed Vision Modeling
Learning Protein Structure-Function Relationships through Knowledge-guided Representation Decomposition
Autoregressive Boltzmann Generators
Robust Stochastic Gradient Posterior Sampling with Lattice Based Discretisation
SafeLab: An Interactive High-Fidelity Benchmark for Embodied Safety in Scientific Robotics
Why DDIM Hallucinates More than DDPM: A Theoretical Analysis of Reverse Dynamics
Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density
Steady-State Behavior of Constant-Stepsize Stochastic Approximation: Gaussian Approximation and Tail Bounds
Moving Out: Physically-grounded Human-AI Collaboration
Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge
Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning
Demystifying Action Space Design for Robotic Manipulation Policies
ScoreMix: Synthetic Data Generation by Score Composition in Diffusion Models Improves Face Recognition
Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data
HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling
Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning
Block Rotation is All You Need for MXFP4 Quantization
ViTok-v2: Scaling Native-Resolution Autoencoders to 5B
CRPO: Character-centric Group Relative Policy Optimization for Role-aware Reasoning in Role-playing Agents
Partitioning for Intrinsic Model Inversion Resistance in Collaborative Inference
VEQ: Modality-Adaptive Quantization for MoE Vision-Language Models
Training–Inference Consistent Segmented Execution for Long-Context LLMs
Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE
Is Data Shapley Not Better than Random in Data Selection? Ask NASH
EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting
EVOLVING ROLLOUTS: Harnessing Historical Experience for Web Agent Evolution in Reinforcement Learning
NeuronCtrl: Geometry-Aware Safe Closed-Loop Generative Control for Neuronal Microenvironment Dynamics
Learning Task-Sufficient World Models by Synergizing Agentic Exploration and Structured Modeling
Q-Flow: Stable and Expressive Reinforcement Learning with Flow-based Policy
How can embedding models bind concepts?
Necessary Conditions for Compositional Generalization of Embedding Models
Knowledge Diversion for Efficient Morphology Control and Policy Transfer
Breaking the Scale Barrier: One-Shot Knowledge Transfer via Frequency Transform
Self-Supervised Weight Templates for Scalable Vision Model Initialization
When Do Diffusion Models learn to Generate Multiple Objects?
Bridging Scaling Laws to On-Policy Reinforcement Learning via Adaptive Batch Scaling
U$^3$CF: Unbiased, Unconfounding, and Unified Causal Framework for Multi-Target Domain Adaptation
AlignVid: Taming Visual Dominance via Training-Free Attention Modulation in Text-guided Image-to-Video Generation
Incentivizing Truthfulness and Collaborative Fairness in Bayesian Learning
Towards Optimal Robustness in Learning-Augmented Paging
Gradient Flow Dynamics and Implicit Bias of Diagonal Linear Networks under Infinitesimal Initialization
Towards Understanding Adam Convergence on Highly Degenerate Polynomials
Position: Scale is a False Promise for Endangered Languages
Agentic Framework for Epidemiological Modeling
Less Is More in Federated Continual Learning: RieSelect for Conflict-Aware Layer Selection in LLMs
Expand Neurons, Not Parameters
Variational Flow Maps: Make Some Noise for One-Step Conditional Generation
BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning
Lower Complexity Bounds for Nonconvex-Strongly-Convex Bilevel Optimization with First-Order Oracles
Securing Multimodal AI through Internal Information Decomposition
PolyFlow: Safe and Efficient Polytope-Constrained Flow Matching with Constraint Embedding and Projection-free Update
LILO: Bayesian Optimization with Natural Language Feedback
Flow Inverse Reinforcement Learning
Opt-Verifier: Unleashing the Power of LLMs for Optimization Modeling via Dual-Side Verification
CLARITree: Cholesky and Lookahead Accelerations for Regression with Interpretable Piecewise Linear Trees
SafeDec: Constrained Decoding for Safe Autoregressive Generalist Robot Navigation Policies
3D-DLP: Self-supervised 3D Object-centric Scene Representation Learning
LaRA-Fusion: Latent-Robust Adaptation via Dual-Loop Constraints for Infrared and Visible Image Fusion
EnsembleVLA: Ensemble Learning for Vision-Language Action Models
On Group Relative Policy Optimization Collapse in Agent Search: The Lazy Likelihood-Displacement
Arboreal Neural Network
DPsurv: Dual-Prototype Evidential Fusion for Uncertainty-Aware and Interpretable Whole Slide Image Survival Prediction
When RAG Hurts: Diagnosing and Mitigating Attention Distraction in Retrieval-Augmented LVLMs
Deep Trajectory Supervision: Deep Supervision Strikes Back
Holonomy Grid Codes for Generalisation Under Directed Actions
NAACA: Training-Free NeuroAuditory Attentive Cognitive Architecture with Oscillatory Working Memory for Salience-Driven Attention Gating
R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?
The Differences Between Direct Alignment Algorithms are a Blur
Beyond Static Allocation: Dynamic Sensitivity-Aware Fine-Tuning for Vision Transformers
Modeling Spectral Energy Shifts in Spatio-Temporal Graph Anomaly Detection
SpatioLM: Towards General Physical Spatial Intelligence in Vision-Language Models
MobileFusion: Mobile-Friendly Infrared and Visible Image Fusion via Structural Re-parameterization
MatchFixAgent: Language-Agnostic Autonomous Repository-Level Code Translation Validation and Repair
MIMOMamba: From Scalar Duality to Matrix-Valued Attention
MacroGuide: Topological Guidance for Macrocycle Generation
Conformal Prediction for Early Stopping in Mixed Integer Optimization
DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents
Threat2Traffic: Multi-Agent Environment Synthesis for Malware Traffic Generation from Threat Intelligence
Geometry-Misalignment in Distributional Learning
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception
LIVE: Long-horizon Interactive Video World Modeling
FedHera: Towards Drift-Resilient Federated Fine-tuning with Heterogeneous Resources
TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments
Towards Execution-Grounded Automated AI Research
Bio-Vision-Inspired Spiking Neural Networks for Object Detection with Event Cameras
ORBIT: A Prognostic World Model for Ocular Reasoning Based on Imagined Trajectories
Don't Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation
MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts
AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks
Can vision language models learn intuitive physics from interaction?
Routing and Reasoned Evaluation with Large Language Models
Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models
Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models
History-Bootstrapped Flow Matching for Inverse Boiling Reconstruction
Copula-SVI: Vine-Copula Variational Inference for Instance-Level Correlation Capturing
Walrus: A Cross-domain Foundation Model for Continuum Dynamics
PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards
FedARC: Anchor-Guided Residual Compensation for Data and Model Heterogeneous Federated Learning
Cross-View Lewis Weight Fusion Empowering Exemplar Replay for Federated Class-Incremental Learning
PGS: Effective LLM Code Refinement via Property-Oriented and Structurally Minimal Feedback
Rethinking Serialization in Linear 3D Vision: Decoupling Anisotropic Geometry from Isotropic Semantics
Derivative Informed Learning of Exchange-Correlation Functionals
QHyer: Q-conditioned Hybrid Attention-mamba Transformer for Offline Goal-conditioned RL
Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery
Profiling the Irrational Agent: Cognitive Modeling of LLM Behaviors in Sequential Jailbreaks
Multicalibration Yields Better Matchings
Semantic Tube Prediction: Beating LLM Data Efficiency with JEPA
Reward Shaping Control Variates for Off-Policy Evaluation Under Sparse Rewards
EmBrace: A Collective Knowledge Fusion Framework Toward Unified EEG Foundation Models
Rate or Fate? RLV$^{\varepsilon}$R: Reinforcement Learning with Verifiable Noisy Rewards
Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data Augmentation
HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning
Steal the Patch Size: Adversarially Manipulate Vision Language Models
Training-Free Bayesian Filtering with Generative Emulators
Self-Evolving LLM Agents under Offline Data Support
When Data Is Scarce: Scaling Sparse Language Models with Repeated Training
MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning
Discovering Scaling Exponents with Physics-Informed Müntz-Szász Networks
Proteus: Lookup-Free Trellis-Coded Quantization by Lattice-Breaking Compute Codes for 2-Bit LLMs
DeCoDe: Decoupling Binding Position and Molecular Conformation in 3D Ligand Diffusion for Structure-Based Drug Design
Compile to Compress: Boosting Formal Theorem Provers by Compiler Outputs
Improved Bounds for Private and Robust Alignment
A Noise Sensitivity Exponent Controls Large Statistical-to-Computational Gaps in Single- and Multi-Index Models
Direct Flow Q-Learning
Taming the Loss Landscape of PINNs with Noisy Feynman–Kac Supervision: Operator Preconditioning and Non-Asymptotic Error Bounds
Learning Reward–Cost Balance in Safe RL via Score-Based World Models
Set-Preserving Calibration from Conformal P-Values to E-Values
Inducing LLM Workflows with Bilevel Optimization and Textual Gradients
Error Propagation and Model Collapse in Diffusion Models: A Theoretical Study
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
Reasoning about Reasoning: BAPO Bounds on Chain-of-Thought Token Complexity in LLMs
Mixture of Distributions Matters: Dynamic Sparse Attention for Efficient Video Diffusion Transformers
Scaling Long-Horizon Agent via Context Folding
SE3Set: Harnessing Equivariant Hypergraph Neural Networks for Molecular Representation Learning
DSENet: A Novel Dual-Stream Enhancement Network for Multi-Scale Non-Stationary Time Series Forecasting
On the existence of consistent adversarial attacks in high-dimensional linear classification
Polyphonia: Training-Free Context-Aware Music Editing with Acoustic-Informed Attention Calibration
MUSE: Resolving Manifold Misalignment in Visual Tokenization via Topological Orthogonality
REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge
Low Kruskal-Rank Adaptation
WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning
Trifuse: Enhancing Attention-Based GUI Grounding via Multimodal Fusion
An Embarrasingly Simple Way to Optimize Orthogonal Matrices at Scale
Position: Peer Review in ML/AI Conferences Should Separate Publication from Presentation and Offer Non-Anonymous Review Tracks
Expected Returns and Policy Inconsistency-Aware Offline Federated Deep Reinforcement Learning
SFedPO: Streaming Federated Learning with a Prediction Oracle under Temporal Shifts
Statistical Consistency and Generalization of Contrastive Representation Learning
Student-Centered Distillation Narrows the Agentic Gap Between Small and Large LLMs
Sufficiency is Relative: Evaluating LLM Explanations under Model-Induced Input Distributions
SMD: Multi-view Safety-Critical Driving Video Generation in the Real-world Domain
Towards Disentangled Preference Optimization Dynamics
FedLog: Personalized Federated Classification with Less Communication and More Flexibility
Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models
Edge-colored Clustering in Hypergraphs: A MaxECC Approximation
GHOST: Geometry-Guided Hallucination of Opaque Surface Textures
Anytime-Valid Inference Under Outcome Delay: A Design-Based Approach
X-MoGe: A Cross-Modal Adaptation Framework with Mixture-of-Experts and Geometry Guidance for Heterogeneous Collaborative Perception
Fast and Scalable Analytical Diffusion
When More Data Doesn't Help: Limits of Adaptation in Multitask Learning
Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory
Symbiosis-Inspired Knowledge Distillation for Incremental Object Detection
Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following
Exposing Vulnerabilities in Explanation for Time Series Classifiers via Dual-Target Attacks
PRISM: Sequence Modeling as Parallel Residual Iteration
DomED: Redesigning Ensemble Distillation for Domain Generalization
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
HyMTRL: A Hybrid Multi-Task Reinforcement Learning Framework via Phased Policy Evolution
Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping
Generative Augmented Inference
Sparse Tokens Suffice: Jailbreaking Audio Language Models via Token-Aware Gradient Optimization
How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
Revisiting Pre-Propagation GNNs: Robust Diffusion Operators and Hidden-State Re-Propagation
Olaf-World: Orienting Latent Actions for Video World Modeling
FastSESR: Fast Scene-level Explicit Surface Reconstruction
Eyes-on-Me: Scalable RAG Poisoning through Transferable Attention-Steering Attractors
Maximum Likelihood Reinforcement Learning
From Observations to States: Latent Time Series Forecasting
Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism
Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity
Revisiting Efficiency–Accuracy Scaling in Mixture-of-Experts Architectures
Grounding Functional Similarity by Invariance-Aware Model Stitching
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond
Hierarchical Abstract Tree for Cross-Document Retrieval Augmented Generation
Near-Optimal Private Linear Regression via Iterative Hessian Mixing
Addressing Instrument-Outcome Confounding in Mendelian Randomization through Representation Learning
SSDCN: Spatial-Spectral Dual-Clustering-based Network for Hyperspectral Image Super-resolution
Probably Approximately Correct Labels
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion
Towards Scalable and Consistent 3D Editing
In-Training Defenses Against Emergent Misalignment in Language Models
VlogReward: Learning Multi-Dimensional Evaluation for Vlog Editing
Plain Transformers are Surprisingly Powerful Link Predictors
rePIRL: Learn PRM with Inverse RL for LLM Reasoning
AES: Curing Optimizer Blindness in Long-Tailed Recognition via State-Aware Correction
Task-Aware Preference Calibration for Direct Preference Optimization
A Hypertoroidal Covering for Perfect Color Equivariance
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning
No Need to Train Your RDB Foundation Model
AutoBaxBuilder: Bootstrapping Code Security Benchmarking
Desirable Effort Fairness and Optimality Trade-offs in Strategic Learning
GenAlign: Towards Unified Alignment Framework of MLLMs via Generative Reward Model
Grounding LLMs in Scientific Discovery via Embodied Actions
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers
The Geometric Origin of Grokking: Accelerating Generalization via Active Structural Reorganization
A Theory of Contrastive Learning with Natural Images
Learning-Augmented Online Minimization with Dual Predictions
Transport or Discard: Robust Unbalanced Optimal Transport for Cross-Domain Policy Adaptation
HyperMLP: An Integrated Perspective for Sequence Modeling
Artificial Hippocampus Networks for Efficient Long-Context Modeling
NeuralFLoC: Neural Flow-Based Joint Registration and Clustering of Functional Data
BLIPs: Bayesian Learned Interatomic Potentials
Barriers to Counterfactual Credit Attribution for Autoregressive Models
Bridging Tokens and Geometry: Token-wise 3D Supervision for CAD Generation
Leveraging Gauge Freedom for Learning Non-Gradient Population Dynamics of Stochastic Systems
Efficient, Validation-Free Intrinsic Quality Estimation for Large-Scale Face Recognition Datasets
Bipartite Graph Attention-based Clustering for Large-scale scRNA-seq Data
Singular Vectors of Attention Heads Align with Features
Efficient Distributionally Robust Assortment Optimization in MNL Bandits
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models
Reinforcement Learning from Human Feedback with Active Queries
The Fairness Hierarchy: A viewpoint from causal inference
High-accuracy sampling for diffusion models and log-concave distributions
Shape of Thought: Progressive Object Assembly via Visual Chain-of-Thought
Reinforcement Learning via Self-Distillation
What Language is This? Ask Your Tokenizer.
Graph Rewiring based on Flow Alignment for Improving Fluid Simulation
Multi-Scale Wavelet Transformers for Operator Learning of Dynamical Systems
Understanding and Mitigating Token-Pruning-Induced Vulnerabilities in VLMs
Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts
LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization
Efficient Distributed MLLM Training with ModalGlue
FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models
Learning $U$-Statistics with Active Inference
Unlocking the Potential of Continual Model Merging: An ODE Perspective
Universal Multiclass Transductive Online Learning
CONGA:Confidence-and-Gradient-Aware Learning Rate Schedule for Test Time Adaptation
SEMIR: Semantic Minor-Induced Representation Learning on Graphs for Visual Segmentation
Synergistic Space-Vision Processing for Predicate Inference
Exploration-free Algorithms for Multi-group Mean Estimation
Certificates for Complex-Compatible Learned Cochain Laplacians
TiME: Test-Time Mixture-of-Experts Routing via Asymmetric CO-Optimal Transport for Continual Test-Time Adaptation
UrbanMLLM: Joint Learning of Cross-view Imagery for Urban Understanding
LLM-Guided Diagnostic Evidence Alignment for Medical Vision–Language Pretraining under Limited Pairing
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
PESD-TSF: A Period-Aware and Explicit Structured Decomposition Framework for Long-Term Time Series Forecasting
VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding
SOLAR for Offline MARL: Plateau-Triggered Potential Shaping under World-Model Uncertainty
Calibrated Knowledge Aggregation in Bayesian Mixture-of-Experts for Continual VQA
EPSVec: Efficient and Private Synthetic Text Generation via Dataset Vectors
HInT: Hypergraph Infusion at the Structural Layers Improves Table Understanding
Batched First-Order Methods for Parallel LP Solving in MIP
BIOARC: Discovering Optimal Neural Architectures for Biological Foundation Models
Position: AI Evaluation Should Work With Humans
Escaping the Verifier: Learning to Reason via Demonstrations
Position: Artificial Intelligence Needs Meta Intelligence - the Case for Metacognitive AI
Distributed Stochastic $K$-Level Optimization Over Networks
Approximation Preserving Coresets
Provably Learning Attention with Queries
Adaptive Node Feature Selection for Graph Neural Networks
Correcting in Hindsight: Editing Past Key-Value States for Robust LLM Reasoning
Causal discovery for time series with endogenous context variables
When does predictive inverse dynamics outperform behavior cloning?
LoRDO: Distributed Low-Rank Optimization with Infrequent Communication
Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints
Adversarial Reinforcement Learning for Robust Diffusion Large Language Model Unlearning
Hierarchical Causal Abduction: A Foundation Framework for Explainable Model Predictive Control
Riemannian Networks over Full-Rank Correlation Matrices
A Bayesian Approach to Quantify the Uncertainty of Human Ratings in a Single-Instance Multimodal Framework
A geometric relation of the error introduced by sampling a language model's output distribution to its internal state
CLoVE: Personalized Federated Learning through Clustering of Loss Vector Embeddings
HSGG: Training-Free Hierarchical Scene Graph Generation with Geometry-Guided Relation Reasoning
Data-driven Mixed Integer Optimization through Probabilistic Multi-variable Branching
The Heterogeneous Safety Impacts of Benign Multilingual Fine-Tuning
Causal-EPIG: Causally Aligned Active CATE Estimation
Continual GUI Agents
A Direct Approach for Handling Contextual Bandits with Latent State Dynamics
Cross-Chirality Generalization by Axial Vectors for Hetero-Chiral Protein-Peptide Interaction Design
Discriminative Visual Process Rewards for Scaling Thinking at Test-Time with Images
A Diagnostic Study of Multi-Agent LLMs for Real-World Debates
From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation
IPMark: A Sentence-Level Watermark for LLMs with Hierarchical Personalization and Efficient Detection
CLINIC : Evaluating Multilingual Trustworthiness in Language Models for Healthcare
Structured Diffusion Bridges: Inductive Bias for Denoising Diffusion Bridges
A Time-Reparameterized Cumulative Intensity Extrapolation Sampler for Discrete Flow Matching
Exploring 3D Dataset Pruning
From Teacher Pathways to Invariant Manifolds: Consensus Subspace Distillation for TSFMs
UNIVERSAL REPRESENTATION OF GENERALIZED CONVEX FUNCTIONS AND THEIR GRADIENTS
BiCrossNet with Decoupled Dual Generators: A Parameter‑Efficient and Generalizable Few‑Shot Custom Gesture Recognition Framework
Architecture Matters for Multi-Agent Security
Asymptotic Theory of Iterated Empirical Risk Minimization, with Applications to Active Learning
Universal Skeleton Understanding via Differentiable Rendering and MLLMs
SAGE: A Dataflow-Native Framework for Modular, Controllable, and Transparent LLM-Augmented Reasoning
Unsupervised Process-Aware Coreset Selection for In-Context Learning
Learning to Share: Selective Memory for Efficient Parallel Agentic Systems
Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation
Conf-Gen: Conformal Uncertainty Quantification for Generative Models
Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs
EEG-Based Multimodal Learning via Hyperbolic Mixture-of-Curvature Experts
Rethinking Time-Series Imputation as Conditional Inference along Temporal Evolution
Wait, Wait, Wait... Why Do Reasoning Models Loop?
Benchmarking Dense and Indiscernible Object Counting with Blueberries
Efficient Equivariant High-Order Crystal Tensor Prediction via Cartesian Local-Environment Many-Body Coupling
Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation
Unifying Adversarial Robustness and Training Across Text Scoring Models
The Catastrophic Failure of *the* k-Means Algorithm in High Dimensions, and How Hartigan's Algorithm Avoids It
Expressive Graph Neural Networks via Equivariant Use of Noise
Meta-Black-Box Optimization Can Do Search Guidance for Expensive Constrained Multi-Objective Optimization
Language Generation in the Limit: Complexity Barriers and Implications for Learning
Is Fixing Schema Graphs Necessary? Full-Resolution Graph Structure Learning for Relational Deep Learning
REG: In-Sample RL via Regularizing the Evaluation Gap
Identifiable Equivariant Networks are Layerwise Equivariant
Gradient-Aware Scheduling: Coupling Curriculum and Staleness for Async Reinforcement Learning
A Two-Tier Perspective on Inference-Time Parallelism in Multi-Agent LLM Systems
LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues
SMART: Scalable Mesh‑free Aerodynamic Simulations from Raw Geometries using a Transformer‑based Surrogate Model
AC-ODM: Actor–Critic Online Data Mixing for Sample-Efficient LLM Pretraining
Awakening Visual Reasoning: Mitigating Post-Training Failure in Vision-Text Compression
Target-Driven Policy Optimization for Sequential Counterfactual Outcome Control
How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
BTSP-CAM: A Brain-Inspired Geometric Memory for Class-Incremental Learning
GXPO: Group Cross-Lingual Relative Policy Optimization for Code Generation
TokenRatio: Principled Token-Level Preference Optimization via Ratio Matching
Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering
A Minimax Approach for Optimal Intervention Policy Learning with Two-Stage Outcomes
Clarify Before You Draw: Proactive Agents for Robust Text-to-CAD Generation
VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models
Object-level Semantic and Spatial Distillation for Open Vocabulary Detection
Learning Taxonomic Trees with Hierarchical Representation Regularization for Large Multimodal Models
COFT: Counterfactual–Conformal Decoding for Fair Chain‑of‑Thought Reasoning in Large Language Models
Fast Estimation for Forest Matrix of Signed Graphs
HECTOR: Hybrid Editable Compositional Object References for Video Generation
SD-MoE: Spectral Decomposition for Effective Expert Specialization
MPFM: Cross Multi-Domain Prototype Flow Matching for Log Anomaly Detection
Generalization Bounds for Out-of-distribution Generalization
Faithful Relational Reasoning with Region-based Embeddings: Expressivity of Convex Coordinate-wise Models
Generative Modeling of Discrete Latent Structures via Dynamic Policy Gradients
Position: Certified Correctness in Neural Constraint Reasoning Requires Symbolic Integration
Position: Breaking the Dual Curse of Multilingual AI Requires Socio-Technical Guardrails, Not Post-Hoc Alignment
Position: Unlabeled ≠ No Human Supervision in Visual Learning
Position: *Beyond Text* The Text-Centric Bias in Foundation Models Must Be Revisited for a Speech-First Future
Position: Assistive AI requires Personalized Specialists, not Generalists
Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
Position: Preregister Experiments with AI Agents
Position: Prioritize Identifying Structure, Not Complex Models, for Scientific Discovery
Synthesizing world models for bilevel planning
Position: Assistive Agents Need Accessibility Alignment
Position: Ideas Should be the Center of Machine Learning Research
Position: Age Estimation Models Do Not Process Biometric Data
Position: The Age of AI Agents Demands A New Scientific Paradigm To Sustain Trustworthy Science
Position: Model identity in machine learning is a convention, not a property
Position: Sycophancy is an Educational Safety Risk: Why LLM Tutors Need Sycophancy Benchmarks
Position: AI Must Become Planet-Centered, Not Human-Centered
(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models
Position: Generative Engine Optimization Creates Underexamined Risks, Governance Must Target Concentration, Disclosure, and Academic Blind Spots
Position: LLM Benchmark Datasets should be Contamination-Resistant
Transforming Weather Data from Pixel to Latent Space
Position: We Need AI Efficiency Incentives for Accessibility and Sustainability
Position: Stop Reactively Patching Your Model Every Time and Start Proactive Test-Driven AI Development
Position: AI Capabilities Are Not Increasing Exponentially
Adaptive Policy Backbone via Shared Network
An Information-Theoretic Criterion for Efficient Data Synthesis
Position: Metaphysical Concepts in AI Should Be Judged by Their Consequences
Position: Good Embodied Reward Models Need Bad Behavior Data
Position: Generative Distributional Integrity against Backdoor Attacks
DITING: A Weak Degradation Listener for Battery Lifetime Early Prediction
Learning 3D-Gaussian Simulators from RGB Videos
Cache Coherent Resampling for Efficient Test Time Scaling in LLM Reasoning via Adaptive Sequential Monte Carlo
GameDevBench: Evaluating Agentic Capabilities Through Game Development
Explaining Concept Shift with Interpretable Feature Attribution
Can Simple Denoising Improve Uniform State Diffusion Models?
On The Variability Of Concept Activation Vectors
Second-Order Smooth Planning with Optimal-Transport Bellman Smoothing
Asymptotic Optimality of the High-Dimensional Gaussian Mechanism and Improved Low-Dimensional Mechanisms for Differential Privacy
INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic
ScoreMatchingRiesz: Score Matching for Debiased Machine Learning and Policy Path Estimation
WaterSIC: information-theoretically (near) optimal linear layer quantization
Alleviating Observation Bias via Causal-Invariant Meta-Learning for Unbalanced Incomplete Multi-view Clustering
Language Generation with Feedback: Queries and Mistakes
Learning-Augmented Online Covering Problems
Accuracy and Normalized Accuracy under Length Bias: Analysis, Guidelines, and a Bayesian Alternative
Synthesizable Molecular Generation via Soft-constrained GFlowNets with Rich Chemical Priors
A Theory of Data Acquisition and Pricing at Scale
Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory
From Generative to Episodic: Sample-Efficient Replicable RL
Human-AI Collaborative Uncertainty Quantification
A Statistical Framework for Analyzing Specification Resistance to Learnware-Inversion Risks
Offline Reinforcement Learning with Universal Horizon Models
Fast and Optimal Algorithms for Private Hypothesis Selection
Opt-Miner: Empowering Information-Seeking Agent with Tree-Guided Data Synthesis for Optimization Modeling
Bullet Trains: Parallelizing Training of Temporally Precise Spiking Neural Networks
Implicit Intelligence - Evaluating Agents on What Users Don’t Say
Syntax vs. Semantics: How Transformers Learn Deep Dependencies
Proximal-IMH: Proximal Posterior Proposals for Independent Metropolis–Hastings with Approximate Operators
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Strategic Candidacy in Generative AI Arenas
Influence-Disentangled Federated Training: Learning Models That Are Easy to Unlearn
Activation-Free Backbones for Image Recognition: Polynomial Alternatives for Spatial and Channel Mixing
Online Compatible Reward Identification from Preference Feedback
Feedback Control for Multi-Objective Graph Self-Supervision
Eigenvectors of Experts are Training-free Non-collapsing Routers
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs
Probabilistic Salient Object Ranking
Discrete Tilt Matching
PromptPilot: Game-Theoretic Multi-Agent Prompt Optimization for Segment Anything
Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
Efficient privacy loss accounting for subsampling and random allocation
ClimateAR: Multi-Scale Autoregressive Generative Modeling for Seasonal-to-Interannual Climate Forecasting
Accelerating Langevin Monte Carlo via Efficient Stochastic Runge-Kutta Methods beyond Log-Concavity
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
RL4RLA: Teaching ML to Discover Randomized Linear Algebra Algorithms through Curriculum Design and Graph-based Search
MaxSAT-Based Compression for Tsetlin Machines
MFH-NAS:A Hybrid Neural Architecture Search Framework for Multimodal Fusion Object Detection
The surprising strength of weak classifiers for validating neural posterior estimates
All ERMs Can Fail in Stochastic Convex Optimization Lower Bounds in Linear Dimension
The Power of Power Law: Asymmetry Enables Compositional Reasoning
Streaming Sliced Optimal Transport
SafeHarbor: Defining Precise Decision Boundaries via Hierarchical Memory-Augmented Guardrail for LLM Agent Safety
Corrigibility Transformation: Constructing Goals That Accept Updates
A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents
Black-Box Assisted Regression: Phase Transitions and Minimax Optimality
DFlash: Block Diffusion for Flash Speculative Decoding
Evaluating and Rewarding LALMs for Expressive Role-Play TTS via Mean Continuation Log-Probability
TFTF: Training-Free Targeted Flow for Conditional Sampling
Deep Networks Learn Deep Hierarchical Models
KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware
Skipping the Zeros in Diffusion Models for Sparse Data Generation
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation
Exploration Hacking: LLMs Can Learn to Resist RL Training
Parsimonious Learning-Augmented Online Metric Matching
Rethinking Sparse Mixture of Experts from a Unified Perspective
Score-Repellent Monte Carlo: Toward Efficient Non-Markovian Sampler with Constant Memory in General State Spaces
Bregman meets Lévy: Stochastic Mirror Descent with Heavy-Tailed Noise in Continuous and Discrete Time
PepCompass: Navigating Peptide Embedding Spaces Using Riemannian Geometry
Dimension-free convergence of diffusion models for approximate Gaussian mixtures
Local Mechanisms of Compositional Generalization
BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps
Beyond First-order Asymptotics in Sequential Mean Testing
Stochastic Gradient Methods under Heavy-Tailed Noises in Weakly Convex Optimization
SC$^{2}$-WM: A Self-Correcting World Model with Closed-Loop Feedback for Vision-and-Language Navigation in Continuous Environments
Consistent Diffusion Language Models
Spatial Conformal Inference through Localized Quantile Regression
Near-Optimal Convergence of Accelerated Gradient Methods under Generalized and $(L_0,L_1)$-Smoothness
Bilevel Optimization over Saddle Points of Zero-Sum Markov Games
PVDepth: Panoramic Video Depth Estimation via Geometry-Aware Spatiotemporal Adaptation
Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
Reduction of Probabilistic Chemical Reaction Networks
Image Restoration via Diffusion Models with Dynamic Resolution
Exactly Computing do-Shapley Values
Learning Anisotropic Value Geometry with Finsler Reinforcement Learning
Neutral-Reference Prompting for Vision–Language Models
Ubiquity of Homeostatic Hebbian Dynamics in Regularized Learning
Success-Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success
Differentially Private Continual Release with Relative Error
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training
Graph of States: Solving Abductive Tasks with Large Language Models
Beyond Mode Collapse: Distribution Matching for Diverse Reasoning
Learning Transferable Interaction Primitives from Game Videos for Humanoids
Prior Diffusiveness and Regret in the Linear-Gaussian Bandit
How high is ‘high’? Rethinking the roles of dimensionality in topological data analysis and manifold learning
Who can we trust? LLM-as-a-jury for Comparative Assessment
Multimodal Function Vectors for Spatial Relations
Interactive Segmentation with Elaborate Focus Prior
Online Tensor Learning: Computational and Statistical Trade-offs, Adaptivity and Optimal Regret
RouteFinder: Towards Foundation Models for Vehicle Routing Problems
PruneFuse: Efficient Data Selection via Weight Pruning and Network Fusion
Probabilistic Pretraining for Improved Neural Regression
Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations
Generalization Bounds for Discrete Diffusion: Statistical Advantage of Masking
Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization
Expert Routing with Synthetic Data for Domain Incremental Learning
UCB Exploration for Fixed-Budget Bayesian Best Arm Identification
Physics-Aware Spatiotemporal Causal Graph Network for Forecasting with Limited Data
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Zono-Conformal Prediction: Zonotope-Based Uncertainty Quantification for Regression and Classification Tasks
Understanding Deep Representation Learning via Layerwise Feature Compression and Discrimination
Position: Let’s Build a Trustworthy Model Context Protocol!
Position: Medical AI Neglects Real Treatment Outcomes
Position: Make Planning Research Rigorous Again!
Position: Predictive Uncertainty Is Not Enough -- Joint Distribution for Full Uncertainty Representation
Hyperbolic neural population geometry benefits computation
Position: AI Evaluations Should be Grounded on a Theory of Capability
Position: No Retroactive Cure for Infringement during Training
Position: AGI Requires a Coordination Layer on Top of Pattern Repositories
PolicyGuard: Towards Test-time and Step-level Backdoor Defense for Reinforcement Learning Agent
Position: The Case for Theory-Level Autoformalization
Position: It’s Time to Optimize for Self-Consistency
Position: Significant impact of numerical precision in scientific machine learning
Position: Evaluating LLMs in Finance Requires Explicit Bias Consideration
Position: Regulating Algorithms Is Not Enough. A Study of Content Discovery in Online Platforms
Position: Explainability Research Must Prioritize Foundations over Ad-hoc Methods
Position: Hallucinations Undermine Trust; Metacognition is a Way Forward
Compositional Generative Modeling from Decentralized Data
Position: Privacy Is a Claim, Not a Property of Synthetic Data
Position: We Need Large Language Models Optimized For Our Well-Being
Position: Natural Language Should Not Fully Replace Formal Languages
FUSE: Quantifying Uncertainty in Multimodal LLMs by Bayesian Fusing Epistemic and Aleatoric Uncertainty
Position: Time to Close The Validation Gap in LLM Social Simulations
Position: Knowing Isn’t Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight
Position: Web Agents Should Use Typed Actions Instead of Click-Based Browsing
Position: We need to re-think the concept of “real” images.
Position: EU AI Act's Research Exemptions Can Break the Publication Norms of Major AI Conferences
Position: Comprehensive AI governance requires addressing non-model capability gains
Position: Collusion Risks Among AI Reasoning Agents Justify Certification Requirements for Making Market Decisions
Position: Reframing Hallucination: Latent Space Geodesics as a Pathway for Generative Discovery
Position: Agentic Systems Should be General
Position: Interpretability in Deep Time Series Models Demands Semantic Alignment
Position: Beyond Sensitive Attributes, ML Fairness Should Quantify Structural Injustice via Social Determinants
Position: It is Time to Virtualize Foundation Models with a Self-evolving Operating System Layer
Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs
Position: Every Ground Truth is a Human Construction, not an Objective Truth
Position: The Data Provenance–Parametric Divide in Large Language Models
Position: Your VLM May Not Be Thinking with Interleaved Images
Position: ICML Should Treat Hosted LLM APIs as Versioned Dependencies and Require Drift-Audit Artifacts
Position: Adopting AI in Practice Does Not Guarantee the Productivity Boost
Position: Towards Responsible Evaluation for Text-to-Speech
Position: LLM for Physics Research Requires Domain-Specialized Training and Tooling
Position: The Turing-Completeness of Real-World Autoregressive Transformers Relies Heavily on Context Management
Position: LLM Agents Are the Antidote to Walled Gardens
Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!
Position: LLM-Safety Evaluations Lack Robustness
Position: Interestingness is an Inductive Heuristic for Future Compression Progress
Position: LLMs Should Incorporate Explicit Mechanisms for Human Empathy
Position: Reliable AI Needs to Externalize Implicit Knowledge: A Human–AI Collaboration Perspective
Position: Responsible AI for AI companions must actively combat violence toward intimate partners
Position: Weight Space Should Be a First-Class Generative AI Modality
Position: Token Taxes Can Mitigate AI's Economic Risks
Position: Use Sparse Autoencoders to Discover Unknowns
Position: LLM Serving Needs Mathematical Optimization and Algorithmic Foundations, Not Just Heuristics
UnMaskFork: Test-Time Scaling for Masked Diffusion via Deterministic Action Branching
Low-Compute Watermark Removal via Dual-Domain Natural Projection
CANDI: Hybrid Discrete-Continuous Diffusion Models
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
Steer Like the LLM: Activation Steering that Mimics Prompting
The Consistency Trap in LLMs: Generator-Evaluator Agreement and Vulnerability to Mistakes
Introspection Adapters: Training LLMs to Report Their Learned Behaviors
Beyond Looking Up, Try Looking Around: Harmonizing Global Structure and Local Consistency in Optimal Transport for Short Text Clustering
Online Contract Design With Unknown Technology
Cross-Tactile Sensor Representation Learning
Mixtures Closest To A Given Measure: A Semidefinite Programming Approach
Understanding LoRA as Knowledge Memory: An Empirical Analysis
Phy-CoSF: Physics-Guided Continuous Spectral Fields Reconstruction and Spectral Super-Resolution for Snapshot Compressive Imaging
Reflective Hamiltonian Monte Carlo: Mixing Analysis and Application to Sampling on Stiefel Manifold
Reinforcement-aware Knowledge Distillation for LLM Reasoning
SciNet: Evaluating AI Agents in Relation-Aware Scientific Literature Retrieval
Amortized Maximum Inner Product Search with Learned Support Functions
AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing
ACO-MoE-LoRA: Evolving-while-Training for Adapting Segment Anything Model 2 to Specialized Domains
Learning to Approximate Uniform Facility Location via Graph Neural Networks
Language Bias in LVLMs: From In-Depth Analysis to Simple and Effective Mitigation
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
CausalProfiler: Generating Synthetic Benchmarks for Rigorous and Transparent Evaluation of Causal Machine Learning
Shift-Dependent Asymmetry: Orthogonal Inverse Low-Rank Adaptation for Federated Medical Segmentation
Computational Arbitrage in AI Model Markets
Safe Reinforcement Learning with Preference-based Constraint Inference
SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs
Directional Neural Collapse for Self-Supervised Visual Representation Learning
Float8@2bits: Entropy Coding Enables Data-Free Model Compression
Quantifying Frontier LLM Capabilities for Container Sandbox Escape
Towards Functional Correctness of Large Code Models with Selective Generation
ABSINT-AI: Agentic Heap Abstractions for Abstract Interpretation
FluxNet: Learning Capacity-Constrained Local Transport Operators for Conservative and Bounded PDE Surrogates
DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing
Self Optimizing Language Models
Online Bayesian Experimental Design for Partially Observed Dynamical Systems
When RL Meets Adaptive Speculative Training: A Unified Training-Serving System
Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints
From Welfare to Utility: Generalized Objectives in Budget-Feasible Procurement
Symbol-Equivariant Recurrent Reasoning Models
Refining Dual Spectral Sparsity in Transformed Tensor Singular Values
Generation is Required for Data-Efficient Perception
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding
EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions
Vision Language Models Cannot Reason About Physical Transformation
Model-Dowser: Data-Free Importance Probing to Mitigate Catastrophic Forgetting in Multimodal Large Language Models
UniFLoW: Universal Multi-Modal Federated LoRA Fine-Tuning Framework with Analytical Aggregation
MoCL: Metabolic Optimization for Curvature-Aware Continual Learning
LOTTERY: Learning from Reference-Only Samples in Two-Sample Testing under Size Asymmetry
Understanding Transfer Learning of RNA Foundation Models on Downstream Tasks
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
A Fine-Grained Understanding of Uniform Convergence for Halfspaces
Why Are Linear RNNs More Parallelizable?
Prototype-Grounded Concept Models for Verifiable Concept Alignment
Mixture of Horizons in Action Chunking
DiscoForcing: A Unified Framework for Real-Time Audio-Driven Character Control with Diffusion Forcing
Information Geometry Loss for Time Series Forecasting
COPF: An Online Framework for Deployment-Stable Counterfactual Fairness in Evolving Graphs
TINNs: Time-Induced Neural Networks for Solving Time-Dependent PDEs
Hyper-LLaVA: Hyperbolic Uncertainty-aware Modality-Balanced Routing for Multimodal Continual Instruction Tuning
Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning
Towards Efficient LLMs Annealing with Principled Sample Selection
MEDUSA: Motion Elimination in Diffusion Using Spectral Attack
Op-CAD: Benchmarking and Investigating Operation-oriented CAD Generation
Beyond Logits: Metastable Latent Dynamics for Sample-Efficient Best-of-N Selection in LLMs
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics
FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA
Keep Everyone Happy: Online Fair Division of Numerous Items with Few Copies
XSpecMesh: Quality-Preserving Auto-Regressive Mesh Generation Acceleration via Multi-Head Speculative Decoding
Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective
Learning to Correct: Reinforcement Learning for Multi-Attempt Chain-of-Thought
Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum
Understanding Multimodal Learning: A Loss Landscape Smoothness Perspective
A Sketch-and-Project Analysis of Subsampled Natural Gradient Algorithms
Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models
FPTQuant: Function-Preserving Transforms for LLM Quantization
A Penalty Approach For Differentiation Through Black-box Quadratic Programming Solvers
MICE-Bench: A Challenging and Comprehensive Benchmark for Multi-Reference Image Creation and Editing
Cello: A Universal Cell-wise Feature Aggregation framework for Reliable Pathology Images Analysis
Being More Lightweight and Practical: Mini-sized Contrastive Learning Pre-trained Models for Fine-grained Traffic Task
Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles
Efficient Preference Poisoning Attack on Offline RLHF
Exploring and Exploiting Stability in Latent Flow Matching
A Probabilistic Framework for LLM-Based Model Discovery
Towards a Holistic Understanding of Selection Bias for Causal Effect Identification
pTNAS: Progressive Neural Architecture Search for Tabular Data
Adversarial Latent Embedding Repair for LLM Continual Learning
Multi-Integration of Labels across Categories for Component Identification (MILCCI)
Inner-layer Token Self-Modulation as Another Scaling Axis for LLMs
HeraSys: Collaborative Serving of Multiple LLM Workflows via Fine-Grained End-to-End Optimization
Fleet: Few-Shots Lead Effective AIGI Detection
Spike-HTR: Spiking Neural Transformer for Handwritten Text Recognition
TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
SG2Loc: Sequential Visual Localization on 3D Scene Graphs
LocalV: Exploiting Information Locality for IP-level Verilog Generation
On Contraction of Sequential and Offset Rademacher Complexities
Learning to Label: A Reinforced Self-Evolving Framework for Semi-supervised Referring Expression Segmentation
Distilling Task-Level Coordination Policies for Generalizable Multi-Agent Cooperation
GraphFlow: A Graph-Based Workflow Management for Efficient LLM-Agent Serving
SL-VC: A Benchmark and Automated Framework for Separation Logic Verification Condition Proving
The Generalization Spectrum: A Chromatographic Approach to Evaluating Learning Algorithms
Quantifying the Generalization Gap in Seizure Detection: A Large-Scale Empirical Benchmark via the SzCORE Challenge
CSPO: Constraint-Sensitive Policy Optimization for Safe Reinforcement Learning
OptiFluence: Principled Design of Privacy Canaries
Robust and Consistent Ski Rental with Distributional Advice
Adversarial Training for Process Reward Models
Independent Component Discovery in Temporal Count Data
Constrained Bayesian Experimental Design via Online Planning
ExpWeaver: LLM Agents Learn from Experience via Latent RAG
Evolving Interdependent Operators with Large Language Models for Multi-Objective Combinatorial Optimization
Esoteric Language Models
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
Which Reasoning Traces Are Worth Generating Further? Data Curation for Training Reasoning Models
MGAL: A Multilingual Granularity-Aware Long-Context Benchmark
Modality-Decoupled Online Recursive Editing
MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models
SAOT: Self-Supervised Continual Graph Learning with Structure-Aware Optimal Transport
CLIMB: Taming the LoRA Residency Cliff in Multi-LoRA Serving
Segment-Aligned Policy Optimization for Multi-Modal Reasoning
Beyond Single-View Indexing: Structure-Aware Multi-View Retrieval for Knowledge-Based VQA
Not All Frequencies Are Equal: Energy-Adaptive Diffusion for Time Series Forecasting
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation
VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning
Semi-Supervised Neural Super-Resolution for Mesh-Based Simulations
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
FedPDG: Prediction Discrepancy–Guided Data Generation for Heterogeneous Federated Learning
KernelFoundry: Hardware-Aware Evolutionary GPU Kernel Optimization
Adaptive Code Watermarking Through Reinforcement Learning
Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
Predicting the Emergence of Induction Heads in Language Model Pretraining
Focus-Then-Contact: Speeding Up Robotic Contact-Rich Task Learning with Affordance-Guided Real-World Residual Reinforcement Learning
From Evaluation to Design: Using Potential Energy Surface Smoothness Metrics to Guide ML Interatomic Potential Architectures
Progressive Cramming: Reliable Token Compression and What It Reveals
Equilibrium Propagation for Non-Conservative Systems
Learn to change the world: Multi-level reinforcement learning with model-changing actions
Regret Minimization With a Crowd of Awakening Experts
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Geodesic Calculus on Implicitly Defined Latent Manifolds
MIMO-LP: A Multi-Input Multi-Output Framework for Subgraph-based Link Prediction
Towards Diverse Scientific Hypothesis Search with Large Language Models
Self-Prophetic Decoding to Unlock Visual Search in LVLMs
SpecExit: Accelerating Large Reasoning Model via Speculative Exit
Adaptively Robust Resettable Streaming
Skill Neologisms: Towards Skill-based Continual Learning
Improved Algorithms for Nash Welfare in Linear Bandits
Target-Aware Bandit Allocation for Scalable Surrogate Optimization in Chemical Space
Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment
ProtoVAR: Efficient Dataset Distillation via Prototype-Guided Visual Autoregressive Modeling
Learning Locally, Revising Globally: Global Reviser for Federated Learning with Noisy Labels
Designing noise schedules for diffusion models with spectral analysis
Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform
Transform Trained Transformer for Accelerating Native 4K Video Generation
GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning
Physics in 2-Steps: Locking Motion Priors Before Visual Refinement Erases Them
Adaptive Estimation and Inference in Semi-parametric Heterogeneous Clustered Multitask Learning via Neyman Orthogonality
TSFAdv: Frequency-Guided Black-Box Adversarial Attacks on Time Series Forecasting
Spatial Memory for Out-of-Vision Manipulation in Vision-Language-Action
Evaluating Language Models in Realistic Conversational Contexts
Causes and Consequences of Representational Similarity in Machine Learning Models
Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution
AREA: Attribute Extraction and Aggregation for CLIP-Based Class-Incremental Learning
Sparser Block-Sparse Attention via Token Permutation
TextAtlas5M: A Large-Scale Dataset for Long Text Image Generation
Theory of Minimal Weight Perturbations in Deep Networks and its Applications for Low-Rank Activated Backdoor Attacks
On Expressive Power of Floating-Point Transformers
MLUBench: A Benchmark for Lifelong Unlearning Evaluation in MLLMs
Primal-Spectral Generative Modeling: Fast Analytical Generation via Pseudoinverse Lévy Inversion
LagLLM: LLM-empowered lead–lag dependency learning for spatial-temporal time series forecasting
Learning Adaptive Topology with FiLM-Guided Distillation for Tertiary Structure-Based RNA Design
Personalized Policy Learning through Discrete Experimentation
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
When AI Agents Compete for Jobs: Strategic Capabilities and Economic Dynamics of AI Labour Markets
Learning Global Representation from Queries for Vectorized HD Map Construction
Variance-Reduced $(\varepsilon, \delta)-$Unlearning using Forget Set Gradients
Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning
INDEXGUARD: Index-only Backdoor Vetting for Secure Federated PEFT of Large Language Models
Approximating f -Divergences with Rank Statistics
OvisOCR: End-to-End Document Parsing via Aligning Specialized Perception with General Reasoning
Learning-augmented Rent-or-Buy with a Sample
Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions
Uncovering the Latent Potential of Deep Intermediate Representations
OCNR: Stabilizing Self-Play by Mitigating Iteration-Collapse With One-Class Novelty Rewards
FedUSD: Unbiased Synthetic Data for Federated Learning
Algorithmic Recourse of In-Context Learning for Tabular Data
Evaluating Parameter Efficient Methods for RLVR
Learn from A Rationalist: Distilling Intermediate Interpretable Rationales
RedDebate: Safer Responses Through Multi-Agent Red Teaming Debates
iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis
RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning
SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models
Agent JIT Compilation for Latency-Optimizing Computer-Use Agent Planning and Scheduling
Safety Game: Inference-Time Alignment of Black-Box LLMs via Constrained Optimization
$\mathbb{R}^{2k}$ is Theoretically Large Enough for Embedding-based Top-$k$ Retrieval
Conditional Equivalence of DPO and RLHF: Assumptions, Failure Modes, and Provable Alignment
PPDL: LLM-Based Flows as Probabilistic Programs
Federated Bilevel Performative Prediction
TetraJet-v2: Accurate NVFP4 Training for Large Language Models with Oscillation Suppression and Outlier Control
Mixtures of geodesic factor analyzers on Riemannian homogeneous spaces
Token-Efficient Change Detection in LLM APIs
IVQA-LD: Inclusive Multimodal Understanding for Population with Limb-Deficiency
DiffStyle3D: Consistent 3D Gaussian Stylization via Attention Optimization
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
VisualScore: Learning Holistic Visual Quality Scores via Multi-Task Reasoning
AdaSplash-2: Faster Differentiable Sparse Attention
FaPS: A General and Fast Training Method for Diffusion Models
Beyond Literal Translation: Evaluating Cultural Effectiveness in Social Media UGC
Sharpness-Aware Minimization Can Hallucinate Minimizers
Unpaired Visual Editing with Self-Consistent Flow Matching
Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner
Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models
Projected Gradient Ascent for Efficient Reward-Guided Updates with One-Step Generative Models
Width Independent Bounds for the Local Lipschitz Constant of Deep Neural Networks at Random Initialization and after Lazy Training
Simple Policy Gradients for Reasoning with Diffusion Language Models
SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
How Language Models Process Negation
Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future
Modeling temporal scRNA-seq data with latent Gaussian process and optimal transport
Asymptotically Fast Clebsch-Gordan Tensor Products with Vector Spherical Harmonics
Stem: Rethinking Causal Information Flow in Sparse Attention
Judgment Operators: A Composition-Invariant Substrate for Multi-Agent Action Spaces
Emergent Analogical Reasoning in Transformers
TMS: Trajectory-Mixed Supervision for Reward-Free, On-Policy SFT
Rethinking Attention in Spiking Transformers: Overcoming Density Bias with Set Similarity
WaveSSM: Multiscale State-Space Models for Non-stationary Signal Attention
Directly Optimizing Natural Language Explanations for Behavioral Faithfulness: Simulatability and Recoverability
A Strictly Proper Scoring Rule and a Calibration Metric for Interval-Censored Data Analysis
TD-VAD: Breaking Visual Dependence in Video Anomaly Detection with Text-Driven Learning
TabPack: Efficient Hyperparameter Ensembles for Tabular Deep Learning
What If We Let Forecasting Forget? A Sparse Bottleneck for Cross-Variable Dependencies
DuRP: Dual-Stage Physics-Embedded Learning for Joint Radiance and Polarization Restoration
Capacitated Fair-Range Clustering: Hardness and Approximation Algorithms
Anti-Backdoor Coreset Selection via Cumulative Entropy
REVIS: Sparse Latent Steering to Mitigate Object Hallucination in Large Vision-Language Models
Scaling Real-World Robot Policy Evaluation via Discrete Diffusion World Model
Variational Inference for Uncertain Optimal Transport via Sinkhorn Parametrization
Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment
RVAS: Referring Video Active Exploration and Segmentation
Learning to Discover at Test Time
QUATRO: Query-Adaptive Trust Region Policy Optimization for LLM Fine-tuning
Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction
Select to Think: Unlocking SLM Potential with Local Sufficiency
Training-Free Multimodal Large Language Model Orchestration
When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
Marrying Generative Model of Healthcare Events with Digital Twin of Human-Environment Interaction for Disease Reasoning
Contextualized Visual Personalization in Vision-Language Models
Thinking in Flow: A Dissipative Stabilization Operator for Robust Autoregressive Reasoning
Maximin Relative Improvement: Fair Learning as a Bargaining Problem
Less Is More: Elevating RAG via Performance-Driven Context Compression
Spherical Steering: Geometry-Aware Activation Rotation for Language Models
S$^3$GNN: Efficient Global Mixing and Local Message Passing for Long-Range Graph Learning
Riemannian Optimization for Fair Spectral Clustering
Monitorability as a Free Gift: How RLVR Spontaneously Aligns Reasoning
Rethinking Feature Alignment in Generalist Graph Anomaly Detection: A Relational Fingerprint-based Approach
Analytic Bijections for Smooth and Interpretable Normalizing Flows
Very Efficient Listwise Multimodal Reranking for Long Documents
Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR
Understanding Behavior Cloning with Action Quantization
Geometric Convergence of Gauss–Newton for Neural Networks: Riemannian Geometry and Adaptive Damping
Resilient Coresets and Consistent Clustering
SleepLM: Natural-Language Intelligence for Human Sleep
Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models
DDIM Inversion as a Perturbation Amplifier: Breaking Mimicry Protection via Reconstruction Error Minimization
GemDepth: Geometry-Embedded Features for 3D-Consistent Video Depth
MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks
$L^3$: Large Lookup Layers
Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching
Align Your Trajectory Tangent: Training Better Consistency Models via Manifold-Aligned Tangents
Beyond Sample-Level Forgetting: Improving Reliability in Multimodal Unlearning
On Testing Conditional Mean Independence for Manifold-Valued Data
Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models
Message Tuning Outshines Graph Prompt Tuning: A Prismatic Space Perspective
Weight-Space Learning for Certifiable Few-shot Transfer Learning
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs
From Retrieval to Translation: Translating Query into Graph-level Clues for Retrieval-Augmented Generation
Respecting Modality Gap in Post-hoc Out-of-distribution Detection with Pre-trained Vision-Language Models
General Quantification of Covariate and Concept Shifts
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning
Beyond ReLU: Bifurcation, Oversmoothing, and Topological Priors
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data
$E^2$PO: Embedding-perturbed Exploration Preference Optimization for Flow Models
Over-Alignment vs Over-Fitting: The Role of Feature Learning Strength in Generalization
Clipping Makes Distributed and Federated Asynchronous SGD Robust to Stragglers
Learning Realistic Depth via Physics-Grounded Noise Disentanglement with Semantic-Geometric Collaboration
A Unified Approach to Interpreting Knowledge Distillation for Large Language Models via Interactions
Error-Driven Graph Augmentation for Mesh-Based PDE Surrogates
Identifying Connectivity Distributions from Neural Dynamics Using Flows
The Viscosity of Logic: Phase Transitions and Hysteresis in DPO Alignment
Learning-Augmented Scalable Linear Assignment Problem Optimization via Neural Dual Warm-Starts
Local Intrinsic Dimension of Representations Predicts Alignment and Generalization in AI Models and Human Brain
LERD: Latent Event-Relational Dynamics for Neurodegenerative Classification
Conformal Risk-Averse Decision Making with Action Conditional Guarantee
Cooperative variance estimation and Bayesian neural networks disentangle aleatoric and epistemic uncertainties
Pessimistic Verification for Open-Ended Math Questions
See, Act, Adapt: Active Perception for Unsupervised Cross-Domain Visual Adaptation via Personalized VLM-Guided Agent
Off-Policy Evaluation Beyond Overlap under Network Interference
RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization
Nash Equilibria in Games with Playerwise Concave Coupling Constraints: Existence and Computation
HybridOM: Hybrid Physics-Based and Data-Driven Global Ocean Modeling with Efficient Regional Downscaling
Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations
FairMerging: Rethinking Model Merging through the Lens of Fairness
MINT: Minimal Information Neuro-Symbolic Tree for Objective-Driven Knowledge-Gap Reasoning and Active Elicitation
Enhancing LLMs for Graph Tasks via Graph-aware LoRA Generation
PACER: Acyclic Causal Discovery from Large-scale Interventional Data
Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking
Membership Inference Attacks for Unseen Classes
Gradient Flow Through Diagram Expansions: Learning Regimes and Explicit Solutions
MotionMAR: Multi-scale Auto-Regressive Human Motion Reconstruction from Sparse Observations
Joint-Embedding Predictive Learning of Latent Market States in U.S. Equities
RiboSphere: Learning Unified and Efficient Representations of RNA Structures
ITSPACE: Monotone Gaussian Optimal Transport Updates
Benchmarking at the Edge of Comprehension
Risk-Averse and Optimistic Advertiser Incentive Compatibility in Auto-bidding
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability
Efficiently Learning Drifting Halfspaces with Massart Noise
When Agents Go Rogue: Activation-Based Detection of Malicious Behaviors in Multi-Agent Systems
$\text{DT}^\text{2}$: Decision-Targeted Digital Twins
Prism-MoE: Efficient Dense-to-MoE Conversion for Visual Autoregressive Generation
Hyperbolic RQ-VAE enhanced Generative Recommendation with Differential-Length Codebook Strategy
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
HPS: Hyperspherical Parameter Sharing for Efficient Multi-Agent Reinforcement Learning
CoCoQuant: Breaking the Bandwidth Wall via Co-Optimized Communication and Computation Quantization
Interventional Processes For Causal Uncertainty Quantification
Where Concept Erasure Should Occur: Concept–Layer Alignment in Text-to-Video Diffusion Models
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations
ABCD: All Biases Come Disguised
Geometric Conformal Prediction with Spatial Ranks and Multivariate Quantiles
Multiple Choice Learning of Low-Rank Adapters for Language Modeling
Selective Deferred Routing: Enabling Cost-Efficient Collaboration between Local SLMs and Remote LLMs
Multi-Objective Protein Design via Memory-Aware Test-Time Scaling in Diffusion Models
Solving Imperfect-Recall Games via Sum-of-Squares Optimization
AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning
Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations
Normalization Equivariance for Arbitrary Backbones, with Application to Image Denoising
Preconditioning Neural Tangent Kernel for Adaptive Optimization
Global Directional Priors with Local Statistical Validation for Scalable Causal Discovery
QuantumBoost: A lazy, yet fast, quantum algorithm for learning with weak hypotheses
Tighter Regret Lower Bound for Gaussian Process Bandits with Squared Exponential Kernel in Hypersphere
Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs
Deep sequence models tend to memorize geometrically; it is unclear why.
Decision Tree Learning on Product Spaces
Predicting What Matters: Robust Generalist Robot Policy Learning via Future Semantic Mask
Correcting Overparameterization Effects in Fair Empirical Risk Minimization
PLASH: Provably Linear-Time Attention with Selective Higher-Order Feature Sketching
InfVSR: Toward Consistency-Driven Streaming Generative Video Super-Resolution
At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization
Spatiotemporal Imputation with Graph-Informed Flow Matching
Data- and Variance-dependent Regret Bounds for Online Tabular MDPs
Learning to Search and Searching to Learn for Generalization in Planning
Adaptive Generation of Bias-Eliciting Questions for LLMs
Formally Exploring Visual Anomaly Detection Evaluation Metrics
CARE: Class-Adaptive Expert Consensus for Reliable Learning with Long-Tailed Noisy Labels
Private and Stable Test-time Adaptation with Differential Privacy
Stochastic Order Learning: An Approach to Rank Estimation Using Noisy Data
Trajectory Consistency for One-Step Generation on Euler Mean Flows
Factored Causal Representation Learning for Robust Reward Modeling in RLHF
Optimal and Scalable MAPF via Multi-Marginal Optimal Transport and Schrödinger Bridges
Learning the ESG Geometry with Domain Aware Language Models
CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers
Schur-A*: Layer-wise Optimal Expert Pruning for Sparse MoEs via Schur-Complement Guided A* Search
DiP-G: Discrete Prompting for Graph Neural Networks
PrivCode++ : Latent-Conditioned Differentially Private Code Generation for Comprehensive Guarantees
Hierarchical Reinforcement Learning for Sparse-Reward Search in Commutative Algebra
PISA: Privacy-Preserving Split Adaptation with Model IP Protection
Conflict-Aware Additive Guidance for Flow Models under Compositional Rewards
Ultrafast On-Chip Online Learning via Spline Locality in Kolmogorov–Arnold Networks
UniFast-HGR: Scalable and Efficient Maximal Correlation for Multimodal Models
Transolver-3: Scaling Up Transformer Solvers to Industrial-Scale Geometries
LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries
ASRU: Activation Steering Meets Reinforcement Unlearning for Multimodal Large Language Models
R-Diverse: Mitigating Diversity Illusion in Self-Play LLM Training
Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models
Are VLMs Seeing or Just Saying? Uncovering the Illusion of Visual Re-examination
Generative Inverse Design with Abstention via Diagonal Flow Matching
Graph-Preference Learning: Debiasing Network-Sampled Human Feedback for Target Welfare Estimation
Positive Distribution Shift as a Framework for Understanding Tractable Learning
When Do Graph Foundation Models Transfer? A Data-Centric Theory
The Implicit Bias of Adam and Muon on Smooth Homogeneous Neural Networks
Zeus: Towards Tuning-Free Foundation Model for Time Series Analysis
Training-Free Rate-Distortion-Perception Traversal With Diffusion
Practical and Scalable Hamiltonian Monte Carlo Without the Metropolis Test
Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective
Efficient Test-time Inference for Generative Planning Models with OCL Search
Semi-Supervised Gaze Estimation via Disentangled Subspace Contrastive Learning
Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching
Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit
Learning from Pairwise Preferences in Long-Term Decision Problems
Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation
Spectral Reach: Understanding Neural Scaling through Kernel Alignment Dynamics
GAUSS: Graph-Assisted Uncertainty Quantification using Structure and Semantics for Long-Form Generation in LLMs
Principle-Evolvable Scientific Discovery via Uncertainty Minimization
Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models
Bandit Social Leaning Dynamics with Exploration Episodes
RESIDUAL-GUIDED MULTI-RESOLUTION REFINEMENT OF FOUNDATION MODELS - A CASE STUDY IN DROUGHT FORECASTING
Escaping the Likelihood Trap: Geometric Diversity Optimization for Long-Form Image Captioning
WebWorld: A Large-Scale World Model for Web Agent Training
DART: Distribution-Aware Adaptive Relational Transfer for Adversarial Attacks against Closed-Source MLLMs
Evaluating Sample Utility for Efficient Data Selection by Mimicking Model Weights
Weasel: Out-of-Domain Generalization for Web Agents via Importance-Diversity Data Selection
GEM-FI: Gated Evidential Mixtures with Fisher Modulation
KAST-BAR: Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Modeling for Universal Neural Interpretation
Unleashing the Representational Power of Fourier Shapes for Attacking Infrared Object Detection
Realizable Bayes-Consistency for General Metric Losses
Backward SDE–Based Diffusion for Physics-Constrained Generation
Projection-Free Algorithms for Minimax Problems
Any-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Theory of Continual Learning Against Data Poisoning Attacks
Task-Aware Exploration via a Predictive Bisimulation Metric
MarketSim: Simulating Stock Markets with Large-Scale Generative Agents
LoCoT2V-Bench: Benchmarking Long-Form and Complex Text-to-Video Generation
Dimensionality Reduction with Point-distributions Similarity Invariant
Learning Unmasking Policies for Diffusion Language Models
Bridging Your Imagination with Audio-Video Generation via a Unified Director
3D MeanFlow: One-Step Point Cloud Completion and Generation via Average-Velocity Transport
Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation
Manifold-Aligned Guided Integrated Gradients for Reliable Feature Attribution
Predicting evolutionary rate as a pretraining task improves genome language model representations
Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
Forget-It-All: Multi-Concept Machine Unlearning via Concept-Aware Neuron Masking
Leveraging Machine Unlearning for Cost-Efficient Preference Alignment
Mitigating Gradient Pathology in PINNs through Aligned Constraint
HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench
Convex Basins in Single-Index Model Loss Landscapes: Applications to Robust Recovery under Strong Adversarial Corruption
Where Signals Are Sparse, We Synthesize: Reinforcing Self-Corrective Reasoning in Vision–Language Models via Rollout Augmentation
Multi-timescale Reinforcement Learning by Value Reconstruction
EAGer: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling
HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction
Discrete Adjoint Schrödinger Bridge Sampler
Row-stochastic matrices can provably outperform doubly stochastic matrices in decentralized learning
MiniMax Learning of Interpretable Factored Stochastic Policies from Conjoint Data, with Uncertainty Quantification
Matrix-Free GPU Semidefinite Programming for Quantum Ordered Search at the k=6 Frontier
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training
Which LLM Multi-Agent Protocol to Choose?
Train Once, Reuse Everywhere: Generalizable Implicit ICL by Routing Attention
Failure-Driven Workflow Refinement
An analytic theory of convolutional neural network inverse problems solvers
Dynamics of neural scaling laws in random feature regression with powerlaw-distributed kernel eigenvalues
DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
Constitutional Black-Box Monitoring for Scheming in LLM Agents
PPT-Eval: A Benchmark for Computer-Use Agents on PowerPoint Tasks
A Computational Framework for Evaluating Human-likeness in LLMs' Open-ended Human Behaviors
Evaluating and Explaining Prompt Sensitivity of LLMs Using Interactions
Entropic Mirror Monte Carlo
Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search
Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search
Hierarchical Anchor Graph Learning for Multi-View Clustering
SEMA: a Scalable and Efficient Mamba like Attention via Token Localization and Averaging
Moving Beyond Sparse Grounding with Complete Screen Parsing Supervision
DRFusion: Drift-Resilient Temporally Consistent Infrared–Visible Video Fusion
MARS-SQL: A Multi-Agent Reinforcement Learning Framework For Text-To-SQL
Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
ATLAS: Learning to Optimally Memorize the Context at Test Time
A General Framework for Dynamic Consistent Submodular Maximization
How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning
Adaptive Multi-Round Allocation with Stochastic Arrivals
Counterfactual Occlusion-Aware Learning via Visibility Intervention for LiDAR Anomaly Detection
MoRGEN: Mixture-of-Resolutions Generative Forecasting for Irregularly Sampled Medical Time-Series Data
UniCode: Augmenting Evaluation for Code Reasoning
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement
Decoupling The "What" and "Where" With Polar Coordinate Positional Embedding
MeshTok: Efficient Multi-Scale Tokenization for Scalable PDE Transformers
Active Attacks: Red-teaming LLMs via Adaptive Environments
Verifiable Multimodal Reasoning: Fact-level Attribution with Multimodal Sources
Do Natural Language Interpretability Methods Convey Privileged Information?
AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection
DEER: A Benchmark for Evaluating Deep Research Agents on Expert Report Generation
WIND: Weather Inverse Diffusion for Zero-Shot Atmospheric Modeling
TeamTR: Trust-Region Fine-Tuning for Multi-Agent LLM Coordination
Understanding the Ability of LLMs to Handle Character-Level Perturbation
Mean-Shift PCA by Knockoff Mean
Watermarking Graph Neural Networks via Explanations for Ownership Protection
MAST: Motif-Augmented Diffusion with Search Tree for Spectroscopic Molecular Structure Elucidation
3DGS-HPC: Distractor-free 3D Gaussian Splatting with Hybrid Patch-wise Classification
Efficient LLM Moderation with Multi-Layer Latent Prototypes
Causal Flow Q-Learning for Robust Offline Reinforcement Learning
A Robust Optimization Guided Pruning Framework for Vision and Large Language Models
Factorized Scheduling Principle: Learning Interpretable and Transferable Policies via Structured Additive Functions
SCOPE and SCION: Benchmark and Method for Ontology Induction and Fusion from Text
Is One Layer Enough? Understanding Inference Dynamics in Tabular Foundation Models
BOOSTAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models
Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning
Entropy-Aware On-Policy Distillation of Language Models
FoeGlass: When Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors
A robust PPG foundation model using multimodal physiological supervision
Out-of-Distribution Evaluation of Rule-Based and Strategic Reasoning in Chess Transformers
Disentangling meaning from language in LLM-based machine translation
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
SIPO: Stabilized and Improved Preference Optimization for Aligning Diffusion Models
Parameter-Masked Decoupled Optimization for Cross-Domain Class-Incremental Learning
A Provable Expressiveness Hierarchy in Hybrid Linear-Full Attention
PPI Candidate Ranking: Large-Scale Evaluation of a Domain Knowledge–Guided Pipeline
Scaling Laws and Architectural Frontiers in Metagenomic Foundation Models
Dynamic TMoE: A Drift-Aware Dynamic Mixture of Experts Framework for Non-Stationary Time Series Forecasting
Non-Stationary Online Structured Prediction with Surrogate Losses
Dissect and Prune: Enhancing Robustness in AI-Generated Image Detection
SPR: A Structured Prompt Refinement Network for Modality Missing
Keeping a Secret Requires a Good Memory: Space Lower-Bounds for Private Algorithms
CPMöbius: Iterative Coach–Player Reasoning for Data-Free Reinforcement Learning
ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning
Can LLM Agents Stick to the Script? Modeling Commitment in Interactive Narratives
FIRE: Multi-fidelity Regression with Distribution-conditioned In-context Learning using Tabular Foundation Models
HARD-KV: Head-Adaptive Regularization for Decoding-time KV Compression
The cost of commitment in option-based hierarchical RL
EVMbench: Evaluating AI Agents on Smart Contract Security
Scalable Training of 3D Gaussian Splatting via Out-of-Core Optimization
Subliminal Effects in Your Data: A General Mechanism via Log-Linearity
Decouple and Cache: KV Cache Construction for Streaming Video Understanding
Identifying Partially Observed Causal Models from Heterogeneous/Nonstationary Data
You Don't Protect if You Don't Expect: Breaking the Key Assumption behind CLIP's Test-Time Defenses
MaMa: A Game-Theoretic Approach for Designing Safe Agentic Systems
Riemannian Metric Matching for Scalable Geometric Modeling of Distributions
Towards Long-Horizon Interpretability: Efficient and Faithful Multi-Token Attribution for Reasoning LLMs
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
CARD: Coarse-to-fine Autoregressive Modeling with Radix-based Decomposition for Transferable Free Energy Estimation
FHAIM: Fully Homomorphic AIM for Private Synthetic Data Generation
Probing RLVR Training Instability through the Lens of Objective-Level Hacking
Inference-Aware Meta-Alignment of LLMs via Non-Linear GRPO
SIMPC: Learning Self-Induced Mirror-Point Consistency for Unsupervised Point Cloud Denoising
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Compact Conformal Subgraphs
On the Learning Dynamics of RLVR at the Edge of Competence
Focus and Dilution: The Multi-stage Learning Process of Attention
Latent Forcing: Reordering the Diffusion Trajectory for Pixel-Space Image Generation
OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph
Parameter Decorrelation via Transition-Variance Alignment for Multivariate Time-series Forecasting
Mitigating Hallucinations in Large Vision-Language Models via Causal Route Gating
Contractive Anchor Resolvent Diffusion for Incomplete Multi-View Clustering
Robust Harmful Features Under Jailbreak Attacks: Mechanistic Evidence from Attention Head Specialization in Large Language Models
Incentivized Exploration with Stochastic Covariates: A Two-Stage Mechanism Design for Recommender System
Mitigating Bias in Locally Constrained Decoding via Tractable Proposals
Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space
FUSE: Ensembling Verifiers with Zero Labeled Data
STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
Ambient Dataloops: Generative Models for Dataset Refinement
Multi-marginal temporal Schrödinger Bridge Matching from unpaired data
Letting Trajectories Spread: Quality-Preserving Control for Diverse Flow Matching
Stein Diffusion Guidance: Training-Free Posterior Correction for Sampling Beyond High-Density Regions
From LLM-Generated Conjectures to Lean Formalizations: Automated Polynomial Inequality Proving via Sum-of-Squares Certificates
Extra-Merge: Tracing the Rank-1 Subspace of Model Merging in Language Model Pre-Training
A Stronger Benchmark for Online Bilateral Trade: From Fixed Prices to Distributions
Efficient Bilevel Optimization for CKA-Guided MoE Upcycling
OnePO: Direct One-stage Policy Optimization for SFT-free Domain Adaptation
When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
Adaptive Residual-Update Steering for Low-Overhead Hallucination Mitigation in Large Vision-Language Models
The Secret Engine Behind RLHF: It's Contarstive Learning All Along
Unlocking Zero-Shot Geospatial Reasoning via Indirect Rewards
SteeringSafety: Benchmarking Representation Steering in LLMs Across Safety Perspectives
The impact of LoRA on Oversmoothing $\colon$ Understanding Catastrophic Forgetting in Mean-Field Attention Dynamics
Optimal Unconstrained Self-Distillation in Ridge Regression: Strict Improvements, Precise Asymptotics, and One-Shot Tuning
Efficient Online Influence Maximization under the Independent Cascade Model with Node-Level Feedback
Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling
A Call to Lagrangian Action: Learning Population Mechanics from Temporal Snapshots
Is the Last Layer Sufficient for Uncertainty Quantification?
Geometric Embedding Alignment via Curvature Matching in Transfer Learning
MAPS: Memory-Aware Predictive Scheduling Framework for Large Language Models Serving
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
Last-iterate Convergence of ADMM on Multi-affine Quadratic Equality Constrained Problem
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
Modeling Hierarchical Thinking in Large Reasoning Models
L-Drive: Beyond a Single Mapping—Latent Context Drives Time Series Forecasting
SRPO: Self-Reflective Policy Optimization for Long-Horizon Reasoning
SURGE: Surrogate Gradient Adaptation in Binary Neural Networks
Can Muon Fine-tune Adam-Pretrained Models?
GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
The Latent Guardian: Defending Collaborative Perception via Feature-Level Consistency Verification
Ratio-Variance Regularized Policy Optimization
Separating representation from reconstruction enables scalable text encoders
XPERT: Expert Knowledge Transfer for Effective Training of Language Models
Words Towards Explainability: Caption Label-Free Learning via Dual Loop Agentic Time Series Captioning
KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering
Identifiable Nonlinear Differentiable Causal Discovery via Independence and Adaptive Group Sparsity
Reasoning Is Not Free: Robust Adaptive Cost-Efficient Router for LLM-as-a-Judge
Flatland: The Adventures of Gradient Descent with Large Step Sizes
Revisiting the Role of Pretrained Weights in Model Merging: On Near-Optimality within the Core Subspace
AIR-VLA: Vision-Language-Action Systems for Aerial Manipulation
Goal-Oriented Lower-Tail Calibration of Gaussian Processes for Bayesian Optimization
NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces
Co-Generative De Novo Functional Protein Design
TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts
An Exploration of Non-Euclidean Gradient Descent: Muon and its Many Variants
PACEAttention: Principled and Adaptive Feature Compression-Expansion Grounded in the Geometry of $\text{MCR}^2$
MALICE: Memory-aware Loop Invariants Generation on Symbolic Execution Traces
$\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment
The Fisher Dimension: Instance-Dependent Complexity for Causal Discovery
Channel Adapter for Time Series Foundation Models in Zero-Shot Multivariate Forecasting
Context Distillation Retains Post-Training Capabilities in Continually Trained LMs
Toward Effective Multimodal Graph Foundation Model: A Divide-and-Conquer Based Approach
Spiral RoPE: Rotate Your Rotary Positional Embeddings in the 2D Plane
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing
Approximation Theory for Lipschitz Continuous Transformers
Recursive Binding on a Budget: Subspace Carving in Order-$p$ Tensor Memories
Lookahead Unmasking Elicits Reliable Decoding in Diffusion Language Models
The Expressive Power of Low Precision Softmax Transformers with (Summarized) Chain-of-Thought
Front-Loaded Robust Conformal Prediction: Heavy Calibration, Minimal Test-Time Cost
Empirical Gaussian Processes
Adaptive Sharpness-Aware Minimization with a Polyak-type Step size: A Theory-Grounded Scheduler
A Mechanistic Understanding of Sim-and-Real Co-Training in Generative Policies
Designing Observation and Action Models for Efficient Reinforcement Learning with LLMs
Streaming Covariate Balancing via Discrepancy-Based Feature Coresets
Provably Data-driven Lagrangian Relaxation for Mixed Integer Linear Programming
Convex Optimization for Alignment and Preference Learning on a Single GPU
Linguistic Relative Policy Optimization for Video Anomaly Reasoning
Hardware-Aware Dynamic Sparse Training for Large Output Spaces
Experience-Evolving Multi-Turn Tool-Use Agent with Hybrid Episodic–Procedural Memory
DIVER: Diving Deeper into Distilled Data via Expressive Semantic Recovery
Multilingual Unlearning in LLMs: Transfer, Dynamics, and Reversibility
Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents
OpenDeception: Learning Deception and Trust in Human–AI Interaction via Multi-Agent Simulation
Efficient Online Variational Estimation via Monte Carlo Sampling
SLAT: Segment-Level Adaptive Trimming for Efficient CoT Reasoning
ShapCCS: Shapley-Driven Client Coreset Selection in Federated Learning
PGT: Procedurally Generated Tasks for improving fine-grained understanding in MLLMs
GIPO: Gaussian Importance Sampling Policy Optimization
Source-Free Open-World RF Fingerprint Identification
Robust Sequential Experimental Design for A/B Testing
Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation
Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction
ConFu: Contemplate the Future for Better Speculative Sampling
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings
DecFus: Decentralized Layer-wise Fusion with Dynamic Exploration and Exploitation
Spectral Heat Flow for Conservative Token Condensation in Vision-Language Models
Bulk-Calibrated Credal Ambiguity Sets: Fast, Tractable Decision Making under Out-of-Sample Contamination
Two-Stage Unit Tying for Simplifying Differentiable Logic Gate Networks
AI Engram: In Search of Memory Traces in Artificial Intelligence
Unitary Convolutions for Message-passing and Positional Encodings on Directed Graphs
Multi-agent imitation learning with function approximation: linear Markov games and beyond
Hierarchical Goal Abstractions via Learned Subset Relations
BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback
A Studentized Spherical Harmonics–Based Nonparametric Two-Sample Test for Compositional and Directional Data
Towards Realistic Lifelong Re-identification: Identity Recurrence with Changing Clothes
EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization
HOBIT: Hardness Optimized Batch Sampling for InfoNCE Training
Toward Stable Value Alignment: Introducing Independent Modules for Consistent Value Guidance
SF-Mamba: Rethinking State Space Model for Vision
Accelerated Multiple Wasserstein Gradient Flows for Multi-objective Distributional Optimization
RefChess: Monte-Carlo Move Selection for Zero-Shot Referring Image Segmentation
SoftJAX & SoftTorch: Empowering Automatic Differentiation Libraries with Informative Gradients
TD3B: Transition-Directed Discrete Diffusion for Allosteric Binder Generation
Hierarchical ODE: Learning Continuous-Time Physical Prototypes for Early Link Failure Detection
A Fourier perspective on the learning dynamics of neural networks: from sample complexities to mechanistic insights
Dynamic High-Dimensional Facility Location with Low Recourse
Escaping the Subspace Trap: The Role of Optimizer Geometry in Model Width Expansion
AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions
Orthogonal Hierarchical Decomposition for Structure-Aware Table Understanding with Large Language Models
Stochastic Minimum-Cost Reach-Avoid Reinforcement Learning
SecCodePRM: A Process Reward Model for Code Security
A Coin Flip for Safety: LLM Judges Fail to Reliably Measure Adversarial Robustness
Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference
Zero-Shot Rankability: Revealing Latent Ordinal Structure in Multimodal Large Language Models via Language
Multi-Round Human–AI Collaboration with User-Specified Requirements
On the Difficulty of Learning a Meta-network for Training Data Selection
Gradient-Free Approaches is a Key to an Efficient Interaction with Markovian Stochasticity
R2-Router: A New Paradigm for LLM Routing with Reasoning
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics
Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization
Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling
dTRPO : Trajectory Reduction in Policy Optimization of Diffusion Large Language Models
TextME: Bridging Unseen Modalities Through Text Descriptions
Three Years of r/ChatGPT: Societal Impact Evaluations from Social Media Data
Active Continual Learning with Metaplastic Binary Bayesian Neural Networks
Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling
Ego3S: Select, Strengthen, and Synchronize for Efficient Egocentric Reasoning
DualCOIL: Offline Imitation Learning from Contrasting Demonstrations
CORAL: Uncertainty-Aware Regulation of Exposure Concentration in Recommender Systems
Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control
Balancing plasticity and stability with Fast and Slow Successor Features
Locate then Correct: Debiasing Attention Heads in CLIP
Mitigating Error Accumulation in Continuous Navigation via Memory-Augmented Kalman Filtering
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
CE$^4$L: Continual Ego, Exo, and Ego-Exo Learning
Neural Control: Adjoint Learning Through Equilibrium Constraints
SyMerge: From Non-Interference to Synergistic Merging via Single-Layer Adaptation
ST-TGExplainer: Disentangling Stability and Transition Patterns for Temporal GNN Interpretability
The Truth Lies Somewhere in the Middle (of the Generated Tokens)
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Efficient Transformer Attention for SNNs via Hadamard Simplification
Sparse Regression with $\ell_0$ Constraints for $\alpha$-Mixing Time Series: Algorithms and Guarantees
Equilibrium Pricing in Oligopolistic Data Markets
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
The Implicit Bias of Depth: From Neural Collapse to Softmax Codes
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis
KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices
Semi-Supervised Learning for Molecular Graphs via Ensemble Consensus
Reading Between the Tokens: Improving Preference Predictions through Mechanistic Forecasting
AutoMat: Physics-Guided Agentic Reasoning for Solving Ill-Posed Inverse Microscopy Problems
SCNS: Continual Personalization of Diffusion Models via Submodular Concept Neuron Selection
BlueCodeAgent: A Blue Teaming Agent Powered by Automated Red Teaming for CodeGen AI
Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
DISCO: Mitigating Bias in Deep Learning with Conditional Distance Correlation
A Distributional View for Visual Mechanistic Interpretability: KL-Minimal Soft-Constraint Principle
Zero-Shot 3D Question Answering via Hierarchical View-to-Token Transportation
A Consensus Anchor-guided Hypergraph Framework For Incomplete Multi-view Clustering
D-FUSEr: Diverse Failure, Unified Success via Error-Distribution Shaping in LLM Reasoning
Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation
Active Policy Optimization for Individualized Dosing via Gradient Variance Minimization
Greedy Coordinate Diffusion: Effective and Semantically Coherent Adversarial Attacks via Diffusion Guidance
CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models
Time-Series Decomposition as a standalone Task: A Mechanism-Driven Diagnostic Benchmark
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation
Class-Conditional Distribution Balancing for Group Robust Classification
Stable Spectral Copula Alignment for Robust Multimodal Learning
Pruning at Initialisation through the lens of Graphon Limit: Convergence, Expressivity, and Generalisation
Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems
Improved Analysis of the Accelerated Noisy Power Method with Applications to Decentralized PCA
On Local Policies for Graph-Structured Markov Decision Processes
Rashomon Sets of Falling Trees
Quantifying LLM Attention-Head Stability: Implications for Circuit Universality
DF-ExpEnse: Diffusion Filtered Exploration for Sample Efficient Finetuning
Machine Learning Hamiltonians are Accurate Energy-Force Predictors
Smoothness Errors in Dynamics Models and How to Avoid Them
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding
Adaptive Quasimetric Mapping : Principled Topological Abstraction for Robust Offline Goal-Conditioned Navigation
From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs
When Is Rank-1 Enough? Geometry-Guided Initialization for Parameter-Efficient Fine-Tuning
Token-Free Hierarchical Indexing for RAG beyond LLM-based Summarization
A Random Matrix Theory of Masked Self-Supervised Learning
GeoReward: Mitigating Contextual Variable Overestimation in Vision-Language Models for Cross-Market Preference Prediction
Value-as-Return: A Two-Stage Framework to Align on the Optimal Score Function
TEAM: Temporal–Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration
Large Language Models as Topological Thinkers: A Benchmark on Graph Persistent Homology
Dependence-Aware Label Aggregation for LLM-as-a-Judge via Ising Models
Orthogonal Concept Erasure for Diffusion Models
Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation
Recognize Your Orchestrator: An Entropy Dynamics Perspective for LLM Multi-Agent Systems
Anti-causal domain generalization: Leveraging unlabeled data
On Densest $k$-Subgraph Mining and Diagonal Loading: Optimization Landscape and Finite-Step Exact Convergence Analysis
End-to-end Graph-structured Brain Representation Learning
Beyond Blind Noising: Disentangled Visual Rectification for Hallucination Mitigation in MLLMs
Mitigating Visual Hallucinations via Semantic Curriculum Preference Optimization in MLLMs
Adversarially Robust Control of Conditional Value-at-Risk via Kelly Conformal Inference
MODEL MERGING SCALING LAWS IN LARGE LANGUAGE MODELS
Recurrent Structural Policy Gradient for Partially Observable Mean Field Games
Decoupled Training with Local Reinforcement Fine-Tuning in Federated Learning
Expected Return Causes Outcome-Level Mode Collapse in Reinforcement Learning and How to Fix It with Inverse Probability Scaling
Learning from Comparison: Constrained Projection Policy Optimization for Pareto-Front Improvement
Diversity Matters: Revisiting Test-Time Compute in Vision-Language Models
OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing
SAGE: Shaping Anchors for Guided Exploration in RLVR of LLMs
Efficient Diffusion LLMs via Temporal-Spatial Parallel Decoding and Confidence Extrapolation
AnyEdit++: Adaptive Long-Form Knowledge Editing via Bayesian Surprise
MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation
Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation
WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints
Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training
Deep Ensemble Clustering for Visual Representation Learning
Rethinking Visual Intelligence: Insights from Video Pretraining
Conversation for Non-verifiable Learning: Self-Evolving Large Language Models through Meta-Evaluation
Content-Style Identification via Differential Independence
FlashOptim: Memory Efficient Optimizers for Large-Scale Training
GeoEvo: Identity-Aware Potential Game with Geometric Evolution for Personalized Multimodal Federated Learning
CatFlow: Co-generation of Slab-Adsorbate Systems via Flow Matching
Reason, Then Re-reason: Cross-view Revisiting Improves Spatial Reasoning
STFlow: Data-Coupled Flow Matching for Geometric Trajectory Simulation
WeatherSyn: An Instruction Tuning MLLM For Weather Forecasting Report Generation
Information-Theoretic Disentangled Latent Modeling with Conditional Diffusion for Incomplete Multi-View Clustering
RBCBF: Decoding Time Safety Alignment via Risk Guided Rollback and Barrier Control
FEDEMOE: IMPROVING PERSONALIZATION ON HET- EROGENEOUS FEDERATED LEARNING VIA ELASTIC MIXTURE OF EXPERTS ARCHITECTURE
FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents
PhotoAgent: Exploratory Visual Aesthetic Planning with Large Vision Models
Self-correcting for Debiasing Large Language Models
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning
Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization’s Impact on VLMs Beyond Accuracy
Building Social World Model with Large Language Models
Compressed Sensing for Capability Localization in Large Language Models
The Role of Target Update Frequencies in Q-Learning
Diffract: Spectral View of LLM Domain Adaptation
Procedural Pretraining: Warming Up Language Models with Abstract Data
$\tau$-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge
Euler–Poincaré Neural Dynamics: A Geometric-Mechanics Framework for Scientific Simulation
Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design
ManiSoft: Towards Vision-Language Manipulation for Soft Robotics
Faithful Mobile GUI Agents with Guided Advantage Estimator
FlowMAP: Flow Matching for Generalizable Agent Planning
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Trust Region Inverse Reinforcement Learning
A Perturbation Approach to Unconstrained Linear Bandits
Domain Adaptation with Adaptive $f$-Divergence: Tighter Variational Representation and Generalization Bounds
MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline
PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
Functional Adjoint Sampler: Scalable Sampling on Infinite Dimensional Spaces
Unlearning’s Blind Spots: Over‑Unlearning and Prototypical Relearning Attack
Semi-knockoffs: a model-agnostic conditional independence testing method with finite-sample guarantees
M+Adam: Low-Precision Training via Mantissa–Exponent Optimization
Fair Decisions from Calibrated Scores: Achieving Optimal Classification While Satisfying Sufficiency
CoRe: Collaborative Reasoning via Cross Teaching
LightAVSeg: Lightweight Audio-Visual Segmentation
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate
$f$-Divergence Regularized RLHF: Two Tales of Sampling and Unified Analyses
Needles in the Haystack: Addressing Signal Dilution Improves scRNA-seq Perturbation Response Modeling and Evaluation
Exploring More to Solve More: Boosting Diversity in Text Diffusion Models via Entropy-Based Guidance
Normalization-equivariant Diffusion Models: Learning Posterior Samplers From Noisy And Partial Measurements
Automated Formal Proofs of Combinatorial Identities via Wilf–Zeilberger Guidance and LLMs
Reflex: Real-Time Vision-Language-Action Control through Streaming Inference
Mind Your Margin and Boundary: Are Your Distilled Datasets Truly Robust?
IVQ: Structured and Lightweight Vector Quantization via Binary Hierarchical Composition Inspired by $\textit{IChing}$
InteractBench: Benchmarking LLMs on Competitive Programming under Unrevealed Information
Reranker Helps, but Not Enough: Towards Strong Poisoning Attacks Against Retrieval-Augmented Generation
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
Unifying Deep Stochastic Processes for Image Enhancement
When Can We Trust Survival Model Evaluation ?
Per-example Gradients: a New Frontier for Understanding and Improving Optimizers
Privacy Amplification in Differentially Private Zeroth-Order Optimization with Hidden States
TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization
Peer-Preservation in Frontier Models
Von Mises-Fisher Mixture Model with Dynamic Shrinkage for Realistic Test-Time Transduction
LLM4Branch: Large Language Model for Discovering Efficient Branching Policies of Integer Programs
Learning to Execute Graph Algorithms Exactly with Graph Neural Networks
MAS-Architect: Declarative Multi-Agent System Design via Separation of Concerns
Interpretable Self-Supervised Learning via Representer Landmarks and Nyström Approximation
Neural Feature Geometry Evolves as Discrete Ricci Flow
Robust Inter-Series Dependency Modeling for Time Series Forecasting via Information-Theoretic Alignment
LithoDreamer: A Physics-Informed World Model for Multi-Stage Computational Lithography
Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
Rethinking LLM Ensembling from the Perspective of Mixture Models
SENDAI: A Hierarchical Sparse-measurement, EfficieNt Data AssImilation Framework
Unsat Core Prediction through Polarity-Aware Representation Learning over Clause-Literal Hypergraphs
DANCE: Dynamic, Available, Neighbor-gated Condensation for Federated Text-Attributed Graphs
Long-term Fairness with Selective Labels
How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs
Beyond the Bellman Recursion: A Pontryagin-Guided Framework for Non-Exponential Discounting
Stabilizing Reinforcement Learning for Diffusion Language Models
ECCO: Evidence-Driven Causal Reasoning for Compiler Optimization
Learning Treatment Representations for Downstream Instrumental Variable Regression
Dustin: Draft-Augmented Sparse Verification for Efficient Long-Context Generation with Speculative Decoding
The Double-Edged Nature of the Rashomon Set for Trustworthy Machine Learning
PDFBench: A Benchmark for De Novo Protein Design from Function
Training Diffusion Language Models for Black-Box Optimization
Learning to Perceive the World Through Control: Empowerment-Based Representation Learning
Decision-Focused Learning via Tangent-Space Projection of Prediction Error
Bend the Basics: Degradation-Aware Deformable Tokenization for All-in-One Image Restoration
Context-Driven Incremental Compression for Multi-Turn Dialogue Generation
Diffusion Models Preferentially Memorize Prototypical Examples or: Why Does My Diffusion Model Love Slop?
Turning Adaptation into Assets: Cross-Domain Bridging for Online Vision-Language Navigation
Active Timepoint Selection for Learning Measure-Valued Trajectories
Learning-to-Optimize via Deep Unfolded Flows
CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems
Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals
Density-Aware Translation of Spurious Correlations in Zero-Shot VLMs
Beyond Magnitude: Scale-Invariant Evidential Fusion for Multi-View Classification
DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs
Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning
Deep Learning for Bioimaging: What are we actually learning?
Reviving Error Correction in Modern Deep Time-Series Forecasting
Efficient RL Training for LLMs with Experience Replay
SI-IGCL: Subject Invariance-aware Inverse Graph Contrastive Learning for Psychiatric Disorder Identification
Task-Awareness Improves LLM Generations and Uncertainty
Learning-To-Measure: In-Context Active Feature Acquisition
Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation
Selecting Samples on Graphs: A Unified Dataset Pruning Framework for Lossless Training Acceleration
Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes
Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents
Provably Valid Uncertainty Quantification for Deep Computed Tomography
Theoretical Characterization of Generalization in Knowledge Distillation
Geodesic Flow Matching for Denoising High-Dimensional Structured Representations
Causal-aware Anomaly Detection for Tabular Data
Unlocking Cross-Modal Biosignal Synthesis: A Temporally-Aware VAE-Diffusion Model
T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
A Dirac-Frenkel-Onsager principle: Instantaneous residual minimization with gauge momentum for nonlinear parametrizations of PDE solutions
On the Power of Statistics in Class-Incremental Learning with Pretrained Models
The Geometry of Representational Failures in Vision Language Models
Foundations of Equivariant Deep Learning: Unifying Graph and Sheaf Neural Networks
ReQAT: Achieving Full-Precision Reasoning Accuracy with 4-bit Floating-Point Quantization-Aware Training
Riemannian stochastic optimization for sufficient dimension reduction
DistFlow: A Fully Distributed RL Framework for Scalable and Efficient LLM Post-Training
AlignedNorm: Prompting Vision–Language Models via Coupled Prompt Field
SMM Transformer: Leveraging Spiking Neural Networks for Multimodal Tasks
LeakGFN: Robust Molecular Generation in Generative Flow Networks via Flow Decomposition
Evolutionary Multi-View Classification with Label Noise via Gradient and Feature Dual-Perception
Merge to Remember: Sharpness-Aware Isotropic Merging for Continual Learning
Rethinking Forgery Attacks on Semantic Watermarks in Black-Box Settings: A Geometric Distortion Perspective
DELTA4: Sparse Matrix-Vector Multiplication for Low Sparsity
HTAC: Hierarchical Task-Aware Composition for Continual Offline Reinforcement Learning
Closing the Loop: Universal Repository Representation with RPG-Encoder
ML-Embed: Inclusive and Efficient Embeddings for a Multilingual World
EMBGUARD: Constructing Hazard-Aware Guardrails for Safe Planning in Embodied Agents
Learning High-Frequency Continuous Action Chunks in Latent Space
Rethinking Multimodal Time-Series Forecasting Evaluation
Hugging Carbon: Quantifying the Training Carbon Emissions of AI Models at Scale
$\mu$pscaling small models: Principled warm starts and hyperparameter transfer
Linearizing Vision Transformer with Test-Time Training
Language Model Augmented Semi-Supervised Statistical Inference
Smooth Multi-Policy Causal Effect Estimation in Longitudinal Settings
TGPO: Efficient Policy Optimization through Sequence Anchor and Information Gating
LiveFigure: Generating Editable Scientific Illustration with VLM Agents
CRAMER: Control via Request-Aware Masking for Editing Recommenders
EvoEGF-Mol: Evolving Exponential Geodesic Flow for Structure-based Drug Design
UniMapping: Unified SLAM Framework for Map-Centric Embodied Perception
Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
ProtDBench: A Unified Benchmark of Protein Binder Design and Evaluation
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations
IMPACT: Influence Modeling for Open-Set Time Series Anomaly Detection
From Noise to Control: Parameterized Diffusion Policies
From Guessing to Placeholding: A Cost-Theoretic Framework for Uncertainty-Aware Code Completion
Dissecting Causal Mechanism Shifts via FANS: Function And Noise Separation
RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference
Interpreting Genomic Language Models using Sparse Autoencoders
Segment Anything with Robust Uncertainty-Accuracy Correlation
MonoScale: Scaling Multi-Agent System with Monotonic Improvement
Learning to Reason for Factuality
Learn to Merge: Meta-Learning for Adaptive Multi-Task Model Merging
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
InfoDLM: an Information-Adaptive Framework for Discrete Diffusion Language Model Pretraining
DisjunctiveNet: Neural Symbolic Learning via Differentiable Convexified Optimization Layers
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Guidance: Sentence-Level Citation Enforcement via Prefix-Tail Guidance during LLM Decoding
BESplit: Bias-Compensated Split Federated Learning with Evidential Aggregation
BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining
Certifying Capabilities from Finite Tests: When Is It Possible?
Hom-PGD+: Fast Reparameterized Optimization over Non-convex Ball-Homeomorphic Set
Fair-FedMOE: Group-Fair One-Shot Federated Learning via Prototype-Guided Experts for Medical Imaging Analysis
Offline Two-Player Zero-Sum Markov Games with KL Regularization
Fast Mixing Steady-State Control in Markov Decision Processes
GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training
MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training
Hide&Seek: Learning to explain in an end-to-end differentiable network
On the Learnability of Test-Time Adaptation: A Recovery Complexity Perspective
VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics
Sharp Concentration Bounds for Vector Bundle-Valued Statistics on Manifolds
Compositional Planning with Jumpy World Models
LayerT2V: A Unified Multi-Layer Video Generation Framework
MASH: Modeling Abstention via Selective Help-Seeking
SemRep: Code Transformation with Semantics-Preserving Representations
KernelBand: Steering LLM-based Kernel Optimization via Hardware-Aware Multi-Armed Bandits
The Latent Color Subspace: Emergent Order in High-Dimensional Chaos
Decentralized Online Convex Optimization with Efficient Communication: Improved Algorithm and Lower Bounds
Robust Multi-View Fusion via Prototype-Anchored Unbalanced Optimal Transport
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Cognitive Fatigue in Autoregressive Transformers: Formalization and Measurement
Avoid What You Know: Divergent Trajectory Balance for GFlowNets
Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots
Predictive variational inference: Learn the predictively optimal posterior distribution
Representation Learning for Equivariant Inference with Guarantees
Do You Want to Know if Two Distributions Are Close to Each Other?Testing the Closeness With Statistical Significance
Learning Unanimously Acceptable Lotteries via Queries
Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
Dreaming in Code for Curriculum Learning in Open-Ended Worlds
When to Trust the Cheap Check: Weak and Strong Verification for Reasoning
Who’s in Charge? Disempowerment Patterns in Real-World LLM Usage
Incremental Transformer Neural Processes
Improved Bounds for Reward-Agnostic and Reward-Free Exploration
Tracing the Persona Circuit: How Large Language Models Encode and Express Character Traits
XYZFlow: Scaling Multidimensional Shortcut Flows for Efficient Generative Modeling
An Efficient Joint Learning Approach for Item Response Theory
Reinforcement Learning for Non-Verifiable Problems
The Safety-Aware Denoiser for Text Diffusion Models
Optimizing Visual Generative Models via Distribution-wise Rewards
LoBCD-GW: A Fast and Data-Dependent Algorithm for Computing Gromov-Wasserstein Distance via Localized Block Coordinate Descent
A Tight Theory of Error Feedback Algorithms in Distributed Optimization
Text-Driven Fusion for Infrared and Visible Images: Achieving Image Scene Adaptation on Hyperbolic Space
Let Language Constrain Geometry: Vision–Language Models as Semantic and Spatial Critics for 3D Generation
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Proxy Compression for Language Modeling
Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization
What Does Flow-Matching Bring to TD-Learning?
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Signature-Informed Transformer for Asset Allocation
CODiff: One-Step Diffusion Model for Camouflaged Object Detection
From Moments to Models: Graphon-Mixture Learning for Mixup and Contrastive Learning
Fast Mixture of Curvature-Aware Experts for Diverse and Dynamic Graph Topologies
CB-SLICE: Concept-Based Interpretable Error Slice Discovery
PACT: Self-Evolving Physical Safety Alignment for Diffusion Policies in Embodied Manipulation
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
An Exponential Separation Between Quantum and Quantum-Inspired Classical Algorithms for Linear Systems
Neural–Evolutionary Symbolic Regression with Global Constraints: Constraint-Aware Decoding and Reward Shaping
Neuro-Fuzzy Concept Learning for Interpretable Large Multimodal Models
SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark
Minimizing Upper Confidence Bounds: A Data-Driven Framework for Stochastic Programming
GFedCL: Graph-Based Federated Continual Learning with Spatial and Temporal Awareness
From Talking to Singing: A New Challenge for Audio-Visual Deepfake Detection
State Space Model with Continuous Limit of HiPPO Matrix: Eigenvalue Analysis and Explicit Solution Formula
PosterAgent: Agentic Poster Generation via Stage-Aware Reinforcement Learning
Realistic Adaptive Merging
Scaling Small Agents Through Strategy Auctions
ReNF: Rethinking the Principles of Neural Long-Term Time Series Forecasters
MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models
Prediction-Powered Adaptive Inference with Pretrained AI Models for Contextual Bandits
POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
Building Reliable Long-Form Generation via Hallucination Rejection Sampling
FUSE: FK-Steered Multi-Modal Flow Matching for Efficient Simulation-Based Posterior Estimation
Scaling Laws of Global Weather Models
Riemannian Neural Optimal Transport
Learning Discriminative and Generalizable Anomaly Detector for Dynamic Graph with Limited Supervision
Utonia: Toward One Encoder for All Point Clouds
Privacy-Aware Data Integration for Enhanced Quantile Inference under Heterogeneity
Foundation VAE for CT Reconstruction, Augmentation, and Generation
Affine-Equivariant Kernel Space Encoding for NeRF Editing
A General Neural Backbone for Mixed-Integer Linear Optimization via Dual Attention
Hyperparameter Transfer with Mixture-of-Expert Layers
Adversarial Vulnerability from Interference Between Features in Superposition
AgentScore: Autoformulation of Deployable Clinical Scoring Systems
OLion: Approaching the Hadamard Ideal by Intersecting Spectral and L inf Implicit Biases
Data Agent: Learning to Select Data via End-to-End Dynamic Optimization
Constrained Multi-Objective Reinforcement Learning with Max-Min Criterion
Provable Bounds for the Learnability of Sample-Compressible Families from Noisy Samples
LassoFlexNet: a Flexible Neural Architecture for Tabular Data
RefineEvo: Planning-Guided Heuristic Evolution with Bidirectional Experience
Correcting Split Selection in Online Decision Trees via Anytime-Valid Inference
From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG
Landmark-Guided Policy Optimization for Multi-Objective Language Model Selection
Why Dedicated Critics: Eliminating Target Drift in Multi-Constraint RL
Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers
Bimodal masked language modeling for bulk RNA-seq and DNA methylation representation learning
Privacy-Aware Video Anomaly Detection: Guided Orthogonal Projection and a Comprehensive Evaluation Framework
PS-PPO : Prefix-Sampling PPO for Critic-Free RLHF
Next-Token Prediction and Regret Minimization
Representational Curvature Shapes Behavioral Uncertainty in Large Language Models
SPARD: Defending Harmful Fine-Tuning Attack via Safety Projection with Relevance–Diversity Data Selection
HiCI: Hierarchical Construction–Integration for Long-Context Attention
One Intervention per Component is Enough: Towards Identifiability in Linear Stochastic Dynamics from Steady State
LMCleaner: Efficient and Certified Online Unlearning via Influence Propagation Truncation
Narrowing the ANN–SNN Gap for 1D Signal Classification with Multi-Scale Temporal Encoding and Sparsity-Regularized Transform Encoding
Differentiable Weightless Controllers: Learning Logic Circuits for Continuous Control
Truthfulness Does Not Scale Like Reasoning: Why Polling Fails as a Proxy Verifier
STRIDE: Post-Training LLMs to Reason and Refine Bio-Sequences via Edit Trajectories
Sponge Tool Attack: Stealthy Denial-of-Efficiency against Tool-Augmented Agentic Reasoning
Transport Clustering: Solving Low-Rank Optimal Transport via Clustering
Towards One-for-All Anomaly Detection for Tabular Data
D$^3$: Dynamic Directional Graph-Constrained Data Scheduling for LLM Training
Test-time Generalization for Physics through Neural Operator Splitting
Semi-Supervised Learning with Noisy Covariates: Generalization Bounds and Distribution Regression
Global Geometry Is Not Enough for Vision Representations
CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing
Unifying Value Alignment and Assignment in Cross-Domain Offline Reinforcement Learning with Heterogeneous Datasets
Scalable Sampling via Generalized Fixed-Point Diffusion Matching
LakeQA: A Benchmark for Complex Exploratory QA over a Million-Scale Data Lake
FreeText: Training-Free Text Rendering via Attention Localization and Spectral Glyph Injection
SCOPE: Selective Conformal Optimized Pairwise LLM Judging
Sharper Generalization Guarantees for Asynchronous SGD: Beyond Lipschitzness, Smoothness and Data Homogeneity
Hide and Seek in Embedding Space: Geometry-based Steganography and Detection in Large Language Models
CAMEL: Confidence-Gated Reflection for Reward Modeling
Mind the Gap: Catching Hallucinations via Evidence Drop on the Reasoning Manifold
Imposing Boundary Conditions on Neural Operators via Learned Function Extensions
Many Experiments, Few Repetitions, Unpaired Data, and Sparse Effects: Is Causal Inference Possible?
Benign Overfitting in Adversarial Training for Vision Transformers
Lions and Muons: Optimization via Stochastic Frank-Wolfe
DiScoFormer: Plug-In Density and Score Estimation with Transformers
Time-CoT: Hierarchical Reasoning with Temporal Semantic Codes for Multivariate Time Series Classification
Safe Autoregressive Image Generation with Iterative Self-Improving Codebooks
This State Looks Like That: Self-Interpretable Reinforcement Learning Agents using Prototype Soft Actor-Critic
WAVE: Window-Aware Vocabulary-Efficient Early-Exit for Training-Free LLM Acceleration
R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning
Deep neural networks divide and conquer dihedral multiplication
RedVisor: Reasoning-Aware Prompt Injection Defense via Zero-Copy KV Cache Reuse
DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone
StyleDistillation: A New Insight of Image Style Enables Personalized Aesthetic Manipulation
CONTINUUM: Restoring the Contiguous Tensor Abstraction Efficiently for Dynamic AI Workloads via Hardware Virtualization
Noise as a Natural Regularizer in Markov Decision Processes: Connecting Environmental Stochasticity and Policy Simplicity
Betting on Equilibrium: Monitoring Strategic Behavior in Multi-Agent Systems
Names Don’t Matter: Symbol-Invariant Transformer for Open-Vocabulary Learning
Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
MAC-NeRF: Motion-Aware Curriculum Learning for Dynamic LiDAR NeRFs
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching
On the Relationship Between Activation Outliers and Feature Death in Sparse Autoencoders
Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving
SpecMD: A Comprehensive Study On Speculative Expert Prefetching
ReJump: A Tree-Jump Representation for Analyzing and Improving LLM Reasoning
Stochastic Lifting for Generating Trajectories of Stochastic Physical Systems
LASER: Learning Active Sensing for Continuum Field Reconstruction
Towards One-to-Many Temporal Grounding
TSP with predictions
Equalized Generative Treatment: Matching f-divergences for Fairness in Generative Models
UniRTL: Unifying Code and Graph for Robust RTL Representation Learning
Brep2Shape: Boundary and Shape Representation Alignment via Self-supervised Transformers
Solving Spatial-Spectral Fusion with Latent Spectral Operators
Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge
Fine-to-Coarse Fairness-Informed Multi-View Clustering
Harnessing Spectrum Video for Subject-Level Few-Shot and Cross-Montage EEG Generalization
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Identifying Learnwares via Reduced Neural Conditional Mean Embedding
A Random Matrix Perspective on the Consistency of Diffusion Models
Stronger Benchmarks for Prediction as a Service with Constraints
RSF-GLLM: Bridging the Semantic Gap in Multi-Hop Knowledge Graph QA via Recurrent Soft-Flow and Decoupled LLM Generation
Tight Margin-Based Generalization Bounds for Voting Classifiers over Finite Hypothesis Sets
Alterbute: Editing Intrinsic Attributes of Objects in Images
Many Needles in a Haystack: Active Hit Discovery for Perturbation Experiments
Evolutionary Generation of Multi-Agent Systems
Does Reasoning Improve Seeing? Understanding When Vision-Language Models Benefit from Thinking
Co-Evolving Latent Action World Models
Neural Attention Search Linear: Towards Adaptive Token-Level Hybrid Attention Models
ReflFlow: Learning Geometry-Guided Ray Tracing for Dynamic Specular Reconstruction
RAPNet: Accelerating Algebraic Multigrid with Learned Sparse Corrections
Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control
Towards Uniformity and Alignment for Multimodal Representation Learning
BRIDGE: Triangular Fixed-Point Refinement for Long-Horizon Persona Consistency
Semi-Supervised Noise Adaptation: Transferring Knowledge from Noise Domain
Continual Learning through Control Minimization
BYORn: Bootstrap Your Own Responses to Defend Large Vision-Language Models Against Backdoor Attacks
DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory (and Its Loss' Convexity is Dispensable)
Hierarchical Decision Making with Structured Policies: A Principled Design via Inverse Optimization
Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning
scChord: A Probabilistic Manifold Rectification Framework for RNA-to-Protein Translation
Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction
Stability and Generalization of Nonconvex Optimization with Heavy-Tailed Noise
NBCG: Nash-Bargained Causal Game for Long-Tailed Multi-Label NLP
Protein Fold Classification at Scale: Benchmarking and Pretraining
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
Concept Heterogeneity-aware Representation Steering
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
LIFT: A Novel Framework for Enhancing Long-Context Understanding of LLMs via Long Input Fine-Tuning
LLM Self-Recognition: Steering and Retrieving Activation Signatures
Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding
Reinforcement Learning with Action-Triggered Observations
VSCD: Video-based Scene Change Detection in Unaligned Scenes
Unveiling the Role of Data Uncertainty in Tabular Deep Learning
Learning Permutation Distributions via Reflected Diffusion on Ranks
Understanding MARS: When Scaling Momentum Provably Helps
A Theory of How Pretraining Shapes Inductive Bias in Fine-Tuning
Doubly Robust Distributionally Robust Offline Contextual Pricing
A Graph Foundation Model with Cross-Modal Alignment and Modality-Aware Expert Fusion for Multi-Modal Graphs
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch
Budget-Efficient Attacks and Robustness Training for Cooperative MARL
Meta-learning Structure-Preserving Dynamics
A Bi-metric Framework for Efficient Nearest Neighbor Search
Omitted Variable Bias in Language Models Under Distribution Shift
Can LLMs Reason Structurally? Benchmarking via the lens of Data Structures
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations
Flexibility-Aware Geometric Latent Diffusion for Full-Atom Peptide Design
Balanced LoRA: Removing Parameter Invariance to Accelerate Convergence
Anchoring Self-Play for Code Repair
Disentangling Geometry, Performance, and Training in Language Models
Large Language Model Agents Are Not Always Faithful Self-Evolvers
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining
LEGO: An LLM-Enabled Hierarchical Optimizer for Tensor Computation Graphs with Structure-Aware Search and Compositional Synthesis
Compass-RoPE: Isotropic Rotary Position Embeddings for Vision Transformers
Supervised Classification Heads as Semantic Prototypes: Unlocking Vision-Language Alignment via Weight Recycling
xLSTM Distillation: Achieving Teacher-Student Parity Through Efficient Hybrid Architectures
Equivariant Covariance Tensors: Guaranteed SPD Uncertainty for Tensor-Valued Geometric Learning
Towards the Training of Deeper Predictive Coding Neural Networks
Compositional Behavioral Semantics and Metrics for State Abstraction in Reinforcement Learning
Torus Graphs for Large Scale Neural Phase Analysis
Quantitative Estimation of Target Task Performance from Unsupervised Pretext Task in Semi/Self-Supervised Learning
When Softmax Fails at the Top: Extreme‑Value Corrections for InfoNCE
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
Mosaic: Unlocking Over 30$\times$ Context Length for Diffusion LLMs Inference via Global Memory Planning and Dynamic Peak Taming
Cascaded Flow Matching for Heterogeneous Tabular Data with Mixed-Type Features
From Internal Diagnosis to External Auditing: A VLM-Driven Paradigm for Data-Free Online Backdoor Defense
DDSVM: A Differentiable Framework for Deep Support Vector Machines with Iterative Geometry-Aware Optimization
Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review
The Value Function Semi-Algebraic Set in Partially Observable Markov Decision Processes
CURVE: Learning Causality-Inspired Invariant Representations for Robust Scene Understanding via Uncertainty-Guided Regularization
STAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control
Categorical Flow Maps
Giving Sensors a Voice: Multimodal JEPA for Semantic Time-Series Embeddings
Vision Transformer Finetuning Benefits from Non-Smooth Components
From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs
Conditional Diffusion Sampling
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Fast k-means Seeding Under The Manifold Hypothesis
Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks
Breaking Multi-Task Curse: Reward-Weighted Evolution for Black-Box Many-Task Optimization
Minimum Distance Summaries for Robust Neural Posterior Estimation
Focus, Align, and Sustain: Counteracting Gradient Dilution in Incremental Object Detection
Distributionally Robust Causal Abstractions
Fair Classification with Efficient and Post-hoc Controllable Fairness-Accuracy Trade-off
PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching
Distributional Active Inference
Dual-branch Robust Unlearnable Examples
Editable Proof Sketch for Automated Theorem Proving
Neural Vector Lyapunov–Razumikhin Certificates for Delayed Interconnected Systems
dgMARK: Decoding-Guided Watermarking for Diffusion Language Models
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing
Reliable Neighborhood-Aware Multi-View Outlier Detection
Spatial Priors via Space Filling Curves for Small and Limited Data Vision Transformers
Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs
NeuroMamba: A Universal Spatiotemporal Module for Robust Perception in Degraded Sensory Streams
Don't Force the Fit: Bounded Log-Likelihood Loss for Enhanced Reasoning in Large Language Models
Approximation Error Upper and Lower Bounds for Hölder Class with Transformers
Skewness-Robust Causal Discovery in Location-Scale Noise Models
Once-for-All: Scalable Simultaneous Forecasting via Equilibrium State Estimation
MineDraft: A Framework for Batch Parallel Speculative Decoding
Trading Complexity for Expressivity Through Structured Generalized Linear Token Mixing
CADFit: Precise Mesh-to-CAD Program Generation with Hybrid Optimization
Excited Pfaffians: Generalized Neural Wave Functions Across Structure and State
Toward Identifiable Sparse Autoencoders
OOVDet: Low-Density Prior Learning for Zero-Shot Out-of-Vocabulary Object Detection
OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space
Fast Byte Latent Transformer
LATO: 3D Mesh Flow Matching with Structured TOpology Preserving LAtents
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in Unified Multimodal Models via Decompositional Verifiable Reward
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency
Shifting the Breaking Point of Flow Matching for Multi-Instance Editing
A Pure Hierarchical Spectral Parcellation Network for Brain Network Analysis
Softsignum: Smooth Your Signum For Better Heterogeneity Handling
Revenue Efficiency of Correlated Equilibria in First Price Auctions
FormAct: Agentic Source Editing for Rich-Format Document Generation
Time-PEFT: Temporal and Multichannel Complexity-Based Fine-Tuning for Time-Series Foundation Models
Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models
DecAEvolve: Decompose, Adapt, and Evolve, or, Three Pillars of Effective LLM-based Scientific Equation Discovery
HiPPO Zoo: Making Implicit State Space Memory Explicit
World Guidance: World Modeling in Condition Space for Action Generation
Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection
VELR: Efficient Video Reward Feedback via Ensemble Latent Reward Models
AgentVocab: Structure-Aware Vocabulary Adaptation for Efficient LLM Agents
BrainJanus: A Foundation Model for Unified Understanding and Generation across Brain, Vision, and Language
Revisiting OOD Generalization in Programmatic RL
Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals
Telescope: Improving Zero Shot Detection of LLM Generated Content By Measuring Token Repetition Probability
Particles Don’t Care About Z: Towards Scaling Entropy Estimation of Unnormalized Densities
LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning
Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards
Dual Quaternion SE(3) Synchronization with Recovery Guarantees
Know Thyself, Know Thy User: Intrinsic Dual-Perspective Reasoning for Role-Playing LLMs
What is Missing? Explaining Neurons Activated by Absent Concepts
Efficient Hallucination Detection for LLMs Using Uncertainty-Aware Attention Heads
CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
Class-Grouped-Normalized-Momentum and Faster Hyperparameter Exploration to Tackle Class Imbalance in Federated Learning
Discriminative Attribute Graph Clustering Through Topology-Guided Contrastive Learning
Post-Training with Policy Gradients: Optimality and the Base Model Barrier
The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level
Dual-stage Contrastive Learning-enhanced Multi-view Variational Clustering
Query-Based Asymmetric Modeling with Decoupled Input–Output Rates for Speech Restoration
Cost-aware Stopping for Bayesian Optimization
Knowing Who, Not How Much: Learning-Augmented Mechanisms for Consumer Utility Maximization
OPIC: Enhancing Language Model Merging via Optimizing In-Context Capability
Beyond Correctness: Distance-Based Social Dynamics of Multi-Agent Debate
CLASP: Online learning algorithms for Convex Losses And Squared Penalties
Training LLM Agents to Empower Humans
Improving Few-Shot Design Optimization By Exploiting Auxiliary Information
SOLAR: Self-supervised Joint Learning for Symmetric Multimodal Retrieval
Semantic Granularity Navigation in Image Editing
Stabilizing In-Context Multi-Source Domain Adaptation for Biomedical Images Through Controls
Normalizing Flows with Iterative Denoising
ScaleMoE: Mixture-of-Experts for Scalable Continuous Control in Actor-Critic Reinforcement Learning
Sampled hard labels from sparse targets mislead rotation invariant algorithms
ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
A recipe for scalable attention-based ML potentials: unlocking long-range accuracy with all-to-all node attention
$\mathcal{O}(\log N)$ Latent Dimension Suffices for Universal Approximation of Permutation-invariant Function
InvGNN: Learning Invertible Node Representations on Graphs
Bioacoustic Geolocation: Species Sounds as Geographic Signals
A proximal ADMM for multiblock problems with block anti-upper triangular constraints
Recursive Monte-Carlo Tree Search
Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression
On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
LORD-GoF: A Robust Online Detection Approach for LLM Watermarks in Sparse and Mixed Streams
TimeSeed: Effective Time Series Forecasting with Sparse Endogenous Variables
NeuroCLUS: A Foundation Model with Functional Clustering for Intracranial Neural Decoding
scDEBART: Predicting in silico Single-Cell Perturbation Responses via Large-Scale Differential Expression Learning
Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction
Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates
Optimizing Network Simulation: Enhancing Performance Prediction Accuracy via Neural Architecture Search
Boost the Identity-Preserving Embedding for Consistent Visual Generation
FOVI: A biologically-inspired foveated interface for deep vision models
Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment
Learning Context-Conditioned Predicate Semantics via Prototype Feedback
Improved Convergence Analysis of Topology Dependence in Decentralized SGD
No Global Plan in Sight: Uncover the Myopic Planning Horizon of LLMs
Adaptive Visual Autoregressive Acceleration via Dual-Linkage Entropy Analysis
Non-Euclidean Gradient Descent Operates at the Edge of Stability
M-IDoL: Information Decomposition for Modality-Specific and Diverse Representation Learning in Medical Foundation Model
Attn-QAT: 4-Bit Attention With Quantization-Aware Training
TeamWork: Multivariate Time Series Anomaly Detection via Asymmetric Role-aware Channel Modeling
Memory-Efficient LLM Pretraining via Minimalist Optimizer Design
Large Vision–Language Models Get Lost in Attention
LLM Priors for ERM over Programs
Practical Mechanism for Fault-Tolerant Spiking Neural Networks via Simple Input Control Based on Learnable Fragmentation
LLM4Cov: Execution-Grounded Agent Learning for High-Coverage Hardware Verification
Self-Distillation Enables Continual Learning
Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge
Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model
Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs
LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs
Instance-Level Costs for Nuanced Classifier Evaluation
JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments
MetaBio: Learning from metadata for bioacoustics foundation models
Optimal Rates for Feasible Payoff Set Estimation in Games
The Interplay Between Interpolation and Aggregation in Regression: Optimal Sample Complexity
Training-free Composition of Pre-trained GFlowNets for Multi-Objective Generation
When less gives more: bias from small dataset can speed up training
Collaborative Learning for Semi-Supervised LiDAR Semantic Segmentation
An Exterior Method for Nonnegative Matrix Factorization
Lost in Context: Discovering Context Anxiety in Large Language Models
HEARTS: Benchmarking LLM Reasoning on Health Time Series
Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping
Vision-aligned Latent Reasoning for Multi-Modal Large Language Model
DREAM: Dual-Standard Semantic Homogeneity with Dynamic Optimization for Graph Learning with Label Noise
Nested birth-death processes are competitive with parameter-heavy neural networks as time-dependent models of protein evolution
CausalX: A Unified and Causally-Interpretable Plug-and-Play Model for Multi-modal Spatio-Temporal Forecasting
Diversity-Aware Recursive Feature Multiple Kernel Learning
ConFlux: Multivariate Time Series in Flux, One Unified Forecast in Confluence
Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History
EpiCoCo: De Novo Epitope Generation via MHC-Context Co-Modeling and Contrastive Affinity Guidance
Parametric Prior Mapping Framework for Non-stationary Probabilistic Time Series Forecasting
RouterInterp: Understanding Superposed Specialisation in Mixture of Experts Routing
Practical and Optimal Algorithm for Linear Contextual Bandits with Rare Parameter Updates
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL
Perceptrons and Localization of Attention’s Mean-Field Landscape
Identifying and Correcting Label Noise for Robust GNNs via Influence Contradiction
Fast Reconstruction of Mixtures of Bernoulli Product Distributions
How does Bayesian Sampling help Membership Inference Attacks?
MINIM: Privacy-Aware Minimal View for Agents via Trusted Local Sanitization
MIRA: A Score for Conditional Distribution Accuracy and Model Comparison
Partial Identification under High-Dimensional Potential Outcomes and Confounders via Optimal Transport
Test-Time Anchoring for Discrete Diffusion Posterior Sampling
Distribution Matching Variational AutoEncoder
Bits That Count: Quantifying and Predicting Capabilities of Language Models
OSF: On Pre-training and Scaling of Sleep Foundation Models
Twice Sequential Monte Carlo for Tree Search
Amortized Simulation-Based Inference in Generalized Bayes via Neural Posterior Estimation
General Covariant Action Modeling: Constructing Generalized Manifolds via Spatio-Temporal Decoupling
Closing the Sim-to-Real Gap in Non-Markovian Spreading Processes via GPU-Accelerated Distributional RL
d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation
Instance-Specific Approximation Ratios for Correlation Clustering and Max-Cut
Robust In-Context Reinforcement Learning Under Reward Poisoning Attacks
VIA-SD: Verification via Intra-Model Routing for Speculative Decoding
A Unifying View of Variational Generative Wasserstein Flows
Towards the Explainability of Temporal Graph Networks via Memory Backtracking and Topological Attribution
ExpAlign: Expectation-Guided Vision–Language Alignment for Open-Vocabulary Grounding
MotionGRPO: Overcoming Low Intra-Group Diversity in GRPO-Based Egocentric Motion Recovery
Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding
Revisiting Regularized Policy Optimization for Stable and Efficient Reinforcement Learning in Two-Player Games
Optimal Self-Consistency for Efficient Reasoning with Large Language Models
Continuity-Regularized Flow Matching for Offline Reinforcement Learning
Causally Evaluating the Learnability of Formal Language Tasks
Understanding Self-Supervised Learning via Latent Distribution Matching
Learn from Your Mistakes: Tree-like Self-Play on Vulnerability Nodes for Secure Code LLMs
Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective
Personalized Image Generation via Human-in-the-loop Bayesian Optimization
ProactiveLLM: Learning Active Interaction for Streaming Large Language Models
Automatic Layer Selection for Hallucination Detection
Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
Latent Collaboration in Multi-Agent Systems
Fast KV Compaction via Attention Matching
Improving the Performance and Learning Stability of Parallelizable RNNs Designed for Ultra-Low Power Applications
Clipping Low-Probability Tokens in SFT Yields a Generalizable Initialization for RL
PCA of Probability Measures: Sparse and Dense Sampling Regimes
Universal Algorithm-Implicit Learning
Improved Dynamic Algorithm for Non-monotone Submodular Maximization under Cardinality Constraint
3D Scene Assertion Verification
Tvcache: A Tool-Value Cache for Post-Training LLM Agents
Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation
Darwinian Memory: A Training-Free Self-Regulating Memory System for GUI Agent Evolution
Efficient Inference for Noisy LLM-as-a-Judge Evaluation
Principled RL for Flow Matching Emerges From the Chunk-level Policy Optimization
A Graphop Analysis of Graph Neural Networks on Sparse Graphs: Generalization and Universal Approximation
Toward Structural Multimodal Representations: Specialization, Selection, and Sparsification via Mixture-of-Experts
JADAI: Jointly Amortizing Adaptive Design and Bayesian Inference
Blocking the Leakage: Manifold-Aware Gradient Projection for Long-Horizon Test-Time Adaptation
Diversity Over Frequency: Rethinking Tool Use in Visual Chain-of-Thought Agents
HVR-Met: A Hypothesis-Verification-Replaning Agentic System for Extreme Weather Diagnosis
Variational Learning for Insertion-based Generation
Continuous Variable Hamiltonian Learning at Heisenberg Limit via Displacement-Random Unitary Transformation
VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models
Rethinking the Hardness of PbRL: A Provable General Regret Bound
Positional Encoding for Spiking Transformers
Thinned Mean Field Langevin Dynamics
Multi-Objective Bayesian Optimization via Adaptive $\varepsilon$-Constraint Decomposition
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
LineageFlow: Flow Matching for High-Fidelity Family-Aware Protein Sequence Generation
Interpretability and Generalization Bounds for Learning Spatial Physics
Collaborative and Efficient Fine-tuning: Leveraging Task Similarity
Faster Activation Functions at the Edge for Post-Training Speedups
BEST: Benchmarking Efficiency in Space and Time for LLM-Generated Code
Demystifying Mergeability: Interpretable Properties to Predict Model Merging Success
Priority-Aware Shapley Value
DiscoverLLM: From Executing Intents to Discovering Them
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing
Flexible Kernels for Protein Property Prediction
Improving Backward Conformal Prediction via Non-Conformity Score Transformation
Performative Learning Theory
TimeOmni-VL: Unified Models for Time Series Understanding and Generation
$\textit{S}$-SPPO: Semantic-Calibrated Self-Play Preference Optimization
Spectra: Rethinking Optimizers for LLMs Under Spectral Anisotropy
Chain-of-Thought Gradient Descent
Expanding the Chaos: Neural Operator for Stochastic (Partial) Differential Equations
$\texttt{ShaplEIG}$: Bayesian Experimental Design for Shapley Value Estimation
Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning
DiasR: Dual-Modal Identity-Anchored Sparse Routing for Efficient Multi-Subject Video Generation
When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer
Self-Augmenting Retrieval for Diffusion Language Models
MMClima: A Framework for Multimodal Climate Science Data and Evaluation
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers
From geometry to dynamics: Learning overdamped Langevin dynamics from sparse observations with geometric constraints
Hallucination is a Consequence of Space-Optimality: A Rate-Distortion Theorem for Membership Testing
Optimal structure learning and conditional independence testing
Diagnosing and Correcting Concept Omission in Multimodal Diffusion Transformers
Spectral Gradient Descent Mitigates Anisotropy-Driven Misalignment: A Case Study in Phase Retrieval
Evidential Copula Concept Embedding Models
Toward Safe Quantization-Aware Fine-tuning: Understanding and Mitigating Safety Alignment Degradation
SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
CORE-MTL: Rethinking Gradient Balancing via Causal Orthogonal Representations
Joint Model and Data Sparsification via the Marginal Likelihood
Natural Language Actor–Critic Is Bilevel: Learning to Reason with Textual Feedback
Security–Fidelity Tradeoffs: No Universal Defense Against Prompt Injection
Reverse Flow Matching: A Unified Framework for Online Reinforcement Learning with Diffusion and Flow Policies
Probing the Inductive Bias of Neural Networks through Learning Random Cellular Automata
Evaluating AI Grading on Real-World Handwritten College Mathematics: A Large-Scale Study Toward a Benchmark
Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization
Cutting LLM Evaluation Costs with SySRs: A Bandit Algorithm that Provably Exploits Model Similarity
Feature Collapse Under Corruption: An Entropy Perspective on Robust Neural Networks
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models
On the Convergence of Steepest Descent and Adaptive Gradient Methods under Non-Uniform Smoothness
The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?
FiGuRO - Intrinsic Dimension Estimation for Multi-Modal Data
On Efficient Scaling of GNNs via IO-Aware Layers Implementations
DRIFT-BENCH: Diagnosing CoopeRative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction
Dynamics and representation structure of local approximations to gradient-based learning in linear recurrent neural networks
SpeedVFI: One-step Diffusion for Efficient Video Frame Interpolation
Drop-in Circulant Structural Priors for Transformer Decoding of Cyclic Codes
Global Merger-Arbitrage Forecasting with Language Models
GuidedBridge: Training-freely Improving Bridge Models with Prior Guidance
Segment-driven Structural Induction and Semantic Alignment for Heterogeneous Tabular Representation
Tightening the Score Matching Gap for Diffusion Models
Vector Linking via Cross-Model Local Isometric Consistency
FRIGID: Scaling Diffusion-Based Molecular Generation from Mass Spectra at Training and Inference Time
Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems
Multilingual Safety Alignment via Representation-Space Separability
CoFrGeNet: Continued Fraction Architectures for Language Generation
Discrete Survival Knowledge Distillation for Competing Risks Analysis
Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow
Impact of Connectivity on Laplacian Representations in Reinforcement Learning
ProjQ: Project-and-Quantize for Adapter-Aware LLM Compression
Learning to Theorize the World from Observation
MODEL SOUPS NEED ONLY ONE INGREDIENT
GenUnfold: Rapidly Predict Protein Mechanical Unfolding Trajectory via a Physics-Guided Diffusion Model
Improved Distribution Estimation in $\ell_\infty$
MOC: Multi-Order Communication in LLM-based Multi-Agent Systems
Safe and Scalable Web Agent Learning via Recreated Websites
Towards Professional-Grade Financial Agents: Benchmarking, Tooling, and Structured Reasoning
PSMix: Robust Point Cloud Recognition through Spectral Domain Mixing
RL-SPH: Learning to Achieve Feasible Solutions for Integer Linear Programs
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
Normalized Rewards for Preference Optimization
An Odd Estimator for Shapley Values
Forensic Prompting with Dual-Action Policy Optimization for Vision-Language Forgery Detection and Localization
Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
Federated Manifold Learning (FML): Tackling Domain Heterogeneity with Structural Knowledge Transfer
TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning
MulFCoder: Framework-conditioned Multi-agent for MLLM-based Multi-framework Front-end Code Generation
ElicitR: Unlocking Latent Reasoning in Dense Retrievers via Generative Regularization
SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets
FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation
Open Materials Generation with Inference-Time Reinforcement Learning
Deep Incentive Design with Differentiable Equilibrium Blocks
CoarseBind: Fast and Accurate Binding Affinity Prediction through Coarse Structural Representations
Learning, Solving and Optimizing PDEs with TensorGalerkin: an efficient high-performance Galerkin assembly algorithm
On the Sharp Input-Output Analysis of Nonlinear Systems under Adversarial Attacks
Physics from Video: Identifiability of Time-Invariant Second-Order ODEs under Minimal Trajectory Conditions
LIF Recurrent Memory Enables Long-Horizon Spiking Computation
HumanLM: Simulating Users with State Alignment Beats Response Imitation
Fourier Features Let Agents Learn High Precision Policies with Imitation Learning
Learning Efficient Guardrails for Compliance
PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling
A Judge-Aware Ranking Framework for Evaluating Large Language Models without Ground Truth
GeoAlign: Geometric Rollout Curation for Robust LLM Reinforcement Learning
Modular Pretraining Enables Access Control
Unfolding Generative Flows with Koopman Operators: Trajectory-Preserving Linearization
Robust Cross-Modal Retrieval via Generative Semantic Refinement and Exclusion-Guided Adaptation
Meta-iLaD: Identifiable Latent Dynamics via Meta-Learning of Dynamics Environments
What Information Matters? Graph Out-of-Distribution Detection via Tri-Component Information Decomposition
When LLMs Encounter Open-world Graph Learning: A Fresh View on Unlabeled Data Uncertainty
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck
N2M: Bridging Navigation and Manipulation by Learning Pose Preference from Rollout
On Regret Bounds of Thompson Sampling for Bayesian Optimization
Optimization, Generalization and Differential Privacy Bounds for Gradient Descent on Kolmogorov–Arnold Networks
Refined Analysis of Entropy-Regularized Actor-Critic
Efficient Multi-Agent Reasoning via Confidence-Guided Adaptive Debate
Exploiting weight-space symmetries for approximating curvature
ReMoE: Boosting Expert Reuse through Router Fine-Tuning in Memory-Constrained MoE LLM Inference
AGoQ: Activation and Gradient Quantization for Memory-Efficient Distributed Training of LLMs
Chunk-Guided Q-Learning
Di-BiLPS: Denoising induced Bidirectional Latent-PDE-Solver under Sparse Observations
Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs
IO-Adam: Rethinking Memory-Efficient Adaptive Optimizers from Gradient Computation
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training
Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs
Textual Supervision Enhances Geospatial Representations in Vision-Language Models
Turning Back Without Forgetting: Selective Backward Refinement for Parameter-Efficient Continual Learning
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination
No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference
Generalized Discrete Diffusion with Self-Correction
VeRO: An Evaluation Harness for Agents to Optimize Agents
Generative Large Neighborhood Search: Scalable Set Cover Optimization via Discrete Diffusion
Learning Multi-Timescale Abstractions for Hierarchical Combinatorial Planning
How to Avoid Debate: Scalable AI Safety via Doubly-Efficient Interactive Proofs
Target-Agnostic Calibration under Distribution Shift with Frequency-Aware Gradient Rectification
RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference
TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation
Polishing-Only Policies in Peer Reviews are Currently Not Enforceable
ROAMM: A Benchmark Dataset for Multimodal Human Attention Decoding and EEG-to-Text Modeling During Naturalistic Reading
Why Specialist Models Still Matter: A Heterogeneous Multi-Agent Paradigm for Medical Artificial Intelligence
Attention's forward pass and Frank-Wolfe
Grouter: Decoupling Routing from Representation for Accelerated MoE Training
HiST: A Hierarchical Sparse Transformer for Cross-Modal Spatial Transcriptomics Modeling
A3: an Analytical Low-Rank Approximation Framework for Attention
CompleteP for RL: Maintaining Feature Learning When Scaling Deep Reinforcement Learning
Optimizing Agentic Reasoning with Retrieval via Synthetic Semantic Information Gain Reward
FlashSketch: Sketch-Kernel Co-Design for Fast Sparse Sketching on GPUs
Optimal Attention Temperature Improves the Robustness of In-Context Learning under Distribution Shift in High Dimensions
When Embedding-Based Defenses Fail: Rethinking Safety in LLM-Based Multi-Agent Systems
Scaling Law for Quantization-Aware Training
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws
What If We Allocate Test-Time Compute Adaptively?
LUCID: Attention with Preconditioned Representations
Richer Bayesian Last Layers with Subsampled NTK Features
Spectral Collapse Drives Loss of Plasticity in Deep Continual Learning
We use cookies to store which papers have been visited.
I agree
Successful Page Load
ICML uses cookies for essential functions only. We do not sell your personal information.
Our Privacy Policy »
Accept
We use cookies to store which papers have been visited.
I agree