Papers

Recent research papers from ArXiv and academia

247 articles

One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them
AI & ML

One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them

ai & ml
2 min read★★★☆☆
Read Breakdown →
Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents
AI & ML

Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?
AI & ML

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

ai & ml
2 min read★★★☆☆
Read Breakdown →
Molecular Lead Optimization via Agentic Tool Planning
AI & ML

Molecular Lead Optimization via Agentic Tool Planning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Self-Play Reinforcement Learning under Imperfect Information in Big 2
AI & ML

Self-Play Reinforcement Learning under Imperfect Information in Big 2

ai & ml
2 min read★★★☆☆
Read Breakdown →
Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision
AI & ML

Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision

ai & ml
2 min read★★★☆☆
Read Breakdown →
Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models
AI & ML

Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation
AI & ML

PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models
AI & ML

TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Balancing Multimodal Learning through Label Space Reshaping
AI & ML

Balancing Multimodal Learning through Label Space Reshaping

ai & ml
2 min read★★★☆☆
Read Breakdown →
Representation Alignment Rests on Linear Structure
AI & ML

Representation Alignment Rests on Linear Structure

ai & ml
2 min read★★★☆☆
Read Breakdown →
Pre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Audit
AI & ML

Pre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Audit

ai & ml
2 min read★★★☆☆
Read Breakdown →
Towards Continuous-time Causal Foundation Models
AI & ML

Towards Continuous-time Causal Foundation Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Context Distillation as Latent Memory Management
AI & ML

Context Distillation as Latent Memory Management

ai & ml
2 min read★★★☆☆
Read Breakdown →
Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models
AI & ML

Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Spectral Guidance for Flexible and Efficient Control of Diffusion Models
AI & ML

Spectral Guidance for Flexible and Efficient Control of Diffusion Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System
AI & ML

Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systems
AI & ML

Cycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL
AI & ML

When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL

ai & ml
2 min read★★★☆☆
Read Breakdown →
CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models
AI & ML

CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Conf-Gen: Conformal Uncertainty Quantification for Generative Models
AI & ML

Conf-Gen: Conformal Uncertainty Quantification for Generative Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio
AI & ML

A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio

ai & ml
2 min read★★★☆☆
Read Breakdown →
Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection
AI & ML

Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Hamilton-Jacobi Theory of Deep Learning
AI & ML

The Hamilton-Jacobi Theory of Deep Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning
AI & ML

Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
AI & ML

FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks

ai & ml
2 min read★★★☆☆
Read Breakdown →
FedQHD: Closed-Form Function-Space Federated Reinforcement Learning
AI & ML

FedQHD: Closed-Form Function-Space Federated Reinforcement Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
TECH BUSINESS

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

tech business
2 min read★★★☆☆
Read Breakdown →
Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions
AI & ML

Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions

ai & ml
2 min read★★★☆☆
Read Breakdown →
Label-Free Reinforcement Learning via Cross-Model Entropy
AI & ML

Label-Free Reinforcement Learning via Cross-Model Entropy

ai & ml
2 min read★★★☆☆
Read Breakdown →
ICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignment
AI & ML

ICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignment

ai & ml
2 min read★★★☆☆
Read Breakdown →
LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks
AI & ML

LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Unlocking Fine-Grained and Within-Utterance Speaking Style Control in Prompt-Based Text-to-Speech Models
AI & ML

Unlocking Fine-Grained and Within-Utterance Speaking Style Control in Prompt-Based Text-to-Speech Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Enhancing LLM Medical Coding with Structured External Knowledge
AI & ML

Enhancing LLM Medical Coding with Structured External Knowledge

ai & ml
2 min read★★★☆☆
Read Breakdown →
OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis
AI & ML

OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis

ai & ml
2 min read★★★☆☆
Read Breakdown →
BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking
AI & ML

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

ai & ml
2 min read★★★☆☆
Read Breakdown →
Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models
AI & ML

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons
AI & ML

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons

ai & ml
2 min read★★★☆☆
Read Breakdown →
Modeling Community Attitude through Reaction Tone: A Human-AI Collaborative Framework for Evaluating LLM Alignment with Linguistic Behaviors in Online Communities
AI & ML

Modeling Community Attitude through Reaction Tone: A Human-AI Collaborative Framework for Evaluating LLM Alignment with Linguistic Behaviors in Online Communities

ai & ml
2 min read★★★☆☆
Read Breakdown →
EvoSpec: Evolving Speculative Decoding via Real-Time Vocabulary and Parameter Adaptation
AI & ML

EvoSpec: Evolving Speculative Decoding via Real-Time Vocabulary and Parameter Adaptation

ai & ml
2 min read★★★☆☆
Read Breakdown →
StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation
AI & ML

StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Debate Helps Weak Judges Reward Stronger Models
AI & ML

Debate Helps Weak Judges Reward Stronger Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
PAST2HARM: A Simple Adaptive Past Tense Attack for Jailbreaking Multimodal AI
AI & ML

PAST2HARM: A Simple Adaptive Past Tense Attack for Jailbreaking Multimodal AI

ai & ml
2 min read★★★☆☆
Read Breakdown →
Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies
AI & ML

Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Future of Facts: Tracing the Factual Generation-Verification Gap
AI & ML

The Future of Facts: Tracing the Factual Generation-Verification Gap

ai & ml
2 min read★★★☆☆
Read Breakdown →
Can Hallucinations Be Useful? Solving Multi-Hop Questions With SLMs By Chaining System-I/II Reasoning
AI & ML

Can Hallucinations Be Useful? Solving Multi-Hop Questions With SLMs By Chaining System-I/II Reasoning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering
AI & ML

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering

ai & ml
2 min read★★★☆☆
Read Breakdown →
Learning to Translate from Soft to Hard LLM Prompts
AI & ML

Learning to Translate from Soft to Hard LLM Prompts

ai & ml
2 min read★★★☆☆
Read Breakdown →
Disentangling Language Roles in Multilingual LLM Task Execution
AI & ML

Disentangling Language Roles in Multilingual LLM Task Execution

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverability
AI & ML

Cultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverability

ai & ml
2 min read★★★☆☆
Read Breakdown →
TRACES: Proactive Safety Auditing for Multi-Turn LLM Agents via Trajectory-State Modeling
AI & ML

TRACES: Proactive Safety Auditing for Multi-Turn LLM Agents via Trajectory-State Modeling

ai & ml
2 min read★★★☆☆
Read Breakdown →
Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction
AI & ML

Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction

ai & ml
2 min read★★★☆☆
Read Breakdown →
ReverseMath: Answer Inversion for Scalable and Verifiable Mathematical Problem Generation
AI & ML

ReverseMath: Answer Inversion for Scalable and Verifiable Mathematical Problem Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs
AI & ML

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

ai & ml
2 min read★★★☆☆
Read Breakdown →
UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind
AI & ML

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind

ai & ml
2 min read★★★☆☆
Read Breakdown →
UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training
AI & ML

UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training

ai & ml
2 min read★★★☆☆
Read Breakdown →
Escape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimization
AI & ML

Escape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions
AI & ML

Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions

ai & ml
2 min read★★★☆☆
Read Breakdown →
UniMaia: Steering Chess Policies with Language for Human-like Play
AI & ML

UniMaia: Steering Chess Policies with Language for Human-like Play

ai & ml
2 min read★★★☆☆
Read Breakdown →
Do Models Know Why They Changed Their Mind? Interpretability and Faithfulness of Chain-of-Thought Under Knowledge Conflict
AI & ML

Do Models Know Why They Changed Their Mind? Interpretability and Faithfulness of Chain-of-Thought Under Knowledge Conflict

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Download: unlocking lithium and controlling Ebola
AI & ML

The Download: unlocking lithium and controlling Ebola

ai & ml
2 min read★★★☆☆
Read Breakdown →
The deadly Ebola outbreak is proving difficult to control
AI & ML

The deadly Ebola outbreak is proving difficult to control

ai & ml
2 min read★★★☆☆
Read Breakdown →
How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment
AI & ML

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

ai & ml
2 min read★★★☆☆
Read Breakdown →
Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction
AI & ML

Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction

ai & ml
2 min read★★★☆☆
Read Breakdown →
Behavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference Prediction
AI & ML

Behavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference Prediction

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling
AI & ML

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

ai & ml
2 min read★★★☆☆
Read Breakdown →
Ultra-Reduced-Impact-Encased-Logging (URIEL): propose a new method for selective sustainable logging and post-harvest silvicultural treatment in tropical forest using airborne robotics systems
ENGINEERING

Ultra-Reduced-Impact-Encased-Logging (URIEL): propose a new method for selective sustainable logging and post-harvest silvicultural treatment in tropical forest using airborne robotics systems

engineering
2 min read★★★☆☆
Read Breakdown →
Review Arcade: On the Human Alignment and Gameability of LLM Reviews
AI & ML

Review Arcade: On the Human Alignment and Gameability of LLM Reviews

ai & ml
2 min read★★★☆☆
Read Breakdown →
Orthogonal Concept Erasure for Diffusion Models
AI & ML

Orthogonal Concept Erasure for Diffusion Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Frontier LLM-based agents can overcome the ontology curation bottleneck for natural phenotypes
AI & ML

Frontier LLM-based agents can overcome the ontology curation bottleneck for natural phenotypes

ai & ml
2 min read★★★☆☆
Read Breakdown →
VFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element Analysis
AI & ML

VFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element Analysis

ai & ml
2 min read★★★☆☆
Read Breakdown →
BEAMS: Benchmarking and Evaluating AI for Modeling and Simulation
TECH BUSINESS

BEAMS: Benchmarking and Evaluating AI for Modeling and Simulation

tech business
2 min read★★★☆☆
Read Breakdown →
Adopt $\neq$ Adapt: Longitudinal Analyses of LLM Conversations in the Wild
AI & ML

Adopt $\neq$ Adapt: Longitudinal Analyses of LLM Conversations in the Wild

ai & ml
2 min read★★★☆☆
Read Breakdown →
When Models Disagree: Rethinking LLM Evaluation for Public Comment Analysis
AI & ML

When Models Disagree: Rethinking LLM Evaluation for Public Comment Analysis

ai & ml
2 min read★★★☆☆
Read Breakdown →
Mind Your Tone: Does Tone Alter LLM Performance?
AI & ML

Mind Your Tone: Does Tone Alter LLM Performance?

ai & ml
2 min read★★★☆☆
Read Breakdown →
Practitioner Beliefs and Behaviors in AI-Enhanced Education: DOT Framework Survey Evidence
AI & ML

Practitioner Beliefs and Behaviors in AI-Enhanced Education: DOT Framework Survey Evidence

ai & ml
2 min read★★★☆☆
Read Breakdown →
Differentiable Belief-based Opponent Shaping
AI & ML

Differentiable Belief-based Opponent Shaping

ai & ml
2 min read★★★☆☆
Read Breakdown →
Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching
AI & ML

Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

ai & ml
2 min read★★★☆☆
Read Breakdown →
Robust and Efficient Guardrails with Latent Reasoning
AI & ML

Robust and Efficient Guardrails with Latent Reasoning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics
AI & ML

Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane
ENGINEERING

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

engineering
2 min read★★★☆☆
Read Breakdown →
The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure
AI & ML

The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure

ai & ml
2 min read★★★☆☆
Read Breakdown →
Trends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Exploration
AI & ML

Trends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Exploration

ai & ml
2 min read★★★☆☆
Read Breakdown →
Beyond Consensus: Trace-Level Synthesis in Mixture of Agents
AI & ML

Beyond Consensus: Trace-Level Synthesis in Mixture of Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
PRO-CUA: Process-Reward Optimization for Computer Use Agents
AI & ML

PRO-CUA: Process-Reward Optimization for Computer Use Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models
AI & ML

The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
Governing Technical Debt in Agentic AI Systems
AI & ML

Governing Technical Debt in Agentic AI Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Better Later Than Sooner: Neuro-Symbolic Knowledge Graph Construction via Ontology-grounded Post-extraction Correction
AI & ML

Better Later Than Sooner: Neuro-Symbolic Knowledge Graph Construction via Ontology-grounded Post-extraction Correction

ai & ml
2 min read★★★☆☆
Read Breakdown →
Paper Agents, Paper Gains: An Empirical Analysis of DeFi Investment Agents
AI & ML

Paper Agents, Paper Gains: An Empirical Analysis of DeFi Investment Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
ReasonOps: Operator Segmentation for LLM Reasoning Traces
AI & ML

ReasonOps: Operator Segmentation for LLM Reasoning Traces

ai & ml
2 min read★★★☆☆
Read Breakdown →
GTA: Generating Long-Horizon Tasks for Web Agents at Scale
AI & ML

GTA: Generating Long-Horizon Tasks for Web Agents at Scale

ai & ml
2 min read★★★☆☆
Read Breakdown →
BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents
AI & ML

BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility
AI & ML

Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility

ai & ml
2 min read★★★☆☆
Read Breakdown →
How a new extraction process could unlock the world’s lithium
AI & ML

How a new extraction process could unlock the world’s lithium

ai & ml
2 min read★★★☆☆
Read Breakdown →
Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity
AI & ML

Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity

ai & ml
2 min read★★★☆☆
Read Breakdown →
IGADA-IoT: IoT Sensor Energy Optimization in Wireless Sensor Networks Driven by Automatic Data Augmentation
AI & ML

IGADA-IoT: IoT Sensor Energy Optimization in Wireless Sensor Networks Driven by Automatic Data Augmentation

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Simple State Space Model Excels at Multivariate Time Series Classification
TECH BUSINESS

A Simple State Space Model Excels at Multivariate Time Series Classification

tech business
2 min read★★★☆☆
Read Breakdown →
$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference
AI & ML

$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference

ai & ml
2 min read★★★☆☆
Read Breakdown →
Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey
AI & ML

Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey

ai & ml
2 min read★★★☆☆
Read Breakdown →
Metric-Aware PCA as a Linear Instance of Geometric Deep Learning
AI & ML

Metric-Aware PCA as a Linear Instance of Geometric Deep Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility
AI & ML

Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility

ai & ml
2 min read★★★☆☆
Read Breakdown →
Architecture-driven Shift: towards a lightweight selector for capturing the trends of logit shift
AI & ML

Architecture-driven Shift: towards a lightweight selector for capturing the trends of logit shift

ai & ml
2 min read★★★☆☆
Read Breakdown →
Detect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detection
AI & ML

Detect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
HEAL: Resilient and Self-* Hub-based Learning
AI & ML

HEAL: Resilient and Self-* Hub-based Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective
AI & ML

Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective

ai & ml
2 min read★★★☆☆
Read Breakdown →
Resource-Constrained Affect Modelling via Variance Regularisation Pruning
AI & ML

Resource-Constrained Affect Modelling via Variance Regularisation Pruning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Energy-Structured Low-Rank Adaptation for Continual Learning
AI & ML

Energy-Structured Low-Rank Adaptation for Continual Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Federated Learning for Multivariate Time Series Anomaly Detection in Industrial Automation
TECH BUSINESS

Federated Learning for Multivariate Time Series Anomaly Detection in Industrial Automation

tech business
2 min read★★★☆☆
Read Breakdown →
GenSBI: Generative Methods for Simulation-Based Inference in JAX
AI & ML

GenSBI: Generative Methods for Simulation-Based Inference in JAX

ai & ml
2 min read★★★☆☆
Read Breakdown →
SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
AI & ML

SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Fundamental Limits of Fraud Detection in Card Payment Networks
AI & ML

The Fundamental Limits of Fraud Detection in Card Payment Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals
AI & ML

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

ai & ml
2 min read★★★☆☆
Read Breakdown →
Gradient Transformer: Learning to Generate Updates for LLMs
AI & ML

Gradient Transformer: Learning to Generate Updates for LLMs

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Energy Blind Spot: NVIDIA's Flagship Edge AI Hardware Cannot Support Process-Level Energy Attribution
AI & ML

The Energy Blind Spot: NVIDIA's Flagship Edge AI Hardware Cannot Support Process-Level Energy Attribution

ai & ml
2 min read★★★☆☆
Read Breakdown →
Evaluating Local Explainability Metrics for Machine Learning Models on Tabular Data
TECH BUSINESS

Evaluating Local Explainability Metrics for Machine Learning Models on Tabular Data

tech business
2 min read★★★☆☆
Read Breakdown →
Supervised Distributional Reduction via Optimal Transport and Dependence Maximization
AI & ML

Supervised Distributional Reduction via Optimal Transport and Dependence Maximization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Hurwitz Quaternion Multiplicative Quantization for KV Cache Compression
AI & ML

Hurwitz Quaternion Multiplicative Quantization for KV Cache Compression

ai & ml
2 min read★★★☆☆
Read Breakdown →
Faster Thermal Profiling of a Lunar Rover with Machine Learning Adapted Finite Difference Model
AI & ML

Faster Thermal Profiling of a Lunar Rover with Machine Learning Adapted Finite Difference Model

ai & ml
2 min read★★★☆☆
Read Breakdown →
Transferable Reinforcement Learning via Probabilistic Latent Embeddings and Dynamic Policy Adaptation for Sim-to-Real Deployment
AI & ML

Transferable Reinforcement Learning via Probabilistic Latent Embeddings and Dynamic Policy Adaptation for Sim-to-Real Deployment

ai & ml
2 min read★★★☆☆
Read Breakdown →
How the Optimizer Shapes Learned Solutions in Equivariant Neural Networks
AI & ML

How the Optimizer Shapes Learned Solutions in Equivariant Neural Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Aligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecasting
AI & ML

Aligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecasting

ai & ml
2 min read★★★☆☆
Read Breakdown →
When do complex-valued neural networks help? A study of representation, geometry, and optimization
AI & ML

When do complex-valued neural networks help? A study of representation, geometry, and optimization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Test-Time Collective Action: Proxy-Based Perturbations for Correcting Algorithmic Harms
AI & ML

Test-Time Collective Action: Proxy-Based Perturbations for Correcting Algorithmic Harms

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Download: climate tech goes public and the AI Hype Index returns
ENGINEERING

The Download: climate tech goes public and the AI Hype Index returns

engineering
2 min read★★★☆☆
Read Breakdown →
Climate tech companies are going public. What’s next?
AI & ML

Climate tech companies are going public. What’s next?

ai & ml
2 min read★★★☆☆
Read Breakdown →
The AI Hype Index: AI gets booed in graduation season
AI & ML

The AI Hype Index: AI gets booed in graduation season

ai & ml
2 min read★★★☆☆
Read Breakdown →
Identifying and Understanding Human Values in Text: A Tailorable LLM-based Architecture
AI & ML

Identifying and Understanding Human Values in Text: A Tailorable LLM-based Architecture

ai & ml
2 min read★★★☆☆
Read Breakdown →
Soro: A Lightweight Foundation Model and Chatbot for Tajik
AI & ML

Soro: A Lightweight Foundation Model and Chatbot for Tajik

ai & ml
2 min read★★★☆☆
Read Breakdown →
On the Origin of Synthetic Information by Means of Steganographic Inheritance
AI & ML

On the Origin of Synthetic Information by Means of Steganographic Inheritance

ai & ml
2 min read★★★☆☆
Read Breakdown →
DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents
AI & ML

DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Why LLMs Fail at Causal Discovery and How Interventional Agents Escape
AI & ML

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

ai & ml
2 min read★★★☆☆
Read Breakdown →
RULER: Representation-Level Verification of Machine Unlearning
AI & ML

RULER: Representation-Level Verification of Machine Unlearning

ai & ml
2 min read★★★☆☆
Read Breakdown →
LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation
AI & ML

LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Discovery Agents for Real-Time Analytics: Toward Proactive Insight Systems
AI & ML

Discovery Agents for Real-Time Analytics: Toward Proactive Insight Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Agyn: An Open-Source Platform for AI Agents with Scalable On-Demand Execution, Agent Definition as a Code, and Zero-Trust Access
AI & ML

Agyn: An Open-Source Platform for AI Agents with Scalable On-Demand Execution, Agent Definition as a Code, and Zero-Trust Access

ai & ml
2 min read★★★☆☆
Read Breakdown →
You Are in Control of Your State: Why Human Outcomes Are Controllable Through Causal State Intervention
AI & ML

You Are in Control of Your State: Why Human Outcomes Are Controllable Through Causal State Intervention

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention
AI & ML

Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention

ai & ml
2 min read★★★☆☆
Read Breakdown →
Voluntary Collusion with Secret Tools in Competing LLM Agents
AI & ML

Voluntary Collusion with Secret Tools in Competing LLM Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Laguna M.1/XS.2 Technical Report
AI & ML

Laguna M.1/XS.2 Technical Report

ai & ml
2 min read★★★☆☆
Read Breakdown →
Reasoning and Planning with Dynamically Changing Norms
AI & ML

Reasoning and Planning with Dynamically Changing Norms

ai & ml
2 min read★★★☆☆
Read Breakdown →
Intelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systems
ENGINEERING

Intelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systems

engineering
2 min read★★★☆☆
Read Breakdown →
Behavioural Analysis of Alignment Faking
AI & ML

Behavioural Analysis of Alignment Faking

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cross-Entropy Games and Frost Training
AI & ML

Cross-Entropy Games and Frost Training

ai & ml
2 min read★★★☆☆
Read Breakdown →
Hierarchical Prompt-Domain Control and Learning for Resource-Constrained Agentic Language Models
AI & ML

Hierarchical Prompt-Domain Control and Learning for Resource-Constrained Agentic Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
DeepSciVerify: Verifying Scientific Claim--Citation Alignment via LLM-Driven Evidence Escalation
AI & ML

DeepSciVerify: Verifying Scientific Claim--Citation Alignment via LLM-Driven Evidence Escalation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Prefix-Safe Bayesian Belief Tracking for LLM Reasoning Reliability:Separating Calibration from Ranking
AI & ML

Prefix-Safe Bayesian Belief Tracking for LLM Reasoning Reliability:Separating Calibration from Ranking

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Policy-Driven Runtime Layer for Agentic LLM Serving
AI & ML

A Policy-Driven Runtime Layer for Agentic LLM Serving

ai & ml
2 min read★★★☆☆
Read Breakdown →
Asking Is Not Enough: Protocol Sensitivity in LLM Confidence Calibration
AI & ML

Asking Is Not Enough: Protocol Sensitivity in LLM Confidence Calibration

ai & ml
2 min read★★★☆☆
Read Breakdown →
SkillGrad: Optimizing Agent Skills Like Gradient Descent
AI & ML

SkillGrad: Optimizing Agent Skills Like Gradient Descent

ai & ml
2 min read★★★☆☆
Read Breakdown →
Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems
AI & ML

Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Auditable Decision Models with Learned Abstention and Real-Time Steering
AI & ML

Auditable Decision Models with Learned Abstention and Real-Time Steering

ai & ml
2 min read★★★☆☆
Read Breakdown →
Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles
AI & ML

Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Query Engine for the Agents
AI & ML

A Query Engine for the Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Fixed-Budget, Cluster-Aware Standard for LLM-as-a-Judge Evaluation: A Multi-Hop RAG Stress Test
AI & ML

A Fixed-Budget, Cluster-Aware Standard for LLM-as-a-Judge Evaluation: A Multi-Hop RAG Stress Test

ai & ml
2 min read★★★☆☆
Read Breakdown →
GraD-IBD: Graph Representation Learning from Diagnosis Trajectories for Early Detection of Inflammatory Bowel Disease
AI & ML

GraD-IBD: Graph Representation Learning from Diagnosis Trajectories for Early Detection of Inflammatory Bowel Disease

ai & ml
2 min read★★★☆☆
Read Breakdown →
GEM: Geometric Entropy Mixing for Optimal LLM Data Curation
AI & ML

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Models
AI & ML

The Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion
AI & ML

AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion

ai & ml
2 min read★★★☆☆
Read Breakdown →
SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection
AI & ML

SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
Neural Bayesian Sequential Routing
AI & ML

Neural Bayesian Sequential Routing

ai & ml
2 min read★★★☆☆
Read Breakdown →
TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models
AI & ML

TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach
AI & ML

On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach

ai & ml
2 min read★★★☆☆
Read Breakdown →
Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning
AI & ML

Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection
AI & ML

When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
ARBITER: Reasoning Trajectory Basins and Majority Vote Failures in Test-Time Sampling
AI & ML

ARBITER: Reasoning Trajectory Basins and Majority Vote Failures in Test-Time Sampling

ai & ml
2 min read★★★☆☆
Read Breakdown →
InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization
AI & ML

InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization

ai & ml
2 min read★★★☆☆
Read Breakdown →
GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training
AI & ML

GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training

ai & ml
2 min read★★★☆☆
Read Breakdown →
Max-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Training
AI & ML

Max-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Training

ai & ml
2 min read★★★☆☆
Read Breakdown →
HRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signals
AI & ML

HRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signals

ai & ml
2 min read★★★☆☆
Read Breakdown →
Modeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Series
TECH BUSINESS

Modeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Series

tech business
2 min read★★★☆☆
Read Breakdown →
Co-folding model guided by structural proteomics
AI & ML

Co-folding model guided by structural proteomics

ai & ml
2 min read★★★☆☆
Read Breakdown →
Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection
TECH BUSINESS

Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection

tech business
2 min read★★★☆☆
Read Breakdown →
On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series
AI & ML

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

ai & ml
2 min read★★★☆☆
Read Breakdown →
From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD
AI & ML

From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD

ai & ml
2 min read★★★☆☆
Read Breakdown →
Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks
AI & ML

Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works
AI & ML

The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works

ai & ml
2 min read★★★☆☆
Read Breakdown →
Unified Neural Scaling Laws
AI & ML

Unified Neural Scaling Laws

ai & ml
2 min read★★★☆☆
Read Breakdown →
Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion
AI & ML

Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion

ai & ml
2 min read★★★☆☆
Read Breakdown →
Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization
AI & ML

Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Two-Parameter Flows for Learning Population Dynamics of Physical Systems
AI & ML

Two-Parameter Flows for Learning Population Dynamics of Physical Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Stateful Inference for Low-Latency Multi-Agent Tool Calling
AI & ML

Stateful Inference for Low-Latency Multi-Agent Tool Calling

ai & ml
2 min read★★★☆☆
Read Breakdown →
Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks
AI & ML

Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks
AI & ML

Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Curriculum Learning for Safety Alignment
AI & ML

Curriculum Learning for Safety Alignment

ai & ml
2 min read★★★☆☆
Read Breakdown →
MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding
AI & ML

MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Download: keeping up with AI, and the future of IVF
AI & ML

The Download: keeping up with AI, and the future of IVF

ai & ml
2 min read★★★☆☆
Read Breakdown →
BrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenization
AI & ML

BrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Can LLMs Introspect? A Reality Check
AI & ML

Can LLMs Introspect? A Reality Check

ai & ml
2 min read★★★☆☆
Read Breakdown →
Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions
ENGINEERING

Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions

engineering
2 min read★★★☆☆
Read Breakdown →
Constraint acquisition needs better benchmarks
TECH BUSINESS

Constraint acquisition needs better benchmarks

tech business
2 min read★★★☆☆
Read Breakdown →
Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems
AI & ML

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Experiments in Agentic AI for Science
AI & ML

Experiments in Agentic AI for Science

ai & ml
2 min read★★★☆☆
Read Breakdown →
Anchor: Mitigating Artifact Drift in Agent Benchmark Generation
AI & ML

Anchor: Mitigating Artifact Drift in Agent Benchmark Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling
AI & ML

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

ai & ml
2 min read★★★☆☆
Read Breakdown →
JobBench: Aligning Agent Work With Human Will
AI & ML

JobBench: Aligning Agent Work With Human Will

ai & ml
2 min read★★★☆☆
Read Breakdown →
Managing Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planning
AI & ML

Managing Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planning

ai & ml
2 min read★★★☆☆
Read Breakdown →
ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence
ENGINEERING

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

engineering
2 min read★★★☆☆
Read Breakdown →
Automatic Layer Selection for Hallucination Detection
AI & ML

Automatic Layer Selection for Hallucination Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL
AI & ML

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

ai & ml
2 min read★★★☆☆
Read Breakdown →
Advancing Creative Physical Intelligence in Large Multimodal Models
AI & ML

Advancing Creative Physical Intelligence in Large Multimodal Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator
AI & ML

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

ai & ml
2 min read★★★☆☆
Read Breakdown →
Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions
AI & ML

Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions

ai & ml
2 min read★★★☆☆
Read Breakdown →
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
AI & ML

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

ai & ml
2 min read★★★☆☆
Read Breakdown →
Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning
TECH BUSINESS

Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

tech business
2 min read★★★☆☆
Read Breakdown →
PolyFusionAgent: A Multimodal Foundation Model and Autonomous AI Assistant for Polymer Property Prediction and Inverse Design
ENGINEERING

PolyFusionAgent: A Multimodal Foundation Model and Autonomous AI Assistant for Polymer Property Prediction and Inverse Design

engineering
2 min read★★★☆☆
Read Breakdown →
MobileExplorer: Accelerating On-Device Inference for Mobile GUI Agents via Online Exploration
AI & ML

MobileExplorer: Accelerating On-Device Inference for Mobile GUI Agents via Online Exploration

ai & ml
2 min read★★★☆☆
Read Breakdown →
MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning
AI & ML

MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning

ai & ml
2 min read★★★☆☆
Read Breakdown →
AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents
AI & ML

AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
FAST-GOAL: Fast and Efficient Global-local Object Alignment Learning
AI & ML

FAST-GOAL: Fast and Efficient Global-local Object Alignment Learning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Tail-Aware HiFloat4: W4A4 Post-Training Quantization for Wan2.2
AI & ML

Tail-Aware HiFloat4: W4A4 Post-Training Quantization for Wan2.2

ai & ml
2 min read★★★☆☆
Read Breakdown →
UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems
AI & ML

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Completion vs Optimality: Policy Gradient in Long-Horizon Cumulative-Damage Problems
AI & ML

Completion vs Optimality: Policy Gradient in Long-Horizon Cumulative-Damage Problems

ai & ml
2 min read★★★☆☆
Read Breakdown →
MemFail: Stress-Testing Failure Modes of LLM Memory Systems
AI & ML

MemFail: Stress-Testing Failure Modes of LLM Memory Systems

ai & ml
2 min read★★★☆☆
Read Breakdown →
Mind the Tool Failures: Achieving Synergistic Tool Gains for Medical Agents
AI & ML

Mind the Tool Failures: Achieving Synergistic Tool Gains for Medical Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
AI & ML

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Rethinking organizational design in the age of agentic AI
AI & ML

Rethinking organizational design in the age of agentic AI

ai & ml
2 min read★★★☆☆
Read Breakdown →
Algometrics: Forecasting Under Algorithmic Feedback
AI & ML

Algometrics: Forecasting Under Algorithmic Feedback

ai & ml
2 min read★★★☆☆
Read Breakdown →
Parameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detection
AI & ML

Parameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
CAFD: Concept-Aware DNN Fault Detection using VLMs
AI & ML

CAFD: Concept-Aware DNN Fault Detection using VLMs

ai & ml
2 min read★★★☆☆
Read Breakdown →
Towards Verifiable Transformers: Solver-Checkable Circuit Explanations
AI & ML

Towards Verifiable Transformers: Solver-Checkable Circuit Explanations

ai & ml
2 min read★★★☆☆
Read Breakdown →
Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation
AI & ML

Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation

ai & ml
2 min read★★★☆☆
Read Breakdown →
Hidden-State Privacy Has an Empty Middle
AI & ML

Hidden-State Privacy Has an Empty Middle

ai & ml
2 min read★★★☆☆
Read Breakdown →
LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs
AI & ML

LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs

ai & ml
2 min read★★★☆☆
Read Breakdown →
A Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood?
AI & ML

A Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood?

ai & ml
2 min read★★★☆☆
Read Breakdown →
Mixture of Complementary Agents for Robust LLM Ensemble
AI & ML

Mixture of Complementary Agents for Robust LLM Ensemble

ai & ml
2 min read★★★☆☆
Read Breakdown →
Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing
AI & ML

Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptions
AI & ML

Cascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptions

ai & ml
2 min read★★★☆☆
Read Breakdown →
Feature Lottery? A Bifurcation Theory of Concept Emergence
AI & ML

Feature Lottery? A Bifurcation Theory of Concept Emergence

ai & ml
2 min read★★★☆☆
Read Breakdown →
Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning
AI & ML

Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning

ai & ml
2 min read★★★☆☆
Read Breakdown →
Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers
AI & ML

Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers

ai & ml
2 min read★★★☆☆
Read Breakdown →
Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette
AI & ML

Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette

ai & ml
2 min read★★★☆☆
Read Breakdown →
Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion
AI & ML

Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion

ai & ml
2 min read★★★☆☆
Read Breakdown →
Not All Transitions Matter: Evidence from PPO
AI & ML

Not All Transitions Matter: Evidence from PPO

ai & ml
2 min read★★★☆☆
Read Breakdown →
Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks
AI & ML

Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Overcoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference
AI & ML

Overcoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference

ai & ml
2 min read★★★☆☆
Read Breakdown →
Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions
AI & ML

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions

ai & ml
2 min read★★★☆☆
Read Breakdown →
Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis
AI & ML

Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis

ai & ml
2 min read★★★☆☆
Read Breakdown →
PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection
AI & ML

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

ai & ml
2 min read★★★☆☆
Read Breakdown →
Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization
AI & ML

Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization

ai & ml
2 min read★★★☆☆
Read Breakdown →
Characterizing the Representational Capacity of Neural Processes
AI & ML

Characterizing the Representational Capacity of Neural Processes

ai & ml
2 min read★★★☆☆
Read Breakdown →
Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning
AI & ML

Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning

ai & ml
2 min read★★★☆☆
Read Breakdown →
PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets
AI & ML

PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets

ai & ml
2 min read★★★☆☆
Read Breakdown →
Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions
AI & ML

Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions

ai & ml
2 min read★★★☆☆
Read Breakdown →
Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence
AI & ML

Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence

ai & ml
2 min read★★★☆☆
Read Breakdown →
A lift for input-convex neural network training
AI & ML

A lift for input-convex neural network training

ai & ml
2 min read★★★☆☆
Read Breakdown →
Fourier Feature Pyramids for Physics-Informed Neural Networks
AI & ML

Fourier Feature Pyramids for Physics-Informed Neural Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
The Download: puncturing the AI jobs panic
AI & ML

The Download: puncturing the AI jobs panic

ai & ml
2 min read★★★☆☆
Read Breakdown →