AI & MLMicrosoft under fire for threatening security researcher with criminal investigationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOne Mask to Rule Them All: On Hidden Facts after Editing and How to Find Themai & ml2 min read★★★☆☆Read Breakdown →
AI & MLRepresentation Signatures and Risk-Feedback Alignment in LLM Trading Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSelf-Play Reinforcement Learning under Imperfect Information in Big 2ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEmergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervisionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLContinuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPrismFlow: Residual Dynamics for Flow Matching in Time-Series Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBalancing Multimodal Learning through Label Space Reshapingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Auditai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFeature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSpectral Guidance for Flexible and Efficient Control of Diffusion Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir Systemai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhen LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RLai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLConf-Gen: Conformal Uncertainty Quantification for Generative Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLA Training-Time Diagnostic for Generalization via the Log-Alignment Ratioai & ml2 min read★★★☆☆Read Breakdown →
AI & MLComparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLearning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFedQHD: Closed-Form Function-Space Federated Reinforcement Learningai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSLoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solverstech business2 min read★★★☆☆Read Breakdown →
AI & MLCausal Intelligence for Constraint-Aware Intervention Design to Induce State Transitionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLabel-Free Reinforcement Learning via Cross-Model Entropyai & ml2 min read★★★☆☆Read Breakdown →
AI & MLICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignmentai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUnlocking Fine-Grained and Within-Utterance Speaking Style Control in Prompt-Based Text-to-Speech Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEnhancing LLM Medical Coding with Structured External Knowledgeai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysisai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Rankingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFrom AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizonsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLModeling Community Attitude through Reaction Tone: A Human-AI Collaborative Framework for Evaluating LLM Alignment with Linguistic Behaviors in Online Communitiesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEvoSpec: Evolving Speculative Decoding via Real-Time Vocabulary and Parameter Adaptationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLStoryMI: Steerable Multi-Agent Therapeutic Dialogue Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPAST2HARM: A Simple Adaptive Past Tense Attack for Jailbreaking Multimodal AIai & ml2 min read★★★☆☆Read Breakdown →
AI & MLKeyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomiesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Future of Facts: Tracing the Factual Generation-Verification Gapai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCan Hallucinations Be Useful? Solving Multi-Hop Questions With SLMs By Chaining System-I/II Reasoningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSimorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answeringai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDisentangling Language Roles in Multilingual LLM Task Executionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverabilityai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTRACES: Proactive Safety Auditing for Multi-Turn LLM Agents via Trajectory-State Modelingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLChain-based Adaptive Reconfiguration Over Lattices for Hallucination Reductionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLReverseMath: Answer Inversion for Scalable and Verifiable Mathematical Problem Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBeyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mindai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Trainingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEscape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLReading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUniMaia: Steering Chess Policies with Language for Human-like Playai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDo Models Know Why They Changed Their Mind? Interpretability and Faithfulness of Chain-of-Thought Under Knowledge Conflictai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSThese researchers would be in Africa fighting ebola—but Trump cut their fundingtech business2 min read★★★☆☆Read Breakdown →
OPEN SOURCEgalilai-group/stable-worldmodel — A platform for reproducible world model research and evaluationopen source2 min read★★★☆☆Read Breakdown →
AI & MLHow the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI momentai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLondon-based Inherent, which aims to combine human scientific research with AI to produce innovations, emerges from stealth with $50M led by Index Venturesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLResearchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 daysai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBlaming the model won't fix your workflow — a white paper on structural enforcement for AI agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhat are important data systems problems, ignored by research? (2024)ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBehavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Predictionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBehavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference Predictionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modelingai & ml2 min read★★★☆☆Read Breakdown →
ENGINEERINGUltra-Reduced-Impact-Encased-Logging (URIEL): propose a new method for selective sustainable logging and post-harvest silvicultural treatment in tropical forest using airborne robotics systemsengineering2 min read★★★☆☆Read Breakdown →
AI & MLReview Arcade: On the Human Alignment and Gameability of LLM Reviewsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFrontier LLM-based agents can overcome the ontology curation bottleneck for natural phenotypesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLVFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element Analysisai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSBEAMS: Benchmarking and Evaluating AI for Modeling and Simulationtech business2 min read★★★☆☆Read Breakdown →
AI & MLAdopt $\neq$ Adapt: Longitudinal Analyses of LLM Conversations in the Wildai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhen Models Disagree: Rethinking LLM Evaluation for Public Comment Analysisai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPractitioner Beliefs and Behaviors in AI-Enhanced Education: DOT Framework Survey Evidenceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLHallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Cachingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semanticsai & ml2 min read★★★☆☆Read Breakdown →
ENGINEERINGThe Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Planeengineering2 min read★★★☆☆Read Breakdown →
AI & MLThe Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressureai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTrends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Explorationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBeyond Consensus: Trace-Level Synthesis in Mixture of Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPRO-CUA: Process-Reward Optimization for Computer Use Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBetter Later Than Sooner: Neuro-Symbolic Knowledge Graph Construction via Ontology-grounded Post-extraction Correctionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPaper Agents, Paper Gains: An Empirical Analysis of DeFi Investment Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLReasonOps: Operator Segmentation for LLM Reasoning Tracesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGTA: Generating Long-Horizon Tasks for Web Agents at Scaleai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibilityai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAI researchers ran 15-day simulations of worlds governed by different AI models: Claude Sonnet 4.6 recorded zero crime, while Gemini 3 Flash had the most at 683ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGitHub bans security researcher who posted zero-day Windows exploitsai & ml2 min read★★★☆☆Read Breakdown →
AI & ML[D] Where do you go for serious AI research discussion online? [D]ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUS healthcare still stupidly expensive, with pathetic outcomes, study findsai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSResearchers develop a new process to get lithium out of rockstech business2 min read★★★☆☆Read Breakdown →
AI & MLKept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEMA-Gated Temporal Sequence Compression in Vision Transformers [P]ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLnoisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe OpenClaw crisis is the most complete case study of agentic AI security failure. Here's the full timeline and technical breakdown.ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLHow a new extraction process could unlock the world’s lithiumai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPersonalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneityai & ml2 min read★★★☆☆Read Breakdown →
AI & MLIGADA-IoT: IoT Sensor Energy Optimization in Wireless Sensor Networks Driven by Automatic Data Augmentationai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSA Simple State Space Model Excels at Multivariate Time Series Classificationtech business2 min read★★★☆☆Read Breakdown →
AI & ML$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inferenceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTackling Multimodal Learning Challenges with Mixture-of-Expert: A Surveyai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMetric-Aware PCA as a Linear Instance of Geometric Deep Learningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLComparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utilityai & ml2 min read★★★☆☆Read Breakdown →
AI & MLArchitecture-driven Shift: towards a lightweight selector for capturing the trends of logit shiftai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDetect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBalancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspectiveai & ml2 min read★★★☆☆Read Breakdown →
AI & MLResource-Constrained Affect Modelling via Variance Regularisation Pruningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLEnergy-Structured Low-Rank Adaptation for Continual Learningai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSFederated Learning for Multivariate Time Series Anomaly Detection in Industrial Automationtech business2 min read★★★☆☆Read Breakdown →
AI & MLGenSBI: Generative Methods for Simulation-Based Inference in JAXai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Trainingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Fundamental Limits of Fraud Detection in Card Payment Networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLInformation-theoretic Multimodal Representation Learning for Electrocardiogram Signalsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGradient Transformer: Learning to Generate Updates for LLMsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Energy Blind Spot: NVIDIA's Flagship Edge AI Hardware Cannot Support Process-Level Energy Attributionai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSEvaluating Local Explainability Metrics for Machine Learning Models on Tabular Datatech business2 min read★★★☆☆Read Breakdown →
AI & MLSupervised Distributional Reduction via Optimal Transport and Dependence Maximizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLHurwitz Quaternion Multiplicative Quantization for KV Cache Compressionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFaster Thermal Profiling of a Lunar Rover with Machine Learning Adapted Finite Difference Modelai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTransferable Reinforcement Learning via Probabilistic Latent Embeddings and Dynamic Policy Adaptation for Sim-to-Real Deploymentai & ml2 min read★★★☆☆Read Breakdown →
AI & MLHow the Optimizer Shapes Learned Solutions in Equivariant Neural Networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecastingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhen do complex-valued neural networks help? A study of representation, geometry, and optimizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTest-Time Collective Action: Proxy-Based Perturbations for Correcting Algorithmic Harmsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAWS is rolling out Resilient Network Graphs, a “quasi-random” networking architecture that uses a flat mesh design, and says it accelerates information flowsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUK researchers win access to Google's Willow quantum chip, which it says completes a calculation in five minutes that takes supercomputers 10 septillion yearsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLIdentifying and Understanding Human Values in Text: A Tailorable LLM-based Architectureai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSoro: A Lightweight Foundation Model and Chatbot for Tajikai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOn the Origin of Synthetic Information by Means of Steganographic Inheritanceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhy LLMs Fail at Causal Discovery and How Interventional Agents Escapeai & ml2 min read★★★☆☆Read Breakdown →
AI & MLRULER: Representation-Level Verification of Machine Unlearningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDiscovery Agents for Real-Time Analytics: Toward Proactive Insight Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAgyn: An Open-Source Platform for AI Agents with Scalable On-Demand Execution, Agent Definition as a Code, and Zero-Trust Accessai & ml2 min read★★★☆☆Read Breakdown →
AI & MLYou Are in Control of Your State: Why Human Outcomes Are Controllable Through Causal State Interventionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCyberbullying Governance on Social Media: A Unified Framework from Content Identification to Interventionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLVoluntary Collusion with Secret Tools in Competing LLM Agentsai & ml2 min read★★★☆☆Read Breakdown →
ENGINEERINGIntelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systemsengineering2 min read★★★☆☆Read Breakdown →
AI & MLHierarchical Prompt-Domain Control and Learning for Resource-Constrained Agentic Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDeepSciVerify: Verifying Scientific Claim--Citation Alignment via LLM-Driven Evidence Escalationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPrefix-Safe Bayesian Belief Tracking for LLM Reasoning Reliability:Separating Calibration from Rankingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAsking Is Not Enough: Protocol Sensitivity in LLM Confidence Calibrationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSkillGrad: Optimizing Agent Skills Like Gradient Descentai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGot a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAuditable Decision Models with Learned Abstention and Real-Time Steeringai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDiagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profilesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLA Fixed-Budget, Cluster-Aware Standard for LLM-as-a-Judge Evaluation: A Multi-Hop RAG Stress Testai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGraD-IBD: Graph Representation Learning from Diagnosis Trajectories for Early Detection of Inflammatory Bowel Diseaseai & ml2 min read★★★☆☆Read Breakdown →
AI & MLI used autoresearch to improve my AGENTS.md, measured against real tasksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAmazon says it is making the “architecture, starter code, and learnings” from Alexa for Shopping available to third-party retailers, starting with Kate Spadeai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGEM: Geometric Entropy Mixing for Optimal LLM Data Curationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOn the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approachai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPlanning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLWhen Rule Violations Are Rare: Chimera Training for Logical Anomaly Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLARBITER: Reasoning Trajectory Basins and Majority Vote Failures in Test-Time Samplingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLInfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Trainingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMax-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Trainingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLHRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signalsai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSModeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Seriestech business2 min read★★★☆☆Read Breakdown →
TECH BUSINESSBridging Classification and Reconstruction: Cooperative Time Series Anomaly Detectiontech business2 min read★★★☆☆Read Breakdown →
AI & MLOn the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Seriesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFrom Privacy to Generalization: Linear Max-Information Bounds for DP-SGDai & ml2 min read★★★☆☆Read Breakdown →
AI & MLProvably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Worksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLQuantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLScaling World-Model Reinforcement Learning Through Diffusion Policy Optimizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTwo-Parameter Flows for Learning Population Dynamics of Physical Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLStateful Inference for Low-Latency Multi-Agent Tool Callingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLClassification and detection of multiple UAVs using rational Gaussian wavelet neural networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understandingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDatacurve releases the DeepSWE coding benchmark, a 113-task test across 91 open-source repositories and five languages, and says GPT-5.5 is the leader at 70%ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLBrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenizationai & ml2 min read★★★☆☆Read Breakdown →
ENGINEERINGPersonalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactionsengineering2 min read★★★☆☆Read Breakdown →
TECH BUSINESSConstraint acquisition needs better benchmarkstech business2 min read★★★☆☆Read Breakdown →
AI & MLYour Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAnchor: Mitigating Artifact Drift in Agent Benchmark Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modelingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLManaging Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planningai & ml2 min read★★★☆☆Read Breakdown →
ENGINEERINGScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidenceengineering2 min read★★★☆☆Read Breakdown →
AI & MLExploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RLai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAdvancing Creative Physical Intelligence in Large Multimodal Modelsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFrom Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulatorai & ml2 min read★★★☆☆Read Breakdown →
AI & MLReasoning, Code, or Both? How Large Language Models Handle Variations in Math Questionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLThe MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligenceai & ml2 min read★★★☆☆Read Breakdown →
TECH BUSINESSWhich Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoningtech business2 min read★★★☆☆Read Breakdown →
ENGINEERINGPolyFusionAgent: A Multimodal Foundation Model and Autonomous AI Assistant for Polymer Property Prediction and Inverse Designengineering2 min read★★★☆☆Read Breakdown →
AI & MLMobileExplorer: Accelerating On-Device Inference for Mobile GUI Agents via Online Explorationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFAST-GOAL: Fast and Efficient Global-local Object Alignment Learningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTail-Aware HiFloat4: W4A4 Post-Training Quantization for Wan2.2ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLUnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCompletion vs Optimality: Policy Gradient in Long-Horizon Cumulative-Damage Problemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMemFail: Stress-Testing Failure Modes of LLM Memory Systemsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLMind the Tool Failures: Achieving Synergistic Tool Gains for Medical Agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTowards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLInitial benchmarks show Nvidia's Vera CPU, which features 88 in-house-designed Olympus cores, packs a heavy-hitting punch, beating Intel's and AMD's x86_64 CPUsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLDeepSWE: A contamination-free benchmark for long-horizon coding agentsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLParameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTowards Verifiable Transformers: Solver-Checkable Circuit Explanationsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLIterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLLLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLA Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood?ai & ml2 min read★★★☆☆Read Breakdown →
AI & MLTruthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcingai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFeature Lottery? A Bifurcation Theory of Concept Emergenceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSigns Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLSpectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformersai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFederated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignetteai & ml2 min read★★★☆☆Read Breakdown →
AI & MLGenerative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLVerified SHAP: Provable Bounds for Exact Shapley Values of Neural Networksai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOvercoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inferenceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLRiemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLKnowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysisai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detectionai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFiltered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalizationai & ml2 min read★★★☆☆Read Breakdown →
AI & MLCharacterizing the Representational Capacity of Neural Processesai & ml2 min read★★★☆☆Read Breakdown →
AI & MLAgent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoningai & ml2 min read★★★☆☆Read Breakdown →
AI & MLPrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasetsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLRethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditionsai & ml2 min read★★★☆☆Read Breakdown →
AI & MLOptimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherenceai & ml2 min read★★★☆☆Read Breakdown →
AI & MLFourier Feature Pyramids for Physics-Informed Neural Networksai & ml2 min read★★★☆☆Read Breakdown →