Research

Papers, benchmarks, and academic breakthroughs

37 articles

First MacBook Neo Benchmarks Are In
AI & ML

First MacBook Neo Benchmarks Are In

ai & ml
2 min read★★★☆☆
Read Breakdown →
crosspoint-reader: Firmware for the Xteink X4 e-paper display reader
AI & ML

crosspoint-reader: Firmware for the Xteink X4 e-paper display reader

ai & ml
2 min read★★★☆☆
Read Breakdown →
DiligenceSquared uses AI, voice agents to make M&A research affordable
AI & ML

DiligenceSquared uses AI, voice agents to make M&A research affordable

ai & ml
2 min read★★★☆☆
Read Breakdown →
TheCraigHewitt/seomachine — A specialized Claude Code workspace for creating long-form, SEO-optimized blog content for any busin
OPEN SOURCE

TheCraigHewitt/seomachine — A specialized Claude Code workspace for creating long-form, SEO-optimized blog content for any busin

open source
2 min read★★★☆☆
Read Breakdown →
KeygraphHQ/shannon — Shannon Lite is a fully autonomous AI pentester for web apps and APIs. 96.15% (100/104 exploits) on
OPEN SOURCE

KeygraphHQ/shannon — Shannon Lite is a fully autonomous AI pentester for web apps and APIs. 96.15% (100/104 exploits) on

open source
2 min read★★★☆☆
Read Breakdown →
SkillNet: Create, Evaluate, and Connect AI Skills
TECH BUSINESS

SkillNet: Create, Evaluate, and Connect AI Skills

tech business
2 min read★★★☆☆
Read Breakdown →
Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography
ENGINEERING

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

engineering
2 min read★★★☆☆
Read Breakdown →
Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding
AI & ML

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

ai & ml
2 min read★★★☆☆
Read Breakdown →
Discovering mathematical concepts through a multi-agent system
AI & ML

Discovering mathematical concepts through a multi-agent system

ai & ml
2 min read★★★☆☆
Read Breakdown →
Adaptive Memory Admission Control for LLM Agents
AI & ML

Adaptive Memory Admission Control for LLM Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
Self-Attribution Bias: When AI Monitors Go Easy on Themselves
AI & ML

Self-Attribution Bias: When AI Monitors Go Easy on Themselves

ai & ml
2 min read★★★☆☆
Read Breakdown →
ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model
TECH BUSINESS

ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model

tech business
2 min read★★★☆☆
Read Breakdown →
Towards automated data analysis: A guided framework for LLM-based risk estimation
AI & ML

Towards automated data analysis: A guided framework for LLM-based risk estimation

ai & ml
2 min read★★★☆☆
Read Breakdown →
When Agents Persuade: Propaganda Generation and Mitigation in LLMs
AI & ML

When Agents Persuade: Propaganda Generation and Mitigation in LLMs

ai & ml
2 min read★★★☆☆
Read Breakdown →
Using Vision + Language Models to Predict Item Difficulty
AI & ML

Using Vision + Language Models to Predict Item Difficulty

ai & ml
2 min read★★★☆☆
Read Breakdown →
Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models
ENGINEERING

Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

engineering
2 min read★★★☆☆
Read Breakdown →
From Offline to Periodic Adaptation for Pose-Based Shoplifting Detection in Real-world Retail Security
TECH BUSINESS

From Offline to Periodic Adaptation for Pose-Based Shoplifting Detection in Real-world Retail Security

tech business
2 min read★★★☆☆
Read Breakdown →
Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery
TECH BUSINESS

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

tech business
2 min read★★★☆☆
Read Breakdown →
Interactive Benchmarks
AI & ML

Interactive Benchmarks

ai & ml
2 min read★★★☆☆
Read Breakdown →
Memory as Ontology: A Constitutional Memory Architecture for Persistent Digital Citizens
AI & ML

Memory as Ontology: A Constitutional Memory Architecture for Persistent Digital Citizens

ai & ml
2 min read★★★☆☆
Read Breakdown →
CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics
AI & ML

CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics

ai & ml
2 min read★★★☆☆
Read Breakdown →
Visioning Human-Agentic AI Teaming: Continuity, Tension, and Future Research
AI & ML

Visioning Human-Agentic AI Teaming: Continuity, Tension, and Future Research

ai & ml
2 min read★★★☆☆
Read Breakdown →
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
AI & ML

HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel

ai & ml
2 min read★★★☆☆
Read Breakdown →
Evaluating the Search Agent in a Parallel World
TECH BUSINESS

Evaluating the Search Agent in a Parallel World

tech business
2 min read★★★☆☆
Read Breakdown →
MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem
AI & ML

MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem

ai & ml
2 min read★★★☆☆
Read Breakdown →
Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction
TECH BUSINESS

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

tech business
2 min read★★★☆☆
Read Breakdown →
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling
TECH BUSINESS

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

tech business
2 min read★★★☆☆
Read Breakdown →
EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue
ENGINEERING

EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue

engineering
2 min read★★★☆☆
Read Breakdown →
LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks
AI & ML

LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment
AI & ML

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

ai & ml
2 min read★★★☆☆
Read Breakdown →
Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models
AI & ML

Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models

ai & ml
2 min read★★★☆☆
Read Breakdown →
On Multi-Step Theorem Prediction via Non-Parametric Structural Priors
TECH BUSINESS

On Multi-Step Theorem Prediction via Non-Parametric Structural Priors

tech business
2 min read★★★☆☆
Read Breakdown →
Causally Robust Reward Learning from Reason-Augmented Preference Feedback
TECH BUSINESS

Causally Robust Reward Learning from Reason-Augmented Preference Feedback

tech business
2 min read★★★☆☆
Read Breakdown →
K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation
TECH BUSINESS

K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

tech business
2 min read★★★☆☆
Read Breakdown →
SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms
ENGINEERING

SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms

engineering
2 min read★★★☆☆
Read Breakdown →
AI tools can unmask anonymous accounts
TECH BUSINESS

AI tools can unmask anonymous accounts

tech business
2 min read★★★☆☆
Read Breakdown →
NotebookLM can now summarize research in ‘cinematic’ video overviews
AI & ML

NotebookLM can now summarize research in ‘cinematic’ video overviews

ai & ml
2 min read★★★☆☆
Read Breakdown →