AI & Machine Learning
AI & MLShow HN: Rust compiler in PHP emitting x86-64 executables
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGiving LLMs a personality is just good engineering
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLBetter JIT for Postgres
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSources: President Trump met with Coinbase CEO Brian Armstrong on March 3 before publicly admonishing banks over the GENIUS Act, echoing Coinbase's position
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAgentic Engineering Patterns
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLA CPU that runs entirely on GPU
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLog messages are mostly for the people operating your software
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSecurity researchers successfully prompted the AI behind a Utah prescription renewal pilot to reclassify meth as an “unrestricted therapeutic”, and more
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLFederated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLEngineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLEstimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCan machines be uncertain?
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCOOL-MC: Verifying and Explaining RL Policies for Platelet Inventory Management
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLVL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDiagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLPRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRevealing Positive and Negative Role Models to Help People Make Good Decisions
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLNeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLA Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLiveAgentBench: Comprehensive Benchmarking of Agentic Systems Across 104 Real-World Challenges
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSee and Remember: A Multimodal Agent for Web Traversal
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSorryDB: Can AI Provers Complete Real-World Lean Theorems?
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRetrieval-Augmented Robots via Retrieve-Reason-Act
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLA Natural Language Agentic Approach to Study Affective Polarization
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLEvoSkill: Automated Skill Discovery for Multi-Agent Systems
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRethinking Code Similarity for Automated Algorithm Design with LLMs
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAgentified Assessment of Logical Reasoning Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGuideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLLM-based Argument Mining meets Argumentation and Description Logics: a Unified Framework for Reasoning about Debates
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRetrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLIs Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSelf-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLNExT-Guard: Training-Free Streaming Safeguard without Token-Level Labels
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLForecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLMedFeat: Model-Aware and Explainability-Driven Feature Engineering with LLMs for Clinical Tabular Prediction
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCharacterizing and Predicting Wildfire Evacuation Behavior: A Dual-Stage ML Approach
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSubspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLScaling Reward Modeling without Human Supervision
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLEfficient Sparse Selective-Update RNNs for Long-Range Sequence Modeling
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLRouting Absorption in Sparse Attention: Why Random Gates Are Hard to Beat
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLNeural Paging: Learning Context Management Policies for Turing-Complete Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSafety Training Persists Through Helpfulness Optimization in LLM Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGeneralized Discrete Diffusion with Self-Correction
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLPhysics-Informed Neural Networks with Architectural Physics Embedding for Large-Scale Wave Field Reconstruction
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLBeyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAdaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLStructured vs. Unstructured Pruning: An Exponential Gap
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLTalking with Verifiers: Automatic Specification Generation for Neural Network Verification
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCUDABench: Benchmarking LLMs for Text-to-CUDA Generation
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLConcept Heterogeneity-aware Representation Steering
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLength Generalization Bounds for Transformers
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLHigh-order Knowledge Based Network Controllability Robustness Prediction: A Hypergraph Neural Network Approach
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLBoosting Meta-Learning for Few-Shot Text Classification via Label-guided Distance Scaling
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLPRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGraph Attention Based Prioritization of Disease Responsible Genes from Multimodal Alzheimer's Network
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLA Comparative Study of UMAP and Other Dimensionality Reduction Methods
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLTemporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLNumber Research Inc
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCoruna: The Mysterious Journey of a Powerful iOS Exploit Kit
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLA pretty looking web for a quantum mechanics tool
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLSpeculative Speculative Decoding (SSD)
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLThe largest acidic geyser has been putting on quite a show
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLWeave – A language aware merge algorithm based on entities
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLMount Mayhem at Netflix: Scaling Containers on Modern CPUs
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLTikTok will not introduce end-to-end encryption, saying it makes users less safe
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLCalifornia's Digital Age Assurance Act, and FOSS
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLClaude's Cycles [pdf]
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLaunch HN: Cekura (YC F24) – Testing and monitoring for voice and chat AI agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDon't make me talk to your chatbot
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGoogle launches Gemini 3.1 Flash-Lite, which it says delivers “enhanced performance” at a fraction of the cost of larger models and outperforms 2.5 Flash
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLOpenAI releases GPT-5.3 Instant, which it says delivers more accurate answers and better-contextualized results when searching the web, for all ChatGPT users
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLOpenAI says GPT-5.3 Instant's tone should feel less “cringe” than GPT-5.2 Instant and the model has a smoother, more to-the-point conversational style
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLZiff Davis agrees to sell its Connectivity division, including Ookla and Downdetector, to Accenture for $1.2B in cash, to focus on enthusiast websites like IGN
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLClaude is an Electron App because we’ve lost native
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDon Knuth's "Claude-like" directed Hamiltonian cycles decompositions
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLNew MacBook Airs come with M5, double the storage, and higher starting prices
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLLMs can unmask pseudonymous users at scale with surprising accuracy
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLResearch roundup: Six cool science stories we almost missed
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLApple's new iPhone 17e has an A19 chip, MagSafe, and 256GB of storage for $599
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLClaude Code rolls out a voice mode capability
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLChatGPT’s new GPT-5.3 Instant model will stop telling you to calm down
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAnthropic’s Claude reports widespread outage
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[D] Quantified analysis of 2,218 Gary Marcus claims - two independent LLM pipelines, scored against evidence
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[D] How much time do you actually lose trying to reproduce ML papers?
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] Boundary-Metric Evaluation for Thin-Structure Segmentation under 2% Foreground Sparsity
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[P] Bridging the gap between arXiv PDFs and runnable implementations: Announcing ResearchClaw (Open Source)
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] How often do you implement research papers?
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[P] *Free Code* Real-time voice-to-voice with your LLM & full reasoning LLM interface (Telegram + 25 tools, vision, docs, memory) on a Mac Studio running Qwen 3.5 35B — 100% local, zero API cost. Full build open-sourced. cloudfare + n8n + Pipecat + MLX unlock insane possibilities on consumer hardwar
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[D] How to get credits to run experiments on closed source models as a student researcher.
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] Toward Guarantees for Clinical Reasoning in Vision Language Models via Formal Verification
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[P] On-device Qwen3-TTS (1.7B/0.6B) inference on iOS and macOS via MLX-Swift — voice cloning, voice design, and streaming TTS with no cloud
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] Benchmarked 94 LLM endpoints for jan 2026. open source is now within 5 quality points of proprietary
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] CVPR 2026 Camera Ready Paper
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & ML[R] Tiny transformers (<100 params) can add two 10-digit numbers to 100% accuracy
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLChatGPT Uninstalls Surge 295% After OpenAI’s DoD Deal Sparks Backlash
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLWarning: Trae IDE's New Token Pricing Destroyed My Workflow Overnight – Don't Get Caught Off Guard
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLScientists made AI agents ruder — and they performed better at complex reasoning tasks
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLearning how to steer agentic AI in the right direction is a useless skill #changemymind
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLClaude hits No. 1 on App Store as ChatGPT users defect in show of support for Anthropic's Pentagon stance
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLMulti-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLEmCoop: A Framework and Benchmark for Embodied Cooperation Among LLM Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLConservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLConfusion-Aware Rubric Optimization for LLM-based Automated Grading
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLMED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLOptimizing In-Context Demonstrations for LLM-based Automated Grading
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLFrom Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDenoiseFlow: Uncertainty-Aware Denoising for Reliable LLM Agentic Workflows
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLOGIGEN: Logic-Driven Generation of Verifiable Agentic Tasks
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAdvancing Multimodal Judge Models through a Capability-Oriented Benchmark and MCTS-Driven Data Generation
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLDraft-Thinking: Learning Efficient Reasoning in Long Chain-of-Thought LLMs
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLFair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLLiTS: A Modular Framework for LLM Tree Search
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLInfoPO: Information-Driven Policy Optimization for User-Centric Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLK^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLMemPO: Self-Memory Policy Optimization for Long-Horizon Agents
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLGoogle’s latest Pixel drop allows Gemini to order groceries for you and more
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLHow the experts figure out what’s real in the age of deepfakes
ai & ml
2 min read★★★☆☆
Read Breakdown → AI & MLAnthropic upgrades Claude’s memory to attract AI switchers
ai & ml
2 min read★★★☆☆
Read Breakdown →