Models

LLMs, foundation models, and model releases

34 articles

Mozilla says Claude Opus 4.6 found 100+ bugs in Firefox in two weeks in January, 14 of them high-severity, more than the bugs typically reported in two months
AI & ML

Mozilla says Claude Opus 4.6 found 100+ bugs in Firefox in two weeks in January, 14 of them high-severity, more than the bugs typically reported in two months

ai & ml
2 min read★★★☆☆
Read Breakdown →
Hardening Firefox with Anthropic’s Red Team
AI & ML

Hardening Firefox with Anthropic’s Red Team

ai & ml
3 min read★★★☆☆
Read Breakdown →
Clinejection — Compromising Cline’s Production Releases just by Prompting an Issue Triager
AI & ML

Clinejection — Compromising Cline’s Production Releases just by Prompting an Issue Triager

ai & ml
2 min read★★★☆☆
Read Breakdown →
QwenLM/Qwen-Agent — Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpr
OPEN SOURCE

QwenLM/Qwen-Agent — Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpr

open source
2 min read★★★☆☆
Read Breakdown →
lingfengQAQ/webnovel-writer — 基于 Claude Code 的长篇网文辅助创作系统,解决 AI 写作中的「遗忘」和「幻觉」问题,支持 200 万字量级 连载创作。
OPEN SOURCE

lingfengQAQ/webnovel-writer — 基于 Claude Code 的长篇网文辅助创作系统,解决 AI 写作中的「遗忘」和「幻觉」问题,支持 200 万字量级 连载创作。

open source
2 min read★★★☆☆
Read Breakdown →
openai/skills — Skills Catalog for Codex
OPEN SOURCE

openai/skills — Skills Catalog for Codex

open source
2 min read★★★☆☆
Read Breakdown →
OpenAI says GPT-5.4 produces presentations with stronger, more varied aesthetics and makes more effective use of its image generation tools
AI & ML

OpenAI says GPT-5.4 produces presentations with stronger, more varied aesthetics and makes more effective use of its image generation tools

ai & ml
2 min read★★★☆☆
Read Breakdown →
GPT-5.4 is priced at $2.50/1M input and $15/1M output tokens while GPT-5.4 Pro is $30/1M input and $180/1M output tokens, more than GPT-5.2 and GPT-5.2 Pro
AI & ML

GPT-5.4 is priced at $2.50/1M input and $15/1M output tokens while GPT-5.4 Pro is $30/1M input and $180/1M output tokens, more than GPT-5.2 and GPT-5.2 Pro

ai & ml
2 min read★★★☆☆
Read Breakdown →
OpenAI says GPT-5.4's “individual claims are 33% less likely to be false and its full responses are 18% less likely to contain any errors, relative to GPT-5.2”
AI & ML

OpenAI says GPT-5.4's “individual claims are 33% less likely to be false and its full responses are 18% less likely to contain any errors, relative to GPT-5.2”

ai & ml
2 min read★★★☆☆
Read Breakdown →
OpenAI says users can now use ChatGPT directly in Microsoft Excel and Google Sheets and debuts a suite of financial-services tools to better tackle office work
AI & ML

OpenAI says users can now use ChatGPT directly in Microsoft Excel and Google Sheets and debuts a suite of financial-services tools to better tackle office work

ai & ml
2 min read★★★☆☆
Read Breakdown →
ByteDance's Seedance 2.0 AI model is held back by limited compute resources that create a bottleneck, forcing users to wait hours to generate a single video
TECH BUSINESS

ByteDance's Seedance 2.0 AI model is held back by limited compute resources that create a bottleneck, forcing users to wait hours to generate a single video

tech business
2 min read★★★☆☆
Read Breakdown →
Anthropic says Claude's free active users grew 60%+ and daily signups grew 4x since the start of the year, with Monday being its strongest day ever
AI & ML

Anthropic says Claude's free active users grew 60%+ and daily signups grew 4x since the start of the year, with Monday being its strongest day ever

ai & ml
2 min read★★★☆☆
Read Breakdown →
Cursor launches Automations, a new tool that lets users automatically launch agents triggered through new additions to a codebase, a Slack message, or a timer
AI & ML

Cursor launches Automations, a new tool that lets users automatically launch agents triggered through new additions to a codebase, a Slack message, or a timer

ai & ml
2 min read★★★☆☆
Read Breakdown →
OpenAI introduces GPT-5.4 with more knowledge-work capability
AI & ML

OpenAI introduces GPT-5.4 with more knowledge-work capability

ai & ml
2 min read★★★☆☆
Read Breakdown →
Large genome model: Open source AI trained on trillions of bases
OPEN SOURCE

Large genome model: Open source AI trained on trillions of bases

open source
2 min read★★★☆☆
Read Breakdown →
Lawsuit: Google Gemini sent man on violent missions, set suicide "countdown"
AI & ML

Lawsuit: Google Gemini sent man on violent missions, set suicide "countdown"

ai & ml
2 min read★★★☆☆
Read Breakdown →
OpenAI launches GPT-5.4 with Pro and Thinking versions
AI & ML

OpenAI launches GPT-5.4 with Pro and Thinking versions

ai & ml
2 min read★★★☆☆
Read Breakdown →
ChatGPT uninstalls surged by 295% after DoD deal
AI & ML

ChatGPT uninstalls surged by 295% after DoD deal

ai & ml
2 min read★★★☆☆
Read Breakdown →
Users are ditching ChatGPT for Claude — here’s how to make the switch
AI & ML

Users are ditching ChatGPT for Claude — here’s how to make the switch

ai & ml
2 min read★★★☆☆
Read Breakdown →
Anthropic’s Claude reports widespread outage
AI & ML

Anthropic’s Claude reports widespread outage

ai & ml
2 min read★★★☆☆
Read Breakdown →
TheCraigHewitt/seomachine — A specialized Claude Code workspace for creating long-form, SEO-optimized blog content for any busin
OPEN SOURCE

TheCraigHewitt/seomachine — A specialized Claude Code workspace for creating long-form, SEO-optimized blog content for any busin

open source
2 min read★★★☆☆
Read Breakdown →
inclusionAI/AReaL — Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
OPEN SOURCE

inclusionAI/AReaL — Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

open source
2 min read★★★☆☆
Read Breakdown →
Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding
AI & ML

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

ai & ml
2 min read★★★☆☆
Read Breakdown →
Adaptive Memory Admission Control for LLM Agents
AI & ML

Adaptive Memory Admission Control for LLM Agents

ai & ml
2 min read★★★☆☆
Read Breakdown →
ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model
TECH BUSINESS

ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model

tech business
2 min read★★★☆☆
Read Breakdown →
Towards automated data analysis: A guided framework for LLM-based risk estimation
AI & ML

Towards automated data analysis: A guided framework for LLM-based risk estimation

ai & ml
2 min read★★★☆☆
Read Breakdown →
When Agents Persuade: Propaganda Generation and Mitigation in LLMs
AI & ML

When Agents Persuade: Propaganda Generation and Mitigation in LLMs

ai & ml
2 min read★★★☆☆
Read Breakdown →
Using Vision + Language Models to Predict Item Difficulty
AI & ML

Using Vision + Language Models to Predict Item Difficulty

ai & ml
2 min read★★★☆☆
Read Breakdown →
Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models
ENGINEERING

Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

engineering
2 min read★★★☆☆
Read Breakdown →
Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery
TECH BUSINESS

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

tech business
2 min read★★★☆☆
Read Breakdown →
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling
TECH BUSINESS

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

tech business
2 min read★★★☆☆
Read Breakdown →
LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks
AI & ML

LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

ai & ml
2 min read★★★☆☆
Read Breakdown →
VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment
AI & ML

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

ai & ml
2 min read★★★☆☆
Read Breakdown →
OpenAI’s new GPT-5.4 model is a big step toward autonomous agents
AI & ML

OpenAI’s new GPT-5.4 model is a big step toward autonomous agents

ai & ml
2 min read★★★☆☆
Read Breakdown →