OpenAI on April 24, 2026, released GPT-5.5, calling it the company’s first base model trained entirely from scratch rather ...
OpenAI’s GPT-5.5 has posted an 82.7% score on Terminal-Bench 2.0, a benchmark that throws AI agents into difficult, ...
Tencent just open-sourced Hy3 preview, a model that punches above its weight on coding agents, reasoning, and search—built in ...
Chinese AI company MiniMax has released the weights for MiniMax M2.7, a 229-billion-parameter Mixture-of-Experts model that participated in its own development cycle – marking what the company calls ...
OpenAI scientists have designed MLE-bench — a compilation of 75 extremely difficult tests that can assess whether a future advanced AI agent is capable of modifying its own code and improving itself.