OpenAI says the cyber capabilities of its frontier AI models are accelerating and warns Wednesday that upcoming models are ...
The SWE-Bench Verified evaluation is basically a test of AI processing accuracy. It measures how well the AI solves a set of coding problems. According to OpenAI, GPT-5.1-Codex-Max "reaches the same ...
OpenAI has reported a surge in performance as GPT-5.1-Codex-Max reaching 76% in capability assessments, and warned of ...
OpenAI has shipped new products at a relentless clip in the second half of 2025. Not only has the company released several new AI models, but also new features within ChatGPT, an AI-powered web ...
Interesting Engineering on MSN
OpenAI adds layered safeguards as frontier AI reaches higher cyber capability
In anticipation, OpenAI says it is preparing safeguards as if every new model could reach that threshold, ensuring progress ...
Check Point Research has found a flaw in OpenAI’s AI coding tool, Codex, that would allow bad actors to exfiltrate data ...
A major supply chain vulnerability in the OpenAI Codex CLI has been patched after discovery by Check Point Research.
OpenAI launched Codex, an AI tool to write codes and fix bugs for developers. As an AI Agent, Codex could also help users with an Amazon order or a dinner reservation. Codex and GPT-4.5, which was ...
The competitive stakes have intensified as Google's Gemini 3 topped LMArena leaderboards and earned widespread praise for ...
American AI giants are backing a new effort to establish open standards for building agentic software and tools.
For the first time, OpenAI is publishing a report describing the use of its own products in companies. It is of little help ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results