Weekly active users of OpenAI's coding agent, 'Codex,' have increased by 400% since the start of 2026, surpassing 5 million. To support this momentum, OpenAI announced the acquisition of cloud ...
Anthropic has handed Petri, its open-source toolbox of AI alignment tests, to Meridian Labs. The company also released Petri 3.0, a change that expands how the open-source alignment-testing toolkit ...
A starting point for building your own developer portal on top of Azure API Management AI Gateways. Fork it, open it in VS Code with GitHub Copilot (or any coding agent), and shape it to fit your ...
The new unified desktop app comes at a time when OpenAI’s popularity is being challenged by Anthropic. OpenAI is planning to combine its AI chatbot, coding tool and web browser into a desktop ...
DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating large-language model systems. It is similar to Pytest but specialized for unit testing LLM apps. DeepEval incorporates ...
PromptFoo conducted an experiment in which Deepseek-R1 was tasked with answering 1,360 prompts covering 'sensitive topics in China.' These included the Taiwanese and Tibetan independence movements, ...
DeepSeek-R1, the viral open-source AI assistant recently released by a Chinese company, refuses to answer 85% of prompts on sensitive topics in Beijing, researchers have found. But restrictions can be ...