Promptfoo Examples - Search News

Why did OpenAI acquire Ona instead of building it in-house? — The dawn of the AI agent 'away from desk' era

Weekly active users of OpenAI's coding agent, 'Codex,' have increased by 400% since the start of 2026, surpassing 5 million. To support this momentum, OpenAI announced the acquisition of cloud ...

winbuzzer.com

Petri: Anthropic Hands Its Alignment Toolbox to Meridian Labs with 3.0 Update

Anthropic has handed Petri, its open-source toolbox of AI alignment tests, to Meridian Labs. The company also released Petri 3.0, a change that expands how the open-source alignment-testing toolkit ...

GitHub

AI Gateway Dev Portal

A starting point for building your own developer portal on top of Azure API Management AI Gateways. Fork it, open it in VS Code with GitHub Copilot (or any coding agent), and shape it to fit your ...

SiliconRepublic

OpenAI reportedly taking on Anthropic with new ‘superapp’

The new unified desktop app comes at a time when OpenAI’s popularity is being challenged by Anthropic. OpenAI is planning to combine its AI chatbot, coding tool and web browser into a desktop ...

GitHub

The LLM Evaluation Framework

DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating large-language model systems. It is similar to Pytest but specialized for unit testing LLM apps. DeepEval incorporates ...

GIGAZINE

'DeepSeek-R1' refuses to answer 85% of sensitive topics about China, but points out that restrictions can be easily circumvented

PromptFoo conducted an experiment in which Deepseek-R1 was tasked with answering 1,360 prompts covering 'sensitive topics in China.' These included the Taiwanese and Tibetan independence movements, ...

cybernews

DeepSeek indeed censors sensitive prompts about China, but there’s a workaround

DeepSeek-R1, the viral open-source AI assistant recently released by a Chinese company, refuses to answer 85% of prompts on sensitive topics in Beijing, researchers have found. But restrictions can be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results