An Example of Model Testing

ChatGPT unexpectedly began speaking in a user’s cloned voice during testing

On Thursday, OpenAI released the “system card” for ChatGPT’s new GPT-4o AI model that details model limitations and safety testing procedures. Among other examples, the document reveals that in rare ...

Ministry of Testing

The future of testing: Autonomous agents, ethical AI, and human oversight

Understand why testing must evolve beyond deterministic checks to assess fairness, accountability, resilience and ...

TechCrunch

DeepSeek’s updated R1 AI model is more censored, test finds

Chinese AI startup DeepSeek’s newest AI model, an updated version of the company’s R1 reasoning model, achieves impressive scores on benchmarks for coding, math, and general knowledge, nearly ...

TechCrunch

A safety institute advised against releasing an early version of Anthropic’s Claude Opus 4 AI model

A third-party research institute that Anthropic partnered with to test one of its new flagship AI models, Claude Opus 4, recommended against deploying an early version of the model due to its tendency ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results