OpenAI pledges to publish AI safety test results more often

Parfect News

Technology

OpenAI pledges to publish AI safety test results more often

sdtech2532@gmail.com

May 14, 2025

OpenAI is moving to publish the results of its internal AI model safety evaluations more regularly in what the outfit is pitching as an effort to increase transparency.

On Wednesday, OpenAI launched the Safety evaluations hub, a web page showing how the company’s models score on various tests for harmful content generation, jailbreaks, and hallucinations. OpenAI says that it’ll use the hub to share metrics on an “ongoing basis,” and that it intends to update the hub with “major model updates” going forward.

Introducing the Safety Evaluations Hub—a resource to explore safety results for our models.

While system cards share safety metrics at launch, the Hub will be updated periodically as part of our efforts to communicate proactively about safety.https://t.co/c8NgmXlC2Y

— OpenAI (@OpenAI) May 14, 2025

“As the science of AI evaluation evolves, we aim to share our progress on developing more scalable ways to measure model capability and safety,” wrote OpenAI in a blog post. “By sharing a subset of our safety evaluation results here, we hope this will not only make it easier to understand the safety performance of OpenAI systems over time, but also support community efforts⁠ to increase transparency across the field.”

OpenAI says that it may add additional evaluations to the hub over time.

In recent months, OpenAI has raised the ire of some ethicists for reportedly rushing the safety testing of certain flagship models and failing to release technical reports for others. The company’s CEO, Sam Altman, also stands accused of misleading OpenAI executives about model safety reviews prior to his brief ouster in November 2023.

Late last month, OpenAI was forced to roll back an update to the default model powering ChatGPT, GPT-4o, after users began reporting that it responded in an overly validating and agreeable way. X became flooded with screenshots of ChatGPT applauding all sorts of problematic, dangerous decisions and ideas.

OpenAI said that it would implement several fixes and changes to prevent future such incidents, including introducing an opt-in “alpha phase” for some models that would allow certain ChatGPT users to test the models and give feedback before launch.

Source link

OpenAI is moving to publish the results of its internal AI model safety evaluations more regularly in what the outfit is pitching as an effort to increase transparency.

Introducing the Safety Evaluations Hub—a resource to explore safety results for our models.

While system cards share safety metrics at launch, the Hub will be updated periodically as part of our efforts to communicate proactively about safety.https://t.co/c8NgmXlC2Y

— OpenAI (@OpenAI) May 14, 2025

OpenAI says that it may add additional evaluations to the hub over time.

Source link

It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for ‘lorem ipsum’ will uncover many web sites still in their infancy.

The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making

The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for ‘lorem ipsum’ will uncover many web sites still in their infancy.

JP, Vajpayee, Morarji, Charan Singh…vs Mrs G

Fordoward Thinking

Bihar: Kings & kingmakers

Census, consensus

Census & fairness

What TN case tells us about the need to ease Centre-state friction

Parfect News

News Elementor

RECENT NEWS

‘Centre aims to enhance farmers income by increasing production’

Omics technologies crucial to address challenges in agriculture

’11 against the world’: Denny Hamlin backs up smack talk with Martinsville win

OpenAI pledges to publish AI safety test results more often

sdtech2532@gmail.com

RECENT POSTS

Why a16z VC believes that Cluely, the ‘cheat on everything’ startup, is the new blueprint for AI startups

Horoscope Today, June 27, 2025: Stars are aligned in favour of these zodiac signs | Astrology

Travis Kalanick is trying to buy Pony AI — and Uber might help

CATEGORIES

AI

Astrology

Business

CITHARA PATRA: The Shell | 50-Word Stories

Wholesale inflation dips to 13-month low of 0.85% in April

Leave a Reply Cancel reply

HELP/SUPPORT

USEFUL LINKS

SUBSCRIBE US