News
OpenAI pushes further into healthcare with release of HealthBench to evaluate AI models By Heather Landi May 13, 2025 1:00pm Artificial Intelligence OpenAI generative AI ...
The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes 5,000 “realistic health conversations,” each with detailed grading tools to evaluate ...
The dataset—called HealthBench—is OpenAI's first major independent health care project. It includes 5,000 "realistic health conversations," each with detailed grading tools to evaluate AI ...
OpenAI has introduced HealthBench, a comprehensive dataset designed to assess how well AI models respond to health care-related questions. This release aims to enhance the evaluation of AI's ...
April 17, 2025: OpenAI has released o3 and 04-mini, two reasoning AI models designed to be extra good at programming, math, ...
OpenAI's HealthBench is a suite of text prompts concerning medical situations and conditions that could reasonably be submitted to a chatbot by a person seeking medical advice.
OpenAI GPT-4o is displayed on smartphone. OpenAI launched HealthBench to test AI health care responses The dataset includes 5,000 health conversations and more than 57,000 criteria Experts say it ...
Experts say it improves AI evaluation but warn that more review is needed TUESDAY, May 13, 2025 (HealthDay News) — OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results