News

OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI) models answer health care questions.
OpenAI Launches HealthBench, a Dataset That Benchmarks Health Care AI Models This is a major leap by the ChatGPT creator into health care.
OpenAI has announced the launch of HealthBench, a benchmark to evaluate AI models in healthcare using real-world applicability and physician judgment. "The 5,000 conversations in HealthBench simulate ...
OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large language models in healthcare. The large data set, called HealthBench, goes ...
The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes 5,000 “realistic health conversations,” each with detailed grading tools to evaluate ...
OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench that lets the health care industry benchmark AI models, the company ...
OpenAI GPT-4o is displayed on smartphone. OpenAI launched HealthBench to test AI health care responses The dataset includes 5,000 health conversations and more than 57,000 criteria Experts say it ...