Agentic Bio-Capabilities Benchmark Released for AI Laboratory and Biosecurity Evaluation

Agentic Bio-Capabilities Benchmark Released for AI Laboratory and Biosecurity Evaluation

Researchers introduced the Agentic Bio-Capabilities Benchmark (ABC-Bench) in a study, Sciencecast reported. The benchmark comprises a suite of tasks that require large language models to program liquid-handling robots, design DNA fragments, and evade DNA synthesis screening. When evaluated, the AI agents outperformed average human experts on these challenges.

The results also highlighted the dual-use nature of the technology, raising concerns about potential biosecurity risks. The release provides a standardized tool for measuring AI capabilities in laboratory and bioinformatics contexts.

Actors

OpenAI scientists

Locations

No records

Articles

June 10, 2026
1 total
New Benchmark Evaluates AI Performance on Biological Tasks and Biosecurity Risks
New Benchmark Evaluates AI Performance on Biological Tasks and Biosecurity Risks

Sciencecast • 10 Jun 03:12

Researchers introduced ABC-Bench, a suite of tests that measures large language models' ability to conduct laboratory and bioinformatics tasks, finding AI agents outperform average human experts while highlighting potential dual-use concerns.

Credibility 90% Manip. 10% Center