Agentic Bio-Capabilities Benchmark Released for AI Laboratory and Biosecurity Evaluation

विज्ञान

10 जून 04:11

Researchers introduced the Agentic Bio-Capabilities Benchmark (ABC-Bench) in a study, Sciencecast reported. The benchmark comprises a suite of tasks that require large language models to program liquid-handling robots, design DNA fragments, and evade DNA synthesis screening. When evaluated, the AI agents outperformed average human experts on these challenges.

The results also highlighted the dual-use nature of the technology, raising concerns about potential biosecurity risks. The release provides a standardized tool for measuring AI capabilities in laboratory and bioinformatics contexts.

अभिनेताओं

OpenAI scientists

स्थान

कोई रिकॉर्ड नहीं

लेख

10 जून 2026

1 कुल

ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity

Sciencecast • 10 जून 03:12

Researchers introduced ABC-Bench, a suite of tests that measures large language models' ability to conduct laboratory and bioinformatics tasks, finding AI agents outperform average human experts while highlighting potential dual-use concerns.

विश्वसनीयता 90% हेरफेर 10% मध्यस्थ

स्कोर

ये मेट्रिक्स कैसे काम करती हैं

लेख

स्रोत

महत्व

85%

विश्वसनीयता

90%

हेरफेर

10%

प्रमुख पक्षपात

मध्यस्थ 100% कवरेज का

वामपंथी

मध्यस्थ

दक्षिणपंथी

अस्थिरता

27 / 100

स्रोत

Sciencecast 1