AI has appeared to score poorly on lesser-used tests involving visual pattern recognition
- A team of technology experts issued a global call on Monday seeking the toughest questions to pose to artificial intelligence systems, which increasingly have handled popular benchmark tests like child's play.
Hendrycks co-authored two 2021 papers that proposed tests of AI systems that are now widely used, one quizzing them on undergraduate-level knowledge of topics like U.S. history, the other probing models' ability to reason through competition-level math. As one example, the Claude models from the AI lab Anthropic have gone from scoring about 77% on the undergraduate-level test in 2023, to nearly 89% a year later, according to a prominent capabilities leaderboard.AI has appeared to score poorly on lesser-used tests involving plan formulation and visual pattern-recognition puzzles, according to Stanford University’s AI Index Report from April.
The exam will include at least 1,000 crowd-sourced questions due November 1 that are hard for non-experts to answer. These will undergo peer review, with winning submissions offered co-authorship and up to $5,000 prizes sponsored by Scale AI.
AI Humanity's Last Exam Chatgpt Center For AI Safety (CAIS) Startup Scale AI
پاکستان تازہ ترین خبریں, پاکستان عنوانات
Similar News:آپ اس سے ملتی جلتی خبریں بھی پڑھ سکتے ہیں جو ہم نے دوسرے خبروں کے ذرائع سے جمع کی ہیں۔
Contentious California AI bill passes legislature, awaits governor's signatureBill mandates safety testing for advanced AI models, tech companies have largely balked at it
مزید پڑھ »
Llama AI models being used by banks, tech companiesLlama AI models being used by banks, tech companies
مزید پڑھ »
Google appoints former Character.AI founder as co-lead of its AI modelsCharacter.AI has raised $193 million and was valued at $1 billion last year
مزید پڑھ »
Honda 125 2025 Model launched in Pakistan; Check Price and FeaturesGet ready to experience thrill with the all-new Honda CG 125 which comes with same powerful performance but with new stickers.
مزید پڑھ »
Pakistani technology experts attend world AI moot in Saudi ArabiaThis is the third edition of the Global AI Summit
مزید پڑھ »
IT experts call for joint AI ventures with Saudi firms in smart cities, healthcareGlobal Artificial Intelligence (AI) summit in Riyadh
مزید پڑھ »