xAnts

OpenAI's DeepResearch can complete 26% of 'Humanity's Last Exam' — a benchmark for the frontier of human knowledge

2025-02-12Fortune on MSN.com

OpenAI's DeepResearch can complete 26% of 'Humanity's Last Exam' — a benchmark for the frontier of human knowledge

OpenAI's o1 and DeepSeek's R1 models, which previously sat atop the leaderboard, could only get through roughly 9% of the exam. ...Read more

Recommendations

Wiki Finance Expo Hong Kong 2025: Asia's premier fintech and Web3.0 summit returns on March 27
2025-02-07MSN
L&T Fin to acquire Paul Merchants Finance's gold loan biz for Rs 537 cr
2025-02-07Business Standard
Chocolate Finance whets Singaporeans' appetite for higher cash yields; assets approach S$1 billion
2025-02-11Business Times
5 Altcoins Ready to Go Ballistic in the 2025 Bull Run: Avalanche, Toncoin, Chainlink, Cardano, Rexas Finance (RXS)
2025-02-08MSN
Mutuum Finance vs XRP Price Prediction: Which Crypto Will 10x in 2025 and Why?
2025-02-16cryptopolitan on MSN.com
Mutuum Finance (MUTM) Set for an 18,544% Blowout as Dogecoin's (DOGE) Forecast for 2025 Looks Promising
2025-02-16MSN
Stocks in news: Eicher Motors, Lupin, MTAR Tech, AB Capital, Nykaa, Bata & SBFC Finance
2025-02-11Business Today on MSN.com
Brexit reset tested as Brussels and London fight about finance
2025-02-12Politico Europe
Workday debuts AI agents, with CEO saying they'll 'peacefully coexist' with humans rather than replace them
2025-02-11Fortune on MSN.com
Manappuram, Fusion, IIFL Finance, Repco Home shares fall up to 6% today; stock price targets
2025-02-14Business Today on MSN.com

Loading...