OpenAI’s Deep Research beats DeepSeek in the hardest AI exam



On Sunday, OpenAI unveiled Deep Research an agentic AI tool that can conduct multi-step research on the internet for complex tasks. The ChatGPT maker says the tool can simulate a human research analyst and claims what the agent accomplishes in ten minutes would take several hours for a human equivalent.

And as it now seems, the tool is living up to the hype. According to shared benchmarks on debatably the hardest AI exam, Humanity’s Last Exam, which was released less than two weeks ago, Deep Research holds a significant lead ahead of ChatGPT03-mini and DeepSeek’s R1 V3-powered model (via TechRadar).



Source link

Previous articleChina considering antitrust investigation into Apple, likely as leverage in trade war
Next articleEthereum loses momentum, falls to 5-year low against Bitcoin