OPENAI perplexed while new models show increasing hallucination rates

The latest models of OpenAi, O3 and O4-Mini, more frequently hallucinate than the old company AI systems, according to internal tests and third-party research. On the Personqa of Openai reference, O3 hallucinated 33%of the time – double the rate of older models O1 (16%) and O3 -Mini (14.8%). The O4-Mini still made it possible to perform, hallucinating 48% of the time. The AI non -profit AI Laboratory has discovered the O3 manufacturing processes that it claimed to use, including the current code on a MacBook Pro 2021 “outside of Chatgpt”. Stanford’s auxiliary professor Kian Katanforoosh, noted that his team had found that O3 frequently generates broken website links.

OPENAI says in its technical report that “more research is necessary” to understand why the hallucinations aggravate as the reasoning models increase.

OPENAI perplexed while new models show increasing hallucination rates

Lewis Hamilton groans that he is “nowhere” near his Formula One Rival after another disappointing outing in Saudi Arabia – while Ferrari’s star admits that he can only “pray” for a good result in the breed of Jeddah

Young AMEX card holders have best Fico scores, said the CEO

Young AMEX card holders have best Fico scores, said the CEO