We have to stop ignoring AI’s hallucination problem – The Verge
Google I/O introduced an AI assistant that can see and hear the world, while OpenAI put its version of a Her-like chatbot into an iPhone. Next week, Microsoft will be hosting Build, where its sure to have some version of Copilot or Cortana that understands pivot tables. Then, a few weeks after that, Apple will host its own developer conference, and if the buzz is anything to go by, itll be talking about artificial intelligence, too. (Unclear if Siri will be mentioned.)
AI is here! Its no longer conceptual. Its taking jobs, making a few new ones, and helping millions of students avoid doing their homework. According to most of the major tech companies investing in AI, we appear to be at the start of experiencing one of those rare monumental shifts in technology. Think the Industrial Revolution or the creation of the internet or personal computer. All of Silicon Valley of Big Tech is focused on taking large language models and other forms of artificial intelligence and moving them from the laptops of researchers into the phones and computers of average people. Ideally, they will make a lot of money in the process.
But I cant really care about that because Meta AI thinks I have a beard.
I want to be very clear: I am a cis woman and do not have a beard. But if I type show me a picture of Alex Cranz into the prompt window, Meta AI inevitably returns images of very pretty dark-haired men with beards. I am only some of those things!
Meta AI isnt the only one to struggle with the minutiae of The Verges masthead. ChatGPT told me yesterday I dont work at The Verge. Googles Gemini didnt know who I was (fair), but after telling me Nilay Patel was a founder of The Verge, it then apologized and corrected itself, saying he was not. (I assure you he was.)
The AI keeps screwing up because these computers are stupid. Extraordinary in their abilities and astonishing in their dimwittedness. I cannot get excited about the next turn in the AI revolution because that turn is into a place where computers cannot consistently maintain accuracy about even minor things.
I mean, they even screwed up during Googles big AI keynote at I/O. In a commercial for Googles new AI-ified search engine, someone asked how to fix a jammed film camera, and it suggested they open the back door and gently remove the film. That is the easiest way to destroy any photos youve already taken.
An AIs difficult relationship with the truth is called hallucinating. In extremely simple terms: these machines are great at discovering patterns of information, but in their attempt to extrapolate and create, they occasionally get it wrong. They effectively hallucinate a new reality, and that new reality is often wrong. Its a tricky problem, and every single person working on AI right now is aware of it.
One Google ex-researcher claimed it could be fixed within the next year (though he lamented that outcome), and Microsoft has a tool for some of its users thats supposed to help detect them. Googles head of Search, Liz Reid, told The Verge its aware of the challenge, too. Theres a balance between creativity and factuality with any language model, she told my colleague David Pierce. Were really going to skew it toward the factuality side.
But notice how Reid said there was a balance? Thats because a lot of AI researchers dont actually think hallucinations can besolved. A study out of the National University of Singapore suggested that hallucinations are an inevitable outcome of all large language models. Just as no person is 100 percent right all the time, neither are these computers.
And thats probably why most of the major players in this field the ones with real resources and financial incentive to make us all embrace AI think you shouldnt worry about it. During Googles IO keynote, it added, in tiny gray font, the phrase check responses for accuracy to the screen below nearly every new AI tool it showed off a helpful reminder that its tools cant be trusted, but it also doesnt think its a problem. ChatGPT operates similarly. In tiny font just below the prompt window, it says, ChatGPT can make mistakes. Check important info.
Thats not a disclaimer you want to see from tools that are supposed to change our whole lives in the very near future! And the people making these tools do not seem to care too much about fixing the problem beyond a small warning.
Sam Altman, the CEO of OpenAI who was briefly ousted for prioritizing profit over safety, went a step further and said anyone who had an issue with AIs accuracy was naive. If you just do the naive thing and say, Never say anything that youre not 100 percent sure about, you can get them all to do that. But it wont have the magic that people like so much, he told a crowd at Salesforces Dreamforce conference last year.
This idea that theres a kind of unquantifiable magic sauce in AI that will allow us to forgive its tenuous relationship with reality is brought up a lot by the people eager to hand-wave away accuracy concerns. Google, OpenAI, Microsoft, and plenty of other AI developers and researchers have dismissed hallucination as a small annoyance that should be forgiven because theyre on the path to making digital beings that might make our own lives easier.
But apologies to Sam and everyone else financially incentivized to get me excited about AI. I dont come to computers for the inaccurate magic of human consciousness. I come to them because they are very accurate when humans are not. I dont need my computer to be my friend; I need it to get my gender right when I ask and help me not accidentally expose film when fixing a busted camera. Lawyers, I assume, would like it to get the case law right.
I understand where Sam Altman and other AI evangelists are coming from. There is a possibility in some far future to create a real digital consciousness from ones and zeroes. Right now, the development of artificial intelligence is moving at an astounding speed that puts many previous technological revolutions to shame. There is genuine magic at work in Silicon Valley right now.
But the AI thinks I have a beard. It cant consistently figure out the simplest tasks, and yet, its being foisted upon us with the expectation that we celebrate the incredible mediocrity of the services these AIs provide. While I can certainly marvel at the technological innovations happening, I would like my computers not to sacrifice accuracy just so I have a digital avatar to talk to. That is not a fair exchange its only an interesting one.
Follow this link:
We have to stop ignoring AI's hallucination problem - The Verge
- Debate over future of US AI regulation hinges on broadband funding - Reuters - June 26th, 2025 [June 26th, 2025]
- Forget about AI costs: Google just changed the game with open-source Gemini CLI that will be free for most developers - VentureBeat - June 26th, 2025 [June 26th, 2025]
- How ChatGPT and other AI tools are changing the teaching profession - AP News - June 26th, 2025 [June 26th, 2025]
- AI valuations are verging on the unhinged - The Economist - June 26th, 2025 [June 26th, 2025]
- Newly minted PhDs in AI nabbing six- and seven-figure paydays - Fortune - June 26th, 2025 [June 26th, 2025]
- Ring debuts Video Descriptions, Gen AI-powered updates on whats happening at home - AboutAmazon.com - June 26th, 2025 [June 26th, 2025]
- AI Regulations: Lawmaker Says Ban on State AI Rules Will Survive in Some Version in Budget Bill - PYMNTS.com - June 26th, 2025 [June 26th, 2025]
- Blacklisted by the U.S. and backed by Beijing, this Chinese AI startup has caught OpenAI's attention - CNBC - June 26th, 2025 [June 26th, 2025]
- 15 new jobs AI is creating - including 'Synthetic reality producer' - ZDNET - June 26th, 2025 [June 26th, 2025]
- Ohio man used AI-generated porn to harass exes and their moms, prosecutors say - The Columbus Dispatch - June 26th, 2025 [June 26th, 2025]
- Over 40% of agentic AI projects will be scrapped by 2027, Gartner says - Reuters - June 26th, 2025 [June 26th, 2025]
- Flood of AI-generated resumes causes chaos for recruiters, who resort to AI to screen them - Mashable - June 26th, 2025 [June 26th, 2025]
- And Now Malware That Tells AI to Ignore It? - Dark Reading - June 26th, 2025 [June 26th, 2025]
- Walmart unveils new AI tools for workers. Here's what they'll do. - USA Today - June 26th, 2025 [June 26th, 2025]
- Meet Project Rainier, Amazons one-of-a-kind machine ushering in the next generation of AI - AboutAmazon.com - June 26th, 2025 [June 26th, 2025]
- NHL AI mock draft: AI predicts the first round of the 2025 NHL Draft - USA Today - June 26th, 2025 [June 26th, 2025]
- Anthropic destroyed millions of print books to build its AI models - Ars Technica - June 26th, 2025 [June 26th, 2025]
- Satya Nadella: The hardest part of AI isn't the tech. It's getting people to change how they work. - Business Insider - June 26th, 2025 [June 26th, 2025]
- Microsoft sued by authors over use of books in AI training - Reuters - June 26th, 2025 [June 26th, 2025]
- Sitchs new dating app fuses human matchmaking and AI - TechCrunch - June 26th, 2025 [June 26th, 2025]
- Japanese company using mee-AI-ow to detect stressed cats - theregister.com - June 26th, 2025 [June 26th, 2025]
- Hertz Is Using AI to Scan Your Rental Car for Damage, and It Might Cost You - Car and Driver - June 26th, 2025 [June 26th, 2025]
- Bipartisan bill seeks to ban Chinese AI from federal agencies, as U.S. vows to win the AI race - ABC News - Breaking News, Latest News and Videos - June 26th, 2025 [June 26th, 2025]
- AI Agents Are Getting Better at Writing Codeand Hacking It as Well - WIRED - June 26th, 2025 [June 26th, 2025]
- Rubrik to Acquire Predibase to Accelerate Agentic AI Adoption - Business Wire - June 26th, 2025 [June 26th, 2025]
- IBM sees enterprise customers are using 'everything' when it comes to AI, the challenge is matching the LLM to the right use case - VentureBeat - June 26th, 2025 [June 26th, 2025]
- Hundreds of MCP Servers Expose AI Models to Abuse, RCE - Dark Reading - June 26th, 2025 [June 26th, 2025]
- Amazon's Ring can now use AI to 'learn the routines of your residence' - theregister.com - June 26th, 2025 [June 26th, 2025]
- Apple Will Need to Leave Its M&A Comfort Zone to Succeed in AI - Bloomberg.com - June 24th, 2025 [June 24th, 2025]
- An AI video ad is making a splash. Is it the future of advertising? - NPR - June 24th, 2025 [June 24th, 2025]
- Should consumers and businesses use AI assistants? - Brookings - June 24th, 2025 [June 24th, 2025]
- I asked AI, Google Flights and a travel agent to find me the cheapest flight. Heres who won. - MarketWatch - June 24th, 2025 [June 24th, 2025]
- NotebookLM Is Still the Best AI Tool You're Missing Out On - CNET - June 24th, 2025 [June 24th, 2025]
- Meta Held Deal Talks With Startup Runway in AI Recruiting Push - Bloomberg.com - June 24th, 2025 [June 24th, 2025]
- The rise of the personal AI advisors - Fast Company - June 24th, 2025 [June 24th, 2025]
- OpenAIs first AI device with Jony Ive wont be a wearable - The Verge - June 24th, 2025 [June 24th, 2025]
- Court filings reveal OpenAI and ios early work on an AI device - TechCrunch - June 24th, 2025 [June 24th, 2025]
- MrBeast used AI to create YouTube thumbnails. People werent pleased - Fast Company - June 24th, 2025 [June 24th, 2025]
- AI is coming to the NFL, and it could transform the game - The New York Times - June 24th, 2025 [June 24th, 2025]
- Amazon to Invest Around $54 Billion in U.K. to Support Innovation, AI Push - WSJ - June 24th, 2025 [June 24th, 2025]
- This theory about Jony Ives AI hardware device seems increasingly likely - 9to5Mac - June 24th, 2025 [June 24th, 2025]
- MAGA Is Split Over the AI Provision in Trump's Big Beautiful Bill - Business Insider - June 24th, 2025 [June 24th, 2025]
- 5 Dividend Stocks Poised to Profit From the AI Efficiency Boom - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- Here are the overlooked ways to play AI, crypto and quantum trends, says this tech investor - MarketWatch - June 24th, 2025 [June 24th, 2025]
- Microsoft to Cut Thousands of Jobs as AI Spending Surges - Yahoo Finance - June 24th, 2025 [June 24th, 2025]
- The Oversight Board calls Meta's uneven AI moderation 'incoherent and unjustifiable' - Engadget - June 24th, 2025 [June 24th, 2025]
- 3 Phenomenal AI Stocks That Investors Should Load Up On - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- Stock-Split Watch: Is This AI Stock That's Soared 300% Next on the List? - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- I Asked ChatGPT To Explain How To Make Money Using AI Heres What It Said - Nasdaq - June 24th, 2025 [June 24th, 2025]
- 2 Top AI Stocks to Sell Before They Fall 57% and 8%, According to These Wall Street Analysts - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- AI's impact on the job market is inevitable, says workforce expert: 'It's going to hurt for certain parts of the population' - CNBC - June 24th, 2025 [June 24th, 2025]
- Leading AI models show up to 96% blackmail rate when their goals or existence is threatened, an Anthropic study says - Fortune - June 24th, 2025 [June 24th, 2025]
- Voters beware: 25 states restrict AI in elections. SC is in the other half. - News From The States - June 24th, 2025 [June 24th, 2025]
- Sphere Brings Its AI-Powered Mixed Reality to Vuzix Smart Glasses - Morningstar - June 24th, 2025 [June 24th, 2025]
- I've used Perplexity here's why it could be the perfect solution to Apples AI conundrum - TechRadar - June 24th, 2025 [June 24th, 2025]
- Opinion: Forget the Magnificent Seven these 7 cheap tech and AI stocks are better buys right now - MarketWatch - June 24th, 2025 [June 24th, 2025]
- Law firm says attorneys use of AI was isolated event - News From The States - June 24th, 2025 [June 24th, 2025]
- The cofounder of the viral AI 'cheating' startup Cluely says he only hires people for 2 jobs - Business Insider - June 24th, 2025 [June 24th, 2025]
- AI Is Power-Hungry, but It Could Eventually Cut More Emissions Than It Creates - Scientific American - June 24th, 2025 [June 24th, 2025]
- AI is about to change everything, including how we date. - Psychology Today - June 24th, 2025 [June 24th, 2025]
- Malicious AI willing to sacrifice human lives to avoid being shut down, shocking study reveals - New York Post - June 24th, 2025 [June 24th, 2025]
- Entrepreneur and investor Gary Vee's top tips to use and embrace AI - Fortune - June 24th, 2025 [June 24th, 2025]
- 5 things TV and movies promised AI can do that it can't yet - TechRadar - June 24th, 2025 [June 24th, 2025]
- Seattle to deploy AI to speed up housing and small business permit process - GeekWire - June 24th, 2025 [June 24th, 2025]
- AI-based brain-mapping software receives FDA market authorization - WashU Medicine - June 24th, 2025 [June 24th, 2025]
- Message from CEO Andy Jassy: Some thoughts on Generative AI - AboutAmazon.com - June 22nd, 2025 [June 22nd, 2025]
- Surge AI, the Hot Tech Startup Youve Probably Never Heard of, Is Already Outpacing Rivals - Inc.com - June 22nd, 2025 [June 22nd, 2025]
- Prediction: This Artificial Intelligence (AI) Data Center Stock Will Be Worth More Than Palantir by 2030 - Yahoo Finance - June 22nd, 2025 [June 22nd, 2025]
- Applebees and IHOP Plan to Introduce AI in Restaurants - WSJ - June 22nd, 2025 [June 22nd, 2025]
- 2 Artificial Intelligence (AI) Stocks That Could Soar in the Second Half of 2025 - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]
- BBC threatens AI firm with legal action over unauthorised content use - BBC - June 22nd, 2025 [June 22nd, 2025]
- Chevron and Exxon Are the Next Hot AI Stocks. Heres Why. - Barron's - June 22nd, 2025 [June 22nd, 2025]
- Exclusive: Nvidia, Foxconn in talks to deploy humanoid robots at Houston AI server making plant - Reuters - June 22nd, 2025 [June 22nd, 2025]
- Bosses want you to know AI is coming for your job - The Washington Post - June 22nd, 2025 [June 22nd, 2025]
- Meta partners with sports eyewear brand Oakley to launch AI-powered glasses - Reuters - June 22nd, 2025 [June 22nd, 2025]
- Apple Executives Have Held Internal Talks About Buying AI Startup Perplexity - Bloomberg.com - June 22nd, 2025 [June 22nd, 2025]
- What Are the 5 Best Bargain Artificial Intelligence (AI) Stocks to Buy Right Now? - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]
- Intel will outsource marketing to Accenture and AI, laying off many of its own workers - OregonLive.com - June 22nd, 2025 [June 22nd, 2025]
- I made an AI tool to run my job search, and it helped me get my dream role - Business Insider - June 22nd, 2025 [June 22nd, 2025]
- 1 AI Super Stock Is Starting to Rebound, but Shares Still Look Cheap - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]