We have to stop ignoring AI’s hallucination problem – The Verge
Google I/O introduced an AI assistant that can see and hear the world, while OpenAI put its version of a Her-like chatbot into an iPhone. Next week, Microsoft will be hosting Build, where its sure to have some version of Copilot or Cortana that understands pivot tables. Then, a few weeks after that, Apple will host its own developer conference, and if the buzz is anything to go by, itll be talking about artificial intelligence, too. (Unclear if Siri will be mentioned.)
AI is here! Its no longer conceptual. Its taking jobs, making a few new ones, and helping millions of students avoid doing their homework. According to most of the major tech companies investing in AI, we appear to be at the start of experiencing one of those rare monumental shifts in technology. Think the Industrial Revolution or the creation of the internet or personal computer. All of Silicon Valley of Big Tech is focused on taking large language models and other forms of artificial intelligence and moving them from the laptops of researchers into the phones and computers of average people. Ideally, they will make a lot of money in the process.
But I cant really care about that because Meta AI thinks I have a beard.
I want to be very clear: I am a cis woman and do not have a beard. But if I type show me a picture of Alex Cranz into the prompt window, Meta AI inevitably returns images of very pretty dark-haired men with beards. I am only some of those things!
Meta AI isnt the only one to struggle with the minutiae of The Verges masthead. ChatGPT told me yesterday I dont work at The Verge. Googles Gemini didnt know who I was (fair), but after telling me Nilay Patel was a founder of The Verge, it then apologized and corrected itself, saying he was not. (I assure you he was.)
The AI keeps screwing up because these computers are stupid. Extraordinary in their abilities and astonishing in their dimwittedness. I cannot get excited about the next turn in the AI revolution because that turn is into a place where computers cannot consistently maintain accuracy about even minor things.
I mean, they even screwed up during Googles big AI keynote at I/O. In a commercial for Googles new AI-ified search engine, someone asked how to fix a jammed film camera, and it suggested they open the back door and gently remove the film. That is the easiest way to destroy any photos youve already taken.
An AIs difficult relationship with the truth is called hallucinating. In extremely simple terms: these machines are great at discovering patterns of information, but in their attempt to extrapolate and create, they occasionally get it wrong. They effectively hallucinate a new reality, and that new reality is often wrong. Its a tricky problem, and every single person working on AI right now is aware of it.
One Google ex-researcher claimed it could be fixed within the next year (though he lamented that outcome), and Microsoft has a tool for some of its users thats supposed to help detect them. Googles head of Search, Liz Reid, told The Verge its aware of the challenge, too. Theres a balance between creativity and factuality with any language model, she told my colleague David Pierce. Were really going to skew it toward the factuality side.
But notice how Reid said there was a balance? Thats because a lot of AI researchers dont actually think hallucinations can besolved. A study out of the National University of Singapore suggested that hallucinations are an inevitable outcome of all large language models. Just as no person is 100 percent right all the time, neither are these computers.
And thats probably why most of the major players in this field the ones with real resources and financial incentive to make us all embrace AI think you shouldnt worry about it. During Googles IO keynote, it added, in tiny gray font, the phrase check responses for accuracy to the screen below nearly every new AI tool it showed off a helpful reminder that its tools cant be trusted, but it also doesnt think its a problem. ChatGPT operates similarly. In tiny font just below the prompt window, it says, ChatGPT can make mistakes. Check important info.
Thats not a disclaimer you want to see from tools that are supposed to change our whole lives in the very near future! And the people making these tools do not seem to care too much about fixing the problem beyond a small warning.
Sam Altman, the CEO of OpenAI who was briefly ousted for prioritizing profit over safety, went a step further and said anyone who had an issue with AIs accuracy was naive. If you just do the naive thing and say, Never say anything that youre not 100 percent sure about, you can get them all to do that. But it wont have the magic that people like so much, he told a crowd at Salesforces Dreamforce conference last year.
This idea that theres a kind of unquantifiable magic sauce in AI that will allow us to forgive its tenuous relationship with reality is brought up a lot by the people eager to hand-wave away accuracy concerns. Google, OpenAI, Microsoft, and plenty of other AI developers and researchers have dismissed hallucination as a small annoyance that should be forgiven because theyre on the path to making digital beings that might make our own lives easier.
But apologies to Sam and everyone else financially incentivized to get me excited about AI. I dont come to computers for the inaccurate magic of human consciousness. I come to them because they are very accurate when humans are not. I dont need my computer to be my friend; I need it to get my gender right when I ask and help me not accidentally expose film when fixing a busted camera. Lawyers, I assume, would like it to get the case law right.
I understand where Sam Altman and other AI evangelists are coming from. There is a possibility in some far future to create a real digital consciousness from ones and zeroes. Right now, the development of artificial intelligence is moving at an astounding speed that puts many previous technological revolutions to shame. There is genuine magic at work in Silicon Valley right now.
But the AI thinks I have a beard. It cant consistently figure out the simplest tasks, and yet, its being foisted upon us with the expectation that we celebrate the incredible mediocrity of the services these AIs provide. While I can certainly marvel at the technological innovations happening, I would like my computers not to sacrifice accuracy just so I have a digital avatar to talk to. That is not a fair exchange its only an interesting one.
Follow this link:
We have to stop ignoring AI's hallucination problem - The Verge
- IBM Is Back. Now It Must Prove Its Mettle in AI. - WSJ - April 25th, 2025 [April 25th, 2025]
- Googles AI Overviews now reach more than 1.5 billion people every month - The Verge - April 25th, 2025 [April 25th, 2025]
- Alphabet rises as AI bets begin to pay off - Reuters - April 25th, 2025 [April 25th, 2025]
- Microsoft made an ad with generative AI and nobody noticed - The Verge - April 25th, 2025 [April 25th, 2025]
- Apple to Strip Secret Robotics Unit From AI Chief Weeks After Moving Siri - Bloomberg.com - April 25th, 2025 [April 25th, 2025]
- State Bar of California admits it used AI to develop exam questions, triggering new furor - Los Angeles Times - April 25th, 2025 [April 25th, 2025]
- Heres How Big the AI Revolution Really Is, in Four Charts - WSJ - April 25th, 2025 [April 25th, 2025]
- Update: Meta AI Begins Roll Out on Ray-Ban Meta Glasses to Even More Countries in the EU - Meta | Social Metaverse Company - April 25th, 2025 [April 25th, 2025]
- Adobe Revolutionizes AI-Assisted Creativity with Firefly, the All-In-One Home for AI Content Creation, with New Partner and Firefly Models - Adobe... - April 25th, 2025 [April 25th, 2025]
- Unveiling GPT-image-1: Rising to new heights with image generation in Azure AI Foundry - Microsoft Azure - April 25th, 2025 [April 25th, 2025]
- AI Is Spreading Old Stereotypes to New Languages and Cultures - WIRED - April 25th, 2025 [April 25th, 2025]
- In the age of AI, we must protect human creativity as a natural resource - Ars Technica - April 25th, 2025 [April 25th, 2025]
- Spotify Expands AI Playlist in Beta to Premium Listeners in 40+ New Markets - Spotify For the Record - April 25th, 2025 [April 25th, 2025]
- Microsoft says everyone will be a boss in the future of AI employees - The Guardian - April 25th, 2025 [April 25th, 2025]
- Student loans are back, US travel is whack, and, AI, please, step back : The Indicator from Planet Money - NPR - April 25th, 2025 [April 25th, 2025]
- How real-world businesses are transforming with AI with 261 new stories - The Official Microsoft Blog - April 25th, 2025 [April 25th, 2025]
- This Texas mom made $8,000 in 3 weeks training AI at her kitchen table. She says it's 'not easy money.' - Business Insider - April 25th, 2025 [April 25th, 2025]
- Dataminr Announces $100M Investment from Fortress to Accelerate Gen AI and Agentic AI Product Innovation, and to Expand its Reach to Enterprises &... - April 25th, 2025 [April 25th, 2025]
- Pony.ai teams up with Tencent for robotaxi services on WeChat, other apps - CNBC - April 25th, 2025 [April 25th, 2025]
- Alarming rise in AI-powered scams: Microsoft reveals $4 Billion in thwarted fraud - AI News - April 25th, 2025 [April 25th, 2025]
- CalArts, Chanel Launch Center for Artists and Tech With AI Focus - Variety - April 25th, 2025 [April 25th, 2025]
- China isnt trying to win the AI race - Financial Times - April 25th, 2025 [April 25th, 2025]
- WhatsApp defends 'optional' AI tool that cannot be turned off - BBC - April 25th, 2025 [April 25th, 2025]
- Nvidia Thinks It Has a Better Way of Building AI Agents - WSJ - April 25th, 2025 [April 25th, 2025]
- AI was used to write the California bar exam. The law community is outraged. - Mashable - April 25th, 2025 [April 25th, 2025]
- Exclusive: Anthropic warns fully AI employees are a year away - Axios - April 25th, 2025 [April 25th, 2025]
- Should You Forget Nvidia and Buy These 2 Millionaire-Maker AI Stocks Instead? - The Motley Fool - April 25th, 2025 [April 25th, 2025]
- Opinion: Art is a form of communication between human beings. AI wont change that - The Globe and Mail - April 25th, 2025 [April 25th, 2025]
- Adobe Firefly: The next evolution of creative AI is here - Adobe - April 25th, 2025 [April 25th, 2025]
- Adobe to launch mobile app for AI image generation tool as OpenAI steps up rivalry - CNBC - April 25th, 2025 [April 25th, 2025]
- Humanoid workers and surveillance buggies: embodied AI is reshaping daily life in China - The Guardian - April 21st, 2025 [April 21st, 2025]
- TSMC Warns of Limits of Ability to Keep Its AI Chips From China - Bloomberg.com - April 21st, 2025 [April 21st, 2025]
- A customer support AI went rogueand its a warning for every company considering replacing workers with automation - Fortune - April 21st, 2025 [April 21st, 2025]
- Could AI text alerts help save snow leopards from extinction? - BBC - April 21st, 2025 [April 21st, 2025]
- The #1 Skill That Pays More Than Gen AI In 2025 - Forbes - April 21st, 2025 [April 21st, 2025]
- 1 Artificial Intelligence (AI) Stock-Buyback Stock to Buy Hand Over Fist During the Nasdaq Sell-Off - Yahoo Finance - April 21st, 2025 [April 21st, 2025]
- What America Gets Wrong About the AI Race - Foreign Affairs - April 21st, 2025 [April 21st, 2025]
- Use AI as a tool for growth instead of degradation with this strategy. - Psychology Today - April 21st, 2025 [April 21st, 2025]
- Investor Says AI Is Already "Fully Replacing People" - futurism.com - April 21st, 2025 [April 21st, 2025]
- The philosophers machine: my conversation with Peter Singers AI chatbot - The Guardian - April 21st, 2025 [April 21st, 2025]
- With AI slop distorting our reality, the world is sleepwalking into disaster | Nesrine Malik - The Guardian - April 21st, 2025 [April 21st, 2025]
- Viral AI-made art trends are making artists even more worried about their futures - NBC News - April 21st, 2025 [April 21st, 2025]
- OpenAIs o3 AI model scores lower on a benchmark than the company initially implied - TechCrunch - April 21st, 2025 [April 21st, 2025]
- Artists push back against Barbie-like AI dolls with their own creations - BBC - April 21st, 2025 [April 21st, 2025]
- If you use AI to write me that note, dont expect me to read it - Fast Company - April 21st, 2025 [April 21st, 2025]
- Companies can leverage the true value of meetings with AI by building an LLM for Leadership - GeekWire - April 21st, 2025 [April 21st, 2025]
- Using tech, AI to make construction jobs appeal to women - DW - April 21st, 2025 [April 21st, 2025]
- Famed AI researcher launches controversial startup to replace all human workers everywhere - TechCrunch - April 21st, 2025 [April 21st, 2025]
- Impersonal assistant: This vehicle AI drove me to distraction - Detroit Free Press - April 21st, 2025 [April 21st, 2025]
- A 30-year-old AI founder who followed the FIRE movement to build wealth is now the youngest self-made woman billionaire - Fortune - April 21st, 2025 [April 21st, 2025]
- Musk and AI among biggest threats to brand reputation, global survey shows - The Guardian - April 21st, 2025 [April 21st, 2025]
- Stable Diffusion Now Optimized for AMD Radeon GPUs and Ryzen AI APUs - Stability AI - April 21st, 2025 [April 21st, 2025]
- Wikipedia is giving AI developers its data to fend off bot scrapers - The Verge - April 21st, 2025 [April 21st, 2025]
- The Healthcare AI Adoption Index - Bessemer Venture Partners - April 21st, 2025 [April 21st, 2025]
- Italian opposition file complaint over far-right partys use of racist AI images - The Guardian - April 21st, 2025 [April 21st, 2025]
- Meta's chief AI scientist calls French initiative to attract US scientists a 'smart move' - Business Insider - April 21st, 2025 [April 21st, 2025]
- Huawei introduces the Ascend 920 AI chip to fill the void left by Nvidia's H20 - Tom's Hardware - April 21st, 2025 [April 21st, 2025]
- I started vibe coding my own apps with AI. Im absolutely loving it - pcworld.com - April 21st, 2025 [April 21st, 2025]
- Living With the Galaxy S25 Ultra: Samsung's AI Shines in This Year's Model - PCMag - April 21st, 2025 [April 21st, 2025]
- o3 and o4-mini: Unlock enterprise agent workflows with next-level reasoning AI with Azure AI Foundry and GitHub - Microsoft Azure - April 18th, 2025 [April 18th, 2025]
- AI-generated music accounts for 18% of all tracks uploaded to Deezer - Reuters - April 18th, 2025 [April 18th, 2025]
- This Incredibly Cheap Artificial Intelligence (AI) Stock Is a Terrific Bargain Right Now - The Motley Fool - April 18th, 2025 [April 18th, 2025]
- Trump, Braun executive orders seek to revive fossil fuels. AI is one reason - IndyStar - April 18th, 2025 [April 18th, 2025]
- AI is coming for music, too - MIT Technology Review - April 18th, 2025 [April 18th, 2025]
- AI Reveals What Keeps People Committed to Exercise - Neuroscience News - April 18th, 2025 [April 18th, 2025]
- CEO reorganizes Intel with new CTO and AI lead - Tom's Hardware - April 18th, 2025 [April 18th, 2025]
- Netflix is revamping search with AI to improve discovery - TechCrunch - April 18th, 2025 [April 18th, 2025]
- Can this $70,000 robot transform AI research? - Fox News - April 18th, 2025 [April 18th, 2025]
- How This AI Tool Simplifies the Renting Process - CNET - April 18th, 2025 [April 18th, 2025]
- What to know before using AI to turn yourself into a Barbie doll or action figure - FOX 13 Tampa Bay - April 18th, 2025 [April 18th, 2025]
- YouTube Looks to Creators (and Their Data) to Win in the AI Era - Bloomberg.com - April 18th, 2025 [April 18th, 2025]
- 7 Goldman Sachs insiders explain how the bank's new AI sidekick is helping them crush it at work - Business Insider - April 18th, 2025 [April 18th, 2025]
- Ted Sarandos: The Bigger Opportunity with AI in Filmmaking Is If You Can Make Movies 10% Better, Not Just Cheaper - IndieWire - April 18th, 2025 [April 18th, 2025]
- Figuring out which AI model is right for you is harder than you think - Business Insider - April 18th, 2025 [April 18th, 2025]
- The humble screenshot might be the key to great AI assistants - The Verge - April 18th, 2025 [April 18th, 2025]
- How AI is using facial recognition to help bring lost pets home - CBS News - April 18th, 2025 [April 18th, 2025]
- Announcing the AWS Well-Architected Generative AI Lens - Amazon Web Services - April 18th, 2025 [April 18th, 2025]
- Gen Z can earn $70,000 a year and enter the AI-proof medical field without a college degreeall they have to do is learn how to sterilize surgical... - April 18th, 2025 [April 18th, 2025]
- This College Protester Isnt Real. Its an AI-Powered Undercover Bot for Cops - WIRED - April 18th, 2025 [April 18th, 2025]
- Intel will need license to export AI chips to Chinese clients, FT reports - Reuters - April 18th, 2025 [April 18th, 2025]