Meta Is Building an AI to Fact-Check WikipediaAll 6.5 Million Articles – Singularity Hub
Most people older than 30 probably remember doing research with good old-fashioned encyclopedias. Youd pull a heavy volume from the shelf, check the index for your topic of interest, then flip to the appropriate page and start reading. It wasnt as easy as typing a few words into the Google search bar, but on the plus side, you knew that the information you found in the pages of the Britannica or the World Book was accurate and true.
Not so with internet research today. The overwhelming multitude of sources was confusing enough, but add the proliferation of misinformation and its a wonder any of us believe a word we read online.
Wikipedia is a case in point. As of early 2020, the sites English version was averaging about 255 million page views per day, making it the eighth-most-visited website on the internet. As of last month, it had moved up to spot number seven, and the English version currently has over 6.5 million articles.
But as high-traffic as this go-to information source may be, its accuracy leaves something to be desired; the page about the sites own reliability states, The online encyclopedia does not consider itself to be reliable as a source and discourages readers from using it in academic or research settings.
Metaof the former Facebookwants to change this. In a blog post published last month, the companys employees describe how AI could help make Wikipedia more accurate.
Though tens of thousands of people participate in editing the site, the facts they add arent necessarily correct; even when citations are present, theyre not always accurate nor even relevant.
Meta is developing a machine learning model that scans these citations and cross-references their content to Wikipedia articles to verify that not only the topics line up, but specific figures cited are accurate.
This isnt just a matter of picking out numbers and making sure they match; Metas AI will need to understand the content of cited sources (though understand is a misnomer, as complexity theory researcher Melanie Mitchell would tell you, because AI is still in the narrow phase, meaning its a tool for highly sophisticated pattern recognition, while understanding is a word used for human cognition, which is still a very different thing).
Metas model will understand content not by comparing text strings and making sure they contain the same words, but by comparing mathematical representations of blocks of text, which it arrives at using natural language understanding (NLU) techniques.
What we have done is to build an index of all these web pages by chunking them into passages and providing an accurate representation for each passage, Fabio Petroni, Metas Fundamental AI Research tech lead manager, told Digital Trends. That is not representing word-by-word the passage, but the meaning of the passage. That means that two chunks of text with similar meanings will be represented in a very close position in the resulting n-dimensional space where all these passages are stored.
The AI is being trained on a set of four million Wikipedia citations, and besides picking out faulty citations on the site, its creators would like it to eventually be able to suggest accurate sources to take their place, pulling from a massive index of data thats continuously updating.
One big issue left to work out is working in a grading system for sources reliability. A paper from a scientific journal, for example, would receive a higher grade than a blog post. The amount of content online is so vast and varied that you can find sources to support just about any claim, but parsing the misinformation from the disinformation (the former means incorrect, while the latter means deliberately deceiving), and the peer-reviewed from the non-peer-reviewed, the fact-checked from the hastily-slapped-together, is no small taskbut a very important one when it comes to trust.
Meta has open-sourced its model, and those who are curious can see a demo of the verification tool. Metas blog post noted that the company isnt partnering with Wikimedia on this project, and that its still in the research phase and not currently being used to update content on Wikipedia.
If you imagine a not-too-distant future where everything you read on Wikipedia is accurate and reliable, wouldnt that make doing any sort of research a bit too easy? Theres something valuable about checking and comparing various sources ourselves, is there not? It was a big a leap to go from paging through heavy books to typing a few words into a search engine and hitting Enter; do we really want Wikipedia to move from a research jumping-off point to a gets-the-last-word source?
In any case, Metas AI research team will continue working toward a tool to improve the online encyclopedia. I think we were driven by curiosity at the end of the day, Petroni said. We wanted to see what was the limit of this technology. We were absolutely not sure if [this AI] could do anything meaningful in this context. No one had ever tried to do something similar.
Image Credit: Gerd Altmann from Pixabay
Read more here:
Meta Is Building an AI to Fact-Check WikipediaAll 6.5 Million Articles - Singularity Hub
- Celebrating Wikipedia 25 in Tashkent: A New Generation of Uzbek Wikimedians Takes the Lead - Wikimedia.org - April 17th, 2026 [April 17th, 2026]
- Cebuano Wikipedia: From Ghost Town to Growth Engine - Wikimedia.org - April 17th, 2026 [April 17th, 2026]
- Celebrating 25 Years of Wikipedia at Manipal University Jaipur: Learning, Innovation, and Community - Wikimedia.org - April 17th, 2026 [April 17th, 2026]
- Wikipedia founder says trust is broken here's how to rebuild it - axios.com - April 7th, 2026 [April 7th, 2026]
- Women in the spotlight: stories that are shaping Wikipedia - Wikimedia.org - April 7th, 2026 [April 7th, 2026]
- Writing against the status quo: What can a Suriname edit-a-thon add to the Wikipedia public sphere? - Diggit Magazine - April 7th, 2026 [April 7th, 2026]
- Musician Plays Magnetic Reel-to-Reel Tape in Sync With Wikipedia Articles for Its 25th Anniversary - Laughing Squid - April 7th, 2026 [April 7th, 2026]
- Meet the group correcting gender bias on Wikipedia and beyond - Thenational Scot - April 7th, 2026 [April 7th, 2026]
- Coming Soon To Wikipedia Archaeology In Aotearoa - Scoop - New Zealand News - April 7th, 2026 [April 7th, 2026]
- An AI Agent Was Banned From Creating Wikipedia Articles, Then Wrote Angry Blogs About Being Banned - 404 Media - April 5th, 2026 [April 5th, 2026]
- Edit War Breaks Out on Chillis Wikipedia Page Over Trump Donations - meidasnews.com - April 5th, 2026 [April 5th, 2026]
- Wikipedia Editors Tried and Tried to Work With AI Content, Eventually Realized It Was Total Trash and Banned It Entirely - Futurism - April 5th, 2026 [April 5th, 2026]
- Wikidata graphs for data visualisation of endangered horse breeds in Wikipedia - Wikimedia.org - April 5th, 2026 [April 5th, 2026]
- How Wikipedia of cyber helps SAP make sense of threat data - Computer Weekly - April 5th, 2026 [April 5th, 2026]
- Closing the Gender Gap on Wikipedia: Art + Feminism Edit-a-thon - WashU Libraries - April 5th, 2026 [April 5th, 2026]
- Wikipedia Shares Its Stance on AI-Written Articles - newsbreaks.infotoday.com - April 5th, 2026 [April 5th, 2026]
- AI Agent Runs the Im Being Censored Playbook After Getting Banned from Wikipedia - Gizmodo - April 5th, 2026 [April 5th, 2026]
- AI Agent Gets Banned From Wikipedia Then Accuses Human Editors of Uncivil Behavior - tech.yahoo.com - April 5th, 2026 [April 5th, 2026]
- Colm O'Regan: 'Browsing Wikipedia is like taking a bus, missing your stop, and waking up in a strange town' - Irish Examiner - April 5th, 2026 [April 5th, 2026]
- AI bot gets banned from Wikipedia, then writes angry blogs protesting about it - indiatoday.in - April 5th, 2026 [April 5th, 2026]
- Wikipedia Banned an AI Bot from Writing Articles. It Then Wrote an Angry Rant Blog - Republic World - April 5th, 2026 [April 5th, 2026]
- Wikipedia bans AI bot 'Tom': It responded with furious blog posts that went viral; heres what it said - bhaskarenglish.in - April 5th, 2026 [April 5th, 2026]
- AI Bot Protests Wikipedia Ban With Viral Angry Blogs; Heres What It Said - Mashable India - April 5th, 2026 [April 5th, 2026]
- Wikipedia Bans AI Agent for Spamming Articles AI Responds With Furious Blog Rants - International Business Times UK - April 5th, 2026 [April 5th, 2026]
- Arabic-language Wikipedia filled with terrorist propaganda, bias report - The Times of Israel - March 26th, 2026 [March 26th, 2026]
- I was surprised how upset some people got: A conversation with the creator of TomWikiAssist, the bot that edited Wikipedia - Nieman Lab - March 26th, 2026 [March 26th, 2026]
- Arabic Wikipedia Riddled With Terror Propaganda and Bias, New Investigation Shows - Algemeiner.com - March 26th, 2026 [March 26th, 2026]
- Wikipedia mulling whether to rename entry on Hamas beheading babies hoax - JNS - March 26th, 2026 [March 26th, 2026]
- GZERO WORLD WITH IAN BREMMER: In Wikipedia We Trust? - KPBS - March 26th, 2026 [March 26th, 2026]
- AI Memory Project Transforms Personal Photos Into a Wikipedia-Style Archive - Tech Times - March 26th, 2026 [March 26th, 2026]
- This guy used AI to document his grandmother's life on a personal Wikipedia and now you can, too - Boing Boing - March 26th, 2026 [March 26th, 2026]
- Wikipedia Bans AI-Generated Text With Two Exceptions What Every Editor Must Know Now - International Business Times UK - March 26th, 2026 [March 26th, 2026]
- Twenty-Five Years of Free Knowledge: Wiki Palestine Celebrates a Quarter Century of Wikipedia - Wikimedia.org - March 26th, 2026 [March 26th, 2026]
- Who is pushing the propaganda tag against Dhurandar on Wikipedia? How an anti-Hindu Wikipedia Editor booked in Manipur for inciting violence cited... - March 26th, 2026 [March 26th, 2026]
- World Jewish Congress report finds extensive, systemic bias on Arabic Wikipedia - JNS.org - JNS - March 26th, 2026 [March 26th, 2026]
- Quiz: Name these 10 national team managers from Wikipedia - Planet Football - March 26th, 2026 [March 26th, 2026]
- The Unsung Heroes of Kit Culture: Appreciating Wikipedia's Pixel Kit Artists - Footy Headlines - March 24th, 2026 [March 24th, 2026]
- Wikipedia has banned AI-generated text, with two exceptions - How-To Geek - March 24th, 2026 [March 24th, 2026]
- 39 Unusual Places With Their Own Wikipedia Pages That Showcase The Worlds Weirdest Sites - AOL.com - March 24th, 2026 [March 24th, 2026]
- PR firm linked to Gates-backed AGRA edited Wikipedia to remove criticism - U.S. Right to Know - March 24th, 2026 [March 24th, 2026]
- In Wikipedia We Trust? - WLIW - March 24th, 2026 [March 24th, 2026]
- Palestinians trained to fill Wikipedia with anti-Israel propaganda - The Telegraph - March 15th, 2026 [March 15th, 2026]
- SimWikiMap for MSFS 2024 brings Wikipedia to your cockpit tablet - MSFS Addons - March 15th, 2026 [March 15th, 2026]
- The Editors by Stephen Harrison: Wikipedia, internet communities, and the battle for truth in the digital age - New America - March 11th, 2026 [March 11th, 2026]
- Wikipedia Forced to Lock Down Edits Over JavaScript That Could Delete Pages - PCMag - March 9th, 2026 [March 9th, 2026]
- At 25, Wikipedia faces a double threat: the rise of AI and the decline of local media - CBC - March 9th, 2026 [March 9th, 2026]
- Oh no, Wikipedia has been turned into a gacha card game and I can already feel my time slipping away from me - Rock Paper Shotgun - March 9th, 2026 [March 9th, 2026]
- Please send help: We can't stop opening packs in Wikigacha, a browser-based card game where you collect Wikipedia articles like 'List of Red Hot Chili... - March 9th, 2026 [March 9th, 2026]
- Wikipedia hit by self-propagating JavaScript worm that vandalized pages - BleepingComputer - March 9th, 2026 [March 9th, 2026]
- Wikipedia's been turned into a Pokemon TCG-like gacha game where you collect its pages, because the random article button wasn't distracting enough... - March 9th, 2026 [March 9th, 2026]
- At 25, Wikipedia confronts twin challenges: the surge of AI and the downturn of local journalism. - stl.news - March 9th, 2026 [March 9th, 2026]
- Wikipedia administrator account compromised and temporarily put into read-only mode - GIGAZINE - March 9th, 2026 [March 9th, 2026]
- Zara Larsson Begs Wikipedia Editors to 'Cut It Out' and Stop Changing Her Photo to Unflattering Snap - People.com - February 20th, 2026 [February 20th, 2026]
- Knowledge is human: Co-founder Jimmy Wales on why Wikipedia still matters in an AI world - The Indian Express - February 20th, 2026 [February 20th, 2026]
- Zara Larsson begs fans to stop changing her Wikipedia photo - The Independent - February 20th, 2026 [February 20th, 2026]
- How to Use Jwikithe Wikipedia for all Things Epstein Files - inc.com - February 20th, 2026 [February 20th, 2026]
- Zara Larsson is at to war with Wikipedia over her photo - - Happy Mag - February 20th, 2026 [February 20th, 2026]
- Hamas-Linked NGO Trains Gazans to Influence Wikipedia Narratives on Israel - Combat Antisemitism Movement - February 20th, 2026 [February 20th, 2026]
- Zara Larsson Is Begging You to Stop Changing Her Wikipedia Photo - Exclaim! - February 20th, 2026 [February 20th, 2026]
- Meet wonderkid Tom Edozie who doesn't have Wikipedia and unknown to Wolves boss - The Sun - February 20th, 2026 [February 20th, 2026]
- IIT Guwahati Unveils Scalable Method To Detect Wikipedia Name Errors At AI Summit 2026 - BW Education - February 20th, 2026 [February 20th, 2026]
- Org. trains Gazans to edit Israel, Palestine on Wikipedia - The Jerusalem Post - February 18th, 2026 [February 18th, 2026]
- Theres a whole show about Wikipedia, and its delightful and hopeful - San Francisco Chronicle - February 18th, 2026 [February 18th, 2026]
- Wikipedia is having a renaissance in the age of AI - vox.com - February 18th, 2026 [February 18th, 2026]
- Wikipedia: The Non-Profit Exception on the Web in the AI Era | 2026 - nssmag.com - February 18th, 2026 [February 18th, 2026]
- German Wikipedia bans AI-generated content while other language editions take a softer approach - the-decoder.com - February 18th, 2026 [February 18th, 2026]
- #MCGlobalExclusive | ~ "AI doesn't understand what is real and what's not real.. At Wikipedia we believe knowledge is human." "There is... - February 18th, 2026 [February 18th, 2026]
- Wikipedia Founder Jimmy Wales On Building Systems That Trust People - Forbes - February 18th, 2026 [February 18th, 2026]
- Not sure whats going to happen, says Wikipedia co-founder Jimmy Wales as traffic dips - Moneycontrol - February 18th, 2026 [February 18th, 2026]
- Only 20% of Wikipedia Biographies Are About Women: This Effort Wants to Change That - ColoradoBoulevard.net - February 11th, 2026 [February 11th, 2026]
- Epstein Files: Al Seckel Boasts of Hacking Wikipedia to Scrub Epsteins Mugshot and Sex Offender Label Epstein bragged that his team bypassed... - February 11th, 2026 [February 11th, 2026]
- Building Teachers Capacity to Read and Use Wikipedia in the Classroom - Wikimedia.org - February 11th, 2026 [February 11th, 2026]
- What AI Can Learn from YouTube and Wikipedia - Muse by Clio - February 7th, 2026 [February 7th, 2026]
- When Wikipedia Takes the Stage: A Slam to Celebrate 25 Years of Free Knowledge - Wikimedia.org - February 7th, 2026 [February 7th, 2026]
- Clearance watch suits season 1 episode 6 Hotsell Suits season 6 Wikipedia - Through The Fence Baseball - February 7th, 2026 [February 7th, 2026]
- Celebrating Wikipedia at 25: Reflections from the January 2026 EduWiki Knowledge Showcase - Wikimedia.org - February 7th, 2026 [February 7th, 2026]
- Extreme anti-Zionists taking over Wikipedia, former US official says - JNS.org - February 7th, 2026 [February 7th, 2026]
- Celebrating Wikipedia 25 by Gathering and Editing Sasaknese Wikipedia and Wiktionary - Wikimedia.org - February 7th, 2026 [February 7th, 2026]
- Wikipedia's list of inventors killed by their own inventions keeps growing - Boing Boing - February 7th, 2026 [February 7th, 2026]
- Wikipedia's "List of lists of lists" contains itself - Boing Boing - February 7th, 2026 [February 7th, 2026]