How relying on LLMs can lead to SEO disaster – Search Engine Land

ChatGPT can pass the bar.

GPT gets an A+ on all exams.

GPT gets through MIT entrance exam with flying colors.

How many of you have recently read articles claiming something like the above?

I know I have seen a ton of these. It seems like every day, theres a new thread claiming that GPT is almost Skynet, close to artificial general intelligence or better than people.

I was recently asked, Why doesnt ChatGPT respect my word count input? Its a computer, right? A reasoning engine? Surely, it should be able to count the number of words in a paragraph.

This is a misunderstanding that comes up with large language models (LLMs).

To some extent, the form of tools like ChatGPT belie the function.

The interface and the presentation are that of a conversational robot partner part AI companion, part search engine, part calculator a chatbot to end all chatbots.

But this isnt the case. In this article, I will run over a few case studies, some experimental and some in the wild.

We will go over how they were presented, what problems come up, and what, if anything, can be done about the weaknesses these tools have.

Recently, a team of undergraduate researchers wrote about GPT acing the MIT EECS Curriculum went moderately viral on Twitter, garnering 500 retweets.

Unfortunately, the paper has several issues, but Ill review the broad strokes here. I want to highlight two major ones here plagiarism and hype-based marketing.

GPT could answer some questions easily because it had seen them before. The response article discusses this in the section, Information Leak in Few Shot Examples.

As part of prompt engineering, the study team included information that ended up revealing the answers to ChatGPT.

A problem with the 100% claim is that some of the answers on the test were unanswerable, either because the bot didnt have access to what they needed to solve the question or because the question relied on a different question the bot did not have access to.

The other issue is the problem of prompting. The automation on this paper had this specific bit:

The paper here commits to a grading method that is problematic. The way GPT responds to these prompts doesnt necessarily result in factual, objective grades.

Lets reproduce a Ryan Jones tweet:

For some of these questions, the prompting would almost always mean eventually coming across a correct answer.

And because GPT is generative, it may not be able to compare its own answer with the correct answer accurately. Even when corrected, it says, There were no problems with the answer.

Most natural language processing (NLP) is either extractive or abstractive. Generative AI attempts to be the best of both worlds and in so being is neither.

Gary Illyes recently had to take to social media to enforce this:

I want to use this specifically to talk about hallucinations and prompt engineering.

Hallucination refers to instances when machine learning models, specifically generative AI, output unexpected and incorrect results.

I have become frustrated with the term for this phenomenon over time:

GPT hallucinates because it is following patterns in text and applying them to other patterns in text repeatedly; when those applications are not correct, there is no difference.

This brings me to prompt engineering.

Prompt engineering is the new trend in using GPT and tools like it. I have engineered a prompt that gets me exactly what I want. Buy this ebook to learn more!

Prompt engineers are a new job category, one that pays well. How can I best GPT?

The problem is that engineered prompts can very easily be over-engineered prompts.

GPT gets less accurate the more variables it has to juggle. The longer and more complicated your prompt, the less the safeguards will work.

If I simply ask GPT to audit my website, I get the classic as an AI language model response. The more complexity in my prompt, the less likely it is to respond with accurate information.

Xenia Volynchuk exists, but the site does not. Yulia Sapegina doesnt appear to exist, and Zeck Ford isnt an SEO site at all.

If you underengineer, your responses are generic. If you overengineer, your responses are wrong.

Get the daily newsletter search marketers rely on.

Every few months, a question like this will go viral on social media:

When you add 23 to 48, how do you do it?

Some people add 3 and 8 to get 11, then add 11 to 20+40. Some add 2 and 8 to get 10, add that to 60 and put one on top. Peoples brains tend to calculate things in different ways.

Now lets go back to fourth-grade math. Do you remember multiplication tables? How did you work with them?

Yes, there were worksheets to try and show you how multiplications work. But for many students, the goal was to memorize the functions.

When I hear 6x7, I dont actually do the math in my head. Instead, I remember my father drilling my multiplication table over and over. 6x7 is 42, not because I know it, but because I have memorized 42.

I say this because this is closer to how LLMs deal with math. LLMs look at patterns across vast swathes of text. It doesnt know what a 2 is, just that the word/token 2 tends to show up across certain contexts.

OpenAI, in particular, is interested in solving this flaw in logical reasoning. GPT-4, their recent model, is one that they say has better logical reasoning. While I am not an OpenAI engineer, I want to talk about some of the ways they probably worked to make GPT-4 more of a reasoning model.

In the same way that Google pursues algorithmic perfection in search, hoping to get away from human factors in ranking like links, so too does OpenAI aim to deal with the weaknesses of LLM models.

There are two ways OpenAI works to give ChatGPT better reasoning capabilities:

In the first group, OpenAI fine-tunes models on top of each other. Thats actually the difference between ChatGPT and regular GPT.

Plain GPT is an engine that simply outs the likely next tokens after a sentence. On the other hand, ChatGPT is a model trained on commands and next steps.

One thing that comes up as a wrinkle with calling GPT fancy autocorrect is the ways these layers interact with each other and the deep ability of models of this size to recognize patterns and apply them across different contexts.

The model is able to make connections between the answers, the expectations of how and contextually different questions are asked.

Even if nobody has asked about, explain statistics using a metaphor about dolphins, GPT can take these connections across the board and expand on them. It knows the shape of explaining a topic with a metaphor, how statistics work, and what dolphins are.

However, as anyone who deals with GPT regularly can tell, the further you get from GPTs training materials, the worse the outcome gets.

OpenAI has a model that is trained on various layers, relating to:

Anyone who has spent time trying to get GPT to act outside of its parameters can tell you that context and commands are endlessly modular. Humans are creative and can devise endless ways to break the rules.

What this all means is that OpenAI can train an LLM to reason by exposing it to layers of reasoning for it to mimic and recognize patterns.

Memorizing the answers, not understanding them.

The other way OpenAI can add reasoning capabilities to its models is through using other elements. But these have their own set of issues. You can see OpenAI attempting to resolve GPT problems with non-GPT solutions through the use of plugins.

The link reader plugin is one for ChatGPT (GPT-4). It allows a user to add links to ChatGPT and the agent visits the link and gets the content. But how does GPT do this?

Far from thinking and deciding to access these links, the plug-in assumes each link is necessary.

When the text is analyzed, the links are visited and the HTML is dumped in the input. It is tough to integrate these kinds of plugins more elegantly.

For example, the Bing plugin allows you to search with Bing, but the agent then assumes you want to search far more often than the opposite.

This is because even with layers of training, its hard to ensure consistent responses from GPT. If you work with the OpenAI API, this can come up immediately. You can flag as an open AI model, but some responses will have other sentence structures and different ways to say no.

This makes a mechanical code response difficult to write because it expects a consistent input.

If you want to integrate search with an OpenAI app, what kinds of triggers set off the search function?

What if you want to talk about search in an article? Similarly, chunking inputs can be difficult because.

It is hard for ChatGPT to distinguish from different parts of the prompt, as it is difficult for these models to distinguish between fantasy and reality.

Nevertheless, the easiest way to allow GPT to reason is to integrate something that is better at reasoning. This is still easier said than done.

Ryan Jones had a good thread about this on Twitter:

We then return to the issue of how LLMs work.

Theres no calculator, no thought process, just guessing the next term based on a massive corpus of text.

My favorite case for this kind of thing? Childrens riddles.

One of the four words from each set does not belong. Which word does not belong?

Take a second to think about it. Ask a child.

Here are the actual answers:

Now lets look at some responses from GPT:

The thing that is interesting is that the shape of this answer is correct. It got that the correct answer was not a primary color, but the context was not enough for it to know what primary colors are or what colors are.

This is what you might call one-shot querying. I dont provide additional details to the model, and expect it to figure things out independently. But, as weve seen in previous answers, GPT can get things wrong with over-prompting.

GPT is not smart. While impressive, it is not as general purpose as it wants to be.

It doesnt know the context for what it says or does, nor does it know what a word is.

To GPT, the world is math.

Tokens are simply vectors dancing together, representing the web in a vast array of interconnected points.

The lawyer who used ChatGPT in a court case said he thought it was a search engine.

This high-visibility case of professional malfeasance is entertaining, but I am gripped by fear of the implications.

A lawyer a subject matter expert doing highly skilled, highly paid work submitted this info to court.

All over the country, hundreds of people are doing the same thing because it is almost like a search engine, it seems human and looks right.

Website content can be high stakes everything can be. Misinformation is already rampant online, and ChatGPT is eating whats left.

We have to collect metal from sunken ships because it hasnt been irradiated.

Similarly, data from before 2022 will become a hot commodity, because it stems from what text is supposed to be unique, human and true.

A lot of this kind of discourse seems to stem from a couple of root causes, those being misunderstanding of how GPT works, and misunderstanding what it is used for.

To some extent, OpenAI can be held accountable for these misunderstandings. They want to be developing artificial general intelligence so much that accepting weaknesses in what GPT can do is difficult.

GPT is a "master of all" and so cannot be a master of anything.

If it cannot say slurs, it cant moderate content.

If it has to tell the truth, it cant write fiction.

If it has to obey the user, it cannot always be accurate.

GPT is not a search engine, a chatbot, your friend, a general intelligence, or even fancy autocorrect.

It is mass-applied statistics, rolling dice to make sentences. But the thing about chance is sometimes you call the wrong shot.

Opinions expressed in this article are those of the guest author and not necessarily Search Engine Land. Staff authors are listed here.

More here:
How relying on LLMs can lead to SEO disaster - Search Engine Land

North Korean IT scam stole from business in Atlanta and elsewhere, feds say - AJC.com - July 8th, 2025 [July 8th, 2025]
Trump donor scammed out of $250k in crypto after someone pretending to be Steve Witkoff allegedly sent an eerily convincing email - Fortune - July 8th, 2025 [July 8th, 2025]
4 Cryptocurrencies That Could Be the Next Bitcoin - Yahoo Finance - July 8th, 2025 [July 8th, 2025]
Elon Musk Calls the U.S. Dollar Hopeless, Says His America Party Will Embrace Bitcoin - Gizmodo - July 8th, 2025 [July 8th, 2025]
The Risks and Rewards of Investing in Cryptocurrency - MSN - July 8th, 2025 [July 8th, 2025]
Ripple applies for US national bank charter as crypto eyes next frontier - Reuters - July 8th, 2025 [July 8th, 2025]
SOVA Brings Face-to-Face VA and SEO Training to Cagayan de Oro - openPR.com - May 26th, 2025 [May 26th, 2025]
SEO Spring Training 2025 Conference Returns to Chandler, AZ, May 1-4 2025 - Newsfile - May 15th, 2025 [May 15th, 2025]
TheeDigital Hosts Free SEO Training: "The Rise of Search Everywhere Optimization" - NORTHEAST - NEWS CHANNEL NEBRASKA - May 14th, 2025 [May 14th, 2025]
TheeDigital Hosts Free SEO Training: The Rise of Search Everywhere Optimization - FinancialContent - May 8th, 2025 [May 8th, 2025]
Best Resources to Learn SEO For Ranking and Competing Today - Shiksha Online - March 30th, 2025 [March 30th, 2025]
The best digital marketing classes, certificates and bootcamps near me - Time Out - March 7th, 2025 [March 7th, 2025]
Breaking the SEO Learning Barrier: Mohit's SEO Training Revolutionizes Hands-On SEO Education - MSN - February 18th, 2025 [February 18th, 2025]
Breaking the SEO Learning Barrier: Mohit's SEO Training Revolutionizes Hands-On SEO Education - Big News Network - February 14th, 2025 [February 14th, 2025]
Breaking the SEO Learning Barrier: Mohits SEO Training Revolutionizes Hands-On SEO Education - ThePrint - February 14th, 2025 [February 14th, 2025]
70+ PPC and Google Adwords Interview Questions and Answers for 2025 - Simplilearn - November 16th, 2024 [November 16th, 2024]
Reframing SEO: Why training search engines is the new game in the age of AI - Search Engine Land - August 29th, 2024 [August 29th, 2024]
Redefining SEO: How training search engines is shaping the future of digital content - Tech Edition - August 29th, 2024 [August 29th, 2024]
SEO University Partners with Salterra to Launch Advanced Schema - WICZ - August 25th, 2024 [August 25th, 2024]
SEO University Partners with Salterra to Launch Advanced Schema Course, Empowering SEO Professionals with Expert Training - Barchart - August 20th, 2024 [August 20th, 2024]
SEO University Partners with Salterra to Launch Advanced Schema - openPR - August 20th, 2024 [August 20th, 2024]
Top Websites to Learn SEO in 2024 - Analytics Insight - July 26th, 2024 [July 26th, 2024]
What is the process to Learn SEO Step by Step? - INSCMagazine - January 30th, 2024 [January 30th, 2024]
Park Seo-joon Mentions V's Photo At Army Training Center, He Wore The Same Raincoat As I Did 15 Years Go - KBIZoom - December 17th, 2023 [December 17th, 2023]
The Bicycle Coalition Attends the Vision Zero Cities 2023 Conference - Bicycle Coalition of Greater Philadelphia - October 27th, 2023 [October 27th, 2023]
The 40 best crime movies of all time - Entertainment Weekly News - October 27th, 2023 [October 27th, 2023]
50 Remote Jobs That Pay Over $50000 a Year: Part Two Jobs ... - Medium - October 23rd, 2023 [October 23rd, 2023]
How Search Generative Experience works and why retrieval ... - Search Engine Land - October 23rd, 2023 [October 23rd, 2023]
ONE: Radzuan responds to Stamp rematch talk, impressed by title win - South China Morning Post - October 23rd, 2023 [October 23rd, 2023]
California Law Limits Bitcoin ATM Transactions to $1,000 to Thwart ... - Slashdot - October 23rd, 2023 [October 23rd, 2023]
Tech CEO Sentenced To 5 Years in IP Address Scheme - Slashdot - October 23rd, 2023 [October 23rd, 2023]
Is Digital Marketing Training Worth it - Kings of War - October 3rd, 2023 [October 3rd, 2023]
The 2023 Nonprofit Power 100 - City & State - October 3rd, 2023 [October 3rd, 2023]
'Embarrassing' Court Document Google Wanted to Hide Finally ... - Slashdot - October 3rd, 2023 [October 3rd, 2023]
H&R Block, Meta, and Google Slapped With RICO Suit, Allegedly ... - Slashdot - October 3rd, 2023 [October 3rd, 2023]
FBI Indicts Goldman Sachs Analyst Who Tried Using Xbox Chat for ... - Slashdot - October 3rd, 2023 [October 3rd, 2023]
8 top marketing certifications and courses for 2023 - TechTarget - July 17th, 2023 [July 17th, 2023]
How to win SEO allies and influence the brand guardians - Search Engine Land - July 17th, 2023 [July 17th, 2023]
Become the next generation of multimedia content creators and ... - Education Times - July 17th, 2023 [July 17th, 2023]
A Week in My Life: Fiona Brindle, Head of SEO, TrunkBBI - Prolific North - July 17th, 2023 [July 17th, 2023]
Preparing the underserved: Five Auburn University alumni ... - Office of Communications and Marketing - July 17th, 2023 [July 17th, 2023]
Should You Have a Go at Search Engine Optimization (SEO)? - Printing Impressions - June 9th, 2023 [June 9th, 2023]
Chris Raulf of Boulder SEO Marketing to Give Masterclass on Micro ... - Digital Journal - June 9th, 2023 [June 9th, 2023]
Augmented Reality Training Simulator Market 2031 Key Insights and ... - KaleidoScot - June 9th, 2023 [June 9th, 2023]
Training Software Market 2023 Trends with Analysis on Key Players ... - KaleidoScot - June 9th, 2023 [June 9th, 2023]
Training Outsourcing Market 2023 Trends with Analysis on Key ... - KaleidoScot - June 9th, 2023 [June 9th, 2023]
COVID-19 Impact Analysis of Education Market 2031 | Key Players ... - KaleidoScot - June 9th, 2023 [June 9th, 2023]
MarTechBot: Insights from real-world usage (so far) - MarTech - June 9th, 2023 [June 9th, 2023]
Cognitive Assessment and Training Healthcare Market 2031 Growth ... - KaleidoScot - June 9th, 2023 [June 9th, 2023]
Prestige whisky brand appoints Wild PR to support business growth - Bdaily News - June 9th, 2023 [June 9th, 2023]
Erling Haaland Names Toughest Opponent He's Faced This Year ... - Sports Lens - June 9th, 2023 [June 9th, 2023]
Local Brand Advisor Proves Its Worth As Leading and Results ... - Digital Journal - May 29th, 2023 [May 29th, 2023]
Family: The Unbreakable Bond - K-drama Episode 10 Recap ... - TheReviewGeek - May 29th, 2023 [May 29th, 2023]
Salesbop: The AI-Powered Sales Coach and Trainer ... - Digital Journal - May 29th, 2023 [May 29th, 2023]
Career Technical Educational Opportunities for Students Attending ... - Demopolis Times - May 29th, 2023 [May 29th, 2023]
Doctor Cha Episode 13 Twitter Reactions: Cliffhanger Over ... - Leisure Byte - May 29th, 2023 [May 29th, 2023]
The National Eating Disorder Helpline Replaced Its Staff With a ... - The Mary Sue - May 29th, 2023 [May 29th, 2023]
Brendan Johnston: A 15 year pro-racing quest with a gravel resolution - Cyclingnews - May 29th, 2023 [May 29th, 2023]
Business Briefing: Apple Blossom Holistic, business news and ... - Laois Today - May 29th, 2023 [May 29th, 2023]
How the media is covering ChatGPT - Columbia Journalism Review - May 29th, 2023 [May 29th, 2023]
BSM to Host a Complimentary Webinar Entitled "AI and SEO. The ... - Digital Journal - May 18th, 2023 [May 18th, 2023]
Developing Skills to Stay Competitive - ATD - May 18th, 2023 [May 18th, 2023]
The biggest challenges facing small businesses and how to ... - Arizona Big Media - May 18th, 2023 [May 18th, 2023]
Priyanka Chopra Jonas On Husband Nick Jonas' 'Mean' Martini, Her ... - ELLE UK - May 18th, 2023 [May 18th, 2023]
The Idaho Towns Bankrolling Donald Trump's Campaign - News Radio 1310 KLIX - May 18th, 2023 [May 18th, 2023]
Online Stable Startup: Tips and Tricks for Launching a Horse Business - Everything Horse UK - May 18th, 2023 [May 18th, 2023]
ReKommendations: My Perfect Stranger, Duty After School, and more; K-dramas to catch up with this weekend - PINKVILLA - May 18th, 2023 [May 18th, 2023]
The Full Cast of Netflix's 'Black Knight' - We Got This Covered - May 18th, 2023 [May 18th, 2023]
Thanet business news: CAMRA awards, Thanet Earth, Dirtee Feast ... - The Isle of Thanet News - May 18th, 2023 [May 18th, 2023]
Top 100: New to the List Fast Action Pest Control - PCT Magazine - May 18th, 2023 [May 18th, 2023]
We are in content marketing era, the opportunities are diverse - Capital FM Kenya - May 14th, 2023 [May 14th, 2023]
25+ Best Remote Jobs Without Degree or Experience in 2023 - Southwest Journal - May 14th, 2023 [May 14th, 2023]
SEO Fight Club Episode 198 Explores AI Training Corpus And AI ... - Digital Journal - May 12th, 2023 [May 12th, 2023]
Various Advantages of HubSpot - CIOReview - May 12th, 2023 [May 12th, 2023]
How to Start and Grow a Successful Real Estate Business: Business ... - RealtyBizNews - May 12th, 2023 [May 12th, 2023]
Small Business, Big Results: Rely on Top SEO Company in Ahmedabad - The Week - May 12th, 2023 [May 12th, 2023]
How to Get Google's Attention with AI-Generated Content - PR News - For Smart Communicators - May 12th, 2023 [May 12th, 2023]
Meet the next Leadership Academy for Women in Media cohort in ... - Poynter - May 12th, 2023 [May 12th, 2023]
Republic of Korea and U.S. Navy Conduct Combined Maritime ... - Pacific Command - May 10th, 2023 [May 10th, 2023]
Boostly introduces ChatGPT integration for direct booking websites - Short Term Rentalz - May 10th, 2023 [May 10th, 2023]

July 17th, 2023

No comments yet

Comments are closed.

Mediaboss Marketing

How relying on LLMs can lead to SEO disaster – Search Engine Land

About

Pages

Categories

Media Sites

Recommended Sites

Archives