How Google’s AI search overview is losing touch with reality

Opinion | 3 June 2024

Download The Business Of podcast today on your favourite podcast platform.

Using AI to write search results is risky for Google, the internet, and the whole idea of ‘truth’, writes UNSW Sydney’s Toby Walsh

Google has rolled out its latest experimental search feature on Chrome, Firefox and the Google app browser to hundreds of millions of users. “AI Overviews” saves you from clicking on links by using generative AI – the same technology that powers rival product ChatGPT – to provide summaries of the search results. Ask “how to keep bananas fresh for longer,” and it uses AI to generate a useful summary of tips such as storing them in a cool, dark place and away from other fruits like apples.

But ask it a left-field question, and the results can be disastrous or even dangerous. Google is currently scrambling to fix these problems one by one, but it is a PR disaster for the search giant and a challenging game of whack-a-mole.

Google’s AI Overviews may damage the tech giant’s reputation for providing reliable results.jpg — Google’s AI Overviews may damage the tech giant’s reputation for providing reliable results. Photo: Google/The Conversation

AI Overviews helpfully tells you that “Whack-A-Mole is a classic arcade game where players use a mallet to hit moles that pop up at random for points. The game was invented in Japan in 1975 by the amusement manufacturer TOGO and was originally called Mogura Taiji or Mogura Tataki.”

But AI Overviews also tells you that “astronauts have met cats on the moon, played with them, and provided care”. More worryingly, it also recommends “you should eat at least one small rock per day“ as “rocks are a vital source of minerals and vitamins”, and suggests putting glue in pizza topping.

Why is this happening?

One fundamental problem is that generative AI tools don’t know what is true, just what is popular. For example, there aren’t a lot of articles on the web about eating rocks as it is so self-evidently a bad idea.

There is, however, a well-read satirical article from The Onion about eating rocks. And so Google’s AI based its summary on what was popular, not what was true.

Some AI Overview results appear to have mistaken jokes and parodies for factual information.jpg — Some AI Overview results appear to have mistaken jokes and parodies for factual information. Photo: Google/The Conversation

Another problem is that generative AI tools don’t have our values. They’re trained on a large chunk of the web.

And while sophisticated techniques (that go by exotic names such as “reinforcement learning from human feedback“ or RLHF) are used to eliminate the worst, it is unsurprising they reflect some of the biases, conspiracy theories and worse to be found on the web. Indeed, I am always amazed at how polite and well-behaved AI chatbots are, given what they’re trained on.

Is this the future of search?

If this is really the future of search, then we’re in for a bumpy ride. Google is, of course, playing catch-up with OpenAI and Microsoft.

The financial incentives to lead the AI race are immense. Google is therefore being less prudent than in the past in pushing the technology out into users’ hands.

In 2023, Google chief executive Sundar Pichai said: “We’ve been cautious. There are areas where we’ve chosen not to be the first to put a product out. We’ve set up good structures around responsible AI. You will continue to see us take our time.”

That no longer appears to be so true, as Google responds to criticisms that it has become a large and lethargic competitor.

Learn more: develop AI strategies that focus on transformation and generating new value

A risky move

It’s a risky strategy for Google. It risks losing the trust that the public has in Google being the place to find (correct) answers to questions.

But Google also risks undermining its own billion-dollar business model. If we no longer click on links and just read their summary, how does Google continue to make money?

The risks are not restricted to Google. I fear the use of AI might be harmful to society more broadly. Truth is already a somewhat contested and fungible idea. AI untruths are likely to make this worse.

In a decade’s time, we may look back at 2024 as the golden age of the web, when most of it was quality human-generated content, before the bots took over and filled the web with synthetic and increasingly low-quality AI-generated content.

Subscribe to BusinessThink for the latest research, analysis and insights from UNSW Business School

Has AI started breathing its own exhaust fumes?

The second generation of large language models are likely and unintentionally being trained on some of the outputs of the first generation. And lots of AI startups are touting the benefits of training on synthetic, AI-generated data.

However, training on the exhaust fumes of current AI models risks amplifying even small biases and errors. Just as breathing in exhaust fumes is bad for humans, it is bad for AI.

These concerns fit into a much bigger picture. Globally, more than US$400 million (A$600 million) is being invested in AI every day. And governments are only now just waking up to the idea we might need guardrails and regulations to ensure AI is used responsibly, given this torrent of investment.

Pharmaceutical companies aren’t allowed to release drugs that are harmful. Nor are car companies. But so far, tech companies have largely been allowed to do what they like.

Toby Walsh is Chief Scientist of UNSW’s AI Institute UNSW.AI. He is a Fellow of the Australia Academy of Science. His most recent book is Machines Behaving Badly: the morality of AI. Prof Walsh is supported by the ARC by means of an ARC Laureate Fellowship exploring trustworthy AI. A version of this article was originally published on The Conversation.

Regulation Data Digital Risk Technology Innovation Artificial Intelligence

Republish this article

Republish

You are free to republish this article both online and in print. We ask that you follow some simple guidelines.

Please do not edit the piece, ensure that you attribute the author, their institute, and mention that the article was originally published on Business Think.

By copying the HTML below, you will be adhering to all our guidelines.

Press Ctrl-C to copy

<h1>How Google’s AI search overview is losing touch with reality</h1>

<figure><img src="https://assets-us-01.kc-usercontent.com:443/4df0558d-5779-0012-00f2-b3fa06d6c950/b8df064a-557d-4583-8db7-b7cf448683da/Google%27s%20AI%20Overviews%20provide%20search%20result%20summaries%2C%20but%20can%20be%20misleading%201.mp4?w=1320" alt="Google's AI Overviews provide search result summaries, but can be misleading" /><figcaption>Google's AI Overviews provide search result summaries, but can be misleading</figcaption></figure>
 Google has rolled out its <a href="https://blog.google/products/search/generative-ai-google-search-may-2024/" data-new-window="true" target="_blank" rel="noopener noreferrer">latest experimental search feature</a> on Chrome, Firefox and the Google app browser to hundreds of millions of users. “AI Overviews” saves you from clicking on links by using generative AI – the same technology that powers rival product ChatGPT – to provide summaries of the search results. Ask “how to keep bananas fresh for longer,” and it uses AI to generate a useful summary of tips such as storing them in a cool, dark place and away from other fruits like apples.
But ask it a left-field question, and the results can be disastrous or even dangerous. Google is currently scrambling to <a href="https://www.theverge.com/2024/5/24/24164119/google-ai-overview-mistakes-search-race-openai" data-new-window="true" target="_blank" rel="noopener noreferrer">fix these problems one by one</a>, but it is a PR disaster for the search giant and a challenging game of whack-a-mole.
<figure class="figure"><img src="https://assets-us-01.kc-usercontent.com:443/4df0558d-5779-0012-00f2-b3fa06d6c950/5267fb2e-93d9-4753-9f4e-af7d4a31941c/Google%E2%80%99s%20AI%20Overviews%20may%20damage%20the%20tech%20giant%E2%80%99s%20reputation%20for%20providing%20reliable%20results.jpg" class="figure-img" alt="Google’s AI Overviews may damage the tech giant’s reputation for providing reliable results.jpg"><figcaption class="figure-caption">Google’s AI Overviews may damage the tech giant’s reputation for providing reliable results. Photo: Google/The Conversation</figcaption></figure>
AI Overviews helpfully tells you that “Whack-A-Mole is a classic arcade game where players use a mallet to hit moles that pop up at random for points. The game was invented in Japan in 1975 by the amusement manufacturer TOGO and was originally called Mogura Taiji or Mogura Tataki.”
But AI Overviews also tells you that “<a href="https://www.smh.com.au/business/companies/cats-on-the-moon-google-s-ai-tool-is-producing-misleading-responses-that-have-experts-worried-20240525-p5jgmk.html" data-new-window="true" target="_blank" rel="noopener noreferrer">astronauts have met cats on the moon</a>, played with them, and provided care”. More worryingly, it also recommends “you should <a href="https://www.reddit.com/r/google/comments/1cziil6/a_rock_a_day_keeps_the_doctor_away/" data-new-window="true" target="_blank" rel="noopener noreferrer">eat at least one small rock per day</a>“ as “rocks are a vital source of minerals and vitamins”, and suggests putting <a href="https://x.com/petergyang/status/1793480607198323196" data-new-window="true" target="_blank" rel="noopener noreferrer">glue in pizza topping</a>.
<h2>Why is this happening?</h2>
One fundamental problem is that generative AI tools don’t know what is true, just what is popular. For example, there aren’t a lot of articles on the web about eating rocks as it is so self-evidently a bad idea.
There is, however, a well-read <a href="https://www.theonion.com/geologists-recommend-eating-at-least-one-small-rock-per-1846655112" data-new-window="true" target="_blank" rel="noopener noreferrer">satirical article</a> from The Onion about eating rocks. And so Google’s AI based its summary on what was popular, not what was true.
<figure class="figure"><img src="https://assets-us-01.kc-usercontent.com:443/4df0558d-5779-0012-00f2-b3fa06d6c950/c359edc0-9d29-4713-9159-db981312fb58/Some%20AI%20Overview%20results%20appear%20to%20have%20mistaken%20jokes%20and%20parodies%20for%20factual%20information.jpg" class="figure-img" alt="Some AI Overview results appear to have mistaken jokes and parodies for factual information.jpg"><figcaption class="figure-caption">Some AI Overview results appear to have mistaken jokes and parodies for factual information. Photo: Google/The Conversation</figcaption></figure>
Another problem is that generative AI tools don’t have our values. They’re trained on a large chunk of the web.
And while sophisticated techniques (that go by exotic names such as “<a href="https://huggingface.co/blog/rlhf" data-new-window="true" target="_blank" rel="noopener noreferrer">reinforcement learning from human feedback</a>“ or RLHF) are used to eliminate the worst, it is unsurprising they reflect some of the biases, conspiracy theories and worse to be found on the web. Indeed, I am always amazed at how polite and well-behaved AI chatbots are, given what they’re trained on.
<h2>Is this the future of search?</h2>
If this is really the future of search, then we’re in for a bumpy ride. Google is, of course, <a href="https://www.theinformation.com/articles/why-google-and-openai-dont-see-eye-to-eye-on-voice-assistants" data-new-window="true" target="_blank" rel="noopener noreferrer">playing catch-up</a> with OpenAI and Microsoft.
The financial incentives to lead the AI race are <a href="https://www.bloomberg.com/company/press/generative-ai-to-become-a-1-3-trillion-market-by-2032-research-finds/" data-new-window="true" target="_blank" rel="noopener noreferrer">immense</a>. Google is therefore being less prudent than in the past in pushing the technology out into users’ hands.
In 2023, Google chief executive Sundar Pichai <a href="https://www.bloomberg.com/news/articles/2023-06-12/google-ceo-sundar-pichai-urges-caution-amid-ai-hype-cycle" data-new-window="true" target="_blank" rel="noopener noreferrer">said</a>: “We’ve been cautious. There are areas where we’ve chosen not to be the first to put a product out. We’ve set up good structures around responsible AI. You will continue to see us take our time.”
That no longer appears to be so true, as <a href="https://stratechery.com/2023/ai-and-the-big-five/" data-new-window="true" target="_blank" rel="noopener noreferrer">Google responds to criticisms</a> that it has become a large and lethargic competitor.
<a class="o-btn o-btn--primary" href="https://bit.ly/3X1Ij7q" target="_blank">Learn more: develop AI strategies that focus on transformation and generating new value</a>
<h2>A risky move</h2>
It’s a risky strategy for Google. It risks losing the trust that the public has in Google being the place to find (correct) answers to questions.
But Google also risks undermining its own billion-dollar business model. If we no longer click on links and just read their summary, how does Google continue to make money?
The risks are not restricted to Google. I fear the use of AI might be harmful to society more broadly. Truth is already a somewhat contested and fungible idea. AI untruths are likely to make this worse.
In a decade’s time, we may look back at 2024 as the golden age of the web, when most of it was quality human-generated content, before the bots took over and <a href="https://www.theatlantic.com/technology/archive/2024/04/generative-ai-search-llmo/678154/" data-new-window="true" target="_blank" rel="noopener noreferrer">filled the web</a> with synthetic and increasingly low-quality <a href="https://www.theguardian.com/technology/article/2024/may/19/spam-junk-slop-the-latest-wave-of-ai-behind-the-zombie-internet" data-new-window="true" target="_blank" rel="noopener noreferrer">AI-generated content</a>.
<a class="o-btn o-btn--primary" href="https://www.businessthink.unsw.edu.au/subscribe" target="_self">Subscribe to BusinessThink for the latest research, analysis and insights from UNSW Business School</a>
<h2>Has AI started breathing its own exhaust fumes?</h2>
The second generation of large language models are likely and unintentionally being trained on some of the <a href="https://www.newyorker.com/tech/annals-of-technology/chatgpt-is-a-blurry-jpeg-of-the-web" data-new-window="true" target="_blank" rel="noopener noreferrer">outputs of the first generation</a>. And lots of AI startups are touting the benefits of training on <a href="https://www.nytimes.com/2024/04/06/technology/ai-data-tech-companies.html" data-new-window="true" target="_blank" rel="noopener noreferrer">synthetic, AI-generated data</a>.
However, training on the exhaust fumes of current AI models risks amplifying even small biases and errors. Just as breathing in exhaust fumes is bad for humans, it is bad for AI.
These concerns fit into a much bigger picture. Globally, <a href="https://www.idc.com/getdoc.jsp?containerId=prUS50454123" data-new-window="true" target="_blank" rel="noopener noreferrer">more than US$400 million</a> (A$600 million) is being invested in AI every day. And governments are only now just waking up to the idea we might need guardrails and regulations to ensure AI is used responsibly, given this torrent of investment.
Pharmaceutical companies aren’t allowed to release drugs that are harmful. Nor are car companies. But so far, tech companies have largely been allowed to do what they like.
Toby Walsh is Chief Scientist of UNSW’s AI Institute <a href="http://unsw.ai/" data-new-window="true" target="_blank" rel="noopener noreferrer">UNSW.AI</a>. He is a Fellow of the Australia Academy of Science. His most recent book is Machines Behaving Badly: the morality of AI. Prof Walsh is supported by the ARC by means of an ARC Laureate Fellowship exploring trustworthy AI. A version of this article was originally published on The Conversation.

Comments

Opinion

How Google’s AI search overview is losing touch with reality

Download The Business Of podcast today on your favourite podcast platform.

Why is this happening?

Is this the future of search?

A risky move

Has AI started breathing its own exhaust fumes?

Republish

Related

GenAI opportunities and risks for the Australian public service

Beyond chatbots: Navigating AI's industrial transformation

What if your first job no longer exists?

Find an expert

Connect