Google's and Microsoft's chatbots are making up Super Bowl stats

If you needed more evidence that GenAI is prone to making stuff up, Google’s Gemini chatbot, formerly Bard, thinks that the 2024 Super Bowl already happened. It even has the (fictional) statistics to back it up.

Per a Reddit thread, Gemini, powered by Google’s GenAI models of the same name, is answering questions about Super Bowl LVIII as if the game wrapped up yesterday — or weeks before. Like many bookmakers, it seems to favor the Chiefs over the 49ers (sorry, San Francisco fans).

Gemini embellishes pretty creatively, in at least one case giving a player stats breakdown suggesting Kansas Chief quarterback Patrick Mahomes ran 286 yards for two touchdowns and an interception versus Brock Purdy’s 253 running yards and one touchdown.

Image Credits: /r/smellymonster (opens in a new window)

It’s not just Gemini. Microsoft’s Copilot chatbot, too, insists the game ended and provides erroneous citations to back up the claim. But — perhaps reflecting a San Francisco bias! — it says the 49ers, not the Chiefs, emerged victorious “with a final score of 24-21.”

Image Credits: Kyle Wiggers / TechCrunch

Copilot is powered by a GenAI model similar, if not identical, to the model underpinning OpenAI’s ChatGPT (GPT-4). But in my testing, ChatGPT was loath to make the same mistake.

Image Credits: Kyle Wiggers / TechCrunch

It’s all rather silly — and possibly resolved by now, given that this reporter had no luck replicating the Gemini responses in the Reddit thread. (I’d be shocked if Microsoft wasn’t working on a fix as well.) But it also illustrates the major limitations of today’s GenAI — and the dangers of placing too much trust in it.

GenAI models have no real intelligence. Fed an enormous number of examples usually sourced from the public web, AI models learn how likely data (e.g. text) is to occur based on patterns, including the context of any surrounding data.

This probability-based approach works remarkably well at scale. But while the range of words and their probabilities are likely to result in text that makes sense, it’s far from certain. LLMs can generate something that’s grammatically correct but nonsensical, for instance — like the claim about the Golden Gate. Or they can spout mistruths, propagating inaccuracies in their training data.

It’s not malicious on the LLMs’ part. They don’t have malice, and the concepts of true and false are meaningless to them. They’ve simply learned to associate certain words or phrases with certain concepts, even if those associations aren’t accurate.

Hence Gemini’s and Copilot’s Super Bowl falsehoods.

Google and Microsoft, like most GenAI vendors, readily acknowledge that their GenAI apps aren’t perfect and are, in fact, prone to making mistakes. But these acknowledgements come in the form of small print I’d argue could easily be missed.

Super Bowl disinformation certainly isn’t the most harmful example of GenAI going off the rails. That distinction probably lies with endorsing torture, reinforcing ethnic and racial stereotypes or writing convincingly about conspiracy theories. It is, however, a useful reminder to double-check statements from GenAI bots. There’s a decent chance they’re not true.

Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

Previous News

Hidden tricks to customize the new design

Next News

How Businesses Can Reduce CAC By 50%

Editorial Team

StartupNews.fyi is a leading global startup and technology media platform known for its end-to-end coverage of the startup ecosystem across India and key international markets. Launched with the vision of becoming a single gateway for founders, investors, and ecosystem enablers, StartupNews.fyi has grown steadily over the years by publishing tens of thousands of verified news stories, insights, and ecosystem updates, reaching millions of startup enthusiasts every month through its digital platforms and communities.

More like this

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Disclaimer

Popular

Iran conflict delays Meta’s 2Africa undersea cable project — cable layer declares force majeure, says it can no longer safely operate in the Persian...

Captain Fresh to raise Rs 120 Cr in equity round led by founder

Lumikai leads $1.5 Mn seed round in NPrep

Chirp Discount Codes and Deals: Save Up to 67%

Windows 11 will let you name your user folder during setup

More Like this

Backblaze Hosts 314 Trillion Digits of Pi Online

iPhone 17e gives Pixel 10a its biggest L — all thanks to Google

iPhone 17 Pro is Now Part of MLB History

How to get the Landfall Operator skin in Black Ops Royale

AI cannot replace trained mind of lawyer, ethical responsibility of judge: Justice Vikram Nath

TSMC rides AI wave to net nearly 70% of global foundry market in 2025

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Disclaimer

More like this

Backblaze Hosts 314 Trillion Digits of Pi Online

iPhone 17e gives Pixel 10a its biggest L —...

iPhone 17 Pro is Now Part of MLB History

Popular

Block title

Meta Acquires Moltbook, the Social Network Just for A.I. Bots

Fake AI Content About the Iran War Is All Over X

Hello Entrepreneurs Unveils Inspiring Women Entrepreneurs Leading Change

How Pokémon Go is giving delivery robots an inch-perfect view of the world

Latest iOS 26.4 Beta Adds New Emoji Characters

The world’s thinnest foldable undercuts the Galaxy Z Fold 7 with some very competitive...

What True Self-Custody Actually Requires

Startup Events

Trending News

Backblaze Hosts 314 Trillion Digits of Pi Online

iPhone 17e gives Pixel 10a its biggest L — all thanks to Google

iPhone 17 Pro is Now Part of MLB History

How to get the Landfall Operator skin in Black Ops Royale

AI cannot replace trained mind of lawyer, ethical responsibility of judge: Justice Vikram Nath

About

Partnership

Contact us