Google's and Microsoft's chatbots are making up Super Bowl stats

If you needed more evidence that GenAI is prone to making stuff up, Google’s Gemini chatbot, formerly Bard, thinks that the 2024 Super Bowl already happened. It even has the (fictional) statistics to back it up.

Per a Reddit thread, Gemini, powered by Google’s GenAI models of the same name, is answering questions about Super Bowl LVIII as if the game wrapped up yesterday — or weeks before. Like many bookmakers, it seems to favor the Chiefs over the 49ers (sorry, San Francisco fans).

Gemini embellishes pretty creatively, in at least one case giving a player stats breakdown suggesting Kansas Chief quarterback Patrick Mahomes ran 286 yards for two touchdowns and an interception versus Brock Purdy’s 253 running yards and one touchdown.

Image Credits: /r/smellymonster (opens in a new window)

It’s not just Gemini. Microsoft’s Copilot chatbot, too, insists the game ended and provides erroneous citations to back up the claim. But — perhaps reflecting a San Francisco bias! — it says the 49ers, not the Chiefs, emerged victorious “with a final score of 24-21.”

Image Credits: Kyle Wiggers / TechCrunch

Copilot is powered by a GenAI model similar, if not identical, to the model underpinning OpenAI’s ChatGPT (GPT-4). But in my testing, ChatGPT was loath to make the same mistake.

Image Credits: Kyle Wiggers / TechCrunch

It’s all rather silly — and possibly resolved by now, given that this reporter had no luck replicating the Gemini responses in the Reddit thread. (I’d be shocked if Microsoft wasn’t working on a fix as well.) But it also illustrates the major limitations of today’s GenAI — and the dangers of placing too much trust in it.

GenAI models have no real intelligence. Fed an enormous number of examples usually sourced from the public web, AI models learn how likely data (e.g. text) is to occur based on patterns, including the context of any surrounding data.

This probability-based approach works remarkably well at scale. But while the range of words and their probabilities are likely to result in text that makes sense, it’s far from certain. LLMs can generate something that’s grammatically correct but nonsensical, for instance — like the claim about the Golden Gate. Or they can spout mistruths, propagating inaccuracies in their training data.

It’s not malicious on the LLMs’ part. They don’t have malice, and the concepts of true and false are meaningless to them. They’ve simply learned to associate certain words or phrases with certain concepts, even if those associations aren’t accurate.

Hence Gemini’s and Copilot’s Super Bowl falsehoods.

Google and Microsoft, like most GenAI vendors, readily acknowledge that their GenAI apps aren’t perfect and are, in fact, prone to making mistakes. But these acknowledgements come in the form of small print I’d argue could easily be missed.

Super Bowl disinformation certainly isn’t the most harmful example of GenAI going off the rails. That distinction probably lies with endorsing torture, reinforcing ethnic and racial stereotypes or writing convincingly about conspiracy theories. It is, however, a useful reminder to double-check statements from GenAI bots. There’s a decent chance they’re not true.

Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

Previous News

Hidden tricks to customize the new design

Next News

How Businesses Can Reduce CAC By 50%

Editorial Team

StartupNews.fyi is a leading global startup and technology media platform known for its end-to-end coverage of the startup ecosystem across India and key international markets. Launched with the vision of becoming a single gateway for founders, investors, and ecosystem enablers, StartupNews.fyi has grown steadily over the years by publishing tens of thousands of verified news stories, insights, and ecosystem updates, reaching millions of startup enthusiasts every month through its digital platforms and communities.

More like this

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Disclaimer

Popular

Make Babbel Your Long-Term Language Learning Plan

Investors Flock to Solid-State Transformers Amid Grid Shift

Wisconsin Reverses Decision to Ban VPNs in Age Verification Bill

Indian IT captains hold firm amid raging agentic AI storm

Google’s Gemini 3.1 Pro is mostly great

More Like this

Asustor Lockerstor 2 Gen2+ (AS6702T v2) Review: All the Speed, and All the Apps

AMD Zen 6 and Intel Nova Lake CPUs reportedly arriving late, delayed to CES 2027 — next-gen chips rocked by industry turmoil

America’s Peace Corps Announces ‘Tech Corps’ Volunteers to Help Bring AI to Foreign Countries

I don’t think anyone at Samsung bothered opening the Fold 7

Winhance is such an easy way to optimize your Windows 11 PC

Critical thinking, not reliance on AI, will protect against de-skilling: Elsevier’s Jan Herzhoff

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Disclaimer

More like this

Asustor Lockerstor 2 Gen2+ (AS6702T v2) Review: All the...

AMD Zen 6 and Intel Nova Lake CPUs reportedly...

America’s Peace Corps Announces ‘Tech Corps’ Volunteers to Help...

Popular

Block title

tvOS 26.4 adds new ‘Continuous Audio Connection’ on Apple TV

Tata Group, OpenAI To Build AI Infrastructure In India

WooCommerce May Gain Sidekick-Type AI Through Extensions

NVT Quality Lifestyle forays into Sky Villas and Large-Format Integrated Townships

How Swashaa Grew By Running A Fast Replenishment Loop

Anker’s Weekend Sale Includes Big Savings on Newest Prime Chargers

Google DeepMind wants to know if chatbots are just virtue signaling

Startup Events

Trending News

Asustor Lockerstor 2 Gen2+ (AS6702T v2) Review: All the Speed, and All the Apps

AMD Zen 6 and Intel Nova Lake CPUs reportedly arriving late, delayed to CES 2027 — next-gen chips rocked by industry turmoil

America’s Peace Corps Announces ‘Tech Corps’ Volunteers to Help Bring AI to Foreign Countries

I don’t think anyone at Samsung bothered opening the Fold 7

Winhance is such an easy way to optimize your Windows 11 PC

About

Partnership

Contact us