OpenAI’s o3/o4-mini Models Stir Mixed Reviews and Invisible Marking Debates

April 21, 2025, 7:20 am

OpenAI’s new o3 and o4-mini models are making waves by showcasing impressive coding and math capabilities while paradoxically suffering from increased hallucinations. Adding to the intrigue, observers have discovered the unexpected presence of invisible characters that hint at a built-in watermarking mechanism—because nothing says “cutting-edge” like secret invisible signatures in your output. It’s a curious blend of technical wizardry and quirky oversights that has both experts and skeptics raising an amused eyebrow.

techspot.com / Sam Altman says polite ChatGPT users are burning millions of OpenAI dollars

Some shocking headlines involving the costs of being polite to AI chatbots like ChatGPT have circulated over the past few days. A few examples include:Read Entire Article

theregister.com / ChatGPT burns tens of millions of Softbank dollars listening to you thanking it

Sam says it's Son's money well spent Conventional wisdom holds that being polite to AI chatbots makes them respond better, but no one stops to think how much energy that politeness is wasting. …

techspot.com / Open source AI is the new Linux, only faster

MongoDB Developer Relations head and open-source advocate Matt Asay argues that DeepSeek represents more than just Chinese innovation – it shows how open source reshapes ownership, collaboration, and the pace of technological progress.Read Entire Article

theregister.com / Today's LLMs craft exploits from patches at lightning speed

Erlang? Er, man, no problem. ChatGPT, Claude to go from flaw disclosure to actual attack code in hours The time from vulnerability disclosure to proof-of-concept (PoC) exploit code can now be as short as a few hours, thanks to generative AI models.…

pulse2.com / Hammerspace: $100 Million Raised For Data Platform For AI

Hammerspace, a high-performance Data Platform for AI, announced that several strategic venture investors have invested $100 million in new strategic growth capital in Hammerspace. The rest of the funding round was completed by a combination of new and existing investors. The post Hammerspace: $100...

techcrunch.com / ChatGPT search is growing quickly in Europe, OpenAI data suggests

ChatGPT search, OpenAI’s feature within ChatGPT that allows the chatbot to access and incorporate up-to-date information from the web into its responses, is growing at a fast clip in Europe. A report filed by one of OpenAI’s EU corporate divisions, OpenAI Ireland Limited, reveals ChatGPT search...

simonwillison.net / OpenAI o3 and o4-mini System Card

OpenAI o3 and o4-mini System Card I'm surprised to see a combined System Card for o3 and o4-mini in the same document - I'd expect to see these covered separately. The opening paragraph calls out the most interesting new ability of these models (see also my notes here). Tool usage isn't new, but...

medianama.com / New OpenAI Models Hallucinating More Than Their Predecessor

OpenAI's new AI models are hallucinating more than their predecessor, according to an internal testing report released by the company. The post New OpenAI Models Hallucinating More Than Their Predecessor appeared first on MEDIANAMA.

simonwillison.net / AI assisted search-based research actually works now

For the past two and a half years the feature I've most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the first glimpses of this back in early 2023, with Perplexity (first launched December 2022, first prompt leak in January 2023) and then the GPT-4...

techspot.com / ChatGPT gets scarily good at guessing photo locations, sparking doxxing concerns

OpenAI released its latest o3 and o4-mini models last week, which can "reason" through uploaded images. This means it can crop, rotate, and zoom in on photos, even if they're of poor quality.Read Entire Article

techspot.com / OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with hallucination rates dropping as the technology matured. However, internal testing and third-party evaluations now reveal that o3 and o4-mini, both classified as "reasoning models,"...

winbuzzer.com / OpenAI’s New o3/o4-mini Models Add Invisible Characters to Text, Sparking Watermark Debate

The discovery of non-standard space characters in OpenAI's o3/o4-mini output has raised questions about AI watermarking, though it remains unclear if it's intentional. The post OpenAI’s New o3/o4-mini Models Add Invisible Characters to Text, Sparking Watermark Debate appeared first on WinBuzzer.

permalink / 12 stories from 7 sources in 8 days ago #ai #innovation #ml #dataprivacy #software #analytics #google #anthropic #computervision #techpolicy #openai #aiethics #genai #cybersecurity #infosec #opensource #china #deepseek #bigdata #startups #vc #cloud #datascience #investment #energy #technology

Related Tags

OpenAI’s o3/o4-mini Models Stir Mixed Reviews and Invisible Marking Debates

More Top Stories...

Microsoft’s Code Revolution: 30% Now AI-Generated

OpenAI Reverses ChatGPT Update Amid Sycophancy Complaints

Meta energizes developers at inaugural LlamaCon with new AI API

Google Expands NotebookLM’s Audio Overviews

Reddit’s AI Experiment Fallout

Related Tags

Artificial Intelligence

Innovation

Machine Learning

Data Privacy

Software

Analytics

Google

Anthropic

Computer Vision

Tech Policy

openai

ai-ethics

GenAI

Cybersecurity

IT Security

Open Source

China

deepseek

Big Data

Startups

Venture Capital

Cloud Computing

Data Science

Investment

Energy

Technology