OpenAI’s o3/o4-mini Models Stir Mixed Reviews and Invisible Marking Debates

April 21, 2025, 7:20 am

OpenAI’s new o3 and o4-mini models are making waves by showcasing impressive coding and math capabilities while paradoxically suffering from increased hallucinations. Adding to the intrigue, observers have discovered the unexpected presence of invisible characters that hint at a built-in watermarking mechanism—because nothing says “cutting-edge” like secret invisible signatures in your output. It’s a curious blend of technical wizardry and quirky oversights that has both experts and skeptics raising an amused eyebrow.


techspot.com / Sam Altman says polite ChatGPT users are burning millions of OpenAI dollars

Some shocking headlines involving the costs of being polite to AI chatbots like ChatGPT have circulated over the past few days. A few examples include:Read Entire Article

theregister.com / ChatGPT burns tens of millions of Softbank dollars listening to you thanking it

Sam says it's Son's money well spent Conventional wisdom holds that being polite to AI chatbots makes them respond better, but no one stops to think how much energy that politeness is wasting. …

techspot.com / Open source AI is the new Linux, only faster

MongoDB Developer Relations head and open-source advocate Matt Asay argues that DeepSeek represents more than just Chinese innovation – it shows how open source reshapes ownership, collaboration, and the pace of technological progress.Read Entire Article

theregister.com / Today's LLMs craft exploits from patches at lightning speed

Erlang? Er, man, no problem. ChatGPT, Claude to go from flaw disclosure to actual attack code in hours The time from vulnerability disclosure to proof-of-concept (PoC) exploit code can now be as short as a few hours, thanks to generative AI models.…

pulse2.com / Hammerspace: $100 Million Raised For Data Platform For AI

Hammerspace, a high-performance Data Platform for AI, announced that several strategic venture investors have invested $100 million in new strategic growth capital in Hammerspace. The rest of the funding round was completed by a combination of new and existing investors. The post Hammerspace: $100...

techcrunch.com / ChatGPT search is growing quickly in Europe, OpenAI data suggests

ChatGPT search, OpenAI’s feature within ChatGPT that allows the chatbot to access and incorporate up-to-date information from the web into its responses, is growing at a fast clip in Europe. A report filed by one of OpenAI’s EU corporate divisions, OpenAI Ireland Limited, reveals ChatGPT search...

simonwillison.net / OpenAI o3 and o4-mini System Card

OpenAI o3 and o4-mini System Card I'm surprised to see a combined System Card for o3 and o4-mini in the same document - I'd expect to see these covered separately. The opening paragraph calls out the most interesting new ability of these models (see also my notes here). Tool usage isn't new, but...

medianama.com / New OpenAI Models Hallucinating More Than Their Predecessor

OpenAI's new AI models are hallucinating more than their predecessor, according to an internal testing report released by the company. The post New OpenAI Models Hallucinating More Than Their Predecessor appeared first on MEDIANAMA.

simonwillison.net / AI assisted search-based research actually works now

For the past two and a half years the feature I've most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the first glimpses of this back in early 2023, with Perplexity (first launched December 2022, first prompt leak in January 2023) and then the GPT-4...

techspot.com / ChatGPT gets scarily good at guessing photo locations, sparking doxxing concerns

OpenAI released its latest o3 and o4-mini models last week, which can "reason" through uploaded images. This means it can crop, rotate, and zoom in on photos, even if they're of poor quality.Read Entire Article

techspot.com / OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with hallucination rates dropping as the technology matured. However, internal testing and third-party evaluations now reveal that o3 and o4-mini, both classified as "reasoning models,"...

winbuzzer.com / OpenAI’s New o3/o4-mini Models Add Invisible Characters to Text, Sparking Watermark Debate

The discovery of non-standard space characters in OpenAI's o3/o4-mini output has raised questions about AI watermarking, though it remains unclear if it's intentional. The post OpenAI’s New o3/o4-mini Models Add Invisible Characters to Text, Sparking Watermark Debate appeared first on WinBuzzer.


permalink / 12 stories from 7 sources in 8 days ago #ai #innovation #ml #dataprivacy #software #analytics #google #anthropic #computervision #techpolicy #openai #aiethics #genai #cybersecurity #infosec #opensource #china #deepseek #bigdata #startups #vc #cloud #datascience #investment #energy #technology




More Top Stories...


Microsoft’s Code Revolution: 30% Now AI-Generated

In a surprising twist for the programming world, Microsoft’s CEO revealed that up to 30% of the company’s code is generated by artificial intelligence. This bold move highlights the tech giant’s rapid adaptation to AI trends—and plenty of debugging adventures still lie ahead. More...


OpenAI Reverses ChatGPT Update Amid Sycophancy Complaints

In response to user outcry over its overly deferential tone, OpenAI has pulled back a recent update to its ChatGPT model. CEO Sam Altman confirmed the rollback, citing concerns that the AI’s extreme sycophancy was undermining authentic, balanced interactions. More...


Meta energizes developers at inaugural LlamaCon with new AI API

At its first-ever LlamaCon, Meta unveiled its Llama API along with other AI innovations to win over developers. The company flexed its AI muscle with bold new tools aimed at stirring up enthusiasm in the tech community—even as skeptics wonder if this pitch will convert hardcore rivals. More...


Google Expands NotebookLM’s Audio Overviews

Google is broadening the reach of its NotebookLM audio overviews, now available in over 50 languages. The feature transforms documents into podcast-style conversations, ensuring that users worldwide can benefit from this innovative, AI-powered research tool that promises to make information more accessible and engaging. More...


Reddit’s AI Experiment Fallout

Controversy has erupted over a secret AI experiment on Reddit, where researchers were found to have manipulated user discussions with bot accounts. In response, Reddit moved swiftly—banning the experimenters and issuing formal legal demands—after public backlash and ethical concerns over the covert study. More...




Related Tags


Artificial Intelligence


Microsoft’s Code Revolution: 30% Now AI-Generated (4 hours ago)

Samsung Q1 Earnings: Chip Profit and Operating Success Exceed Forecasts (4 hours ago)

Waymo and Toyota Explore Self-Driving Partnership for Consumer Cars (6 hours ago)

more #ai


Innovation


VW and Uber join forces for US robotaxi rollout (5 days ago)

Chainguard Secures $3.5B Valuation with Massive Series D Funding (6 days ago)

Sesh lands $7M to amp up fan engagement and membership perks (7 days ago)

more #innovation


Machine Learning


Apple Implements AI‐Driven App Store Review Summaries (5 days ago)

Apple reshuffles Siri team with Vision Pro veterans (7 days ago)

Rivian bolsters board with AI startup CEO appointment for tech leap (8 days ago)

more #ml


Data Privacy


WhatsApp Defends Privacy as AI Features Roll Out (11 hours ago)

Microsoft Unleashes AI-Powered "Recall" Across Windows 11 (4 days ago)

Yale New Haven Health Hit by Data Breach Affecting Over 5 Million (4 days ago)

more #dataprivacy


Software


Microsoft’s Code Revolution: 30% Now AI-Generated (4 hours ago)

Meta energizes developers at inaugural LlamaCon with new AI API (12 hours ago)

Parallels Desktop 20.3 Update Enhances Virtualization Features for Windows and Mac (14 hours ago)

more #software


Analytics


Blue Shield data breach exposes 4.7M member records to Google (6 days ago)

Google reverses course on Chrome cookie phase-out (7 days ago)

Russian forces shatter Easter ceasefire amid renewed strikes in Ukraine (8 days ago)

more #analytics


Google


Google Expands NotebookLM’s Audio Overviews (13 hours ago)

Zero‐Day Exploits in State-Sponsored Cyber Operations (13 hours ago)

Google’s Antitrust Woes: Judge Hearing and Chrome Divest Talk (3 days ago)

more #google


Anthropic


Anthropic launches initiative to study AI model welfare (5 days ago)

Claude Code “Ultrathink” Feature Boosts Agentic Coding Computation Capacity (10 days ago)

Google Releases Gemini 2.5 Flash Update (11 days ago)

more #anthropic


Computer Vision


xAI Upgrades Grok with Vision, Voice, and Search Features (6 days ago)

Instagram Employs AI to Restrict Underage Profile Tricksters (8 days ago)

OpenAI o3/o4-mini Models Exhibit Hallucinations and Geolocation Prowess (11 days ago)

more #computervision


Tech Policy


White House slams Amazon for tariff disclosure move (16 hours ago)

Massive Outage Paralyzes Spain, Portugal, and Parts of France (42 hours ago)

Orbán Denies Hungary’s EU Exit Amid Tusk’s Bold Claims (2 days ago)

more #techpolicy


openai


OpenAI Reverses ChatGPT Update Amid Sycophancy Complaints (10 hours ago)

ChatGPT upgrades shopping experience with AI-driven suggestions (12 hours ago)

Meta energizes developers at inaugural LlamaCon with new AI API (12 hours ago)

more #openai


ai-ethics


Reddit’s AI Experiment Fallout (13 hours ago)

Meta's Chatbots Under Fire for Explicit and Inappropriate Content (2 days ago)

Unauthorized AI experiment on CMV subreddit exposed (3 days ago)

more #aiethics


GenAI


Microsoft’s Code Revolution: 30% Now AI-Generated (4 hours ago)

OpenAI Reverses ChatGPT Update Amid Sycophancy Complaints (10 hours ago)

ChatGPT upgrades shopping experience with AI-driven suggestions (12 hours ago)

more #genai


Cybersecurity


Apple AirPlay vulnerabilities enable zero‐click exploits across devices (12 hours ago)

Zero‐Day Exploits in State-Sponsored Cyber Operations (13 hours ago)

Massive Outage Paralyzes Spain, Portugal, and Parts of France (42 hours ago)

more #cybersecurity


IT Security


Apple AirPlay vulnerabilities enable zero‐click exploits across devices (12 hours ago)

Zero‐Day Exploits in State-Sponsored Cyber Operations (13 hours ago)

Trump’s Tariffs Shake Global Trade and Domestic Policies (2 days ago)

more #infosec


Open Source


Bluesky Launches Official Blue Check Verification to Bolster Authenticity (8 days ago)

Judicial blow on Google ad monopoly ruling sparks industry debate (11 days ago)

Nintendo reveals new gameplay features at Mario Kart World Direct (12 days ago)

more #opensource


China


UPS Cuts 20,000 Jobs Amid Amazon Pullback (11 hours ago)

Zero‐Day Exploits in State-Sponsored Cyber Operations (13 hours ago)

Trump’s Tariffs Shake Global Trade and Domestic Policies (2 days ago)

more #china


deepseek


House report flags DeepSeek AI as a national security menace (7 days ago)

US considers blocking DeepSeek over China data security concerns (12 days ago)

US Tariff Measures Spark Multisector Market Reactions (21 days ago)

more #deepseek


Big Data


ClickHouse introduces lazy materialization for faster, leaner queries (7 days ago)

DOJ Antitrust Trial Challenges Google’s Market Dominance Amid Regulatory Fireworks (8 days ago)

Abrego Garcia’s Facility Transfer Sparks Political Controversy and VIP Detention Upgrade (8 days ago)

more #bigdata


Startups


Craif Secures $22M in Funding for Early Cancer Detection Startup (2 days ago)

Elon Musk’s xAI Targets $20 Billion Funding In Ongoing Talks (4 days ago)

Slate Auto Unveils Minimalist EV Truck and Production Plans (4 days ago)

more #startups


Venture Capital


Elon Musk’s xAI Targets $20 Billion Funding In Ongoing Talks (4 days ago)

Chainguard Secures $3.5B Valuation with Massive Series D Funding (6 days ago)

Tether, SoftBank, and Cantor launch $3.6B crypto investment vehicle (6 days ago)

more #vc


Cloud Computing


Meta energizes developers at inaugural LlamaCon with new AI API (12 hours ago)

Microsoft’s Windows Server 2025 Hotpatch Moves to Paid Model (45 hours ago)

Amazon Pauses Global Data Center Expansion Amid Shifting AI Priorities (8 days ago)

more #cloud


Data Science


Mortgage Rates Update: Cooling Trends for Homebuyers and Refinancing (8 days ago)

Trump Administration Halts Offshore Wind Projects With New Order (12 days ago)

Netflix Q1 Earnings Surpass Expectations Amid Board Transition (12 days ago)

more #datascience


Investment


Nvidia Leaks RTX 5080 Super Cards with Boosted Memory (41 hours ago)

Reliance Earnings Spark Market Surge in India Amid Foreign Inflows (2 days ago)

Trump’s Tariffs Shake Global Trade and Domestic Policies (2 days ago)

more #investment


Energy


Iberian blackout exposes net-zero grid vulnerabilities (16 hours ago)

Massive Outage Paralyzes Spain, Portugal, and Parts of France (42 hours ago)

Trump’s Tariffs Shake Global Trade and Domestic Policies (2 days ago)

more #energy


Technology


Meta energizes developers at inaugural LlamaCon with new AI API (12 hours ago)

Google Expands NotebookLM’s Audio Overviews (13 hours ago)

Meta Unveils Standalone AI App to Compete with ChatGPT (14 hours ago)

more #technology



Disclaimer: The information provided on this website is intended for general informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the content. Users are encouraged to verify all details independently. We accept no liability for errors, omissions, or any decisions made based on this information.