• Home
  • Altcoin
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Legal Hub
  • More
    • Market & Analysis
    • NFTs
    • XRP
    • Regulations
  • Shop
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Book
    • Bitcoin Miner
    • Bitcoin Standard
    • Bitcoin Miner Machine
    • Bitcoin Merch
    • Bitcoin Wallet
    • Bitcoin Shirt
No Result
View All Result
Card Bitcoin
Shop
Card Bitcoin
No Result
View All Result
Home NFTs

Anthropic’s Mythos is evolving faster than expected, reports AI safety agency

by n70products
May 14, 2026
in NFTs
0
Anthropic’s Mythos is evolving faster than expected, reports AI safety agency
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter


aiburst-gettyimages-2189115060

Eugene Mymrin/ Moment via Getty Images

Follow ZDNET: Add us as a preferred source on Google.


ZDNET’s key takeaways

  • The latest version of Claude Mythos has already advanced.
  • External researchers found that it achieved several firsts in testing. 
  • AI capabilities may be improving much faster than anticipated. 

Anthropic’s Claude Mythos, which the company maintains is too powerful to be released generally, already appears to have gained new capabilities. 

In a blog post published Wednesday, the UK AI Security Institute (AISI) reported that it had tested a newer version of Mythos, which outperformed both its earlier results and OpenAI’s GPT-5.5 — just a month after Mythos’ initial release. 

Also: Apple, Google, and Microsoft join Anthropic’s Project Glasswing to defend world’s most critical software

“The newer Mythos Preview checkpoint completed both our cyber ranges, solving the range ‘The Last Ones’ in 6 of 10 attempts and the previously unsolved ‘Cooling Tower’ in 3 of 10 attempts,” the blog authors wrote. “This was the first time that a model completed the second of our two cyber ranges.” 

When Anthropic first announced Mythos Preview and Project Glasswing — the cybersecurity testing alliance it formed with rival tech companies and AI labs, to which it gave limited access to Mythos — last month, UK AISI evaluated it, finding that the model “represents a step up over previous frontier models in a landscape where cyber performance was already rapidly improving.” 

That third-party perspective helped balance claims that the hype around Mythos was either solely marketing or, at the other end, signaled a catastrophic shift in AI capabilities. The truth about what the model can do is likely somewhere in the middle. 

Also: How to learn Claude Code for free with Anthropic’s AI courses – one took me just 20 minutes

AISI’s updated test also exemplifies that capability improvements aren’t restricted to individual model releases, but can happen within versions of a single model. 

A rapidly accelerating cyber threat 

AISI noted that AI models are rapidly advancing in their ability to handle cyber tasks, with serious implications for cybersecurity, especially given Mythos’ knack for detecting software vulnerabilities. 

“In February 2026, we internally estimated that the length of cyber tasks AI models could complete had doubled every 4.7 months since late 2024 – already an acceleration from our November 2025 estimate of 8 months,” the blog authors wrote. “Since then, AISI reported on two new models, Claude Mythos Preview and [OpenAI’s] GPT-5.5, which substantially exceeded both doubling rate trends.” 

Also: The third major Linux kernel flaw in two weeks has been found – thanks to AI

The authors added that it’s unclear whether that trend will hold or whether these findings indicate a lasting increase. Mythos and GPT-5.5 could simply be notable breaks from the overall pattern of model evolution. 

Still, AISI clarified that there are several unknowns its testing could not account for. The tests capped tasks at 2.5 million tokens, which let researchers better compare performance results over time. That inherently “understates what frontier models can do,” they wrote. 

“Mythos Preview and GPT-5.5 have large upper-bound error bars due to near-100% success rates on our narrow cyber suite’s longest tasks, even with the 2.5M token limit,” the blog continued. “Our tasks are also not long enough to determine how sharply the models’ reliability would deteriorate at higher task lengths. This places some of the latest models at the limit of what our narrow test suite can measure.”

Also: I put GPT-5.5 through a 10-round test: It scored 93/100, losing points only for exuberance

While this makes the point of model failure hard to measure, it also means model success rates on these tasks would be much higher without the token cap — so high, in fact, that “time horizons become impossible to calculate.” Models with more token access and complex agent infrastructure would be much more capable. 

“A 2.5M token limit is relatively low — in our cyber range experiment we use up to 100M tokens and find performance would likely still improve beyond that budget, especially for recent models, which disproportionately benefit from higher token limits,” the blog added. 





Source link

Tags: AgencyAnthropicsevolvingExpectedfasterMythosreportssafety

Product categories

  • Bitcoin Book
  • Bitcoin Coin
  • Bitcoin Hat
  • Bitcoin Merch
  • Bitcoin Miner
  • Bitcoin Miner Machine
  • Bitcoin Shirt
  • Bitcoin Standard
  • Bitcoin Wallet
  • Products
  • Uncategorized

Recent Posts

  • Anthropic’s Mythos is evolving faster than expected, reports AI safety agency
  • Dogecoin (DOGE) Breaks Away From Pack As Momentum Turns Aggressive
  • Clear Signing: Making Transaction Approvals Safer on Ethereum
  • The Last Setups Were Explosive
  • You can buy Meta smart glasses for up to 20% off right now – Ray-Bans included

Recent Comments

No comments to show.

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025

Categories

  • Altcoin
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs
  • Regulations
  • XRP

CATEGORIES

  • Altcoin
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs
  • Regulations
  • XRP

BROWSE BY TAG

Analyst Bank Bitcoin Blog Bottom Breakout BTC Bullish Business Buy Coinbase Crypto deals DOGE Dogecoin ETF ETH Ethereum Foundation Heres Hypergrid Institutional Investors Key Level Major Market Means Move Price Rally Ripple Risk Samsung Shows SOL Solana Stablecoin Support Surge Time Traders Whats XRP zone

© 2024 Card Bitcoin | All Rights Reserved

Feature

U.S. Regulated
 

Beginner Friendly
 

Advanced Tools
 

Free Bitcoin Offer
 

Mobile App
 

Close the CTA

10$
 

5$
 

Varies
 

No Result
View All Result
  • Home
  • Altcoin
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Legal Hub
  • More
    • Market & Analysis
    • NFTs
    • XRP
    • Regulations
  • Shop
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Book
    • Bitcoin Miner
    • Bitcoin Standard
    • Bitcoin Miner Machine
    • Bitcoin Merch
    • Bitcoin Wallet
    • Bitcoin Shirt

© 2024 Card Bitcoin | All Rights Reserved