
In a really cagily-written story from Bloomberg, Anthropic confirmed Tuesday that it has acquired a report that an unauthorized thriller group is accessing Claude Mythos—the mannequin it says is simply too harmful to launch. “We’re investigating a report claiming unauthorized entry to Claude Mythos Preview by way of one in every of our third-party vendor environments,” says an Anthropic spokesperson’s assertion to Bloomberg.
Bloomberg apparently confirmed the obvious breach by taking a look at a reside demo and screenshots despatched over by a member of the group liable for the unauthorized entry.
In understandably obfuscatory language, Bloomberg explains that an nameless supply says they’re a member of an unnamed group that has abused their entry “as a employee at a third-party contractor for Anthropic” and employed “generally used web sleuthing instruments usually employed by cybersecurity researchers,” to realize some type of entry to the mannequin.
However don’t fear, this secret group that apparently has entry to essentially the most feared piece of know-how on this planet is “involved in taking part in round with new fashions, not wreaking havoc with them,” the supply apparently defined to Bloomberg.
The sequence of occasions within the obvious breach seems one thing like this:
- A Discord group exists which makes use of bots to smell round on GitHub for details about unreleased AI fashions
- There was an information breach on the AI coaching startup Mercor
- The group mixed data from the Mercor breach with entry out there to Bloomberg’s supply as a result of they work for an Anthropic contractor
- This allowed the group to guess the web location of Claude Mythos
- The group has been freely messing round with Claude Mythos ever since April 7, the identical day because the announcement of Undertaking Glasswing
So to recap: Anthropic says it has the scariest AI mannequin on this planet, and for what it’s price, a complete lot of highly effective establishments appear to consider it. If we take Anthropic at its phrase, we’re all trusting it to not abuse this energy that it and solely it controls. Nevertheless, some unknown entity has accessed this scary AI mannequin, but when we take them at their phrase, they simply used it for some vibe coding exams they usually swear they’re not doing something evil with it.
