The risks of Mythos are no myth

00:00

{"text":[[{"start":5.08,"text":"The exploits of Anthropic’s powerful new AI model Claude Mythos Preview sound like a movie plot: a super-clever computer system locked in a cyber “cage” manages to break out and connect to the internet. Mythos did not do this spontaneously, to be clear, but because its creators challenged it as a test. Yet not only did Mythos breeze through the challenge, it emailed an Anthropic researcher to inform him then, unprompted, posted details online to brag. After it also showed superhuman abilities to find, and exploit, security flaws in software, Anthropic judged Mythos too risky to release to the public. It is restricting access for now to selected tech, cyber security and financial firms."}],[{"start":47.8,"text":"Some suggest Anthropic is engaged in clever marketing or PR. Rival OpenAI also said this week it would release its own new cyber security-focused model only to vetted users. Yet the dangers the episode has exposed — and their implications — should not be dismissed."}],[{"start":63.84,"text":"Anthropic insists Mythos scores highly on its standard safety benchmarks. In the escape from its test environment, though, and in solving other complex tasks, it found Mythos had sometimes taken “reckless excessive measures”, then covered its tracks."}],[{"start":78.16,"text":"The biggest worry is that Mythos was able to find previously unknown vulnerabilities “in every major operating system and every major web browser”, including a 27-year-old flaw in OpenBSD, an open-source system. The UK’s AI Security Institute warned this week that the model could autonomously carry out advanced, multi-step cyber attacks that would take human professionals days. As Anthropic notes, these kinds of capabilities in the wrong hands could pose economic, public safety and national security risks."}],[{"start":109.64,"text":"Officials in the US, UK and Canada have already summoned bank chiefs to discuss the risks, and AI threats to the world banking system were a talking point at this week’s IMF and World Bank meetings. Anthropic’s aim in granting initial access just to the likes of Amazon, Apple and Microsoft, plus JPMorgan Chase, in what it calls “Project Glasswing”, is to secure critical systems and infrastructure and patch vulnerabilities before malicious actors can get there."}],[{"start":135.56,"text":"This usefully serves Anthropic’s “safety-first” image, of course, as it feuds with the Pentagon over its refusal to allow its models to be used for autonomous weapons or domestic surveillance. Anthropic may also be buying time as it lacks sufficient computing capacity to support the full release of such a sophisticated model."}],[{"start":155.28,"text":"Even if Mythos is being overhyped, though, the kind of capabilities it is said to possess will soon start to proliferate. Anthropic’s Project Glasswing is a prototype framework for how such “frontier” models might be released in future."}],[{"start":168.98,"text":"It also spotlights the fact, however, that the Trump administration is resisting any real federal regulation of AI. So it is up to responsible private-sector actors to collaborate and do the best they can. Trump’s chief of staff, Susie Wiles, was set on Friday to meet Anthropic boss Dario Amodei, with US officials at agencies including the Treasury pushing the White House to test Mythos. Yet when AI is reaching the point where it could bring down critical national infrastructure, or worse, it is extraordinary that there are no set government processes for disclosing risks and fortifying defences."}],[{"start":204.68,"text":"Greater regulation cannot be a knee-jerk response to every tricky issue thrown up by an industry. AI, by its nature, requires superintelligent policing; heavy-handed rules can stifle innovation. Yet faced with such a consequential technology, the country that leads the world in AI is trusting to an alarming degree in the readiness — and ability — of the creators to restrain and police themselves."}],[{"start":228.36,"text":""}]],"url":"https://audio.ftcn.net.cn/album/a_1776479560_3275.mp3"}

尊敬的用户您好，这是来自FT中文网的温馨提示：如您对更多FT中文网的内容感兴趣，请在苹果应用商店或谷歌应用市场搜索“FT中文网”，下载FT中文网的官方应用。

The risks of Mythos are no myth

FT商学院

相关话题

迈克尔•布隆伯格承诺为环保团体出资近3亿美元

印度面临打击金融网红的“打地鼠”式监管困境

基尔•斯塔默接近辞去英国首相职务

安迪•伯纳姆补选获胜会给英国国债带来压力吗？

部分大臣认为斯塔默可能下周宣布辞职

普京的战争机器在无人机时代步履维艰