Ethical Innovations: Embracing Ethics in Technology

Ethical Innovations: Embracing Ethics in Technology

Menu

US Orders Anthropic to Kill Fable 5 and Mythos 5

Anthropic has disabled access to two of its artificial intelligence models, Fable 5 and Mythos 5, for all customers worldwide after receiving an export control directive from the United States Commerce Department. The order, signed by Commerce Secretary Howard Lutnick with assistance from the Bureau of Industry and Security, was received at 5:21 p.m. Eastern Time on Friday, June 12, 2026. It instructed the company to prevent any foreign national from accessing the models, whether inside or outside the United States, including foreign national employees of Anthropic itself. Because of the way the models are deployed, Anthropic said it had no choice but to take both systems offline for every user.

The government cited national security concerns as the basis for the action. According to Anthropic's understanding, the government became aware of a technique that could bypass, or "jailbreak," certain safeguards built into Fable 5. Anthropic reviewed a demonstration of the technique and found it was used to identify a small number of previously known, minor software vulnerabilities. The company stated that other publicly available models, including OpenAI's GPT-5.5, are capable of discovering the same vulnerabilities without requiring any bypass method. The government provided only verbal evidence of the issue and did not spell out the specific national security threat in detail in its letter.

Anthropic said it disagrees with the government's decision, arguing that a narrow, non-universal jailbreak should not be grounds for recalling a commercial model deployed to hundreds of millions of people. The company warned that applying this standard across the industry would essentially halt all new model deployments by every major AI provider. Anthropic stated that it believes the government should have the ability to block unsafe deployments, but only as part of a statutory process that is transparent, fair, clear, and grounded in technical facts, and that this action did not meet those principles. The company apologized to its customers for the disruption, called the situation a misunderstanding, and said it is working to restore access as soon as possible.

Fable 5 and Mythos 5 had been released just days earlier. Fable 5 was made publicly available with strict safety restrictions, particularly around cybersecurity and biology topics. Mythos 5 was shared with a limited group of trusted partners, including cybersecurity and infrastructure companies, without those same safeguards. Both models originated from Claude Mythos Preview, an advanced model for security research that was initially available only through a program called Project Glasswing. Anthropic said before launching Fable 5, it worked with the U.S. government, the UK AI Safety Institute, multiple private third-party organizations, and internal teams to test the model's safeguards for thousands of hours. The company adopted what it calls a defense in depth strategy, designed to make any jailbreaks narrow in scope or expensive to produce, combined with monitoring to detect and shut down successful attacks. This approach also led to a 30-day customer data retention policy for Fable, which the company said allows it to research and mitigate jailbreaks.

Access to all other Claude models, including the latest Claude Opus 4.8, was not affected by the directive. On the Claude Platform, requests to Fable 5 will return an error, and existing Fable 5 sessions will end with an error. New sessions will run on the user's selected default model or Opus 4.8.

The directive marks the latest escalation in a strained relationship between Anthropic and the federal government. Earlier in the year, negotiations between Anthropic and the Department of Defense collapsed after the company sought exemptions barring the military from using its tools in combat without human oversight and prohibiting AI-powered surveillance. The Pentagon subsequently labeled Anthropic a supply chain risk, a designation historically reserved for foreign adversaries, requiring defense contractors to certify they will not use Anthropic's Claude models in military work. Anthropic sued the Trump administration to reverse that designation and won a preliminary injunction in California, though a parallel case in Washington, D.C. is ongoing. In February, President Donald Trump ordered all federal agencies to stop using Anthropic's models after the company refused contract terms allowing its AI to be used for any lawful purpose.

Despite the conflict, signs of reconciliation had appeared in recent weeks. The National Security Agency has been using Mythos to find vulnerabilities in popular software, including Microsoft products. President Trump also signed a memorandum directing federal agencies to strengthen cyber defenses and establish voluntary early access to advanced AI models. Trump told reporters he is open to the idea of the U.S. government owning stakes in major AI labs and plans to discuss it with company executives.

The export control decision could affect Anthropic's plans for a public listing. The company confidentially filed for an IPO earlier in June, and a recent funding round valued it at $965 billion. Several prominent Trump administration technology advisors have publicly criticized Anthropic and its leadership. Former AI and crypto czar David Sacks has called the company "woke and leftist" and accused it of regulatory capture. Pentagon undersecretary of defense for research and engineering Emil Michael has also been openly critical. Meanwhile, AI policy expert Dean Ball, who briefly served in the Trump administration, called the export directive "cartoonish" and questioned the logic of blocking foreign nationals from using American AI models while supporting the export of advanced AI chips to China. Cybersecurity researcher Peter Girnus noted that by publicly describing Mythos as too dangerous to release widely, Anthropic may have given the government a legal basis to treat it as a controlled technology. Gary Marcus, a frequent critic of the AI industry, said the directive could push Chinese-born AI researchers working at American labs to return to China and make investors question whether U.S. AI companies are safe investments.

Original Sources: 1, 2, 3, 4, 5, 6, 7, 8 (anthropic) (openai)

Real Value Analysis

This article provides limited practical value to a normal person. It reports on a government order requiring Anthropic to disable two of its AI models, Fable 5 and Mythos 5, for all users worldwide. While the story is significant for the AI industry and for Anthropic's customers, it does not offer actionable steps, tools, or choices that a reader can use. There are no instructions to follow, no resources mentioned, and no clear actions a person can take based on this information. The article simply recounts what happened, why it happened, and how Anthropic responded. A person who reads this cannot apply it to their own life in any direct way, unless they happen to be a current user of those specific models, in which case the only action is to wait for Anthropic to resolve the situation.

The educational depth is moderate but uneven. The article explains several useful concepts, such as what a jailbreak means in the context of AI safety, what a defense in depth strategy involves, and why customer data retention can help companies detect and respond to security threats. It also provides context about the relationship between AI companies and government agencies, including the role of export control directives and the concept of supply chain risk designations. However, the article does not explain how export control directives work in general, what statutory processes exist for challenging government orders, or how a normal person could learn more about the regulatory framework governing AI. The numbers mentioned, such as the 30 day data retention period and the reference to hundreds of millions of users, are presented without deeper context for what they mean in the broader landscape of AI deployment and regulation. The article teaches some things but leaves significant gaps for a reader who wants to understand the full picture.

Personal relevance is narrow for most people. The story directly affects users of Fable 5 and Mythos 5, who have lost access to tools they may have been using for work or research. For those individuals, the disruption is real and meaningful. For everyone else, the relevance is indirect. The article touches on broader themes like AI safety, government oversight, and the balance between innovation and security, which are important topics. But it does not explain how likely an average person is to be affected by similar government actions in the future, what to do if they are concerned about the safety of AI tools they use, or how to evaluate whether a particular AI product is trustworthy. The relevance is limited to people who follow AI policy or who personally use Anthropic's products.

The public service function is weak. The article does not offer warnings, safety guidance, or emergency information that helps the public act responsibly. It does not tell readers how to evaluate AI tools for safety, what to look for when choosing an AI provider, or how to engage with policy decisions about AI regulation. The article appears to exist mainly to report on a significant event in the AI industry rather than to help people make better decisions or stay safe. It informs but does not guide.

There is no practical advice in the article. No steps or tips are given that an ordinary reader can follow. The guidance is entirely absent. The article does not even suggest general actions a person might take when thinking about AI safety or evaluating the tools they use.

The long term impact is minimal for most readers. The information does not help a person plan ahead, improve habits, or make stronger choices. It focuses on a single government order with no lasting benefit for the average reader. However, for people who work in AI or who rely on AI tools professionally, the story highlights the risk that government intervention could disrupt access to important technology. This is a useful lesson, but the article does not develop it into practical guidance.

The emotional and psychological impact is mixed. The article creates feelings of uncertainty and concern, particularly for users of the affected models. The tone is somewhat alarming, with phrases like "forced to disable" and "hundreds of millions of people" affected. However, the article also provides reassurance by explaining that the jailbreak in question appears narrow and that the vulnerabilities found were minor and already known. It does not cause harm, but it does not offer much clarity either. A reader may finish the article feeling informed but uncertain about what it means for them.

There is no clickbait or ad driven language. The article is straightforward and does not use exaggerated or dramatic claims. It does not sensationalize the event or rely on shock to maintain attention. The tone is factual and measured, though the subject matter itself is inherently dramatic.

The article misses several chances to teach or guide. It presents a significant event in AI regulation but fails to provide context, examples, or ways for the reader to learn more. It could have explained how government export controls work compared to other forms of regulation, what factors influence whether a government orders a product recalled, or what patterns exist in how countries respond to AI safety concerns. A reader who wants to learn more could compare independent news sources on AI regulation, look for patterns in how different governments respond to AI safety issues, or consider general civic practices like understanding how regulatory agencies function and knowing what questions to ask elected officials about technology policy.

To add real value, a reader can take several practical steps based on general reasoning and universal critical thinking principles. When evaluating any AI tool, remember that no system is perfectly secure, so it is wise to avoid relying on a single tool for critical work. If you or someone you know uses AI tools professionally, ask the provider specific questions about what safety testing has been done, what data retention policies exist, and what would happen if the tool were suddenly discontinued. When considering whether to adopt a new AI product, look for verification from multiple independent sources and consider whether the provider has a track record of transparency about limitations and risks. If you want to stay informed about AI policy, practical habits like following multiple news sources, understanding basic regulatory concepts, and knowing how to contact your representatives often matter more than reading individual articles. When reading about government actions affecting technology you use, look for background information from established educational or governmental organizations rather than relying on single news articles that may simplify or omit important details. When making decisions about which AI tools to rely on, diversify your toolkit so that the loss of any single product does not cripple your work, and maintain local backups of important data and outputs. These steps are realistic, widely applicable, and grounded in common sense. They help a reader assess AI tools, choose more reliable providers, and prepare for disruptions that could affect their work or personal projects, even though the original article offered none of this guidance.

Bias analysis

The text uses the phrase "substantially more effective" to describe Fable's safeguards compared to other models. This is a strong claim with no numbers or proof shown in the text. It helps Anthropic look good by making their safety work sound much better than anyone else's. The word "substantially" pushes the reader to believe the difference is big without showing any real evidence.

The text says "perfect jailbreak resistance likely is not possible for any model provider today." This softens the reader's expectations before the company defends itself. It makes the problem sound like everyone's fault, not just Anthropic's. This helps Anthropic by sharing the blame with all other companies.

The phrase "defense in depth strategy" makes Anthropic's approach sound smart and careful. It uses a military-style word to make the company seem serious about safety. This helps Anthropic look responsible even though the government still found a problem. The word "strategy" makes it sound like they had a strong plan all along.

The text says the jailbreak found was "narrow in scope" and produced "entirely benign responses." These soft words make the problem sound very small and harmless. This helps Anthropic by making the government's worry seem like an overreaction. The word "benign" means harmless, which pushes the reader to think there was no real danger.

The text says the capability shown is "widely available from other models, including OpenAI's GPT-5.5." This brings another company into the problem to make Anthropic look less alone. It helps Anthropic by saying everyone can do this, so picking on them is not fair. This also makes the government's order seem too harsh by comparison.

The phrase "used daily by cybersecurity defenders" makes the jailbreak sound like a normal, good thing that helpful people do every day. This helps Anthropic by making the government's worry seem wrong. It changes the meaning of the jailbreak from a danger to a tool that good people use.

The text says the government gave only "verbal evidence" of the jailbreak. This makes the government's case sound weak and not well supported. It helps Anthropic by making the government seem like they did not have strong proof. The word "verbal" suggests it was just talk, not solid facts.

The phrase "transparent, fair, clear, and grounded in technical facts" is used to describe what the government should have done. This makes Anthropic look like they care about good rules and fairness. It helps the company by making the government seem like they did not follow those good words. This is a way of saying the government was not fair without saying it directly.

The text says "Anthropic apologized to its customers for the disruption, called the situation a misunderstanding." The word "misunderstanding" makes it sound like nobody did anything wrong. It was just a mix-up. This helps Anthropic by making the whole problem seem smaller and less serious than the government said it was.

The text uses passive voice in "the government became aware of a technique." This hides who found the technique or how they found it. It makes the discovery sound like it just happened on its own. This helps Anthropic by not showing if the government did good work or if someone else found the problem first.

The text says "Anthropic has been forced to disable both models for all users." The word "forced" makes Anthropic sound like a victim with no choice. It helps the company by making them look like they are just following orders and not responsible for any problems. This pushes the reader to feel sorry for Anthropic.

The phrase "hundreds of millions of people" is used to show how many users are affected by the shutdown. This big number helps Anthropic by making the government's order seem like it hurts a lot of people. It pushes the reader to think the government's action is too big and causes too much harm.

The text says "the company warned that applying this standard across the industry would essentially halt all new model deployments." This is a big guess about the future with no proof. It helps Anthropic by making the government's decision sound like it could hurt the whole AI industry. The word "essentially" makes the claim sound certain even though it is just a guess.

The text mentions "the UK AI Safety Institute, multiple private third-party organizations, and internal teams" tested Fable. This long list of groups makes the testing sound very thorough and trustworthy. It helps Anthropic by showing many groups checked their work. But the text does not say what any of those groups actually found, so the list is used to look good without giving real proof.

Emotion Resonance Analysis

The text expresses several emotions that work together to shape how the reader feels about the government's order and Anthropic's response. The strongest emotion is frustration, which runs throughout the company's statements. This frustration appears in phrases like "Anthropic disagrees with the government's decision" and "this action does not meet those principles." The strength of this frustration is moderate and controlled, not explosive or angry, but clearly present. It serves to make the reader feel that Anthropic believes the government made a mistake, and it positions the company as someone who is being treated unfairly despite doing everything right. The frustration is meant to create sympathy for Anthropic and to make the reader question whether the government's action was really necessary.

A related emotion is defensiveness, which shows up when Anthropic explains all the testing it did before launching Fable 5. The text says the company worked with the US government, the UK AI Safety Institute, multiple private third-party organizations, and internal teams to test the model's safeguards for thousands of hours. This long list of groups and the mention of thousands of hours are meant to show that Anthropic was very careful and responsible. The defensiveness serves to protect the company's reputation by proving that it did not cut corners. It makes the reader feel that Anthropic took safety seriously and that the government's order came despite all that hard work, not because of any laziness or carelessness on the company's part.

There is also a clear sense of worry about the future, which appears when the company warns that applying this standard across the industry would essentially halt all new model deployments for every frontier AI provider. This worry is about what might happen next, not just what has already happened. It serves to make the reader feel that this situation is not just about Anthropic but could affect the entire AI industry and everyone who uses AI tools. The word "essentially" makes the claim sound very certain, which increases the worry. The purpose is to make the reader feel that the government's decision could have big, negative consequences for a lot of people, not just one company.

A softer emotion of regret appears when Anthropic apologizes to its customers for the disruption and calls the situation a misunderstanding. The word "misunderstanding" is important because it makes the whole problem sound like a mistake that could have been avoided, not something anyone did on purpose. The regret serves to make Anthropic look humble and caring, like a company that feels bad about letting people down. It softens the frustration that comes earlier in the text and makes the reader feel that Anthropic is not just angry but also sorry for the trouble it caused. This combination of frustration and regret makes the company seem both strong and sympathetic at the same time.

A subtle emotion of pride appears when Anthropic says its safeguards were substantially more effective than those of any previously deployed model. The word "substantially" makes the claim sound big and important, and the comparison to other models makes Anthropic look like a leader in safety. This pride serves to build trust in the company by showing that it has done something impressive. It makes the reader feel that Anthropic is a good company that cares about doing things right, which makes the government's order seem even more unfair by comparison.

There is also a faint emotion of fear hidden in the text, particularly around the idea of jailbreaks and national security. The text mentions that the government cited national security concerns and that a technique could bypass the safety restrictions built into Fable 5. These words sound scary because they suggest that the AI could be used in dangerous ways. However, Anthropic quickly reduces this fear by saying the vulnerabilities found were minor, simple, and already known, and that other models can find them too. The fear serves a dual purpose: it explains why the government acted, but Anthropic's response to it is meant to make the reader feel that the fear was overblown and that the government reacted too strongly.

The writer uses several tools to increase the emotional impact of the text. One tool is repeating the idea that the jailbreak was narrow and not a big deal. The text says the technique identified "a small number of previously known, minor software vulnerabilities" and that the responses were "entirely benign." These phrases are repeated in different ways throughout the text to make the reader feel that the government's worry was too big for such a small problem. Another tool is comparing Anthropic's situation to other companies. The text says the capability shown is "widely available from other models, including OpenAI's GPT-5.5," which makes the reader wonder why Anthropic was singled out when other companies can do the same thing. This comparison serves to make the government's action seem unfair and inconsistent.

The writer also uses numbers and time to make the story feel real and urgent. The text says Anthropic received the directive at 5:21pm Eastern Time on June 12, 2026, which makes the event feel specific and recent. The mention of "hundreds of millions of people" affected by the shutdown makes the problem feel very big, even though the jailbreak itself was described as small. This contrast between the small problem and the big consequence is a tool to make the reader feel that the government's response was too extreme. The phrase "30 days of customer data retention" is used to show that Anthropic was willing to accept real costs to keep people safe, which makes the company look responsible and makes the government's order seem even more unjustified.

The writer also uses formal, careful language to make the emotions feel more serious and trustworthy. Words like "statutory process," "transparent, fair, clear, and grounded in technical facts" sound official and make Anthropic look like it cares about doing things the right way. This formal language serves to make the reader feel that Anthropic is a serious, responsible company, not one that is just complaining. It gives the frustration and disagreement more weight because they come from a source that sounds smart and careful.

Together, these emotions guide the reader to feel sympathetic toward Anthropic and skeptical of the government's decision. The frustration and defensiveness make the reader feel that Anthropic is being treated unfairly. The worry about the future makes the reader feel that this situation could have bigger consequences. The regret and apology make the reader feel that Anthropic cares about its customers. The pride in the company's safety record makes the reader trust Anthropic more. And the reduced fear around jailbreaks makes the reader feel that the government overreacted. All of these emotions work together to shape a message that is less about the technical details of the jailbreak and more about whether the government's response was fair and proportionate. The writer uses these emotions to persuade the reader that Anthropic is a responsible company that did its best and that the government's order, while perhaps well-intentioned, was not the right way to handle the situation.

Cookie settings
X
This site uses cookies to offer you a better browsing experience.
You can accept them all, or choose the kinds of cookies you are happy to allow.
Privacy settings
Choose which cookies you wish to allow while you browse this website. Please note that some cookies cannot be turned off, because without them the website would not function.
Essential
To prevent spam this site uses Google Recaptcha in its contact forms.

This site may also use cookies for ecommerce and payment systems which are essential for the website to function properly.
Google Services
This site uses cookies from Google to access data such as the pages you visit and your IP address. Google services on this website may include:

- Google Maps
Data Driven
This site may use cookies to record visitor behavior, monitor ad conversions, and create audiences, including from:

- Google Analytics
- Google Ads conversion tracking
- Facebook (Meta Pixel)