Technology

Meta's New AI Policy: No More Development on 'Critical Risk' Models

Published February 4, 2025

Meta has recently established a clear AI policy aimed at shaping its future projects, emphasizing its commitment to ceasing the development of AI models classified as a "critical risk." This decision marks a significant shift in the company's approach, especially as Meta has been steadily rising as a prominent player in the AI field.

Unlike many other leading AI companies that focus on proprietary models, Meta has taken an open-source route by releasing its Llama family of models. This strategy enables researchers and developers to access and build upon their technology, promoting a more collaborative environment.

One of the primary concerns for Meta, along with the broader AI industry, is the challenge of ensuring that AI systems are developed responsibly and cannot be misused. To this end, Meta is a signatory of the Frontier AI Safety Commitments, and its new Frontier AI Framework aligns with these safety principles.

The company has pinpointed specific catastrophic outcomes it aims to prevent, which will guide its AI development decisions. By identifying these outcomes, Meta outlines potential pathways that could lead to disasters, considering how various actors, including governments, might misuse advanced AI technologies. The company also describes specific threat scenarios that could lead to significant harm and establishes risk thresholds based on how likely these scenarios are to occur due to the implementation of frontier AI.

By focusing on outcomes, Meta hopes to create a durable set of risk thresholds. While technological capabilities may change, the outcomes they strive to avoid are more stable and enduring. Furthermore, Meta recognizes that as its understanding of frontier AI continues to evolve, it may need to adjust its identified outcomes and threat scenarios, possibly even expanding them to reflect new risks that arise from advancements in AI capabilities.

Through this outcomes-driven framework, Meta intends to prioritize its efforts on the most pressing dangers. For instance, it will focus on avoiding risks associated with cybersecurity and biological weapons, rather than spreading its resources too thin across various hypothetical threats.

The new policy also clarifies how Meta defines a critical risk AI model and outlines the actions it will take if a model is deemed too dangerous.

Meta defines its risk thresholds based on the ability of a frontier AI to enable specific threat scenarios effectively. If a model is assessed to have reached the critical risk threshold and cannot be safely managed, Meta will halt its development and apply the predetermined measures. The company also evaluates high- and moderate-risk models to assess their potential escalation to critical risk levels.

AI Industry Needs More Open Safety Policies

Meta's detailed approach to its AI safety policies contrasts sharply with the often hurried push toward advanced AI technologies seen in much of the industry. Many voices within the sector have raised concerns about the lack of comprehensive safety measures in place.

By establishing clear safety goals and committing to halt the development of high-risk AI models, Meta distinguishes itself as a leader focused on responsible AI development. Other companies in the industry, such as Anthropic, also adopt similar safety-first strategies, and hopefully, this trend will inspire wider changes across the AI landscape.

Meta, AI, risk