Estimating worst case frontier risks of open weight LLMs

OpenAI Blog

Aug 5, 2025 00:00

1 views

openaiairesearch

Summary

The paper examines the potential worst-case risks of releasing open-weight large language models (LLMs) like gpt-oss by introducing "malicious fine-tuning" (MFT), a method to maximize model capabilities in sensitive areas such as biology and cybersecurity. The findings highlight the heightened risks associated with open access to powerful LLMs, emphasizing the need for careful consideration of their release and potential misuse in high-stakes domains.

In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and cybersecurity.

Read Full Article More News

Top Free AI Tools for Web Developers in 2025

Analytics InsightAug 7

The article highlights the leading free AI tools available to web developers in 2025, emphasizing platforms that streamline coding, automate design, and enhance user experience. It notes that these tools are democratizing access to advanced AI capabilities, allowing developers of all skill levels to build smarter, more efficient web applications. This trend signals a continued integration of AI into mainstream web development, accelerating innovation across the industry.

AI is impacting the labor market, young tech workers: Goldman Sachs economist

Hacker News - AIAug 7

A Goldman Sachs economist highlights that AI is significantly impacting the labor market, particularly affecting young tech workers by changing job requirements and increasing competition. The article suggests that while AI creates new opportunities, it also poses challenges for workforce adaptation, emphasizing the need for continuous skill development in the AI field.

The AI industry is going after students