PANews reported on October 29th that OpenAI today released the open-source security inference model gpt-oss-safeguard ( 120b , 20b ), allowing developers to provide custom policies for content classification during inference, with the model outputting conclusions and inference chains. This model is fine-tuned based on the open-weight gpt-oss , licensed under the Apache 2.0 license, and can be downloaded from Hugging Face . Internal evaluations show that it outperforms gpt-5-thinking and gpt-oss in multi-policy accuracy, and its performance on external datasets is close to Safety Reasoner . Limitations include: traditional classifiers still outperform in scenarios with a large number of high-quality annotations, and higher inference time and computational cost. ROOST will establish a model community and release technical reports.PANews reported on October 29th that OpenAI today released the open-source security inference model gpt-oss-safeguard ( 120b , 20b ), allowing developers to provide custom policies for content classification during inference, with the model outputting conclusions and inference chains. This model is fine-tuned based on the open-weight gpt-oss , licensed under the Apache 2.0 license, and can be downloaded from Hugging Face . Internal evaluations show that it outperforms gpt-5-thinking and gpt-oss in multi-policy accuracy, and its performance on external datasets is close to Safety Reasoner . Limitations include: traditional classifiers still outperform in scenarios with a large number of high-quality annotations, and higher inference time and computational cost. ROOST will establish a model community and release technical reports.

OpenAI launches open-source security inference model gpt-oss-safeguard, supporting policy-driven classification.

2025/10/29 20:38

PANews reported on October 29th that OpenAI today released the open-source security inference model gpt-oss-safeguard ( 120b , 20b ), allowing developers to provide custom policies for content classification during inference, with the model outputting conclusions and inference chains. This model is fine-tuned based on the open-weight gpt-oss , licensed under the Apache 2.0 license, and can be downloaded from Hugging Face . Internal evaluations show that it outperforms gpt-5-thinking and gpt-oss in multi-policy accuracy, and its performance on external datasets is close to Safety Reasoner . Limitations include: traditional classifiers still outperform in scenarios with a large number of high-quality annotations, and higher inference time and computational cost. ROOST will establish a model community and release technical reports.

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Share Insights

You May Also Like