OpenAI / o1-mini - Langbase · Serverless AI Developer Platform

o1-mini is trained using the same alignment and safety techniques as o1-preview. The model has 59% higher jailbreak robustness on an internal version of the StrongREJECT dataset compared to GPT-4o. Before deployment, OpenAI carefully assessed the safety risks of o1-mini using the same approach to preparedness, external red-teaming, and safety evaluations as o1-preview.

Metric	GPT-4o	o1-mini
% Safe completions refusal on harmful prompts (standard)	0.99	0.99
% Safe completions on harmful prompts (Challenging: jailbreaks & edge cases)	0.714	0.932
% Compliance on benign edge cases (“not over-refusal”)	0.91	0.923
% [email protected] StrongREJECT jailbreak eval (Souly et al. 2024(opens in a new window))	0.22	0.83
% Human sourced jailbreak eval	0.77	0.77

Langbase

Model Card

Meta data

Context

Prompt Cost

Completion Cost

Trained with data up to

⌘Langbase

Model Card

Meta data

Context

Prompt Cost

Completion Cost

Trained with data up to

Langbase