1. What is a Data Scientist?
A Data Scientist at OpenAI is a force multiplier for model quality, safety, and product impact. You will define and operationalize north-star metrics for safety and reliability, drive statistical rigor into decisions, and turn ambiguous risks into measurable, monitorable signals. Your work directly influences how advanced models are evaluated, deployed, and improved—especially in domains like Safety Systems, Trustworthy AI, and Pretraining Safety where metrics and analyses drive model and policy interventions.
You will collaborate across research, engineering, product, and policy to ensure our systems are both state-of-the-art and safe-by-design. That includes building evaluation pipelines for LLMs and multimodal models, creating robust dashboards used company-wide, and designing experiments that capture real-world risk, misuse, and user outcomes. Expect work at cutting-edge scale: large datasets, high-throughput evaluation frameworks, and complex socio-technical questions requiring both scientific rigor and practical judgment.
This role is critical because safety and reliability are not afterthoughts at OpenAI—they are central to our mission and products. As a Data Scientist, you turn broad safety and performance goals into operational metrics, production-grade measurement, and evidence-backed decisions that shape model design, alignment methods (e.g., RLHF, adversarial training), and deployment readiness.
