Software Engineer, Infrastructure
New Yesterday
About The Role
All teams are deeply collaborative, work on mission-critical services, and are responsible for building distributed, scalable infrastructure to bring OpenAI's technology to the world through products like ChatGPT and the OpenAI API. You'll work closely with stakeholders to understand infrastructure, data and compute needs, setting the technical strategy that supports cutting-edge research and product development. This is a critical role for someone who is passionate about solving complex engineering problems at scale, ensuring their performance, scalability and reliability.
Team Focus Areas
- Distributed Systems: Owning and building important, highly scalable, available, performant, and reliable distributed systems (and their building blocks) to power the entire stack at OpenAI
- Systems Engineering: Work across layers of the stack-debugging system bottlenecks, evolving core infrastructure, and solving novel problems in performance and scalability.
- Reliability Engineering: Build scalable, fault-tolerant systems and lead efforts around service health, incident response, and resilience.
- Observability: Design and maintain observability tooling (metrics, logs, tracing) to give teams visibility into production systems at scale.
- Developer Productivity: Create tools, environments, and workflows that help engineers ship high-quality software faster and more safely.
- Cloud Infrastructure: Own the cloud-native infrastructure (compute, networking, storage) that underpins all services and research workloads.
In this role you will:
- Design, build, and maintain reliable and performant systems used across engineering. Work with your team to define technical strategy, architecture, and long-term goals.
- Collaborate with other engineers, product managers, and researchers to build infrastructure that meets evolving needs.
- Improve internal tooling, automation, and developer experience.
- Contribute to incident response, postmortems, and the development of best practices around system reliability and scalability.
You might thrive in this role if you:
- Strong software engineering skills with experience in Python, Go, Rust, or similar languages.
- Experience designing, operating, or scaling distributed systems or developer infrastructure.
- Comfort working in Linux environments, and with tools like Kubernetes, Terraform, CI/CD pipelines, and modern observability stacks.
- Ability to navigate complex systems and a willingness to dig deep when debugging tricky issues.
- Excellent communication and collaboration skills, especially in cross-functional settings.
Qualifications:
- 4+ years of relevant industry experience, with 2+ years leading large scale, complex projects or teams as an engineer or tech lead
- A passion for distributed systems at scale with a focus on reliability, scalability, security, and continuous improvement.
- Excellent communication skills, with ability to build consensus among stakeholders both internally and externally.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
- Location:
- San Francisco
We found some similar jobs based on your search
-
New Today
Software Engineer, Infrastructure (All Levels)
-
San Francisco, CA, United States
- IT & Technology
Software Engineer, Infrastructure (All Levels) Join to apply for the Software Engineer, Infrastructure (All Levels) role at Jobright.ai Software Engineer, Infrastructure (All Levels) 1 day ago Be among the first 25 applicants Join to apply for t...
More Details -
-
New Today
Software Engineer - Infrastructure
-
San Francisco
Infrastructure Software Engineer Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. We're trusted by leading AI-driven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, an...
More Details -
-
New Today
Software Engineer, Cloud Infrastructure
-
San Francisco, CA, United States
- Computer And Mathematical Occupations
About Us: Notion helps you build beautiful tools for your life's work. In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting docs, notes, projects, calendar, and email-with AI bu...
More Details -
-
New Today
Senior Flight Software Engineer, Development Infrastructure
-
San Francisco
Senior Flight Software Engineer, Development Infrastructure Welcome to Planet. We believe in using space to help life on Earth. Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation deliv...
More Details -
-
New Today
Backend Software Engineer, Test Infrastructure San Francisco, California, United States
-
San Francisco
Backend Software Engineer, Test Infrastructure Postman is the world's leading API platform, used by more than 40 million developers and 500,000 organizations, including 98% of the Fortune 500. Postman is helping developers and professionals across t...
More Details -
-
New Yesterday
Software Engineer, Infrastructure Applications
-
San Francisco
Software Engineer, Infrastructure Applications San Francisco Bay Area, California, United States Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create,...
More Details -