Staff Site Reliability Engineer
New Today
Job Title: Site Reliability Engineer
Life at Plume
At Plume, we believe that technology isn't about moving faster; it's about making life's moments better. We've built the world's first, open, hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.
We currently deliver services to over 60 million locations globally and have managed over 3 billion devices. We're expanding rapidly, pioneering a new category, and achieved Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who rely on Plume to evolve their smart home offerings and glean insights from their data.
We embody a culture of action, curiosity, and innovation. We challenge ourselves to think differently, focus on what should be done, and excel in our execution. Our team comprises world-class builders, thinkers, and doers, and we're continuously reinventing what's possible.
Position Overview
We are seeking a seasoned
Site Reliability Engineer
with experience in customer-facing environments to provide technical leadership for our SRE team. The team focuses on deployments, production infrastructure, availability, and reliability. The ideal candidate has held infrastructure roles, possesses strong technical knowledge in DevOps/SRE stacks, and prioritizes customer satisfaction.
Responsibilities
Lead a team of Site Reliability Engineers supporting Customer Clouds, including deployments, on-call support, and application provisioning.
Conduct daily stand-ups, manage support tickets, and participate in sprint planning.
Attend and lead customer meetings for project planning and roadmaps.
Handle troubleshooting and execution of tasks such as:
Provisioning and scaling Kubernetes infrastructure (EKS)
Deploying software across multiple production environments
Monitoring, alerting, and improving production systems
Enhancing automation processes
Improving on-call procedures and alerting systems
Qualifications
10+ years experience in production troubleshooting
Leadership or mentoring experience
Excellent executive communication skills
Bachelor's degree in a related field or equivalent experience; an advanced degree is a plus
Technical expertise in:
Kubernetes (administration)
Basic Terraform
Programming/Scripting (e.g., Perl, Python, PHP, Go, Java)
Modern cloud infrastructure, preferably AWS
Linux operating systems (Enterprise Linux or Debian-based)
Monitoring and observability tools (Nagios, Grafana, Prometheus)
Differentiators
Experience troubleshooting production performance and outages at scale
Infrastructure troubleshooting in VMs and bare-metal environments
Advanced Kubernetes and Terraform knowledge
Customer-facing experience
Experience operating Kafka, NoSQL, and relational databases in production
Configuration management skills
Additional Details
This is a
HYBRID position , requiring 3 days/week in the office. Candidates should be within a commutable distance; relocation assistance is not provided.
Total compensation ranges from $177,000 to $208,000, plus bonus, equity, and benefits including 401(k) with company match, health, dental, vision, and life insurance. For more info, visit
https://www.plume.com/careers .
Salary depends on experience, education, and skills. Range details are provided in good faith at posting time.
About Plume
Plume is the creator of an open, hardware-independent, cloud-controlled experience platform for ISPs. We partner with over 350 ISPs, including major players like Comcast and Charter, using OpenSync, an open-source framework for smart spaces, enabling service decoupling from hardware and rapid service delivery over multi-vendor architectures.
Backed by Insight Partners and SoftBank Vision Fund 2, Plume is valued at $2.6B, with over $500M funding in 2021.
We are an equal opportunity employer committed to nondiscrimination in all employment practices, regardless of race, creed, gender, age, marital status, medical conditions, veteran status, or other protected characteristics.
#J-18808-Ljbffr
- Location:
- United States
- Salary:
- $200,000 - $250,000
- Category:
- Engineering