Lead Data Engineer
New Today
Posting Type
Hybrid
Job Overview
Join our team as we innovate the future of data platform architecture, enabling massive scaling and data processing for ML and Gen AI projects. You'll be at the forefront of processing vast unstructured data, building high-throughput APIs, and supporting distributed compute frameworks for seamless model deployment. Ready to dive into the heart of cutting-edge tech? Job Description and Requirements
Your role in action
Assist in building our next-generation data platform tooling and services to support the ingestion and processing of large volumes of documents at scale. Contribute to improving and extending our Spark-based distributed data processing pipeline.
Help enhance our Rust-based distributed query engine used to request large amounts of document data.
Create tools to automate and optimize processes across disciplines.
Participate in the on-call schedule to investigate and fix production issues related to our data processing pipeline or query engine.
Participate in code reviews for projects written by your team.
Focus on quality through comprehensive unit and integration testing.
Your Skills
6+ years of software development experience in writing performant, commercial-grade systems and applications
Experience with monitoring and troubleshooting production environments Proficiency in programming languages used in high volume data processing and applications like: Java or Scala and Python
Experience building data pipelines with distributed compute frameworks like Hadoop. Spark, or Dask
Knowledge of Linux/Unix systems, Docker/Kubernetes and CI/CD including scripting in Python or other scripting languages to automate build and deployment processes
Knowledge of professional software engineering practices & software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
Leverages best practices and past experiences to mentor and improve the productivity of the team
We'd particularly love it if you have: Deep experience building and debugging distributed data pipelines Experience with columnar databases and storage formats like Delta Lake and Parquet Experience deploying and managing services on Kubernetes Experience building with Rust
Relativity is committed to competitive, fair, and equitable compensation practices.
This position is eligible for total compensation which includes a competitive base salary, an annual performance bonus, and long-term incentives.
The expected salary range for this role is between following values:
$150,000 and $224,000 The final offered salary will be based on several factors, including but not limited to the candidate's depth of experience, skill set, qualifications, and internal pay equity. Hiring at the top end of the range would not be typical, to allow for future meaningful salary growth in this position.
- Location:
- Virginia Beach, VA, United States
- Category:
- Computer And Mathematical Occupations