ETL Engineer / Data Scientist

New Today

Overview GovCIO is seeking a talented ETL Engineer or Data Scientist to join our dynamic ETL Team. In this role, you will focus on designing, developing, and maintaining efficient ETL pipelines and data infrastructure within AWS GovCloud environments. While the position is based in Hanover, MD, it offers the flexibility of being fully remote. Responsibilities Your main duties will include: Data Pipeline Development: Design and implement Extract, Transform, Load (ETL) solutions to efficiently process and store data from diverse sources such as AWS S3, CloudWatch, and EventBridge. Utilizing AWS Services: Leverage services like Lambda, Kinesis, and Data Prepper to create scalable data pipelines. Infrastructure Management: Use AWS CloudFormation to deploy and manage data pipeline infrastructure in an Infrastructure-as-Code (IaC) environment. Monitoring & Alerts: Set up CloudWatch alarms and synthetic canaries to ensure system health and proactively identify issues. Log Aggregation: Configure Fluent-bit agents for collecting system and application logs, developing custom Lua functions and regex parsers for log transformation. Data Analysis & Visualization: Design visualizations and dashboards to monitor system performance and detect anomalies, implementing AWS CloudWatch alarms and utilizing Amazon SNS for timely notifications. Cluster Management: Manage an OpenSearch cluster, ensuring optimal performance through precise field mappings and index state management. Application Performance Monitoring: Use OpenTelemetry for monitoring application performance. Qualifications We require candidates to have the following: HS Diploma with at least 9 years of relevant experience. Top Secret clearance. IAT level II/III certification (e.g., CompTIA Security+(CE)). Experience with Linux and AWS GovCloud technologies. Preferred Skills While not required, the following skills will enhance your application: Strong experience with AWS services, including Lambda, Kinesis, CloudWatch, S3, EventBridge, and CloudFormation. Proficiency in Python for developing data pipelines. Experience with NoSQL databases like OpenSearch, Elasticsearch, or MongoDB. Knowledge of Infrastructure-as-Code (IaC) principles. Excellent written and oral communication skills. Company Overview At GovCIO, we are passionate about transforming government IT by delivering innovative solutions that improve agency operations and citizen services. We are looking for great people to join our team and help us achieve our mission. Employee Perks We offer a range of perks and benefits, including: Employee Assistance Program (EAP) Corporate Discounts Learning & Development platform Training, Education and Certification Assistance Referral Bonus Program Flexible Work Environment Join us at GovCIO to be a part of a culture that values your contributions and invests in your professional growth. We are an Equal Opportunity Employer, and all qualified applicants will receive consideration for employment. Location: Hanover, MD (Remote) Posted Salary Range: USD $140,000.00 - USD $160,000.00 /Yr.
Location:
Washington, DC, United States
Category:
Computer And Mathematical Occupations