Technical Data Scientist/ETL Engineer

New Yesterday

Technical Data Scientist/ETL Engineer at GovCIO in Boise, Idaho, United States Job Description Overview GovCIO is currently hiring foran ETL Engineer (or Data Scientist) to join our ETL Team focusedon designing, developing, and maintaining robust ETL pipelines and data infrastructure within AWS GovCloud environments. This position will be located in Hanover, MD and will be a fully remote position. Responsibilities Design, develop, and maintain robust ETL pipelines and data infrastructure within AWS GovCloud environments. You will work closely with cross-functional teams to ensure that data from a variety of sources is efficiently processed, analyzed, and visualized to monitor and optimize system performance. You will use a combination of AWS native services and open-source software to create scalable and high-performance data pipelines, manage complex data flows, and contribute to operational monitoring efforts across a large AWS environment. Data Pipeline Development: + Design and implement Extract, Transform, Load (ETL) solutions to move, process, and store data from a wide range of sources, including AWS S3, CloudWatch, EventBridge, and other cloud-based services. + Leverage AWS services such as Lambda, Kinesis, and Data Prepper to create data pipelines that span multiple AWS accounts. + Use AWS CloudFormation to deploy and manage data pipeline infrastructure in an Infrastructure-as-Code (IaC) environment. + Deploy and manage CloudWatch alarms and synthetic canaries to ensure system health and proactively detect issues before they impact users. Log Aggregation & Metrics Collection: + Deploy and configure Fluent-bit agents to collect system and application logs from hundreds of critical systems. + Develop custom Lua functions and regex parsers for transforming and routing logs as required. Data Analysis, Visualization & Alerting: + Design and build data visualizations, dashboards, and alerts to monitor and detect anomalous system activity. + Design, implement, and maintain AWS CloudWatch alarms to monitor the health and performance of cloud applications, infrastructure, and services. + Configure notifications for alerts using Amazon SNS, and integrate with communication tools (e.g., Slack, email) to ensure timely awareness of critical issues. + Develop custom visualizations using Vega, create alerts using Query DSL and Painless scripting, and produce reports via OpenSearch SQL. Cluster Management & Data Storage: + Manage and maintain an OpenSearch cluster to support large-scale data ingestion and querying. + Implement ex To view full details and how to apply, please login or create a Job Seeker account
Location:
Boise, ID, United States
Category:
Computer And Mathematical Occupations