Data Engineer - Real-Time Systems
New Yesterday
Data Engineer - Real-Time Systems
Division: DATUM, Impac Exploration Services
Location: Remote, Oklahoma City (OK), Houston (TX), San Jose (CA)
Type: Full-Time
We're done with the "load it tonight, analyze it tomorrow" paradigm. At DATUM, decisions happen in milliseconds, not morning reports. We need a data engineer who believes streaming is the default, not the exception—someone who gets genuinely frustrated when people suggest "just run it as a nightly job."
Our data doesn't wait for convenient processing windows. It flows from sensors, cameras, and systems that never sleep. Your infrastructure will catch it, process it, and serve insights before traditional pipelines even know it arrived. If you think Kafka is table stakes and real-time inference is the only kind worth doing, we should talk.
What You'll Build
Streaming pipelines that handle millions of events per second
Infrastructure for real-time ML inference at the edge and core
Systems that treat historical data as streaming replay, not static files
Data architectures that scale
Fault-tolerant pipelines that keep flowing when hardware fails
Your Philosophy
The best data lake is a flowing river
Every millisecond of latency is a missed opportunity
Static ETL is where good data goes to get stale
If it's not real-time, it's not real
Technical Reality
Core streaming stack:
Apache Kafka/Pulsar/Redpanda (or better alternatives you'll introduce)
Flink/Spark Streaming for complex event processing
Time-series databases that can actually keep up (TimescaleDB, InfluxDB, or custom)
Languages: Python/Java/Rust—whatever makes it fast
Container orchestration without cloud vendor lock-in
What you won't use:
Traditional ETL tools that think "streaming" means every 5 minutes
Cloud services that hold your data hostage
Architectures that fall over when AWS hiccups
You're Our Person If
You've built streaming systems that stayed up when it mattered
"Eventually consistent" makes you uncomfortable
Real-time inference excites you more than data warehousing
Especially If
You've built on-premise streaming infrastructure that rivals cloud offerings
You've done inference at the edge before edge was cool
You understand hardware—from NVMe optimization to network tuning
You've migrated from batch to streaming and never looked back
You can make time-series data sing at scale
You believe data gravity is a solvable problem
Why This Matters
Your pipelines will power:
ML models making decisions while drill bits are turning
Computer vision processing streams from harsh environments
Analytics that prevent problems rather than explaining them later
Systems where "historical analysis" means 30 seconds ago
This isn't building dashboards for quarterly reviews. This is infrastructure for decisions that can't wait.
Growth Path
Today: Building streaming pipelines that embarrass traditional ETL Six months: Architecting systems that make cloud vendors nervous. One year: Publishing approaches that redefine industrial data processing.
When Databricks or Confluent tries to hire you, it'll be because you built something better than what they're selling.
Reality Check
You'll fight against decades of batch processing mindset. You'll optimize systems down to microseconds. You'll build infrastructure in places with challenging connectivity. You'll explain why "real-time" isn't just a buzzword.
But you'll also enable genuinely new capabilities. You'll prove that industrial systems can be as responsive as trading platforms. You'll build the foundation for AI that reacts as fast as physics demands.
Ready to Stream?
Show us streaming systems you've built that others said were impossible. Tell us why you believe batch processing is (mostly) dead. Share your vision for data infrastructure unchained from cloud providers.
We're looking for someone who sees "process nightly" and thinks "why wait?"
- Location:
- Weatherford
- Job Type:
- FullTime
- Category:
- Technology