- B2B
- Growth StageExpanding market presence
- Recently fundedRaised funding in the past six months
Data Engineer
- $125k – $175k
- Remote •+2
- 2 years of exp
- Full Time
Reposted: 2 years ago
Job Location
Visa Sponsorship
Not Available
Hires remotely
Everywhere
RelocationAllowed
Skills
Python
SQL
Distributed Systems
Apache Spark
Apache Airflow
About the job
Who You Are
You are a thoughtful engineer. You understand the complexities of distributed systems and how to triage and solve issues that arise with them. Scalability is top of mind when designing any system or writing code. You believe building a better ETL system requires close collaboration with the machine learning and data science teams. You avoid reinventing the wheel unless necessary and are excited by opportunities to contribute to the open-source community.
Day 5
- Learn about Viaduct’s history and mission
- Get to know every team member
- Set up your development environment
- Understand Viaduct’s ETL pipelines and run your first DAGs
- Deep dive into the nuances of vehicle data
- Attend our weekly ML lunch
Day 30
- Take ownership of ETL pipelines
- Identify scalability bottlenecks in the existing ETL pipelines
- Be familiar with the day-to-day work of machine learning engineers and data scientists
- Learn the architecture of data engineering systems and services
Day 90
- Be the ETL pipeline expert at Viaduct
- Improve overall data quality and discoverability
- Confident in the scalability of Viaduct’s ETL pipelines
- Present your work at our weekly ML lunch
- Comfortable contributing to our engineering infrastructure and systems
Expected Skills
- 2+ years working with large-scale data processing tools (Spark, Hadoop, Airflow, etc)
- Expertise in Python, Scala, or Go
- Experience managing Spark clusters and tuning Spark jobs
- Active user of and/or contributor to open-source projects
- Exceptional Skills
- Familiar with Terraform, Docker, and Kubernetes
- Familiar with managing data engineering infrastructure (Airflow, Kubernetes, etc)
Why Viaduct
- Contribute to the open-source ecosystem
- Work with established experts in deep learning, time-series analytics, and convex optimization
- Endless opportunities for technical learning and personal growth
- Full health, vision, and dental benefits
About the company
- B2B
- Growth StageExpanding market presence
- Recently fundedRaised funding in the past six months
Similar Jobs
GVOS
An Edge Cloud for Autonomous Driving
C3.ai
C3 AI is a leading enterprise AI software provider for accelerating digital transformation
Uncountable
Accelerating R&D via Machine Learning
StarTree
Fast, Fresh, Actionable Insights at Scale!
Scale AI
The API For Training Data
Genies
Empowering humans to create and own their own avatar ecosystems
Scale AI
The API For Training Data
Scale AI
The API For Training Data
DoorDash
Your favorite restaurants, delivered