- Early StageStartup in initial stages
Data Engineer
- Full Time
Not Available
About the job
Who are we?
Coastal Carbon is a seed-funded startup on a mission to create positive impact through earth observation and AI. Founded at the University of Waterloo by a team of PhDs and engineers, we’re backed by some of the best AI and climate tech investors like HF0, Inovia Capital and Propeller Ventures, angels like James Tamplin (cofounder Firebase) and Sid Gorham (cofounder OpenTable, Granular), and partners like Amazon AWS and the United Nations.
What do we do?
We’re building multimodal foundation models for the natural world. We believe there’s more to the world than the internet + more to intelligence than memorizing the internet. Our models are trained on satellite remote sensing and real world ground truth data, and are used by our customers in nature conservation, carbon dioxide removal, and government to protect and positively impact our increasingly changing world. Our ultimate goal is to build AGI of the natural world.
About the role
We are seeking a Data Engineer to join our team and help us build out a digital twin of the natural world. The successful candidate will be responsible for supporting the design, building, monitoring, and maintenance of the underlying database and related tooling.
The role will involve:
Developing and maintaining AWS infrastructure to support a multi-Petabyte database
Supporting upstream data pipeline design and implementation
Heavy focus on scalability and optimization for performance
Creating downstream applications to support machine learning and visualization
Requirements
Bachelor’s degree in engineering, computer science or a related field, or equivalent
5+ years of relevant experience
Fluency with SQL programming
Proficiency in Python
Aptitude in parallel processing
Demonstrated experience with managing, ingesting, and transforming geospatial data
Knowledge of Earth observations and methods including satellite remote sensing and weather reanalysis data
Familiarity with object store databases like Redshift/Snowflake
Experience building data pipelines and tooling to support downstream applications
Team player, willing to undertake various tasks to support our collective goals
Nice to have
Proficiency in PyTorch or Tensorflow (with interest in learning PyTorch)
Knowledge of AWS networking and security protocols
Experience with containerization and orchestration technologies such as Docker, AirFlow, and/or Kubernetes.
Location wise, strong preference for in-person in Waterloo, however hybrid work is possible for exceptional candidates.
About the company
- Early StageStartup in initial stages