- B2B
- Growth StageExpanding market presence
- Recently fundedRaised funding in the past six months
Senior Machine Learning Engineer
- Full Time
Not Available
About the job
We're a team of machine learning engineers training task-specific generative models for psychology. Our goal is to build an AI therapist to help people change their mind and their lives in the ways that they want to. We partner with organizations around the globe and power use cases, including AI-assisted crisis text response, while securing best-in-class datasets to power our models.
Success to us means every human being in need of support having somewhere to go. We're a well-funded, seed-stage startup backed by top-tier tech investors involved in Huggingface, ElevenLabs, Replit, Captions, Shopify, Plaid, Notion, Canva, Twitch, Airtable, and others
We're building a powerful team by empowering our engineers with the autonomy, flexibility, and resources to do their best work. We dream big and iterate fast. If that sounds like home, we'd love to hear from you.
The Role
As machine learning engineer, you’ll contribute to our ML research and development in areas including data collection, data curation, continued pre-training, ablation studies, evals, creation of hand-crafted supervised fine-tuning data, preference optimization, and state-of-the-art reinforcement learning research. You'll also contribute to our internal tooling for making models better and understanding how our models are performing in production. You’ll be able to work at a faster pace than almost anywhere else while writing high-quality code and producing meaningful scientific insights.
You’ll be primarily working with 70B parameter models as well as fine-tuning GPT-4o, through our partnership with OpenAI. You will be responsible for reading papers and identifying state-of-the-art techniques for us to learn from. Our application backend is written in Kotlin and our ML stack (PyTorch) utilizes modern tooling in the ML space, including some that we’ve developed in-house. We write high-quality, typed, Zen code.
About you:
3+ years developing deep learning models in PyTorch, TensorFlow or JAX, include 1+ years in a production environment.
Experience fine-tuning language models, like Llama.
A demonstrated track record of excellence, including coming up with new ideas or improving upon existing ideas in machine learning.
Experience with software engineering best practices and have a deep appreciation for what good code looks like.
You're fast-paced and pragmatic. You'd rather prove out an idea through quick MVP code than present a slide deck to explain it.
You can explain complex ideas to non-technical people
You understand why deep learning is magic.
What We Offer
Competitive compensation (90th percentile)
Hybrid environment, highly collaborative, fast-paced culture
Work with a crazy passionate team that cares deeply about the impact of our work on mental health, especially in a post-AGI world
About the company
- B2B
- Growth StageExpanding market presence
- Recently fundedRaised funding in the past six months