AI Research Engineer (LLM Optimization)

Palo Alto
|Full Time

Posted: 1 year ago

Visa Sponsorship

Not Available

RelocationAllowed

About the job

Chai is one of the fastest-growing, generative AI startups in Silicon Valley. YouTube but for LLM's - we have over 1 million active users.

Who we are looking for:

We need a relentless engineer with 3+ years of experience overseeing and being responsible for optimizing our LLMs. Ensuring they are performant, scaleable, and cost-efficient. You will work alongside equally talented and driven teammates implementing cutting-edge AI inference engines. We need someone who is reliable and has high standards.

Here's why we might not be the right fit for you:

• We work hard and have a high-velocity environment with lots of growth opportunities.

• We value exceptional performance and continuous improvement. We believe that if you aren't constantly learning, you aren't growing.

• You will be responsible and accountable for making high-impact decisions that determine Chai's future

Here are the top 2 reasons why you should join us:

• Exponential growth. 1 Million MAU. Join the team that gets us to 100 million MAU

• Craftsmanship. Build something beautiful

Requirements:

• Familiar with vLLM, quantization, and current techniques of LLM optimization

• 3+ years of experience in software engineering

• Bachelor or Master degree from a leading academic institution

Here is our tech stack:

• Front end: Python, Flutter, Dart

• Back end: Python, GCP, Redis, Kubernetes

Process:

Exceptionally fast, application to offer within 7 days