Avatar for Crunchyroll
Crunchyroll
Actively Hiring
The world’s largest destination for anime & manga focused on creating 360° fan experiences
  • B2C
  • Scale Stage
    Rapidly increasing operations
  • Valuation $1B+
    This company has a valuation of $1B or more

Staff Site Reliability Engineer

Posted: 2 weeks ago• Recruiter recently active
Visa Sponsorship

Not Available

RelocationAllowed
Hiring contact

Kat Mercado

About the job

Who We Are

We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection of brands.

About the Team

The Site Reliability Engineering (SRE) team is dedicated to ensuring the reliability, scalability, and performance of our data infrastructure. We focus on standardizing and implementing monitoring and alerting across all datastores to track key metrics like errors, latency, and throughput, and to ensure critical systems are covered. Our team also leads efforts to keep databases up-to-date, implements Infrastructure as Code (IaC) for high availability and performance, and automates key processes to enhance operational efficiency.

We lead and evangelize the principle of 100% automation. Additionally, we define and document operational requirements, develop incident response processes, and automate monitoring and compliance checks to maintain a secure and reliable data environment. By continuously improving load testing and optimizing data governance practices, we support the overall health and efficiency of our data systems.

About the Role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The Data Engineering team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans.

As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering and will be based out of our Mexico City office.

About You

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.
  • Extensive experience with AWS cloud platform and their data-related services.
  • Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
  • Proficiency in one or more programming languages (e.g. Python, Java)
  • Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
  • Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.
  • Experience in identifying and eliminating the bottlenecks in the system.
  • Strong understanding of database internals like types of indexes, schemas, query plans.
  • Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.
  • Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.
  • Experience with data governance, compliance, and lifecycle management.
  • Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.

#LifeAtCrunchyroll #LI-Hybrid

About the company

Crunchyroll company logo

Crunchyroll

Actively Hiring
The world’s largest destination for anime & manga focused on creating 360° fan experiences501-1000 Employees
  • B2C
  • Scale Stage
    Rapidly increasing operations
  • Valuation $1B+
    This company has a valuation of $1B or more

Employees joined from

Learn more about Crunchyroll image

Funding

AMOUNT RAISED
$504.8M
FUNDED OVER
4 rounds
Rounds
U
$500,000,000
Unknown - Apr 2014+3

Perks

Healthcare benefits
Medical, dental, vision, STD, LTD, and life insurance; Health care and dependent care FSA
Retirement benefits
401(k) plan with employer match
Parental leave
Generous vacation
“Use What You Need” time away from work policy
Company meals
Catered lunch and dinner 4 days per week
Wellness benefits
On-site gym, showers, yoga, and wellness classes
Commuter benefits
Employer paid commuter benefit
Pet-friendly office
Pet friendly environment - pet insurance and dog friendly office
Professional development
Company events

Similar Jobs

Altinity company logo
Altinity
The leading service and software provider for ClickHouse
MindCloud company logo
MindCloud
MindCloud is a rapidly growing full service software integration startup
Collinear.ai company logo
Collinear.ai
Collinear solves the fundamental problem of LLM customization; Elevate your LLM game today
testRigor AI company logo
testRigor AI
Executable Specifications for Software Testing
Tendo company logo
Tendo
It’s time to reimagine what’s possible in healthcare
DoctusTech company logo
DoctusTech
We are passionate about helping healthcare organization improve their HCC training