- B2C
- Scale StageRapidly increasing operations
- Top InvestorsThis company has received a significant amount of investment from top investors
- +1
Site Reliability Engineer
- Full Time
Not Available
Lingke Wang
About the job
About the Role
The Ethos Site Reliability Engineering group is looking for a fellow SRE to collaborate with other teams to improve service reliability, automation, observability and improve their application maturity, reliability, and production readiness. In this role, you will create new infrastructure and support tools, help other teams debug issues, and act as advisors regarding production readiness when new services are being designed and brought online. The person in this role will exhibit good mentoring and coding skills as well as the ability to solve complex problems related to reliability. This role also requires good interpersonal skills and the ability to motivate other teams to adopt best practices concerning reliability.
Duties and Responsibilities:
- Collaborate with other engineering teams to enhance the reliability and performance of applications, up to and including submitting PRs to their codebases with improvements
- Assist with automation, continuous integration, and delivery of applications and configurations
- Assist other teams with designing solid foundations and best practices around running services in production environments
- Assist other teams implementing infrastructure level dependencies
- Help other teams identify, design, and monitor proper service-level indicators (SLI) and service-level objectives (SLO)
- Design and implement cross company resilient and reliable tools, resources, infrastructure and processes
Qualifications and Skills:
- 2+ years of full time software engineering experience in relevant role (Backend SWE / SRE / DevOps / Infrastructure / Operations)
- Experienced with software engineering best practices – design patterns, code reviews, testing, etc.
- Ability to communicate technical specifications both verbal and written
- Strong interpersonal skills
- A strong desire to remove any toil from a system
- Preferred background in distributed systems and infrastructure
- Preferred engineering experience in TypeScript, GoLang, Python, PostgreSQL, AWS
- Preferred experience with Kubernetes and Kubernetes operators
- Preferred knowledge of CI/CD systems
- Preferred experience with: terraform, argoCD, Atlantis, GitHub actions and DataDog
#LI-DG1
#LI-Onsite
About the company
Ethos
- B2C
- Scale StageRapidly increasing operations
- Top InvestorsThis company has received a significant amount of investment from top investors
- Valuation $1B+This company has a valuation of $1B or more