Site Reliability Engineer
(3+ years exp)Cloudraft
Job Type
Full TimeVisa Sponsorship
Not AvailableRemote Work Policy
Remote onlyHires remotely in
Relocation
Not AllowedSkills
Hiring contact
Anjul SahuThe Role
CloudRaft is a consulting company specializing in cloud native solutions, devops and platform engineering. We partner with startups and growing companies to scale by leveraging cloud native technologies and modern engineering practices. You can learn more about us at https://cloudraft.io.
We are looking for a SRE to join our team. The ideal candidate will have 3-5 years of experience in managing production systems. This position is fully remote (India only) with occassional visits to client location.
What we are looking for?
- You have good understanding and professional work experience of running Kubernetes in on-prem and cloud (OpenShift, EKS, AKS and GKE).
- You are comfortable in programmable infrastructure and can do programming in Golang or Python.
- You are experienced in production grade CI/CD in tools like github action, argocd, flux or jenkins.
- You can setup observability pipelines and backend using popular products like vector, fluentd, opentelemetry, prometheus, grafana etc.
- You have production experience in troubleshooting and resolve system issues
- Have a good understanding and implementation experience of SRE concepts such as SLIs and SLOs
- You can represent CloudRaft and collaborate with and coach customer teams
- You have curiousity to learn and develop skills in upcoming fields such as AI, MLOps, Edge Computing, etc
- You like sharing your work through writing (blogs) and speaking sessions in the community and conferences
Qualifications:
- Bachelor's degree in Computer Science, IT, or a related field
- 3-5 years of experience in DevOps
- Stong Understanding in at least two of AWS, OpenShift, Azure and Google Cloud
- Hands-on production experience in designing and managing Kubernetes clusters
- Hands-on experience in CI/CD (Github Actions, Jenkins, ArgoCD, etc) and setting up Developer tooling
- Programming skills in any modern programming language (Python or Golang or Node)
- Infrastructure as Code (Terraform, CDK, Pulumi, etc)
- You have understanding of security concepts and tooling
- Excellent problem-solving and troubleshooting skills
- Strong communication and teamwork skills
- Ability to write well as we prefer async communication
- Having product mindset and customer empathy is a big plus.
Benefits:
- Competitive salary and benefits package
- Opportunity to work on cutting-edge technology
- Collaborative and supportive work environment
- Chance to make a real impact on the company's success
This is a chance to work as a founding engineer and be part of building a rocket ship. You will be engaged in other areas of company development and will be able to groom yourself in other areas. If you are interested in this position, please submit your resume and cover letter to [email protected].