Sr/Staff Site Reliability Engineer
(5+ years exp)SilverCloud Health
Job Location
Job Type
Full TimeVisa Sponsorship
Not AvailableHires remotely in
Relocation
AllowedSkills
Hiring contact
Pete Karl IIThe Role
SilverCloud Health is the world’s leading digital mental health company, enabling healthcare organizations to deliver clinically validated digital therapeutic care, that improves outcomes, increases access & scale while reducing costs. Today, SilverCloud is being used by over 300 organizations globally to meet the mental health needs of their end users/patients. The Platform has been deeply validated working with global experts, through full randomized control trials as well as real world data from over 500,000 SilverCloud end users. The platform continues to lead the industry with its effectiveness, engagement and range of clinical programs covering the spectrum of mental health needs.
As a fast-growing company, doubling year on year and with our proven ability to attract and retain customers, we are now at the exciting phase of scaling up all aspects of the company as we look to expand into new markets. A critical component of this is our continued investment in our existing customers and ensuring they are experiencing all SilverCloud has to offer. SilverCloud Health is looking for a strong candidate to set up our Site Reliability Engineering team for US operations. This candidate will have experience managing systems at scale and be passionate about reliability, resiliency, and scalability. This candidate would have a background managing systems at scale in the cloud.
SilverCloud Health will support you to:
- Manage large migrations to new cloud platform
- Innovate on the platform (develop and impact SLOs + KPIs, etc.)
- Provide guidance to CISO and technology leadership on platform investment
- Own incident management + postmortem history + on-call
- Fine tune our systems’ performance tuning with a focus on high availability and scalability.
- Monitor and troubleshoot the application and infrastructure.
- Assist engineering team with accelerating processes through automation.
- Ensure SCH are up to date on the latest patches, security issues through automation and building security into our SDLC processes.
- Accelerate code velocity and improve process for engineering to improve ship time.
- Continually improve reliability of systems.
- Review and provide feedback on GitHub Pull Requests to team members & cross functional engineering teams.
- Champion the concepts of immutable containers, Infrastructure as Code, stateless applications, and software observability throughout the organization.
As a Sr. Site Reliability Engineer, you...
- Have 5+ years’ experience in DevOps or SRE.
- Are a strong communicator in both written and verbal form.
- Have a strong understanding of:
- AWS or GCP or Azure (We are moving to Azure)
- Terraform or Kubernetes or Ansible (we use Ansible today)
- Bash
- Docker
- Advanced networking concepts
- Cloud architecture
- Have a knowledge of:
- Postgres/MySQL
- Jenkins
- Python
- Have strong attention to detail.
- Are proactive.
- Have an enthusiasm and passion for quality.
- Are a Team player with an ability to bring teams together.