Avatar for Plume Design
Plume Design
Actively Hiring
Plume is the backbone of the services customers demand from operators
  • Top 10% of responders
    Plume Design is in the top 10% of companies in terms of response time to applications
  • Responds within two weeks
    Based on past data, Plume Design usually responds to incoming applications within two weeks
  • B2B
  • +2

Manager, Site Reliability Engineering

Posted: 1 month ago• Recruiter recently active
Visa Sponsorship

Not Available

RelocationAllowed
Hiring contact

Jean Baptiste

About the job

We’re looking for a seasoned Technical Manager, experienced with Customer Facing environments, to Captain our Site Reliability Engineering Team. This team is focused on deployments, fixes, and sustainability. The right candidate needs to have strong technical knowledge in key areas while focusing on customer satisfaction.

What You’ll Do:

  • Supervise a team of Site Reliability Engineers who provide first-line support to Customer Clouds. Deployments, On-call, Application Provisioning are some of the routine tasks.
  • Attend and conduct customer Meetings for Project and Roadmap specification.
  • Manage growth and performance of SRE team members.
  • Be able to step in and execute or triage issues as much as the Engineers. Hands-on past experience is beneficial. Some examples are as follows:

    • Provision and scale multi-datacenter Kubernetes Infrastructure and Applications (EKS)
    • Deploy Software in multiple Production Environments
    • Own monitoring and alerting to production systems, improvements and changes
    • Contribute improvements to the current automation
    • Contribute improvements to our on-call process and alerting
  • Play a key role in the recruitment and retention of top talent.

What You’ll Bring

  • Availability to be in on-call rotation for Production issues
  • Availability to work with a distributed team in different timezones
  • Advanced communication skills
  • Experience managing people

Desired Skill Set

  • 10+ Years of experience with Production Troubleshooting
  • Minimum 5+ Years of experience leading or managing teams
  • Bachelor’s degree in related field or equivalent experience, Advanced degree preferred.
  • This is a leadership role, but you must have Technical knowledge and working experience with:

    • Kubernetes (operate)
    • Basic Terraform Knowledge
    • Experience Programming/Scripting - one of the following (eg. Perl, Python, PHP, GoLang, Java, etc)
    • Experience with modern cloud infrastructure, preferably AWS
    • Experience with modern Linux Operating systems (Enterprise Linux or Debian based)
    • Experience both setting up and utilizing self-managed Monitoring and observability tools (e.g. Nagios/Icinga, Grafana, Prometheus)

Differentiators

  • Troubleshooting production performance/service degradation or outage issues at scale
  • Experience with Infrastructure Troubleshooting in VMs and/or Bare Metal (ssh/Linux)
  • Advanced Kubernetes knowledge
  • Advanced Terraform knowledge
  • Customer Facing experience in previous roles
  • Experience operating Kafka in Production
  • Experience operating NoSQL Databases in Production
  • Experience operating Relational Databases in Production
  • Configuration Management experience

HYBRID - Candidates must be in commutable distance. We are not offering relocation at this time.

Total Compensation package would include: anticipated compensation range of $181,000 - $213,000 + bonus + equity + benefits. Benefits include: a 401k plan and a company match, basic life insurance plus unparalleled health, dental, vision and other benefits and perks. For more details please see: https://www.plume.com/careers

An employee’s base salary and its position within the range may depend on a number of factors including job related knowledge, education, skills, experience and other business related considerations. Published ranges are provided in good faith at the time of posting.

About the company

Plume Design company logo

Plume Design

Actively Hiring
Plume is the backbone of the services customers demand from operators51-200 Employees
  • Top 10% of responders
    Plume Design is in the top 10% of companies in terms of response time to applications
  • Responds within two weeks
    Based on past data, Plume Design usually responds to incoming applications within two weeks
  • B2B
  • Scale Stage
    Rapidly increasing operations
  • Valuation $1B+
    This company has a valuation of $1B or more
Learn more about Plume Design image

Funding

AMOUNT RAISED
$37.5M
FUNDED OVER
1 round
Round
A
$37,500,000
Series A - Jun 2017

Founders

Fahri Diner
Founder • 3 years
United States
image
Adam Hotchkiss
Founder • 3 years
Palo Alto
image
Aman Singla
Founder • 3 years
Palo Alto
image
View the team image

Similar Jobs

Distro company logo
Distro
The AI-powered commerce platform for industrial distributors
Pinterest company logo
Pinterest
Dream about, plan and prepare for things to to do in life
stakefish company logo
stakefish
We are the leading staking service provider for blockchain projects. Delegate to us, stake with us
Assured company logo
Assured
Automated claims is now a reality
Typeface company logo
Typeface
GenAI early stage start up backed by top investors