- B2B
- Scale StageRapidly increasing operations
- Top InvestorsThis company has received a significant amount of investment from top investors
- +4
Staff Site Reliability Engineer, Application SRE
- Full Time
Not Available
About the job
About the role
Please note, this team is hiring across all levels and candidates are individually assessed and appropriately leveled based upon their skills and experience.
The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our RBI and SSPM services. We are a team of software engineers focused on improving availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of the engineering stacks. If you are passionate about solving complex problems and developing cloud services at scale, we would like to speak with you.
What’s in it for you
- You will be part of a high caliber engineering team in the exciting space of cloud tools and infrastructure management.
- You will have an opportunity to work on hybrid cloud (Google Cloud, On-prem cloud) and work with cutting edge tooling like spinnaker, kubernetes, docker and more.
- You will solve complex, exciting challenges and improve the depth and breadth of your technical and analytical skills
- Your contributions to our market-leading product support will significantly impact our rapidly-growing global customer base.
What you will be doing
- Partner closely with our development teams and product managers to architect and build features that are highly available, performant and secure
- Develop innovative ways to smartly measure, monitor & report application and infrastructure health
- Gain deep knowledge of our application stack
- Experience improving the performance of micro-services and solve scaling/performance issues
- Capacity management and planning
- Function well in a fast-paced and rapidly-changing environment
- Participate with the dev teams in a 24X7 on-call rotations.
- Ability to debug and optimize code and automate routine tasks.
- Drive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.
Required skills and experience
- 5+ years of experience troubleshooting Unix/Linux
- Experience in managing a large-scale web operations role
- Experience in one or more of the following: C, C++, Java, Python, Go, Perl or Ruby
- Experience with algorithms, data structures, complexity analysis, and software design
- Hands-on working with private or public cloud services in a highly available and scalable production environment.
- Experience with continuous integration and deployment automation tools such as Jenkins, Ansible etc.
- Knowledge of distributed systems a big plus
- Previous experience working with geographically-distributed coworkers.
- Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, developers, Product Managers, etc
- Should have led teams, collaborating cross-functionally to deliver complex software features and solutions.
Education
- BSCS or equivalent required, MSCS or equivalent strongly preferred
#LI-DB1
About the company
- B2B
- Scale StageRapidly increasing operations
- Top InvestorsThis company has received a significant amount of investment from top investors
- Valuation $1B+This company has a valuation of $1B or more
- 4.2Highly ratedNetskope is highly rated on Glassdoor, with 4.2 out of 5 stars
- 4.1Work / Life BalanceEmployees rate Netskope 4.1/5 on Glassdoor for work / life balance
- 4.1Strong LeadershipEmployees rate Netskope 4.1/5 on Glassdoor for faith in leadership