Zilliz
Actively Hiring
Vector database for production-ready AI
- B2B
- Growth StageExpanding market presence
- Top InvestorsThis company has received a significant amount of investment from top investors
Staff Site Reliability Engineer Cloud Platform
- Full Time
Posted: 4 months ago
Visa Sponsorship
Not Available
RelocationAllowed
About the job
What you will do:
- Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms.
- Ensure the reliability, availability, and performance of Zilliz’s distributed database systems.
- Develop and implement strategies for monitoring, incident management, and disaster recovery.
- Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention.
- Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness.
- Collaborate with software engineers to enhance system reliability, scalability, and performance.
- Maintain and improve the CI/CD pipeline to ensure smooth and rapid deployment of changes.
- Actively contribute to the Milvus open-source community, focusing on improving reliability and operational efficiency. What we are looking for:
- 4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems.
- Proficiency in scripting languages such as Python, Go, or Java.
- Strong knowledge of container orchestration technologies like Kubernetes and Docker.
- Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools.
- Experience with infrastructure as code tools such as Terraform or Ansible.
- Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo.
- Proven ability to troubleshoot complex distributed systems and resolve issues promptly.
- Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines.
- Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously. Benefits:
- Competitive compensation (cash + equity)
- Regular bonus and equity refresh opportunities
- Medical, dental, and vision insurance
- Paid time off, including vacation, sick leave, and global reset/wellbeing days
- Generous 401(k) and regional retirement plans
About the company
51-200
Artificial Intelligence
Enterprise Software Company
- B2B
- Growth StageExpanding market presence
- Top InvestorsThis company has received a significant amount of investment from top investors
Similar Jobs
C3.ai
C3 AI is a leading enterprise AI software provider for accelerating digital transformation
Zūm
Student transportation that's safe, reliable, and sustainable
Vorticity
The Fastest Scientific Computing Platform on the Planet
BlueVine
Empower small businesses with innovative banking designed for them
BlueVine
Empower small businesses with innovative banking designed for them
Zuora
Cloud-based software that enables any company to transform into a subscription business
Archesys
Improving the government services that impact everyday lives
Archesys
Improving the government services that impact everyday lives
Alarm.com
Get best-in-class security, plus smart home automation the whole family will love