Binance
Actively Hiring
Trade bitcoin, bnb, and hundreds of other cryptocurrencies in minutes
- Scale StageRapidly increasing operations
Senior DevOps Engineer (Monitoring - Grafana, Prometheus)
- Full Time
Posted: 7 months ago
Visa Sponsorship
Not Available
RelocationAllowed
About the job
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 230 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
Responsibilities:
- Design, implement, and manage comprehensive monitoring solutions to ensure high availability, performance of our microservices infrastructure and applications.
- Utilize advanced monitoring tools and scripting to automate the monitoring of our cloud environments, focusing on AWS.
- Develop and maintain robust logging and alerting mechanisms to identify and mitigate potential issues proactively.
- Collaborate with infra team to integrate monitoring solutions into the CI/CD pipeline, ensuring seamless deployments and operations.
- Conduct performance analysis, capacity planning, and scalability testing to ensure our systems meet current and future demands.
- Lead incident response and troubleshooting efforts, utilizing monitoring data to quickly resolve operational issues.
Requirements:
- Minimum of 5 years of hands-on experience with Kubernetes, Elasticsearch, Promtheus, Grafana and AWS, with a strong emphasis on monitoring and observability in cloud-native environments.
- Proficiency in programming languages (such as Python, Go or Rust) for automation of monitoring tasks.
- Experience with infrastructure as code (IaC) tools, and strong understanding of CI/CD principles, including experience with Docker and Kubernetes for container orchestration.
- Deep knowledge monitoring tools (such as Prometheus, Grafana or ELK stack) and strategies for large-scale environments.
- Proven track record in managing and troubleshooting large-scale distributed systems, with an emphasis on performance tuning and optimization.
- Excellent problem-solving skills, with a focus on delivering high-quality, reliable, and scalable infrastructure solutions.
- Strong communication and teamwork skills, with the ability to work effectively in a fast-paced, collaborative environment.
About the company
Binance
Actively Hiring
5000+
Financial Exchanges
Stock Exchanges
- Scale StageRapidly increasing operations
Employees joined from
Perks
Company sponsored holidays
International transfers mid-career
Free language classes
Regular team-building activities
Competitive salary and benefits
Flexible work hours
Exchange-career-relocation support
Option to be paid in crypto
Similar Jobs
LogiNext
SaaS for Delivery and Transportation Business
FORMCEPT
#1 Augmented Data Management Company Trusted by Fortune 1000 Brands Globally
Marvin
The best user research platform for designers, product teams and consultants
Cloud Scale®
Transforming Cloud, Data Center Management & profitability with Integrated Data-Insights
eLitmus.com
Accurate skill matching using Data Analytics, Research & Technology
StackBOX
At StackBOX, we are helping our clients win at the last mile
| Networth Corp |
Fast-tracking of global problem solving and value generation from innovation
Get My Parking
We Integrate Parking and Mobility
Zenduty
Incident Management System for SRE, DevOps, ITOps and Support teams