Avatar for Binance
Binance
Actively Hiring
Trade bitcoin, bnb, and hundreds of other cryptocurrencies in minutes
  • Scale Stage
    Rapidly increasing operations

Senior DevOps Engineer (Monitoring - Grafana, Prometheus)

Posted: 7 months ago
Visa Sponsorship

Not Available

RelocationAllowed

About the job

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 230 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

Responsibilities:

  • Design, implement, and manage comprehensive monitoring solutions to ensure high availability, performance of our microservices infrastructure and applications.
  • Utilize advanced monitoring tools and scripting to automate the monitoring of our cloud environments, focusing on AWS.
  • Develop and maintain robust logging and alerting mechanisms to identify and mitigate potential issues proactively.
  • Collaborate with infra team to integrate monitoring solutions into the CI/CD pipeline, ensuring seamless deployments and operations.
  • Conduct performance analysis, capacity planning, and scalability testing to ensure our systems meet current and future demands.
  • Lead incident response and troubleshooting efforts, utilizing monitoring data to quickly resolve operational issues.

Requirements:

  • Minimum of 5 years of hands-on experience with Kubernetes, Elasticsearch, Promtheus, Grafana and AWS, with a strong emphasis on monitoring and observability in cloud-native environments.
  • Proficiency in programming languages (such as Python, Go or Rust) for automation of monitoring tasks.
  • Experience with infrastructure as code (IaC) tools, and strong understanding of CI/CD principles, including experience with Docker and Kubernetes for container orchestration.
  • Deep knowledge monitoring tools (such as Prometheus, Grafana or ELK stack) and strategies for large-scale environments.
  • Proven track record in managing and troubleshooting large-scale distributed systems, with an emphasis on performance tuning and optimization.
  • Excellent problem-solving skills, with a focus on delivering high-quality, reliable, and scalable infrastructure solutions.
  • Strong communication and teamwork skills, with the ability to work effectively in a fast-paced, collaborative environment.

About the company

Binance company logo

Binance

Actively Hiring
Trade bitcoin, bnb, and hundreds of other cryptocurrencies in minutes5000+ Employees
  • Scale Stage
    Rapidly increasing operations
Learn more about Binance image

Funding

AMOUNT RAISED
$10M
FUNDED OVER
1 round
Round
A
$10,000,000
Series A - Sep 2017

Perks

Company sponsored holidays
International transfers mid-career
Free language classes
Regular team-building activities
Competitive salary and benefits
Flexible work hours
Exchange-career-relocation support
Option to be paid in crypto

Founders

Chengpeng Z.
Founder • 3 years
image
View the team image

Similar Jobs

LogiNext company logo
LogiNext
SaaS for Delivery and Transportation Business
FORMCEPT company logo
FORMCEPT
#1 Augmented Data Management Company Trusted by Fortune 1000 Brands Globally
Marvin company logo
Marvin
The best user research platform for designers, product teams and consultants
Cloud Scale®  company logo
Cloud Scale®
Transforming Cloud, Data Center Management & profitability with Integrated Data-Insights
eLitmus.com company logo
eLitmus.com
Accurate skill matching using Data Analytics, Research & Technology
StackBOX company logo
StackBOX
At StackBOX, we are helping our clients win at the last mile
| Networth Corp | company logo
| Networth Corp |
Fast-tracking of global problem solving and value generation from innovation
Zenduty company logo
Zenduty
Incident Management System for SRE, DevOps, ITOps and Support teams