Avatar for Stord
Stord
Actively Hiring
The Cloud Supply Chain
  • B2B
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • +4

Sr Site Reliability Engineer

Posted: 2 months ago
Visa Sponsorship

Not Available

Hires remotely in
RelocationAllowed
Hiring contact

Jonathan Lehrman

About the job

About the SRE Position:

Stord is looking for a mission-driven Senior SRE to be a driving force behind an exceptionally resilient, efficient, and secure infrastructure and platform. You will be looked upon to expertly deliver a catalog of high-quality, world-class products and services to our customers at scale. We aim to establish a dynamic operational environment that seamlessly integrates cutting-edge technologies, embraces automation, has a high degree of ownership and fosters a culture of continuous improvement.

The SRE team is committed to accelerating development, enabling continuous delivery, enhancing security and ensuring operational excellence. This role is integral to designing and implementing the infrastructure and developer tooling that will enable Stord to scale our systems and processes, enhance reliability and availability in an efficient manner.

SRE operates cross-functionally, collaborating with product management, software developers, data science and other operations teams. At Stord, each member of the team has the ability to impact all aspects of the development process from ideation, design, delivery, maintenance, and operations.

What You'll Do:

  • Collaborate with cross-functional teams to design and implement CI/CD pipelines that automate fast and safe delivery of software to our customers, enable experimentation, create fast feedback loops and developer self-service capabilities.
  • Lead efforts in automating deployment, monitoring, and infrastructure management.
  • Proactively identify and resolve performance bottlenecks, system failures, and security vulnerabilities.
  • Minimize or eliminate degradations and failures related to fault tolerance, security, availability, and performance.
  • Develop SLOs and SLIs to manage risk through continuous monitoring and measurement of system performance.
  • Build, manage and deploy highly available, self-healing, customer facing production infrastructure and applications (microservice and event based architectures) using Docker, Kubernetes, Helm and Terraform.
  • Leverage 12 Factor App methodology when building and deploying all our services and systems.
  • Implement best practice infrastructure as code (IaC) principles for configuration management and deployment of infrastructure.
  • Enhance operational efficiency by identifying repetitive tasks and developing automation to eliminate toil work.
  • Implement robust metrics, monitoring and alerting for proactive issue identification and resolution.
  • Participate in incident response, on-call rotation and post-incident reviews to ensure 24/7 availability of critical systems and to learn from failures and continuously improve system reliability.
  • Implement and enforce security best practices for infrastructure and applications.
  • Collaborate with security teams to ensure compliance with industry standards and regulations.
  • Empower others by sharing knowledge through documentation, training, and mentorship.

What You'll Need:

  • Proven experience as a Senior DevOps Engineer or Senior Site Reliability Engineer.
  • Strong expertise in cloud platforms such as AWS, GCP or Azure.
  • Strong experience with CI/CD tools (Github Actions, GitLab CI, CircleCI) and version control systems (Git).
  • Proficiency with infrastructure-as-code tools (e.g., Terraform, Ansible, Cloudformation).
  • Hands-on experience with container orchestration tools like Docker and Kubernetes.
  • Solid understanding of networking, security, and system engineering.
  • Experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack).
  • Strong scripting skills in languages such as Python, Shell or similar.
  • Familiarity with security best practices and compliance requirements.
  • Excellent problem-solving and troubleshooting skills.
  • Ability to work collaboratively in a fast-paced, agile environment.
  • Passion for building the highest-quality solutions for the long term that delight the customer (both internal and external customers).
  • Automation first mindset.
  • High degree of ownership and pride for work.

Bonus Points:

  • Industry certifications - (AWS, GCP, Linux Foundation - CKA, CKS, CKAD)
  • Bachelor's or higher degree in Computer Science, Information Technology, or a related field.
  • Previous startup experience
  • Previous logistics or supply chain experience

#LI-Remote

About the company

Stord company logo

Stord

Actively Hiring
The Cloud Supply Chain501-1000 Employees
  • B2B
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • Valuation $1B+
    This company has a valuation of $1B or more
  • 5.0
    Highly rated
    Stord is highly rated on Glassdoor, with 5.0 out of 5 stars
  • 4.1
    Work / Life Balance
    Employees rate Stord 4.1/5 on Glassdoor for work / life balance
  • 4.9
    Strong Leadership
    Employees rate Stord 4.9/5 on Glassdoor for faith in leadership
Learn more about Stord image

Funding

AMOUNT RAISED
$125.2M
FUNDED OVER
4 rounds
Rounds
C
$75,000,000
Series C - Mar 2021+3

Perks

Benefits:
Medical, Dental, and Vision Insurance
Benefits:
401(k)
Benefits:
Flexible Parental Leave
Below are a few perks of joining our team:
Unlimited, flexible PTO and generous holiday schedule
Below are a few perks of joining our team:
Catered Lunches
Benefits:
Life and Disability Insurance
Benefits:
Health Savings Account (HSA) Eligibility
Below are a few perks of joining our team:
Competitive salary
Below are a few perks of joining our team:
Gym reimbursement program

Founders

Jacob Boudreau
CTO • 3 years • 9 years
Atlanta
image
Sean Henry
Founder • 3 years • 9 years
Atlanta
image
View the team image

Similar Jobs

Archesys company logo
Archesys
Improving the government services that impact everyday lives
swivl company logo
swivl
Self Storage Automation Platform
FanDuel company logo
FanDuel
FanDuel is America's #1 Sportsbook. We make every moment more
AnswerRocket company logo
AnswerRocket
The AI-powered analytics solution for everyone
Archesys company logo
Archesys
Improving the government services that impact everyday lives
FanDuel company logo
FanDuel
FanDuel is America's #1 Sportsbook. We make every moment more
FanDuel company logo
FanDuel
FanDuel is America's #1 Sportsbook. We make every moment more
Halen Technology company logo
Halen Technology
Halen is a super-app that offers a variety of services in one app