Observability Platform System Administrator
(4+ years exp)Archesys
Job Location
Job Type
Full TimeVisa Sponsorship
Not AvailableRemote Work Policy
Onsite or remoteHires remotely in
Preferred Timezones
Relocation
Not AllowedSkills
Hiring contact
Ketan PatelThe Role
Archesys is a technology consulting firm specializing in innovative cloud/devops solutions and services for clients in public-sector industries. We pride ourselves on our cutting-edge technologies, exceptional customer service, and collaborative work environment.
We're seeking a talented Observability Platform System Administrator to uphold the availability, performance, and security of our customers production systems. You'll be a champion of reliability, utilizing your diverse technology background to design, build, and maintain the infrastructure that powers our customers solutions. This plays a pivotal role in impacting our customer's critical production environments.
This is a fully remote, full-time position.
As a Observability Platform System Administrator , Your combination of people skills and system administrator expertise makes you the team hero, solving one problem after another. What if you could use those skills to improve the technology supporting healthcare? We are looking for a system administrator who specializes in Observability solutions to help develop and manage observability enabling solutions to increase the performance and reliability of a large-scale consumer facing system.
As a system administrator on our project, you will ensure that knowledge objects (dashboards, alerts, etc) are properly configured to help identify areas for improvement in the system architecture and constantly look for ways to make the platform better using the latest technology and the best implementation strategies.
Your system administrator expertise will be vital as you identify problem areas and opportunities for improvement in a mission-critical network. You will help your team better understand the network by turning metrics into information and explaining their meaning.
This is an opportunity to broaden your skillset into areas like observability in a system of systems operated in cloud. We focus on growing as a team, so you will share your expertise through leadership and mentoring as you help the teamwork through challenges and develop new methodologies. As a system administrator leader, you’ll identify new opportunities to modernize the network, so your clients achieve their goals. Work with us and resolve daily challenges as we improve healthcare for millions of consumers.
Key Responsibilities:
- System Reliability & Monitoring: Design, develop, and manage robust monitoring and alerting systems (e.g., Datadog, Cloudwatch, Splunk, Newrelic). Proactively identify and resolve potential performance bottlenecks.
- Process Improvement: Contribute to the development and implementation of efficient change management, incident response, and disaster recovery processes. Lead post-incident reviews and root cause analysis.
- Incident Response: Serve as a key player in rapidly analyzing, troubleshooting, and resolving production issues, collaborating with development teams to minimize downtime. Automation: Champion automation to streamline CI/CD pipelines (Jenkins, Terraform, CloudFormation), reduce toil, and enhance our infrastructure's self-healing capabilities.
- Capacity Planning: Assess future needs, proactively manage resource allocation, and optimize performance within our cloud environment (AWS, Azure).
- Continuous Improvement: Embrace Agile methodologies and DevOps principles to drive continuous improvement in our systems' reliability, scalability, and resilience.
Qualifications:
Educational Background:
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
Certification Requirements:
- Certifications in cloud technologies like AWS Certified Solutions Architect, Azure Solutions Architect Expert, or Google Professional Cloud Architect.
- Certifications in container technologies such as Docker Certified Associate or Certified Kubernetes Administrator (CKA) are highly desirable.
- Splunk Enterprise Certified Admin or Splunk Enterprise Certified Architect or Splunk Core Certified Consultant or Splunk IT Service Intelligence Certified Admin
Experience:
- 3-5 years of hands-on experience with cloud computing, DevOps practices, and Site Reliability Engineering (SRE).
- Proven experience in implementing and managing containerized environments.
Skills:
- Proficiency in scripting languages such as Python, Bash, or PowerShell.
- Deep understanding of CI/CD tools and version control systems like Git.
- Expertise in implementing and managing container technologies, including Docker and Kubernetes.
- Understanding of implementing monitoring tools such as AWS cloudwatch,AWS cloudwatch canaries, AWS cloudwatch X-ray, Splunk Enterprise, Splunk ITSI, Splunk on-call, Newrelic
Personal Attributes:
- Excellent problem-solving abilities and analytical thinking.
- Strong communication skills with the ability to articulate complex technical concepts clearly.
- A collaborative mindset, able to work effectively in team environments and contribute to knowledge sharing and best practices.
All work must be conducted within the U.S., excluding U.S. territories. Some federal contracts require U.S. citizenship to be eligible for employment.
You must be legally authorized to work in the U.S. now and in the future without sponsorship.
As the US Government is our clientele, you may be required to obtain a public trust or security clearance.
Some of our available roles are on federal contracts that require a degree or additional years of experience as a substitute.
What We Offer
- Competitive salary and benefits package, including health, dental, and vision insurance, retirement plan, and generous paid time off.
- Opportunity to work with a talented team of professionals on exciting and innovative projects.
- Flexible work arrangements, including remote work options.
- Continuous learning and development opportunities, including access to training resources and professional development programs. A collaborative and inclusive work environment that values diversity and encourages growth.
Join us at Archesys and be part of a team dedicated to delivering cutting-edge cloud solutions for clients in the public sector. Your expertise and passion for technology will help us continue to innovate and grow. We look forward to welcoming you to our team and supporting your success as a Observability Platform System Administrator.
Archesys participate in E-Verify. Upon hire, we will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.
Archesys is an equal opportunity employer committed to creating a diverse and inclusive workplace. We welcome applications from all qualified candidates, regardless of race, color, religion, sex.