Intermediate Site Reliability Engineer (1 year contract)

Remote •
United States
|Full Time

Posted: 1 month ago

Visa Sponsorship

Not Available

Hires remotely

Everywhere

RelocationAllowed

About the job

PointClickCare is a leading North American healthcare technology platform enabling meaningful care collaboration and real‐time patient insights. For over 20 years, the company has been focused on realizing its vision: to help create a world in which providers and plans can confidently deliver frictionless care. Since its inception, PointClickCare has grown exponentially, with over 2,200 employees working to impact millions across North America. Recognized by Forbes as one of the top 100 private cloud companies and acknowledged by Waterstone Human Capital as Canada’s Most Admired Corporate Cultures, PointClickCare leads the way in creating cloud-based healthcare software. At PointClickCare, we offer a wealth of opportunities and a vibrant culture that empowers our employees. Our dynamic environment is the perfect place to advance your career while engaging in meaningful work alongside incredible colleagues. Here, you’ll discover a space where your talents can thrive, your career can grow, and your work will have a lasting impact on healthcare across North America. We believe that work becomes profoundly fulfilling when driven by a higher purpose. Join us and be part of a team that is making a real impact. To learn more about us, check out Life at PointClickCare and connect with us on Glassdoor and LinkedIn.

**This is a 1-year contract**

About the Team The SaaS Ops Systems Team at PointClickCare is dedicated to designing, implementing, and maintaining efficient systems that enhance organizational productivity and streamline operations. We focus on best of breed technology while ensuring a rock solid and secure technology infrastructure. Our team is made up of like-minded individuals who have a passion for technology while at the same time personalities that build off one another’s strengths. About the Role: As an Intermediate Site Reliability Engineer, you will bridge the gap between development and operations, ensuring our systems are highly reliable, scalable, and efficient. You will be responsible for maintaining the uptime and performance of our systems, applications, automating processes, and implementing best practices in site reliability and system administration. You’ll collaborate closely with cross-functional teams to optimize system architecture and drive operational excellence. You’ll have complete autonomy in making infrastructure performance enhancements to ensure we have the most stable and secure environment possible for our world class healthcare solution. Key Responsibilities:· Experience administering Linux networks -- preferably Ubuntu or other Debian-based distros· Experience in setting up servers and services in both On Premise and Cloud infrastructure· Experience with continuous improvement of process and systems - Puppet, Nagios and Splunk knowledge· VMware administration and maintenance enhancements· iSCSI Storage Administration in a HPE Nimble environment is a plus· Dell Hardware Experience in an OnPrem environment Your Key Strengths: · Expertise in Linux Environments: Deep understanding of Debian based Linux distributions and their configurations, file systems, and management.· Scripting and Automation Skills: Proficiency in scripting languages like Bash, Python, or Perl to automate tasks, streamline processes, and improve system efficiency.· Networking Knowledge: Strong grasp of networking concepts, including TCP/IP, firewalls, DNS, and routing, to effectively manage and troubleshoot network-related issues.· Security Best Practices: Knowledge of Linux security protocols and practices, including user permissions, access controls, firewalls, and intrusion detection systems, to ensure system integrity and data protection.· System Monitoring and Performance Tuning: Ability to use monitoring tools (like Nagios,OpsGenie or Prometheus) to analyze system performance, identify bottlenecks, and optimize resources for better efficiency.· Disaster Recovery and Backup Solutions: Expertise in implementing and managing backup and recovery strategies to protect data and ensure business continuity in case of failures.· Collaboration and Communication Skills: Strong ability to work collaboratively with development teams, stakeholders, and other IT personnel, effectively communicating complex technical concepts to non-technical audiences.

Optional: Bonus Skills:· Azure Cloud and best practices around Infrastructure As Code· Puppet Configuration Management· Identity Access Management in a Service Provider role · Windows Active Directory Administration