- Early StageStartup in initial stages
MLOps Engineer
- No equity
- 2 years of exp
- Full Time
Not Available
In office - WFH flexibility
About the job
Job Responsibilities -
• Build software products with R&D teams that are openly collaborative, are non-hierarchical, respect contributions, and work with agility.
• Provide vision & leadership for the technology roadmap of our products.
• Plan and execute PoCs as necessary.
• Build and maintain the next-generation ML platforms and infrastructure
• Create & Maintain CI/CD Pipelines for Machine learning models on AWS. Define, deploy and manage processes and tools for continuous integration (CI/CD), test-driven development, and release management for ML/DL models (Machine Learning and Deep Learning-based) and data pipelines.
• Work closely with the Dev team to create software deployment strategies and solutions and be
accountable for designing, building, and optimizing automation systems with quality and speed
• Accountable for architecture and technical leadership of complete DevOps infrastructure
Skill Requirement – Mandatory
• 2+ years of relavant experience.M
• Understanding of Machine learning pipeline
• Experience with productionizing deep learning applications
• Experience with training, inference and deploying deep learning models using DevOps principles
• Familiarity with commonly used frameworks like tensorflow, torch, sklearn, etc
• Experience with containerization
• Experience with version control tools such as Git, Bitbucket etc
• Good understanding of NLP models like GPT, BERT, etc
• Expertise in Installation, Configuration and file system management of Linux.
• Performance tuning, perform backup and restore.
• Experience on configuration management tool like git and SVN.
• Configuring and managing Apache webserver and MySQL server.
• Hands on experience on Amazon Web Services.
• Hands on different operating systems.
• Hands on experience on virtualization software such as Virtual Box, VMware, Vagrant and
Docker
• Designing and deploying a multiple application using almost all of the AWS features (including EC2, Route53, VPN, IAM, S3, RDS) focusing on high-availability, fault tolerance and auto scaling.
• Linux user and group Management.
• Centos and Ubuntu new installation, patching, trouble shooting.
• Installation and Configuration of Mail Servers like postfix, send-mail and Dovecot.
• Creating AWS Route53 to route traffic between different regions.
• Experience with monitoring tools such as New-Relic, Cloud Watch.
• Configuration & File sharing using NFS, FTP.
• RPM and YUM package installations, patch and other server management.
• Firewall Security with iptables configuration.
• System monitoring CPU, Disk, Memory utilization etc.
• Monitoring & troubleshooting with performance related issues.
• Managing Disks and partitions (LVM) and Swap Space management.
• Installations of different operating systems on servers/desktops.
• Experience on virtualization software such as Virtual Box, VMware.
• Managed and troubleshoot issue related to OS.
File permissions, Backup and restoring Files.
• Packages and Patch Administration.
• Adding and configuring new hardware in the servers.
• Install, configure & operate simple routed LAN, Troubleshooting of hardware problem.