Technical Program Manager - Network Operations Tools
Lambda
Job Location
Job Type
Full TimeVisa Sponsorship
Not AvailableRelocation
AllowedHiring contact
Jeri VillegasThe Role
Lambda's GPU cloud is used by deep learning engineers at Stanford, Berkeley, and Carnegie Mellon. Lambda's on-prem systems power research and engineering at Intel, Microsoft, Kaiser Permanente, major universities, and the Department of Defense.
If you'd like to build the world's best deep learning cloud, join us.
*Note: This position requires presence in one of our San Francisco Bay Area office locations (Currently San Jose, expanding to Peninsula/SF) 5 days per week.
What You’ll Do
We are currently seeking a seasoned Senior Technical Program Manager to spearhead the development and implementation of a robust incident management program tailored to our fast-paced production environment and data centers. In this role, you will leverage your strategic insight and technical prowess to foster collaboration across teams and deliver innovative solutions. Your primary responsibilities will include defining project objectives, orchestrating timelines, and aligning resources to ensure the seamless rollout of an incident management program that enhances operational efficiency and meets organizational goals.
Key Responsibilities:
- Strategy Development: Collaborate closely with stakeholders to analyze business needs and craft a comprehensive strategy for implementing an incident management program, encompassing the integration of new methodologies and optimization of existing workflows.
- Project Planning and Execution: Lead the planning, execution, and successful delivery of projects related to incident management, including defining project scope, objectives, milestones, and deliverables.
- Cross-Functional Collaboration: Foster strong relationships with cross-functional teams, including Data Center Operations, Technical Support, Supply Chain, and Manufacturing, to ensure seamless alignment and collaboration throughout the project lifecycle.
- Resource Management: Identify resource requirements, delegate tasks, and effectively manage project resources to ensure timely delivery within budgetary constraints.
- Risk Management: Proactively identify risks and dependencies, develop robust mitigation strategies, and address issues to minimize project risks and ensure successful project outcomes.
- Quality Assurance: Establish and enforce stringent quality standards to ensure that incident management protocols adhere to the highest benchmarks of performance, reliability, and usability.
- Stakeholder Communication: Provide regular updates to stakeholders on project progress, milestones, and potential risks, facilitating informed decision-making and maintaining project momentum.
- Continuous Improvement: Drive continuous improvement initiatives to optimize processes, tools, and workflows, identifying opportunities to enhance operational efficiency and drive business value.
You
- Bring a minimum of 7 years of experience in program/project management, with a proven track record of success in complex environments.
- Demonstrate proficiency in agile and waterfall management techniques.
- Possess a strong technical background, ideally with an Engineering degree or equivalent experience.
- Exhibit exceptional leadership, communication, and organizational skills.
- Have a collaborative approach, with the ability to lead by influence and example.
- Proficient in various project management tools.
- Comfortable navigating environments characterized by ambiguity and uncertainty.
- Experience in production networks/data centers
Nice to Have
- Experience in the machine learning or computer hardware industry.
- Previous experience in a manufacturing environment, particularly with complex systems integration.
- Familiarity with large-scale distributed data center environments.
About Lambda
- We offer generous cash & equity compensation
- Investors include Gradient Ventures, Google’s AI-focused venture fund
- We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
- Ourresearch papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
- We have a wildly talented team of 200, and growing fast
- Health, dental, and vision coverage for you and your dependents
- Commuter/Work from home stipends
- 401k Plan with 2% company match
- Flexible Paid Time Off Plan that we all actually use
Salary Range Information
Based on market data and other factors, the salary range for this position is $150,000 - $220,000. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.