EdTech Jobs

This position has been filled

This job is no longer accepting applications. Browse open EdTech jobs or view current openings at Macmillan Learning or search for Site Reliability Manager jobs.

Summary

The Site Reliability Manager maintains availability, reliability, and performance of internal applications and SaaS platforms at Macmillan Learning. This role focuses on incident management, system optimization, and operational excellence through automation and monitoring strategies.

Key Responsibilities: Lead incident management processes, conduct root cause analyses, and implement preventive measures. Design and maintain monitoring systems with SLIs, SLOs, and SLAs; automate operational tasks; and collaborate with cross-functional teams to ensure system performance and proactive stakeholder communication.
Skills & Tools: Expertise with monitoring tools (Splunk, Azure Monitor), cloud platforms (Azure, AWS), strong scripting skills (Python, Bash), and Infrastructure as Code tools. Proficiency with ITIL frameworks, ServiceNow, and PagerDuty, along with excellent problem-solving and communication abilities.
Qualifications: 5+ years of proven experience in Site Reliability Engineering, DevOps, or related fields with experience managing SaaS platforms like Google Workspace. Familiarity with ITIL frameworks and advanced automation practices required.
Location: New York, New York, United States
Compensation: $120,000 – $130,000/year

Job Description

The Site Reliability Manager (SRM) is responsible for maintaining the availability, reliability, and performance of internal
applications and SaaS platforms. This role involves managing incidents, optimizing system performance, and ensuring
operational excellence through automation and monitoring strategies.

What you'll do:

  • Lead incident management processes, ensuring swift resolution and communication during outages. Conduct root cause analyses and implement preventive measures.
  • Design and maintain robust monitoring systems for internal and third-party applications, establishing SLIs, SLOs, and SLAs.
  • Automate operational tasks and develop self-healing systems to reduce manual intervention.
  • Collaborate with cross-functional teams and vendors to maintain system performance and address potential reliability issues proactively.
  • Provide leadership in system performance reporting, ensuring proactive communication with stakeholders on system health, ongoing initiatives, incident updates, and post-resolution analysis.

What you'll bring:

  • Expertise with monitoring tools (e.g., Splunk, Azure Montor) and cloud platforms (e.g., Azure, AWS).
  • Familiarity with ITIL frameworks and advanced automation practices.
  • Strong scripting skills (e.g., Python, Bash) and familiarity with Infrastructure as Code tools.

  • Excellent problem-solving and communication skills.

Ideal experience:

  • Proven experience (5+ years) in Site Reliability Engineering, DevOps, or related fields.
  • Experience with Service Now and Pager Duty (or similar).
  • Experience managing SaaS platforms like Google Workspace.

This role will have an annual salary of $120,000 - $130,000.

Macmillan Publishers is the U.S. trade company that is part of the Holtzbrinck Publishing Group, a large family-owned group of media companies headquartered in Stuttgart, Germany. Holtzbrinck Publishing Group's publishing companies include prominent imprints around the world that publish a broad range of award-winning books for children and adults in all categories and formats.

U.S. publishers include Celadon Books, Farrar, Straus and Giroux, Flatiron Books, Henry Holt & Company, Macmillan Audio, Macmillan Children’s Publishing Group, The St. Martin's Publishing Group, and Tor Publishing Group. In the UK, Australia, India, and South Africa, companies in the Holtzbrinck Publishing Group publish under the Pan Macmillan name. The German publishing company, Holtzbrinck Deutsche Buchverlage, includes among its imprints S. Fischer, Kiepenheuer & Witsch, Rowohlt, and Droemer Knaur.

We are an Equal Opportunity Employer. We are actively seeking job applicants who reflect a broad representation of differences, including race, ethnicity, religion, sex, sexual orientation, gender identity/expression, physical ability, neurodiversity, age, family status, economic background and status, geographical background and status, and perspective. We believe that the best companies reflect the incredible diversity in viewpoints, backgrounds, and identities of the world in their staffs, and are committed to inclusive hiring across departments and levels. The successful candidate for this position will be an employee of Macmillan Publishing Group, LLC.

Other Open Roles at Macmillan Learning