Search all jobs
Fanatics Logo

Senior Manager - Performance and Reliability Engineering

  • Hyderabad, India
  • Full time
  • Competitive
  • 25th October 2024
View organisation profile
Apply Favourite
Copy Link

Full Description

We are looking for a passionate and accomplished leader to join our Reliability Engineering org. The ideal candidate will possess deep expertise in Site Reliability Engineering (SRE) practices, performance engineering, cloud technologies, distributed systems architecture, and SQL/NoSQL databases.

  • You have the drive to pro-actively learn and stay up to date in a fast changing environment, including but not limited to evolving the org to meet strategic challenges, evaluation and adoption of new technology, managing significant changes in existing systems, and addressing security and compliance needs.
  • You have the ability to understand our platform that powers hundreds of eCommerce websites for our sports fans globally, and involves moving atoms, not just bits, with order management, fulfilment, and manufacturing systems on the backend.
  • You have strong execution and collaboration skills in driving alignment across the org for key initiatives and delivering on Objectives and key results (OKRs).
  • You are known for sharing knowledge, building trust, admitting mistakes, and fostering a safe and inclusive team environment.

Responsibilities

  • Manage a team of software engineers and performance engineers.
  • Drive/contribute to the reliability and performance engineering initiatives across the tech org building new capabilities and introducing new practices.
  • Build and manage platform tooling that provides standardisation on Service Level Objectives (SLOs), Operational Readiness Reviews (ORRs), incident metrics, availability and performance, etc
  • Develop and maintain working relationships with engineering leads across our distributed teams and evangelising SRE best practices and tooling. Develop paved-path solutions and drive adoption of standard tools.
  • Manage planning, scheduling and resourcing to deliver for our OKRs and Keeping The Lights On (KTLO) deliverables.
  • Own the scale in and scale out of critical services during high traffic volume days/events with a focus on keeping the costs optimal.
  • Work closely with production support and incident management teams in supporting our websites, order intake systems, fulfilment systems, analytics and reporting workflows, infrastructure across on-prem and AWS, corporate tools, vendor integrations and other third party software.
  • Be proactive in analysing fan impacting incidents related to availability and performance and help develop automations/tooling to reduce Mean time to detect (MTTD) and Mean time to repair (MTTR) for those incidents.
  • Find opportunities to apply new technologies such as GenAI to increase productivity and improve operational efficiency.

Qualifications

  • Overall 12+ years of experience in Information Technology with proven experience in engineering leadership.
  • Ability to mentor, develop talent and drive technical excellence.
  • A solid foundation in software development, with 10+ years in two or more of GoLang, Java, Python, ReactJS and NodeJS.
  • 5+ years of experience in building in-house tools or setting up vendor tools for monitoring, alerting, alert correlation, on-call management, auto-remediation, chaos engineering, SLIs/SLOs/Error Budget tracking, performance testing, profiling, incident management, change management and reporting dashboards.
  • 5+ years of experience in designing and implementing solutions on AWS.
  • 5+ years of experience in implementing tools for reliability and performance engineering practices in medium to large sized organisations.
  • Experience in SQL and NoSQL DBs, e.g., SQL Server, MySQL, Cassandra, and Scylla.
  • Experience in building globally responsible and autonomous teams in Global Capability Centres (GCCs) in India.
  • Good understanding of DNS, networking and service discovery is a plus.
  • Experience with Kubernetes or Openshift is a plus.
  • Experience in developing Slack apps/bots is a plus.
  • Experience in eCommerce and/or supply chain systems is a plus.

The organisation

Fanatics
  • Data & Technology
  • New York, USA
  • 2000+ employees
  • Website

Relentlessly Enhancing the Fan Experience

More jobs from Fanatics

Fanatics Logo
Associate Product Manager - Sports - Fanatics Collectibles
  • Tokyo, Japan
  • Full time
  • Competitive
Fanatics Logo
Content Product Manager-marvel
  • New York, USA
  • Full time
  • Competitive
Fanatics Logo
Seasonal - Retail Services Professional
  • Tampa, USA
  • Full time
  • Competitive
Fanatics Logo
Operator, Pre-press Retouching
  • Coppell, USA
  • Full time
  • Competitive
Fanatics Logo
Social Media Manager - Iberia - Fanatics Collectibles
  • Madrid, Spain
  • Full time
  • Competitive
Create a job alert

Get notified as soon as new jobs matching your ambitions go live.

Create a course alert

Create a job alert