Senior Manager - Performance and SRE
- Hyderabad, India
- Full time
- Competitive
- 24th December 2024
Full Description
We are looking for a passionate and accomplished leader to join our Reliability Engineering org. The ideal candidate will possess deep expertise in Site Reliability Engineering (SRE) practices, performance engineering, cloud technologies, distributed systems architecture, and SQL/NoSQL databases.
- You have the drive to pro-actively learn and stay up to date in a fast changing environment, including but not limited to evolving the org to meet strategic challenges, evaluation and adoption of new technology, managing significant changes in existing systems, and addressing security and compliance needs.
- You have the ability to understand our platform that powers hundreds of eCommerce websites for our sports fans globally, and involves moving atoms, not just bits, with order management, fulfilment, and manufacturing systems on the backend.
- You have strong execution and collaboration skills in driving alignment across the org for key initiatives and delivering on Objectives and key results (OKRs).
- You are known for sharing knowledge, building trust, admitting mistakes, and fostering a safe and inclusive team environment.
Responsibilities
- Manage a team of software engineers and performance engineers.
- Drive/contribute to the reliability and performance engineering initiatives across the tech org building new capabilities and introducing new practices.
- Build and manage platform tooling that provides standardisation on Service Level Objectives (SLOs), Operational Readiness Reviews (ORRs), incident metrics, availability and performance, etc
- Develop and maintain working relationships with engineering leads across our distributed teams and evangelising SRE best practices and tooling. Develop paved-path solutions and drive adoption of standard tools.
- Manage planning, scheduling and resourcing to deliver for our OKRs and Keeping The Lights On (KTLO) deliverables.
- Own the scale in and scale out of critical services during high traffic volume days/events with a focus on keeping the costs optimal.
- Work closely with production support and incident management teams in supporting our websites, order intake systems, fulfilment systems, analytics and reporting workflows, infrastructure across on-prem and AWS, corporate tools, vendor integrations and other third party software.
- Be proactive in analysing fan impacting incidents related to availability and performance and help develop automations/tooling to reduce Mean time to detect (MTTD) and Mean time to repair (MTTR) for those incidents.
- Find opportunities to apply new technologies such as GenAI to increase productivity and improve operational efficiency.
Qualifications
- Overall 12+ years of experience in Information Technology with proven experience in engineering leadership.
- Ability to mentor, develop talent and drive technical excellence.
- A solid foundation in software development, with 10+ years in two or more of GoLang, Java, Python, ReactJS and NodeJS.
- 5+ years of experience in building in-house tools or setting up vendor tools for monitoring, alerting, alert correlation, on-call management, auto-remediation, chaos engineering, SLIs/SLOs/Error Budget tracking, performance testing, profiling, incident management, change management and reporting dashboards.
- 5+ years of experience in designing and implementing solutions on AWS.
- 5+ years of experience in implementing tools for reliability and performance engineering practices in medium to large sized organisations.
- Experience in SQL and NoSQL DBs, e.g., SQL Server, MySQL, Cassandra, and Scylla.
- Experience in building globally responsible and autonomous teams in Global Capability Centres (GCCs) in India.
- Good understanding of DNS, networking and service discovery is a plus.
- Experience with Kubernetes or Openshift is a plus.
- Experience in developing Slack apps/bots is a plus.
- Experience in eCommerce and/or supply chain systems is a plus.
The organisation
Fanatics
- Data & Technology
- New York, USA
- 2000+ employees
- Website
Relentlessly Enhancing the Fan Experience
More jobs from Fanatics
Social Media Director
- New York, USA
- Full time
- Competitive
Promotion Operations Associate
- Leeds, UK
- Full time
- Competitive
Planner Athlete Partnerships - Fanatics Collectibles
- Milton Keynes, UK
- Full time
- Competitive
Hobby Business Sales Manager - Fanatics Collectibles
- Paris, France
- Full time
- Competitive
Retail Associate - Miami Marlins
- Miami, USA
- Part time
- Competitive
Create a job alert
Get notified as soon as new jobs matching your ambitions go live.