Site Reliability Engineering Lead

experian· Technology
Apply Now ↗
📍 Hyderabad📍 Hyderabad, , India📍 inFull time

About this role

Company Description

Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money.

We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more industry segments.

We invest in people and new advanced technologies to unlock the power of data. As a FTSE 100 Index company listed on the London Stock Exchange (EXPN), we have a team of 22,500 people across 32 countries. Our corporate headquarters are in Dublin, Ireland. Learn more at experianplc.com.

Job Description

 

  • Leadership & Strategy
    • Define and implement SRE best practices across the organization.
    • Proven expertise in production support, resilience engineering, disaster recovery (DCR), automation, and cloud operations
    • Mentor and guide a team of SREs, fostering growth and technical excellence.
    • Collaborate with senior stakeholders to align reliability goals with business objectives.
  • Reliability & Performance
    • Establish SLIs, SLOs, and SLAs for critical services and ensure adherence.
    • Drive initiatives to improve system resilience and reduce operational toil.
    • Excellent in designing systems that detect and remediate issues without manual intervention – Self Healing systems, Runbook automation
    • Exposure to tools like Gremlin, Chaos Monkey, AWS FIS to simulate outages and improve fault tolerance
  • Incident Management
    • Act as the primary point of escalation for critical production issues and lead major incident response, root cause analysis, and postmortems.
    • Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence.
    • Implement preventive measures and continuous improvement processes.
  • Observability
    • Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS CloudWatch.
    • Build real-time dashboards to visualize system health and reliability metrics.
    • Configure intelligent alerting based on anomaly detection and thresholds.
    • Combine metrics, logs, and traces to enable root cause analysis and reduce Mean Time to Resolution (MTTR).
    • Knowledge of AIOps or ML-based anomaly detection for proactive reliability management.
  • Collaboration
    • Work closely with development teams to integrate reliability into application design and deployment
    • Promote a culture of shared responsibility for uptime and performance across engineering teams.
    • Strong interpersonal and communication skills for technical and non-technical audiences.

 

Qualifications

 

  • Qualified with a degree in B.Sc. in Computer Science, MCA in Computer Science, Bachelor of Technology in Engineering, or higher
  • Hands on technologist with minimum 12 years of experience working in software development with at least 5 years of experience leading an SRE team currently
  • Deep expertise with various AWS services. Advanced knowledge of monitoring and observability tools. Strong leadership capabilities with a focus on setting clear direction, aligning team efforts with organizational goals, and maintaining high levels of motivation and engagement across the team.
  • Skilled in working with geographically distributed teams, fostering inclusive collaboration across diverse cultures and backgrounds to enhance productivity and innovation. Excellent communication skills, with the ability to articulate complex ideas, solutions, and feedback clearly to both technical and non-technical stakeholders. Adept at managing conflict constructively and facilitating consensus
  • Proven track record of building secure, mission-critical, high-volume transaction web-based software systems, preferably in regulated environments (finance and insurance industries).  Passionate in solving technical business problems, designing solutions and developing. Strong individual contributor and team player, capable of collaborating effectively within cross-functional teams.

Additional Information

Our uniqueness is that we celebrate yours. Experian's culture and people are important differentiators. We take our people agenda very seriously and focus on what matters; DEI, work/life balance, development, authenticity, collaboration, wellness, reward & recognition, volunteering... the list goes on. Experian's people first approach is award-winning; World's Best Workplaces™ 2024 (Fortune Top 25), Great Place To Work™ in 24 countries, and Glassdoor Best Places to Work 2024 to name a few. Check out Experian Life on social or our Careers Site to understand why.

Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experian's DNA and practices, and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work, irrespective of their gender, ethnicity, religion, colour, sexuality, physical ability or age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

Frequently Asked Questions

Is the salary disclosed for the Site Reliability Engineering Lead position at experian?
The salary for this Site Reliability Engineering Lead role at experian is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Site Reliability Engineering Lead position at experian located?
This Site Reliability Engineering Lead role at experian is based in Hyderabad, Hyderabad, , India, in. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Site Reliability Engineering Lead role at experian full-time or part-time?
This is listed as a Full time position. It is posted as a Site Reliability Engineering Lead role in the Technology department at experian.
Which team or department does the Site Reliability Engineering Lead at experian belong to?
This Site Reliability Engineering Lead position is part of the Technology department at experian. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Site Reliability Engineering Lead position at experian?
Click the "Apply Now" button on this page. You will be redirected to experian's official application portal hosted on smartrecruiters where you can submit your application directly.
When was the Site Reliability Engineering Lead job at experian posted?
This Site Reliability Engineering Lead position at experian was posted on Jun 10, 2026. Apply as soon as possible — early applications are often reviewed first.
Site Reliability Engineering Lead
experian
Apply for this role ↗

You'll be redirected to experian's official application page on SmartRecruiters.