Jun 24, 2024

Site Reliability Engineer - Manager (Hybrid-Flexible Options)

  • Broadridge
  • Multinational Bancorp Center, 6805 Ayala Ave, Makati, Metro Manila, Philippines
Full time Information Technology

Job Description

At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you’re passionate about developing your career, while helping others along the way, come join the Broadridge team.

Broadridge, a global Fintech leader with more than $6 billion in revenues, provides the critical infrastructure that powers investing, corporate governance and communications to enable better financial lives. We deliver technology-driven solutions that drive digital transformation for our clients and help them get ahead of today’s challenges to capitalize on what’s next.

Role Overview

As an Application Site Reliability Engineer (SRE), you will play a critical role in ensuring the stability, scalability, and reliability of our products and services. You will work closely with cross-functional teams to design, develop, and deploy solutions that enhance the performance and uptime of our applications.

The Application SRE is part of the Enterprise Platform (EP) group and is responsible for supporting and running our standard platforms efficiently and effectively. You will be expected to collaborate closely with other functions within EP (DevOps/Cloud Platforms, Quality Engineering and Developer Experience) to provide robust, integrated and best-in-class solutions for our product engineering teams.

Key Responsibilities

  • Implementing Site Reliability Engineering best practices, including error budgeting, service level objectives (SLOs), and monitoring and alerting systems

  • Building automation tools and processes to improve the efficiency and reliability of running our products and standard platforms

  • Performing capacity planning and system design to ensure that our systems can handle increasing traffic and load

  • Troubleshooting complex technical issues and providing root cause analysis to prevent future incidents

  • Participating in incident calls to respond to system outages and emergencies

  • Collaborating with software developers to define and implement reliability requirements for new products/applications/services

  • Conducting post-mortem analyses to identify opportunities for improvement and prevent recurring issues

  • Using data-based decision making to be proactive in the prevention of potential incidents and problems

  • Supporting product development teams in the implementation of tools, processes, and practices to improve stability, reliability, and extensibility of their products

  • Collaborate across the EP function to ensure that standard platforms are best-in-class

  • Drive standard implementation of NFRs in new product development and own the “deep-dive” process to improve problematic application

  • Overall management and governance of vulnerabilities and End-of-life within our products

Skills and Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, Software Engineering, or a related field.

  • 3+ years of experience supporting production applications (ie SRE and/or DevOps roles)

  • 2+ years’ experience managing technology teams

  • Practical understanding of implementing SLOs and SLIs

  • Knowledge of Windows and/or Linux Systems administration and networking fundamentals

  • Experience in implementing Observability and Alerting tools (eg Datadog, Splunk)

  • Ability to automate application operations using tools such as Python, Java, Shell Scripting, Terraform, Chef, Puppet, SQL, Ansible

  • Knowledge of AWS

  • Experience in supporting middleware such as databases, webservers, MQ and Kafka

  • Familiarity with containerization technologies, such as Docker and Kubernetes

  • Excellent problem-solving skills and attention to detail

  • Ability to work well under pressure and prioritize tasks in a fast-paced environment.

  • Fluency in English is essential.

  • Ability to collaborate closely with others.

  • Continual Improvement mindset

Leadership Responsibilities

As a senior member of the team, you will be responsible for:

  • Overseeing initiatives and deliverables across the team

  • Technical and design decisions made by the team.

  • Coaching and mentoring members of the team

  • Keeping your finger on the pulse: identifying and developing new ideas and initiatives

  • Acting as an advocate for SRE across Enterprise Platform team and wider Broadridge community

  • Reviewing work and improving SRE processes

  • Collaborating with other teams outside Enterprise Platforms

  • Contributing to the strategic direction of the function

Manager Responsibilities

  • As a people manager you will be responsible for general line management responsibilities including:

  • Hiring of staff

  • Career Development of your direct reports

  • Goal/Objectives setting, tracking and reviewing.

  • End of year appraisals

  • Overall accountability for the overall performance (deliverables and output) of the SRE team

  • Compliance governance (mandatory training, timesheets, holiday approvals etc)

What Broadridge Offers

  • An opportunity to be part of a global leader in fintech innovation.

  • A culture of inclusivity, collaboration, and professional development.

  • Competitive salary, comprehensive benefits, and a commitment to work-life balance.

  • Access to state-of-the-art technologies and tools.

  • Continuous learning opportunities through professional development programs and educational assistance.

Broadridge is proud to be an equal opportunity employer. We celebrate diversity and are dedicated to creating an inclusive environment for all employees. We encourage applications from individuals of all backgrounds.

#LI-KA2 #LI-Hybrid

Broadridge associates helped us envision our Connected Workplace - a work model that allows associates around the globe, dependent upon their role responsibilities, take advantage of the benefits of both on-site and off-site work to support our clients, one another, and the communities where we live and work. Our Connected Workplace is grounded in the concept of FACS: Flexible, Accountable, Connected, and Supported, which is our commitment to our associates. FACS supports our strong culture and allows us to achieve business goals while supporting meaningful work-life integration for our associates.

We are dedicated to fostering a diverse, equitable, and inclusive environment and committed to providing a workplace that empowers associates to be authentic and bring their best to work. We believe that associates can only do their best when they feel safe, understood, and valued, and we work diligently and collaboratively to ensure Broadridge is a company—and ultimately a community—that recognizes and celebrates diversity in all its dimensions.