Software Developer/ Engineer/ Architect

Site Reliability Engineer (SRE) - Consumer Data Management

Job Title

Site Reliability Engineer (SRE) - Consumer Data Management

Overview

The Mastercard Shared Services group is on a mission to drive adoption and evolve our Consumer Data Management (CDM) Services. These are internally consumed capabilities that are the foundation for Mastercard's product and service offerings globally. We provide a secure, unified source of Consumer PII/PCI data for Mastercard products, while meeting the stringent needs of regulatory requirements. Our reusable API services allow us to manage the Consumers PII/PCI data so that Market facing programs do not need to develop their own solutions and can focus on their core business services and capabilities instead.

The Mastercard CDM team in Dublin is looking for an early career Site Reliability Engineer to drive our Consumer Data Management vision and strategy forward. The SRE team within CDM are responsible for pipeline development, change management, site reliability and performance, monitoring & alerting, and supporting emergency response situations. The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management. We are seeking an individual who is highly motivated, intellectually curious, and who possesses an entrepreneurial mindset that seeks out opportunities for improvement.

The Role:

This role involves working with a team of talented SREs to support highly scalable Java based services. In this role, you will be responsible for:

• Responsible for pipeline build and maintenance in accordance with

Mastercard tooling and conventions.

• Participate in the software development lifecycle, working closely with the

development team to ensure that designed solutions meet non-functional

requirements such as availability, performance, security and

maintainability standards.

• Maintain services through monitoring of metrics, system health, and

analysis of reports.

• Provide support for production and in-house systems. Participate in on-

call Production support rota.

• Incident management and conducting post incident reviews and 5-Whys

analysis.

• Improve process and systems within the Program.

• Experience with CI/CD and Build pipelines using Jenkins.

• Experience in public and private Cloud offerings (PCF, Azure, AWS etc.).

• Knowledge of NoSQL & SQL databases such as Mongo / Oracle/Postgres.

• Experience and knowledge of managing distributed systems and working with microservices.

• Familiarity with Unix tooling, with strong scripting skills.

• Exposure to working with Monitoring and Alerting tools such as Splunk, Dynatrace

• Proficiency in one of the following: Python, Java, GO or equivalent.

• Familiarity defining SLO’s and SLA’s

• Prior experience of working in an SRE team and excellent understanding of SRE principles.

• High degree of initiative and self-motivation, with a willingness to take on challenging opportunities.

• Excellent communication and relationship building/collaboration skills.