Software Developer/ Engineer/ Architect

Site Reliability Engineer

The Site Reliability Engineer is a technical engineering role within Platform Reliability Engineering organization, responsible for designing, operating and ensuring that ActiveCampaign’s Production services are up and performing reliably. You will develop, deploy and operate our global infrastructure leveraging AC’s first engineering principles, delivering 24x7 availability, with high performance, scalability and reliability. 

  • Leverage your Cloud Engineering skills to help us build and operate a global saas infrastructure
  • Conduct Operational reviews of ActiveCampaign’s major software components, systems, and features to improve the availability, scalability, latency, and efficiency of ActiveCampaign's services
  • Lead sustainable incident response, blameless postmortems, and production improvements
  • Provide guidance to other team members on managing end-to-end availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions
  • Drive or contribute to infrastructure as a code
  • Work on improving the security posture of the infrastructure stack
  • Be available as primary and secondary on-call, SME on SRE related incidents
  • Experience with pulumi, cloudformation, terraform or similar IaC
  • Experience coding with python, ruby, php, go, or shell scripting.
  • Experience with large scale multi-regional infrastructure deployments is a plus
  • Solid experience, understanding of AWS
  • Experience with EKS, Migration to monolith to K8’s would be nice to have
  • Experience with PHP, Python , MySQL based applications
  • Understanding of Customer Experience Platforms or equivalent would be great
  • BS degree in a technical discipline or equivalent experience in software / web development
  • Instrumented, operated and scaled backend services preferably across multiple AWS clusters, regions and environments with High Availability
  • Triaged Site Availability Incidents and proactively work towards reducing MTTR for customer impacting incidents
  • Partner with Service owners to implement Service Level Metrics & Service Level Objectives that act as service level health indicators
  • Establish solid patterns for monitoring, benchmarking and deploying new features for the backend services
  • Have a demonstrated track record of cultivating strong working relationships and driving collaboration across multiple product and platform teams
  • Curiosity to dig several layers deep into technical solutions with an eye toward continuous improvement
  • 4+ years of experience as a Site Reliability Engineer, Production Engineer or Backend Software Engineer for web-scale or similar platforms

ActiveCampaign is an employee-first culture. We take care of our employees at work and outside of work. Some of our most popular benefits include our comprehensive health and wellness benefits (including no premiums for employees on our HSA plan, tele-health and tele-mental health, and access to the Calm app for mediation), open paid time off, generous 401(k) matching with no vesting, a generous stipend to outfit your remote office, and a focus on career growth including access to personal and professional coaching. We take a proactive approach to diversity and inclusion and offer parental leave, career pathing, and support employees’ ongoing learning and development through Udemy and access to life coaches via Modern Health. We also offer cool swag.

ActiveCampaign is an equal opportunity employer. We recruit, hire, pay, grow and promote no matter of gender, race, color, sexual orientation, religion, age, protected veteran status, physical and mental abilities, or any other identities protected by law.

Our Employee Resource Groups (ERGs) strive to foster a diverse inclusive environment by supporting each other, building a strong sense of belonging, and creating opportunities for mentorship and professional growth for their members.