Software Developer/ Engineer/ Architect

Linux System Engineer

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category : Products and Technology

Job Details

Salesforce is seeking a systems engineer candidate to join the Production Engineering organization. Working closely with counterparts in the Infrastructure and R&D organizations, this organization provides a global team of engineers monitoring cloud service availability and ready to swiftly repair any service-impacting issues. This objective is met by monitoring the services, reacting to problems, and proactively addressing issues before they affect performance or availability.
When not fighting fires, the team is responsible for fire prevention through monitoring, automation, self-healing and resiliency initiatives. The incumbent in this role would demonstrate a strong focus on tactical operations, as well as large-scale production engineering and orchestration.
 Responsibilities:

  • Keep the customer facing services available at top performance by maintaining the constant health of the supporting systems.
  • Ensuring that work carried out by the Site Reliability team is executed in such a way as to comply with the company’s internal compliance policy and directives.
  • Being available to discuss and resolve technical issues and escalations with other technical staff as required.
  • Participate in on-call rotation
  • Ability to operate in the high-pressure environment and troubleshoot complex issues quickly, while successfully handling multiple priorities.
  • Work to automate detection and resolution of recurring issues in the production environment.
  • Apply software development workflows to operational environments.
  • Measure everything, providing critical operational insight into our applications.
  • Partnering with application teams on new services/features and capacity planning per business needs.
  • Managing middleware services supporting the application platform (search & indexing, distributed messaging, caching).
  • you write software tools for fun and even your personal projects are well organized with code and configurations in version control and with fully automated builds and deployments
  • you are obsessed with eliminating repetitive manual tasks

Required Skills:

  • BS/BA Degree in Computer Science or equivalent industry experience(3-5 years in an Enterprise scale internet service engineering or support role)
  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.)
  • Expertise in enterprise support of Linux systems with significant exposure to CentOS and Ubuntu Linux
  • Strong understanding of monitoring implementations and administration
  • Strong communication skills (written and oral)
  • Past experience in Incident Management and good understanding of ITIL service operations
  • Experience in working in a 24/7 team managing large data centers
  • Exposure to RDBMS and storage infrastructure
  • Experience with distributed search, messaging and caching systems

Preferred Qualifications:

  • Perl/Python scripting experience
  • Prior Chef/Puppet or automated deployment experience
  • Experience in supporting and maintaining monitoring systems
  • Experience supporting and troubleshooting relational databases and distributed platforms
  • Experience in supporting and maintaining Java applications
  • you have written software in at least 3 of the following languages:
  • Java, php, Ruby, python, bash, puppet, node.js
  • you have used at least three software build and deployment tools like:
  • Ant, Gradle, make, Maven, Ivy, Jenkins, Ansible, Puppet
  • you have a good understanding of at least 2 web and application servers or middleware such as:
  • Apache, nginx, haproxy, jetty, Tomcat, memcache, redis, RabbitMQ, ActiveMQ
  • You have used at least two virtualization, container or cloud platform such as:
  • VMWare, Virtualbox, Vagrant, Docker, AWS, GCP, k8s