Software Developer/ Engineer/ Architect

Staff Network Engineer

Salesforce is seeking an experienced leader for our Global Network Operations organization. Working closely with counterparts in the Site Reliability organization, this global team of engineers is staffed by “follow the sun” teams who are always at the ready to swiftly respond to all service-impacting network issues.

As a leader of the Global Network Operations team, you will be responsible for leading a highly specialized team with the primary focus of ensuring the uptime of the Salesforce network. When not remediating service impacting incidents, our team is the service owner for our network, responsible for incident root cause analysis, proactive measures to increase network stability, process improvement to reduce human error, and game day exercises, while driving monitoring improvements and alert-handling automation. The leader in this role must be focused on people and team health, key industry-leading engineering practices, service ownership, and agile leadership. You should have an uncanny ability to understand network topologies, pinpoint problem areas, and drive high availability. You will represent the Global Network Operations organization in cross-functional meetings and own decisions related to our network operations policies and methods.

Responsibilities

  • Oversee the day-to-day functions of front-line Network Operations; drive accountability and operational excellence to ensure all network incidents are appropriately actioned and SLAs/OLAs are met.
  • Participate in high-severity outage bridges to ensure appropriate network resources are engaged, provide technical guidance as necessary, and communicate status to senior leaders.
  • Develop and track success metrics and identify areas for improvement in processes, monitoring, and team technical skills.
  • Ensure prompt completion of incident-related root cause analyses.
  • Identify and provide feedback to the Network Engineering organization related to fundamental reliability issues with the network infrastructure, providing strategy and direction for improvement.
  • Contribute to the development of change management policies and procedures, and proactively identify high risk change activity for mitigation.
  • Manage performance and development of individual contributors. Act as a technical mentor to Network Operations team members.

Required Skills

  • 5+ years of experience in network engineering and/or operations, with a strong preference for someone with experience on a 24x7 team responsible for maintaining the uptime of a large-scale network.
  • 2+ years of experience managing both individual engineers and managers, encompassing mentorship, strategic leadership, and hiring.
  • Exceptional leadership abilities, with experience in mentoring/coaching, performance management, goal setting, and metrics-based reporting.
  • Extensive experience with incident management, and the ability to rapidly assess impact, marshal resources, and direct troubleshooting efforts in order to speed resolution.
  • Self-motivated and goal-oriented, with the ability to effectively prioritize and execute tasks in a high-pressure environment.
  • Highly developed organizational and planning skills, with a strong analytical approach to problem solving and data-driven decision making.
  • Excellent written and verbal communication skills and the ability to effectively convey highly technical information to senior executives.
  • Expert level knowledge of TCP/IP networking, architecture, and core technologies, such as BGP, IS-IS, OSPF, QoS, etc.
  • High level of proficiency in router and switch configuration, troubleshooting, and maintenance, with emphasis on Cisco and Juniper.
  • Strong knowledge of load balancing, with emphasis on F5.
  • Must demonstrate integrity and maturity, as well as a constructive approach to challenges.

Desired Skills

  • 7+ years of progressive leadership in network/operations roles with 3+ years of people leadership.
  • High level of proficiency and experience maintaining large scale networks
  • Experience deploying large scale distributed multi-vendor network environments.
  • Highly skilled in packet analysis, network analysis tools, and analytical fault diagnosis.
  • Experience in deploying, customizing, and scaling a network monitoring solution.
  • Practical knowledge of Agile development methodologies and ITIL principles.

Education:

  • MS in Computer Science or related field, or
  • BS in Computer Science plus relevant job-related experience, or
  • Industry-leading job experience equivalent to a college degree