New

Site Reliability Engineering Technical Leader

Cisco Systems, Inc.
United States, North Carolina, Charlotte
Aug 07, 2025
The application window is expected to close on: October 8th 2025. Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received. The preferred location for this role is RTP, North Carolina, US. Onsite 2 days a week. Meet the Team We in Cisco IT are going through a major transformation increasing automation of our business processes to reduce time to capability and improve performance. To support this transformation, we are looking for an experienced and dedicated individual to expand our capabilities by adopting industry leading automation technologies. Work independently, receive minimal guidance and direction from manager then use the best approach to accomplish work. Solve sophisticated problems. Collaborate with customers, peers, account leaders and external and internal partners for network solving. Contribution result in business or process improvements globally within the function. Solve complex problems; use sophisticated analytical thought to exercise judgment and identify innovative solutions. Anticipate business and industry issues; recommend functional process or service improvements; have an expert understanding of and act as a catalyst to the improvement of Cisco's products, services, and customers. Major focus on LAN, WAN, Wireless management & operations with Scripting skills. As we transition to AI-driven network operations, it is essential to develop models and automation that harness the power of AI and intelligent agents. These tools will streamline common tasks, minimize manual effort, and significantly reduce operational toil within our network. Your Impact Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration, learning, and continuous improvement. Define and implement SRE standard practices, processes, principles, and strategies across the organization. Partner with engineering, product, and operations teams to align reliability goals with business objectives. Lead/Implement tools and automation to improve efficiency in monitoring, alerting, capacity planning, and incident management. Reduce manual intervention through automation of operational workflows and processes. Lead incident response during outages and ensure root cause analysis is conducted. Advocate for retrospective and continuous feedback loops to prevent recurrence of issues. Act as the SRE domain expert, enabling teams to adopt and implement reliability standard processes. Work as lead engineer on user incidents and cases to resolution and provide phenomenal user experience Communicate effectively with partners to provide clarity into health and incident status. Drive multi-functional initiatives to improve operational and engineering efficiency. Minimum Qualifications Bachelor's degree in computer science, computer engineering, a related field, and 5+ years relevant work experience. Expertise with development coding languages such as Java or Python in a CI/CD, DevOps model with knowledge of Jenkins, GIT. Advanced knowledge of network controllers for operations & monitoring technologies such as Catalyst Center (CatC), Meraki Dashboard, etc. Certifications a plus: Devnet Professional, CCIE, CWNP, etc. AI literacy, data analytics & visualization, AI-driven automation, prompt engineering or AI security. Preferred Qualifications Experience working in an agile environment with exposure to Agile tools like Jira. Strong solving and problem-solving skills with a focus on getting to the root of the problem. Excellent communication and collaborator leadership skills. Oversee on-call rotations and ensure effective response to production incidents. Experience in leading and mentoring technical teams to ensure high availability and reliability of services. Why Cisco? At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint. Simply put - we power the future. Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere. We are Cisco, and our power starts with you.