About Supermicro:
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.
Job Summary:
Supermicro is seeking an experienced AI Network Software Solution Architect to lead the design and development of next-generation network infrastructure solutions optimized for AI workloads. This role requires deep expertise in GPU fabric design, high-speed switching, network automation, network observability on a scale. You will architect & develop robust and scalable Network solutions in collaboration with external solution providers, VAR, and Supermicro internal teams. Define strategy roadmaps and ensure our networking infrastructure is ready for the most demanding AI platforms. This role will be based on our headquarters located in the San Jose, CA.
Essential Duties and Responsibilities:
Includes the following essential duties and responsibilities (other duties may also be assigned):
- Design & Develop Cutting-Edge Solutions: Develop Network solutions in collaboration with external solution providers, VAR, and Supermicro internal teams.
- AI-Optimized Network Architecture
Design low-latency, high-throughput AI network fabrics (scale-out, scale up, converged) to support GPU traffic patterns for training and distributed inferencing. - Fabric Design & Topology
Architect RAIL, Clos-based multiplane leaf-spine topologies using 100G/400G/800G infrastructure across various networking platforms. - Control Plane & Protocol Integration
Design multitenant BGP, EVPN, VXLAN, and routing designs for both scale out, internal cluster traffic and external ingress/egress paths to the internet and cloud. - Define and Drive Strategy
Define and drive networking strategy aligned with business growth, automation goals, and AI infrastructure scalability. - Network Automation & Orchestration
Develop infrastructure-as-code workflows using Ansible, Terraform, and Python to automate provisioning, configuration, and monitoring. - System Performance & Observability
Implement telemetry pipelines and traffic analytics for proactive visibility, capacity planning, and SLA adherence. - HLD/LLD Documentation & Standards
Develop high-level and low-level network solution design documentation, playbooks, and operational standards to support scalable deployments and troubleshooting. - Technology & Market Evaluation
Evaluate emerging technologies from NVIDIA, AMD, hyperscalers, and connectivity providers to influence roadmap decisions. - Cross-Functional Collaboration & Leadership
Work closely with platform, hardware, facilities, and security teams to deliver integrated network solutions and infrastructure for AI/ML workloads.
Qualifications:
- 15-20 years in network engineering or architecture roles, including large-scale data center or AI infrastructure environments
- Bachelor's degree in computer science, Electrical Engineering, or equivalent experience
- Strong business acumen: able to balance performance, cost, and scalability in architecture decisions.
- Customer-Focused Mindset: Experience working closely with customers to design solutions that meet their unique needs and resolving complex technical challenges.
- Strong Communication & Leadership: Exceptional communication skills, both written and verbal, with an ability to manage relationships, negotiate effectively, and work with high-level executives.
- Strong hands-on experience with Open Networking switching platforms & SONiC.
- Proven track record designing data center fabrics using BGP, OSPF, EVPN-VXLAN, and overlay networks
- Expertise with InfiniBand, RoCEv2, and RDMA-based networking in GPU environments
- Proficient in network automation using Ansible, Terraform, Python, and Git-based workflows
- Ability to define business-aligned network strategy roadmaps for scalable AI infrastructure
- Experience leading HLD/LLD design efforts and technical documentation
- Strong understanding of telemetry, observability, and proactive network health management
Salary Range
$200,000 - $220,000 The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.
EEO Statement
Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.
|