Position Description:
Lead the development, integration, and maintenance of server hardware with a focus on CPU, chipset performance, and firmware. Collaborate across teams to enhance hardware reliability and efficiency for our data centers.
Responsibilities:
- Design and enhance server hardware architecture for data centers
- Customize hardware solutions with ODM partners
- Innovate firmware management strategies for system stability and security
- Oversee BMC protocols for reliable remote management
- Implement Metal as a Service (MaaS) for server provisioning automation
- Monitor and maintain server fleet health and performance
- Develop automation scripts using Terraform, Ansible, Python, Linux, and Bash
- Optimize data center functionality in collaboration with network engineering
- Manage hardware lifecycle from procurement to decommissioning
- Contribute to open-source hardware initiatives
- Support cloud service architecture balancing hardware and cloud performance
- Stay updated on CPU, chipset, firmware, and BMC innovations
Qualifications:
- Expertise in server hardware architecture, CPU, chipset, and integration
- Experience in firmware management and system reliability
- Proficiency with BMC and remote server management
- Background in open hardware standards and community contributions
- Mastery of MaaS and server fleet management automation
- Scripting skills with Terraform, Ansible, Python, Linux, and Bash
- Insight into hardware and network architecture interplay in data centers
- Strong problem-solving, strategic thinking, and integration skills
- Collaborative spirit and ability to manage multiple high-stake projects
- Relevant industry certifications are a plus
Required Experience:
- 10+ years in server hardware engineering with emphasis on CPU, chipset, firmware, and BMC
- 6+ years in large-scale data center orchestration
- Proven leadership in innovative hardware projects
- 6+ years working with OEMs and ODMs
- 5+ years with security protocols and high-availability systems
- 5+ years in large-scale data center deployment and management
- 3+ years with Redfish, BMC, and IPMI standards
- 4+ years with cloud storage solutions
- 4+ years in collaborative development environments
- 4+ years in C/C++, Bash, Python, or GO scripting
Desired Experience:
- 10+ years in professional software development
- 8+ years in systems architecture and design
- 6+ years in open-source frameworks
- 4+ years with cloud services (AWS, GCP, Azure)
- 3+ years in technical leadership