SR. Lead GPU Server Validation Engineer
Req ID: 129191
Region: Americas
Country: USA
State/Province: Texas
City: Austin
General Overview
Functional Area: Engineering
Career Stream: Design - Software Engineering
SAP Short Name: SLE-ENG-DSE
Job Level: Level 09
IC/MGR: Individual Contributor
Direct/Indirect Indicator: Indirect
Summary
The Senior Lead Storage and Server Test Engineer will play a pivotal role in the design, development, and execution of comprehensive test strategies for our AI data center's storage and server infrastructure. This leadership position requires deep expertise in enterprise storage systems, server architectures, networking, and a strong understanding of the unique performance and reliability demands of AI/ML workloads. The ideal candidate will be a hands-on technical leader, capable of mentoring junior engineers, driving test automation, and collaborating across engineering teams to deliver robust and high-performing solutions.
Knowledge/Skills/Competencies
• Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related technical field.
• 7+ years of experience in hardware and/or software testing, with at least 5 years focused on enterprise-level storage and server systems.
• 3+ years of experience in a lead or senior technical role, mentoring junior engineers or leading test initiatives.
• Proven experience in a lead or senior technical role, mentoring and guiding other engineers.
• Deep expertise in various storage technologies including NVMe, SAS/SATA SSDs/HDDs, RAID, distributed file systems (e.g., Ceph, Lustre, GPFS), SAN, and NAS.
• Strong understanding of server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, and power management.
• Strong understanding of server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, power management, and Baseband Management Controllers (BMC) functionality.
• Proficiency in scripting languages (e.g., Python, Bash) for test automation and data analysis.
• Experience with Linux operating systems (e.g., Ubuntu, CentOS, RHEL) and command-line tools.
• Familiarity with networking concepts (Ethernet, TCP/IP, InfiniBand) and network testing methodologies.
• Experience with test methodologies such as performance testing, reliability testing, stress testing, and fault injection.
• Excellent problem-solving, analytical, and debugging skills.
• Strong communication and interpersonal skills, with the ability to collaborate effectively across diverse teams.
Preferred Qualifications
• Familiarity with OCP (Open Compute Project)
• Experience with cloud environments (AWS, Azure, GCP) and virtualization technologies.
• Knowledge of containerization technologies (Docker, Kubernetes).
• Familiarity with AI/ML frameworks (e.g., TensorFlow, PyTorch) and their infrastructure requirements.
• Experience with performance profiling tools (e.g., fio, Iometer, Perf, VTune).
• Contributions to open-source projects related to storage, servers, or testing.
• Certifications in relevant technologies (e.g., NetApp, Dell EMC, HPE, NVIDIA).
Notes
This job description is not intended to be an exhaustive list of all duties and responsibilities of the position. Employees are held accountable for all duties of the job. Job duties and the % of time identified for any function are subject to change at any time.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Celestica's policy on equal employment opportunity prohibits discrimination based on race, color, creed, religion, national origin, gender, sexual orientation, gender identity, age, marital status, veteran or disability status, or other characteristics protected by law.
This policy applies to hiring, promotion, discharge, pay, fringe benefits, job training, classification, referral and other aspects of employment and also states that retaliation against a person who files a charge of discrimination, participates in a discrimination proceeding, or otherwise opposes an unlawful employment practice will not be tolerated. All information will be kept confidential according to EEO guidelines.
COMPANY OVERVIEW:
Celestica (NYSE, TSX: CLS) enables the world's best brands. Through our recognized customer-centric approach, we partner with leading companies in Aerospace and Defense, Communications, Enterprise, HealthTech, Industrial, Capital Equipment and Energy to deliver solutions for their most complex challenges. As a leader in design, manufacturing, hardware platform and supply chain solutions, Celestica brings global expertise and insight at every stage of product development – from drawing board to full-scale production and after-market services for products from advanced medical devices, to highly engineered aviation systems, to next-generation hardware platform solutions for the Cloud. Headquartered in Toronto, with talented teams spanning 40+ locations in 13 countries across the Americas, Europe and Asia, we imagine, develop and deliver a better future with our customers.
Celestica would like to thank all applicants, however, only qualified applicants will be contacted.
Celestica does not accept unsolicited resumes from recruitment agencies or fee based recruitment services.
This location is a US ITAR facility and these positions will involve the release of export controlled goods either directly to employees or through the employee's movement within the facility. As such, Celestica will require necessary information from all applicants upon an applicant's acceptance of employment to determine if any export control exemptions or licenses must be filed.
Recommended Jobs
Electrical Estimator
Position: Electrical Estimator Location: Houston, TX Reports To: Director of Estimating Employment Type: Full-Time About Us: Hays Electrical Services, Inc. is a national electric…
Invoice Coordinator
Job Responsibilities: Review and analyze blocked invoices to identify root causes for non-payment or delayed processing. Facilitate and manage invoicing alongside one other employee for this lo…
Accounts Payable Manager
About Maddox Maddox is the nation’s leading provider of electrical transformers to the commercial and industrial market, with primary locations in South Carolina, Washington State, Texas, Idaho,…
Automotive Floorplan Territory Manager
McAllen, TX| Remote Company Overview: About Westlake Floorplan Company Westlake Floorplan Company was established in 2013 as a division of Westlake Financial Services – the leading lender for in…
Architecture Lead Analyst
Citibank, N.A. seeks an Architecture Lead Analyst for its Irving, Texas location. Duties: Develop architecture, strategy, planning, and software solutions on an enterprise level. Document techni…
Technician, Fiber Operations
Technician, Fiber Operations Position Summary Centric Fiber (“Centric”) delivers industry-leading high-speed internet through Centric Fiber and natural gas services through its UniGas division.…
Receptionist (Bi-lingual)
We are seeking a Bilingual Receptionist for an opportunity in Houston, TX. This person will greet, assist, and provide direction and information to clients, visitors, and other guests of the organiz…
Home Health Nurse LVN / RN
About Us: Amazing Care Home Health provides Private Duty Nursing which differs from other Home Health Nursing. Rather than short visits, traveling to multiple patients per shift, you work with on…
Survey CAD Technician I
Assist in the preparation of project deliverables and contract documents in accordance with the company design standards and client requirements.
Remote Income Specialist - Mentorship Included
Remote Client Agent – Work From Home | Training & Mentorship Included Are you looking for a remote opportunity where your income reflects your effort — not a capped hourly rate? The Gainey Agency …