Principal Software Engineer AI/HPC Middleware

Cornelis Networks, Inc.
Austin, TX

Cornelis Networks delivers the world’s highest performance scale-out networking solutions for AI and HPC datacenters. Our differentiated architecture seamlessly integrates hardware, software and system level technologies to maximize the efficiency of GPU, CPU and accelerator-based compute clusters at any scale. Our solutions drive breakthroughs in AI & HPC workloads, empowering our customers to push the boundaries of innovation. Backed by top-tier venture capital and strategic investors, we are committed to innovation, performance and scalability - solving the world’s most demanding computational challenges with our next-generation networking solutions.

We are a fast-growing, forward-thinking team of architects, engineers, and business professionals with a proven track record of building successful products and companies. As a global organization, our team spans multiple U.S. states and six countries, and we continue to expand with exceptional talent in onsite, hybrid, and fully remote roles.

Cornelis Networks is seeking a highly experienced Principal Software Engineer to lead the design, development, and upstream enablement of our AI and HPC communication middleware stack. This role focuses on enabling and optimizing HPC middleware (MPI and SHMEM) and AI collective communication libraries (CCL — e.g., NCCL, RCCL, and related stacks) over Cornelis Networks interconnects. You will deliver performance, correctness, and deployability across real customer workloads by collaborating across the full system stack — including kernel/driver, switches, system software, and application/framework teams — while contributing upstream to key open-source projects.

Key Responsibilities:


  • Lead design and implementation enabling and optimizing HPC middleware (MPI and SHMEM) and AI middleware CCL stacks (e.g., NCCL/RCCL and related collective communication libraries).

  • Deliver performance-critical communication paths including low-latency small and medium message transfers, bulk SDMA data movement, GPU-Direct and IPC communication, and collective acceleration.

  • Design and tune collective communication algorithms (latency-optimized and bandwidth-optimized), including GPU-aware collectives.

  • Integrate middleware with underlying transports and provider layers such as libfabric/OFI, UCX, and verbs-style interfaces to achieve performance, portability, and maintainability.

  • Implement and optimize memory registration strategies, progress and execution models, completion semantics, multi-rail communication behavior, and GPU memory handling.

  • Drive upstream contributions across MPI/SHMEM projects, CCL ecosystems, and related components with a focus on upstreamable design and long-term maintainability.

  • Represent Cornelis Networks in open-source communities through technical reviews, design discussions, and sustained technical leadership.

  • Implement and prototype Ultra Ethernet capabilities supporting MPI/SHMEM and AI collective communication use cases.

  • Collaborate with ecosystem partners to validate deployment models and performance scaling on customer-relevant configurations.

  • Work closely with kernel, driver, and switch teams to deliver end-to-end performance aligned with the Cornelis product roadmap.

  • Participate in architecture reviews, performance tuning, scaling validation, and multi-layer root-cause investigations.

  • Analyze performance traces and triage advanced customer issues, translating findings into robust fixes and upstream improvements.

  • Publish internal and external best practices, including tuning guidance, reference configurations, and debugging methodologies.

  • Mentor senior engineers and promote best practices for design, testing, documentation, and code quality.

  • Help define the long-term middleware technical roadmap aligned with product evolution and customer needs.

Minimum Qualifications:


  • 12+ years of experience in high-performance systems programming in C/C++ on Linux.

  • Hands-on experience with MPI internals (Open MPI, MPICH, MVAPICH) and/or SHMEM implementations.

  • Experience implementing or optimizing collective communications for HPC and/or AI workloads, including NCCL/RCCL (CUDA/ROCm) or related CCL stacks.

  • Demonstrated ability to design low-latency/high-throughput communication paths and diagnose performance issues using profiling and tracing tools.

  • Working knowledge of transport and integration layers such as OFI/libfabric, UCX, and verbs-style networking concepts.

  • Strong understanding of RDMA and performance tuning.

  • Proven open-source contribution track record.

  • Demonstrated technical leadership in complex HPC or AI system software.

Preferred Qualifications:


  • Experience developing or maintaining libfabric providers.

  • Familiarity with Ultra Ethernet (UEC/UET) specifications.

  • Experience with RoCEv2, congestion control, or Ethernet-based RDMA deployments.

  • Experience with cluster-scale benchmarking, profiling, and optimization.

  • Background with Omni-Path/OPX or other Ethernet-based HPC fabrics.

Location: This is a remote position for employees residing within the United States.

We offer a competitive compensation package that includes equity, cash, and incentives, along with health and retirement benefits. Our dynamic, flexible work environment provides the opportunity to collaborate with some of the most influential names in the semiconductor industry.

At Cornelis Networks your base salary is only one component of your comprehensive total rewards package. Your base pay will be determined by factors such as your skills, qualifications, experience, and location relative to the hiring range for the position. Depending on your role, you may also be eligible for performance-based incentives, including an annual bonus or sales incentives.

In addition to your base pay, you’ll have access to a broad range of benefits, including medical, dental, and vision coverage, as well as disability and life insurance, a dependent care flexible spending account, accidental injury insurance, and pet insurance. We also offer generous paid holidays, 401(k) with company match, and Open Time Off (OTO) for regular full-time exempt employees. Other paid time off benefits include sick time, bonding leave, and pregnancy disability leave.

Cornelis Networks does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. Cornelis Networks is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Posted 2026-02-28

Recommended Jobs

Kitchen Team Member

Chick-fil-A - Sherman (TX)
Sherman, TX

Description Our kitchen staff work in a fast paced, high energy environment. The responsibility of our kitchen staff is to produce the highest quality food with precision and accuracy, continua…

View Details
Posted 2026-01-15

Email Developer

Biote
Irving, TX

Description Biote Medical is the world leader in hormone optimization and we are adding to our team! We partner with providers to take a complete approach to healthier aging through patient-sp…

View Details
Posted 2026-02-28

Welder (Steel and Aluminum) - 3rd Shift

Openwork
Houston, TX

Build Your Future in Welding Now Hiring: Welder (Steel and Aluminum) – 3rd Shift Full-Time | 9:00 PM – 7:00 AM | $24–$25 per hour Openwork is partnering with a respected steel and alum…

View Details
Posted 2026-01-27

Robotics Technical Advisor

NOV
Houston, TX

Job Description NOV is a global leader in technology, equipment, and services for the oil and gas industry, supporting full-field drilling, completion, and production operations. Since 1862, we'…

View Details
Posted 2025-11-21

Dog Daycare Attendant/Bark Ranger

Muttz El Paso
El Paso, TX

Job Description: Daycare Attendant / Bark-Ranger Muttz is seeking a dedicated and enthusiastic Daycare Attendant / Bark-Ranger to join our team. If you have a love for DOGS, a passion for delive…

View Details
Posted 2026-02-18

Medical Assistant (Float)

Family Medicine Austin
Austin, TX

Schedule: Monday - Friday. Hours: As Early As 7:00 AM. As Late As 5:30 PM - 60-minute lunch. Clinic Locations: ~1106 Clayton Lane, Austin TX 78723 ~1500 W 38th Street, Austin TX 78731 ~27…

View Details
Posted 2026-02-07

Salon Business Development Manager (Outside Sales)

AVEDA Institutes and Neill Corporation
Dallas, TX

: Neill Corporation is looking for a talented individual to join our team as a beauty and salon industry Business Development Outside Sales Manager for our Dallas, Texas region. Neill is an exclusive…

View Details
Posted 2026-02-24

Golf Sales Associate, Part-Time - Edwin Watts Golf

Worldwide Golf Shops
Tyler, TX

What We Offer: Our Golf Sales Associates receive base pay plus commission! We offer flexible scheduling, 401(k) matching, and a strong employee discount for all part-time employees! There is n…

View Details
Posted 2026-02-24

Pharmacy Technician PRN

ScionHealth
Round Rock, TX

At ScionHealth , we empower our caregivers to do what they do best. We value every voice by caring deeply for every patient and each other. We show courage by running toward the challenge and …

View Details
Posted 2025-12-03

08 - Specialist, IT Solutions 1

Celestica International LP
Richardson, TX

Req ID: 132307  Region: Americas  Country: USA  State/Province: Texas  City:  Richardson  General Overview Functional Area:   Information Technology (ITM) Career Stream: IT Solutions (…

View Details
Posted 2026-01-24