Serving engineer cloud engineering senior / staff engineer
This roleinvolves thefollowing activities:
-
Building ascalableLLM inference platform using inference techniques (e.g.disaggregated servingandKV-Cache management,advancedparallelism,speculative algorithms,model optimization,specialized kernels).
-
Contribute to the development of LLM Servingpackages (e.g.vLLM,SGLang, TGI, Triton-Inferenceserver,Dynamo, LLM-d).
-
Work closely with customers to drive solutions by collaborating with internal compiler,firmwareand platform teams.
-
Workattheforefront ofGenAIby understanding advanced algorithms (e.g.attention mechanisms,MoEs)andnumericstoidentifynew optimization opportunities.
-
Driveefficient servingthrough smart autoscaling,load balancingandrouting.
-
Engage withopen-sourceserving communitiesto evolvetheframework.
Candidates for this position willdemonstratethe following:
-
Hands-on experience inone or moreof thefollowingLLM serving/Orchestrationpackages(Triton-Inference Server,vLLM,SGLang,Ollama,llm-d,KServe,LMCache,MoonCake)
-
Deep understanding of foundational LLMs,VLMs, SLMs,transformer-basedarchitectures.
-
Strong experience in developinglanguagemodelsusingPyTorch.
-
Strong computer science fundamentals - algorithms, data structures, parallel and distributedprogramming.
-
Understanding of computer architecture,ML accelerators,in-memoryprocessinganddistributedsystems.
-
Strong Python development skills for large-scale projects with passion for software engineering.
-
Experience in analyzing, profiling, andoptimizingdeep learningworkloads.
-
Proactive learning about the latest inference optimization techniques.
-
Excellent communication and problem-solving skills, with the ability to thrive in afast-pacedand collaborativeenvironment.
-
MSin Computer Science, Machine Learning, Computer Engineering or Electrical Engineering.
Bonus Skills:
-
Open-source contribution to anyGenAIpackage.
-
Experience architectinganddeveloping large-scale distributed systems.
-
High-level kernel design experience (PyTorch, CUDA, Triton).
-
Knowledgeof torch.compileortorchDynamo
-
PhDinComputer Science,ComputerEngineeringorMachine Learning
Minimum Qualifications:
Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. OR PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may [email protected]
or call Qualcomm's toll-free number foundhere
. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).
To all Staffing and Recruiting Agencies:
Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agenciesand individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited
resumes/applications.
EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or propr
Recommended Jobs
Repair Technician I
Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do! We are at the forefront of …
Front Desk Receptionist
Want to Make a Difference Through a Career in Healthcare? Welcome to Serenity. If you’ve ever thought about a career in healthcare but didn’t know where to start — this is your sign. Serenity Heal…
Au Pair
Get hired for Aditya's aupair Job in Austin, TX. Austin Family Seeks Au Pair. Find aupair care work in Austin.
Customer Service Representative III
Job Responsibilities: Serve as the first point of contact for customers, handling inquiries across multiple communication channels. Provide accurate part, pump, and motor price quotes. Proce…
Safety Driver (Autonomous Vehicles)
Pay: $24.00 - $26.00 per hour Job description: In this role, you will be at the forefront of autonomous vehicle testing, contributing to the advancement of cutting-edge technology while ensurin…
GKN Aerospace Global Graduate Program (Technology)
GKN Aerospace Global Graduate Program (Technology) Date: Nov 5, 2025 Location:Westlake, TX, US Company: GKN Aerospace Careers GKN Aerospace Global Graduate Program (Technolo…
Experienced Cook - Perseid
Summary Hotel Saint Augustine is looking for candidates to join the team! Hotel Saint Augustine is a 71-room property located in the heart of the Montrose neighborhood. The hotel is spread across …
SMT Equipement Maintenance Supervisor (A Group MLB PE 02 )-Houston,TX
Purpose of the position Foxconn Technology Group, a globally leading electronics manufacturing company, provides high-precision manufacturing solutions for top-tier technology brands. We are…
Speech Pathologist (SLP)
BlueCloud Staffing is hiring a warm, student-focused SLP for a full-time role in Abilene, Texas. Â Responsibilities: Â Provide speech-language therapy aligned with IEP goals. Conduct nec…
Licensed Medicare Agent
Medicare Sales Agent (Licensed) – Remote Medicare Giants is a fast-growing Medicare brokerage committed to helping seniors make confident, informed healthcare decisions. We foster a supportive, hi…