Backend Software Developer Jobs at NVIDIA with Visa Sponsorship
Backend Software Developer roles at NVIDIA sit at the intersection of high-performance computing, AI infrastructure, and distributed systems work. NVIDIA has a strong track record of sponsoring international engineers across H-1B, E-3, and Green Card pathways, making it a realistic target for skilled backend developers seeking long-term U.S. work authorization.
See All Backend Software Developer at NVIDIA JobsOverview
Showing 5 of 99+ Backend Software Developer Jobs at NVIDIA jobs


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?
See all 99+ Backend Software Developer Jobs at NVIDIA
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Backend Software Developer Jobs at NVIDIA.
Get Access To All Jobs
INTRODUCTION
We are now looking for a Deep Learning Software Engineer, FlashInfer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads.
ROLE AND RESPONSIBILITIES
What you'll be doing:
- Innovating and developing new AI systems technologies for efficient inference
- Designing, implementing, and optimizing kernels for high impact AI workloads
- Designing and implementing extensible abstractions for LLM serving engines
- Building efficient just-in-time domain specific compilers and runtimes
- Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
- Contributing to open source communities like FlashInfer, vLLM, and SGLang
BASIC QUALIFICATIONS
What we need to see:
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred
- Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC.
- Strong Python and C/C++ programming skills
PREFERRED QUALIFICATIONS
Ways to stand out from the crowd:
- Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
- Expertise in inference engines like vLLM and SGLang
- Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
- Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)
- Open source project ownership or contributions
COMPENSATION
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until April 28, 2026. This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

INTRODUCTION
We are now looking for a Deep Learning Software Engineer, FlashInfer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads.
ROLE AND RESPONSIBILITIES
What you'll be doing:
- Innovating and developing new AI systems technologies for efficient inference
- Designing, implementing, and optimizing kernels for high impact AI workloads
- Designing and implementing extensible abstractions for LLM serving engines
- Building efficient just-in-time domain specific compilers and runtimes
- Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
- Contributing to open source communities like FlashInfer, vLLM, and SGLang
BASIC QUALIFICATIONS
What we need to see:
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred
- Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC.
- Strong Python and C/C++ programming skills
PREFERRED QUALIFICATIONS
Ways to stand out from the crowd:
- Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
- Expertise in inference engines like vLLM and SGLang
- Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
- Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)
- Open source project ownership or contributions
COMPENSATION
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until April 28, 2026. This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
See all 99+ Backend Software Developer at NVIDIA jobs
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Backend Software Developer at NVIDIA roles.
Get Access To All JobsTips for Finding Backend Software Developer Jobs at NVIDIA Jobs
Align your portfolio to NVIDIA's infrastructure stack
NVIDIA's backend roles emphasize GPU computing frameworks, CUDA-adjacent services, and high-throughput distributed systems. Before applying, reframe your projects around latency-sensitive pipelines or large-scale data processing to match what their engineering teams are actually building.
Target roles listed under specific NVIDIA business units
NVIDIA posts backend roles across distinct divisions, including AI Enterprise, GeForce, and data center platforms. Roles tied to product lines with active deployment cycles tend to move faster through internal headcount approval, which directly affects how quickly sponsorship paperwork gets initiated.
Use Migrate Mate to filter NVIDIA backend openings by visa type
NVIDIA sponsors H-1B, E-3, and EB-2/EB-3 pathways, but not every open role is flagged clearly on general job boards. Migrate Mate lets you filter NVIDIA's backend postings specifically by the visa type you need, so you're only spending time on roles where sponsorship is confirmed.
Request clarity on H-1B cap timing during final interviews
If you're not currently on H-1B status, NVIDIA would need to file during the April cap season with an October 1 start date. Raise this timeline in late-stage conversations so the recruiting team can loop in their immigration counsel before an offer is extended.
Document your degree's equivalency if it's a three-year program
USCIS requires a U.S. bachelor's equivalent for H-1B specialty occupation status. If your degree is a three-year program, get a credential evaluation from NACES-member agency before your offer stage. NVIDIA's legal team will need this to complete the H-1B petition accurately.
Understand how PERM timing affects your Green Card trajectory
NVIDIA sponsors EB-2 and EB-3 Green Cards, but PERM labor certification with DOL can take 12 to 18 months before the I-140 petition is even filed. If you're from an oversubscribed country, ask your recruiter early about where they typically initiate the process in your tenure.
Backend Software Developer at NVIDIA jobs are hiring across the US. Find yours.
Find Backend Software Developer at NVIDIA JobsFrequently Asked Questions
Does NVIDIA sponsor H-1B visas for Backend Software Developers?
Yes, NVIDIA sponsors H-1B visas for Backend Software Developer roles. The company works with experienced immigration counsel to file petitions during the annual cap season, with an October 1 start date for selected registrations. If you're already on an H-1B with another employer, NVIDIA can file an H-1B transfer, which lets you start work once the petition is received by USCIS.
How do I apply for Backend Software Developer jobs at NVIDIA?
You can apply directly through NVIDIA's careers portal or through Migrate Mate, which aggregates NVIDIA's backend openings filtered by visa sponsorship type. NVIDIA's backend hiring process typically includes an initial recruiter screen, multiple technical rounds covering systems design and distributed computing, and a final team-fit interview. Roles move at different speeds depending on the business unit, so applying to multiple relevant postings improves your chances.
Which visa types does NVIDIA commonly use for Backend Software Developer roles?
NVIDIA sponsors H-1B visas for most international backend hires, along with the E-3 visa for Australian citizens, which has no lottery and allows two-year renewable status. For longer-term pathways, NVIDIA also sponsors EB-2 and EB-3 Green Card petitions. The right pathway depends on your nationality, current immigration status, and how far along you are in your career.
What qualifications does NVIDIA expect for Backend Software Developer roles?
NVIDIA typically looks for a bachelor's degree or higher in computer science, electrical engineering, or a closely related field, which also satisfies the specialty occupation requirement for H-1B purposes. Strong candidates have hands-on experience with distributed systems, C++ or Python at scale, and ideally some exposure to GPU computing or high-performance infrastructure. Roles tied to AI platform teams may also require familiarity with model serving or inference pipelines.
How long does the visa sponsorship process take if I receive an offer from NVIDIA?
Timeline depends heavily on your current status. H-1B cap-subject petitions require filing in April for an October start, which can mean a wait of several months after your offer. H-1B transfers and E-3 visas move faster, with E-3 consular processing in Australia often completing within two to four weeks. PERM-based Green Card sponsorship, which NVIDIA offers for EB-2 and EB-3 cases, typically takes 12 to 18 months just for the DOL certification phase.
See which Backend Software Developer at NVIDIA employers are hiring and sponsoring visas right now.
Search Backend Software Developer at NVIDIA Jobs