Infrastructure Software Engineer Jobs in USA with Visa Sponsorship
Infrastructure Software Engineers are strong H-1B candidates, the role qualifies as a specialty occupation requiring a bachelor's degree in computer science, computer engineering, or a related field. Employers routinely sponsor both H-1B and L-1 visas for this role, and cap-exempt positions exist at universities and nonprofits. For detailed occupation requirements, see the O*NET profile.
See All Infrastructure Software Engineer JobsOverview
Showing 5 of 964+ Infrastructure Software Engineer jobs


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?
See all 964+ Infrastructure Software Engineer jobs
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Infrastructure Software Engineer roles.
Get Access To All Jobs
About Etched
Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.
Job Summary
Building cutting-edge model-specific ASICs requires crafting custom infrastructure and toolchains to support ultra-fast, reliable, and scalable development across the stack - from simulation to silicon. We build this infrastructure as software - and we engineer it with the same best practices we apply to our products. We use the same rigor, design discipline, and quality standards and testing as we do to our ASIC, software, and platform. You will lead the development and adoption of next-generation infrastructure tooling, enabling Etched ASIC, Software, and Platform engineers to iterate faster, build more reliably, and push the boundaries of AI performance. This includes building and scaling our hybrid high-performance compute (HPC) cluster, optimized for massively parallel CI, EDA workflows, Emulation, and hardware-aware job execution. You’ll also architect and implement a state-of-the-art observability stack with LLM integration and a strong emphasis on streaming health and performance telemetry, log aggregation, distributed tracing, insight generation, synthetic testing, and smart alerting - across CI pipelines, simulation clusters, and service endpoints. This role demands a strong software engineering mindset, quality instincts, and deep understanding of systems. It’s not just about writing scripts - it’s about writing code that builds and manages infrastructure with precision, repeatability, and intent.
Key Responsibilities
- Architect and Scale Distributed Compute Systems: Design and build the orchestration layers that drive our hybrid high-performance clusters—enabling simulation, synthesis, and continuous integration of AI ASICs at unprecedented scale.
- Build Infrastructure-as-Code Systems: Develop and maintain a fully programmable infrastructure control plane to ensure reproducibility, auditability, and rapid iteration across the entire stack.
- Optimize End-to-End Developer Experience: Create tools and abstractions that empower engineers to harness massive parallelism without worrying about the underlying complexity.
- Workload Elasticity, Reliability, and Efficiency: Prototype and execute workload orchestration and migration strategies between on-premise and cloud environments, balancing performance, storage availability and replication, uptime, and cost across heterogeneous hardware and compute backends.
- Implement real-time telemetry, tracing systems that surface insights from millions of metrics, enabling proactive debugging and system optimization.
- Push the Limits of Observability: Build a full observability stack that includes dashboards, alerting, automated responses, and a synthetic testing framework to proactively test infrastructure performance and reliability for various application and data flows, ensuring we remain proactive against issues impacting development and productivity workflows.
Representative projects
- Design and deploy a fully automated, scalable hybrid HPC cluster, combining bare-metal servers and switches with cloud instances, provisioned through MaaS and orchestrated via SLURM and Kubernetes, optimized for mixed EDA workloads and parallel CI pipelines.
- Develop a real-time observability system for ASIC toolchain jobs and distributed builds, integrating Prometheus, Grafana, and VictoriaMetrics with streaming telemetry, tracing, and alerting to detect performance regressions before they hit silicon.
- Architect and implement a programmable infrastructure-as-code control plane, using Terraform, Ansible, and Puppet, to version, audit, and redeploy every layer of Etched's development stack with deterministic reproducibility.
- Create a zero-downtime interactive development environment that provisions and connects Jupyter and VS Code sessions to GPUs and high-memory nodes via a secure zero-trust network, abstracting away cluster state and machine failures.
- Prototype and evaluate dynamic workload migration strategies between on-premise and cloud environments to optimize for latency, reliability, and cost across simulation and synthesis pipelines.
- Design a synthetic testing and fault injection framework to validate the behavior of infrastructure under high-load, degraded hardware, and intermittent network partitions - before they happen in production.
You may be a good fit if you
- Are a systems-minded software engineer who loves building foundational platforms, working close to the metal and cloud, solving high-leverage problems at scale.
- Are a deeply technical engineer who treats infrastructure as a software problem - prioritizing clean abstractions, version control, small change lists, easy roll backs, testing, and long-term maintainability over ad hoc configuration.
- Have strong programming skills in languages such as Python, Go, Rust, and C++, and are comfortable building production-grade tooling.
- Possess expert-level knowledge of Linux, virtualization, containerization, and CI/CD pipelines, with a deep understanding of how to debug, optimize, and scale complex systems.
- Are familiar with Infrastructure as Code tools like OpenTofu, Ansible, or Puppet, and enjoy designing declarative, reproducible infrastructure systems.
- Understand and use PromQL and other telemetry/query languages and have used LLM to extract insight from real-time metrics, and know how to architect and tune observability stacks.
- Have a track record of debugging and resolving difficult hardware-software integration problems across bare-metal systems, networks, and distributed workloads.
- Can lead and mentor technical teams, guiding design decisions and helping others develop sound engineering instincts.
- Have 8+ years of experience in infrastructure engineering, systems programming, or backend software development - ideally in environments where performance, scale, or hardware interaction mattered.
- Are driven by curiosity, take initiative, and have an innate sense of ownership — you thrive in uncharted territory, design for edge cases, and love making systems more powerful, reliable, and elegant.
Strong Candidates May Also Have Experience With
- Familiarity with Bazel build system
- Deep understanding of ASIC development flows, especially those involving Synopsys, Cadence, and Verilator, including how EDA tools interact with infrastructure for simulation, synthesis, and verification.
- Hands-on experience architecting systems with AWS, GCP, or Azure, including hybrid on-prem/cloud deployments, workload migration strategies, and cloud-native orchestration tooling.
- Experience monitoring, provisioning, and debugging bare-metal servers, network hardware, and high-performance storage systems in rack-scale environments.
- Comfortable in profiling and optimizing compute environments for single-threaded latency, memory-bound workloads, or I/O throughput, especially in the context of simulation or CI performance.
- Proficiency building or operating telemetry systems at scale using Prometheus, Grafana, Loki, VictoriaMetrics, and tools for distributed tracing, log aggregation, and real-time alerting across heterogeneous mediums (SMS, email, push alerts, etc.)
Benefits
- Medical, dental, and vision packages with generous premium coverage + $500 per month credit for waiving medical benefits
- Housing subsidy of $2k per month for those living within walking distance of the office
- Relocation support for those moving to San Jose (Santana Row)
- Various wellness benefits covering fitness, mental health, and more
- Daily lunch + dinner in our office
How We’re Different
Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in San Jose (Santana Row), and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.
Compensation Range: $150K - $250K

About Etched
Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.
Job Summary
Building cutting-edge model-specific ASICs requires crafting custom infrastructure and toolchains to support ultra-fast, reliable, and scalable development across the stack - from simulation to silicon. We build this infrastructure as software - and we engineer it with the same best practices we apply to our products. We use the same rigor, design discipline, and quality standards and testing as we do to our ASIC, software, and platform. You will lead the development and adoption of next-generation infrastructure tooling, enabling Etched ASIC, Software, and Platform engineers to iterate faster, build more reliably, and push the boundaries of AI performance. This includes building and scaling our hybrid high-performance compute (HPC) cluster, optimized for massively parallel CI, EDA workflows, Emulation, and hardware-aware job execution. You’ll also architect and implement a state-of-the-art observability stack with LLM integration and a strong emphasis on streaming health and performance telemetry, log aggregation, distributed tracing, insight generation, synthetic testing, and smart alerting - across CI pipelines, simulation clusters, and service endpoints. This role demands a strong software engineering mindset, quality instincts, and deep understanding of systems. It’s not just about writing scripts - it’s about writing code that builds and manages infrastructure with precision, repeatability, and intent.
Key Responsibilities
- Architect and Scale Distributed Compute Systems: Design and build the orchestration layers that drive our hybrid high-performance clusters—enabling simulation, synthesis, and continuous integration of AI ASICs at unprecedented scale.
- Build Infrastructure-as-Code Systems: Develop and maintain a fully programmable infrastructure control plane to ensure reproducibility, auditability, and rapid iteration across the entire stack.
- Optimize End-to-End Developer Experience: Create tools and abstractions that empower engineers to harness massive parallelism without worrying about the underlying complexity.
- Workload Elasticity, Reliability, and Efficiency: Prototype and execute workload orchestration and migration strategies between on-premise and cloud environments, balancing performance, storage availability and replication, uptime, and cost across heterogeneous hardware and compute backends.
- Implement real-time telemetry, tracing systems that surface insights from millions of metrics, enabling proactive debugging and system optimization.
- Push the Limits of Observability: Build a full observability stack that includes dashboards, alerting, automated responses, and a synthetic testing framework to proactively test infrastructure performance and reliability for various application and data flows, ensuring we remain proactive against issues impacting development and productivity workflows.
Representative projects
- Design and deploy a fully automated, scalable hybrid HPC cluster, combining bare-metal servers and switches with cloud instances, provisioned through MaaS and orchestrated via SLURM and Kubernetes, optimized for mixed EDA workloads and parallel CI pipelines.
- Develop a real-time observability system for ASIC toolchain jobs and distributed builds, integrating Prometheus, Grafana, and VictoriaMetrics with streaming telemetry, tracing, and alerting to detect performance regressions before they hit silicon.
- Architect and implement a programmable infrastructure-as-code control plane, using Terraform, Ansible, and Puppet, to version, audit, and redeploy every layer of Etched's development stack with deterministic reproducibility.
- Create a zero-downtime interactive development environment that provisions and connects Jupyter and VS Code sessions to GPUs and high-memory nodes via a secure zero-trust network, abstracting away cluster state and machine failures.
- Prototype and evaluate dynamic workload migration strategies between on-premise and cloud environments to optimize for latency, reliability, and cost across simulation and synthesis pipelines.
- Design a synthetic testing and fault injection framework to validate the behavior of infrastructure under high-load, degraded hardware, and intermittent network partitions - before they happen in production.
You may be a good fit if you
- Are a systems-minded software engineer who loves building foundational platforms, working close to the metal and cloud, solving high-leverage problems at scale.
- Are a deeply technical engineer who treats infrastructure as a software problem - prioritizing clean abstractions, version control, small change lists, easy roll backs, testing, and long-term maintainability over ad hoc configuration.
- Have strong programming skills in languages such as Python, Go, Rust, and C++, and are comfortable building production-grade tooling.
- Possess expert-level knowledge of Linux, virtualization, containerization, and CI/CD pipelines, with a deep understanding of how to debug, optimize, and scale complex systems.
- Are familiar with Infrastructure as Code tools like OpenTofu, Ansible, or Puppet, and enjoy designing declarative, reproducible infrastructure systems.
- Understand and use PromQL and other telemetry/query languages and have used LLM to extract insight from real-time metrics, and know how to architect and tune observability stacks.
- Have a track record of debugging and resolving difficult hardware-software integration problems across bare-metal systems, networks, and distributed workloads.
- Can lead and mentor technical teams, guiding design decisions and helping others develop sound engineering instincts.
- Have 8+ years of experience in infrastructure engineering, systems programming, or backend software development - ideally in environments where performance, scale, or hardware interaction mattered.
- Are driven by curiosity, take initiative, and have an innate sense of ownership — you thrive in uncharted territory, design for edge cases, and love making systems more powerful, reliable, and elegant.
Strong Candidates May Also Have Experience With
- Familiarity with Bazel build system
- Deep understanding of ASIC development flows, especially those involving Synopsys, Cadence, and Verilator, including how EDA tools interact with infrastructure for simulation, synthesis, and verification.
- Hands-on experience architecting systems with AWS, GCP, or Azure, including hybrid on-prem/cloud deployments, workload migration strategies, and cloud-native orchestration tooling.
- Experience monitoring, provisioning, and debugging bare-metal servers, network hardware, and high-performance storage systems in rack-scale environments.
- Comfortable in profiling and optimizing compute environments for single-threaded latency, memory-bound workloads, or I/O throughput, especially in the context of simulation or CI performance.
- Proficiency building or operating telemetry systems at scale using Prometheus, Grafana, Loki, VictoriaMetrics, and tools for distributed tracing, log aggregation, and real-time alerting across heterogeneous mediums (SMS, email, push alerts, etc.)
Benefits
- Medical, dental, and vision packages with generous premium coverage + $500 per month credit for waiving medical benefits
- Housing subsidy of $2k per month for those living within walking distance of the office
- Relocation support for those moving to San Jose (Santana Row)
- Various wellness benefits covering fitness, mental health, and more
- Daily lunch + dinner in our office
How We’re Different
Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in San Jose (Santana Row), and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.
Compensation Range: $150K - $250K
How to Get Visa Sponsorship as an Infrastructure Software Engineer
Target employers with a proven H-1B filing history
Large tech companies, cloud providers, and financial institutions file hundreds of H-1B petitions annually for infrastructure roles. Filtering by employers with consistent filing history reduces the risk of landing at a company that's never navigated the process before.
Emphasize systems-level depth in your resume
USCIS scrutinizes whether a role genuinely requires a specialized degree. Highlighting distributed systems design, kernel-level work, or cloud orchestration makes the specialty occupation argument much harder to challenge during adjudication or an RFE.
Understand the difference between cap-subject and cap-exempt employers
Universities, nonprofit research institutions, and certain government contractors are exempt from the H-1B lottery. Infrastructure roles at these organizations can be filed at any time of year with no lottery risk, which is a meaningful advantage if you're on a tight OPT timeline.
Prepare for technical and visa questions simultaneously
Many infrastructure interviews include a sponsorship conversation alongside systems design rounds. Having a clear, confident explanation of your current status, work authorization timeline, and visa pathway saves time and signals you've done the groundwork employers expect from sponsored candidates.
Consider the L-1B if you're transferring within a multinational
If you've worked for a company abroad for at least one year, the L-1B intracompany transfer visa is a lottery-free alternative to the H-1B. Infrastructure engineers with specialized knowledge of proprietary systems, architecture, or internal platforms often qualify under the specialized knowledge standard.
Line up your OPT STEM extension early if you're on F-1
Infrastructure Software Engineer is a STEM-eligible role, making you eligible for a 24-month OPT extension beyond the initial 12 months. Filing early gives you up to 36 months of work authorization, covering multiple H-1B lottery cycles if needed.
Infrastructure Software Engineer jobs are hiring across the US. Find yours.
Find Infrastructure Software Engineer JobsSee all 964+ Infrastructure Software Engineer jobs
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Infrastructure Software Engineer roles.
Get Access To All JobsFrequently Asked Questions
Does an Infrastructure Software Engineer role qualify for H-1B sponsorship?
Yes. Infrastructure Software Engineer is a well-established specialty occupation under USCIS criteria. The role requires theoretical and practical application of highly specialized knowledge, typically in distributed systems, networking, or cloud infrastructure, and routinely requires a bachelor's degree or higher in computer science, computer engineering, or a related field. RFE rates for software engineering roles have risen in recent years, so employers should document the degree requirement clearly in the Labor Condition Application and offer letter.
What visa options exist for infrastructure engineers beyond the H-1B?
Several alternatives are worth knowing. The L-1B applies if you've worked for a multinational employer abroad for at least one year and have specialized knowledge of the company's systems. The O-1A is available if you can demonstrate extraordinary ability through publications, patents, or significant industry recognition. Australian citizens can use the E-3, which has no lottery. TN status covers Canadians and Mexicans in qualifying engineering roles. Each has different eligibility thresholds and timelines.
Does my degree field matter for H-1B approval as an infrastructure engineer?
It matters more than most candidates expect. USCIS evaluates whether your degree field is directly related to the job duties. Computer science, computer engineering, electrical engineering, and information systems are the strongest fits. A degree in an unrelated field, even with years of relevant experience, can trigger an RFE or denial unless supported by a detailed advisory opinion or a credential evaluation showing the coursework aligns with the role's technical requirements.
How can I find infrastructure engineering jobs that offer visa sponsorship?
Migrate Mate is built specifically for this, the job board filters for roles that sponsor work visas, so you're not spending time applying to positions that will reject you at the sponsorship question. Infrastructure engineering roles from companies with active H-1B filing histories are listed, which is a faster signal of sponsorship willingness than anything you'd piece together from a general job search.
Can I switch employers mid-H-1B if I'm working as an infrastructure software engineer?
Yes, H-1B portability allows you to change employers while your new employer's H-1B transfer petition is pending, as long as your current H-1B was approved and you've maintained valid status. For infrastructure roles, the new employer must file a fresh I-129 with an LCA matching the new position's duties and wage level. You can start work as soon as the petition is filed, you don't need to wait for approval.
What is the prevailing wage requirement for sponsored Infrastructure Software Engineer jobs?
U.S. employers sponsoring a visa must pay at least the prevailing wage, which is what workers in the same role, area, and experience level typically earn. The Department of Labor sets this rate to make sure companies aren't hiring foreign workers simply because they'd accept lower pay than a U.S. worker. It varies by job title, location, and experience. You can look up current prevailing wage rates for any occupation and location using the OFLC Wage Search page.
See which Infrastructure Software Engineer employers are hiring and sponsoring visas right now.
Search Infrastructure Software Engineer Jobs