Data Engineer Jobs at Snorkel AI with Visa Sponsorship
Snorkel AI hires Data Engineers to build and maintain the data infrastructure behind its AI training platform, and the company has a track record of sponsoring work visas for technical roles. If you're on F-1 OPT, H-1B, or TN status, this is a team worth targeting.
See All Data Engineer at Snorkel AI JobsOverview
Showing 5 of 31+ Data Engineer Jobs at Snorkel AI jobs


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?
See all 31+ Data Engineer Jobs at Snorkel AI
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Data Engineer Jobs at Snorkel AI.
Get Access To All Jobs
About Snorkel
At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.
We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI breakthroughs of today. But one thing has remained constant: the data you use to build AI is the key to achieving differentiation, high performance, and production-ready systems. We work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before. Excited to help us redefine how AI is built? Apply to be the newest Snorkeler!
The Core Services team within Snorkel AI's Infrastructure organization will own the data platform that powers everything at Snorkel — the pipelines, access layers, event systems, governance, and compute infrastructure that every product team and customer deployment depends on. We are a small team with a large surface area, and we are in the middle of a foundational architecture shift: moving from a single-database data path to a multi-source, event-driven platform with dedicated stores for different workloads (transactional data in Postgres/RDS, analytical data in Snowflake, bulk storage in S3, metrics platform). The decisions being made now will define how data flows at Snorkel for years. You will be making them.
You’ll also shape our AI-native development workflow, contribute in modernizing CI/CD workflow (Buildkite, GitHub Actions), and integrate AI SRE tooling. Your work will directly accelerate developer velocity, reliability, and product quality across the company.
- Build and maintain the shared data access library and SDKs that Platform, Packaging, and Dataset API teams use to read from and write to multiple data sources (Snowflake, S3, RDS). Design interfaces that abstract source-level complexity while providing built-in auth, RBAC enforcement, pagination, and query governance.
- Design and implement event-driven data flows using event brokers, CDC connectors, schema registry, event routing, dead letter queues. Make sure events flow reliably and failures are visible and recoverable.
- Build the systems that track how data moves through the platform (lineage), enforce who can access what (governance and RBAC), and log what happened (auditing). This includes PII handling, retention policy enforcement, and audit infrastructure for enterprise and federal compliance.
- Instrument the data platform with OpenTelemetry, define and monitor SLOs for query latency and pipeline success rates, and build alerting that catches issues before they become incidents. You will be on-call for the systems you build.
- Contribute to infrastructure cost visibility and optimization - query cost estimation, workload right-sizing, and routing data to the most cost-effective storage tier for its access pattern.
What You'll Bring
- 4+ years building platform infrastructure, data infrastructure, data platforms, or backend systems with significant data components. You have built and operated pipelines, data access layers, or ETL/ELT systems in production.
- Strong proficiency in Python. Our stack is Python-heavy across Prefect, FastAPI, dbt, and the SDK layer.
- Hands-on experience with SQL and at least two of: Snowflake, Redshift, Postgres. You understand the performance characteristics of each and can write queries that don't bring down production.
- Experience with AWS — S3, RDS, EKS, EventBridge, IAM. Comfortable working in a Terraform-managed environment.
- Experience with Kubernetes. Our workloads run on EKS and you will deploy, debug, and scale services on K8s.
- Familiarity with data orchestration tools (Prefect, Airflow, or Dagster) and transformation frameworks (dbt).
- Understanding of data governance concepts — RBAC, PII handling, audit logging, data lineage.
- Fluency with AI-assisted development tools (Claude Code, Cursor, or similar). This is a hard requirement — the team uses these tools daily and we expect engineers to leverage them for code generation, debugging, and investigation.
Nice to Have
- Experience building shared libraries or SDKs consumed by multiple teams — versioning, backwards compatibility, migration support.
- Experience with event-driven architectures — CDC, event buses, schema registries, at-least-once delivery semantics.
- Experience with OpenTelemetry, ClickHouse, or similar observability infrastructure.
- Prior work in regulated environments (SOC 2, FedRAMP, HIPAA) where compliance requirements shaped system design.
- Experience with Ray for distributed compute workloads.
Salary Range
- Salary Range: $172,000—$215,000 USD
Be Your Best at Snorkel
Joining Snorkel AI means becoming part of a company that has market proven solutions, robust funding, and is scaling rapidly—offering a unique combination of stability and the excitement of high growth. As a member of our team, you’ll have meaningful opportunities to shape priorities and initiatives, influence key strategic decisions, and directly impact our ongoing success. Whether you’re looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you’re fully supported in building your career in an environment designed for growth, learning, and shared success.
Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

About Snorkel
At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.
We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI breakthroughs of today. But one thing has remained constant: the data you use to build AI is the key to achieving differentiation, high performance, and production-ready systems. We work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before. Excited to help us redefine how AI is built? Apply to be the newest Snorkeler!
The Core Services team within Snorkel AI's Infrastructure organization will own the data platform that powers everything at Snorkel — the pipelines, access layers, event systems, governance, and compute infrastructure that every product team and customer deployment depends on. We are a small team with a large surface area, and we are in the middle of a foundational architecture shift: moving from a single-database data path to a multi-source, event-driven platform with dedicated stores for different workloads (transactional data in Postgres/RDS, analytical data in Snowflake, bulk storage in S3, metrics platform). The decisions being made now will define how data flows at Snorkel for years. You will be making them.
You’ll also shape our AI-native development workflow, contribute in modernizing CI/CD workflow (Buildkite, GitHub Actions), and integrate AI SRE tooling. Your work will directly accelerate developer velocity, reliability, and product quality across the company.
- Build and maintain the shared data access library and SDKs that Platform, Packaging, and Dataset API teams use to read from and write to multiple data sources (Snowflake, S3, RDS). Design interfaces that abstract source-level complexity while providing built-in auth, RBAC enforcement, pagination, and query governance.
- Design and implement event-driven data flows using event brokers, CDC connectors, schema registry, event routing, dead letter queues. Make sure events flow reliably and failures are visible and recoverable.
- Build the systems that track how data moves through the platform (lineage), enforce who can access what (governance and RBAC), and log what happened (auditing). This includes PII handling, retention policy enforcement, and audit infrastructure for enterprise and federal compliance.
- Instrument the data platform with OpenTelemetry, define and monitor SLOs for query latency and pipeline success rates, and build alerting that catches issues before they become incidents. You will be on-call for the systems you build.
- Contribute to infrastructure cost visibility and optimization - query cost estimation, workload right-sizing, and routing data to the most cost-effective storage tier for its access pattern.
What You'll Bring
- 4+ years building platform infrastructure, data infrastructure, data platforms, or backend systems with significant data components. You have built and operated pipelines, data access layers, or ETL/ELT systems in production.
- Strong proficiency in Python. Our stack is Python-heavy across Prefect, FastAPI, dbt, and the SDK layer.
- Hands-on experience with SQL and at least two of: Snowflake, Redshift, Postgres. You understand the performance characteristics of each and can write queries that don't bring down production.
- Experience with AWS — S3, RDS, EKS, EventBridge, IAM. Comfortable working in a Terraform-managed environment.
- Experience with Kubernetes. Our workloads run on EKS and you will deploy, debug, and scale services on K8s.
- Familiarity with data orchestration tools (Prefect, Airflow, or Dagster) and transformation frameworks (dbt).
- Understanding of data governance concepts — RBAC, PII handling, audit logging, data lineage.
- Fluency with AI-assisted development tools (Claude Code, Cursor, or similar). This is a hard requirement — the team uses these tools daily and we expect engineers to leverage them for code generation, debugging, and investigation.
Nice to Have
- Experience building shared libraries or SDKs consumed by multiple teams — versioning, backwards compatibility, migration support.
- Experience with event-driven architectures — CDC, event buses, schema registries, at-least-once delivery semantics.
- Experience with OpenTelemetry, ClickHouse, or similar observability infrastructure.
- Prior work in regulated environments (SOC 2, FedRAMP, HIPAA) where compliance requirements shaped system design.
- Experience with Ray for distributed compute workloads.
Salary Range
- Salary Range: $172,000—$215,000 USD
Be Your Best at Snorkel
Joining Snorkel AI means becoming part of a company that has market proven solutions, robust funding, and is scaling rapidly—offering a unique combination of stability and the excitement of high growth. As a member of our team, you’ll have meaningful opportunities to shape priorities and initiatives, influence key strategic decisions, and directly impact our ongoing success. Whether you’re looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you’re fully supported in building your career in an environment designed for growth, learning, and shared success.
Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
See all 31+ Data Engineer at Snorkel AI jobs
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Data Engineer at Snorkel AI roles.
Get Access To All JobsTips for Finding Data Engineer Jobs at Snorkel AI Jobs
Align your portfolio with programmatic data
Snorkel AI's platform is built around programmatic labeling and weak supervision. Projects demonstrating experience with large-scale data pipelines, annotation workflows, or ML training data will resonate far more than general ETL or analytics work.
Time your application around H-1B cap deadlines
USCIS opens H-1B registration in March each year for an October 1 start. If you need cap-subject H-1B sponsorship, target Snorkel AI's interview process in Q4 or Q1 so an offer can be finalized before the registration window opens.
Verify TN eligibility before your interview
Canadian and Mexican nationals can pursue TN status for Data Engineer roles if the position maps to a qualifying USMCA occupation category. Confirm the job description aligns with a recognized TN category before your interview so you can discuss it confidently with the recruiter.
Use Migrate Mate to surface open roles
Snorkel AI posts Data Engineer openings across multiple channels and not all listings flag sponsorship availability. Use Migrate Mate to filter specifically for Snorkel AI roles that have confirmed visa sponsorship, so you're not applying blind.
Data Engineer at Snorkel AI jobs are hiring across the US. Find yours.
Find Data Engineer at Snorkel AI JobsFrequently Asked Questions
Does Snorkel AI sponsor H-1B visas for Data Engineers?
Yes, Snorkel AI sponsors H-1B visas for Data Engineer roles. As a technology company building AI infrastructure, Snorkel AI regularly hires engineers who require work visa sponsorship. If you're on a cap-subject H-1B, timing your offer before USCIS's March registration window is important, as the petition must be filed ahead of an October 1 start date.
How do I apply for Data Engineer jobs at Snorkel AI?
Applications go through Snorkel AI's careers page, where Data Engineer roles are posted when open. You can also browse and filter for current openings with confirmed visa sponsorship on Migrate Mate. When applying, tailor your resume to highlight experience with data pipelines, ML training data, or large-scale labeling systems, which are directly relevant to Snorkel AI's platform.
Which visa types are commonly used for Data Engineer roles at Snorkel AI?
Snorkel AI sponsors H-1B visas for most foreign national hires and supports F-1 OPT for recent graduates who need a bridge while awaiting H-1B selection. Canadian and Mexican nationals may be eligible for TN status if the Data Engineer role maps to a qualifying USMCA occupation. The right visa type depends on your nationality, degree, and current immigration status.
What qualifications and experience does Snorkel AI expect for Data Engineer roles?
Snorkel AI's Data Engineer positions typically require a bachelor's or master's degree in Computer Science, Data Engineering, or a related technical field. Hands-on experience with distributed data systems, pipeline orchestration tools, and working in Python-heavy environments is expected. Experience adjacent to machine learning workflows or data labeling infrastructure is a meaningful differentiator given the nature of Snorkel AI's product.
How do I understand the H-1B filing timeline if Snorkel AI extends me an offer?
Once Snorkel AI extends an offer requiring H-1B sponsorship, your employer files a Labor Condition Application with the DOL before submitting the I-129 petition to USCIS. If you're subject to the annual cap, USCIS registration opens in March and the lottery runs shortly after. Premium processing is available and reduces USCIS adjudication to 15 business days, which can reduce uncertainty after selection.
See which Data Engineer at Snorkel AI employers are hiring and sponsoring visas right now.
Search Data Engineer at Snorkel AI Jobs