Data Contributor Jobs at Liquid AI with Visa Sponsorship
Data Contributor roles at Liquid AI sit at the intersection of machine learning infrastructure and high-quality training data, typically requiring strong annotation, data pipeline, or labeling expertise. Liquid AI has a track record of sponsoring work visas for this function, making it a genuine option if you're on F-1 OPT, CPT, or need H-1B or TN sponsorship.
See All Data Contributor at Liquid AI JobsOverview
Showing 5 of 25+ Data Contributor Jobs at Liquid AI jobs


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?


Have you applied for this role?
See all 25+ Data Contributor Jobs at Liquid AI
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Data Contributor Jobs at Liquid AI.
Get Access To All Jobs
About Liquid AI
Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, minimal memory usage, privacy, and reliability. We partner with enterprises across consumer electronics, automotive, life sciences, and financial services. We are scaling rapidly and need exceptional people to help us get there.
The Opportunity
Our Data team powers Liquid Foundation Models across pre-training, vision, audio, and emerging modalities. Public data sources are plateauing. Model performance increasingly depends on purpose-built datasets. We need ML-minded engineers who can collect, filter, and synthesize high-quality data at scale. We treat data as a research problem, not an infrastructure problem. Our engineers run experiments, design ablations, and measure how data decisions move model quality. We will match you to the team where you can grow the fastest and have the most impact: pre-training, post-training RL, vision-language, audio, or multimodal.
While San Francisco and Boston are preferred, we are open to other locations.
What We're Looking For
We need someone who:
- Thinks like a researcher, ships like an engineer: We need people who form hypotheses, run experiments, and measure results. Our engineers understand deep-theoretical research, and our researchers ship production systems.
- Learns fast and adapts: We work across modalities that evolve weekly. We need people who pick up new domains quickly and thrive with ambiguity.
- Obsesses over data quality: We believe data quality is non-negotiable. Filtering, deduplication, augmentation, and evaluation are first-class concerns for our team, not afterthoughts.
- Solves problems independently: Our data engineers sit within training groups (pre-training and multimodal). We collaborate closely, but we expect ownership and self-direction.
The Work
- Build and maintain data processing, filtering, and selection pipelines at scale
- Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
- Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
- Design and run evaluations and ablations to measure dataset's impact on model performance
- Monitor public datasets across text, vision, and audio domains
- Collaborate with pre-training, vision, and audio teams on modality-specific data needs
Desired Experience
Must-have:
- Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
- Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
- Track record of learning new technical domains quickly
- 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Nice-to-have:
- Experience with synthetic data generation, data curation, or ML evaluation (designing evals, benchmarking, measuring data and model quality)
- Experience with LLMs, VLMs, computer vision, or audio data pipelines
- Open-source contributions or publications at NeurIPS, ICML, ICLR, or CVPR
What Success Looks Like (Year One)
- You own a critical data pipeline end-to-end for one of our modalities
- You have built or improved data systems that measurably moved model performance
- You have identified and integrated at least one external dataset that moved the needle
What We Offer
- Impact at scale: Your pipelines directly determine model quality across all of Liquid's foundation models.
- Compensation: Competitive base salary with equity in a unicorn-stage company
- Health: We pay 100% of medical, dental, and vision premiums for employees and dependents
- Financial: 401(k) matching up to 4% of base pay
- Time Off: Unlimited PTO plus company-wide Refill Days throughout the year

About Liquid AI
Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, minimal memory usage, privacy, and reliability. We partner with enterprises across consumer electronics, automotive, life sciences, and financial services. We are scaling rapidly and need exceptional people to help us get there.
The Opportunity
Our Data team powers Liquid Foundation Models across pre-training, vision, audio, and emerging modalities. Public data sources are plateauing. Model performance increasingly depends on purpose-built datasets. We need ML-minded engineers who can collect, filter, and synthesize high-quality data at scale. We treat data as a research problem, not an infrastructure problem. Our engineers run experiments, design ablations, and measure how data decisions move model quality. We will match you to the team where you can grow the fastest and have the most impact: pre-training, post-training RL, vision-language, audio, or multimodal.
While San Francisco and Boston are preferred, we are open to other locations.
What We're Looking For
We need someone who:
- Thinks like a researcher, ships like an engineer: We need people who form hypotheses, run experiments, and measure results. Our engineers understand deep-theoretical research, and our researchers ship production systems.
- Learns fast and adapts: We work across modalities that evolve weekly. We need people who pick up new domains quickly and thrive with ambiguity.
- Obsesses over data quality: We believe data quality is non-negotiable. Filtering, deduplication, augmentation, and evaluation are first-class concerns for our team, not afterthoughts.
- Solves problems independently: Our data engineers sit within training groups (pre-training and multimodal). We collaborate closely, but we expect ownership and self-direction.
The Work
- Build and maintain data processing, filtering, and selection pipelines at scale
- Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
- Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
- Design and run evaluations and ablations to measure dataset's impact on model performance
- Monitor public datasets across text, vision, and audio domains
- Collaborate with pre-training, vision, and audio teams on modality-specific data needs
Desired Experience
Must-have:
- Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
- Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
- Track record of learning new technical domains quickly
- 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Nice-to-have:
- Experience with synthetic data generation, data curation, or ML evaluation (designing evals, benchmarking, measuring data and model quality)
- Experience with LLMs, VLMs, computer vision, or audio data pipelines
- Open-source contributions or publications at NeurIPS, ICML, ICLR, or CVPR
What Success Looks Like (Year One)
- You own a critical data pipeline end-to-end for one of our modalities
- You have built or improved data systems that measurably moved model performance
- You have identified and integrated at least one external dataset that moved the needle
What We Offer
- Impact at scale: Your pipelines directly determine model quality across all of Liquid's foundation models.
- Compensation: Competitive base salary with equity in a unicorn-stage company
- Health: We pay 100% of medical, dental, and vision premiums for employees and dependents
- Financial: 401(k) matching up to 4% of base pay
- Time Off: Unlimited PTO plus company-wide Refill Days throughout the year
See all 25+ Data Contributor at Liquid AI jobs
Sign up for free to unlock all listings, filter by visa type, and get alerts for new Data Contributor at Liquid AI roles.
Get Access To All JobsTips for Finding Data Contributor Jobs at Liquid AI Jobs
Tailor your portfolio to AI training data
Liquid AI builds novel liquid neural network architectures, so Data Contributor applicants who can demonstrate experience with structured annotation, dataset curation, or model evaluation pipelines stand out well beyond a generic data entry background.
Align your degree to the specialty occupation standard
For H-1B eligibility, USCIS requires that your role qualify as a specialty occupation tied to a specific degree field. A background in computer science, linguistics, or statistics strengthens the nexus between your credentials and a Data Contributor job description.
Confirm E-Verify enrollment before accepting an offer
STEM OPT extensions require your employer to be enrolled in E-Verify. Before signing an offer for a Data Contributor role, verify Liquid AI's E-Verify status directly through the Department of Homeland Security's employer search tool.
Use Migrate Mate to surface active Data Contributor openings
Sponsorship-eligible roles at AI companies fill quickly and aren't always flagged clearly on general job boards. Use Migrate Mate to filter specifically for Data Contributor positions at Liquid AI that match your visa type and current work authorization status.
Request an explicit sponsorship commitment in writing
Before USCIS filing begins, ask your Liquid AI recruiter to confirm the company will cover the H-1B petition and any premium processing costs. Get this in writing so there's no ambiguity when the employer-side filing process starts.
Data Contributor at Liquid AI jobs are hiring across the US. Find yours.
Find Data Contributor at Liquid AI JobsFrequently Asked Questions
Does Liquid AI sponsor H-1B visas for Data Contributors?
Yes, Liquid AI sponsors H-1B visas for Data Contributor roles. The company has filed H-1B petitions for this function, which means they're familiar with the employer obligations involved, including the Labor Condition Application filed with the DOL and the I-129 petition filed with USCIS. You should still confirm current sponsorship intent directly with the recruiter during the interview process, as sponsorship decisions can vary by role level and business need.
How do I apply for Data Contributor jobs at Liquid AI?
You can browse and apply for Data Contributor openings at Liquid AI through Migrate Mate, which filters specifically for visa-sponsoring employers and surfaces roles matched to your work authorization. When applying, tailor your resume to highlight data labeling, annotation tooling, or dataset quality experience relevant to AI model training. Follow up with the recruiting team to confirm sponsorship availability early in the process, before investing time in multiple interview rounds.
Which visa types does Liquid AI use for Data Contributor roles?
Liquid AI sponsors H-1B and TN visas and also hires Data Contributors on F-1 OPT and F-1 CPT. The TN visa is available to Canadian and Mexican nationals in eligible professional categories. For F-1 students, CPT is typically used for internships or co-ops, while OPT covers post-graduation employment. If you're on STEM OPT and Liquid AI is E-Verify enrolled, you may also be eligible for a 24-month extension before an H-1B filing becomes necessary.
What qualifications does Liquid AI expect for Data Contributor roles?
Liquid AI's Data Contributor positions generally call for experience with data annotation, quality assurance workflows, or training data pipelines for machine learning systems. A bachelor's degree in a relevant field such as computer science, linguistics, or cognitive science supports H-1B specialty occupation eligibility. Familiarity with annotation platforms, inter-annotator agreement metrics, or prompt and response evaluation for large language models is a practical differentiator for candidates at AI-focused companies like Liquid AI.
How do I time my H-1B filing if I get a Data Contributor offer from Liquid AI?
H-1B cap-subject petitions can only be filed once per year, with USCIS accepting registrations in March for an October 1 start date. If you receive an offer outside that window, you'll likely bridge the gap on F-1 OPT, TN status, or another valid authorization. Confirm your current status end date with your international student office or immigration attorney, then work backward from the USCIS registration period to ensure Liquid AI initiates the H-1B process on time.
See which Data Contributor at Liquid AI employers are hiring and sponsoring visas right now.
Search Data Contributor at Liquid AI Jobs