Senior Mlops Engineer Jobs in Georgia
Senior Mlops Engineer jobs in Georgia are open across Atlanta and Alpharetta and other Georgia metros, with employers like BlueAlly, Honeywell, and McKinsey & Company hiring at every experience level. Find a role that fits below and apply directly.
Find Senior Mlops Engineer JobsOverview
Showing 5 of 5+ Senior Mlops Engineer jobs










Looking for more senior mlops engineer jobs?
Explore related role searches to find more openings that fit.
See related jobs
Company Description
At BlueAlly, our mission is to make technology more accessible, more certain, and more impactful for every organization.
From cloud to cybersecurity, infrastructure to application modernization, we thrive on cutting-edge technologies and services. Elevate the impact of technology across your enterprise with world-class expertise that produces game-changing insights. Turn complex decisions into clear opportunities with a trusted guide to technology that ensures the next digital advance will be your decisive advantage. Trade IT complexity for capability with solutions that elevate possibilities, and advance with certainty, knowing you have BlueAlly as your ally in next. BlueAlly. Conquer Complexity.
Job Description
We are hiring a Senior AI Engineer to design, build, and operate enterprise AI systems across our client portfolio. You will work end-to-end across the AI stack — from inference engines and platform infrastructure (vLLM, KV cache, Dynamo-style serving, GPU-accelerated AI Factory platforms) up through application-level engineering (RAG pipelines, agent workflows, prompt engineering, evaluation methodology).
This role is for an engineer who can lead workstreams independently, mentor more junior engineers, and serve as the technical authority that clients trust to deliver production AI outcomes. You'll engage directly with client architects, data scientists, application teams, and executives — and you'll leave each engagement having raised both the client's capability and BlueAlly's practice.
Key Responsibilities:
- Lead end-to-end design, build, and operation of AI systems on AI Factory platforms (HPE PCAI, Dell AI Factory, Nutanix Enterprise AI, and adjacent ecosystem layers) across multiple client engagements.
- Engineer and tune LLM inference serving stacks — primary depth in vLLM with breadth across the inference ecosystem — for client latency, throughput, and cost targets.
- Tune inference performance through KV cache management, paged attention, batching strategies, and Dynamo-based disaggregated serving.
- Architect and operate MLOps pipelines covering model lifecycle, registries, deployment, rollback, and observability.
- Design and engineer RAG applications on top of vector databases — chunking strategies, retrieval tuning, reranking, citation handling, and context-window management.
- Build and tune prompt-engineering patterns at production scale — system prompts, structured output, tool and function calling.
- Design and maintain LLM evaluation harnesses — golden sets, regression suites, and online quality metrics.
- Engineer high-performance storage and networking for AI workloads — parallel filesystems, object storage tiers, and high-throughput, low-latency RDMA fabrics.
- Operate Kubernetes clusters underpinning AI workloads — namespaces, RBAC, resource quotas, network policies, storage classes, and ingress.
- Build and maintain container images, registries, and CI/CD pipelines for AI/ML services.
- Implement monitoring, alerting, logging, and capacity planning across the AI stack.
- Harden environments to meet client security and compliance requirements.
- Lead troubleshooting across bare metal, BIOS/firmware, OS, containers, GPUs, frameworks, and models.
- Engage directly with client stakeholders — technical and executive — to communicate status, root cause, options, and recommendations.
- Mentor and code-review work from less senior engineers; raise the technical bar of every engagement you join.
- Author runbooks, reference architectures, and knowledge base content; lead client knowledge transfer and enablement sessions.
- Participate in on-call rotation and incident response for production AI workloads.
- Contribute reusable patterns, tooling, and reference designs back to the practice.
Qualifications
- Experience: 7+ years of software, data, or infrastructure engineering, with 3+ years specifically working with modern AI / LLM systems.
- Software engineering: Production-quality Python at engineering level — testing, code review, version control fluency, and shipping code that other engineers depend on.
- Linux engineering: Deep production Linux experience, including system internals, performance tuning, and troubleshooting.
- Containers: Deep proficiency with Docker — image build, registry management, runtime tuning, and container security.
- Hardware fundamentals: Strong server-platform skills including CPU/GPU topologies, PCIe, BMC management, BIOS/firmware lifecycle, and physical-to-logical troubleshooting.
- AI Factory platforms: Hands-on experience deploying and operating one or more of HPE PCAI, Dell AI Factory, or Nutanix Enterprise AI.
- Inference stack — vLLM: Production experience deploying, tuning, and operating vLLM.
- Inference stack breadth: Working knowledge of multiple inference and model-serving frameworks beyond vLLM, with the ability to choose and tune the right tool for each workload.
- High-performance storage and networking: Hands-on experience with high-throughput, low-latency storage and network fabrics for AI workloads — including RDMA-class interconnects, parallel/object storage tiers, KV cache management, and Dynamo-style disaggregated serving.
- MLOps: Practical experience operating MLOps tooling and patterns — model registries, deployment pipelines, GitOps, lineage, and rollback.
- Vector databases and RAG: Hands-on experience deploying, tuning, and integrating vector databases and RAG pipelines, including the application-level engineering that sits on top of them.
- Prompt engineering and tool use: Production experience designing system prompts, structured output, function calling, and tool-using LLM patterns.
- Evaluation methodology: Demonstrated experience designing LLM evaluation harnesses — golden sets, regression suites, and quality/cost metrics.
- Client-facing skills: Demonstrated ability to engage directly with client stakeholders — running working sessions, presenting recommendations, and translating technical detail for non-technical audiences.
- Communication: Strong written and verbal communication — clear reference architectures, runbooks, and incident reports.
- Mentorship: Track record of mentoring more junior engineers and raising team technical quality through code review and pairing.
- Networking fundamentals: TCP/IP, DNS, load balancing, VLANs, and firewall administration.
- Multi-client delivery: Comfort working across multiple concurrent client environments and managing competing priorities under SLA.
Preferred Qualifications:
- GPU operations: Experience with GPU drivers, CUDA toolchains, GPU partitioning (MIG/vGPU), and GPU-level monitoring.
- NVIDIA AI Enterprise: Deployment and operations experience with the NVAIE software stack.
- Ray: Familiarity with Ray for distributed training and inference scaling.
- Kubernetes: Working knowledge of Kubernetes administration — Helm, ingress, RBAC, storage classes.
- Identity and access: Integrating SSO and enterprise identity (LDAP, AD, OIDC/SAML), secrets management, tenant isolation.
- Fine-tuning: Familiarity with LoRA/QLoRA/PEFT and supervised fine-tuning workflows.
- Token economics: Experience optimizing inference cost — caching, prompt caching, model routing, and distillation.
- MSP / multi-tenant operations: Service-provider experience including chargeback/showback and tenant isolation patterns.
- Compliance frameworks: SOC 2, HIPAA, FedRAMP, FISMA, or CMMC environments.
- Public cloud and hybrid: Working experience with one or more public clouds and hybrid architectures.
- Infrastructure as Code: Terraform, Ansible, Helm, or similar.
Certifications (Preferred):
- Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD).
- Cloud certifications — AWS, Azure, or Google Cloud.
- Linux certifications — RHCE, RHCSA, or LFCS.
- NVIDIA-Certified Associate: AI Infrastructure and Operations (NCA-AIIO) or higher NVIDIA certifications.
- HPE, Dell Technologies, or Nutanix platform certifications.
What Sets You Apart:
- Genuine curiosity about how AI systems work end-to-end — from kernel and GPU up through frameworks and models.
- Track record of restoring production AI services under pressure.
- Ability to translate complex technical concepts into clear, client-facing communication.
- Comfort with ambiguity and rapid change in the AI/LLM ecosystem.
- Service-oriented mindset: you treat each client environment as if it were your own.
- Bias toward leaving the practice better than you found it — patterns, tooling, and reference designs.
Additional Information
About BlueAlly
BlueAlly is a leading provider of IT services and solutions, helping organizations conquer IT complexity across cloud, cybersecurity, infrastructure, data, and application modernization. Headquartered in Cary, North Carolina, with delivery teams across the United States and globally, BlueAlly serves clients ranging from mid-market enterprises to large public-sector and commercial organizations.
Founded in 2011, BlueAlly delivers across the full technology lifecycle — from strategy and design through implementation, managed services, and continuous optimization. The company is recognized on CRN's Tech Elite 150 and MSP 500 lists and partners deeply with leading technology vendors. As enterprise AI moves from pilot to production, BlueAlly is investing in the people, platforms, and practices required to deliver AI Factory outcomes for our clients — and this role is at the center of that investment.
Equal Employment Opportunity
BlueAlly is an Equal Opportunity Employer. We are committed to building a diverse and inclusive workforce and to making employment decisions based on merit, qualifications, and business need. BlueAlly does not discriminate in employment on the basis of race, color, religion, sex (including pregnancy), national origin, age, disability, genetic information, sexual orientation, gender identity or expression, marital status, veteran status, or any other protected characteristic under applicable federal, state, or local law.
BlueAlly provides reasonable accommodations to qualified applicants and employees with disabilities. If you require an accommodation to participate in the application or interview process, please contact our People team.
See All 5 Senior Mlops Engineer Jobs in Georgia
Find roles in Georgia that match your experience and apply in just a few clicks.
Find Senior Mlops Engineer JobsSenior Mlops Engineer Jobs by City in Georgia
Where Georgia roles are concentrated, by current openings.
Senior Mlops Engineer Job Market in Georgia
A snapshot from current Georgia openings, updated as new roles post.
Who's Hiring
- BlueAlly1

- Honeywell1

- McKinsey & Company1

- Navis, LP1

- Waystar1

Top Industries Hiring
- Technology & Software3
- Chemicals & Materials1
- Consulting & Professional Services1
What Georgia Employers Look For
The qualifications that appear most often in senior mlops engineer jobs across Georgia.
- 3 or more years of experience building and maintaining production ML pipelines
- Proficiency with orchestration tools such as Airflow, Kubeflow, or Prefect
- Hands-on experience with at least one major cloud platform: AWS, GCP, or Azure
- Strong Python skills including packaging, testing, and environment management
- Experience with containerization and Kubernetes for model serving and scaling
- Familiarity with CI/CD practices applied to machine learning workflows and model registries
Senior Mlops Engineer Jobs in Georgia: Frequently Asked Questions
How many senior mlops engineer jobs are there in Georgia?
There are 5+ senior mlops engineer openings in Georgia on Migrate Mate as of June 2026, with the most roles in Atlanta and Alpharetta. New positions post regularly as employers across Georgia hire.
How much do senior mlops engineers make in Georgia?
Senior mlops engineers in Georgia earn a median of about $104,610 a year, based on May 2025 Bureau of Labor Statistics wage data, ranging from around $60,920 for the lowest 10% to over $173,060 for the top 10%. Pay rises with experience, specialty, and employer.
Which Georgia cities have the most senior mlops engineer jobs?
Atlanta and Alpharetta have the most senior mlops engineer openings in Georgia right now, with additional roles spread across smaller metros statewide.
Which companies hire senior mlops engineers in Georgia?
Employers hiring senior mlops engineers in Georgia include BlueAlly, Honeywell, and McKinsey & Company, based on current listings on Migrate Mate as of June 2026.
Are there remote senior mlops engineer jobs in Georgia?
Yes. About 20% of senior mlops engineer openings tied to Georgia are remote or hybrid as of June 2026. The rest are on-site roles based in Georgia metros.
How do I apply for senior mlops engineer jobs in Georgia?
You can apply to senior mlops engineer jobs in Georgia directly on Migrate Mate. Search the listings above, find roles that match your experience and preferred Georgia location, then apply to each one that fits.
See All 5 Senior Mlops Engineer Jobs in Georgia
Find roles in Georgia that match your experience and apply in just a few clicks.
Find Senior Mlops Engineer Jobs