We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

AI DevOps and Cloud Infrastructure Engineer

Crowe
remote work
United States, Florida, Sarasota
Jan 11, 2026

Your Journey at Crowe Starts Here:

At Crowe, you can build a meaningful and rewarding career. With real flexibility to balance work with life moments, you're trusted to deliver results and make an impact. We embrace you for who you are, care for your well-being, and nurture your career. Everyone has equitable access to opportunities for career growth and leadership. Over our 80-year history, delivering excellent service through innovation has been a core part of our DNA across our audit, tax, and consulting groups. That's why we continuously invest in innovative ideas, such as AI-enabled insights and technology-powered solutions, to enhance our services. Join us at Crowe and embark on a career where you can help shape the future of our industry.

Job Description:

About Crowe AI Transformation

Everything we do is about making the future of human work more purposeful. We do this by leveraging state-of-the-art technologies, modern architecture, and industry experts to create AI-powered solutions that transform the way our clients do business.

The new AI Transformation team will build on Crowe's established AI foundation, furthering the capabilities of our Applied AI / Machine Learning team. By combining Generative AI, Machine Learning and Software Engineering, this team empowers Crowe clients to transform their business models through AI, irrespective of their current AI adoption stage.

As a member of AI Transformation, you will help distinguish Crowe in the market and drive the firm's technology and innovation strategy. The future is powered by AI, come build it with us.

About the Team

  • We invest in expertise. You'll have the time, space, and support to go deep in your projects and build lasting technical and strategic mastery. You'll work with developers, product stakeholders, and project managers as a trusted leader and domain expert.

  • We believe in continuous growth. Our team is committed to professional development and knowledge-sharing.

  • We protect balance. Our distributed team culture is grounded in trust and flexibility. We offer unlimited PTO, a flexible remote work policy, and a supportive environment that prioritizes sustainable, long-term performance.

About the Role

The AI DevOps and Cloud Infrastructure Manager lead teams responsible for designing, operating, and scaling AI/ML infrastructure, cloud platforms, and DevOps automation that support enterprise model training, inference, and generative AI workloads. This role is the strategy and execution of cloud-native, Kubernetes-based platforms that enable reliable, secure, and cost-efficient AI systems.

As a manager, this position combines hands-on technical leadership with people management, delivery ownership, and strategic decision-making. The role oversees distributed compute environments, GPU clusters, CI/CD pipelines, and vector-search infrastructure while ensuring high availability, resilience, and compliance with security and responsible AI standards. The manager partners closely with AI engineering, data engineering, product, and security teams, serves as the primary technical owner for assigned initiatives, and communicates system risks, tradeoffs, and progress to leadership.

Key responsibilities include:

  • Leading engineering teams responsible for AI/ML infrastructure, cloud operations, and MLOps automation.

  • Defining cloud, Kubernetes, and infrastructure strategy to support scalable model training, inference, and generative AI platforms.

  • Guiding the design and operation of distributed compute environments, GPU clusters, and vector database infrastructure.

  • Overseeing CI/CD pipelines that automate model training, testing, deployment, monitoring, and lifecycle management.

  • Managing incident response, failure analysis, and reliability engineering across AI platforms.

  • Directing performance testing, capacity planning, and cost optimization for AI infrastructure.

  • Ensuring compliance with cloud security, IAM practices, governance requirements, and responsible AI frameworks.

  • Implementing multi-cloud resilience patterns, high availability, and automated failover for critical AI workloads.

  • Supporting platform modernization initiatives, including adoption of optimized LLM runtimes and new orchestration technologies.

  • Evaluating third-party infrastructure tools, GPU scheduling solutions, and platform enhancements.

  • Communicating system status, dependencies, risks, and technical decisions to senior leadership.

  • Managing 4-5 direct reports, including coaching, performance management, and career development.

  • Owning project delivery, including budget, timelines, and quality of outcomes.

  • Coordinating with sales and stakeholders on project sizing, feasibility, and strategic opportunities.

  • Driving continuous improvement initiatives to advance DevOps maturity and AI infrastructure operational readiness.

Qualifications

  • 7+ years of professional experience in DevOps, cloud engineering, MLOps, or platform engineering.

  • 2+ years of experience in engineering leadership or senior technical leadership roles.

  • Expert proficiency with distributed cloud systems, Kubernetes, and infrastructure-as-code.

  • Advanced ability to troubleshoot infrastructure, networking, container, and deployment issues.

  • Proficiency in Python, Bash, or similar automation and scripting languages.

  • Strong understanding of monitoring, observability, and reliability engineering patterns.

  • Hands-on experience supporting infrastructure for ML or generative AI workloads.

  • Strong leadership, communication, and cross-functional collaboration skills.

Preferred Qualifications

  • Bachelor's degree in computer science, engineering, cloud computing, or a related field.

  • Master's degree in technical discipline.

  • Cloud and AI certifications, including Azure (AZ-900, AZ-104, AZ-305, AZ-700, AZ-800, AI-102) or equivalent AWS/GCP certifications.

  • Extensive experience with Kubernetes platforms (EKS, AKS, GKE) and cloud ML services (Azure ML, SageMaker).

  • Experience with GPU workload orchestration, optimization, and multi-tenant inference environments.

  • Expertise in observability and distributed tracing (Prometheus, Grafana, CloudWatch, OpenTelemetry).

  • Strong experience with Terraform and infrastructure governance at scale.

  • Familiarity with service mesh architectures (Istio, Linkerd) and advanced deployment patterns (blue/green, canary).

  • Advanced experience supporting generative AI platforms, including LLM inference runtimes (vLLM, TGI), RAG infrastructure, and vector databases (Pinecone, Weaviate, FAISS).

  • Experience operating fine-tuned LLMs (LoRA, QLoRA), managing GenAI CI/CD pipelines, and implementing hallucination, drift, and reliability monitoring.

  • Demonstrated ability to make strategic technical decisions within defined delivery and budget constraints.

We expect the candidate to uphold Crowe's values of Care, Trust, Courage, and Stewardship. These values define who we are. We expect all of our people to act ethically and with integrity at all times.

The application deadline for this role is 04/30/2026.

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire. Crowe is not sponsoring for work authorization at this time.

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Crowe, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $102,400.00 - $204,100.00 per year.

Our Benefits:
Your exceptional people experience starts here. At Crowe, we know that great peopleare what makes a great firm. We care about our people and offer employees a comprehensive total rewards package. Learn more about what working at Crowe can mean for you!

How You Can Grow:
We will nurture your talent in an inclusive culture that values diversity. You will have the chance to meet on a consistent basis with your Career Coach that will guide you in your career goals and aspirations. Learn more about where talent can prosper!

More about Crowe:
Crowe (www.crowe.com) is one of the largest public accounting, consulting and technology firms in the United States. Crowe uses its deep industry expertise to provide audit services to public and private entities while also helping clients reach their goals with tax, advisory, risk and performance services. Crowe is recognized by many organizations as one of the country's best places to work. Crowe serves clients worldwide as an independent member of Crowe Global, one of the largest global accounting networks in the world. The network consists of more than 200 independent accounting and advisory services firms in more than 130 countries around the world.

Crowe LLP provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, sexual orientation, gender identity or expression, genetics, national origin, disability or protected veteran status, or any other characteristic protected by federal, state or local laws.

Crowe LLP does not accept unsolicited candidates, referrals or resumes from any staffing agency, recruiting service, sourcing entity or any other third-party paid service at any time. Any referrals, resumes or candidates submitted to Crowe, or any employee or owner of Crowe without a pre-existing agreement signed by both parties covering the submission will be considered the property of Crowe, and free of charge.

Crowe will consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with the requirements of applicable state and local laws, including the City of Los Angeles' Fair Chance Initiative for Hiring Ordinance, Los Angeles County Fair Chance Ordinance, San Francisco Fair Chance Ordinance, and the California Fair Chance Act.

Please visit our webpage to see notices of the various state and local Ban-the-Box laws and Fair Chance Ordinances, where applicable.

Applied = 0

(web-df9ddb7dc-hhjqk)