CrawlJobs Logo

Gen AI Engineering and Scaled AI Transformation

Canada, Mississauga Employment contract 145100.00 - 217700.00 USD / Year · Job Posted May 04, 2026
Apply Position
Job Link Share

Job Responsibility

  • Acts as a senior technical authority on Large Language Models, including both commercial and open‑source ecosystems (OpenAI, Gemini, Claude, Llama)
  • Leads model selection and deployment strategy, balancing use‑case fit, data sensitivity, cost efficiency, latency, accuracy, and regulatory constraints
  • Guides decisions on hosted vs. private vs. fine‑tuned models, ensuring optimal trade‑offs between performance, control, and operational risk
  • Establishes enterprise standards for LLM lifecycle management, including upgrades, regression validation, and decommissioning
  • Demonstrates hands‑on leadership in building GenAI applications using LangChain, LangGraph, LlamaIndex, and Hugging Face, translating experimentation into production systems
  • Architects agentic and multi‑step workflows, enabling tool‑use, reasoning chains, state management, and orchestration at enterprise scale
  • Sets reusable reference patterns and accelerators for GenAI adoption across application teams
  • Ensures solutions are built with enterprise-grade reliability, explainability, and extensibility
  • Designs and delivers robust RAG architectures that ground GenAI outputs in trusted, auditable enterprise data
  • Leads implementation of vector databases and embedding strategies (pgvector, Pinecone, Weaviate, FAISS), aligned with data access and security models
  • Applies advanced retrieval techniques including hybrid search, re‑ranking, metadata filtering, and context optimization to improve response accuracy and relevance
  • Ensures RAG solutions support data lineage, auditability, and regulatory compliance
  • Establishes prompt engineering and orchestration standards to ensure consistency, maintainability, and quality across GenAI solutions
  • Optimizes GenAI workflows by actively managing latency, throughput, token cost, and accuracy trade‑offs in production environments
  • Implements evaluation and experimentation frameworks to continuously improve output quality and business value
  • Drives disciplined use of caching, batching, fallback models, and token optimization techniques
  • Applies strong grounding in ML/DL fundamentals, enabling informed architectural decisions and credible engagement with data science teams
  • Leverages PyTorch and TensorFlow for embeddings, training pipelines, and targeted fine‑tuning where business value is clear
  • Ensures GenAI capabilities integrate seamlessly into the broader ML, data, and MLOps ecosystem
  • Balances rapid GenAI delivery with long‑term model sustainability and governance
  • Leads deployment of GenAI systems into secure, scalable production environments using Docker, cloud‑native architectures, and hardened APIs
  • Establishes observability and monitoring for GenAI applications, covering performance, drift, quality, reliability, and failure modes
  • Ensures GenAI platforms meet enterprise availability, resilience, and disaster recovery expectations
  • Drives operational readiness, incident management, and ongoing optimization of AI services
  • Brings strong hands‑on software engineering credibility, setting standards for Python‑based GenAI services
  • Leads development of high‑performance AI‑powered APIs using FastAPI and async programming patterns
  • Champions clean architecture, testability, and security best practices across AI engineering teams
  • Acts as a bridge between traditional application engineering and AI‑native development
  • Leads the implementation of AI evaluation and governance frameworks, including hallucination detection, confidence scoring, and human‑in‑the‑loop validation
  • Designs and enforces guardrails, moderation layers, and usage controls to prevent misuse or unintended outcomes
  • Partners with Risk, Compliance, Legal, and Security teams to embed Responsible AI principles into all GenAI solutions
  • Ensures GenAI adoption withstands audit, regulatory, and reputational scrutiny
  • Operates as a hands‑on SVP, combining strategic influence with deep technical execution
  • Leads senior engineers and GenAI specialists, building sustainable internal AI capability rather than point solutions
  • Communicates complex GenAI concepts clearly to executive and non‑technical stakeholders
  • Drives delivery in agile, fast‑moving environments, with a strong bias for outcomes and measurable value

Requirements

  • 10+ years of progressive experience in software engineering, ML, or AI platforms, with 5+ years leading senior engineers and architects
  • 3+ years of hands‑on experience deploying LLM‑based systems in production environments at enterprise scale
  • Demonstrated authority across commercial and open‑source LLM ecosystems (e.g., OpenAI, Anthropic, Google, Llama), including model selection, fine‑tuning, and hosting strategies
  • Proven ability to define enterprise-wide GenAI standards, reference architectures, and reusable accelerators
  • Demonstrated leadership in establishing prompt engineering standards and orchestration patterns
  • Experience optimizing latency, throughput, accuracy, and token cost across large‑scale GenAI workloads
  • Bachelor’s degree/University degree or equivalent experience
  • Master’s degree preferred

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Gen AI Engineering and Scaled AI Transformation

8 matching positions

Senior Software Engineer Applied Gen AI Engineering

At Citi, we are pioneering the future of enterprise operations through innovativ...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional software engineering experience, demonstrating a strong track record of designing, building, and delivering scalable enterprise-grade solutions in commercial production environments, not just proofs-of-concept
  • Expert-level proficiency in Python is a must-have, with a deep understanding of its ecosystem for AI/ML development, data engineering, and backend services
  • Extensive hands-on experience with Generative AI concepts, Large Language Models (LLMs), transformer architectures, RAG, and advanced agentic frameworks (e.g., LangChain, LangGraph, Google ADK)
  • Deep comfort and practical experience with containers and orchestration technologies, specifically OpenShift
  • Demonstrated ability to architect, develop, and deploy highly performant, large-scale AI/ML systems into production environments
  • Strong understanding of modern software development principles, clean code practices, data structures, algorithms, and distributed systems
  • Proficiency with Relational (preferably, PostgreSQL) and Vector (preferably, pgvector) databases
Job Responsibility
Job Responsibility
  • Architect & Build Production Systems
  • Pioneer Automation with Agents
  • Master Containerized Deployments
  • Drive Technical Direction & Ownership
  • Champion Engineering Excellence
  • Innovate & Research
  • Mentor & Collaborate
  • Iterate & Deliver
  • Ensure Responsible AI
What we offer
What we offer
  • Unprecedented Impact & Visibility
  • Cutting-Edge Technology
  • Growth & Development
  • Collaborative Environment
  • Flexible Work Environment
  • Global Scale
  • Fulltime
Read More
Arrow Right

Gen Ai Tech Engineering Lead - Senior Vice President

This role is for an innovative Generative AI Engineer to drive the adoption of L...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep hands-on experience in engineering and executing enterprise solutions that scale effectively
  • Fluency in at least two programming languages, with a strong preference for Python and Java. Proficiency in Javascript/Typescript and Golang is also highly valued
  • Strong understanding of Language Models (LLMs), transformers, agentic frameworks, vector stores, and search algorithms
  • Experience with relevant frameworks such as Spring (AI, Boot, Core, etc.), Flask, LangChain, LangGraph, ADK, and MLFlow
  • Proficiency in database technologies, such as Oracle, Postgres, or MongoDB
  • Experience with messaging and integration platforms like Kafka or JMS/MQ
  • UI development skills with technologies such as React JS or StreamLit
  • Experience designing and implementing REST and WebSocket APIs
  • Familiarity with one of the major cloud platforms including AWS, GCP, or Azure
  • Strong knowledge of infrastructure tools, including Docker, Kubernetes, Terraform, and Helm
Job Responsibility
Job Responsibility
  • Contribute significantly to both engineering and research initiatives within the Generative AI domain
  • Adopt a product-focused approach, ensuring the development of robust, scalable, and user-friendly solutions
  • Thrive in a fast-paced environment by continuously testing, learning, and tackling cutting-edge problems
  • Actively engage in pair programming, promote lean methodologies, and streamline processes by removing unnecessary bureaucracy
  • Prioritize rapid delivery and iterative development, demonstrating adaptability and a willingness to pivot rather than pursuing a perfect upfront solution
  • Develop foundational components and mature technology capabilities in Artificial Intelligence (AI) and Large Language Models (LLMs)
  • Drive the enterprise-wide adoption and successful integration of Generative AI solutions
  • Demonstrate strong problem-solving capabilities and a proactive learning mindset, continuously acquiring new skills
  • Exhibit excellent communication and collaboration skills, effectively engaging with diverse stakeholders
  • Be adaptable and resourceful in navigating complex technical and organizational challenges
  • Fulltime
Read More
Arrow Right

Gen AI Tech Engineering Senior Lead

This role is for an innovative Generative AI Engineer to drive the adoption of L...
Location
Location
Canada , Mississauga
Salary
Salary:
145100.00 - 217700.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep hands-on experience in engineering and executing enterprise solutions that scale effectively
  • Fluency in at least two programming languages, with a strong preference for Python and Java. Proficiency in Javascript/Typescript and Golang is also highly valued
  • Strong understanding of Language Models (LLMs), transformers, agentic frameworks, vector stores, and search algorithms
  • Experience with relevant frameworks such as Spring (AI, Boot, Core, etc.), N8N, Flask, LangChain, LangGraph, and MLFlow
  • Proficiency in database technologies, such as Oracle, Postgres, or MongoDB
  • Experience with messaging and integration platforms like Kafka or JMS/MQ
  • UI development skills with technologies such as React JS or StreamLit
  • Experience designing and implementing REST and WebSocket APIs
  • Familiarity with one of the major cloud platforms including AWS, GCP, or Azure
  • Strong knowledge of infrastructure tools, including Docker, Kubernetes, Terraform, and Helm
Job Responsibility
Job Responsibility
  • Contribute significantly to both engineering and research initiatives within the Generative AI domain
  • Adopt a product-focused approach, ensuring the development of robust, scalable, and user-friendly solutions
  • Thrive in a fast-paced environment by continuously testing, learning, and tackling cutting-edge problems
  • Actively engage in pair programming, promote lean methodologies, and streamline processes by removing unnecessary bureaucracy
  • Prioritize rapid delivery and iterative development, demonstrating adaptability and a willingness to pivot rather than pursuing a perfect upfront solution
  • Develop foundational components and mature technology capabilities in Artificial Intelligence (AI) and Large Language Models (LLMs)
  • Drive the enterprise-wide adoption and successful integration of Generative AI solutions
  • Demonstrate strong problem-solving capabilities and a proactive learning mindset, continuously acquiring new skills
  • Exhibit excellent communication and collaboration skills, effectively engaging with diverse stakeholders
  • Be adaptable and resourceful in navigating complex technical and organizational challenges
  • Fulltime
Read More
Arrow Right

Gen AI Python Developer

The Applications Development Intermediate Programmer Analyst is an intermediate ...
Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4-8 years of relevant experience in Apps Development or systems analysis role
  • Strong foundational knowledge in Machine Learning (ML modeling), Data Science, Statistics, and AI fundamentals, including Natural Language Processing (NLP), Neural Networks, and Large Language Models (LLMs)
  • Extensive hands-on experience with leading LLMs such as Google Gemini, OpenAI models, Anthropic Claude, Mistral, Llama, and various other open-source LLMs
  • Deep working knowledge and hands-on experience with Retrieval-Augmented Generation (RAG) pipelines, including advanced RAG techniques and their detailed implementation
  • Proven ability to build, tune, and deploy LLM-based applications using platforms like Vertex AI, Hugging Face, etc.
  • Expertise in developing robust prompt engineering strategies, prompt tuning, and creating reusable prompt templates
  • Hands-on experience with agentic framework-based use case implementation
  • Working knowledge of Guardrails and methodologies for assessing the performance and safety of GenAI features
  • Strong programming proficiency in Python, including extensive experience with libraries such as Pandas, NumPy, scikit-learn, PyTorch, TensorFlow, Transformers, FastAPI, Seaborn, LangChain, and LlamaIndex
  • Proficiency in integrating generative AI with enterprise applications using APIs, knowledge graphs, and orchestration tools
Job Responsibility
Job Responsibility
  • Participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team
  • Contribute to applications systems analysis and programming activities
  • Fulltime
Read More
Arrow Right

Gen Ai Tech Engineer - Avp

Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience in engineering and executing enterprise solutions that scale effectively
  • Fluency in at least two programming languages, with a strong preference for Python and Java
  • Proficiency in Javascript/Typescript and Golang is also highly valued
  • Strong understanding of Language Models (LLMs), transformers, agentic frameworks, vector stores, and search algorithms
  • Experience with relevant frameworks such as Spring (AI, Boot, Core, etc.), Flask, LangChain, LangGraph, ADK and MLFlow
  • Proficiency in database technologies, such as Oracle, Postgres, or MongoDB
  • Experience with messaging and integration platforms like Kafka or JMS/MQ
  • UI development skills with technologies such as React JS or StreamLit
  • Experience designing and implementing REST and WebSocket APIs
  • Familiarity with one of the major cloud platforms including AWS, GCP, or Azure
Job Responsibility
Job Responsibility
  • Contribute significantly to both engineering and research initiatives within the Generative AI domain
  • Adopt a product-focused approach, ensuring the development of robust, scalable, and user-friendly solutions
  • Thrive in a fast-paced environment by continuously testing, learning, and tackling cutting-edge problems
  • Actively engage in pair programming, promote lean methodologies, and streamline processes by removing unnecessary bureaucracy
  • Prioritize rapid delivery and iterative development, demonstrating adaptability and a willingness to pivot rather than pursuing a perfect upfront solution
  • Develop foundational components and mature technology capabilities in Artificial Intelligence (AI) and Large Language Models (LLMs)
  • Drive the enterprise-wide adoption and successful integration of Generative AI solutions
  • Demonstrate strong problem-solving capabilities and a proactive learning mindset, continuously acquiring new skills
  • Exhibit excellent communication and collaboration skills, effectively engaging with diverse stakeholders
  • Be adaptable and resourceful in navigating complex technical and organizational challenges
  • Fulltime
Read More
Arrow Right

Quality Engineering Transformation Leader

The Quality Engineering (QE) Technical & Transformational Leader is a pivotal se...
Location
Location
United Kingdom , London or Norwich
Salary
Salary:
Not provided
whitehallresources.com Logo
Whitehall Resources Ltd
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 20-25+ years in QE engineering, transformation, and automation leadership
  • Mastery of: Playwright, Appium, Selenium, Cypress, Java
  • API Automation (REST Assured, Karate, Postman)
  • Performance (JMeter, LoadRunner)
  • Cloud‑native testing (Azure, AWS, GCP)
  • Test orchestration, CI/CD pipelines
  • TDM, environment strategy, observability tools
  • Outstanding communication, influence, and executive presence
  • Proven ability to operate in demanding, politically dynamic client environments
  • Strong analytical and problem‑solving capability
Job Responsibility
Job Responsibility
  • Architect and drive end‑to‑end QE & Testing transformation aligned to Client’s engineering, digital, and modernization roadmap
  • Define and own QE operational model evolution, embedding continuous testing, shift‑left, shift‑right, DevOps alignment, automation‑first culture, and AI/GenAI adoption
  • Identify and drive transformation levers such as automation scaling, test data modernization, environment optimization, and productivity uplift
  • Partner with domain leaders to introduce modern testing practices (in‑sprint automation, service virtualization, early performance testing, observability, etc.)
  • Build and govern standard frameworks, reusable assets, accelerators, and next‑gen engineering solutions
  • Evaluate, propose, and institutionalize new tooling, frameworks, and engineering approaches through innovation, benchmarking, and continuous improvement
  • Present complex technical concepts clearly to Client’s senior leadership
  • Navigate challenging conversations with clarity, data, confidence, and strategic storytelling
  • Establish KPIs, productivity metrics, and value dashboards for QE across programs
  • Lead analytics‑driven decision‑making on Productivity improvement, Defect trends, Cost of quality, Test efficiency & cycle time
Read More
Arrow Right

Machine Learning Scientist II - Gen AI

We are seeking a highly motivated and experienced Machine Learning Scientist to ...
Location
Location
United States , Boston
Salary
Salary:
127300.00 - 186700.00 USD / Year
simplisafe.com Logo
SimpliSafe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS or PhD in Computer Science, Artificial Intelligence, or a related field
  • Experience training or fine-tuning large language models (LLMs) using modern frameworks
  • Strong grasp of deep learning, particularly transformer architectures and foundational model training techniques for text and vision modalities
  • Proficient in Python and relevant ML libraries (e.g., PyTorch, TensorFlow, HuggingFace Transformers)
  • Hands-on experience in developing and deploying LLM- or VLM-powered applications
  • Familiarity with prompt engineering, retrieval-augmented generation (RAG), MCP (Model Context Protocol, Agentic AI and evaluation of generative models
  • Understanding of MLOps practices and how to scale experiments into production-grade solutions
  • Strong communication and documentation skills
  • Collaborative mindset with the ability to thrive in a fast-paced, interdisciplinary environment
Job Responsibility
Job Responsibility
  • Develop and fine-tune large language models (LLMs) and vision-language models (VLMs) to address real-world challenges in the home security space
  • Work with key stakeholders to identify key research initiatives that can have impact on business outcomes
  • Take research initiatives from idea generation to production
  • Collaborate with engineers and product managers to integrate capabilities into our existing systems
  • Stay up-to-date on the latest advancements in LLMs, VLMs, and multimodal systems. Evaluate new techniques for potential adoption and improvement of internal capabilities
What we offer
What we offer
  • A mission- and values-driven culture and a safe, inclusive environment where you can build, grow and thrive
  • A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families
  • Free SimpliSafe system and professional monitoring for your home
  • Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change
  • Participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits
  • Fulltime
Read More
Arrow Right

Principal Machine Learning Data Scientist, Gen AI

Xometry is seeking a Principal Data & ML Scientist to join our Generative AI tea...
Location
Location
United States , Waltham
Salary
Salary:
164000.00 - 213000.00 USD / Year
cherry.vc Logo
Cherry Ventures
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A bachelor’s degree is required, but an advanced degree (M.S. or PhD) in computer science, machine learning, AI, or a related field is highly preferred
  • 7+ years of experience in data science and machine learning, focusing on generative models, LLMs, or computer vision
  • Expertise in large-scale language and vision models (e.g., Transformers, GPT, VLMs)
  • Experience with multimodal data processing (e.g., combining text, image, and 3D data)
  • Proficient in Python, including key libraries such as PyTorch, TensorFlow, pandas, and numpy
  • Strong background in probability, statistics, and optimization techniques relevant to generative modeling
  • Familiarity with cloud computing resources and tools for model training and deployment (e.g., AWS SageMaker)
  • Familiar with software engineering principles, including version control, reproducibility, and continuous integration
Job Responsibility
Job Responsibility
  • Provide technical leadership to the Generative AI team, setting technical direction, defining best practices, and ensuring the team follows industry standards in AI and ML development
  • Lead strategic planning and roadmap development for generative AI initiatives, identifying high-impact projects and aligning them with Xometry’s business objectives
  • Develop and deploy generative AI models and large language models (LLMs) for multimodal document processing, focusing on extracting structured data from technical drawings, purchasing orders, and other complex documents
  • Lead the exploration and development of innovative text and image-based data processing solutions, including training and fine-tuning generative and language models
  • Design and implement efficient workflows for data preparation, cleaning, and augmentation to support the training of generative AI models
  • Utilize cloud platforms (e.g., Amazon Web Services) for large-scale data processing, model training, and deployment
  • Collaborate with cross-functional teams, including engineering and business teams, to align generative AI solutions with business needs and drive impactful applications
  • Mentor and guide team members on advanced machine learning techniques, model architecture design, and problem-solving strategies to elevate the team’s technical capabilities
  • Continuously experiment and iterate on model performance, tuning architectures and parameters to improve accuracy and efficiency in a fast-paced, agile environment
  • Stay updated with the latest research in generative AI, deep learning, and multimodal data processing, incorporating best practices and advancements into model development
What we offer
What we offer
  • 401(k) match
  • medical, dental and vision insurance
  • life and disability insurance
  • generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave
  • EAP, other wellbeing resources
  • Fulltime
Read More
Arrow Right