Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Enterprise AI and Data Engineering with Databricks Specialization

Limited time! Save 40% on 3 months of Coursera Plus and full access to thousands of courses.

Enterprise AI and Data Engineering with Databricks Specialization

Build Production Data and AI on Databricks.

Master lakehouse architecture, Delta Live Tables, ML, GenAI, and MLOps in five hands-on courses.

Instructors: Noah Gift

Included with

Learn more

5 course series

Get in-depth knowledge of a subject

Beginner level

Recommended experience

4 weeks to complete

at 5 hours a week

Flexible schedule

Learn at your own pace

5 course series

Get in-depth knowledge of a subject

Beginner level

Recommended experience

4 weeks to complete

at 5 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Architect and build medallion data pipelines (bronze, silver, gold) using Apache Spark, Delta Lake, and Databricks Workflows
Implement declarative ETL with Delta Live Tables including data quality expectations, streaming ingestion via Auto Loader, and Change Data Capture
Train, track, and register machine learning models using MLflow on Databricks with hyperparameter tuning and the Model Registry
Build generative AI applications with LLM fine-tuning, Vector Search, and retrieval-augmented generation on the Databricks platform

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from Pragmatic AI Labs

Specialization - 5 course series

This five-course specialization takes you from lakehouse fundamentals to production-grade AI systems on the Databricks platform. You begin by building data pipelines with Apache Spark and Delta Lake, learning medallion architecture (bronze, silver, gold) and Unity Catalog governance. You then advance to Delta Live Tables for declarative ETL with built-in data quality expectations, streaming ingestion with Auto Loader, and Change Data Capture with APPLY CHANGES. The specialization progresses into machine learning engineering with MLflow tracking and the Databricks Model Registry, generative AI with LLM fine-tuning and RAG pipelines using Vector Search, and concludes with production governance — model serving, A/B testing, monitoring, and CI/CD for ML systems. Every course includes hands-on labs on the Databricks platform using real-world datasets and production patterns.

Applied Learning Project

Across the specialization, you build progressively complex systems on Databricks. In Course 1, you construct an end-to-end medallion pipeline (bronze to silver to gold) with Delta Lake MERGE operations and Databricks Workflows orchestration. In Course 2, you build a production Delta Live Tables pipeline with expectations-based data quality, streaming ingestion via Auto Loader, and Change Data Capture for an inventory management system. Later courses extend this foundation with MLflow experiment tracking, model registration, LLM fine-tuning, retrieval-augmented generation, and automated model serving with governance controls. Each project uses the Databricks Community Edition or workspace — no cloud billing required for the labs.

Databricks Lakehouse Fundamentals

Course 1 5 hours

What you'll learn

Write PySpark and SparkSQL queries using lazy evaluation, the Catalyst optimizer, and broadcast join optimization
Schedule end-to-end data pipelines as multi-task Databricks Jobs with dashboards and alerting
Build and query Delta Lake tables with ACID transactions, schema enforcement, time travel, and MERGE-based incremental ETL

Data Engineering with Delta Lake on Databricks

Course 2 5 hours

What you'll learn

Build declarative ETL pipelines with Delta Live Tables using both SQL and Python, including streaming ingestion with Auto Loader and schema evolution
Implement the Medallion Architecture (bronze, silver, gold) with expectations-based data quality enforcement at each layer
Design production pipelines with Change Data Capture, incremental processing, and performance optimization using Z-ordering and partitioning

Machine Learning with Databricks and MLflow

Course 3 5 hours

What you'll learn

This course teaches you to build, track, and deploy machine learning models on the Databricks platform using MLflow. You

start with the reproducibility crisis in ML — understanding why untracked experiments, scattered notebooks, and missing version control create production failures — and learn how MLflow solves these problems with structured experiment tracking, model versioning, and artifact management. You then explore MLflow's architecture in depth: the Tracking layer for logging parameters, metrics, and artifacts; the Model Registry for governance and stage gates; and the Projects layer for reproducible environments. The course covers Feature Store architecture for eliminating training/serving skew, where features are computed once and served two ways — batch for training and real-time for inference. You progress through the ML algorithm spectrum from manual implementations to AutoML, learning when to choose transparency over automation for regulated industries. The second module focuses on production deployment: the MLOps maturity staircase (L0 through L3), inference patterns for batch and real-time serving, and the infrastructure decisions that separate prototype ML from production ML. Hands-on labs on Databricks reinforce every concept.

Generative AI and LLMs on Databricks

Course 4 4 hours

What you'll learn

Apply prompt engineering patterns (CoT, ReAct, few-shot) and sampling parameters to control LLM output for production systems
Design and evaluate hybrid RAG pipelines using embeddings, BM25, and Reciprocal Rank Fusion with six standard retrieval metrics
Implement model security through cryptographic chain-of-trust signing, AI Gateway governance, and Unity Catalog model registry workflows

Production Governance and MLOps on Databricks

Course 5 7 hours

What you'll learn

Navigate and manage the Unity Catalog hierarchy (metastores, catalogs, schemas, tables) using the SDK, CLI, and VS Code
Implement access control by creating service principals and writing GRANT/REVOKE statements in SQL
Implement access control by creating service principals and writing GRANT/REVOKE statements in SQL

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Noah Gift

Pragmatic AI Labs

10 Courses 330 learners

Alfredo Deza

Pragmatic AI Labs

4 Courses 134 learners

Offered by

Pragmatic AI Labs

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.