Online Course – Certified Professional Internship in Data Engineering in Python from Google and Duke University

Improve your programming skills with data engineering. Use big data to make decisions, perform analytics, and develop artificial intelligence and machine learning applications.

Suggested by: Coursera (What is Coursera?)

Professional Certificate

Intermediate level

No prior knowledge required

Time to complete the course

7-day free trial

No unnecessary risks

Skills you will acquire in the course

  • Data Science
  • Big Data
  • Python Programming
  • Information Engineering
  • Kubernetes
  • Data Visualization
  • Apache Hadoop
  • Docker Container
  • Apache Spark
  • Snowflake (Data Warehouse)

What you will learn in the course

Courses for which the course is suitable

  • Software Engineer
  • key
  • researcher
  • Data Scientist
  • Data Engineer
  • Machine Learning Expert
  • Business Information Specialist

Internship – Course Series Three

Learn how to use data engineering to leverage big data for business strategy, data analytics, or machine learning and artificial intelligence. By completing this course series, you will equip yourself with the knowledge and skills required to build effective data pipelines, manage advanced platforms such as:

  • Hadoop
  • Spark
  • Snowflake
  • Databricks
  • Kubernetes

and tell stories with data through visualization. Delve into fundamental big data concepts, distributed computing with Spark, the Snowflake architecture, Databricks machine learning capabilities, Python techniques for data visualization, and critical practices like DataOps.

Target audience

This course series is intended for:

  • Software engineers
  • Developers
  • Researchers
  • Data Scientists

who want to strengthen their expertise in data science or machine learning, as well as for professionals interested in developing a career as a data-focused software engineer, data scientist, or data engineer working in the cloud, machine learning, business intelligence, or other fields.

The Practical Learning Project

The internship includes a capstone project that focuses on using the Databricks API to replicate an existing project. This provides hands-on experience working with Databricks to build a portfolio-ready data solution. You will apply Python to a variety of data engineering tasks.

Details of the courses that make up the specialization

Spark, Hadoop, and Snowflake for Data Engineering

  • Course 1 • 29 hours • 3.9 (40 ratings)

Course Details

What you’ll learn
  • Create scalable data pipelines (Hadoop, Spark, Snowflake, Databricks) for efficient data management.
  • Optimize data engineering with clustering and scaling to improve performance and resource utilization.
  • Build ML solutions (PySpark, MLFlow) on Databricks for partial model development and deployment.
  • Implement DataOps and DevOps practices for continuous integration and deployment (CI/CD) of data-driven applications, including process automation.
Skills you will gain
  • Category: Big Data
  • Category: Python Programming
  • Category: Information Engineering
  • Category: Apache Hadoop
  • Category: Apache Spark

Virtualization, Docker, and Kubernetes for Data Engineering

  • Course 2 • 27 hours • 3.8 (23 ratings)

Course Details

What you’ll learn
  • Master virtualization, containerization, and Docker, including Dockerfile creation and coordination between multiple containers with Compose and Airflow.
  • Develop expertise in core Kubernetes concepts, cluster architecture, and implementation using cloud environments, GitHub Codespaces, and AI-driven tools.
  • Navigate data scenarios while mastering containerization, deploying applications, and addressing production issues with cloud orchestration and SRE practices.
Skills you will gain
  • Category: Cloud-based integration
  • Category: Containerization
  • Category: Virtualization
  • Category: Kubernetes
  • Category: Docker (software)

Data visualization with Python

  • Course 3 • 9 hours • 4.2 (15 ratings)

Course Details

What you’ll learn
  • Use Python, spreadsheets, and BI tools professionally to create impressive and interactive data visualizations.
  • Convey and communicate data-driven insights and stories through impressive visualizations and data storytelling.
  • Conduct an assessment and select the most appropriate visualization tools and techniques to meet the organization’s needs and goals.
Skills you will gain
  • Category: Business Communication
  • Category: Data Analysis
  • Category: Python Programming
  • Category: Cloud Applications
  • Category: Data Visualization