Online Course – IBM Certified Professional Certificate in Data Engineering

Prepare yourself for a career as an engineer. Acquire skills that are needed in the job market and train yourself in the field of artificial intelligence. Get certified from IBM. No prior experience required.

Suggested by: Coursera (What is Coursera?)

Professional Certificate

Beginners

No prior knowledge required

Time to complete the course

7-day free trial

No unnecessary risks

Skills you will acquire in the course

  • Python
  • SQL
  • Relational Databases (RDBMS)
  • NoSQL
  • ETL
  • Data Warehouses
  • Apache Spark
  • Database Management
  • Working with unstructured data
  • Generative AI tools and techniques
  • Building data slides
  • Practical experience in projects
  • Backing up files in Linux
  • Traffic data analysis
  • Design and establishment of a data warehouse
  • Data migration in NoSQL databases
  • Machine learning model training
  • End-to-end data engineering platform management
  • Portfolio of projects
  • Professional certificate from IBM
  • Digital badge from IBM
  • Access to career resources
  • Ramadame interviews
  • Resume support
  • Up to 12 academic credits

What you will learn in the course

Courses for which the course is suitable

  • Data Engineer
  • Data Scientist
  • Business Intelligence Analyst
  • ETL key
  • Database Administrator
  • Python developer
  • Data Analyst
  • Big Data Expert
  • Data warehouse developer
  • NoSQL developer
  • Apache Spark Expert
  • Data Application Developer
  • Data Project Manager
## Professional Certificate – 16-Course Series Prepare for a career in the emerging field of data engineering. In this program, you can learn essential skills like Python, SQL, and databases to be **job-ready in less than 5 months**. Data engineering is about building systems to collect, process, and **organize raw data into useful information**. Data engineers provide the foundational knowledge that data scientists and business intelligence analysts need to make decisions. This program will teach you the fundamentals of data engineering that employers are looking for in industry roles, including Python, one of the most widely used programming languages. In addition, you will master **SQL, RDBMS, ETL, data warehouses, NoSQL, big data, and Spark** through hands-on labs and projects. **You will learn to use the Python programming language** and Linux/UNIX scripts to extract, transform, and load (ETL) data. You’ll work with relational databases (RDBMS) and query data using SQL commands, as well as NoSQL databases and unstructured data. You’ll also learn how generative AI tools and techniques are used in data engineering. Upon completion of the program, you’ll have a **project portfolio** and a professional certificate from IBM to showcase your expertise. Additionally, you’ll earn a digital badge from IBM and **gain access to career resources** to help you with your job search, including mock interviews and resume support. This program is recommended by ACE®—when you graduate, **you can earn up to 12 academic credits**. ## Hands-on Learning Project During this professional certificate, you’ll complete labs and hands-on projects to gain hands-on experience in Python, SQL, relational databases, NoSQL databases, Apache Spark, building data pipelines, managing databases, and working with data warehouses. ### Projects: – Design a relational database to help a coffee chain improve its operations. – Use SQL to query demographic data on population, crime, and school surveys. – Write a Linux Bash script that backs up changed files. – Set up, test, and optimize a data platform containing MySQL, PostgreSQL, and IBM Db2 databases. – Analyze road traffic data to perform ETL and create pipelines using Airflow and Kafka. – Design and build a data warehouse for a solid waste management company. – Transfer, query, and analyze data in MongoDB, Cassandra, and Cloudant NoSQL databases. – Train a machine learning model by creating an Apache Spark application. – Design, install, and manage an end-to-end data engineering platform.

Details of the courses that make up the specialization

Introduction to Data Engineering

Course 1

  • 12 hours
  • 4.7

Course Details

What you’ll learn

  • Develop a list of basic skills required for an entry-level data engineering role.
  • Learn about different stages and concepts in the data engineering lifecycle.
  • Describe data engineering technologies such as relational databases, NoSQL databases, and Big Data Engines.
  • Summary of information security, governance, and compliance concepts.

Skills you will acquire

  • Category: Shell Script
  • Category: Bash (Unix Shell)
  • Category: Extraction, Transformation, and Loading (ETL)
  • Category: Linux
  • Category: Linux commands

Python for Data Science, AI, and Development

Course 2

  • 25 hours
  • 4.6

Course Details

What you’ll learn

  • Learn Python – the most popular programming language for data science and software development.
  • Transform the logic of Python programming into variables, data structures, branching, loops, functions, objects, and classes.
  • Demonstrate the ability to use Python libraries such as Pandas and Numpy, and develop code using Jupyter Notebooks.
  • Access and scrape data from the web using APIs and Python libraries like Beautiful Soup.

Skills you will acquire

  • Category: Cloud Databases
  • Category: Mongodb
  • Category: Cassandra
  • Category: NoSQL
  • Category: Cloudant

Python Project for Data Engineering

Course 3

  • 9 hours
  • 4.6

Course Details

What you’ll learn

  • Show your skills in Python for working and processing data.
  • Implement web data scraping and use APIs to extract data with Python.
  • Play the role of a data engineer in a real project to extract, transform, and load data.
  • Use Jupyter Notebooks and IDEs to complete your project.

Introduction to Relational Databases (RDBMS)

Course 4

  • 15 hours
  • 4.6

Course Details

What you’ll learn

  • Describe data, databases, relational databases, and cloud databases.
  • Explain information and data models, relational databases, and relational model concepts (including schemas and tables).
  • Explain a relationship diagram and the design of a relational database for a specific use case.
  • Develop a working knowledge of popular DBMSs including MySQL, PostgreSQL, and IBM DB2.

Skills you will acquire

  • Category: Data Science
  • Category: Data Analysis
  • Category: Python Programming
  • Category: Numpy
  • Category: Pandas

Databases and SQL for Data Science with Python

Course 5

  • 20 hours
  • 4.7

Course Details

What you’ll learn

  • Analyze data within a database using SQL and Python.
  • Create a relational database and work with multiple tables using DDL commands.
  • Build basic to intermediate SQL queries using DML commands.
  • Build more powerful queries with advanced SQL techniques like Views, Transactions, Stored Procedures, and Joins.

Skills you will acquire

  • Category: Machine Learning
  • Category: Machine Learning Pipelines
  • Category: Data Engineer
  • Category: SparkML
  • Category: Apache Spark

A practical introduction to Linux commands and shell scripting

Course 6

  • 14 hours
  • 4.6

Course Details

What you’ll learn

  • Describe the Linux architecture and common Linux distributions and install software on a Linux system.
  • Execute information, file, content, navigation, compression, and network commands in the Bash shell.
  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.
  • Schedule cron jobs in Linux using crontab and explain the syntax of cron.

Skills you will acquire

  • Category: Python Programming
  • Category: Relational databases
  • Category: SQL
  • Category: NoSQL
  • Category: Data Pipes

Database Administration (DBA)

Course 7

  • 21 hours
  • 4.4

Course Details

What you’ll learn

  • Create, query, and configure databases and access and construct system objects such as tables.
  • Perform basic database management including database backup and recovery as well as managing user roles and permissions.
  • Netro and its optimization are important aspects of database performance.
  • Solve database issues like connection, login, and configuration and automate functions like reports, notifications, and alerts.

Skills you will acquire

  • Category: Python Programming
  • Category: Information Engineering
  • Category: Extraction, Transformation, and Loading (ETL)
  • Category: Data Engineer
  • Category: Data Scraping

ETL and data pipelines with Shell, Airflow, and Kafka

Course 8

  • 17 hours
  • 4.5

Course Details

What you’ll learn

  • Describe and differentiate between retrieval, transformation, loading (ETL) processes and extraction, loading, transformation (ELT) processes.
  • Explain concurrent vs. parallel execution modes.
  • Run the ETL process using Bash commands and Python functions.
  • Describe the components of data pipelines, processes, tools, and technologies.

Skills you will acquire

  • Category: Extraction, Transformation, and Loading (ETL)
  • Category: Data Engineer
  • Category: Apache Kafka
  • Category: Apache Airflow
  • Category: Data Pipes

Data warehouse fundamentals

Course 9

  • 15 hours
  • 4.4

Course Details

What you’ll learn

  • Job-ready skills in data warehousing in 6 weeks, supported by hands-on experience and IBM certification.
  • Design and populate a data warehouse, and sample and query data using CUBE, ROLLUP, and materialized views.
  • Identify popular data analysis and business intelligence tools and create data visualizations using IBM Cognos Analytics.
  • How to format and load data into a data warehouse, write summary queries, create functional query tables, and create an analytical dashboard.

Skills you will acquire

  • Category: Big Data
  • Category: SparkSQL
  • Category: SparkML
  • Category: Apache Hadoop
  • Category: Apache Spark

BI dashboards with IBM Cognos Analytics and Google Looker

Course 10

  • 11 hours

Course Details

What you’ll learn

  • Explore the purpose of analytics and business intelligence (BI) tools.
  • Leverage the capabilities of IBM Cognos Analytics and Google Looker Studio.
  • Demonstrate your ability to analyze DB2 data with IBM Cognos Analytics.
  • Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio.

Skills you will acquire

  • Category: Python Programming
  • Category: Cloud Databases
  • Category: Relational Database Management System (RDBMS)
  • Category: SQL
  • Category: Jupyter notebooks

Introduction to NoSQL databases

Course 11

  • 18 hours
  • 4.6

Course Details

What you’ll learn

  • Distinguish between the four main categories of NoSQL databases.
  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.
  • Perform common tasks using MongoDB including create, read, update, and delete (CRUD).
  • Run Keyspace, Table, and CRUD operations in Cassandra.

Skills you will acquire

  • Category: Database Security
  • Category: Database Management System (DBMS)
  • Category: Database Servers
  • Category: Database Management
  • Category: Relational databases

Introduction to Big Data with Spark and Hadoop

Course 12

  • 19 hours
  • 4.4

Course Details

What you’ll learn

  • Explain the impact of Big Data, including use cases, processing tools, and methods.
  • Describe the Apache Hadoop architecture, system, methods, and related user applications, including Hive, HDFS, HBase, Spark, and MapReduce.
  • Apply the fundamental concepts of Spark programming, including parallel programming fundamentals for DataFrames, data systems, and Spark SQL.
  • Use Spark RDDs and data sets, optimize Spark SQL with Catalyst and Tungsten, and use Spark development and runtime options.

Skills you will acquire

  • Category: Data Science
  • Category: Database Management System (DBMS)
  • Category: Information Engineering
  • Category: SQL
  • Category: NoSQL

Machine Learning with Apache Spark

Course 13

  • 15 hours
  • 4.6

Course Details

What you’ll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss the uses of Spark, and analyze ML pipelines and the persistence model.
  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines to ML pipelines.
  • Build data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.
  • Show connection to Spark clusters, build ML pipelines, perform feature extraction and transformation, and report model persistence.

Skills you will acquire

  • Category: Business Intelligence
  • Category: Data Visualization
  • Category: IBM Cognos Analytics
  • Category: Google Looker Studio
  • Category: Dashboards

Final project in data engineering

Course 14

  • 16 hours
  • 4.7

Course Details

What you’ll learn

  • Demonstrate proficiency in the skills needed for an entry-level data engineering role.
  • Designed and implemented various concepts and components in the data engineering lifecycle such as databases.
  • Demonstrate practical knowledge with relational databases, NoSQL databases, Big Data engines, data warehouses, and data pipelines.
  • Use your Linux, SQL, and Python scripting skills to solve data engineering problems.

Skills you will acquire

  • Category: Convolutional Neural Network
  • Category: Information Engineering
  • Category: Database query
  • Category: Data Creation
  • Category: Generative AI

Generative AI: Advancing Your Data Engineering Career

Course 15

  • 12 hours
  • 4.9

Course Details

What you’ll learn

  • Utilize various generative tools and techniques in data engineering processes in various industries.
  • Implement a variety of data engineering processes such as data creation, augmentation, and anonymization using generative tools.
  • Practice generative AI skills in labs and hands-on projects to design a data warehouse schema and establish infrastructure.
  • Evaluate real-world cases that illustrate the successful use of generative AI for ETL and data warehouses.

Skills you will acquire

  • Category: Database Design (DB)
  • Category: PostgreSQL
  • Category: Relational Database Management System (RDBMS)
  • Category: Database Architecture
  • Category: MySQL

Data Engineering Career Guide and Interview Preparation

Course 16

  • 11 hours
  • 4.8

Course Details

What you’ll learn

  • Describe the role of a data engineer and some career options, as well as future opportunities in the field.
  • Explain how to build the foundations for a job search, including researching job listings, writing a resume, and preparing a portfolio.
  • Summarized what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for them.
  • Explain how to give an effective interview, including techniques for answering questions and how to present a professional personal presentation.

Skills you will acquire

  • Category: Dice
  • Category: Data warehouses
  • Category: Snowflake Schemas
  • Category: Data lakes
  • Category: ROLLUPS
  • Category: Data Marts
  • Category: Star Schemas