Python Data Engineering - Learn Data Engineering with Python: visualize, manage and manipulate data at scale
Log InTry Anthropos
Skill Path

Python Data Engineering - Learn Data Engineering with Python: visualize, manage and manipulate data at scale

Intermediate18h 44min7 chapters1 simulationsLast updated 10/2025
Embark on the Python Data Engineering skill path, tailored for software engineers aiming to master data management. This course guides you from the fundamentals of Python to advanced techniques in data manipulation and management. You'll learn to efficiently process and analyze large datasets using pandas and NumPy, and visualize your insights with Matplotlib. The curriculum covers all phases of data treatment—from initial data wrangling to the final presentation of your findings. Ideal for software engineers seeking to enhance their data engineering skills and work effectively with data at scale.
Chris KochBy Chris Koch · Instructional designer, Instructor and Sr. Software Engineer
Skill objectives
Chapter 01
What is Data Engineering
In this chapter we are going to learn more about the definition of data engineering and what that really takes into account. You will learn the tasks and core competencies of data engineers and we will introduce you to a few key concepts like ETL
Skills: Data engineering skills
Chapter 02
Python Data Tools: how to manage data at scale with Python
This chapter is fully focused on the tools you can use with Python to manage large datasets of data. You will learn how to use Jupyter notebooks, Numpy and Pandas.
Skills: Pandas, Python, NumPy, Jupyter Notebook
Chapter 03
Extracting data with Python and other tools
In this chapter, we dive deep into data extraction using various techniques. We will use Python and SQL in a variety of environments and use cases.
Skills: Data Mapping and Extraction, Microsoft Excel, Selenium, SQL
Chapter 07
Building a data pipeline
A data pipeline automates the flow of data from sources to storage and analysis. It is the final result of everything we have seen in this path: It involves extraction, transformation, and loading (ETL) processes to prepare and move data efficiently.
Skills: Data Pipelines, ETL Pipeline Development, ETL (Extract, Transform, Load), Data engineering skills, Dataset Validation, Data Manipulation, Data Mapping, Data Modeling, Data Science Fundamentals