Skip to content

imjbmkz/Building-Your-First-End-to-End-Data-Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Building Your First End-to-End Data Portfolio

Hello! This is Josh Dev. In this series, we created an end-to-end data project that can be a good start for a data portfolio regardless if you are a beginner or an experienced data professional. This end-to-end project covers the major phases of a data project: from creation of data pipelines, visualizing and reporting data, and acquiring deeper insights.

Overview of the Project

Alt text This project covers the end-to-end business intelligence cycle.

  • The data engineering side covered development of a two-step data pipeline:
    • First step covers the extraction of data from online sources using Python and uploads the extracted files to an SFTP server
    • In the second step, the uploaded files in SFTP will be downloaded and loaded to the data warehouse
  • An API has been developed in order to use the data extracted from the web to support compliance requirement for AML screening
  • In the data analysis side of the project, an exploratory analysis has been conducted to the car sales dataset and visualized the results using Power BI
  • In the data science part of the project, a simple linear regression model has been built to analyze the mtcars dataset and to derive relevant insights

Episodes

  1. Introduction and project overview (recording)
  2. Version control and virtual environment essentials for data professionals (recording)
  3. Extracting data to FTP using Python (recording | project)
  4. Loading CSV files from FTP to PostgreSQL using SSIS (recording | project)
  5. Developing screening API using FastAPI (recording | | project)
  6. Data modeling and visualization using Power BI (recording)
  7. Supervised machine learning and regression analysis primer (recording)
  8. Creating a machine learning pipeline on house price dataset using sklearn

About

Let's make our first end-to-end data portfolio!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages