Course overview
The aim of this course is to design ingestion mechanisms from existing databases that prepare data for further analysis. The aim of this course is topics covered in this course include: different data types and formats; regular expressions; web scraping; data exploration, visualisation and preliminary analysis, data profiling and exploratory data analysis; data quality verification, missing data analysis; data wrangling; data cleansing; filtering; sampling; normalisation; visualisations.
Course learning outcomes
- Assemble requirements and implement processes for data structuring, cleaning, and enrichment via utilising multiple tools.
- Identify the alignment required between data collection and analysis.
- Design sustainable data management processes that support the analysis and report phases.
- Create effective documentation reflecting the structure of data sets prepared for further analysis.
- Compare the privacy and ethical implications of sourcing internal and external data.
- Perform self-service data preparation.
Degree list
The following degrees include this course