Übersicht
Real-world data are commonly messy, distributed, and heterogeneous. This course introduces core concepts of data cleaning and standardization, and data integration, that are aimed at converting and mapping raw data into other formats that allow more efficient and convenient use and analysis of data. The course also discusses data quality, management, and storage issues as relevant to data analytics.
Due to the current situation, the lecture will be held as an online lecture in moodle. The lecture consists of recorded videos as well as interactive lectures. There will also be practical exercises.