logo-polimi
Loading...
Degree programme
Programme Structure
Show/Search Programme
Course Details
Save Document
Degree Programme
Degree Programme not available
Quantitative data
Faculty
Infrastructures
International context
Customized Schedule
Your customized time schedule has been disabled
Enable
Search
Search a Lecturer
Search a Course
Search a Course (system prior D.M. n. 509)
Search Lessons taught in English

Glossary
Semester (Sem)
1First Semester
2Second Semester
AAnnual course
Educational activities
BIdentifying activities
Language
Course completely offered in italian
Course completely offered in english
--Not available
Innovative teaching
The credits shown next to this symbol indicate the part of the course CFUs provided with Innovative teaching.
These CFUs include:
  • Subject taught jointly with companies or organizations
  • Blended Learning & Flipped Classroom
  • Massive Open Online Courses (MOOC)
  • Soft Skills
Course Details
Context
Academic Year 2014/2015
School School of Industrial and Information Engineering
Name (Master of Science degree)(ord. 270) - MI (434) Engineering of Computing Systems
Track T2B - Engineering of computing systems
Programme Year 2

Course Details
ID Code 096879
Course Title DATA QUALITY: MAXIMIZING VALUE THROUGH MODELING, ASSESSMENT AND IMPROVEMENT
Course Type Mono-Disciplinary Course
Credits (CFU / ECTS) 5.0
Semester --
Course Description The course introduces the basic concepts, models and techniques of the data quality. It aims to provide the tools to assess and improve the quality of data used in different processes in order to avoid errors and inefficiencies. Data errors, inconsistencies or delays most of the times negatively affect the output of the processes (from business processes to pure computational process). Most of the times these problems are due to the poor quality of the data used. Such issue is perceived as important in different fields and for different data sources (e.g., structured databases, logs, social media content, sensor values). One of the main goals of Data Quality research is to assess and eventually increase the reliability and value of the data in use. In recent years, several comprehensive methodologies for the Data Quality management have been proposed. They include the techniques and procedures to analyze data quality problems, define Data Quality dimensions, measure and improve data quality levels. This course aims to: - introduce the basic elements of Data Quality management; - provide an overview of the current techniques used to assess the most used data quality dimensions in different data sources, i.e., accuracy, precision, completeness, timeliness and consistency. The course shows how the formulas and methods used for assessment vary on the basis of the type of data sources and consequently on the type of data, e.g., numerical vs. text values, structured vs. unstructured data; - discuss the main data quality issues in data fusion: duplicate detection and conflict resolution; - illustrate the techniques to improve data quality levels. The course presents both value-based improvement (e.g., data cleaning) and process-based improvement techniques; and - discuss the main data quality open issues in new field such as IOT and big data.
Scientific-Disciplinary Sector (SSD)
Educational activities SSD Code SSD Description CFU
B
ING-INF/05
INFORMATION PROCESSING SYSTEMS
5.0

Schedule, add and removeAlphabetical groupLecturer(s)LanguageTeaching Assignment Details
From (included)To (excluded)
---AZZZZCappiello Cinzia
manifesti v. 3.7.7 / 3.7.7
Area Servizi ICT
18/02/2025