Data Quality for streaming applications
Andrei Robert Zannelli
Data Quality for streaming applications.
Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2021
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (9MB) | Preview |
Abstract
The topic of big data has become highly sought after in recent years and with it all the problems that they entail. The ability to analyze large amounts of data in an innovative way has made it possible to facilitate the development and the enormous production of data from countless sources such as social media, sensors, industrial machines or simply server logs has certainly encouraged the development of the big data field. With the increase in the production speed of all these types of data, we have begun to speak of streaming data, to indicate the production of data in near real time.
Of course, with the acceleration of production, the need to hasten their analysis rose too and there were many answers proposed by the top players in the sector, such as Apache Spark and its two components dedicated to streaming, DStreams and Structured Streaming
Relatori
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
