[BOOK][B] Data cleaning
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …
important problems in data management, since dirty data often leads to inaccurate data …
A survey on truth discovery
Thanks to information explosion, data for the objects of interest can be collected from
increasingly more sources. However, for the same object, there usually exist conflicts among …
increasingly more sources. However, for the same object, there usually exist conflicts among …
[BOOK][B] Knowledge graphs
Smart speakers such as Alexa and Google Home introduced Artificial Intelligence (AI) in
millions soon billions of households, making AI an everyday experience. We can now look …
millions soon billions of households, making AI an everyday experience. We can now look …
Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation
In many applications, one can obtain descriptions about the same objects or events from a
variety of sources. As a result, this will inevitably lead to data or information conflicts. One …
variety of sources. As a result, this will inevitably lead to data or information conflicts. One …
A confidence-aware approach for truth discovery on long-tail data
In many real world applications, the same item may be described by multiple sources. As a
consequence, conflicts among these sources are inevitable, which leads to an important …
consequence, conflicts among these sources are inevitable, which leads to an important …
Where the truth lies: Explaining the credibility of emerging claims on the web and social media
The web is a huge source of valuable information. However, in recent times, there is an
increasing trend towards false claims in social media, other web-sources, and even in news …
increasing trend towards false claims in social media, other web-sources, and even in news …