Administrative data linkage in Brazil: potentials for health technology assessment
Health technology assessment (HTA) is the systematic evaluation of the properties and
impacts of health technologies and interventions. In this article, we presented a discussion of …
impacts of health technologies and interventions. In this article, we presented a discussion of …
Cloud-scale entity resolution: current state and open challenges
Entity resolution (ER) is a process to identify records in information systems, which refer to
the same real-world entity. Because in the two recent decades the data volume has grown …
the same real-world entity. Because in the two recent decades the data volume has grown …
[HTML][HTML] Building the national database of health centred on the individual: administrative and epidemiological record linkage-Brazil, 2000-2015
AAG Junior, RG Pereira, EI Gurgel… - … Journal of Population …, 2018 - ncbi.nlm.nih.gov
Objective To describe the methods and results of parameter setting that are needed to
execute the probabilistic deduplication of large administrative and epidemiological …
execute the probabilistic deduplication of large administrative and epidemiological …
End-to-end task based parallelization for entity resolution on dynamic data
L Gazzarri, M Herschel - 2021 IEEE 37th International …, 2021 - ieeexplore.ieee.org
Entity resolution (ER) is the problem of finding which digital representations of entities
correspond to the same real-world entity. In many Big Data scenarios, in addition to the …
correspond to the same real-world entity. In many Big Data scenarios, in addition to the …
Large-scale schema-free data deduplication approach with adaptive sliding window using mapreduce
K Ma, F Dong, B Yang - The Computer Journal, 2015 - ieeexplore.ieee.org
Data deduplication is the task of identifying all groups of objects within one or several data
sets, respectively. However, this task will become difficult in the context of big data. To …
sets, respectively. However, this task will become difficult in the context of big data. To …
A fast approach for parallel deduplication on multicore processors
In this paper, we propose a fast approach that parallelizes the deduplication process on
multicore processors. Our approach, named MD-Approach, combines an efficient blocking …
multicore processors. Our approach, named MD-Approach, combines an efficient blocking …
An efficient indexing mechanism for data deduplication
TT Thwel, NL Thein - … Conference on the Current Trends in …, 2009 - ieeexplore.ieee.org
At present, there is a vast amount of duplicated data or redundant data in storage systems.
Data de-duplication can eliminate multiple copies of the same file and duplicated segments …
Data de-duplication can eliminate multiple copies of the same file and duplicated segments …
[PDF][PDF] Efficient Cross User Client Side Data Deduplication in Hadoop.
Hadoop is widely used for applications like Aadhaar card, Healthcare, Media, Ad Platform,
Fraud Detection & Crime, and Education etc. However, it does not provide efficient and …
Fraud Detection & Crime, and Education etc. However, it does not provide efficient and …
Towards efficient and effective entity resolution for high-volume and variable data
X Chen - 2020 - repo.bibliothek.uni-halle.de
Entity Resolution (ER), as a process to identify records that refer to the same realworld entity,
faces challenges that big data has brought to it. On the one hand, high-volume data forces …
faces challenges that big data has brought to it. On the one hand, high-volume data forces …
[PDF][PDF] A method of object-based de-duplication
F Yan, YA Tan - Journal of Networks, 2011 - Citeseer
Today, the world is increasingly awash in more and more unstructured data, not only
because of the Internet, but also because data that used to be collected on paper or media …
because of the Internet, but also because data that used to be collected on paper or media …