A Domain-Independent Data Cleaning Algorithm for Detecting Similar-Duplicates.

AllImages Books Videos Maps News Shopping

Scholarly articles for A Domain-Independent Data Cleaning Algorithm for Detecting Similar-Duplicates.

scholar.google.com › citations

… Cleaning Algorithm for Detecting Similar-Duplicates.
Ripon · Cited by 28

A Domain-Independent Data Cleaning Algorithm for Detecting Similar ...

www.researchgate.net › publication › 22...

Oct 22, 2024 · In this paper, we propose a novel domain-independent technique for better reconciling the similar-duplicate records. We also introduce new ideas ...

[PDF] A Domain-Independent Data Cleaning Algorithm for Detecting Similar ...

citeseerx.ist.psu.edu › document

The standard method for detecting exact duplicates is to sort the dataset and then to perform an exact matching with the neighbor records to determine whether ...

A Domain-Independent Data Cleaning Algorithm for Detecting ...

www.jcomputers.us › ...

In this paper, we propose a novel domain-independent technique for better reconciling the similar-duplicate records. We also introduce new ideas for making ...

A Domain-Independent Data Cleaning Algorithm for Detecting Similar ...

www.semanticscholar.org › paper › A-D...

A novel domain-independent technique for better reconciling the similar-duplicate records is proposed and new ideas for making similar- DUplicate detection ...

A domain independent similar-duplicate detection algorithm for data ...

www.researchgate.net › ... › Data Cleaning

PDF | On Dec 21, 2009, Kazi Shah Nawaz Ripon and others published A domain independent similar-duplicate detection algorithm for data cleaning | Find, ...

A Domain-Independent Data Cleaning Algorithm for Detecting Similar ...

www.scilit.net › publications

The detection of similar-duplicate records is a difficult task, especially when the records are domain-independent. In this paper, we propose a novel domain- ...

[PDF] An efficient domain-independent algorithm for detecting - UCSD CSE

cseweb.ucsd.edu › approxdup

In this paper we study the problem of detecting records in a database that are duplicates of each other, but not necessarily textually identical. This is a ...

Missing: Cleaning | Show results with:Cleaning

A Domain-Independent Data Cleaning Algorithm For Detecting ...

fr.scribd.com › document

Data mining algorithms generally assume that data will be clean and consistent. The detection of similar-duplicate records is a difficult task, ...

All-Three: Near-optimal and domain-independent algorithms for ...

www.sciencedirect.com › article › pii

The most predominant domain-independent algorithm for near-duplicate detection is that of Monge-Elkan (ME) [4,14]. This seminal work is based on stretching ...

(PDF) An Efficient Domain-Independent Algorithm for Detecting ...

www.academia.edu › An_Efficient_Dom...

1 Introduction In this paper we study the problem of detecting records in a database that are duplicates of each other, but not necessarily textually identical.