find duplicates problem

T

tw

Hi,

My scenario is that i have 2 system with name and adress (100.000 names),
that have to be merged into 1 system without any duplicates.

The problem is that the spelling is not 100% between the system.

One way to find duplicate is to group name,adress and count > 1.

My dream is to use the sound index "Difference" so can i get around the
spelling problem.

DIFFERENCE
Returns the difference between the SOUNDEX values of two character
expressions as an integer.
Syntax
DIFFERENCE ( character_expression , character_expression )

Is that possible to use DIFFERENCE to find duplicates?

And how should the t-sql look like?

Example
name adress city
charles way1 state1
charle waj1 stat1
charlez vay1 stat1

I want to find this example, that this 3 is duplicates.

Should i use ordinary way with group and count >1, this would not be
duplicates.

Help

Thanx

Tw
 
K

KARL DEWEY

I do not think Access has a Soundex function. But if it does then I would
use groupng and sum <>0.
 
Top