How do I compare two columns where 1 is used to find errors in the 2nd

R

RaymondC

I have two lists of names in 2 seperate columns

1st list:
Column of 50 correct names

2nd list:
Has a column with 2000 names. The names are repeated and may not b
spelled correctly.

I need to compare the 2000 names in the 2nd list against the 50 name
in the 1st list and generate a 3rd list telling me of the errors in th
2nd list when compared against the 1st list.
[ie: if the 1st list has a correct name of Joe Jones, but the 2nd lis
has a name entered as Jon Jones, how do I get the name Jon Jones t
show in the 3rd list?]

Thanks for your soonest repl
 
D

Daniel.M

Hi,

Do a groups/google search in *excel* groups for "fuzzy match" and/or
"approximate string match" and/or "edit distance" and you should find a couple
of implementations. Or at least a couple of functions that calculate the 'edit
distance' as a percentage (0 very far, 100 perfectly similar).

Warning: there are quite a few algos that does that. Each one with its
strengths/weaknesses and should select one that suits your needs closely.

Regards,

Daniel M.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top