Limit on organization comparison tokens
Posted: Thu Dec 17, 2015 12:02 pm
It seems that when organization names are derived they produce a compare string that contains a maximum of four non-anonymous tokens. For example, let's say we have these two records:
ACME AIRCRAFT PARTS CO DBA BIG DOG MECHANICS
and
BIG DOG MECHANICS
then, after elimination of anonymous values like CO and DBA, the two comparison strings are:
ACME:AIRCRAFT:PARTS:BIG
and
BIG:DOG:MECHANICS
Notice that the first comparison string only contains the first four non-anonymous values. Hence the matching is less than desirable.
How can we overcome this limit of four tokens? The Max Bucket Tokens limit is obviously not the problem as we have that set at 6.
ACME AIRCRAFT PARTS CO DBA BIG DOG MECHANICS
and
BIG DOG MECHANICS
then, after elimination of anonymous values like CO and DBA, the two comparison strings are:
ACME:AIRCRAFT:PARTS:BIG
and
BIG:DOG:MECHANICS
Notice that the first comparison string only contains the first four non-anonymous values. Hence the matching is less than desirable.
How can we overcome this limit of four tokens? The Max Bucket Tokens limit is obviously not the problem as we have that set at 6.