Hi,

I have two data sets, set A and set B of numberical identifiers. B is a subset of A. Set A is about 20,000 lines and set B is a bit less. What I need to do is find all the identifiers in A that are not in B. The second part of this problem is that the identifiers of B are not the same in A, however the B identifier is embedded within the A identifier. So if the identifier for B is 234 the identifier for A is probably something like 000234000.

Solving the problem with sorting doesn't help much because I have to review about 20,000 lines.

Any ideas are appreciated.