Quantcast
Channel: voidtools forum
Viewing all articles
Browse latest Browse all 1719

Everything 1.5 Alpha • Re: soundex

$
0
0
Looking at that article you would need to solve word rearrangement.
For example Edinburgh Tasks should be viewed as almost matching Tasks Edinburgh.
One of the reasons I adopted a simple word pair count for my own searching is that rearranged blocks will come out with a close score to the original as will small drops, changes and insertions. The price you pay is you can get some surprises included, but I've always thought it's better to have a few false inclusions than some good ones not showing.
It may be there are some very optimised processes to tackle block rearrangement, although if a routine is made too intricate it can run the risk of being slower than desirable... possibly not an issue for 500,000 filenames with something written in Assembly.
d

Statistics: Posted by meteorquake — Sat Jun 29, 2024 4:56 pm



Viewing all articles
Browse latest Browse all 1719

Trending Articles