Lower case versus capital

marcus 8 Feb 2012 20:18
For solr, we use http://tartarus.org/martin/PorterStemmer/, with exceptions manually added (e.g don't stem animation and animal to the same word. Porter stems very aggressively, work, works, working, worked all stem to the same root.

On my_uploads, you don't use solr but it should use something close to Porter.
marcus 8 Feb 2012 20:21
Regarding "If the search is the same I wonder why the Keyword input accepts "Home" and "home" as two different words and counts them as two different words?"

Probably the caching layer got confused and didn't think they the same. Thus in one case you were looking at a cached set and in the next case you forced a new cache generation. When I searched for Home and home just now I got the same results.
jason 8 Feb 2012 20:23
And those were the same results I've gotten.
RekindlePhoto 8 Feb 2012 21:01
Thanks Marcus, good to know the "ed", "s" and "ing" are not needed, just the root or main word.
Thanks
wideweb 8 Feb 2012 22:52
What about
man men
child children
etc
RekindlePhoto 8 Feb 2012 23:49
or man, men, and male ... and possibly "jerk" or "stupid" ... ;)
< 1 2
페이지로 이동