Best cross-language analyzer to use with lucene index
Posted
by Halirob
on Stack Overflow
See other posts from Stack Overflow
or by Halirob
Published on 2009-06-16T11:50:08Z
Indexed on
2010/05/03
6:28 UTC
Read the original article
Hit count: 205
Hello, I'm looking for feedback on which analyzer to use with an index that has documents from multiple languages. Currently I am using the simpleanalyzer, as it seems to handle the broadest amount of languages. Most of the documents to be indexed will be english, but there will be the occasional double-byte language indexed as well.
Are there any other suggestions or should I just stick with the simpleanalyzer.
Thanks
© Stack Overflow or respective owner