Best cross-language analyzer to use with lucene index

Posted by Halirob on Stack Overflow See other posts from Stack Overflow or by Halirob
Published on 2009-06-16T11:50:08Z Indexed on 2010/05/03 6:28 UTC
Read the original article Hit count: 205

Filed under:
|

Hello, I'm looking for feedback on which analyzer to use with an index that has documents from multiple languages. Currently I am using the simpleanalyzer, as it seems to handle the broadest amount of languages. Most of the documents to be indexed will be english, but there will be the occasional double-byte language indexed as well.

Are there any other suggestions or should I just stick with the simpleanalyzer.

Thanks

© Stack Overflow or respective owner

Related posts about lucene

Related posts about java