Lucene (.NET) Document stucture and performance suggestions.

Posted by Josh Handel on Stack Overflow See other posts from Stack Overflow or by Josh Handel
Published on 2010-05-15T13:39:42Z Indexed on 2010/05/15 13:44 UTC
Read the original article Hit count: 331

Filed under:
|
|
|

Hello, I am indexing about 100M documents that consist of a few string identifiers and a hundred or so numaric terms.. I won't be doing range queries, so I haven't dugg too deep into Numaric Field but I'm not thinking its the right choose here.

My problem is that the query performance degrades quickly when I start adding OR criteria to my query.. All my queries are on specific numaric terms.. So a document looks like StringField:[someString] and N DataField:[someNumber].. I then query it with something like DataField:((+1 +(2 3)) (+75 +(3 5 52)) (+99 +88 +(102 155 199))).

Currently these queries take about 7 to 16 seconds to run on my laptop.. I would like to make sure thats really the best they can do.. I am open to suggestions on field structure and query structure :-).

Thanks

Josh

PS: I have already read over all the other lucene performance discussions on here, and on the Lucene wiki and at lucid imiagination... I'm a bit further down the rabbit hole then that...

© Stack Overflow or respective owner

Related posts about lucene

Related posts about lucene.net