Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The lack of hermetics in DocumentBatch #172

Open
SOLR4189 opened this issue Jun 21, 2018 · 1 comment
Open

The lack of hermetics in DocumentBatch #172

SOLR4189 opened this issue Jun 21, 2018 · 1 comment

Comments

@SOLR4189
Copy link

SOLR4189 commented Jun 21, 2018

Hi, I have an another problem:
When I passed my docs in batches (3000 docs in batch) through Monitor I don't get all matching pairs. When I passed my docs in batches with one doc per batch I get all results. What can it be? Has LUWAK batch size limit? I didn't found...

I'm using ParallelMatcher with SimpleMatcher inside (score doesn't matter for me), in monitor loaded one query only.

@SOLR4189
Copy link
Author

Ok. I found a problem. The problem is that DocumentBatch gets analyzers from first document in batch only (line 187 in DocumentBatch.java). So, it will failed in the case when another doc in batch has fields that first doc doesn't have.

Temp solution: when I build batch, I collect all analyzers from all docs in batch, so each doc in batch will get all possible analyzers for all possible fields (even those that it doesn't have)

Optimal solution: DocumentBatch must union all analyzers itself. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant