parallel_bulk not actually doing anything
See original GitHub issueI’m trying to make reindexing big indices faster.
for response in helpers.parallel_bulk(target_client, _change_doc_index(docs, target_index),
chunk_size=10, thread_count=10, ** kwargs):
print("response", response)
Nothing appears to happen… Switching to regular bulk things work fine…
Also, siginterrupt doesn’t work with parallel bulk running
Issue Analytics
- State:
- Created 8 years ago
- Reactions:1
- Comments:12 (5 by maintainers)
Top Results From Across the Web
Elasticsearch parallel bulk using Python - issue with json
Hi All - I am a newbie with ELasticsearch and I am encountering strange issue . Specifications: I have a Json file of...
Read more >python 2.7 - Connection timeout on Elasticsearch parallel_bulk
Sincerly I never do a bulk import with so many docs to indicize. I don't know why this error appears. In your case...
Read more >Bulk API - Records in different parallel batches not visible in ...
I think the problem is in the query for primary addresses. New address records only become visible in SOQL queries in the After-context....
Read more >FORALL Insert With Parallel DML Enabled - Ask TOM
You an use NOAPPEND in order to change the default, but that will disable the parallel mode. It is a "feature", it is...
Read more >Flow Performance with parallel flows
I don't think SharePoint handles efficiently parallel bulk ... data which doesnt make sense that it would be impacted by other flows doing ......
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hey, thanks. This solved my problem.
I don’t really need the return results. So I was wondering why
bulk(es, qs)
work andparallel_bulk(es, qs)
does not at first.parallel_bulk
is a generator so you need to iterate over it to get to the results, see https://github.com/elastic/elasticsearch-py/issues/336