elasticsearch.helpers.parallel_bulk() hangs forever on Python 3.7.0
See original GitHub issueI haven’t had a chance to dig into what’s new in Python 3.7.0, or how elasticsearch.helpers.parallel_bulk()
works. But for some reason, it hangs forever on Python 3.7.0.
elasticsearch.helpers.streaming_bulk()
works fine on Python 3.7.0, and my app isn’t indexing-heavy, so I’ve just switched to streaming for now.
Issue Analytics
- State:
- Created 5 years ago
- Reactions:2
- Comments:8 (7 by maintainers)
Top Results From Across the Web
python 2.7 - Connection timeout on Elasticsearch parallel_bulk
I am trying to import a large amount of data using Elasticsearch parallel_bulk. This is my Index structure:
Read more >A brand new website interface for an even better experience!
elasticsearch.helpers.parallel_bulk() hangs forever on Python 3.7.0.
Read more >Bulk python hangs forever - Elasticsearch - Elastic Discuss
Hi all, I'm facing a problem with the bulk function of ElasticSearch python package. When i run the bulk function it hangs forever...
Read more >Helpers — Elasticsearch 7.16.0 documentation
All bulk helpers accept an instance of Elasticsearch class and an iterable actions ... The parallel_bulk() api is a wrapper around the bulk()...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I submitted a pull request. I am able to run the tox tests with it, though with concurrency bugs, there is always the chance that it doesn’t fully address the underlying problem. I see the symptoms, but did not fully root cause it. It could be that any blocking during teardown could cause the problem, and increasing the queue size just reduces the chance of that happening.
https://github.com/elastic/elasticsearch-py/pull/894
I’ll be testing out a larger pool size on my installation to see how the change affects real-world runs, and if hangs still occur occasionally.
The deadlock may be variant of: https://bugs.python.org/issue14976