Assess performance of parallel io.ascii reader with 'spawn' multiprocessing method
See original GitHub issueBased on my understanding of the fast C reader after the work I did in #8853, currently when using the multiprocessing ‘spawn’ method, it might be that the parallel reader is not efficient because it looks like it will pickle the whole contents of the table (via the __reduce__
method). This would mean memory mapping isn’t used. So this issue is a reminder that we should check if this is indeed the case. If so, we might want to consider passing the filename to sub-processes rather than the memory mapped bytes string.
Issue Analytics
- State:
- Created 4 years ago
- Comments:18 (18 by maintainers)
Top Results From Across the Web
Fast reader in io.ascii does not work with multiprocessing ...
The multiprocessing module in Python supports several modes - including fork ... Assess performance of parallel io.ascii reader with 'spawn' ...
Read more >Python Multiprocessing Pool: The Complete Guide
This guide provides a detailed and comprehensive review of the Multiprocessing Pool in Python, including how it works, how to use it, ...
Read more >multiprocessing — Process-based parallelism — Python 3.11 ...
import multiprocessing as mp def foo(q): q.put('hello') if __name__ == '__main__': mp.set_start_method('spawn') q = mp.Queue() p = mp.Process(target=foo ...
Read more >using Python's multiprocessing module - Sebastian Raschka
In this introduction to Python's multiprocessing module, we will see how we can spawn multiple subprocesses to avoid some of the GIL's ...
Read more >Run Code in Parallel Using the Multiprocessing Module
In this video, we will be learning how to use multiprocessing in Python.This video is sponsored by Brilliant.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I think removing support for spawn will also allow us to remove some of the code in the cython code, and removing code is always nice 😄
By default yes, but on Linux or Mac one can easily test both multiprocessing start methods (see #8851 for some example code)