Memory doesn't get freed up after changing reference
See original GitHub issuedf = dd.read_csv('s3://xx/*.csv')
e.persist(df) #first result persists in memory
df = dd.read_csv('s3://yy/*.csv')
e.persist(df)
the memory on the nodes grows and df now references to the new data.However, it appears that original df doesnt get cleared in the worker memory. Is this expected behavior?
Issue Analytics
- State:
- Created 7 years ago
- Comments:10 (10 by maintainers)
Top Results From Across the Web
Why is space not being freed from disk after deleting a file in ...
When deleting a large file or files, the file is deleted successfully but the size of the filesystem does not reflect the change....
Read more >Memory held by the resources is not released even after ...
Hi, I created a WPF application with byte array, memory stream and ... holds a reference to the array so the array won't...
Read more >Do Python dictionaries have all memory freed when ...
No, it will be freed AFTER the new object has been created. In order for the reference count to ...
Read more >Clinging to memory: how Python function calls can increase ...
Solution #1: No local variable at all. If there's no extra reference, the original array can be removed from memory as soon as...
Read more >Memory Management, C++ FAQ - Standard C++
In brief, conceptually malloc and new allocate from different heaps, so can't free or delete each other's memory. They also operate at different...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I disconnected all my Clients but the memory doesnt get freed up all the way. I looked at the scheduler logs and it registers my client disconnect.
If you’re interested in doing even more work,
distributed/http/scheduler.py
is easy to extend. The relevant state here isScheduler.who_wants
andScheduler.wants_what
.