Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

InterProcessLock files left behind

See original GitHub issue

When trying to use the InterProcessLock class on OS X it appears that when it releases the lock, it doesn’t cleanup after itself. It would be great if it could do this to ensure that we don’t leave a bunch of left over files laying around.

Issue Analytics

State:
Created 7 years ago
Reactions:9
Comments:27 (16 by maintainers)

Top GitHub Comments

4reactions

tgamblincommented, Oct 13, 2016

I was able to find a decent solution for this for my use case by using byte range locking with Python’s lockf (which is confusingly fcntl underneath).

In particular, I have a bunch of install prefixes for different packages in a package manager, and each needs to be write-locked while an install is happening and read-locked while a dependency’s install is happening. Lots of installs can happen in parallel.

I use a single lock file, and for each directory, I lock a single byte in the lock file identified by the 63-bit prefix of the SHA-1 of the directory name.

Yes, that is complicated, but it has some advantages:

Uses fcntl locks, which seems to be reasonably supported across distributed filesystems I care about (NFS v2.6+, Lustre, GPFS).
Only requires one lock file, whose actual size is zero since you can lock past the end of the file.
No need to clean up the one lock file (cleaning up fcntl locks gives you race conditions anyway).
Interprocess reader-writer locks.

Disadvantages:

This approach is not “perfect”, in the sense that it can result in false positives – if there is a hash collision you may have to wait for an install to finish when you do not need to wait. But the likelihood of a collision with two locking processes and 64-bit offsets (most machines) is 1e-19, and the likelihood of a collision doesn’t reach 1% until you have ~430 million processes (at which point you probably have other problems).
I think if you adapted this to also support msvcrt on Windows, you’d have to make all the locks exclusive, or maybe unix clients could use the read locks and they would appear to be exclusive locks to a windows client. I haven’t looked into how different NFS implementations handle this.

At least for my use case, the advantages outweigh the disadvantages, and I couldn’t come up with a better solution that didn’t require me to deploy a distributed lock server or some other coordination service.

Anyway, an interprocess reader-writer lock with byte range support is implemented here:

There is some relevant discussion here. The SHA stuff is outside the lock class, so it probably doesn’t need to go in fasteners, unless you like the idea and want some kind of lock_by_name_sha1() method.

Just wanted to throw a potential solution in the thread. Let me know if you’d have a use for this in fasteners.

3reactions

vstinnercommented, Nov 15, 2018

Would it make sense to use a file descriptor rather than a filename with O_TMPFILE, to ensure that the file is destroyed as soon at the file is closed in all processes (or when processes are killed)? O_TMPFILE requires Linux 3.11 and newer, and is not supported by all filesystems, but more and more filesystems support it.

See how tempfile of Python stdlib uses O_TMPFILE:

I just tested on Fedora 29: btrfs, ext4 and tmpfs support O_TMPFILE. (I only tested these filesystems.)

$ cat /etc/fedora-release 
Fedora release 29 (Twenty Nine)

$ uname -r
4.18.16-300.fc29.x86_64

$ python3
Python 3.7.1 (default, Nov  5 2018, 14:07:04) 
>>> import os

>>> fd=os.open(".", os.O_WRONLY | os.O_TMPFILE) # brtfs
>>> os.close(fd)

>>> fd=os.open("/tmp", os.O_WRONLY | os.O_TMPFILE) # tmpfs
>>> os.close(fd)

>>> fd=os.open("/boot/test", os.O_WRONLY | os.O_TMPFILE) # ext4
>>> os.close(fd)

Problem: who create the FD? How to pass the FD to other processes? UNIX socket with sendmsg()? …