Package source priority (HPC cluster use case)
See original GitHub issue_Originally posted by @mboisson in https://github.com/pypa/pip/issues/8606#issuecomment-877431454_
Here are what we can and can not do, in our environment (HPC clusters).
- We can’t run a server. This is for HPC clusters which are not under our control. The only thing we distribute is a filesystem. We can’t long-standing processes and we don’t control the network.
- We can’t run binary (manylinux or other) wheels from PyPI for two primary reasons reasons: missing libraries or libraries installed in non-standard locations, non-optimized CPU instructions
- We can’t host all of PyPI, that’s just too much, and pure-python wheels work just fine.
- We can’t change our requirement files/install differently, because we aren’t the ones doing the installation. We support end users (researchers), we don’t do their works, they install whatever package they need.
What we can do, and have been doing:
- Provide users a directory with wheels that we compile from source
- Define a
PIP_CONFIG_FILE
in the user’s environments to make sure theirpip install ...
commands use some settings. We have been using this to point theirpip
to our wheel directory, which used to be preferred over pypi. We are now also using that to put a constraint on pip < 21, since that’s now broken for us.
Issue Analytics
- State:
- Created 2 years ago
- Comments:20 (11 by maintainers)
Top Results From Across the Web
Using R on the HPC Clusters | Princeton Research Computing
This page shows how to install R packages and run R scripts on the Princeton HPC clusters. RStudio can be used in your...
Read more >Introduction To HPC For New Users - UIowa Wiki
Cluster resources are divided into environments and resources. For example, users may choose between different available parallel environments ( ...
Read more >HPC FAQs - Amazon Web Services
High Performance Computing (HPC) applications distribute computational workloads across a cluster of instances for parallel processing. Examples of HPC ...
Read more >10 Executing large analyses on HPC clusters with slurm
This two hour workshop will introduce attendees to the slurm system for using, queuing and scheduling analyses on high performance compute clusters. We...
Read more >High-performance computing (HPC) on Azure - Microsoft Learn
Learn about high-performance computing (HPC) on Azure, which uses many CPU or GPU-based computers to solve complex mathematical tasks.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I believe build tags are possible on PyPI, although practically nobody uses them.
By two packages with the same label, do you mean e.g. there are two pages for package
foo
, or there are two entries for filefoo-1.0-none-any.whl
? Because the former is an absolutely valid strategy to support architectures that PyPI doesn’t and potentially can’t host, and is used by many projects like piwheels and pytorch. Showing a warning would cause great disruption to those projects (because unassuming users would bug them thinking they are to be faulted), who have been meticulously following community guidelines and doing things “right”. The latter is likely harmless, but probably also not very useful?