Switch all dependencies to pure python and release ArchiveBox pip package
See original GitHub issueI originally thought moving to Python-only dependencies would be intractable, but after some more research I now realize this is quite straightforward.
-
apt install curl
->pip install requests archivenow
(requests docs, archivenow docs) -
apt install wget
->pip install wpull pywb
(wpull docs, pywb docs) -
apt install git
->pip install GitPython
(docs) -
apt install youtube-dl
->pip install youtube-dl
(docs) -
apt install chromium-browser
->pip install pyppeteer
(docs)
Then we wont need users to install any system dependencies anymore, and we can move to using only requirements.txt
and setup.py
to install ArchiveBox via pip
.
Issue Analytics
- State:
- Created 5 years ago
- Reactions:8
- Comments:5 (1 by maintainers)
Top Results From Across the Web
archivebox - PyPI
ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline.
Read more >Release 0.6.3 - ArchiveBox
Python Module Index ... You don't need to install every dependency to use ArchiveBox. ... Build the docs, pip package, and docker image....
Read more >Release 0.4.0 - ArchiveBox
After installing the dependencies, just pipe some new links into the ./archive command to start your archive.
Read more >Release 0.4.18 - ArchiveBox
ArchiveBox only has a few main dependencies apart from python3, and they can all be installed using your normal package manager.
Read more >Release 0.4.24 - ArchiveBox
You don't need to install all the dependencies, ArchiveBox will automatically enable the ... Build the docs, pip package, and docker image.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Yeah I looked at wpull recently and came to the same conclusion. Wget2 looks more promising than wpull.
I think I’m going to close this issue for now, as we start to expand the suite of archiving methods it’s looking more and more like many of them will be node-based. Considering we already support
pip install archivebox
now to get the bulk of archivebox’s functionality, and we offer all the methods out-of-the-box via docker, making everything python-only is no longer a priority.Anything you’re fetching with
curl
should be replaced withwget
or vice versa, and that’ll cut down on some dependencies in thepip
translation.