question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Bugfix: ... Changed timeout to 120 seconds in conf file - still timing out after 60 seconds using docker-compose

See original GitHub issue

Describe the bug

I changed the timeout to 120 seconds using the ArchiveBox.conf file and pointed my docker-compose file to that env file, however it is still timing out after 60 seconds.

Conf file:

################################################################################
## General Settings
################################################################################

#OUTPUT_DIR="output"
OUTPUT_PERMISSIONS=755
#ONLY_NEW=False
TIMEOUT=120
MEDIA_TIMEOUT=3600
#TEMPLATES_DIR="archivebox/templates"
FOOTER_INFO="Content is hosted for personal archiving purposes only."

Docker-compose file:

version: '3'

services:
    archivebox:
        container_name: archivebox
        build: .                                   # replace this with nikisweeting/archivebox to use the docker-compose.yml file as a standalone file without avoid having to clone the repo
        stdin_open: true                           # needed to be able to input URLs directly after `docker-compose up`
        tty: true                                  # needed to be able to pipe in URLs via stdin to `docker-compose exec ...`
        env_file: /home/taubin/ArchiveBox/archivebox.conf   # this feature is available starting >v0.4
        # environment:
        #     - SHOW_PROGRESS=False                  # make docker logs nicer by not writing lots of progress bar lines
        #     - MEDIA_TIMEOUT=60                     # Change media timeout
        #     - TIMEOUT=120                          # Change timeout to 2 minutes
        # volumes:
            # - ./data:/data
        command: bash -c 'echo "https://github.com/pirate/ArchiveBox" | /bin/archive; tail -f /dev/null'  # archive the Github repo homepage as a starting point so the index doesn't just show an empty list to new users
        restart: unless-stopped
        volumes:
            - ./data:/data
            - /home/taubin/ArchiveBox:/archive


    nginx:
        container_name: archivebox-nginx
        image: 'nginx'
        ports:
            - '8098:80'
        volumes:
            - ./etc/nginx/nginx.conf:/etc/nginx/nginx.conf
            - ./data:/var/www
        restart: unless-stopped

Output:

[*] [2020-07-15 21:46:42] "https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff"
    https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff
    √ output/archive/1594849026.0
      > title
        Failed: Unable to detect page title
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594849026.0;
            curl https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff | grep <title>
      > wget
      > pdf
        Failed:TimeoutExpired Command 'chromium-browser' timed out after 60 seconds
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594849026.0;
            chromium-browser --headless "--user-agent=Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.75 Safari/537.36" --window-size=1440,2000 --timeout=60000 --print-to-pdf https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff
      > screenshot
        Failed:TimeoutExpired Command 'chromium-browser' timed out after 60 seconds
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594849026.0;
            chromium-browser --headless "--user-agent=Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.75 Safari/537.36" --window-size=1440,2000 --timeout=60000 --screenshot https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff
      > dom
        Failed:TimeoutExpired Command 'chromium-browser' timed out after 60 seconds
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594849026.0;
            chromium-browser --headless "--user-agent=Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.75 Safari/537.36" --window-size=1440,2000 --timeout=60000 --dump-dom https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff
      > media
      > archive_org
        Failed: Failed to find "content-location" URL header in Archive.org response.
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594849026.0;
            curl --location --head --user-agent "ArchiveBox/6c4c6862e (+https://github.com/pirate/ArchiveBox/)" --max-time 60 https://web.archive.org/save/https://www.redditstatic.com/desktop2x/fonts/redesignIcon/redesignFont.49673a028235b94b800c5f37667963e5.woff

Steps to reproduce

  1. Installed archivebox using docker-compose per the instructions on website.

  2. Copied conf.default file to .conf

  3. Ran archivebox with the following (for testing)

taubin@taubinserver:~$ archivebox https://www.reddit.com/r/AskUbuntu/comments/hhtoay/ubuntu_20_cpu_threads_hit_100_network_drops/

Screenshots or log output

Output still showing 60 second timeout


[+] [2020-07-15 22:01:08] "https://www.reddit.com/register/?dest=https%3A%2F%2Fwww.reddit.com%2Fr%2FAskUbuntu%2Fcomments%2Fhhtoay%2Fubuntu_20_cpu_threads_hit_100_network_drops%2F"
    https://www.reddit.com/register/?dest=https%3A%2F%2Fwww.reddit.com%2Fr%2FAskUbuntu%2Fcomments%2Fhhtoay%2Fubuntu_20_cpu_threads_hit_100_network_drops%2F
    > output/archive/1594850467
      > title
      > favicon
      > wget
      > pdf
      > screenshot
      > dom
      > media
      > archive_org
        Failed:TimeoutExpired Command 'curl' timed out after 60 seconds
        Run to see full output:
            cd /home/taubin/ArchiveBox/output/archive/1594850467;
            curl --location --head --user-agent "ArchiveBox/6c4c6862e (+https://github.com/pirate/ArchiveBox/)" --max-time 60 https://web.archive.org/save/https://www.reddit.com/register/?dest=https%3A%2F%2Fwww.reddit.com%2Fr%2FAskUbuntu%2Fcomments%2Fhhtoay%2Fubuntu_20_cpu_threads_hit_100_network_drops%2F

Software versions

  • OS: Ubuntu 20.04
  • ArchiveBox version: 10799e4
  • Python version: Python 3.8.2
  • Chrome version: Chromium 84.0.4147.89 snap

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:6 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
Taubincommented, Jul 16, 2020

That worked, thank you very much!

0reactions
piratecommented, Jul 16, 2020

Ok good, and after this?

archivebox config --set TIMEOUT=3600
archivebox config --get TIMEOUT
Read more comments on GitHub >

github_iconTop Results From Across the Web

docker-compose up timeout · Issue #3851 - GitHub
When attempting to use docker-compose on Docker for Windows, when I run docker-compose up I get the following error: Creating network ...
Read more >
Frequently asked questions - Docker Documentation
Compose stop attempts to stop a container by sending a SIGTERM . It then waits for a default timeout of 10 seconds. After...
Read more >
`docker-compose up` times out with UnixHTTPConnectionPool
ERROR: for testdb-data UnixHTTPConnectionPool(host='localhost', port=None): Read timed out. (read timeout=60) An HTTP request took too long to complete. Retry ...
Read more >
The Reason Why You Need to Wait When Stopping Your ...
If the timeout period passes, and the container still hasn't managed to exit on its own, Docker sends a SIGKILL signal, which causes...
Read more >
How to increase Server Gateway Timeout more then 1 minute ...
Deployed Django, Gunicorn, NGINX, Postgresql using Docker and GitLab CI/CD on DO droplet. Below are configuration files: nginx.conf
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found