question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Timeout: Incomplete Dataset Export Download

See original GitHub issue

My actions before raising this issue

  • Read/searched the docs
  • Searched past issues

So I’ve been trying to export one of my datasets for the past several days but in vain. Dumping annotations seems to be working fine, but exporting the dataset always results in a timeout halfway through the download. I’ve attempted this on two different networks and browsers with no success.

Expected Behaviour

Trying to export a dataset (4.1 GB) should download successfully, without any timeouts or errors.

Current Behaviour

Download timeouts after 2GB have downloaded and the resulting file is unusable.

Possible Solution

I’ve tried circumventing the timeout in multiple ways, which have all resulted in failure. 1 - I tried using wget in Colab to get around what I thought was slow internet, but the downloaded file is empty and of Length = 0 .

!wget --no-check-certificate  --load-cookies cookies.txt \
'https://cvat.org/api/v1/tasks/###/dataset?format=TFRecord%201.0&action=download'

2- Update: This only worked once and now times out at 2GB as well.

I tried using Selenium on Colab and it was successful:

from selenium import webdriver

options = webdriver.ChromeOptions()
options.add_argument('--headless')
options.add_argument('--no-sandbox')
options.add_argument('--disable-dev-shm-usage')

options.add_experimental_option("prefs", {
  "download.default_directory": r"/content/",
  "download.prompt_for_download": False,
  "download.directory_upgrade": True,
  "safebrowsing.enabled": True
})

wd = webdriver.Chrome('chromedriver',options=options)
wd.get("https://cvat.org/auth/login")

username = wd.find_element_by_id("username")
password = wd.find_element_by_id("password")

username.send_keys("username")
password.send_keys("password")
element = wd.find_element_by_css_selector('.login-form-button')
wd.execute_script("arguments[0].click();", element)

from selenium.webdriver import ActionChains
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By

wait = WebDriverWait(wd, 10)
actions = ActionChains(wd)

open = wd.find_element_by_css_selector("#root > section > main > div > div:nth-child(3) > div > div:nth-child(1) > div:nth-child(4) > div:nth-child(1) > div > a")
wd.execute_script("arguments[0].click();", open)

action_button = wd.find_element_by_css_selector("#root > section > main > div > div > div.ant-row-flex.ant-row-flex-space-between.ant-row-flex-middle.cvat-task-top-bar > div:nth-child(2) > button")
actions.move_to_element(action_button).perform()

export_data = wd.find_element_by_css_selector("body > div:nth-child(9) > div > div > ul > li:nth-child(3) > div")
actions.move_to_element(export_data).perform()

data = wd.find_element_by_xpath("//*[@id='export_task_dataset$Menu']/li[1]")
wd.execute_script("arguments[0].click();", data)

Steps to Reproduce (for bugs)

  1. Create Task
  2. Annotate a few things
  3. Export dataset
  4. Download timeouts before successful download.

Context

I can’t export my data for use in research.

Your Environment

CVAT.org

Issue Analytics

  • State:open
  • Created 3 years ago
  • Reactions:3
  • Comments:30 (5 by maintainers)

github_iconTop GitHub Comments

6reactions
cyrilzakkacommented, Aug 10, 2021

@memirerdol @ConstantSun For some odd reason, curl worked for me today - I was able to download 64 GB worth of my annotated data. Regular downloads using the GUI still times out after 1GB

3reactions
azhavorocommented, May 31, 2021

@lgg, we discussed internally and I’m going to disable this limit for cvat.org today

Read more comments on GitHub >

github_iconTop Results From Across the Web

Developers - Timeout: Incomplete Dataset Export Download -
Dumping annotations seems to be working fine, but exporting the dataset always results in a timeout halfway through the download. I've attempted this...
Read more >
Cannot export the whole search result to csv - Splunk Answers
Solved: I am using Splunk 5.0.4 and accessing splunkweb with IE8. I am trying to export my search results to csv but every...
Read more >
When exporting large files from the Web Player, the resulting ...
Exporting large data sets or downloading large files from the Web Player can result in a timeout which leaves the downloaded data set...
Read more >
How to address the time out issue on "Export to Excel ...
Use a direct link to the SQL database, via a dts package, or by using the native Excel functionality to connect to a...
Read more >
Why do I see "Download failed" when exporting a report in ...
If the report contains a large amount of data, the export service may timeout during the process and result in a failure.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found