question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Crash at shape pre-training

See original GitHub issue

I am trying to reproduce the results but meet some problems at ‘I. Shape Pre-Training’. I find the script would crash at validation of shape pre-training. It looks like a OOM issue because log says “killed” and it stop crashing if I set shuffle_buffer_size=False at shape.ini. Any suggestions would help!

I am using a machine with 4 3090 GPUs, 12 cpu cores and 60 GB memory. My dataset have 100 train data and 7 validate data. There are 120 test data, 99 train data, 99 val data at surf_root directory.

Issue Analytics

  • State:closed
  • Created 10 months ago
  • Comments:7

github_iconTop GitHub Comments

1reaction
Woolseyyycommented, Nov 30, 2022

Does this setting affect the overall experiment?

modify the code to make it only affect validation

0reactions
wangmingyang4commented, Nov 30, 2022

I found no shuffle_buffer_size in shape.ini, do I need to add shuffle_buffer_size = 0, or no_shuffle = True in modify base.py? @Woolseyyy An error : buffer_size must be greater than 0 occurred when setting shuffle_buffer_size = 0.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Crash at the end of training · Issue #9 · huggingface/transformers
Hi, I tried running the Squad model this morning (on a single GPU with gradient accumulation over 3 steps) but after 3 hours...
Read more >
A New Video-Based Crash Detection Method: Balancing ...
In this paper, a feature fusion-based deep learning framework was developed for video-based urban traffic crash detection task, aiming at achieving a balance...
Read more >
Using Supervised Pretraining to Improve Generalization of ...
Figure 1 describes training neural networks as a search pro- cess. The area inside the ellipse represents the representation capacity of a neural...
Read more >
How to Use The Pre-Trained VGG Model to Classify Objects in ...
So, i want to know how to set the input shape of pretrained model for the new dataset because i am getting an...
Read more >
Developing transferable real-time crash prediction models for highly ...
In spite of this, real-time crash prediction models have yet to be ... Figure 2.2 Deviation from average speed vs the crash rate...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found