[Question] After updating to 0.5.1 multi-gpu support have not been working
See original GitHub issueHi! Thank you for the great lib!
After updating from 0.2.4 to 0.5.1 only one gpu out of two is used. The code almost the same:
hyper_parameters = BLSTMModel.get_default_hyper_parameters()
hyper_parameters['layer_bi_lstm']['units'] = 1024
model = BLSTMModel(embedding, hyper_parameters=hyper_parameters)
model.build_model(X_train, y_train)
model.build_multi_gpu_model(gpus=2, x_train=X_train, y_train=y_train, x_validate=X_valid, y_validate=y_valid)
model.fit(X_train, y_train, epochs=15, batch_size=512, x_validate=X_valid, y_validate=y_valid, callbacks=[tf_board_callback, checkpoint_callback])
NVtop and nvidia-smi showing that only one gpu is working. In the previous version I have both of my gpus been used.
Issue Analytics
- State:
- Created 4 years ago
- Comments:7 (4 by maintainers)
Top Results From Across the Web
multi-GPU "alloc failed" error arises after updating to rel-0.5.0
On a multi-GPU system, if a model session is created on GPUs other than GPU0 (i.e. GPU1), then "alloc failed" error will be...
Read more >tensorflow-macos slow (Could not i… | Apple Developer Forums
When i switch to CPU, it works properly and there is no NUMA node error message, but of course CPUs are slower compared...
Read more >NEW 2022! Big Sur on Unsupported Macs [2008 ... - YouTube
I would not recommend installing on a system that you rely on for work ... to Patched Big Sur makes your Mac safer...
Read more >OpenCore Legacy Patcher 0.4.6 & .7 Update + Ventura ...
macOS Ventura and OpenCore Legacy Patcher Support Updates ... upgrading to Patched Monterey makes your Mac safer since Apple is no longer ...
Read more >Using MATLAB Graphics
Graphics. 1. MATLAB Plotting Tools. Anatomy of a Graph . ... When no object is selected, the Property Editor displays the figure's properties....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I am also facing the same issue. Would be great if someone could help.
UPDATE:
Try without calling the build_model() method. It works!
Thanks! This https://github.com/BrikerMan/Kashgari/issues/170#issuecomment-513875786 worked!