You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
thanks for the reply in the previous issue, but now I have another question.
I'm trying alexnet and the training seems to be working but when I should save the net (in SequentialTrainer.lua) in this way:
THCudaCheck FAIL file=/tmp/luarocks_cutorch-scm-1-T1qml2/cutorch/lib/THC/generic/THCStorage.cu line=66 error=2 : out of memory
/home/torch/install/bin/luajit: /home/torch/install/share/lua/5.1/torch/File.lua:351: cuda runtime error (2) : out of memory at /tmp/luarocks_cutorch-scm-1-T1qml2/cutorch/lib/THC/generic/THCStorage.cu:66
stack traceback:
[C]: in function 'read'
/home/torch/install/share/lua/5.1/torch/File.lua:351: in function </home/torch/install/share/lua/5.1/torch/File.lua:245>
[C]: in function 'read'
/home/torch/install/share/lua/5.1/torch/File.lua:351: in function 'readObject'
/home/torch/install/share/lua/5.1/torch/File.lua:369: in function 'readObject'
/home/torch/install/share/lua/5.1/nn/Module.lua:193: in function 'read'
/home/torch/install/share/lua/5.1/torch/File.lua:351: in function 'readObject'
/home/torch/install/share/lua/5.1/nn/Module.lua:141: in function 'clone'
./network/Net.lua:125: in function 'save'
./train/SequentialTrainer.lua:150: in function '_trainBatch'
./train/SequentialTrainer.lua:97: in function 'train'
./network/NetworkWrapper.lua:40: in function 'trainNetwork'
main_train.lua:48: in main chunk
[C]: in function 'dofile'
.../torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:150: in main chunk
[C]: at 0x00405d50
The problem seem to be in this line in Net.lua:
tmp_regressor = self.regressor:clone()
but I don't know why.
I tried to reduce the size of the dataset thinking it was a GPU problem but the error persists.
The text was updated successfully, but these errors were encountered:
Hi,
thanks for the reply in the previous issue, but now I have another question.
I'm trying alexnet and the training seems to be working but when I should save the net (in SequentialTrainer.lua) in this way:
I have this error:
The problem seem to be in this line in Net.lua:
tmp_regressor = self.regressor:clone()
but I don't know why.
I tried to reduce the size of the dataset thinking it was a GPU problem but the error persists.
The text was updated successfully, but these errors were encountered: