add grid lstm #345

christopher5106 · 2016-09-26T19:41:07Z

I added 2D Grid LSTM following https://github.com/coreylynch/grid-lstm
It is pretty slow and I get an out of memory error. Sounds I'm not sure to understand all aspects of Element-research/rnn...

christopher5106 · 2016-09-26T19:46:06Z

An example of usage of the layer nn.Grid2DLSTM https://github.com/christopher5106/grid-lstm

christopher5106 · 2016-09-26T19:58:24Z

But it works.

It is more a question on what is happening behind the element research framework... to get it work as fast as the original version, without memory leak.

Could someone advise me on these questions :

1° how does garbage collection work ? training multiple forward / backwards does delete tensors ? shall I call forget method during each step of the training ?

2° if I want to initialize the parameters (https://github.com/christopher5106/grid-lstm/blob/master/train.lua#L163-L180) inside the layer, in which function shall I put it ?

Thanks a lot

christopher5106 · 2016-09-26T21:13:39Z

Using rnn:forget() solves the memory and speed issue.

christopher5106 · 2016-09-27T09:02:59Z

Small corrections. Works perfectly now.

nicholas-leonard · 2017-01-26T16:05:49Z

@christopher5106 Is it too l ate for me to ask you to include documentation and unit test? Sorry for the delay.

ddovod · 2017-02-03T20:25:44Z

Hello guys! Any updates on this PR? Looks tasty!

christopher5106 · 2017-02-05T17:59:15Z

What would you like exactly for this?

nicholas-leonard · 2017-02-17T19:51:40Z

@christopher5106 For documentation, adding a section to README.md with link to paper and brief explanation should do the trick. For unit tests, add a function to test.lua to make sure GridLSTM behaves as expected. Doesn't have to be extensive.

kenkit · 2017-09-29T09:42:38Z

Grid2DLSTM.lua

+      self.cells = {[0] = {}}
+
+      for L=1,self.nb_layers do
+        local h_init = torch.zeros(input:size(1), self.outputSize):cuda()


You will have to fix this option so it can work without a gpu, i'm getting errors on cpu version

I tried removing cuda() references but could not get it to work on the rnn-sin demo with 4x2 tensor/table

add grid lstm

344c155

some corrections

32bd83e

christopher5106 added 5 commits September 28, 2016 15:12

n

a89a728

initialization of hidden and cell states in the grid module

61ae656

taking lookup table out of module to be more generic

97da470

for printing

05c188f

using userPrevCell to initialize cells after forget

6065689

kenkit reviewed Sep 29, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add grid lstm #345

add grid lstm #345

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 27, 2016

nicholas-leonard commented Jan 26, 2017

ddovod commented Feb 3, 2017 •

edited

Loading

christopher5106 commented Feb 5, 2017

nicholas-leonard commented Feb 17, 2017

kenkit Sep 29, 2017

kenkit Sep 29, 2017 •

edited

Loading

add grid lstm #345

Are you sure you want to change the base?

add grid lstm #345

Conversation

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 26, 2016

christopher5106 commented Sep 27, 2016

nicholas-leonard commented Jan 26, 2017

ddovod commented Feb 3, 2017 • edited Loading

christopher5106 commented Feb 5, 2017

nicholas-leonard commented Feb 17, 2017

kenkit Sep 29, 2017

Choose a reason for hiding this comment

kenkit Sep 29, 2017 • edited Loading

Choose a reason for hiding this comment

ddovod commented Feb 3, 2017 •

edited

Loading

kenkit Sep 29, 2017 •

edited

Loading