Add part: lstm block #66

christophmluscher · 2024-12-18T17:21:39Z

Adds LSTM Block

NeoLegends

One minor Q.

NeoLegends · 2024-12-18T17:38:39Z

i6_models/parts/lstm.py

+    enforce_sorted: bool
+
+    @classmethod
+    def from_dict(cls, model_cfg_dict: Dict):


Same Q as in the other PR: why is this necessary now, and hasn't been for the other assemblies?

i6_models/parts/lstm.py

Co-authored-by: Albert Zeyer <[email protected]>

i6_models/parts/lstm.py

albertz · 2024-12-19T12:56:22Z

i6_models/parts/lstm.py

+            if seq_len.get_device() >= 0:
+                seq_len = seq_len.cpu()


Suggested change

if seq_len.get_device() >= 0:

seq_len = seq_len.cpu()

seq_len = seq_len.cpu()

albertz · 2024-12-19T12:57:03Z

i6_models/parts/lstm.py

+        )
+
+    def forward(self, x: torch.Tensor, seq_len: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor]:
+        if not torch.jit.is_scripting() and not torch.jit.is_tracing():


Why only when not scripting? Don't you want that seq_len is always on CPU?

I followed the example in the blstm part.

if not torch.jit.is_scripting() and not torch.jit.is_tracing(): # during graph mode we have to assume all Tensors are on the correct device, # otherwise move lengths to the CPU if they are on GPU if seq_len.get_device() >= 0: seq_len = seq_len.cpu()

I did not copy the comment over... I did not yet get to look why this is necessary
@JackTemaki you implemented the BLSTM IIRC. You remember why this was done in this way?

The question is, is this still relevant? This was something I added at some point, but if this is not needed for ONNX export this should be removed until there is actually a reason for it.

Co-authored-by: Albert Zeyer <[email protected]>

i6_models/parts/lstm.py

Atticus1806 · 2025-01-16T10:47:08Z

i6_models/parts/lstm.py

+    enforce_sorted: bool
+
+    @classmethod
+    def from_dict(cls, model_cfg_dict: Dict[str, Any]):


I don't see this for other part configs, why do we need this here?

I use it in the model definition to conert a dict to the config class. Need might be a bit strong but I like it :D

Atticus1806 · 2025-01-16T10:47:09Z

i6_models/parts/lstm.py

+    num_layers: int
+    bias: bool
+    dropout: float
+    bidirectional: bool


should we allow this? I feel like if we have bidirectional here, the BLSTM part becomes redundant, which is maybe okay, but might also cause two different branches that do the same, which I am not sure we want (if there are potential extensions later). We could maybe also just deprecate the BLSTM block?

Good point I ll just remove the flag. Is maybe a bit more readable having two classes?!

Atticus1806 · 2025-01-16T10:48:05Z

i6_models/parts/lstm.py

+class LstmBlockV1(nn.Module):
+    def __init__(self, model_cfg: Union[LstmBlockV1Config, Dict[str, Any]], **kwargs):
+        """
+        Model definition of LSTM block. Contains single lstm stack and padding sequence in forward call.


Also add the "including dropout, batch-first variant, hardcoded to use B,T,F input" part please.

Could also add the supports scripting part.

Atticus1806 · 2025-01-16T10:48:41Z

i6_models/parts/lstm.py

+            bidirectional=self.cfg.bidirectional,
+        )
+
+    def forward(self, x: torch.Tensor, seq_len: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor]:


Atticus1806 · 2025-01-16T10:50:07Z

i6_models/parts/lstm.py

+        )
+
+        lstm_out, _ = self.lstm_stack(lstm_packed_in)
+        lstm_out, _ = nn.utils.rnn.pad_packed_sequence(


Just out of curiosity: why does black force the new lines here but not for the blstm? Shouldnt it be the same line length?

well this is dependent if you manually set the last commata. if set it will force new lines

christophmluscher added 2 commits December 17, 2024 13:08

add part: lstm block

d9aadb5

cleanup

7c8b691

christophmluscher requested review from albertz, NeoLegends, JackTemaki, Gerstenberger, michelwi and Atticus1806 December 18, 2024 17:21

christophmluscher mentioned this pull request Dec 18, 2024

Add assembly: lstm encoder #67

Open

NeoLegends approved these changes Dec 18, 2024

View reviewed changes

albertz reviewed Dec 18, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

christophmluscher and others added 2 commits December 19, 2024 10:24

fix forward return typing

624f6c1

Co-authored-by: Albert Zeyer <[email protected]>

add import, set var correctly, add doc

6bf9e2e

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

typing

4b6e4ef

Co-authored-by: Albert Zeyer <[email protected]>

Atticus1806 requested changes Jan 16, 2025

View reviewed changes

christophmluscher added 2 commits January 17, 2025 11:50

address reviewer comments

c4fa60e

fix

9da3a3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add part: lstm block #66

Add part: lstm block #66

christophmluscher commented Dec 18, 2024

NeoLegends left a comment

NeoLegends Dec 18, 2024

albertz Dec 19, 2024

albertz Dec 19, 2024

christophmluscher Dec 19, 2024

JackTemaki Jan 16, 2025

Atticus1806 Jan 16, 2025

christophmluscher Jan 17, 2025

Atticus1806 Jan 16, 2025

christophmluscher Jan 17, 2025

Atticus1806 Jan 16, 2025

Atticus1806 Jan 16, 2025

Atticus1806 Jan 16, 2025

Atticus1806 Jan 16, 2025

christophmluscher Jan 17, 2025

	if seq_len.get_device() >= 0:
	seq_len = seq_len.cpu()
	seq_len = seq_len.cpu()

Add part: lstm block #66

Are you sure you want to change the base?

Add part: lstm block #66

Conversation

christophmluscher commented Dec 18, 2024

NeoLegends left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment