diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index 27497ef..c52200f 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.11.2","generation_timestamp":"2024-12-22T19:18:52","documenter_version":"1.8.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.11.2","generation_timestamp":"2024-12-24T16:26:28","documenter_version":"1.8.0"}}
\ No newline at end of file
diff --git a/dev/api/cells/index.html b/dev/api/cells/index.html
index c16cc84..2728a61 100644
--- a/dev/api/cells/index.html
+++ b/dev/api/cells/index.html
@@ -9,11 +9,11 @@
 c_t         &amp;= i_t \odot \tilde{c}_t + f_t \odot c_{t-1}, \\
 h_t         &amp;= g(c_t)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">rancell(inp, (state, cstate))
-rancell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the rancell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the RANCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/ran_cell.jl#L2-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.IndRNNCell" href="#RecurrentLayers.IndRNNCell"><code>RecurrentLayers.IndRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">IndRNNCell((input_size =&gt; hidden_size)::Pair, σ=relu;
+rancell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the rancell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the RANCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/ran_cell.jl#L2-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.IndRNNCell" href="#RecurrentLayers.IndRNNCell"><code>RecurrentLayers.IndRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">IndRNNCell((input_size =&gt; hidden_size)::Pair, σ=relu;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/pdf/1803.04831">Independently recurrent cell</a>. See <a href="../layers/#RecurrentLayers.IndRNN"><code>IndRNN</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>σ</code>: activation function. Default is <code>tanh</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\mathbf{h}_{t} = \sigma(\mathbf{W} \mathbf{x}_t + \mathbf{u} \odot \mathbf{h}_{t-1} + \mathbf{b})\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">indrnncell(inp, state)
-indrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the indrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the IndRNNCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/indrnn_cell.jl#L3-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LightRUCell" href="#RecurrentLayers.LightRUCell"><code>RecurrentLayers.LightRUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LightRUCell((input_size =&gt; hidden_size)::Pair;
+indrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the indrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the IndRNNCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/indrnn_cell.jl#L3-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LightRUCell" href="#RecurrentLayers.LightRUCell"><code>RecurrentLayers.LightRUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LightRUCell((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://www.mdpi.com/2079-9292/13/16/3204">Light recurrent unit</a>. See <a href="../layers/#RecurrentLayers.LightRU"><code>LightRU</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -21,7 +21,7 @@
 f_t         &amp;= \delta(W_f x_t + U_f h_{t-1} + b_f), \\
 h_t         &amp;= (1 - f_t) \odot h_{t-1} + f_t \odot \tilde{h}_t.
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">lightrucell(inp, state)
-lightrucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the lightrucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the LightRUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/lightru_cell.jl#L3-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LiGRUCell" href="#RecurrentLayers.LiGRUCell"><code>RecurrentLayers.LiGRUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LiGRUCell((input_size =&gt; hidden_size)::Pair;
+lightrucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the lightrucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the LightRUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/lightru_cell.jl#L3-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LiGRUCell" href="#RecurrentLayers.LiGRUCell"><code>RecurrentLayers.LiGRUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LiGRUCell((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/pdf/1803.10225">Light gated recurrent unit</a>. The implementation does not include the batch normalization as described in the original paper. See <a href="../layers/#RecurrentLayers.LiGRU"><code>LiGRU</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -29,7 +29,7 @@
 \tilde{h}_t &amp;= \text{ReLU}(W_h x_t + U_h h_{t-1}), \\
 h_t &amp;= z_t \odot h_{t-1} + (1 - z_t) \odot \tilde{h}_t
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">ligrucell(inp, state)
-ligrucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ligrucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the LiGRUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/ligru_cell.jl#L2-L45">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MGUCell" href="#RecurrentLayers.MGUCell"><code>RecurrentLayers.MGUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MGUCell((input_size =&gt; hidden_size)::Pair;
+ligrucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ligrucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the LiGRUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/ligru_cell.jl#L2-L45">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MGUCell" href="#RecurrentLayers.MGUCell"><code>RecurrentLayers.MGUCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MGUCell((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/pdf/1603.09420">Minimal gated unit</a>. See <a href="../layers/#RecurrentLayers.MGU"><code>MGU</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -37,7 +37,7 @@
 \tilde{h}_t &amp;= \tanh(W_h x_t + U_h (f_t \odot h_{t-1}) + b_h), \\
 h_t         &amp;= (1 - f_t) \odot h_{t-1} + f_t \odot \tilde{h}_t
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mgucell(inp, state)
-mgucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mgucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MGUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mgu_cell.jl#L2-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.NASCell" href="#RecurrentLayers.NASCell"><code>RecurrentLayers.NASCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NASCell((input_size =&gt; hidden_size);
+mgucell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mgucell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MGUCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mgu_cell.jl#L2-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.NASCell" href="#RecurrentLayers.NASCell"><code>RecurrentLayers.NASCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NASCell((input_size =&gt; hidden_size);
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/pdf/1611.01578">Neural Architecture Search unit</a>. See <a href="../layers/#RecurrentLayers.NAS"><code>NAS</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -65,7 +65,7 @@
 l_5 &amp;= \tanh(l_3 + l_4) \\
 h_{\text{new}} &amp;= \tanh(c_{\text{new}} \cdot l_5)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">nascell(inp, (state, cstate))
-nascell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the NASCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/nas_cell.jl#L26-L89">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHNCell" href="#RecurrentLayers.RHNCell"><code>RecurrentLayers.RHNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHNCell((input_size =&gt; hidden_size), depth=3;
+nascell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the NASCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/nas_cell.jl#L26-L89">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHNCell" href="#RecurrentLayers.RHNCell"><code>RecurrentLayers.RHNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHNCell((input_size =&gt; hidden_size), depth=3;
     couple_carry::Bool = true,
     cell_kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1607.03474">Recurrent highway network</a>. See <a href="#RecurrentLayers.RHNCellUnit"><code>RHNCellUnit</code></a> for a the unit component of this layer. See <a href="../layers/#RecurrentLayers.RHN"><code>RHN</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>depth</code>: depth of the recurrence. Default is 3</li><li><code>couple_carry</code>: couples the carry gate and the transform gate. Default <code>true</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 s_{\ell}^{[t]} &amp;= h_{\ell}^{[t]} \odot t_{\ell}^{[t]} + s_{\ell-1}^{[t]} \odot c_{\ell}^{[t]}, \\
@@ -73,9 +73,9 @@
 h_{\ell}^{[t]} &amp;= \tanh(W_h x^{[t]}\mathbb{I}_{\ell = 1} + U_{h_{\ell}} s_{\ell-1}^{[t]} + b_{h_{\ell}}), \\
 t_{\ell}^{[t]} &amp;= \sigma(W_t x^{[t]}\mathbb{I}_{\ell = 1} + U_{t_{\ell}} s_{\ell-1}^{[t]} + b_{t_{\ell}}), \\
 c_{\ell}^{[t]} &amp;= \sigma(W_c x^{[t]}\mathbb{I}_{\ell = 1} + U_{c_{\ell}} s_{\ell-1}^{[t]} + b_{c_{\ell}})
-\end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">rnncell(inp, [state])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/rhn_cell.jl#L48-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHNCellUnit" href="#RecurrentLayers.RHNCellUnit"><code>RecurrentLayers.RHNCellUnit</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHNCellUnit((input_size =&gt; hidden_size)::Pair;
+\end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">rnncell(inp, [state])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/rhn_cell.jl#L48-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHNCellUnit" href="#RecurrentLayers.RHNCellUnit"><code>RecurrentLayers.RHNCellUnit</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHNCellUnit((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
-    bias = true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/rhn_cell.jl#L4-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT1Cell" href="#RecurrentLayers.MUT1Cell"><code>RecurrentLayers.MUT1Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT1Cell((input_size =&gt; hidden_size);
+    bias = true)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/rhn_cell.jl#L4-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT1Cell" href="#RecurrentLayers.MUT1Cell"><code>RecurrentLayers.MUT1Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT1Cell((input_size =&gt; hidden_size);
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 1 cell</a>. See <a href="../layers/#RecurrentLayers.MUT1"><code>MUT1</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -84,7 +84,7 @@
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + \tanh(W_h x_t) + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mutcell(inp, state)
-mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L2-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT2Cell" href="#RecurrentLayers.MUT2Cell"><code>RecurrentLayers.MUT2Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT2Cell((input_size =&gt; hidden_size);
+mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L2-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT2Cell" href="#RecurrentLayers.MUT2Cell"><code>RecurrentLayers.MUT2Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT2Cell((input_size =&gt; hidden_size);
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 2 cell</a>. See <a href="../layers/#RecurrentLayers.MUT2"><code>MUT2</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -93,7 +93,7 @@
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + W_h x_t + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mutcell(inp, state)
-mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L141-L183">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT3Cell" href="#RecurrentLayers.MUT3Cell"><code>RecurrentLayers.MUT3Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT3Cell((input_size =&gt; hidden_size);
+mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L141-L183">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT3Cell" href="#RecurrentLayers.MUT3Cell"><code>RecurrentLayers.MUT3Cell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT3Cell((input_size =&gt; hidden_size);
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 3 cell</a>. See <a href="../layers/#RecurrentLayers.MUT3"><code>MUT3</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -102,7 +102,7 @@
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + W_h x_t + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mutcell(inp, state)
-mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L279-L321">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.SCRNCell" href="#RecurrentLayers.SCRNCell"><code>RecurrentLayers.SCRNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SCRNCell((input_size =&gt; hidden_size)::Pair;
+mutcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mutcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUTCell. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>, </li></ul><p>a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L279-L321">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.SCRNCell" href="#RecurrentLayers.SCRNCell"><code>RecurrentLayers.SCRNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SCRNCell((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true,
@@ -111,7 +111,7 @@
 h_t &amp;= \sigma(W_h s_t + U_h h_{t-1} + b_h), \\
 y_t &amp;= f(U_y h_t + W_y s_t)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">scrncell(inp, (state, cstate))
-scrncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the scrncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the SCRNCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/scrn_cell.jl#L3-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.PeepholeLSTMCell" href="#RecurrentLayers.PeepholeLSTMCell"><code>RecurrentLayers.PeepholeLSTMCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PeepholeLSTMCell((input_size =&gt; hidden_size)::Pair;
+scrncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the scrncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the SCRNCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/scrn_cell.jl#L3-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.PeepholeLSTMCell" href="#RecurrentLayers.PeepholeLSTMCell"><code>RecurrentLayers.PeepholeLSTMCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PeepholeLSTMCell((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://www.jmlr.org/papers/volume3/gers02a/gers02a.pdf">Peephole long short term memory cell</a>. See <a href="../layers/#RecurrentLayers.PeepholeLSTM"><code>PeepholeLSTM</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -121,14 +121,14 @@
 c_t &amp;= f_t \odot c_{t-1} + i_t \odot \sigma_c(W_c x_t + b_c), \\
 h_t &amp;= o_t \odot \sigma_h(c_t).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">peepholelstmcell(inp, (state, cstate))
-peepholelstmcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the peepholelstmcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the PeepholeLSTMCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/peepholelstm_cell.jl#L2-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastRNNCell" href="#RecurrentLayers.FastRNNCell"><code>RecurrentLayers.FastRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastRNNCell((input_size =&gt; hidden_size), [activation];
+peepholelstmcell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the peepholelstmcell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the PeepholeLSTMCell. They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where <code>output = new_state</code> is the new hidden state and <code>state = (new_state, new_cstate)</code> is the new hidden and cell state.  They are tensors of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/peepholelstm_cell.jl#L2-L48">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastRNNCell" href="#RecurrentLayers.FastRNNCell"><code>RecurrentLayers.FastRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastRNNCell((input_size =&gt; hidden_size), [activation];
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast recurrent neural network cell</a>. See <a href="../layers/#RecurrentLayers.FastRNN"><code>FastRNN</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 \tilde{h}_t &amp;= \sigma(W_h x_t + U_h h_{t-1} + b), \\
 h_t &amp;= \alpha \tilde{h}_t + \beta h_{t-1}
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">fastrnncell(inp, state)
-fastrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/fastrnn_cell.jl#L2-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastGRNNCell" href="#RecurrentLayers.FastGRNNCell"><code>RecurrentLayers.FastGRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastGRNNCell((input_size =&gt; hidden_size), [activation];
+fastrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/fastrnn_cell.jl#L2-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastGRNNCell" href="#RecurrentLayers.FastGRNNCell"><code>RecurrentLayers.FastGRNNCell</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastGRNNCell((input_size =&gt; hidden_size), [activation];
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast gated recurrent neural network cell</a>. See <a href="../layers/#RecurrentLayers.FastGRNN"><code>FastGRNN</code></a> for a layer that processes entire sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
@@ -136,4 +136,4 @@
 \tilde{h}_t &amp;= \tanh(W_h x_t + U_h h_{t-1} + b_h), \\
 h_t &amp;= \big((\zeta (1 - z_t) + \nu) \odot \tilde{h}_t\big) + z_t \odot h_{t-1}
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">fastgrnncell(inp, state)
-fastgrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastgrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastGRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/fastrnn_cell.jl#L145-L188">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../layers/">Layers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 22 December 2024 19:18">Sunday 22 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+fastgrnncell(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastgrnncell. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastGRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>A tuple <code>(output, state)</code>, where both elements are given by the updated state <code>new_state</code>,  a tensor of size <code>hidden_size</code> or <code>hidden_size x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/fastrnn_cell.jl#L145-L188">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../layers/">Layers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Tuesday 24 December 2024 16:26">Tuesday 24 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/layers/index.html b/dev/api/layers/index.html
index 6b2c717..f034dd9 100644
--- a/dev/api/layers/index.html
+++ b/dev/api/layers/index.html
@@ -6,24 +6,24 @@
 c_t         &amp;= i_t \odot \tilde{c}_t + f_t \odot c_{t-1}, \\
 h_t         &amp;= g(c_t)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">ran(inp, (state, cstate))
-ran(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ran. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the RAN.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/ran_cell.jl#L88-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.IndRNN" href="#RecurrentLayers.IndRNN"><code>RecurrentLayers.IndRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">IndRNN((input_size, hidden_size)::Pair, σ = tanh, σ=relu;
+ran(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ran. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the RAN.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/ran_cell.jl#L88-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.IndRNN" href="#RecurrentLayers.IndRNN"><code>RecurrentLayers.IndRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">IndRNN((input_size, hidden_size)::Pair, σ = tanh, σ=relu;
     kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1803.04831">Independently recurrent network</a>. See <a href="../cells/#RecurrentLayers.IndRNNCell"><code>IndRNNCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>σ</code>: activation function. Default is <code>tanh</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\mathbf{h}_{t} = \sigma(\mathbf{W} \mathbf{x}_t + \mathbf{u} \odot \mathbf{h}_{t-1} + \mathbf{b})\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">indrnn(inp, state)
-indrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the indrnn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the IndRNN. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/indrnn_cell.jl#L75-L109">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LightRU" href="#RecurrentLayers.LightRU"><code>RecurrentLayers.LightRU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LightRU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://www.mdpi.com/2079-9292/13/16/3204">Light recurrent unit network</a>. See <a href="../cells/#RecurrentLayers.LightRUCell"><code>LightRUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+indrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the indrnn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the IndRNN. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/indrnn_cell.jl#L75-L109">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LightRU" href="#RecurrentLayers.LightRU"><code>RecurrentLayers.LightRU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LightRU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://www.mdpi.com/2079-9292/13/16/3204">Light recurrent unit network</a>. See <a href="../cells/#RecurrentLayers.LightRUCell"><code>LightRUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 \tilde{h}_t &amp;= \tanh(W_h x_t), \\
 f_t         &amp;= \delta(W_f x_t + U_f h_{t-1} + b_f), \\
 h_t         &amp;= (1 - f_t) \odot h_{t-1} + f_t \odot \tilde{h}_t.
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">lightru(inp, state)
-lightru(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the lightru. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the LightRU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/lightru_cell.jl#L82-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LiGRU" href="#RecurrentLayers.LiGRU"><code>RecurrentLayers.LiGRU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LiGRU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1803.10225">Light gated recurrent network</a>. The implementation does not include the batch normalization as described in the original paper. See <a href="../cells/#RecurrentLayers.LiGRUCell"><code>LiGRUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+lightru(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the lightru. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the LightRU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/lightru_cell.jl#L82-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.LiGRU" href="#RecurrentLayers.LiGRU"><code>RecurrentLayers.LiGRU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LiGRU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1803.10225">Light gated recurrent network</a>. The implementation does not include the batch normalization as described in the original paper. See <a href="../cells/#RecurrentLayers.LiGRUCell"><code>LiGRUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 z_t &amp;= \sigma(W_z x_t + U_z h_{t-1}), \\
 \tilde{h}_t &amp;= \text{ReLU}(W_h x_t + U_h h_{t-1}), \\
 h_t &amp;= z_t \odot h_{t-1} + (1 - z_t) \odot \tilde{h}_t
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">ligru(inp, state)
-ligru(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ligru. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the LiGRU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/ligru_cell.jl#L83-L122">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MGU" href="#RecurrentLayers.MGU"><code>RecurrentLayers.MGU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MGU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1603.09420">Minimal gated unit network</a>. See <a href="../cells/#RecurrentLayers.MGUCell"><code>MGUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+ligru(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the ligru. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the LiGRU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/ligru_cell.jl#L83-L122">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MGU" href="#RecurrentLayers.MGU"><code>RecurrentLayers.MGU</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MGU((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1603.09420">Minimal gated unit network</a>. See <a href="../cells/#RecurrentLayers.MGUCell"><code>MGUCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 f_t         &amp;= \sigma(W_f x_t + U_f h_{t-1} + b_f), \\
 \tilde{h}_t &amp;= \tanh(W_h x_t + U_h (f_t \odot h_{t-1}) + b_h), \\
 h_t         &amp;= (1 - f_t) \odot h_{t-1} + f_t \odot \tilde{h}_t
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mgu(inp, state)
-mgu(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mgu. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MGU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mgu_cell.jl#L81-L118">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.NAS" href="#RecurrentLayers.NAS"><code>RecurrentLayers.NAS</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NAS((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1611.01578">Neural Architecture Search unit</a>. See <a href="../cells/#RecurrentLayers.NASCell"><code>NASCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+mgu(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mgu. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MGU. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mgu_cell.jl#L81-L118">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.NAS" href="#RecurrentLayers.NAS"><code>RecurrentLayers.NAS</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NAS((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1611.01578">Neural Architecture Search unit</a>. See <a href="../cells/#RecurrentLayers.NASCell"><code>NASCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 \text{First Layer Outputs:} &amp; \\
 o_1 &amp;= \sigma(W_i^{(1)} x_t + W_h^{(1)} h_{t-1} + b^{(1)}), \\
 o_2 &amp;= \text{ReLU}(W_i^{(2)} x_t + W_h^{(2)} h_{t-1} + b^{(2)}), \\
@@ -48,31 +48,31 @@
 l_5 &amp;= \tanh(l_3 + l_4) \\
 h_{\text{new}} &amp;= \tanh(c_{\text{new}} \cdot l_5)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">nas(inp, (state, cstate))
-nas(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the nas. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the NAS.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/nas_cell.jl#L148-L206">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHN" href="#RecurrentLayers.RHN"><code>RecurrentLayers.RHN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHN((input_size =&gt; hidden_size) depth=3; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1607.03474">Recurrent highway network</a>. See <a href="../cells/#RecurrentLayers.RHNCellUnit"><code>RHNCellUnit</code></a> for a the unit component of this layer. See <a href="../cells/#RecurrentLayers.RHNCell"><code>RHNCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>depth</code>: depth of the recurrence. Default is 3</li><li><code>couple_carry</code>: couples the carry gate and the transform gate. Default <code>true</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+nas(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the nas. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the NAS.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/nas_cell.jl#L148-L206">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.RHN" href="#RecurrentLayers.RHN"><code>RecurrentLayers.RHN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">RHN((input_size =&gt; hidden_size) depth=3; kwargs...)</code></pre><p><a href="https://arxiv.org/pdf/1607.03474">Recurrent highway network</a>. See <a href="../cells/#RecurrentLayers.RHNCellUnit"><code>RHNCellUnit</code></a> for a the unit component of this layer. See <a href="../cells/#RecurrentLayers.RHNCell"><code>RHNCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>depth</code>: depth of the recurrence. Default is 3</li><li><code>couple_carry</code>: couples the carry gate and the transform gate. Default <code>true</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 s_{\ell}^{[t]} &amp;= h_{\ell}^{[t]} \odot t_{\ell}^{[t]} + s_{\ell-1}^{[t]} \odot c_{\ell}^{[t]}, \\
 \text{where} \\
 h_{\ell}^{[t]} &amp;= \tanh(W_h x^{[t]}\mathbb{I}_{\ell = 1} + U_{h_{\ell}} s_{\ell-1}^{[t]} + b_{h_{\ell}}), \\
 t_{\ell}^{[t]} &amp;= \sigma(W_t x^{[t]}\mathbb{I}_{\ell = 1} + U_{t_{\ell}} s_{\ell-1}^{[t]} + b_{t_{\ell}}), \\
 c_{\ell}^{[t]} &amp;= \sigma(W_c x^{[t]}\mathbb{I}_{\ell = 1} + U_{c_{\ell}} s_{\ell-1}^{[t]} + b_{c_{\ell}})
-\end{aligned}\]</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/rhn_cell.jl#L144-L169">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT1" href="#RecurrentLayers.MUT1"><code>RecurrentLayers.MUT1</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT1((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 1 network</a>. See <a href="../cells/#RecurrentLayers.MUT1Cell"><code>MUT1Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+\end{aligned}\]</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/rhn_cell.jl#L144-L169">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT1" href="#RecurrentLayers.MUT1"><code>RecurrentLayers.MUT1</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT1((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 1 network</a>. See <a href="../cells/#RecurrentLayers.MUT1Cell"><code>MUT1Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 z &amp;= \sigma(W_z x_t + b_z), \\
 r &amp;= \sigma(W_r x_t + U_r h_t + b_r), \\
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + \tanh(W_h x_t) + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mut(inp, state)
-mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L85-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT2" href="#RecurrentLayers.MUT2"><code>RecurrentLayers.MUT2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT2Cell((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 2 network</a>. See <a href="../cells/#RecurrentLayers.MUT2Cell"><code>MUT2Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L85-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT2" href="#RecurrentLayers.MUT2"><code>RecurrentLayers.MUT2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT2Cell((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 2 network</a>. See <a href="../cells/#RecurrentLayers.MUT2Cell"><code>MUT2Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 z &amp;= \sigma(W_z x_t + U_z h_t + b_z), \\
 r &amp;= \sigma(x_t + U_r h_t + b_r), \\
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + W_h x_t + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mut(inp, state)
-mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L223-L261">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT3" href="#RecurrentLayers.MUT3"><code>RecurrentLayers.MUT3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT3((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 3 network</a>. See <a href="../cells/#RecurrentLayers.MUT3Cell"><code>MUT3Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L223-L261">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.MUT3" href="#RecurrentLayers.MUT3"><code>RecurrentLayers.MUT3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MUT3((input_size =&gt; hidden_size); kwargs...)</code></pre><p><a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">Mutated unit 3 network</a>. See <a href="../cells/#RecurrentLayers.MUT3Cell"><code>MUT3Cell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 z &amp;= \sigma(W_z x_t + U_z \tanh(h_t) + b_z), \\
 r &amp;= \sigma(W_r x_t + U_r h_t + b_r), \\
 h_{t+1} &amp;= \tanh(U_h (r \odot h_t) + W_h x_t + b_h) \odot z \\
 &amp;\quad + h_t \odot (1 - z).
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">mut(inp, state)
-mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/mut_cell.jl#L360-L398">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.SCRN" href="#RecurrentLayers.SCRN"><code>RecurrentLayers.SCRN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SCRN((input_size =&gt; hidden_size)::Pair;
+mut(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the mut. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the MUT. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/mut_cell.jl#L360-L398">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.SCRN" href="#RecurrentLayers.SCRN"><code>RecurrentLayers.SCRN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SCRN((input_size =&gt; hidden_size)::Pair;
     init_kernel = glorot_uniform,
     init_recurrent_kernel = glorot_uniform,
     bias = true,
@@ -81,20 +81,20 @@
 h_t &amp;= \sigma(W_h s_t + U_h h_{t-1} + b_h), \\
 y_t &amp;= f(U_y h_t + W_y s_t)
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">scrn(inp, (state, cstate))
-scrn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the scrn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the SCRN.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/scrn_cell.jl#L92-L134">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.PeepholeLSTM" href="#RecurrentLayers.PeepholeLSTM"><code>RecurrentLayers.PeepholeLSTM</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PeepholeLSTM((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://www.jmlr.org/papers/volume3/gers02a/gers02a.pdf">Peephole long short term memory network</a>. See <a href="../cells/#RecurrentLayers.PeepholeLSTMCell"><code>PeepholeLSTMCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{align}
+scrn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the scrn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the SCRN.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/scrn_cell.jl#L92-L134">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.PeepholeLSTM" href="#RecurrentLayers.PeepholeLSTM"><code>RecurrentLayers.PeepholeLSTM</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PeepholeLSTM((input_size =&gt; hidden_size)::Pair; kwargs...)</code></pre><p><a href="https://www.jmlr.org/papers/volume3/gers02a/gers02a.pdf">Peephole long short term memory network</a>. See <a href="../cells/#RecurrentLayers.PeepholeLSTMCell"><code>PeepholeLSTMCell</code></a> for a layer that processes a single sequence.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{align}
 f_t &amp;= \sigma_g(W_f x_t + U_f c_{t-1} + b_f), \\
 i_t &amp;= \sigma_g(W_i x_t + U_i c_{t-1} + b_i), \\
 o_t &amp;= \sigma_g(W_o x_t + U_o c_{t-1} + b_o), \\
 c_t &amp;= f_t \odot c_{t-1} + i_t \odot \sigma_c(W_c x_t + b_c), \\
 h_t &amp;= o_t \odot \sigma_h(c_t).
 \end{align}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">peepholelstm(inp, (state, cstate))
-peepholelstm(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the peepholelstm. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the PeepholeLSTM.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/peepholelstm_cell.jl#L84-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastRNN" href="#RecurrentLayers.FastRNN"><code>RecurrentLayers.FastRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastRNN((input_size =&gt; hidden_size), [activation]; kwargs...)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast recurrent neural network</a>. See <a href="../cells/#RecurrentLayers.FastRNNCell"><code>FastRNNCell</code></a> for a layer that processes a single sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+peepholelstm(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the peepholelstm. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>(state, cstate)</code>: A tuple containing the hidden and cell states of the PeepholeLSTM.  They should be vectors of size <code>hidden_size</code> or matrices of size <code>hidden_size x batch_size</code>. If not provided, they are assumed to be vectors of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/peepholelstm_cell.jl#L84-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastRNN" href="#RecurrentLayers.FastRNN"><code>RecurrentLayers.FastRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastRNN((input_size =&gt; hidden_size), [activation]; kwargs...)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast recurrent neural network</a>. See <a href="../cells/#RecurrentLayers.FastRNNCell"><code>FastRNNCell</code></a> for a layer that processes a single sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 \tilde{h}_t &amp;= \sigma(W_h x_t + U_h h_{t-1} + b), \\
 h_t &amp;= \alpha \tilde{h}_t + \beta h_{t-1}
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">fastrnn(inp, state)
-fastrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastRNN. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/fastrnn_cell.jl#L88-L125">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastGRNN" href="#RecurrentLayers.FastGRNN"><code>RecurrentLayers.FastGRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastGRNN((input_size =&gt; hidden_size), [activation]; kwargs...)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast recurrent neural network</a>. See <a href="../cells/#RecurrentLayers.FastGRNNCell"><code>FastGRNNCell</code></a> for a layer that processes a single sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
+fastrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastrnn. It should be a vector of size <code>input_size x len</code> or a matrix of size <code>input_size x len x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastRNN. If given, it is a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/fastrnn_cell.jl#L88-L125">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.FastGRNN" href="#RecurrentLayers.FastGRNN"><code>RecurrentLayers.FastGRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">FastGRNN((input_size =&gt; hidden_size), [activation]; kwargs...)</code></pre><p><a href="https://arxiv.org/abs/1901.02358">Fast recurrent neural network</a>. See <a href="../cells/#RecurrentLayers.FastGRNNCell"><code>FastGRNNCell</code></a> for a layer that processes a single sequences.</p><p><strong>Arguments</strong></p><ul><li><code>input_size =&gt; hidden_size</code>: input and inner dimension of the layer</li><li><code>activation</code>: the activation function, defaults to <code>tanh_fast</code></li><li><code>init_kernel</code>: initializer for the input to hidden weights</li><li><code>init_recurrent_kernel</code>: initializer for the hidden to hidden weights</li><li><code>bias</code>: include a bias or not. Default is <code>true</code></li></ul><p><strong>Equations</strong></p><p class="math-container">\[\begin{aligned}
 z_t &amp;= \sigma(W_z x_t + U_z h_{t-1} + b_z), \\
 \tilde{h}_t &amp;= \tanh(W_h x_t + U_h h_{t-1} + b_h), \\
 h_t &amp;= \big((\zeta (1 - z_t) + \nu) \odot \tilde{h}_t\big) + z_t \odot h_{t-1}
 \end{aligned}\]</p><p><strong>Forward</strong></p><pre><code class="nohighlight hljs">fastgrnn(inp, state)
-fastgrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastgrnn. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastGRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/cells/fastrnn_cell.jl#L237-L276">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../cells/">« Cells</a><a class="docs-footer-nextpage" href="../wrappers/">Wrappers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 22 December 2024 19:18">Sunday 22 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+fastgrnn(inp)</code></pre><p><strong>Arguments</strong></p><ul><li><code>inp</code>: The input to the fastgrnn. It should be a vector of size <code>input_size</code> or a matrix of size <code>input_size x batch_size</code>.</li><li><code>state</code>: The hidden state of the FastGRNN. It should be a vector of size <code>hidden_size</code> or a matrix of size <code>hidden_size x batch_size</code>. If not provided, it is assumed to be a vector of zeros, initialized by <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.initialstates"><code>Flux.initialstates</code></a>.</li></ul><p><strong>Returns</strong></p><ul><li>New hidden states <code>new_states</code> as an array of size <code>hidden_size x len x batch_size</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/cells/fastrnn_cell.jl#L237-L276">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../cells/">« Cells</a><a class="docs-footer-nextpage" href="../wrappers/">Wrappers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Tuesday 24 December 2024 16:26">Tuesday 24 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/wrappers/index.html b/dev/api/wrappers/index.html
index 4956a4a..9e1e98c 100644
--- a/dev/api/wrappers/index.html
+++ b/dev/api/wrappers/index.html
@@ -1,3 +1,3 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Wrappers · RecurrentLayers.jl</title><meta name="title" content="Wrappers · RecurrentLayers.jl"/><meta property="og:title" content="Wrappers · RecurrentLayers.jl"/><meta property="twitter:title" content="Wrappers · RecurrentLayers.jl"/><meta name="description" content="Documentation for RecurrentLayers.jl."/><meta property="og:description" content="Documentation for RecurrentLayers.jl."/><meta property="twitter:description" content="Documentation for RecurrentLayers.jl."/><meta property="og:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/api/wrappers/"/><meta property="twitter:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/api/wrappers/"/><link rel="canonical" href="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/api/wrappers/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="RecurrentLayers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">RecurrentLayers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">API Documentation</span><ul><li><a class="tocitem" href="../cells/">Cells</a></li><li><a class="tocitem" href="../layers/">Layers</a></li><li class="is-active"><a class="tocitem" href>Wrappers</a></li></ul></li><li><a class="tocitem" href="../../roadmap/">Roadmap</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API Documentation</a></li><li class="is-active"><a href>Wrappers</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Wrappers</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/main/docs/src/api/wrappers.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Wrappers"><a class="docs-heading-anchor" href="#Wrappers">Wrappers</a><a id="Wrappers-1"></a><a class="docs-heading-anchor-permalink" href="#Wrappers" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="RecurrentLayers.StackedRNN" href="#RecurrentLayers.StackedRNN"><code>RecurrentLayers.StackedRNN</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">StackedRNN(rlayer, (input_size, hidden_size), args...;
-    num_layers = 1, kwargs...)</code></pre><p>Constructs a stack of recurrent layers given the recurrent layer type.</p><p>Arguments:</p><ul><li><code>rlayer</code>: Any recurrent layer such as <a href="../layers/#RecurrentLayers.MGU">MGU</a>, <a href="../layers/#RecurrentLayers.RHN">RHN</a>, etc... or <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.RNN"><code>Flux.RNN</code></a>, <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.LSTM"><code>Flux.LSTM</code></a>, etc.</li><li><code>input_size</code>: Defines the input dimension for the first layer.</li><li><code>hidden_size</code>: defines the dimension of the hidden layer.</li><li><code>num_layers</code>: The number of layers to stack. Default is 1.</li><li><code>args...</code>: Additional positional arguments passed to the recurrent layer.</li><li><code>kwargs...</code>: Additional keyword arguments passed to the recurrent layers.</li></ul><p>Returns:   A <code>StackedRNN</code> instance containing the specified number of RNN layers and their initial states.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/bc0af8521f59073d1eba9a0285fb6454c8488445/src/wrappers/stackedrnn.jl#L10-L27">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers/">« Layers</a><a class="docs-footer-nextpage" href="../../roadmap/">Roadmap »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 22 December 2024 19:18">Sunday 22 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+    num_layers = 1, kwargs...)</code></pre><p>Constructs a stack of recurrent layers given the recurrent layer type.</p><p>Arguments:</p><ul><li><code>rlayer</code>: Any recurrent layer such as <a href="../layers/#RecurrentLayers.MGU">MGU</a>, <a href="../layers/#RecurrentLayers.RHN">RHN</a>, etc... or <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.RNN"><code>Flux.RNN</code></a>, <a href="https://fluxml.ai/Flux.jl/stable/reference/models/layers/#Flux.LSTM"><code>Flux.LSTM</code></a>, etc.</li><li><code>input_size</code>: Defines the input dimension for the first layer.</li><li><code>hidden_size</code>: defines the dimension of the hidden layer.</li><li><code>num_layers</code>: The number of layers to stack. Default is 1.</li><li><code>args...</code>: Additional positional arguments passed to the recurrent layer.</li><li><code>kwargs...</code>: Additional keyword arguments passed to the recurrent layers.</li></ul><p>Returns:   A <code>StackedRNN</code> instance containing the specified number of RNN layers and their initial states.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/6de3b8c87b884224fdfdc9860d00ded5759b1a72/src/wrappers/stackedrnn.jl#L10-L27">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers/">« Layers</a><a class="docs-footer-nextpage" href="../../roadmap/">Roadmap »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Tuesday 24 December 2024 16:26">Tuesday 24 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index 8587eee..84d7fe5 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · RecurrentLayers.jl</title><meta name="title" content="Home · RecurrentLayers.jl"/><meta property="og:title" content="Home · RecurrentLayers.jl"/><meta property="twitter:title" content="Home · RecurrentLayers.jl"/><meta name="description" content="Documentation for RecurrentLayers.jl."/><meta property="og:description" content="Documentation for RecurrentLayers.jl."/><meta property="twitter:description" content="Documentation for RecurrentLayers.jl."/><meta property="og:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><meta property="twitter:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><link rel="canonical" href="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href><img src="assets/logo.png" alt="RecurrentLayers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href>RecurrentLayers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a><ul class="internal"><li><a class="tocitem" href="#Implemented-layers"><span>Implemented layers</span></a></li><li><a class="tocitem" href="#Contributing"><span>Contributing</span></a></li></ul></li><li><span class="tocitem">API Documentation</span><ul><li><a class="tocitem" href="api/cells/">Cells</a></li><li><a class="tocitem" href="api/layers/">Layers</a></li><li><a class="tocitem" href="api/wrappers/">Wrappers</a></li></ul></li><li><a class="tocitem" href="roadmap/">Roadmap</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/main/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="RecurrentLayers"><a class="docs-heading-anchor" href="#RecurrentLayers">RecurrentLayers</a><a id="RecurrentLayers-1"></a><a class="docs-heading-anchor-permalink" href="#RecurrentLayers" title="Permalink"></a></h1><p>RecurrentLayers.jl extends <a href="https://github.com/FluxML/Flux.jl">Flux.jl</a> recurrent layers offering by providing implementations of bleeding edge recurrent layers not commonly available in base deep learning libraries. It is designed for a seamless integration with the larger Flux ecosystem, enabling researchers and practitioners to leverage the latest developments in recurrent neural networks.</p><h2 id="Implemented-layers"><a class="docs-heading-anchor" href="#Implemented-layers">Implemented layers</a><a id="Implemented-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Implemented-layers" title="Permalink"></a></h2><ul><li>Minimal gated unit as <code>MGUCell</code> <a href="https://arxiv.org/abs/1603.09420">arxiv</a></li><li>Light gated recurrent unit as <code>LiGRUCell</code> <a href="https://arxiv.org/abs/1803.10225">arxiv</a></li><li>Independently recurrent neural networks as <code>IndRNNCell</code> <a href="https://arxiv.org/abs/1803.04831">arxiv</a></li><li>Recurrent addictive networks as <code>RANCell</code> <a href="https://arxiv.org/abs/1705.07393">arxiv</a></li><li>Recurrent highway network as <code>RHNCell</code> <a href="https://arxiv.org/pdf/1607.03474">arixv</a></li><li>Light recurrent unit as <code>LightRUCell</code> <a href="https://www.mdpi.com/2079-9292/13/16/3204">pub</a></li><li>Neural architecture search unit <code>NASCell</code> <a href="https://arxiv.org/abs/1611.01578">arxiv</a></li><li>Evolving recurrent neural networks as <code>MUT1Cell</code>, <code>MUT2Cell</code>, <code>MUT3Cell</code> <a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">pub</a></li><li>Structurally constrained recurrent neural network as <code>SCRNCell</code> <a href="https://arxiv.org/pdf/1412.7753">arxiv</a></li><li>Peephole long short term memory as <code>PeepholeLSTMCell</code> <a href="https://www.jmlr.org/papers/volume3/gers02a/gers02a.pdf">pub</a></li><li><code>FastRNNCell</code> and <code>FastGRNNCell</code> <a href="https://arxiv.org/pdf/1901.02358">arxiv</a></li></ul><h2 id="Contributing"><a class="docs-heading-anchor" href="#Contributing">Contributing</a><a id="Contributing-1"></a><a class="docs-heading-anchor-permalink" href="#Contributing" title="Permalink"></a></h2><p>Contributions are always welcome! We specifically look for :</p><ul><li>Recurrent cells you would like to see implemented </li><li>Benchmarks</li><li>Fixes for any bugs/errors</li><li>Documentation, in any form: examples, how tos, docstrings  </li></ul></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="api/cells/">Cells »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 22 December 2024 19:18">Sunday 22 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · RecurrentLayers.jl</title><meta name="title" content="Home · RecurrentLayers.jl"/><meta property="og:title" content="Home · RecurrentLayers.jl"/><meta property="twitter:title" content="Home · RecurrentLayers.jl"/><meta name="description" content="Documentation for RecurrentLayers.jl."/><meta property="og:description" content="Documentation for RecurrentLayers.jl."/><meta property="twitter:description" content="Documentation for RecurrentLayers.jl."/><meta property="og:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><meta property="twitter:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><link rel="canonical" href="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href><img src="assets/logo.png" alt="RecurrentLayers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href>RecurrentLayers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a><ul class="internal"><li><a class="tocitem" href="#Implemented-layers"><span>Implemented layers</span></a></li><li><a class="tocitem" href="#Contributing"><span>Contributing</span></a></li></ul></li><li><span class="tocitem">API Documentation</span><ul><li><a class="tocitem" href="api/cells/">Cells</a></li><li><a class="tocitem" href="api/layers/">Layers</a></li><li><a class="tocitem" href="api/wrappers/">Wrappers</a></li></ul></li><li><a class="tocitem" href="roadmap/">Roadmap</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/main/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="RecurrentLayers"><a class="docs-heading-anchor" href="#RecurrentLayers">RecurrentLayers</a><a id="RecurrentLayers-1"></a><a class="docs-heading-anchor-permalink" href="#RecurrentLayers" title="Permalink"></a></h1><p>RecurrentLayers.jl extends <a href="https://github.com/FluxML/Flux.jl">Flux.jl</a> recurrent layers offering by providing implementations of bleeding edge recurrent layers not commonly available in base deep learning libraries. It is designed for a seamless integration with the larger Flux ecosystem, enabling researchers and practitioners to leverage the latest developments in recurrent neural networks.</p><h2 id="Implemented-layers"><a class="docs-heading-anchor" href="#Implemented-layers">Implemented layers</a><a id="Implemented-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Implemented-layers" title="Permalink"></a></h2><ul><li>Minimal gated unit as <code>MGUCell</code> <a href="https://arxiv.org/abs/1603.09420">arxiv</a></li><li>Light gated recurrent unit as <code>LiGRUCell</code> <a href="https://arxiv.org/abs/1803.10225">arxiv</a></li><li>Independently recurrent neural networks as <code>IndRNNCell</code> <a href="https://arxiv.org/abs/1803.04831">arxiv</a></li><li>Recurrent addictive networks as <code>RANCell</code> <a href="https://arxiv.org/abs/1705.07393">arxiv</a></li><li>Recurrent highway network as <code>RHNCell</code> <a href="https://arxiv.org/pdf/1607.03474">arixv</a></li><li>Light recurrent unit as <code>LightRUCell</code> <a href="https://www.mdpi.com/2079-9292/13/16/3204">pub</a></li><li>Neural architecture search unit <code>NASCell</code> <a href="https://arxiv.org/abs/1611.01578">arxiv</a></li><li>Evolving recurrent neural networks as <code>MUT1Cell</code>, <code>MUT2Cell</code>, <code>MUT3Cell</code> <a href="https://proceedings.mlr.press/v37/jozefowicz15.pdf">pub</a></li><li>Structurally constrained recurrent neural network as <code>SCRNCell</code> <a href="https://arxiv.org/pdf/1412.7753">arxiv</a></li><li>Peephole long short term memory as <code>PeepholeLSTMCell</code> <a href="https://www.jmlr.org/papers/volume3/gers02a/gers02a.pdf">pub</a></li><li><code>FastRNNCell</code> and <code>FastGRNNCell</code> <a href="https://arxiv.org/pdf/1901.02358">arxiv</a></li></ul><h2 id="Contributing"><a class="docs-heading-anchor" href="#Contributing">Contributing</a><a id="Contributing-1"></a><a class="docs-heading-anchor-permalink" href="#Contributing" title="Permalink"></a></h2><p>Contributions are always welcome! We specifically look for :</p><ul><li>Recurrent cells you would like to see implemented </li><li>Benchmarks</li><li>Fixes for any bugs/errors</li><li>Documentation, in any form: examples, how tos, docstrings  </li></ul></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="api/cells/">Cells »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Tuesday 24 December 2024 16:26">Tuesday 24 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/roadmap/index.html b/dev/roadmap/index.html
index 7e9ce02..2ce5cff 100644
--- a/dev/roadmap/index.html
+++ b/dev/roadmap/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Roadmap · RecurrentLayers.jl</title><meta name="title" content="Roadmap · RecurrentLayers.jl"/><meta property="og:title" content="Roadmap · RecurrentLayers.jl"/><meta property="twitter:title" content="Roadmap · RecurrentLayers.jl"/><meta name="description" content="Documentation for RecurrentLayers.jl."/><meta property="og:description" content="Documentation for RecurrentLayers.jl."/><meta property="twitter:description" content="Documentation for RecurrentLayers.jl."/><meta property="og:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><meta property="twitter:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><link rel="canonical" href="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="RecurrentLayers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">RecurrentLayers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">API Documentation</span><ul><li><a class="tocitem" href="../api/cells/">Cells</a></li><li><a class="tocitem" href="../api/layers/">Layers</a></li><li><a class="tocitem" href="../api/wrappers/">Wrappers</a></li></ul></li><li class="is-active"><a class="tocitem" href>Roadmap</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Roadmap</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Roadmap</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/main/docs/src/roadmap.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Roadmap"><a class="docs-heading-anchor" href="#Roadmap">Roadmap</a><a id="Roadmap-1"></a><a class="docs-heading-anchor-permalink" href="#Roadmap" title="Permalink"></a></h1><p>This page documents some planned work for RecurrentLayers.jl. Future work for this library includes additional cells such as:</p><ul><li>FastRNNs and FastGRUs (current focus) <a href="https://arxiv.org/abs/1901.02358">arxiv</a></li><li>Unitary recurrent neural networks <a href="https://arxiv.org/abs/1611.00035">arxiv</a></li><li>Modern recurrent neural networks such as <a href="https://arxiv.org/abs/2303.06349">LRU</a>  and <a href="https://arxiv.org/abs/2410.01201">minLSTM/minGRU</a></li><li>Quasi recurrent neural networks <a href="https://arxiv.org/abs/1611.01576">arxiv</a></li></ul><p>Additionally, some cell-independent architectures are also planned, that expand the ability of recurrent architectures and could theoretically take any cell:</p><ul><li>Clockwork rnns <a href="https://arxiv.org/abs/1402.3511">arxiv</a></li><li>Phased rnns <a href="https://arxiv.org/abs/1610.09513">arxiv</a></li><li>Segment rnn <a href="https://arxiv.org/abs/2308.11200">arxiv</a></li><li>Fast-Slow rnns <a href="https://arxiv.org/abs/1705.08639">arxiv</a></li></ul><p>An implementation of these ideally would be, for example <code>FastSlow(RNNCell, input_size =&gt; hidden_size)</code>. More details on this soon!</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api/wrappers/">« Wrappers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 22 December 2024 19:18">Sunday 22 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Roadmap · RecurrentLayers.jl</title><meta name="title" content="Roadmap · RecurrentLayers.jl"/><meta property="og:title" content="Roadmap · RecurrentLayers.jl"/><meta property="twitter:title" content="Roadmap · RecurrentLayers.jl"/><meta name="description" content="Documentation for RecurrentLayers.jl."/><meta property="og:description" content="Documentation for RecurrentLayers.jl."/><meta property="twitter:description" content="Documentation for RecurrentLayers.jl."/><meta property="og:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><meta property="twitter:url" content="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><link rel="canonical" href="https://MartinuzziFrancesco.github.io/RecurrentLayers.jl/roadmap/"/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="RecurrentLayers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">RecurrentLayers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">API Documentation</span><ul><li><a class="tocitem" href="../api/cells/">Cells</a></li><li><a class="tocitem" href="../api/layers/">Layers</a></li><li><a class="tocitem" href="../api/wrappers/">Wrappers</a></li></ul></li><li class="is-active"><a class="tocitem" href>Roadmap</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Roadmap</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Roadmap</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/MartinuzziFrancesco/RecurrentLayers.jl/blob/main/docs/src/roadmap.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Roadmap"><a class="docs-heading-anchor" href="#Roadmap">Roadmap</a><a id="Roadmap-1"></a><a class="docs-heading-anchor-permalink" href="#Roadmap" title="Permalink"></a></h1><p>This page documents some planned work for RecurrentLayers.jl. Future work for this library includes additional cells such as:</p><ul><li>FastRNNs and FastGRUs (current focus) <a href="https://arxiv.org/abs/1901.02358">arxiv</a></li><li>Unitary recurrent neural networks <a href="https://arxiv.org/abs/1611.00035">arxiv</a></li><li>Modern recurrent neural networks such as <a href="https://arxiv.org/abs/2303.06349">LRU</a>  and <a href="https://arxiv.org/abs/2410.01201">minLSTM/minGRU</a></li><li>Quasi recurrent neural networks <a href="https://arxiv.org/abs/1611.01576">arxiv</a></li></ul><p>Additionally, some cell-independent architectures are also planned, that expand the ability of recurrent architectures and could theoretically take any cell:</p><ul><li>Clockwork rnns <a href="https://arxiv.org/abs/1402.3511">arxiv</a></li><li>Phased rnns <a href="https://arxiv.org/abs/1610.09513">arxiv</a></li><li>Segment rnn <a href="https://arxiv.org/abs/2308.11200">arxiv</a></li><li>Fast-Slow rnns <a href="https://arxiv.org/abs/1705.08639">arxiv</a></li></ul><p>An implementation of these ideally would be, for example <code>FastSlow(RNNCell, input_size =&gt; hidden_size)</code>. More details on this soon!</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../api/wrappers/">« Wrappers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Tuesday 24 December 2024 16:26">Tuesday 24 December 2024</span>. Using Julia version 1.11.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>