diff --git a/previews/PR613/.documenter-siteinfo.json b/previews/PR613/.documenter-siteinfo.json
index 8674dd89..a9975401 100644
--- a/previews/PR613/.documenter-siteinfo.json
+++ b/previews/PR613/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.9.4","generation_timestamp":"2024-11-16T18:44:18","documenter_version":"1.7.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.9.4","generation_timestamp":"2024-11-16T20:23:28","documenter_version":"1.7.0"}}
\ No newline at end of file
diff --git a/previews/PR613/audio/index.html b/previews/PR613/audio/index.html
index 429bdce5..5f57b7ce 100644
--- a/previews/PR613/audio/index.html
+++ b/previews/PR613/audio/index.html
@@ -20,7 +20,7 @@
 true
 
 julia&gt; hann_window(N) ≈ hamming_window(N; α=0.5f0, β=0.5f0)
-true</code></pre><p><strong>Returns:</strong></p><p>Vector of length <code>window_length</code> and eltype <code>T</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/stft.jl#L69-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hamming_window" href="#NNlib.hamming_window"><code>NNlib.hamming_window</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hamming_window(
+true</code></pre><p><strong>Returns:</strong></p><p>Vector of length <code>window_length</code> and eltype <code>T</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/stft.jl#L69-L123">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hamming_window" href="#NNlib.hamming_window"><code>NNlib.hamming_window</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hamming_window(
     window_length::Int, ::Type{T} = Float32; periodic::Bool = true,
     α::T = T(0.54), β::T = T(0.46),
 ) where T &lt;: Real</code></pre><p>Hamming window function (ref: <a href="https://en.wikipedia.org/wiki/Window_function#Hann_and_Hamming_windows">Window function § Hann and Hamming windows - Wikipedia</a>). Generalized version of <code>hann_window</code>.</p><p><span>$w[n] = \alpha - \beta \cos(\frac{2 \pi n}{N - 1})$</span></p><p>Where <span>$N$</span> is the window length.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hamming_window(100); width=30, height=10)
@@ -39,15 +39,15 @@
      ⠀0⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀100⠀</code></pre><p><strong>Arguments:</strong></p><ul><li><code>window_length::Int</code>: Size of the window.</li><li><code>::Type{T}</code>: Elemet type of the window.</li></ul><p><strong>Keyword Arguments:</strong></p><ul><li><p><code>periodic::Bool</code>: If <code>true</code> (default), returns a window to be used as   periodic function. If <code>false</code>, return a symmetric window.</p><p>Following always holds:</p></li></ul><pre><code class="language-julia-repl hljs">julia&gt; N = 256;
 
 julia&gt; hamming_window(N; periodic=true) ≈ hamming_window(N + 1; periodic=false)[1:end - 1]
-true</code></pre><ul><li><code>α::Real</code>: Coefficient α in the equation above.</li><li><code>β::Real</code>: Coefficient β in the equation above.</li></ul><p><strong>Returns:</strong></p><p>Vector of length <code>window_length</code> and eltype <code>T</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/stft.jl#L1-L56">source</a></section></article><h2 id="Spectral"><a class="docs-heading-anchor" href="#Spectral">Spectral</a><a id="Spectral-1"></a><a class="docs-heading-anchor-permalink" href="#Spectral" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.stft" href="#NNlib.stft"><code>NNlib.stft</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">stft(x;
+true</code></pre><ul><li><code>α::Real</code>: Coefficient α in the equation above.</li><li><code>β::Real</code>: Coefficient β in the equation above.</li></ul><p><strong>Returns:</strong></p><p>Vector of length <code>window_length</code> and eltype <code>T</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/stft.jl#L1-L56">source</a></section></article><h2 id="Spectral"><a class="docs-heading-anchor" href="#Spectral">Spectral</a><a id="Spectral-1"></a><a class="docs-heading-anchor-permalink" href="#Spectral" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.stft" href="#NNlib.stft"><code>NNlib.stft</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">stft(x;
     n_fft::Int, hop_length::Int = n_fft ÷ 4, window = nothing,
     center::Bool = true, normalized::Bool = false,
-)</code></pre><p>Short-time Fourier transform (STFT).</p><p>The STFT computes the Fourier transform of short overlapping windows of the input, giving frequency components of the signal as they change over time.</p><p><span>$Y[\omega, m] = \sum_{k = 0}^{N - 1} \text{window}[k] \text{input}[m \times \text{hop length} + k] \exp(-j \frac{2 \pi \omega k}{\text{n fft}})$</span></p><p>where <span>$N$</span> is the window length, <span>$\omega$</span> is the frequency <span>$0 \le \omega &lt; \text{n fft}$</span> and <span>$m$</span> is the index of the sliding window.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Input, must be either a 1D time sequence (<code>(L,)</code> shape)   or a 2D batch of time sequence (<code>(L, B)</code> shape).</li></ul><p><strong>Keyword Arguments:</strong></p><ul><li><code>n_fft::Int</code>: Size of Fourier transform.</li><li><code>hop_length::Int</code>: Distance between neighboring sliding window frames.</li><li><code>window</code>: Optional window function to apply.   Must be 1D vector <code>0 &lt; length(window) ≤ n_fft</code>.   If window is shorter than <code>n_fft</code>, it is padded with zeros on both sides.   If <code>nothing</code> (default), then no window is applied.</li><li><code>center::Bool</code>: Whether to pad input on both sides so that <span>$t$</span>-th frame   is centered at time <span>$t \times \text{hop length}$</span>.   Padding is done with <code>pad_reflect</code> function.</li><li><code>normalized::Bool</code>: Whether to return normalized STFT,   i.e. multiplied with <span>$\text{n fft}^{-0.5}$</span>.</li></ul><p><strong>Returns:</strong></p><p>Complex array of shape <code>(n_fft, n_frames, B)</code>, where <code>B</code> is the optional batch dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/stft.jl#L130-L170">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.istft" href="#NNlib.istft"><code>NNlib.istft</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">istft(y;
+)</code></pre><p>Short-time Fourier transform (STFT).</p><p>The STFT computes the Fourier transform of short overlapping windows of the input, giving frequency components of the signal as they change over time.</p><p><span>$Y[\omega, m] = \sum_{k = 0}^{N - 1} \text{window}[k] \text{input}[m \times \text{hop length} + k] \exp(-j \frac{2 \pi \omega k}{\text{n fft}})$</span></p><p>where <span>$N$</span> is the window length, <span>$\omega$</span> is the frequency <span>$0 \le \omega &lt; \text{n fft}$</span> and <span>$m$</span> is the index of the sliding window.</p><p><strong>Arguments:</strong></p><ul><li><code>x</code>: Input, must be either a 1D time sequence (<code>(L,)</code> shape)   or a 2D batch of time sequence (<code>(L, B)</code> shape).</li></ul><p><strong>Keyword Arguments:</strong></p><ul><li><code>n_fft::Int</code>: Size of Fourier transform.</li><li><code>hop_length::Int</code>: Distance between neighboring sliding window frames.</li><li><code>window</code>: Optional window function to apply.   Must be 1D vector <code>0 &lt; length(window) ≤ n_fft</code>.   If window is shorter than <code>n_fft</code>, it is padded with zeros on both sides.   If <code>nothing</code> (default), then no window is applied.</li><li><code>center::Bool</code>: Whether to pad input on both sides so that <span>$t$</span>-th frame   is centered at time <span>$t \times \text{hop length}$</span>.   Padding is done with <code>pad_reflect</code> function.</li><li><code>normalized::Bool</code>: Whether to return normalized STFT,   i.e. multiplied with <span>$\text{n fft}^{-0.5}$</span>.</li></ul><p><strong>Returns:</strong></p><p>Complex array of shape <code>(n_fft, n_frames, B)</code>, where <code>B</code> is the optional batch dimension.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/stft.jl#L130-L170">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.istft" href="#NNlib.istft"><code>NNlib.istft</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">istft(y;
     n_fft::Int, hop_length::Int = n_fft ÷ 4, window = nothing,
     center::Bool = true, normalized::Bool = false,
     return_complex::Bool = false,
     original_length::Union{Nothing, Int} = nothing,
-)</code></pre><p>Inverse Short-time Fourier Transform.</p><p>Return the least squares estimation of the original signal</p><p><strong>Arguments:</strong></p><ul><li><code>y</code>: Input complex array in the <code>(n_fft, n_frames, B)</code> shape.   Where <code>B</code> is the optional batch dimension.</li></ul><p><strong>Keyword Arguments:</strong></p><ul><li><code>n_fft::Int</code>: Size of Fourier transform.</li><li><code>hop_length::Int</code>: Distance between neighboring sliding window frames.</li><li><code>window</code>: Window function that was applied to the input of <code>stft</code>.   If <code>nothing</code> (default), then no window was applied.</li><li><code>center::Bool</code>: Whether input to <code>stft</code> was padded on both sides   so that <span>$t$</span>-th frame is centered at time <span>$t \times \text{hop length}$</span>.   Padding is done with <code>pad_reflect</code> function.</li><li><code>normalized::Bool</code>: Whether input to <code>stft</code> was normalized.</li><li><code>return_complex::Bool</code>: Whether the output should be complex,   or if the input should be assumed to derive from a real signal and window.</li><li><code>original_length::Union{Nothing, Int}</code>: Optional size of the first dimension   of the input to <code>stft</code>. Helps restoring the exact <code>stft</code> input size.   Otherwise, the array might be a bit shorter.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/stft.jl#L173-L205">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.power_to_db" href="#NNlib.power_to_db"><code>NNlib.power_to_db</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">power_to_db(s; ref::Real = 1f0, amin::Real = 1f-10, top_db::Real = 80f0)</code></pre><p>Convert a power spectrogram (amplitude squared) to decibel (dB) units.</p><p><strong>Arguments</strong></p><ul><li><code>s</code>: Input power.</li><li><code>ref</code>: Scalar w.r.t. which the input is scaled.</li><li><code>amin</code>: Minimum threshold for <code>s</code>.</li><li><code>top_db</code>: Threshold the output at <code>top_db</code> below the peak:   <code>max.(s_db, maximum(s_db) - top_db)</code>.</li></ul><p><strong>Returns</strong></p><p><code>s_db ~= 10 * log10(s) - 10 * log10(ref)</code></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/spectrogram.jl#L50-L66">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.db_to_power" href="#NNlib.db_to_power"><code>NNlib.db_to_power</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">db_to_power(s_db; ref::Real = 1f0)</code></pre><p>Inverse of <a href="#NNlib.power_to_db"><code>power_to_db</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/spectrogram.jl#L72-L76">source</a></section></article><h2 id="Spectrogram"><a class="docs-heading-anchor" href="#Spectrogram">Spectrogram</a><a id="Spectrogram-1"></a><a class="docs-heading-anchor-permalink" href="#Spectrogram" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.melscale_filterbanks" href="#NNlib.melscale_filterbanks"><code>NNlib.melscale_filterbanks</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">melscale_filterbanks(;
+)</code></pre><p>Inverse Short-time Fourier Transform.</p><p>Return the least squares estimation of the original signal</p><p><strong>Arguments:</strong></p><ul><li><code>y</code>: Input complex array in the <code>(n_fft, n_frames, B)</code> shape.   Where <code>B</code> is the optional batch dimension.</li></ul><p><strong>Keyword Arguments:</strong></p><ul><li><code>n_fft::Int</code>: Size of Fourier transform.</li><li><code>hop_length::Int</code>: Distance between neighboring sliding window frames.</li><li><code>window</code>: Window function that was applied to the input of <code>stft</code>.   If <code>nothing</code> (default), then no window was applied.</li><li><code>center::Bool</code>: Whether input to <code>stft</code> was padded on both sides   so that <span>$t$</span>-th frame is centered at time <span>$t \times \text{hop length}$</span>.   Padding is done with <code>pad_reflect</code> function.</li><li><code>normalized::Bool</code>: Whether input to <code>stft</code> was normalized.</li><li><code>return_complex::Bool</code>: Whether the output should be complex,   or if the input should be assumed to derive from a real signal and window.</li><li><code>original_length::Union{Nothing, Int}</code>: Optional size of the first dimension   of the input to <code>stft</code>. Helps restoring the exact <code>stft</code> input size.   Otherwise, the array might be a bit shorter.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/stft.jl#L173-L205">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.power_to_db" href="#NNlib.power_to_db"><code>NNlib.power_to_db</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">power_to_db(s; ref::Real = 1f0, amin::Real = 1f-10, top_db::Real = 80f0)</code></pre><p>Convert a power spectrogram (amplitude squared) to decibel (dB) units.</p><p><strong>Arguments</strong></p><ul><li><code>s</code>: Input power.</li><li><code>ref</code>: Scalar w.r.t. which the input is scaled.</li><li><code>amin</code>: Minimum threshold for <code>s</code>.</li><li><code>top_db</code>: Threshold the output at <code>top_db</code> below the peak:   <code>max.(s_db, maximum(s_db) - top_db)</code>.</li></ul><p><strong>Returns</strong></p><p><code>s_db ~= 10 * log10(s) - 10 * log10(ref)</code></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/spectrogram.jl#L50-L66">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.db_to_power" href="#NNlib.db_to_power"><code>NNlib.db_to_power</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">db_to_power(s_db; ref::Real = 1f0)</code></pre><p>Inverse of <a href="#NNlib.power_to_db"><code>power_to_db</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/spectrogram.jl#L72-L76">source</a></section></article><h2 id="Spectrogram"><a class="docs-heading-anchor" href="#Spectrogram">Spectrogram</a><a id="Spectrogram-1"></a><a class="docs-heading-anchor-permalink" href="#Spectrogram" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.melscale_filterbanks" href="#NNlib.melscale_filterbanks"><code>NNlib.melscale_filterbanks</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">melscale_filterbanks(;
     n_freqs::Int, n_mels::Int, sample_rate::Int,
     fmin::Float32 = 0f0, fmax::Float32 = Float32(sample_rate ÷ 2))</code></pre><p>Create triangular Mel scale filter banks (ref: <a href="https://en.wikipedia.org/wiki/Mel_scale">Mel scale - Wikipedia</a>). Each column is a filterbank that highlights its own frequency.</p><p><strong>Arguments:</strong></p><ul><li><code>n_freqs::Int</code>: Number of frequencies to highlight.</li><li><code>n_mels::Int</code>: Number of mel filterbanks.</li><li><code>sample_rate::Int</code>: Sample rate of the audio waveform.</li><li><code>fmin::Float32</code>: Minimum frequency in Hz.</li><li><code>fmax::Float32</code>: Maximum frequency in Hz.</li></ul><p><strong>Returns:</strong></p><p>Filterbank matrix of shape <code>(n_freqs, n_mels)</code> where each column is a filterbank.</p><pre><code class="language-julia-repl hljs">julia&gt; n_mels = 8;
 
@@ -77,11 +77,11 @@
      │⡇⡇⢸⠇⢸⡇⠀⣿⠀⠀⢣⡇⠀⠀⠸⣄⠇⠀⠀⠀⠸⡀⡇⠀⠀⠀⠀⠀⢱⠀⠀⠀⠀⠀⠀⠀⠀⠀⠸⡄│
    0 │⣇⣇⣸⣀⣸⣀⣀⣟⣀⣀⣸⣃⣀⣀⣀⣿⣀⣀⣀⣀⣀⣿⣀⣀⣀⣀⣀⣀⣈⣇⣀⣀⣀⣀⣀⣀⣀⣀⣀⣱│
      └────────────────────────────────────────┘
-     ⠀0⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀200⠀</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/mel.jl#L1-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.spectrogram" href="#NNlib.spectrogram"><code>NNlib.spectrogram</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">spectrogram(waveform;
+     ⠀0⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀200⠀</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/mel.jl#L1-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.spectrogram" href="#NNlib.spectrogram"><code>NNlib.spectrogram</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">spectrogram(waveform;
     pad::Int = 0, n_fft::Int, hop_length::Int, window,
     center::Bool = true, power::Real = 2.0,
     normalized::Bool = false, window_normalized::Bool = false,
-)</code></pre><p>Create a spectrogram or a batch of spectrograms from a raw audio signal.</p><p><strong>Arguments</strong></p><ul><li><code>pad::Int</code>:   Then amount of padding to apply on both sides.</li><li><code>window_normalized::Bool</code>:   Whether to normalize the waveform by the window’s L2 energy.</li><li><code>power::Real</code>:   Exponent for the magnitude spectrogram (must be ≥ 0)   e.g., <code>1</code> for magnitude, <code>2</code> for power, etc.   If <code>0</code>, complex spectrum is returned instead.</li></ul><p>See <a href="#NNlib.stft"><code>stft</code></a> for other arguments.</p><p><strong>Returns</strong></p><p>Spectrogram in the shape <code>(T, F, B)</code>, where <code>T</code> is the number of window hops and <code>F = n_fft ÷ 2 + 1</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/audio/spectrogram.jl#L1-L27">source</a></section></article><p>Example:</p><pre><code class="language-julia hljs">using FFTW # &lt;- required for STFT support.
+)</code></pre><p>Create a spectrogram or a batch of spectrograms from a raw audio signal.</p><p><strong>Arguments</strong></p><ul><li><code>pad::Int</code>:   Then amount of padding to apply on both sides.</li><li><code>window_normalized::Bool</code>:   Whether to normalize the waveform by the window’s L2 energy.</li><li><code>power::Real</code>:   Exponent for the magnitude spectrogram (must be ≥ 0)   e.g., <code>1</code> for magnitude, <code>2</code> for power, etc.   If <code>0</code>, complex spectrum is returned instead.</li></ul><p>See <a href="#NNlib.stft"><code>stft</code></a> for other arguments.</p><p><strong>Returns</strong></p><p>Spectrogram in the shape <code>(T, F, B)</code>, where <code>T</code> is the number of window hops and <code>F = n_fft ÷ 2 + 1</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/audio/spectrogram.jl#L1-L27">source</a></section></article><p>Example:</p><pre><code class="language-julia hljs">using FFTW # &lt;- required for STFT support.
 using NNlib
 using FileIO
 using Makie, CairoMakie
@@ -104,4 +104,4 @@
 fb = melscale_filterbanks(; n_freqs, n_mels=128, sample_rate=Int(sampling_rate))
 mel_spec = permutedims(spec, (2, 1, 3)) ⊠ fb # (time, n_mels)
 fig = heatmap(NNlib.power_to_db(mel_spec)[:, :, 1])
-save(&quot;mel-spectrogram.png&quot;, fig)</code></pre><table><tr><th style="text-align: center">Waveform</th><th style="text-align: center">Spectrogram</th><th style="text-align: center">Mel Spectrogram</th></tr><tr><td style="text-align: center"><img src="../waveform.png" alt/></td><td style="text-align: center"><img src="../spectrogram.png" alt/></td><td style="text-align: center"><img src="../mel-spectrogram.png" alt/></td></tr></table></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../reference/">« Reference</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 18:44">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+save(&quot;mel-spectrogram.png&quot;, fig)</code></pre><table><tr><th style="text-align: center">Waveform</th><th style="text-align: center">Spectrogram</th><th style="text-align: center">Mel Spectrogram</th></tr><tr><td style="text-align: center"><img src="../waveform.png" alt/></td><td style="text-align: center"><img src="../spectrogram.png" alt/></td><td style="text-align: center"><img src="../mel-spectrogram.png" alt/></td></tr></table></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../reference/">« Reference</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 20:23">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/previews/PR613/index.html b/previews/PR613/index.html
index 4e5e2468..a14a7b88 100644
--- a/previews/PR613/index.html
+++ b/previews/PR613/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · NNlib.jl</title><meta name="title" content="Home · NNlib.jl"/><meta property="og:title" content="Home · NNlib.jl"/><meta property="twitter:title" content="Home · NNlib.jl"/><meta name="description" content="Documentation for NNlib.jl."/><meta property="og:description" content="Documentation for NNlib.jl."/><meta property="twitter:description" content="Documentation for NNlib.jl."/><meta property="og:url" content="https://fluxml.ai/NNlib.jl/stable/"/><meta property="twitter:url" content="https://fluxml.ai/NNlib.jl/stable/"/><link rel="canonical" href="https://fluxml.ai/NNlib.jl/stable/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href><img src="assets/logo.png" alt="NNlib.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href>NNlib.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a></li><li><a class="tocitem" href="reference/">Reference</a></li><li><a class="tocitem" href="audio/">Audio</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/NNlib.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/NNlib.jl/blob/master/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="NNlib.jl"><a class="docs-heading-anchor" href="#NNlib.jl">NNlib.jl</a><a id="NNlib.jl-1"></a><a class="docs-heading-anchor-permalink" href="#NNlib.jl" title="Permalink"></a></h1><p><code>NNlib</code> provides a library of functions useful for neural networks, such as softmax, sigmoid, batched multiplication, convolutions and pooling. Many of these are used by <a href="https://github.com/FluxML/Flux.jl">Flux.jl</a>, which loads this package, but they may be used independently.</p><p>For use with automatic differentiation, this package defines gradients using <a href="https://github.com/JuliaDiff/ChainRules.jl">ChainRules.jl</a>. These will be seen by various packages including <a href="https://github.com/FluxML/Zygote.jl">Zygote.jl</a>.</p><p>GPU support is provided as package extensions. In order to load the extensions, use the imports</p><pre><code class="language-julia hljs">using NNlib, CUDA, cuDNN</code></pre><p>for CUDA support, or</p><pre><code class="language-julia hljs">using NNlib, AMDGPU</code></pre><p>for AMDGPU support.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="reference/">Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 18:44">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · NNlib.jl</title><meta name="title" content="Home · NNlib.jl"/><meta property="og:title" content="Home · NNlib.jl"/><meta property="twitter:title" content="Home · NNlib.jl"/><meta name="description" content="Documentation for NNlib.jl."/><meta property="og:description" content="Documentation for NNlib.jl."/><meta property="twitter:description" content="Documentation for NNlib.jl."/><meta property="og:url" content="https://fluxml.ai/NNlib.jl/stable/"/><meta property="twitter:url" content="https://fluxml.ai/NNlib.jl/stable/"/><link rel="canonical" href="https://fluxml.ai/NNlib.jl/stable/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href><img src="assets/logo.png" alt="NNlib.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href>NNlib.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a></li><li><a class="tocitem" href="reference/">Reference</a></li><li><a class="tocitem" href="audio/">Audio</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/NNlib.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/NNlib.jl/blob/master/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="NNlib.jl"><a class="docs-heading-anchor" href="#NNlib.jl">NNlib.jl</a><a id="NNlib.jl-1"></a><a class="docs-heading-anchor-permalink" href="#NNlib.jl" title="Permalink"></a></h1><p><code>NNlib</code> provides a library of functions useful for neural networks, such as softmax, sigmoid, batched multiplication, convolutions and pooling. Many of these are used by <a href="https://github.com/FluxML/Flux.jl">Flux.jl</a>, which loads this package, but they may be used independently.</p><p>For use with automatic differentiation, this package defines gradients using <a href="https://github.com/JuliaDiff/ChainRules.jl">ChainRules.jl</a>. These will be seen by various packages including <a href="https://github.com/FluxML/Zygote.jl">Zygote.jl</a>.</p><p>GPU support is provided as package extensions. In order to load the extensions, use the imports</p><pre><code class="language-julia hljs">using NNlib, CUDA, cuDNN</code></pre><p>for CUDA support, or</p><pre><code class="language-julia hljs">using NNlib, AMDGPU</code></pre><p>for AMDGPU support.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="reference/">Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 20:23">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/previews/PR613/mel-spectrogram.png b/previews/PR613/mel-spectrogram.png
index d7b2643f..45350f7a 100644
Binary files a/previews/PR613/mel-spectrogram.png and b/previews/PR613/mel-spectrogram.png differ
diff --git a/previews/PR613/reference/index.html b/previews/PR613/reference/index.html
index 6b89f7c6..f5fee935 100644
--- a/previews/PR613/reference/index.html
+++ b/previews/PR613/reference/index.html
@@ -13,7 +13,7 @@
            ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        
 
 julia&gt; celu(-10f0)
--0.9999546f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L505-L527">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.elu" href="#NNlib.elu"><code>NNlib.elu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">elu(x, α=1) = x &gt; 0 ? x : α * (exp(x) - 1)</code></pre><p>Exponential Linear Unit activation function. See <a href="https://arxiv.org/abs/1511.07289">&quot;Fast and Accurate Deep Network Learning by Exponential Linear Units&quot;</a>. You can also specify the coefficient explicitly, e.g. <code>elu(x, 1)</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(elu, -2, 2, height=7)
+-0.9999546f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L505-L527">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.elu" href="#NNlib.elu"><code>NNlib.elu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">elu(x, α=1) = x &gt; 0 ? x : α * (exp(x) - 1)</code></pre><p>Exponential Linear Unit activation function. See <a href="https://arxiv.org/abs/1511.07289">&quot;Fast and Accurate Deep Network Learning by Exponential Linear Units&quot;</a>. You can also specify the coefficient explicitly, e.g. <code>elu(x, 1)</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(elu, -2, 2, height=7)
            ┌────────────────────────────────────────┐       
          2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠤⠒⠉│ elu(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠⠔⠊⠉⠀⠀⠀⠀│       
@@ -30,7 +30,7 @@
 -0.9999546f0
 
 julia&gt; elu(-10f0, 2)
--1.9999092f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L271-L298">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gelu" href="#NNlib.gelu"><code>NNlib.gelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gelu(x) = 0.5x * (1 + tanh(√(2/π) * (x + 0.044715x^3)))</code></pre><p>Activation function from <a href="https://arxiv.org/abs/1606.08415">&quot;Gaussian Error Linear Units&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(gelu, -2, 2, height=7)
+-1.9999092f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L271-L298">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gelu" href="#NNlib.gelu"><code>NNlib.gelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gelu(x) = 0.5x * (1 + tanh(√(2/π) * (x + 0.044715x^3)))</code></pre><p>Activation function from <a href="https://arxiv.org/abs/1606.08415">&quot;Gaussian Error Linear Units&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(gelu, -2, 2, height=7)
            ┌────────────────────────────────────────┐        
          2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠⠔⠊│ gelu(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠔⠊⠁⠀⠀⠀│        
@@ -56,7 +56,7 @@
         -0.2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠈⠓⣄⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢠⠇⠀⠀⠀│         
              └────────────────────────────────────────┘         
              ⠀-5⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀0⠀         
-             ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L303-L337">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardsigmoid" href="#NNlib.hardsigmoid"><code>NNlib.hardsigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardσ(x) = max(0, min(1, (x + 3) / 6))</code></pre><p>Piecewise linear approximation of <a href="#NNlib.sigmoid"><code>sigmoid</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardsigmoid, -5, 5, height=7)
+             ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L303-L337">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardsigmoid" href="#NNlib.hardsigmoid"><code>NNlib.hardsigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardσ(x) = max(0, min(1, (x + 3) / 6))</code></pre><p>Piecewise linear approximation of <a href="#NNlib.sigmoid"><code>sigmoid</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardsigmoid, -5, 5, height=7)
           ┌────────────────────────────────────────┐         
         1 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⢀⡠⠖⠋⠉⠉⠉⠉⠉⠉⠉⠉│ hardσ(x)
           │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⣀⡤⠒⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│         
@@ -80,14 +80,14 @@
         0 │⣀⣀⣀⣀⣀⣀⣀⠤⠤⠤⠒⠊⠉⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│     
           └────────────────────────────────────────┘     
           ⠀-5⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀5⠀     
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀     </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L61-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.sigmoid_fast" href="#NNlib.sigmoid_fast"><code>NNlib.sigmoid_fast</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">sigmoid_fast(x)</code></pre><p>This is a faster, and very slightly less accurate, version of <code>sigmoid</code>. For <code>x::Float32</code>, perhaps 3 times faster, and maximum errors 2 eps instead of 1.</p><p>See also <a href="#NNlib.tanh_fast"><code>tanh_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; sigmoid(0.2f0)
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀     </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L61-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.sigmoid_fast" href="#NNlib.sigmoid_fast"><code>NNlib.sigmoid_fast</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">sigmoid_fast(x)</code></pre><p>This is a faster, and very slightly less accurate, version of <code>sigmoid</code>. For <code>x::Float32</code>, perhaps 3 times faster, and maximum errors 2 eps instead of 1.</p><p>See also <a href="#NNlib.tanh_fast"><code>tanh_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; sigmoid(0.2f0)
 0.54983395f0
 
 julia&gt; sigmoid_fast(0.2f0)
 0.54983395f0
 
 julia&gt; hardσ(0.2f0)
-0.53333336f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L807-L825">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardtanh" href="#NNlib.hardtanh"><code>NNlib.hardtanh</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardtanh(x) = max(-1, min(1, x))</code></pre><p>Segment-wise linear approximation of <code>tanh</code>, much cheaper to compute. See <a href="https://ronan.collobert.com/pub/matos/2004_phdthesis_lip6.pdf">&quot;Large Scale Machine Learning&quot;</a>.</p><p>See also <a href="#NNlib.tanh_fast"><code>tanh_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardtanh, -2, 2, height=7)
+0.53333336f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L807-L825">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardtanh" href="#NNlib.hardtanh"><code>NNlib.hardtanh</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardtanh(x) = max(-1, min(1, x))</code></pre><p>Segment-wise linear approximation of <code>tanh</code>, much cheaper to compute. See <a href="https://ronan.collobert.com/pub/matos/2004_phdthesis_lip6.pdf">&quot;Large Scale Machine Learning&quot;</a>.</p><p>See also <a href="#NNlib.tanh_fast"><code>tanh_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardtanh, -2, 2, height=7)
            ┌────────────────────────────────────────┐            
          1 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⣀⠔⠋⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉│ hardtanh(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⣀⡤⠊⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│            
@@ -111,14 +111,14 @@
         -1 │⣀⣀⣀⡠⠤⠤⠤⠖⠒⠊⠉⠁⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│        
            └────────────────────────────────────────┘        
            ⠀-2⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀2⠀        
-           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L124-L158">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.tanh_fast" href="#NNlib.tanh_fast"><code>NNlib.tanh_fast</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tanh_fast(x)</code></pre><p>This is a faster but slighly less accurate version of <code>tanh</code>.</p><p>Where Julia&#39;s <code>tanh</code> function has an error under 2 eps, this may be wrong by 5 eps, a reduction by less than one decimal digit. </p><p>For <code>x::Float32</code> this is usually about 10 times faster, with a smaller speedup for <code>x::Float64</code>. For any other number types, it just calls <code>tanh</code>.</p><p>See also <a href="#NNlib.sigmoid_fast"><code>sigmoid_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; tanh(0.5f0)
+           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L124-L158">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.tanh_fast" href="#NNlib.tanh_fast"><code>NNlib.tanh_fast</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tanh_fast(x)</code></pre><p>This is a faster but slighly less accurate version of <code>tanh</code>.</p><p>Where Julia&#39;s <code>tanh</code> function has an error under 2 eps, this may be wrong by 5 eps, a reduction by less than one decimal digit. </p><p>For <code>x::Float32</code> this is usually about 10 times faster, with a smaller speedup for <code>x::Float64</code>. For any other number types, it just calls <code>tanh</code>.</p><p>See also <a href="#NNlib.sigmoid_fast"><code>sigmoid_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; tanh(0.5f0)
 0.46211717f0
 
 julia&gt; tanh_fast(0.5f0)
 0.46211714f0
 
 julia&gt; hard_tanh(0.5f0)
-0.5f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L759-L783">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.leakyrelu" href="#NNlib.leakyrelu"><code>NNlib.leakyrelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">leakyrelu(x, a=0.01) = max(a*x, x)</code></pre><p>Leaky <a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function. You can also specify the coefficient explicitly, e.g. <code>leakyrelu(x, 0.01)</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(x -&gt; leakyrelu(x, 0.5), -2, 2, height=7)
+0.5f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L759-L783">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.leakyrelu" href="#NNlib.leakyrelu"><code>NNlib.leakyrelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">leakyrelu(x, a=0.01) = max(a*x, x)</code></pre><p>Leaky <a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function. You can also specify the coefficient explicitly, e.g. <code>leakyrelu(x, 0.01)</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(x -&gt; leakyrelu(x, 0.5), -2, 2, height=7)
            ┌────────────────────────────────────────┐       
          2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠤⠒⠉│ #42(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠⠔⠊⠉⠀⠀⠀⠀│       
@@ -135,7 +135,7 @@
 -2.0f0
 
 julia&gt; leakyrelu(-10f0, 0.02)
--0.5f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L184-L211">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.lisht" href="#NNlib.lisht"><code>NNlib.lisht</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">lisht(x) = x * tanh(x)</code></pre><p>Activation function from  <a href="https://arxiv.org/abs/1901.05894">&quot;LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent ...&quot;</a></p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(lisht, -2, 2, height=7)
+-0.5f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L184-L211">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.lisht" href="#NNlib.lisht"><code>NNlib.lisht</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">lisht(x) = x * tanh(x)</code></pre><p>Activation function from  <a href="https://arxiv.org/abs/1901.05894">&quot;LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent ...&quot;</a></p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(lisht, -2, 2, height=7)
           ┌────────────────────────────────────────┐         
         2 │⠢⣄⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠔│ lisht(x)
           │⠀⠈⠑⢦⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡤⠊⠁⠀│         
@@ -159,7 +159,7 @@
         0 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠉⠓⠪⠷⣦⣄⣀⣀⣇⣀⣀⣤⠶⠕⠒⠉⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│           
           └────────────────────────────────────────┘           
           ⠀-2⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀2⠀           
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀           </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L427-L460">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logcosh" href="#NNlib.logcosh"><code>NNlib.logcosh</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logcosh(x)</code></pre><p>Return <code>log(cosh(x))</code> which is computed in a numerically stable way.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(logcosh, -5, 5, height=7)
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀           </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L427-L460">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logcosh" href="#NNlib.logcosh"><code>NNlib.logcosh</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logcosh(x)</code></pre><p>Return <code>log(cosh(x))</code> which is computed in a numerically stable way.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(logcosh, -5, 5, height=7)
           ┌────────────────────────────────────────┐           
         5 │⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│ logcosh(x)
           │⠉⠢⣄⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠔⠋│           
@@ -170,7 +170,7 @@
         0 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠈⠑⠢⢄⣀⣀⣇⣀⡠⠔⠊⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│           
           └────────────────────────────────────────┘           
           ⠀-5⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀5⠀           
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀           </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L638-L657">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsigmoid" href="#NNlib.logsigmoid"><code>NNlib.logsigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logσ(x)</code></pre><p>Return <code>log(σ(x))</code> which is computed in a numerically stable way.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(logsigmoid, -5, 5, height=7)
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀           </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L638-L657">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsigmoid" href="#NNlib.logsigmoid"><code>NNlib.logsigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logσ(x)</code></pre><p>Return <code>log(σ(x))</code> which is computed in a numerically stable way.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(logsigmoid, -5, 5, height=7)
            ┌────────────────────────────────────────┐        
          0 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡧⠤⠔⠒⠒⠒⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉│ logσ(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠖⠊⠉⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│        
@@ -181,7 +181,7 @@
         -6 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│        
            └────────────────────────────────────────┘        
            ⠀-5⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀5⠀        
-           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L100-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.mish" href="#NNlib.mish"><code>NNlib.mish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mish(x) = x * tanh(softplus(x))</code></pre><p>Activation function from <a href="https://arxiv.org/abs/1908.08681">&quot;Mish: A Self Regularized Non-Monotonic Neural Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(mish, -5, 5, height=7)
+           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L100-L119">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.mish" href="#NNlib.mish"><code>NNlib.mish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mish(x) = x * tanh(softplus(x))</code></pre><p>Activation function from <a href="https://arxiv.org/abs/1908.08681">&quot;Mish: A Self Regularized Non-Monotonic Neural Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(mish, -5, 5, height=7)
            ┌────────────────────────────────────────┐        
          5 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠⠖⠋│ mish(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠒⠁⠀⠀⠀│        
@@ -192,7 +192,7 @@
         -1 │⠀⠀⠀⠀⠀⠀⠀⠀⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠉⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│        
            └────────────────────────────────────────┘        
            ⠀-5⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀5⠀        
-           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L662-L681">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.relu" href="#NNlib.relu"><code>NNlib.relu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relu(x) = max(0, x)</code></pre><p><a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(relu, -2, 2, height=7)
+           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L662-L681">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.relu" href="#NNlib.relu"><code>NNlib.relu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relu(x) = max(0, x)</code></pre><p><a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(relu, -2, 2, height=7)
           ┌────────────────────────────────────────┐        
         2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠔⠋│ relu(x)
           │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠊⠁⠀⠀│        
@@ -203,7 +203,7 @@
         0 │⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣇⠔⠋⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│        
           └────────────────────────────────────────┘        
           ⠀-2⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀2⠀        
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L161-L181">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.relu6" href="#NNlib.relu6"><code>NNlib.relu6</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relu6(x) = min(max(0, x), 6)</code></pre><p><a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function capped at 6. See <a href="https://www.cs.toronto.edu/~kriz/conv-cifar10-aug2010.pdf">&quot;Convolutional Deep Belief Networks&quot;</a> from CIFAR-10.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(relu6, -10, 10, height=7)
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L161-L181">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.relu6" href="#NNlib.relu6"><code>NNlib.relu6</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relu6(x) = min(max(0, x), 6)</code></pre><p><a href="https://en.wikipedia.org/wiki/Rectifier_(neural_networks)">Rectified Linear Unit</a> activation function capped at 6. See <a href="https://www.cs.toronto.edu/~kriz/conv-cifar10-aug2010.pdf">&quot;Convolutional Deep Belief Networks&quot;</a> from CIFAR-10.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(relu6, -10, 10, height=7)
           ┌────────────────────────────────────────┐         
         6 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⠎⠉⠉⠉⠉⠉⠉⠉⠉│ relu6(x)
           │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⢀⡔⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀│         
@@ -214,7 +214,7 @@
         0 │⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⡧⠋⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│         
           └────────────────────────────────────────┘         
           ⠀-10⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀10⠀         
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L216-L237">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.rrelu" href="#NNlib.rrelu"><code>NNlib.rrelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">rrelu(x, lo=1/8, hi=1/3) = max(a*x, x)
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L216-L237">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.rrelu" href="#NNlib.rrelu"><code>NNlib.rrelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">rrelu(x, lo=1/8, hi=1/3) = max(a*x, x)
 # where `a` is randomly sampled from uniform distribution `U(lo, hi)`</code></pre><p>Randomized Leaky Rectified Linear Unit activation function. See <a href="https://arxiv.org/abs/1505.00853">&quot;Empirical Evaluation of Rectified Activations&quot;</a> You can also specify the bound explicitly, e.g. <code>rrelu(x, 0.0, 1.0)</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(rrelu, -20, 10, height=7)
             ┌────────────────────────────────────────┐         
          10 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢸⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠖⠋│ rrelu(x)
@@ -229,7 +229,7 @@
             ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         
 
 julia&gt; extrema(rrelu.(fill(-10f0, 1000)))
-(-3.3316886f0, -1.2548422f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L240-L265">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.selu" href="#NNlib.selu"><code>NNlib.selu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">selu(x) = λ * (x ≥ 0 ? x : α * (exp(x) - 1))
+(-3.3316886f0, -1.2548422f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L240-L265">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.selu" href="#NNlib.selu"><code>NNlib.selu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">selu(x) = λ * (x ≥ 0 ? x : α * (exp(x) - 1))
 
 λ ≈ 1.05070...
 α ≈ 1.67326...</code></pre><p>Scaled exponential linear units. See <a href="https://arxiv.org/abs/1706.02515">&quot;Self-Normalizing Neural Networks&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(selu, -3, 2, height=7)
@@ -246,7 +246,7 @@
            ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀        
 
 julia&gt; selu(-10f0)
--1.7580194f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L463-L489">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.sigmoid" href="#NNlib.sigmoid"><code>NNlib.sigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">σ(x) = 1 / (1 + exp(-x))</code></pre><p>Classic <a href="https://en.wikipedia.org/wiki/Sigmoid_function">sigmoid</a> activation function. Unicode <code>σ</code> can be entered as <code>\sigma</code> then tab, in many editors. The ascii name <code>sigmoid</code> is also exported.</p><p>See also <a href="#NNlib.sigmoid_fast"><code>sigmoid_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; using UnicodePlots
+-1.7580194f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L463-L489">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.sigmoid" href="#NNlib.sigmoid"><code>NNlib.sigmoid</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">σ(x) = 1 / (1 + exp(-x))</code></pre><p>Classic <a href="https://en.wikipedia.org/wiki/Sigmoid_function">sigmoid</a> activation function. Unicode <code>σ</code> can be entered as <code>\sigma</code> then tab, in many editors. The ascii name <code>sigmoid</code> is also exported.</p><p>See also <a href="#NNlib.sigmoid_fast"><code>sigmoid_fast</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; using UnicodePlots
 
 julia&gt; lineplot(sigmoid, -5, 5, height=7)
           ┌────────────────────────────────────────┐     
@@ -262,7 +262,7 @@
           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀     
 
 julia&gt; sigmoid === σ
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L24-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softplus" href="#NNlib.softplus"><code>NNlib.softplus</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softplus(x) = log(exp(x) + 1)</code></pre><p>See <a href="http://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf">&quot;Deep Sparse Rectifier Neural Networks&quot;</a>, JMLR 2011.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(softplus, -3, 3, height=7)
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L24-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softplus" href="#NNlib.softplus"><code>NNlib.softplus</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softplus(x) = log(exp(x) + 1)</code></pre><p>See <a href="http://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf">&quot;Deep Sparse Rectifier Neural Networks&quot;</a>, JMLR 2011.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(softplus, -3, 3, height=7)
           ┌────────────────────────────────────────┐            
         4 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│ softplus(x)
           │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠│            
@@ -289,7 +289,7 @@
           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀            
 
 julia&gt; softplus(16f0)
-16.0f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L600-L635">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softshrink" href="#NNlib.softshrink"><code>NNlib.softshrink</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softshrink(x, λ=0.5) =
+16.0f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L600-L635">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softshrink" href="#NNlib.softshrink"><code>NNlib.softshrink</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softshrink(x, λ=0.5) =
     (x ≥ λ ? x - λ : (-λ ≥ x ? x + λ : 0))</code></pre><p>See <a href="https://www.gabormelli.com/RKB/Softshrink_Activation_Function">&quot;Softshrink Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(softshrink, -2, 2, height=7)
            ┌────────────────────────────────────────┐              
          2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀│ softshrink(x)
@@ -317,7 +317,7 @@
            ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀
 
 julia&gt; softshrink.((-10f0, 10f0))
-(-9.5f0, 9.5f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L709-L745">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softsign" href="#NNlib.softsign"><code>NNlib.softsign</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softsign(x) = x / (1 + |x|)</code></pre><p>See <a href="http://www.iro.umontreal.ca/~lisa/publications2/index.php/attachments/single/205">&quot;Quadratic Polynomials Learn Better Image Features&quot;</a> (2009).</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(softsign, -5, 5, height=7)
+(-9.5f0, 9.5f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L709-L745">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softsign" href="#NNlib.softsign"><code>NNlib.softsign</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softsign(x) = x / (1 + |x|)</code></pre><p>See <a href="http://www.iro.umontreal.ca/~lisa/publications2/index.php/attachments/single/205">&quot;Quadratic Polynomials Learn Better Image Features&quot;</a> (2009).</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(softsign, -5, 5, height=7)
            ┌────────────────────────────────────────┐            
          1 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⣀⣀⣀⣀⠤⠤⠤⠤⠤│ softsign(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⣀⡤⠖⠒⠋⠉⠉⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀│            
@@ -347,7 +347,7 @@
 0.5f0
 
 julia&gt; softsign(100f0)
-0.990099f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L557-L595">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.swish" href="#NNlib.swish"><code>NNlib.swish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">swish(x) = x * σ(x)</code></pre><p>Self-gated activation function. See <a href="https://arxiv.org/abs/1710.05941">&quot;Swish: a Self-Gated Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(swish, -2, 2, height=7)
+0.990099f0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L557-L595">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.swish" href="#NNlib.swish"><code>NNlib.swish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">swish(x) = x * σ(x)</code></pre><p>Self-gated activation function. See <a href="https://arxiv.org/abs/1710.05941">&quot;Swish: a Self-Gated Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(swish, -2, 2, height=7)
            ┌────────────────────────────────────────┐         
          2 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤│ swish(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠖⠋⠁⠀│         
@@ -358,7 +358,7 @@
         -1 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│         
            └────────────────────────────────────────┘         
            ⠀-2⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀2⠀         
-           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L360-L380">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardswish" href="#NNlib.hardswish"><code>NNlib.hardswish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardswish(x) = x * hardσ(x)</code></pre><p>Hard-Swish activation function. See <a href="https://arxiv.org/abs/1905.02244">&quot;Searching for MobileNetV3&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardswish, -2, 5, height = 7)
+           ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L360-L380">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.hardswish" href="#NNlib.hardswish"><code>NNlib.hardswish</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">hardswish(x) = x * hardσ(x)</code></pre><p>Hard-Swish activation function. See <a href="https://arxiv.org/abs/1905.02244">&quot;Searching for MobileNetV3&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(hardswish, -2, 5, height = 7)
            ┌────────────────────────────────────────┐             
          5 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠔⠒⠉│ hardswish(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡠⠔⠒⠉⠁⠀⠀⠀⠀│             
@@ -388,7 +388,7 @@
 
 julia&gt; hardswish.(-5:5)&#39;
 1×11 adjoint(::Vector{Float64}) with eltype Float64:
- -0.0  -0.0  -0.0  -0.333333  -0.333333  0.0  0.666667  1.66667  3.0  4.0  5.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L383-L422">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.tanhshrink" href="#NNlib.tanhshrink"><code>NNlib.tanhshrink</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tanhshrink(x) = x - tanh(x)</code></pre><p>See <a href="https://www.gabormelli.com/RKB/Tanhshrink_Activation_Function">&quot;Tanhshrink Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(tanhshrink, -3, 3, height=7)
+ -0.0  -0.0  -0.0  -0.333333  -0.333333  0.0  0.666667  1.66667  3.0  4.0  5.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L383-L422">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.tanhshrink" href="#NNlib.tanhshrink"><code>NNlib.tanhshrink</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tanhshrink(x) = x - tanh(x)</code></pre><p>See <a href="https://www.gabormelli.com/RKB/Tanhshrink_Activation_Function">&quot;Tanhshrink Activation Function&quot;</a>.</p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(tanhshrink, -3, 3, height=7)
            ┌────────────────────────────────────────┐              
          3 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│ tanhshrink(x)
            │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡠⠤⠖⠊│              
@@ -402,7 +402,7 @@
            ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀              
 
 julia&gt; tanhshrink.((-10f0, 10f0))
-(-9.0f0, 9.0f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L684-L706">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.trelu" href="#NNlib.trelu"><code>NNlib.trelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">trelu(x, theta=1) = x &gt; theta ? x : 0</code></pre><p>Threshold gated rectified linear activation function. See <a href="https://arxiv.org/abs/1402.3337">&quot;Zero-bias autoencoders and the benefits of co-adapting features&quot;</a></p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(trelu, -2, 4, height=7)
+(-9.0f0, 9.0f0)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L684-L706">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.trelu" href="#NNlib.trelu"><code>NNlib.trelu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">trelu(x, theta=1) = x &gt; theta ? x : 0</code></pre><p>Threshold gated rectified linear activation function. See <a href="https://arxiv.org/abs/1402.3337">&quot;Zero-bias autoencoders and the benefits of co-adapting features&quot;</a></p><pre><code class="language-julia-repl hljs">julia&gt; lineplot(trelu, -2, 4, height=7)
           ┌────────────────────────────────────────┐         
         4 │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣀⡤⠖⠋│ trelu(x)
           │⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⡠⠖⠋⠁⠀⠀⠀│         
@@ -413,8 +413,8 @@
         0 │⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣇⣀⣀⣀⣀⣀⣀⡇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀│         
           └────────────────────────────────────────┘         
           ⠀-2⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀4⠀         
-          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/activations.jl#L532-L552">source</a></section></article><h2 id="Attention"><a class="docs-heading-anchor" href="#Attention">Attention</a><a id="Attention-1"></a><a class="docs-heading-anchor-permalink" href="#Attention" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.dot_product_attention" href="#NNlib.dot_product_attention"><code>NNlib.dot_product_attention</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dot_product_attention(query, key, value, [bias]; [fdrop, mask, nheads])</code></pre><p>Multihead dot product attention used in transformer architectures.</p><p>The input arrays must have the first two dimensions given by the number of features and the sequence length, then an arbitrary number of batch dimensions or none.</p><p>Returns the attention output array of size <code>(v_dim, q_len, batch_size...)</code> and the attention scores of size <code>(kv_len, q_len, nheads, batch_size...)</code>.</p><p>See also <a href="#NNlib.dot_product_attention_scores"><code>dot_product_attention_scores</code></a> if you only need the attention scores.</p><p><strong>Arguments</strong></p><ul><li><code>query</code>: Query array of size <code>(qk_dim, q_len, batch_size...)</code>.</li><li><code>key</code>: Key array of size <code>(qk_dim, kv_len, batch_size...)</code>.</li><li><code>value</code>: Value array of size <code>(v_dim, kv_len, batch_size...)</code>.</li><li><code>bias</code>: Either <code>nothing</code> or an array broadcastable to size <code>(kv_len, q_len, nheads, batch_size)</code>.         It will be added to the attention scores before applying the softmax. Default <code>nothing</code>.</li><li><code>fdrop</code>: A dropout function or layer to be applied on the attention scores right after the softmax.          Default <code>identity</code> (no dropout).</li><li><code>mask</code>: Either <code>nothing</code> or a boolean array broadcastable to size <code>(kv_len, q_len, nheads, batch_size)</code>.         The mask is applied to the attention scores just before the softmax.         See <a href="#NNlib.make_causal_mask"><code>make_causal_mask</code></a> fore creating causal masks. Default <code>nothing</code>.</li><li><code>nheads</code>: Number of heads to split the input arrays into. Default <code>1</code>.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia hljs">q, k, v = rand(10, 20, 2), rand(10, 30, 2), rand(20, 30, 2)
-y, α = dot_product_attention(q, k, v)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/attention.jl#L5-L39">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.dot_product_attention_scores" href="#NNlib.dot_product_attention_scores"><code>NNlib.dot_product_attention_scores</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dot_product_attention_scores(query, key, [bias]; [fdrop, mask])</code></pre><p>Return the attention scores for the <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a>. Input arrays must have dimensions <code>(num_features ÷ nheads, nheads, sequence_length, batch_size)</code>.</p><p>See <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a> for more details.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/attention.jl#L103-L111">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.make_causal_mask" href="#NNlib.make_causal_mask"><code>NNlib.make_causal_mask</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">make_causal_mask(x, dims=2)</code></pre><p>Return a boolean square matrix <code>m</code> of the same type as <code>x</code> and of side <code>size(x, dims)</code>. Its elements are set such that <code>m[i, j] == i ≤ j</code>.</p><p>Can be used to mask the attention scores in <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/attention.jl#L141-L148">source</a></section></article><h2 id="Softmax"><a class="docs-heading-anchor" href="#Softmax">Softmax</a><a id="Softmax-1"></a><a class="docs-heading-anchor-permalink" href="#Softmax" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>logitcrossentropy</code> uses <code>NNlib.softmax</code> internally.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softmax" href="#NNlib.softmax"><code>NNlib.softmax</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softmax(x; dims = 1)</code></pre><p><a href="https://en.wikipedia.org/wiki/Softmax_function">Softmax</a> turns input array <code>x</code> into probability distributions that sum to 1 along the dimensions specified by <code>dims</code>. It is semantically equivalent to the following:</p><pre><code class="nohighlight hljs">softmax(x; dims = 1) = exp.(x) ./ sum(exp.(x), dims = dims)</code></pre><p>with additional manipulations enhancing numerical stability.</p><p>For a matrix input <code>x</code> it will by default (<code>dims = 1</code>) treat it as a batch of vectors, with each column independent. Keyword <code>dims = 2</code> will instead treat rows independently, and so on.</p><p>See also <a href="#NNlib.logsoftmax"><code>logsoftmax</code></a>.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; softmax([1, 2, 3])
+          ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀x⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀         </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/activations.jl#L532-L552">source</a></section></article><h2 id="Attention"><a class="docs-heading-anchor" href="#Attention">Attention</a><a id="Attention-1"></a><a class="docs-heading-anchor-permalink" href="#Attention" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.dot_product_attention" href="#NNlib.dot_product_attention"><code>NNlib.dot_product_attention</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dot_product_attention(query, key, value, [bias]; [fdrop, mask, nheads])</code></pre><p>Multihead dot product attention used in transformer architectures.</p><p>The input arrays must have the first two dimensions given by the number of features and the sequence length, then an arbitrary number of batch dimensions or none.</p><p>Returns the attention output array of size <code>(v_dim, q_len, batch_size...)</code> and the attention scores of size <code>(kv_len, q_len, nheads, batch_size...)</code>.</p><p>See also <a href="#NNlib.dot_product_attention_scores"><code>dot_product_attention_scores</code></a> if you only need the attention scores.</p><p><strong>Arguments</strong></p><ul><li><code>query</code>: Query array of size <code>(qk_dim, q_len, batch_size...)</code>.</li><li><code>key</code>: Key array of size <code>(qk_dim, kv_len, batch_size...)</code>.</li><li><code>value</code>: Value array of size <code>(v_dim, kv_len, batch_size...)</code>.</li><li><code>bias</code>: Either <code>nothing</code> or an array broadcastable to size <code>(kv_len, q_len, nheads, batch_size)</code>.         It will be added to the attention scores before applying the softmax. Default <code>nothing</code>.</li><li><code>fdrop</code>: A dropout function or layer to be applied on the attention scores right after the softmax.          Default <code>identity</code> (no dropout).</li><li><code>mask</code>: Either <code>nothing</code> or a boolean array broadcastable to size <code>(kv_len, q_len, nheads, batch_size)</code>.         The mask is applied to the attention scores just before the softmax.         See <a href="#NNlib.make_causal_mask"><code>make_causal_mask</code></a> fore creating causal masks. Default <code>nothing</code>.</li><li><code>nheads</code>: Number of heads to split the input arrays into. Default <code>1</code>.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia hljs">q, k, v = rand(10, 20, 2), rand(10, 30, 2), rand(20, 30, 2)
+y, α = dot_product_attention(q, k, v)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/attention.jl#L5-L39">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.dot_product_attention_scores" href="#NNlib.dot_product_attention_scores"><code>NNlib.dot_product_attention_scores</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dot_product_attention_scores(query, key, [bias]; [fdrop, mask])</code></pre><p>Return the attention scores for the <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a>. Input arrays must have dimensions <code>(num_features ÷ nheads, nheads, sequence_length, batch_size)</code>.</p><p>See <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a> for more details.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/attention.jl#L103-L111">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.make_causal_mask" href="#NNlib.make_causal_mask"><code>NNlib.make_causal_mask</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">make_causal_mask(x, dims=2)</code></pre><p>Return a boolean square matrix <code>m</code> of the same type as <code>x</code> and of side <code>size(x, dims)</code>. Its elements are set such that <code>m[i, j] == i ≤ j</code>.</p><p>Can be used to mask the attention scores in <a href="#NNlib.dot_product_attention"><code>dot_product_attention</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/attention.jl#L141-L148">source</a></section></article><h2 id="Softmax"><a class="docs-heading-anchor" href="#Softmax">Softmax</a><a id="Softmax-1"></a><a class="docs-heading-anchor-permalink" href="#Softmax" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>logitcrossentropy</code> uses <code>NNlib.softmax</code> internally.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.softmax" href="#NNlib.softmax"><code>NNlib.softmax</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">softmax(x; dims = 1)</code></pre><p><a href="https://en.wikipedia.org/wiki/Softmax_function">Softmax</a> turns input array <code>x</code> into probability distributions that sum to 1 along the dimensions specified by <code>dims</code>. It is semantically equivalent to the following:</p><pre><code class="nohighlight hljs">softmax(x; dims = 1) = exp.(x) ./ sum(exp.(x), dims = dims)</code></pre><p>with additional manipulations enhancing numerical stability.</p><p>For a matrix input <code>x</code> it will by default (<code>dims = 1</code>) treat it as a batch of vectors, with each column independent. Keyword <code>dims = 2</code> will instead treat rows independently, and so on.</p><p>See also <a href="#NNlib.logsoftmax"><code>logsoftmax</code></a>.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; softmax([1, 2, 3])
 3-element Vector{Float64}:
  0.09003057317038046
  0.24472847105479764
@@ -438,8 +438,8 @@
 (7, 13)
 
 julia&gt; Dense(4 =&gt; 7, softmax)(x)
-ERROR: `softmax(x)` called with a number, but it expects an array. </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/softmax.jl#L2-L55">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsoftmax" href="#NNlib.logsoftmax"><code>NNlib.logsoftmax</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logsoftmax(x; dims = 1)</code></pre><p>Computes the log of softmax in a more numerically stable way than directly taking <code>log.(softmax(xs))</code>. Commonly used in computing cross entropy loss.</p><p>It is semantically equivalent to the following:</p><pre><code class="nohighlight hljs">logsoftmax(x; dims = 1) = x .- log.(sum(exp.(x), dims = dims))</code></pre><p>See also <a href="#NNlib.softmax"><code>softmax</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/softmax.jl#L94-L106">source</a></section></article><h2 id="Pooling"><a class="docs-heading-anchor" href="#Pooling">Pooling</a><a id="Pooling-1"></a><a class="docs-heading-anchor-permalink" href="#Pooling" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>AdaptiveMaxPool</code>, <code>AdaptiveMeanPool</code>, <code>GlobalMaxPool</code>, <code>GlobalMeanPool</code>, <code>MaxPool</code>, <code>MeanPool</code> and <code>lpnormpool</code> use <code>NNlib.PoolDims</code>, <code>NNlib.maxpool</code>, <code>NNlib.meanpool</code> and <code>NNlib.lpnormpool</code> as their backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.PoolDims" href="#NNlib.PoolDims"><code>NNlib.PoolDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PoolDims(x_size::NTuple{M}, k::Union{NTuple{L, Int}, Int};
-         stride=k, padding=0, dilation=1)  where {M, L}</code></pre><p>Dimensions for a &quot;pooling&quot; operation that can have an arbitrary input size, kernel size, stride, dilation, and channel count.  Used to dispatch onto efficient implementations at compile-time.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/dim_helpers/PoolDims.jl#L1-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.maxpool" href="#NNlib.maxpool"><code>NNlib.maxpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">maxpool(x, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform max pool operation with window size <code>k</code> on input tensor <code>x</code>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/pooling.jl#L139-L149">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.meanpool" href="#NNlib.meanpool"><code>NNlib.meanpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">meanpool(x, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform mean pool operation with window size <code>k</code> on input tensor <code>x</code>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/pooling.jl#L158-L168">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.lpnormpool" href="#NNlib.lpnormpool"><code>NNlib.lpnormpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">lpnormpool(x, p::Real, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform Lp pool operation with value of the Lp norm <code>p</code> and window size <code>k</code> on input tensor <code>x</code>, also known as LPPool in pytorch. This pooling operator from <a href="https://arxiv.org/abs/1311.1780">Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks</a>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>p</code> is restricted to <code>0 &lt; p &lt; Inf</code>.</li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul><p>For all elements <code>x</code> in a size <code>k</code> window, lpnormpool computes <code>(∑ᵢ xᵢ^p)^(1 / p)</code> as an element of the output.</p><p>Thus <code>lpnormpool(x, 1, k) ./ prod(k) ≈ meanpool(x, k)</code> and <code>lpnormpool(x, 2, k).^2 ./ prod(k) ≈ meanpool(x.^2, k)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/pooling.jl#L177-L193">source</a></section></article><h2 id="Padding"><a class="docs-heading-anchor" href="#Padding">Padding</a><a id="Padding-1"></a><a class="docs-heading-anchor-permalink" href="#Padding" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_reflect" href="#NNlib.pad_reflect"><code>NNlib.pad_reflect</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_reflect(x, pad::Tuple; [dims])
+ERROR: `softmax(x)` called with a number, but it expects an array. </code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/softmax.jl#L2-L55">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsoftmax" href="#NNlib.logsoftmax"><code>NNlib.logsoftmax</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logsoftmax(x; dims = 1)</code></pre><p>Computes the log of softmax in a more numerically stable way than directly taking <code>log.(softmax(xs))</code>. Commonly used in computing cross entropy loss.</p><p>It is semantically equivalent to the following:</p><pre><code class="nohighlight hljs">logsoftmax(x; dims = 1) = x .- log.(sum(exp.(x), dims = dims))</code></pre><p>See also <a href="#NNlib.softmax"><code>softmax</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/softmax.jl#L94-L106">source</a></section></article><h2 id="Pooling"><a class="docs-heading-anchor" href="#Pooling">Pooling</a><a id="Pooling-1"></a><a class="docs-heading-anchor-permalink" href="#Pooling" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>AdaptiveMaxPool</code>, <code>AdaptiveMeanPool</code>, <code>GlobalMaxPool</code>, <code>GlobalMeanPool</code>, <code>MaxPool</code>, <code>MeanPool</code> and <code>lpnormpool</code> use <code>NNlib.PoolDims</code>, <code>NNlib.maxpool</code>, <code>NNlib.meanpool</code> and <code>NNlib.lpnormpool</code> as their backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.PoolDims" href="#NNlib.PoolDims"><code>NNlib.PoolDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PoolDims(x_size::NTuple{M}, k::Union{NTuple{L, Int}, Int};
+         stride=k, padding=0, dilation=1)  where {M, L}</code></pre><p>Dimensions for a &quot;pooling&quot; operation that can have an arbitrary input size, kernel size, stride, dilation, and channel count.  Used to dispatch onto efficient implementations at compile-time.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/dim_helpers/PoolDims.jl#L1-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.maxpool" href="#NNlib.maxpool"><code>NNlib.maxpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">maxpool(x, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform max pool operation with window size <code>k</code> on input tensor <code>x</code>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/pooling.jl#L139-L149">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.meanpool" href="#NNlib.meanpool"><code>NNlib.meanpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">meanpool(x, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform mean pool operation with window size <code>k</code> on input tensor <code>x</code>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/pooling.jl#L158-L168">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.lpnormpool" href="#NNlib.lpnormpool"><code>NNlib.lpnormpool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">lpnormpool(x, p::Real, k::NTuple{N, Integer}; pad=0, stride=k)</code></pre><p>Perform Lp pool operation with value of the Lp norm <code>p</code> and window size <code>k</code> on input tensor <code>x</code>, also known as LPPool in pytorch. This pooling operator from <a href="https://arxiv.org/abs/1311.1780">Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks</a>.</p><p>Arguments:</p><ul><li><code>x</code> and <code>k</code>: Expects <code>ndim(x) ∈ 3:5</code>, and always <code>length(k) == ndim(x) - 2</code></li><li><code>p</code> is restricted to <code>0 &lt; p &lt; Inf</code>.</li><li><code>pad</code>: See <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a> for details.</li><li><code>stride</code>: Either a tuple with the same length as <code>k</code>, or one integer for all directions. Default is <code>k</code>.</li></ul><p>For all elements <code>x</code> in a size <code>k</code> window, lpnormpool computes <code>(∑ᵢ xᵢ^p)^(1 / p)</code> as an element of the output.</p><p>Thus <code>lpnormpool(x, 1, k) ./ prod(k) ≈ meanpool(x, k)</code> and <code>lpnormpool(x, 2, k).^2 ./ prod(k) ≈ meanpool(x.^2, k)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/pooling.jl#L177-L193">source</a></section></article><h2 id="Padding"><a class="docs-heading-anchor" href="#Padding">Padding</a><a id="Padding-1"></a><a class="docs-heading-anchor-permalink" href="#Padding" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_reflect" href="#NNlib.pad_reflect"><code>NNlib.pad_reflect</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_reflect(x, pad::Tuple; [dims])
 pad_reflect(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> reflecting its values across the border.</p><p><code>pad</code> can a tuple of integers <code>(l1, r1, ..., ln, rn)</code> of some length <code>2n</code> that specifies the left and right padding size for each of the dimensions in <code>dims</code>. If <code>dims</code> is not given,  it defaults to the first <code>n</code> dimensions.</p><p>If <code>pad</code> is an integer, it is applied on both sides on every dimension in <code>dims</code>. In this case, <code>dims</code>  defaults to the first <code>ndims(x)-2</code> dimensions  (i.e. excludes the channel and batch dimension). </p><p>See also <a href="#NNlib.pad_repeat"><code>pad_repeat</code></a>, <a href="#NNlib.pad_symmetric"><code>pad_symmetric</code></a>, <a href="#NNlib.pad_circular"><code>pad_circular</code></a>, and <a href="#NNlib.pad_constant"><code>pad_constant</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; r = reshape(1:9, 3, 3)
 3×3 reshape(::UnitRange{Int64}, 3, 3) with eltype Int64:
  1  4  7
@@ -453,7 +453,7 @@
  5  2  5  8  5  2
  6  3  6  9  6  3
  5  2  5  8  5  2
- 4  1  4  7  4  1</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L223-L257">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_symmetric" href="#NNlib.pad_symmetric"><code>NNlib.pad_symmetric</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_symmetric(x, pad::Tuple; [dims])
+ 4  1  4  7  4  1</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L223-L257">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_symmetric" href="#NNlib.pad_symmetric"><code>NNlib.pad_symmetric</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_symmetric(x, pad::Tuple; [dims])
 pad_symmetric(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> reflecting its values symmetrically across the border, i.e. the border values of <code>x</code> are present in the padding values, in contrast to <a href="#NNlib.pad_reflect"><code>pad_reflect</code></a>.</p><p><code>pad</code> can a tuple of integers <code>(l1, r1, ..., ln, rn)</code> of some length <code>2n</code> that specifies the left and right padding size for each of the dimensions in <code>dims</code>. If <code>dims</code> is not given,  it defaults to the first <code>n</code> dimensions.</p><p>If <code>pad</code> is an integer, it is applied on both sides on every dimension in <code>dims</code>. In this case, <code>dims</code>  defaults to the first <code>ndims(x)-2</code> dimensions  (i.e. excludes the channel and batch dimension). </p><p>See also <a href="#NNlib.pad_repeat"><code>pad_repeat</code></a>, <a href="#NNlib.pad_reflect"><code>pad_reflect</code></a>, <a href="#NNlib.pad_circular"><code>pad_circular</code></a>, and <a href="#NNlib.pad_constant"><code>pad_constant</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; r = reshape(1:9, 3, 3)
 3×3 reshape(::UnitRange{Int64}, 3, 3) with eltype Int64:
  1  4  7
@@ -467,7 +467,7 @@
  2  2  5  8  8  5
  3  3  6  9  9  6
  3  3  6  9  9  6
- 2  2  5  8  8  5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L282-L316">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_circular" href="#NNlib.pad_circular"><code>NNlib.pad_circular</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_circular(x, pad::Tuple; [dims])
+ 2  2  5  8  8  5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L282-L316">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_circular" href="#NNlib.pad_circular"><code>NNlib.pad_circular</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_circular(x, pad::Tuple; [dims])
 pad_circular(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> &quot;circularly&quot; across the border by wrapping around values from the opposite side of <code>x</code>. </p><p><code>pad</code> can a tuple of integers <code>(l1, r1, ..., ln, rn)</code> of some length <code>2n</code> that specifies the left and right padding size for each of the dimensions in <code>dims</code>. If <code>dims</code> is not given,  it defaults to the first <code>n</code> dimensions.</p><p>If <code>pad</code> is an integer, it is applied on both sides on every dimension in <code>dims</code>. In this case, <code>dims</code>  defaults to the first <code>ndims(x)-2</code> dimensions  (i.e. excludes the channel and batch dimension). </p><p>The pad length on either side in any dimension must not exceed the size of <code>x</code> in that dimension, i.e. <code>pad_circular</code> is not able to create abitrary sized tilings of <code>x</code>.</p><p>See also <a href="#NNlib.pad_repeat"><code>pad_repeat</code></a>, <a href="#NNlib.pad_reflect"><code>pad_reflect</code></a>, <a href="#NNlib.pad_symmetric"><code>pad_symmetric</code></a>, and <a href="#NNlib.pad_constant"><code>pad_constant</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; r = reshape(1:9, 3, 3)
 3×3 reshape(::UnitRange{Int64}, 3, 3) with eltype Int64:
  1  4  7
@@ -481,7 +481,7 @@
  8  2  5  8  2  5
  9  3  6  9  3  6
  7  1  4  7  1  4
- 8  2  5  8  2  5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L342-L379">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_repeat" href="#NNlib.pad_repeat"><code>NNlib.pad_repeat</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_repeat(x, pad::Tuple; [dims])
+ 8  2  5  8  2  5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L342-L379">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_repeat" href="#NNlib.pad_repeat"><code>NNlib.pad_repeat</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_repeat(x, pad::Tuple; [dims])
 pad_repeat(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> repeating the values on the border.</p><p><code>pad</code> can a tuple of integers <code>(l1, r1, ..., ln, rn)</code> of some length <code>2n</code> that specifies the left and right padding size for each of the dimensions in <code>dims</code>. If <code>dims</code> is not given,  it defaults to the first <code>n</code> dimensions.</p><p>If <code>pad</code> is an integer, it is applied on both sides on every dimension in <code>dims</code>. In this case, <code>dims</code>  defaults to the first <code>ndims(x)-2</code> dimensions  (i.e. excludes the channel and batch dimension). </p><p>See also <a href="#NNlib.pad_reflect"><code>pad_reflect</code></a>, <a href="#NNlib.pad_symmetric"><code>pad_symmetric</code></a>, <a href="#NNlib.pad_circular"><code>pad_circular</code></a>, and <a href="#NNlib.pad_constant"><code>pad_constant</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; r = reshape(1:9, 3, 3)
 3×3 reshape(::UnitRange{Int64}, 3, 3) with eltype Int64:
  1  4  7
@@ -495,7 +495,7 @@
  2  2  2  2  5  8  8  8  8  8
  3  3  3  3  6  9  9  9  9  9
  3  3  3  3  6  9  9  9  9  9
- 3  3  3  3  6  9  9  9  9  9</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L162-L196">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_constant" href="#NNlib.pad_constant"><code>NNlib.pad_constant</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_constant(x, pad::Tuple, val = 0; [dims = :])
+ 3  3  3  3  6  9  9  9  9  9</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L162-L196">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_constant" href="#NNlib.pad_constant"><code>NNlib.pad_constant</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_constant(x, pad::Tuple, val = 0; [dims = :])
 pad_constant(x, pad::Int, val = 0; [dims = :])</code></pre><p>Pad the array <code>x</code> with the constant value <code>val</code>.</p><p><code>pad</code> can be a tuple of integers. If it is of some length <code>2 * length(dims)</code> that specifies the left and right padding size for each of the dimensions in <code>dims</code> as <code>(l1, r1, ..., ln, rn)</code>.  If supplied with a tuple of length <code>length(dims)</code> instead, it applies symmetric padding. If <code>dims</code> is not given, it defaults to all dimensions.</p><p>For integer <code>pad</code> input, it is applied on both sides on every dimension in <code>dims</code>.</p><p>See also <a href="#NNlib.pad_zeros"><code>pad_zeros</code></a>, <a href="#NNlib.pad_repeat"><code>pad_repeat</code></a>, <a href="#NNlib.pad_reflect"><code>pad_reflect</code></a>, <a href="#NNlib.pad_symmetric"><code>pad_symmetric</code></a>, and <a href="#NNlib.pad_circular"><code>pad_circular</code></a>.</p><pre><code class="language-julia-repl hljs">julia&gt; r = reshape(1:4, 2, 2)
 2×2 reshape(::UnitRange{Int64}, 2, 2) with eltype Int64:
  1  3
@@ -562,8 +562,8 @@
 julia&gt; pad_constant(r, (2,1, 3), dims = (1,2)) # padding must always be either the same length as dims, or double it
 ERROR: ArgumentError: Could not parse padding (2, 1, 3) and dims (1, 2)
 Stacktrace:
-[...]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L11-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_zeros" href="#NNlib.pad_zeros"><code>NNlib.pad_zeros</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_zeros(x, pad::Tuple; [dims])
-pad_zeros(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> with zeros. Equivalent to <a href="#NNlib.pad_constant"><code>pad_constant</code></a> with the constant equal to 0. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/padding.jl#L1-L7">source</a></section></article><h2 id="Convolution"><a class="docs-heading-anchor" href="#Convolution">Convolution</a><a id="Convolution-1"></a><a class="docs-heading-anchor-permalink" href="#Convolution" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Conv</code> and <code>CrossCor</code> layers use <code>NNlib.DenseConvDims</code> and <code>NNlib.conv</code> internally.</p><p><code>NNlib.conv</code> supports complex datatypes on CPU and CUDA devices.</p><div class="admonition is-info"><header class="admonition-header">AMDGPU MIOpen supports only cross-correlation (`flipkernel=true`).</header><div class="admonition-body"><p>Therefore for every regular convolution (<code>flipkernel=false</code>) kernel is flipped before calculation. For better performance, use cross-correlation (<code>flipkernel=true</code>) and manually flip the kernel before <code>NNlib.conv</code> call. <code>Flux</code> handles this automatically, this is only required for direct calls.</p></div></div><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.conv" href="#NNlib.conv"><code>NNlib.conv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">conv(x, w; stride = 1, pad = 0, dilation = 1, flipped = false, groups = 1)</code></pre><p>Apply convolution filter <code>w</code> to input <code>x</code>. <code>x</code> and <code>w</code> are 3d/4d/5d tensors in 1d/2d/3d convolutions respectively. <code>x</code> and <code>w</code> may have real or complex element types.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/conv.jl#L44-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.ConvDims" href="#NNlib.ConvDims"><code>NNlib.ConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ConvDims</code></pre><p>Type system-level information about convolution dimensions. Critical for things like <code>im2col!()</code> to generate efficient code, and helpful to reduce the number of kwargs getting passed around.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/dim_helpers/ConvDims.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.depthwiseconv" href="#NNlib.depthwiseconv"><code>NNlib.depthwiseconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">depthwiseconv(x, w; stride=1, pad=0, dilation=1, flipped=false)</code></pre><p>Depthwise convolution operation with filter <code>w</code> on input <code>x</code>. <code>x</code> and <code>w</code> are 3d/4d/5d tensors in 1d/2d/3d convolutions respectively.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/conv.jl#L59-L64">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.DepthwiseConvDims" href="#NNlib.DepthwiseConvDims"><code>NNlib.DepthwiseConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DepthwiseConvDims</code></pre><p>Concrete subclass of <code>ConvDims</code> for a depthwise convolution.  Differs primarily due to characterization by <code>C_in</code>, <code>C_mult</code>, rather than <code>C_in</code>, <code>C_out</code>.  Useful to be separate from DenseConvDims primarily for channel calculation differences.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/dim_helpers/DepthwiseConvDims.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.DenseConvDims" href="#NNlib.DenseConvDims"><code>NNlib.DenseConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DenseConvDims</code></pre><p>Concrete subclass of <code>ConvDims</code> for a normal, dense, conv2d/conv3d.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/dim_helpers/DenseConvDims.jl#L1-L5">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.unfold" href="#NNlib.unfold"><code>NNlib.unfold</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unfold(x, kernel_size; stride = 1, pad = 0, dilation = 0, flipped = true)</code></pre><p>Places sliding windows of x into a container tensor of size <code>(num_windows, window_size, batchsize)</code>. The window size is determined by the <code>prod(spatial dims of kernel)*input_channels</code>. The number of sliding windows will match those of convolution (<code>conv</code>) with the same kernel_size and arguments. Note that by default <code>conv</code> flips the spatial dimensions of its kernel (default <code>flipped=false</code>), whereas <code>unfold</code> does not (default <code>flipped=true</code>). Uses <code>NNlib.im2col!</code> as backend.</p><p>See also <a href="#NNlib.fold"><code>fold</code></a>, the adjoint/transpose operator and a potential inverse of <code>unfold</code>.</p><p><strong>Example</strong></p><p>The below example demonstrates that <code>unfold</code> uses the same sliding windows as <code>conv</code>. In general <a href="#NNlib.batched_mul"><code>batched_mul</code></a> + <code>unfold</code> should not be used to achieve convolution.</p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape([100 2 3 40 5 6 700], 7, 1, 1);  # 1D data, 1 channel, batch of 1
+[...]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L11-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pad_zeros" href="#NNlib.pad_zeros"><code>NNlib.pad_zeros</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pad_zeros(x, pad::Tuple; [dims])
+pad_zeros(x, pad::Int; [dims])</code></pre><p>Pad the array <code>x</code> with zeros. Equivalent to <a href="#NNlib.pad_constant"><code>pad_constant</code></a> with the constant equal to 0. </p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/padding.jl#L1-L7">source</a></section></article><h2 id="Convolution"><a class="docs-heading-anchor" href="#Convolution">Convolution</a><a id="Convolution-1"></a><a class="docs-heading-anchor-permalink" href="#Convolution" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Conv</code> and <code>CrossCor</code> layers use <code>NNlib.DenseConvDims</code> and <code>NNlib.conv</code> internally.</p><p><code>NNlib.conv</code> supports complex datatypes on CPU and CUDA devices.</p><div class="admonition is-info"><header class="admonition-header">AMDGPU MIOpen supports only cross-correlation (`flipkernel=true`).</header><div class="admonition-body"><p>Therefore for every regular convolution (<code>flipkernel=false</code>) kernel is flipped before calculation. For better performance, use cross-correlation (<code>flipkernel=true</code>) and manually flip the kernel before <code>NNlib.conv</code> call. <code>Flux</code> handles this automatically, this is only required for direct calls.</p></div></div><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.conv" href="#NNlib.conv"><code>NNlib.conv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">conv(x, w; stride = 1, pad = 0, dilation = 1, flipped = false, groups = 1)</code></pre><p>Apply convolution filter <code>w</code> to input <code>x</code>. <code>x</code> and <code>w</code> are 3d/4d/5d tensors in 1d/2d/3d convolutions respectively. <code>x</code> and <code>w</code> may have real or complex element types.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/conv.jl#L44-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.ConvDims" href="#NNlib.ConvDims"><code>NNlib.ConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ConvDims</code></pre><p>Type system-level information about convolution dimensions. Critical for things like <code>im2col!()</code> to generate efficient code, and helpful to reduce the number of kwargs getting passed around.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/dim_helpers/ConvDims.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.depthwiseconv" href="#NNlib.depthwiseconv"><code>NNlib.depthwiseconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">depthwiseconv(x, w; stride=1, pad=0, dilation=1, flipped=false)</code></pre><p>Depthwise convolution operation with filter <code>w</code> on input <code>x</code>. <code>x</code> and <code>w</code> are 3d/4d/5d tensors in 1d/2d/3d convolutions respectively.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/conv.jl#L59-L64">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.DepthwiseConvDims" href="#NNlib.DepthwiseConvDims"><code>NNlib.DepthwiseConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DepthwiseConvDims</code></pre><p>Concrete subclass of <code>ConvDims</code> for a depthwise convolution.  Differs primarily due to characterization by <code>C_in</code>, <code>C_mult</code>, rather than <code>C_in</code>, <code>C_out</code>.  Useful to be separate from DenseConvDims primarily for channel calculation differences.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/dim_helpers/DepthwiseConvDims.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.DenseConvDims" href="#NNlib.DenseConvDims"><code>NNlib.DenseConvDims</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DenseConvDims</code></pre><p>Concrete subclass of <code>ConvDims</code> for a normal, dense, conv2d/conv3d.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/dim_helpers/DenseConvDims.jl#L1-L5">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.unfold" href="#NNlib.unfold"><code>NNlib.unfold</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unfold(x, kernel_size; stride = 1, pad = 0, dilation = 0, flipped = true)</code></pre><p>Places sliding windows of x into a container tensor of size <code>(num_windows, window_size, batchsize)</code>. The window size is determined by the <code>prod(spatial dims of kernel)*input_channels</code>. The number of sliding windows will match those of convolution (<code>conv</code>) with the same kernel_size and arguments. Note that by default <code>conv</code> flips the spatial dimensions of its kernel (default <code>flipped=false</code>), whereas <code>unfold</code> does not (default <code>flipped=true</code>). Uses <code>NNlib.im2col!</code> as backend.</p><p>See also <a href="#NNlib.fold"><code>fold</code></a>, the adjoint/transpose operator and a potential inverse of <code>unfold</code>.</p><p><strong>Example</strong></p><p>The below example demonstrates that <code>unfold</code> uses the same sliding windows as <code>conv</code>. In general <a href="#NNlib.batched_mul"><code>batched_mul</code></a> + <code>unfold</code> should not be used to achieve convolution.</p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape([100 2 3 40 5 6 700], 7, 1, 1);  # 1D data, 1 channel, batch of 1
 
 julia&gt; w = reshape([1 0 -1], 3, 1, 1);  # 1D conv kernel of length 3
 
@@ -591,7 +591,7 @@
   -2
  -38
   34
-   6</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/fold.jl#L1-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.fold" href="#NNlib.fold"><code>NNlib.fold</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">fold(y, output_size, kernel_size; stride = 1, pad = 0, dilation = 0, flipped = true)</code></pre><p>The adjoint/transpose operator of <code>unfold</code>. It accumulates sliding windows from the output of <code>unfold</code> into a container tensor of size <code>output_size</code>. An inverse to <code>unfold</code> may be obtained (in some cases) by using <code>fold</code> and accounting for scaling issues with a divisor (see example). Uses <code>NNlib.col2im!</code> as backend.</p><p>See also <a href="#NNlib.unfold"><code>unfold</code></a>.</p><p><strong>Example</strong></p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape([100 2 3 40 5 6 700], 7, 1, 1);  # 1D data, 1 channel, batch of 1
+   6</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/fold.jl#L1-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.fold" href="#NNlib.fold"><code>NNlib.fold</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">fold(y, output_size, kernel_size; stride = 1, pad = 0, dilation = 0, flipped = true)</code></pre><p>The adjoint/transpose operator of <code>unfold</code>. It accumulates sliding windows from the output of <code>unfold</code> into a container tensor of size <code>output_size</code>. An inverse to <code>unfold</code> may be obtained (in some cases) by using <code>fold</code> and accounting for scaling issues with a divisor (see example). Uses <code>NNlib.col2im!</code> as backend.</p><p>See also <a href="#NNlib.unfold"><code>unfold</code></a>.</p><p><strong>Example</strong></p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape([100 2 3 40 5 6 700], 7, 1, 1);  # 1D data, 1 channel, batch of 1
 
 julia&gt; y = NNlib.unfold(x, (3,1,1))  # sliding window of size 3
 5×3×1 Array{Int64, 3}:
@@ -633,7 +633,7 @@
   40.0
    5.0
    6.0
- 700.0</code></pre><p>In general, an inverse to <code>unfold</code> does not exist if <code>divisor</code> contains zeros.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/fold.jl#L58-L115">source</a></section></article><h2 id="Upsampling"><a class="docs-heading-anchor" href="#Upsampling">Upsampling</a><a id="Upsampling-1"></a><a class="docs-heading-anchor-permalink" href="#Upsampling" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Upsample</code> layer uses <code>NNlib.upsample_nearest</code>, <code>NNlib.upsample_bilinear</code>, and <code>NNlib.upsample_trilinear</code> as its backend. Additionally, <code>Flux</code>&#39;s <code>PixelShuffle</code> layer uses <code>NNlib.pixel_shuffle</code> as its backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_nearest" href="#NNlib.upsample_nearest"><code>NNlib.upsample_nearest</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_nearest(x, scale::NTuple{S,Int})
+ 700.0</code></pre><p>In general, an inverse to <code>unfold</code> does not exist if <code>divisor</code> contains zeros.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/fold.jl#L58-L115">source</a></section></article><h2 id="Upsampling"><a class="docs-heading-anchor" href="#Upsampling">Upsampling</a><a id="Upsampling-1"></a><a class="docs-heading-anchor-permalink" href="#Upsampling" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Upsample</code> layer uses <code>NNlib.upsample_nearest</code>, <code>NNlib.upsample_bilinear</code>, and <code>NNlib.upsample_trilinear</code> as its backend. Additionally, <code>Flux</code>&#39;s <code>PixelShuffle</code> layer uses <code>NNlib.pixel_shuffle</code> as its backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_nearest" href="#NNlib.upsample_nearest"><code>NNlib.upsample_nearest</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_nearest(x, scale::NTuple{S,Int})
 upsample_nearest(x; size::NTuple{S,Int})</code></pre><p>Upsamples the array <code>x</code> by integer multiples along the first <code>S</code> dimensions. Subsequent dimensions of <code>x</code> are not altered.</p><p>Either the <code>scale</code> factors or the final output <code>size</code> can be specified.</p><p>See also <a href="#NNlib.upsample_bilinear"><code>upsample_bilinear</code></a>, for two dimensions of an <code>N=4</code> array.</p><p><strong>Example</strong></p><pre><code class="language-julia-repl hljs">julia&gt; upsample_nearest([1 2 3; 4 5 6], (2, 3))
 4×9 Matrix{Int64}:
  1  1  1  2  2  2  3  3  3
@@ -652,8 +652,8 @@
  4  5  6
 
 julia&gt; ans == upsample_nearest([1 2 3; 4 5 6], size=(4,))
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L120-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_nearest" href="#NNlib.∇upsample_nearest"><code>NNlib.∇upsample_nearest</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_nearest(Δ::AbstractArray{T,3}, scales::NTuple{S, &lt;:Integer}) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>scales</code>: scales by which the image was upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L176-L185">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_linear" href="#NNlib.upsample_linear"><code>NNlib.upsample_linear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_linear(x::AbstractArray{T,3}, scale::Real; align_corners::Bool = true)
-upsample_linear(x::AbstractArray{T,3}; size::Integer, align_corners::Bool = true)</code></pre><p>Upsamples the first dimension of the array <code>x</code> by the upsample provided <code>scale</code>, using linear interpolation. As an alternative to using <code>scale</code>, the resulting array <code>size</code> can be directly specified with a keyword argument.</p><p>The size of the output is equal to <code>(scale*S1, S2, S3)</code>, where <code>S1, S2, S3 = size(x)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L207-L217">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_linear" href="#NNlib.∇upsample_linear"><code>NNlib.∇upsample_linear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_linear(Δ::AbstractArray{T,3}; size::Integer, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Size of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L247-L256">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_bilinear" href="#NNlib.upsample_bilinear"><code>NNlib.upsample_bilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_bilinear(x::AbstractArray{T,4}, scale::NTuple{2,Real}; align_corners::Bool = true)
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L120-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_nearest" href="#NNlib.∇upsample_nearest"><code>NNlib.∇upsample_nearest</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_nearest(Δ::AbstractArray{T,3}, scales::NTuple{S, &lt;:Integer}) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>scales</code>: scales by which the image was upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L176-L185">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_linear" href="#NNlib.upsample_linear"><code>NNlib.upsample_linear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_linear(x::AbstractArray{T,3}, scale::Real; align_corners::Bool = true)
+upsample_linear(x::AbstractArray{T,3}; size::Integer, align_corners::Bool = true)</code></pre><p>Upsamples the first dimension of the array <code>x</code> by the upsample provided <code>scale</code>, using linear interpolation. As an alternative to using <code>scale</code>, the resulting array <code>size</code> can be directly specified with a keyword argument.</p><p>The size of the output is equal to <code>(scale*S1, S2, S3)</code>, where <code>S1, S2, S3 = size(x)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L207-L217">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_linear" href="#NNlib.∇upsample_linear"><code>NNlib.∇upsample_linear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_linear(Δ::AbstractArray{T,3}; size::Integer, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Size of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L247-L256">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_bilinear" href="#NNlib.upsample_bilinear"><code>NNlib.upsample_bilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_bilinear(x::AbstractArray{T,4}, scale::NTuple{2,Real}; align_corners::Bool = true)
 upsample_bilinear(x::AbstractArray{T,4}; size::NTuple{2,Integer}, align_corners::Bool = true)</code></pre><p>Upsamples the first 2 dimensions of the array <code>x</code> by the upsample factors stored in <code>scale</code>, using bilinear interpolation. As an alternative to using <code>scale</code>, the resulting image <code>size</code> can be directly specified with a keyword argument.</p><p>The size of the output is equal to <code>(scale[1]*S1, scale[2]*S2, S3, S4)</code>, where <code>S1, S2, S3, S4 = size(x)</code>.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape(Float32[1 2 3; 4 5 6], (2,3,1,1))
 2×3×1×1 Array{Float32, 4}:
 [:, :, 1, 1] =
@@ -678,10 +678,10 @@
  1.75  1.97222  2.19444  2.41667  2.63889     3.08333  3.30556  3.52778  3.75
  2.5   2.72222  2.94444  3.16667  3.38889     3.83333  4.05556  4.27778  4.5
  3.25  3.47222  3.69444  3.91667  4.13889     4.58333  4.80556  5.02778  5.25
- 4.0   4.22222  4.44444  4.66667  4.88889     5.33333  5.55556  5.77778  6.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L274-L314">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_bilinear" href="#NNlib.∇upsample_bilinear"><code>NNlib.∇upsample_bilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_bilinear(Δ::AbstractArray{T,4}; size::NTuple{2,Integer}, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Lateral (W,H) size of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L319-L328">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_trilinear" href="#NNlib.upsample_trilinear"><code>NNlib.upsample_trilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_trilinear(x::AbstractArray{T,5}, scale::NTuple{3,Real}; align_corners::Bool = true)
+ 4.0   4.22222  4.44444  4.66667  4.88889     5.33333  5.55556  5.77778  6.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L274-L314">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_bilinear" href="#NNlib.∇upsample_bilinear"><code>NNlib.∇upsample_bilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_bilinear(Δ::AbstractArray{T,4}; size::NTuple{2,Integer}, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Lateral (W,H) size of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L319-L328">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.upsample_trilinear" href="#NNlib.upsample_trilinear"><code>NNlib.upsample_trilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">upsample_trilinear(x::AbstractArray{T,5}, scale::NTuple{3,Real}; align_corners::Bool = true)
 upsample_trilinear(x::AbstractArray{T,5}; size::NTuple{3,Integer}, align_corners::Bool = true)</code></pre><p>Upsamples the first 3 dimensions of the array <code>x</code> by the upsample factors stored in <code>scale</code>, using trilinear interpolation. As an alternative to using <code>scale</code>, the resulting image <code>size</code> can be directly specified with a keyword argument.</p><p>The size of the output is equal to <code>(scale[1]*S1, scale[2]*S2, scale[3]*S3, S4, S5)</code>, where <code>S1, S2, S3, S4, S5 = size(x)</code>.</p><p><strong>Examples</strong></p><pre><code class="language-julia hljs">upsample_trilinear(x, (2, 3, 4))
 upsample_trilinear(x; size=(4, 9, 11))  # specify ouput size instead
-upsample_trilinear(x, (2.5, 3.5, pi))  # non-integer scaling factors are allowed</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L331-L349">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_trilinear" href="#NNlib.∇upsample_trilinear"><code>NNlib.∇upsample_trilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_trilinear(Δ::AbstractArray{T,5}; size::NTuple{3,Integer}, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Lateral size &amp; depth (W,H,D) of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L353-L362">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pixel_shuffle" href="#NNlib.pixel_shuffle"><code>NNlib.pixel_shuffle</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pixel_shuffle(x, r::Integer)</code></pre><p>Pixel shuffling operation, upscaling by a factor <code>r</code>.</p><p>For 4-arrays representing <code>N</code> images, the operation converts input <code>size(x) == (W, H, r^2*C, N)</code> to output of size <code>(r*W, r*H, C, N)</code>. For <code>D</code>-dimensional data, it expects <code>ndims(x) == D+2</code> with channel and batch dimensions, and divides the number of channels by <code>r^D</code>.</p><p>Used in super-resolution networks to upsample towards high resolution features. Reference: Shi et. al., &quot;Real-Time Single Image and Video Super-Resolution ...&quot;, CVPR 2016, https://arxiv.org/abs/1609.05158</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; x = [10i + j + channel/10 for i in 1:2, j in 1:3, channel in 1:4, batch in 1:1]
+upsample_trilinear(x, (2.5, 3.5, pi))  # non-integer scaling factors are allowed</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L331-L349">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇upsample_trilinear" href="#NNlib.∇upsample_trilinear"><code>NNlib.∇upsample_trilinear</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇upsample_trilinear(Δ::AbstractArray{T,5}; size::NTuple{3,Integer}, align_corners::Bool = true) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Incoming gradient array, backpropagated from downstream layers</li><li><code>size</code>: Lateral size &amp; depth (W,H,D) of the image upsampled in the first place</li></ul><p><strong>Outputs</strong></p><ul><li><code>dx</code>: Downsampled version of <code>Δ</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L353-L362">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.pixel_shuffle" href="#NNlib.pixel_shuffle"><code>NNlib.pixel_shuffle</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">pixel_shuffle(x, r::Integer)</code></pre><p>Pixel shuffling operation, upscaling by a factor <code>r</code>.</p><p>For 4-arrays representing <code>N</code> images, the operation converts input <code>size(x) == (W, H, r^2*C, N)</code> to output of size <code>(r*W, r*H, C, N)</code>. For <code>D</code>-dimensional data, it expects <code>ndims(x) == D+2</code> with channel and batch dimensions, and divides the number of channels by <code>r^D</code>.</p><p>Used in super-resolution networks to upsample towards high resolution features. Reference: Shi et. al., &quot;Real-Time Single Image and Video Super-Resolution ...&quot;, CVPR 2016, https://arxiv.org/abs/1609.05158</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; x = [10i + j + channel/10 for i in 1:2, j in 1:3, channel in 1:4, batch in 1:1]
 2×3×4×1 Array{Float64, 4}:
 [:, :, 1, 1] =
  11.1  12.1  13.1
@@ -722,7 +722,7 @@
  2.1  2.3  2.5
  2.2  2.4  2.6
  3.1  3.3  3.5
- 3.2  3.4  3.6</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/upsample.jl#L1-L59">source</a></section></article><h2 id="Rotation"><a class="docs-heading-anchor" href="#Rotation">Rotation</a><a id="Rotation-1"></a><a class="docs-heading-anchor-permalink" href="#Rotation" title="Permalink"></a></h2><p>Rotate images in the first two dimensions of an array.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.imrotate" href="#NNlib.imrotate"><code>NNlib.imrotate</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">imrotate(arr::AbstractArray{T, 4}, θ; method=:bilinear, rotation_center=size(arr) .÷ 2 .+ 1)</code></pre><p>Rotates an array in the first two dimensions around the center pixel <code>rotation_center</code>.  The default value of <code>rotation_center</code> is defined such that there is a integer center pixel for even and odd sized arrays which it is rotated around. For an even sized array of size <code>(4,4)</code> this would be <code>(3,3)</code>, for an odd array of size <code>(3,3)</code> this would be <code>(2,2)</code> However, <code>rotation_center</code> can be also non-integer numbers if specified.</p><p>The angle <code>θ</code> is interpreted in radians.</p><p>The adjoint is defined with ChainRulesCore.jl. This method also runs with CUDA (and in principle all KernelAbstractions.jl supported backends).</p><p><strong>Keywords</strong></p><ul><li><code>method=:bilinear</code> for bilinear interpolation or <code>method=:nearest</code> for nearest neighbour</li><li><code>rotation_center=size(arr) .÷ 2 .+ 1</code> means there is a real center pixel around it is rotated.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; arr = zeros((4,4,1,1)); arr[2,2,1,1] = 1;
+ 3.2  3.4  3.6</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/upsample.jl#L1-L59">source</a></section></article><h2 id="Rotation"><a class="docs-heading-anchor" href="#Rotation">Rotation</a><a id="Rotation-1"></a><a class="docs-heading-anchor-permalink" href="#Rotation" title="Permalink"></a></h2><p>Rotate images in the first two dimensions of an array.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.imrotate" href="#NNlib.imrotate"><code>NNlib.imrotate</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">imrotate(arr::AbstractArray{T, 4}, θ; method=:bilinear, rotation_center=size(arr) .÷ 2 .+ 1)</code></pre><p>Rotates an array in the first two dimensions around the center pixel <code>rotation_center</code>.  The default value of <code>rotation_center</code> is defined such that there is a integer center pixel for even and odd sized arrays which it is rotated around. For an even sized array of size <code>(4,4)</code> this would be <code>(3,3)</code>, for an odd array of size <code>(3,3)</code> this would be <code>(2,2)</code> However, <code>rotation_center</code> can be also non-integer numbers if specified.</p><p>The angle <code>θ</code> is interpreted in radians.</p><p>The adjoint is defined with ChainRulesCore.jl. This method also runs with CUDA (and in principle all KernelAbstractions.jl supported backends).</p><p><strong>Keywords</strong></p><ul><li><code>method=:bilinear</code> for bilinear interpolation or <code>method=:nearest</code> for nearest neighbour</li><li><code>rotation_center=size(arr) .÷ 2 .+ 1</code> means there is a real center pixel around it is rotated.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; arr = zeros((4,4,1,1)); arr[2,2,1,1] = 1;
 
 julia&gt; arr
 4×4×1×1 Array{Float64, 4}:
@@ -770,8 +770,8 @@
 [:, :, 1, 1] =
  0.0  0.0  1.0
  0.0  0.0  0.0
- 0.0  0.0  0.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/rotation.jl#L96-L164">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇imrotate" href="#NNlib.∇imrotate"><code>NNlib.∇imrotate</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇imrotate(dy, arr::AbstractArray{T, 4}, θ; method=:bilinear,
-                                           rotation_center=size(arr) .÷ 2 .+ 1)</code></pre><p>Adjoint for <code>imrotate</code>. Gradient only with respect to <code>arr</code> and not <code>θ</code>.</p><p><strong>Arguments</strong></p><ul><li><code>dy</code>: input gradient </li><li><code>arr</code>: Input from primal computation</li><li><code>θ</code>: rotation angle in radians</li><li><code>method=:bilinear</code> or <code>method=:nearest</code></li><li><code>rotation_center=size(arr) .÷ 2 .+ 1</code> rotates around a real center pixel for even and odd sized arrays</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/rotation.jl#L189-L201">source</a></section></article><h2 id="Batched-Operations"><a class="docs-heading-anchor" href="#Batched-Operations">Batched Operations</a><a id="Batched-Operations-1"></a><a class="docs-heading-anchor-permalink" href="#Batched-Operations" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Bilinear</code> layer uses <code>NNlib.batched_mul</code> internally.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_mul" href="#NNlib.batched_mul"><code>NNlib.batched_mul</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_mul(A, B) -&gt; C
+ 0.0  0.0  0.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/rotation.jl#L96-L164">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇imrotate" href="#NNlib.∇imrotate"><code>NNlib.∇imrotate</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇imrotate(dy, arr::AbstractArray{T, 4}, θ; method=:bilinear,
+                                           rotation_center=size(arr) .÷ 2 .+ 1)</code></pre><p>Adjoint for <code>imrotate</code>. Gradient only with respect to <code>arr</code> and not <code>θ</code>.</p><p><strong>Arguments</strong></p><ul><li><code>dy</code>: input gradient </li><li><code>arr</code>: Input from primal computation</li><li><code>θ</code>: rotation angle in radians</li><li><code>method=:bilinear</code> or <code>method=:nearest</code></li><li><code>rotation_center=size(arr) .÷ 2 .+ 1</code> rotates around a real center pixel for even and odd sized arrays</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/rotation.jl#L189-L201">source</a></section></article><h2 id="Batched-Operations"><a class="docs-heading-anchor" href="#Batched-Operations">Batched Operations</a><a id="Batched-Operations-1"></a><a class="docs-heading-anchor-permalink" href="#Batched-Operations" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Bilinear</code> layer uses <code>NNlib.batched_mul</code> internally.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_mul" href="#NNlib.batched_mul"><code>NNlib.batched_mul</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_mul(A, B) -&gt; C
 A ⊠ B  # \boxtimes</code></pre><p>Batched matrix multiplication. Result has <code>C[:,:,k...] == A[:,:,k...] * B[:,:,k...]</code> where <code>k...</code> represent  any indices in the last dimensions.</p><p>If <code>ndims(A) == ndims(B) == 3</code> and <code>size(B,3) == 1</code> then instead <code>C[:,:,k] == A[:,:,k] * B[:,:,1]</code>, and similarly for <code>A</code>.</p><p>To transpose each matrix, apply <code>batched_transpose</code> to the array, or <code>batched_adjoint</code> for conjugate-transpose:</p><pre><code class="language-julia-repl hljs">julia&gt; A, B = randn(2,5,17), randn(5,9,17);
 
 julia&gt; A ⊠ B |&gt; size
@@ -787,7 +787,7 @@
 (2, 9, 17)
 
 julia&gt; batched_transpose(A) == PermutedDimsArray(A, (2,1,3))
-true</code></pre><p>The equivalent <code>PermutedDimsArray</code> may be used in place of <code>batched_transpose</code>. Other permutations are also handled by BLAS, provided that the batch index <code>k</code> is not the first dimension of the underlying array. Thus <code>PermutedDimsArray(::Array, (1,3,2))</code> and <code>PermutedDimsArray(::Array, (3,1,2))</code> are fine.</p><p>However, <code>A = PermutedDimsArray(::Array, (3,2,1))</code> is not acceptable to BLAS, since the batch dimension is the contiguous one: <code>stride(A,3) == 1</code>. This will be copied, as doing so is faster than <code>batched_mul_generic!</code>.</p><p>Both this <code>copy</code> and <code>batched_mul_generic!</code> produce <code>@debug</code> messages, and setting for instance <code>ENV[&quot;JULIA_DEBUG&quot;] = NNlib</code> will display them.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedmul.jl#L4-L46">source</a></section><section><div><pre><code class="language-julia hljs">batched_mul(A::Array{T,3}, B::Matrix)
+true</code></pre><p>The equivalent <code>PermutedDimsArray</code> may be used in place of <code>batched_transpose</code>. Other permutations are also handled by BLAS, provided that the batch index <code>k</code> is not the first dimension of the underlying array. Thus <code>PermutedDimsArray(::Array, (1,3,2))</code> and <code>PermutedDimsArray(::Array, (3,1,2))</code> are fine.</p><p>However, <code>A = PermutedDimsArray(::Array, (3,2,1))</code> is not acceptable to BLAS, since the batch dimension is the contiguous one: <code>stride(A,3) == 1</code>. This will be copied, as doing so is faster than <code>batched_mul_generic!</code>.</p><p>Both this <code>copy</code> and <code>batched_mul_generic!</code> produce <code>@debug</code> messages, and setting for instance <code>ENV[&quot;JULIA_DEBUG&quot;] = NNlib</code> will display them.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedmul.jl#L4-L46">source</a></section><section><div><pre><code class="language-julia hljs">batched_mul(A::Array{T,3}, B::Matrix)
 batched_mul(A::Matrix, B::Array{T,3})
 A ⊠ B</code></pre><p>This is always matrix-matrix multiplication, but either <code>A</code> or <code>B</code> may lack a batch index.</p><ul><li><p>When <code>B</code> is a matrix, result has <code>C[:,:,k] == A[:,:,k] * B[:,:]</code> for all <code>k</code>.</p></li><li><p>When <code>A</code> is a matrix, then <code>C[:,:,k] == A[:,:] * B[:,:,k]</code>. This can also be done by reshaping and calling <code>*</code>, for instance <code>A ⊡ B</code> using TensorCore.jl, but is implemented here using <code>batched_gemm</code> instead of <code>gemm</code>.</p></li></ul><pre><code class="language-julia-repl hljs">julia&gt; randn(16,8,32) ⊠ randn(8,4) |&gt; size
 (16, 4, 32)
@@ -796,19 +796,19 @@
 (16, 4, 32)
 
 julia&gt; randn(16,8) ⊠ randn(8,4,32) |&gt; size
-(16, 4, 32)</code></pre><p>See also <code>batched_vec</code> to regard <code>B</code> as a batch of vectors, <code>A[:,:,k] * B[:,k]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedmul.jl#L112-L139">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_mul!" href="#NNlib.batched_mul!"><code>NNlib.batched_mul!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_mul!(C, A, B) -&gt; C
-batched_mul!(C, A, B, α=1, β=0)</code></pre><p>In-place batched matrix multiplication, equivalent to <code>mul!(C[:,:,k], A[:,:,k], B[:,:,k], α, β)</code> for all <code>k</code>. If <code>size(B,3) == 1</code> then every batch uses <code>B[:,:,1]</code> instead.</p><p>This will call <code>batched_gemm!</code> whenever possible. For real arrays this means that, for <code>X ∈ [A,B,C]</code>, either <code>stride(X,1)==1</code> or <code>stride(X,2)==1</code>, the latter may be caused by <code>batched_transpose</code> or by for instance <code>PermutedDimsArray(::Array, (3,1,2))</code>. Unlike <code>batched_mul</code> this will never make a copy.</p><p>For complex arrays, the wrapper made by <code>batched_adjoint</code> must be outermost to be seen. In this case the strided accepted by BLAS are more restricted, if <code>stride(C,1)==1</code> then only <code>stride(AorB::BatchedAdjoint,2) == 1</code> is accepted.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedmul.jl#L197-L213">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_adjoint" href="#NNlib.batched_adjoint"><code>NNlib.batched_adjoint</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_transpose(A::AbstractArray{T,3})
+(16, 4, 32)</code></pre><p>See also <code>batched_vec</code> to regard <code>B</code> as a batch of vectors, <code>A[:,:,k] * B[:,k]</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedmul.jl#L112-L139">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_mul!" href="#NNlib.batched_mul!"><code>NNlib.batched_mul!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_mul!(C, A, B) -&gt; C
+batched_mul!(C, A, B, α=1, β=0)</code></pre><p>In-place batched matrix multiplication, equivalent to <code>mul!(C[:,:,k], A[:,:,k], B[:,:,k], α, β)</code> for all <code>k</code>. If <code>size(B,3) == 1</code> then every batch uses <code>B[:,:,1]</code> instead.</p><p>This will call <code>batched_gemm!</code> whenever possible. For real arrays this means that, for <code>X ∈ [A,B,C]</code>, either <code>stride(X,1)==1</code> or <code>stride(X,2)==1</code>, the latter may be caused by <code>batched_transpose</code> or by for instance <code>PermutedDimsArray(::Array, (3,1,2))</code>. Unlike <code>batched_mul</code> this will never make a copy.</p><p>For complex arrays, the wrapper made by <code>batched_adjoint</code> must be outermost to be seen. In this case the strided accepted by BLAS are more restricted, if <code>stride(C,1)==1</code> then only <code>stride(AorB::BatchedAdjoint,2) == 1</code> is accepted.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedmul.jl#L197-L213">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_adjoint" href="#NNlib.batched_adjoint"><code>NNlib.batched_adjoint</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_transpose(A::AbstractArray{T,3})
 batched_adjoint(A)</code></pre><p>Equivalent to applying <code>transpose</code> or <code>adjoint</code> to each matrix <code>A[:,:,k]</code>.</p><p>These exist to control how <code>batched_mul</code> behaves, as it operates on such matrix slices of an array with <code>ndims(A)==3</code>.</p><p><code>PermutedDimsArray(A, (2,1,3))</code> is equivalent to <code>batched_transpose(A)</code>, and is also understood by <code>batched_mul</code> (and more widely supported elsewhere).</p><pre><code class="nohighlight hljs">BatchedTranspose{T, S} &lt;: AbstractBatchedMatrix{T, 3}
-BatchedAdjoint{T, S}</code></pre><p>Lazy wrappers analogous to <code>Transpose</code> and <code>Adjoint</code>, returned by <code>batched_transpose</code> etc.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedadjtrans.jl#L38-L54">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_transpose" href="#NNlib.batched_transpose"><code>NNlib.batched_transpose</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_transpose(A::AbstractArray{T,3})
+BatchedAdjoint{T, S}</code></pre><p>Lazy wrappers analogous to <code>Transpose</code> and <code>Adjoint</code>, returned by <code>batched_transpose</code> etc.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedadjtrans.jl#L38-L54">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_transpose" href="#NNlib.batched_transpose"><code>NNlib.batched_transpose</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_transpose(A::AbstractArray{T,3})
 batched_adjoint(A)</code></pre><p>Equivalent to applying <code>transpose</code> or <code>adjoint</code> to each matrix <code>A[:,:,k]</code>.</p><p>These exist to control how <code>batched_mul</code> behaves, as it operates on such matrix slices of an array with <code>ndims(A)==3</code>.</p><p><code>PermutedDimsArray(A, (2,1,3))</code> is equivalent to <code>batched_transpose(A)</code>, and is also understood by <code>batched_mul</code> (and more widely supported elsewhere).</p><pre><code class="nohighlight hljs">BatchedTranspose{T, S} &lt;: AbstractBatchedMatrix{T, 3}
-BatchedAdjoint{T, S}</code></pre><p>Lazy wrappers analogous to <code>Transpose</code> and <code>Adjoint</code>, returned by <code>batched_transpose</code> etc.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedadjtrans.jl#L28-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_vec" href="#NNlib.batched_vec"><code>NNlib.batched_vec</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_vec(A::Array{T,3}, B::Matrix)
+BatchedAdjoint{T, S}</code></pre><p>Lazy wrappers analogous to <code>Transpose</code> and <code>Adjoint</code>, returned by <code>batched_transpose</code> etc.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedadjtrans.jl#L28-L44">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.batched_vec" href="#NNlib.batched_vec"><code>NNlib.batched_vec</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">batched_vec(A::Array{T,3}, B::Matrix)
 batched_vec(A::Array{T,3}, b::Vector)</code></pre><p>Batched matrix-vector multiplication: the result has <code>C[:,:,k] == A[:,:,k] * B[:,k]</code> for all <code>k</code>, or else <code>C[:,:,k] == A[:,:,k] * b</code> for <code>b::Vector</code>.</p><p>With the same argument types, <code>batched_mul(A, B)</code> would regard <code>B</code> as a fixed matrix, not a batch of vectors. Both reshape and then call <code>batched_mul(::Array{T,3}, ::Array{T,3})</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; A, B, b = randn(16,8,32), randn(8,32), randn(8);
 
 julia&gt; batched_vec(A,B) |&gt; size
 (16, 32)
 
 julia&gt; batched_vec(A,b) |&gt; size
-(16, 32)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/batched/batchedmul.jl#L164-L185">source</a></section></article><h2 id="Gather-and-Scatter"><a class="docs-heading-anchor" href="#Gather-and-Scatter">Gather and Scatter</a><a id="Gather-and-Scatter-1"></a><a class="docs-heading-anchor-permalink" href="#Gather-and-Scatter" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Embedding</code> layer uses <code>NNlib.gather</code> as its backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gather" href="#NNlib.gather"><code>NNlib.gather</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.gather(src, idx) -&gt; dst</code></pre><p>Reverse operation of <a href="#NNlib.scatter"><code>scatter</code></a>. Gathers data from source <code>src</code> and writes it in a destination <code>dst</code> according to the index array <code>idx</code>. For each <code>k</code> in <code>CartesianIndices(idx)</code>, assign values to <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ... , k] .= src[:, ... , idx[k]...]</code></pre><p>Notice that if <code>idx</code> is a vector containing integers and <code>src</code> is a matrix, previous expression simplifies to</p><pre><code class="nohighlight hljs">dst[:, k] .= src[:, idx[k]]</code></pre><p>and <code>k</code> will run over <code>1:length(idx)</code>.</p><p>The elements of <code>idx</code> can be integers or integer tuples and may be repeated. A single <code>src</code> column can end up being copied into zero, one, or multiple <code>dst</code> columns.</p><p>See <a href="#NNlib.gather!"><code>gather!</code></a> for an in-place version.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.gather([1,20,300,4000], [2,4,2])
+(16, 32)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/batched/batchedmul.jl#L164-L185">source</a></section></article><h2 id="Gather-and-Scatter"><a class="docs-heading-anchor" href="#Gather-and-Scatter">Gather and Scatter</a><a id="Gather-and-Scatter-1"></a><a class="docs-heading-anchor-permalink" href="#Gather-and-Scatter" title="Permalink"></a></h2><p><code>Flux</code>&#39;s <code>Embedding</code> layer uses <code>NNlib.gather</code> as its backend.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gather" href="#NNlib.gather"><code>NNlib.gather</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.gather(src, idx) -&gt; dst</code></pre><p>Reverse operation of <a href="#NNlib.scatter"><code>scatter</code></a>. Gathers data from source <code>src</code> and writes it in a destination <code>dst</code> according to the index array <code>idx</code>. For each <code>k</code> in <code>CartesianIndices(idx)</code>, assign values to <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ... , k] .= src[:, ... , idx[k]...]</code></pre><p>Notice that if <code>idx</code> is a vector containing integers and <code>src</code> is a matrix, previous expression simplifies to</p><pre><code class="nohighlight hljs">dst[:, k] .= src[:, idx[k]]</code></pre><p>and <code>k</code> will run over <code>1:length(idx)</code>.</p><p>The elements of <code>idx</code> can be integers or integer tuples and may be repeated. A single <code>src</code> column can end up being copied into zero, one, or multiple <code>dst</code> columns.</p><p>See <a href="#NNlib.gather!"><code>gather!</code></a> for an in-place version.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.gather([1,20,300,4000], [2,4,2])
 3-element Vector{Int64}:
    20
  4000
@@ -817,7 +817,7 @@
 julia&gt; NNlib.gather([1 2 3; 4 5 6], [1,3,1,3,1])
 2×5 Matrix{Int64}:
  1  3  1  3  1
- 4  6  4  6  4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/gather.jl#L1-L39">source</a></section><section><div><pre><code class="language-julia hljs">gather(src, IJK...)</code></pre><p>Convert the tuple of integer vectors <code>IJK</code> to a tuple of <code>CartesianIndex</code> and call <code>gather</code> on it: <code>gather(src, CartesianIndex.(IJK...))</code>.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; src = reshape([1:15;], 3, 5)
+ 4  6  4  6  4</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/gather.jl#L1-L39">source</a></section><section><div><pre><code class="language-julia hljs">gather(src, IJK...)</code></pre><p>Convert the tuple of integer vectors <code>IJK</code> to a tuple of <code>CartesianIndex</code> and call <code>gather</code> on it: <code>gather(src, CartesianIndex.(IJK...))</code>.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; src = reshape([1:15;], 3, 5)
 3×5 Matrix{Int64}:
  1  4  7  10  13
  2  5  8  11  14
@@ -826,7 +826,7 @@
 julia&gt; NNlib.gather(src, [1, 2], [2, 4])
 2-element Vector{Int64}:
   4
- 11</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/gather.jl#L49-L69">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gather!" href="#NNlib.gather!"><code>NNlib.gather!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.gather!(dst, src, idx)</code></pre><p>Reverse operation of <a href="#NNlib.scatter!"><code>scatter!</code></a>. Gathers data from source <code>src</code> and writes it in destination <code>dst</code> according to the index array <code>idx</code>. For each <code>k</code> in <code>CartesianIndices(idx)</code>, assign values to <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ... , k] .= src[:, ... , idx[k]...]</code></pre><p>Notice that if <code>idx</code> is a vector containing integers, and both <code>dst</code> and <code>src</code> are matrices, previous expression simplifies to</p><pre><code class="nohighlight hljs">dst[:, k] .= src[:, idx[k]]</code></pre><p>and <code>k</code> will run over <code>1:length(idx)</code>.</p><p>The elements of <code>idx</code> can be integers or integer tuples and may be repeated. A single <code>src</code> column can end up being copied into zero, one, or multiple <code>dst</code> columns.</p><p>See <a href="#NNlib.gather"><code>gather</code></a> for an allocating version.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/gather.jl#L81-L102">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.scatter" href="#NNlib.scatter"><code>NNlib.scatter</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.scatter(op, src, idx; [init, dstsize])</code></pre><p>Scatter operation allocating a destination array <code>dst</code> and calling <code>scatter!(op, dst, src, idx)</code> on it.</p><ul><li><p>If keyword <code>init</code> is provided, it is used to initialize the content of <code>dst</code>. Otherwise, the init values is inferred from the reduction operator <code>op</code> for some common operators (e.g. <code>init = 0</code> for <code>op = +</code>).</p></li><li><p>If <code>dstsize</code> is provided, it will be used to define the size of destination array, otherwise it will be inferred by <code>src</code> and <code>idx</code>.</p></li></ul><p>See <a href="#NNlib.scatter!"><code>scatter!</code></a> for full details on how <code>idx</code> works.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.scatter(+, [10,100,1000], [3,1,2])
+ 11</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/gather.jl#L49-L69">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.gather!" href="#NNlib.gather!"><code>NNlib.gather!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.gather!(dst, src, idx)</code></pre><p>Reverse operation of <a href="#NNlib.scatter!"><code>scatter!</code></a>. Gathers data from source <code>src</code> and writes it in destination <code>dst</code> according to the index array <code>idx</code>. For each <code>k</code> in <code>CartesianIndices(idx)</code>, assign values to <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ... , k] .= src[:, ... , idx[k]...]</code></pre><p>Notice that if <code>idx</code> is a vector containing integers, and both <code>dst</code> and <code>src</code> are matrices, previous expression simplifies to</p><pre><code class="nohighlight hljs">dst[:, k] .= src[:, idx[k]]</code></pre><p>and <code>k</code> will run over <code>1:length(idx)</code>.</p><p>The elements of <code>idx</code> can be integers or integer tuples and may be repeated. A single <code>src</code> column can end up being copied into zero, one, or multiple <code>dst</code> columns.</p><p>See <a href="#NNlib.gather"><code>gather</code></a> for an allocating version.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/gather.jl#L81-L102">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.scatter" href="#NNlib.scatter"><code>NNlib.scatter</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.scatter(op, src, idx; [init, dstsize])</code></pre><p>Scatter operation allocating a destination array <code>dst</code> and calling <code>scatter!(op, dst, src, idx)</code> on it.</p><ul><li><p>If keyword <code>init</code> is provided, it is used to initialize the content of <code>dst</code>. Otherwise, the init values is inferred from the reduction operator <code>op</code> for some common operators (e.g. <code>init = 0</code> for <code>op = +</code>).</p></li><li><p>If <code>dstsize</code> is provided, it will be used to define the size of destination array, otherwise it will be inferred by <code>src</code> and <code>idx</code>.</p></li></ul><p>See <a href="#NNlib.scatter!"><code>scatter!</code></a> for full details on how <code>idx</code> works.</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.scatter(+, [10,100,1000], [3,1,2])
 3-element Vector{Int64}:
   100
  1000
@@ -844,7 +844,7 @@
     10
   2000
     10
-    10</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/scatter.jl#L136-L173">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.scatter!" href="#NNlib.scatter!"><code>NNlib.scatter!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.scatter!(op, dst, src, idx)</code></pre><p>Scatter operation, which writes data in <code>src</code> into <code>dst</code> at locations <code>idx</code>. A binary reduction operator <code>op</code> is applied during the scatter. For each index <code>k</code> in <code>idx</code>, accumulates values in <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ..., idx[k]...] = (op).(dst[:, ..., idx[k]...], src[:, ..., k...])</code></pre><p>See also <a href="#NNlib.scatter"><code>scatter</code></a>, <a href="#NNlib.gather"><code>gather</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>op</code>: Operations to be applied on <code>dst</code> and <code>src</code>, e.g. <code>+</code>, <code>-</code>, <code>*</code>, <code>/</code>, <code>max</code>, <code>min</code> and <code>mean</code>.</li><li><code>dst</code>: The destination for <code>src</code> to aggregate to. This argument will be mutated.</li><li><code>src</code>: The source data for aggregating.</li><li><code>idx</code>: The mapping for aggregation from source (index) to destination (value).        The <code>idx</code> array can contain either integers or tuples.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.scatter!(+, ones(3), [10,100], [1,3])
+    10</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/scatter.jl#L136-L173">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.scatter!" href="#NNlib.scatter!"><code>NNlib.scatter!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">NNlib.scatter!(op, dst, src, idx)</code></pre><p>Scatter operation, which writes data in <code>src</code> into <code>dst</code> at locations <code>idx</code>. A binary reduction operator <code>op</code> is applied during the scatter. For each index <code>k</code> in <code>idx</code>, accumulates values in <code>dst</code> according to</p><pre><code class="nohighlight hljs">dst[:, ..., idx[k]...] = (op).(dst[:, ..., idx[k]...], src[:, ..., k...])</code></pre><p>See also <a href="#NNlib.scatter"><code>scatter</code></a>, <a href="#NNlib.gather"><code>gather</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>op</code>: Operations to be applied on <code>dst</code> and <code>src</code>, e.g. <code>+</code>, <code>-</code>, <code>*</code>, <code>/</code>, <code>max</code>, <code>min</code> and <code>mean</code>.</li><li><code>dst</code>: The destination for <code>src</code> to aggregate to. This argument will be mutated.</li><li><code>src</code>: The source data for aggregating.</li><li><code>idx</code>: The mapping for aggregation from source (index) to destination (value).        The <code>idx</code> array can contain either integers or tuples.</li></ul><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; NNlib.scatter!(+, ones(3), [10,100], [1,3])
 3-element Vector{Float64}:
   11.0
    1.0
@@ -853,7 +853,7 @@
 julia&gt; NNlib.scatter!(*, fill(0.5, 2, 4), [1 10; 100 1000], [3,2])
 2×4 Matrix{Float64}:
  0.5    5.0   0.5  0.5
- 0.5  500.0  50.0  0.5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/scatter.jl#L40-L72">source</a></section></article><h2 id="Sampling"><a class="docs-heading-anchor" href="#Sampling">Sampling</a><a id="Sampling-1"></a><a class="docs-heading-anchor-permalink" href="#Sampling" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.grid_sample" href="#NNlib.grid_sample"><code>NNlib.grid_sample</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">grid_sample(input::AbstractArray{T, 4}, grid::AbstractArray{T, 4}; padding_mode = :zeros)</code></pre><p>Given <code>input</code>, compute output by sampling <code>input</code> values at pixel locations from <code>grid</code>. Uses bilinear interpolation to calculate output values.</p><p>This implementation assumes the extrema (<code>-1</code> and <code>1</code>) are considered as referring to the center points of the input’s corner pixels (i.e. align corners is <code>true</code>).</p><p><strong>Arguments</strong></p><ul><li><p><code>input</code>: Input array in <code>(W_in, H_in, C, N)</code> shape.</p></li><li><p><code>grid</code>: Input grid in <code>(2, W_out, H_out, N)</code> shape.   Where for each <code>(W_out, H_out, N)</code> grid contains <code>(x, y)</code>   coordinates that specify sampling locations normalized by the <code>input</code> shape.</p><p>Therefore, <code>x</code> and <code>y</code> should have values in <code>[-1, 1]</code> range.   For example, <code>(x = -1, y = -1)</code> is the left-top pixel of <code>input</code>,   and <code>(x = 1, y = 1)</code> is the right-bottom pixel of <code>input</code>.</p><p>Out-of-bound values are handled according to the <code>padding_mode</code>.</p></li><li><p><code>padding_mode</code>: Out-of-bound padding.   <code>:zeros</code> to use <code>0</code> for out-of-bound grid locations.   <code>:border</code> to use border values for out-of-bound grid locations.   Default is <code>:zeros</code>.</p></li></ul><p><strong>Returns</strong></p><p><code>(W_out, H_out, C, N)</code> sampled grid from <code>input</code>.</p><p><strong>Examples</strong></p><p>In the example below, grid contains two out-of-bound sampling locations, which are handled differently, depending on the <code>padding_mode</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape(collect(1.0:4.0), (2, 2, 1, 1))
+ 0.5  500.0  50.0  0.5</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/scatter.jl#L40-L72">source</a></section></article><h2 id="Sampling"><a class="docs-heading-anchor" href="#Sampling">Sampling</a><a id="Sampling-1"></a><a class="docs-heading-anchor-permalink" href="#Sampling" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.grid_sample" href="#NNlib.grid_sample"><code>NNlib.grid_sample</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">grid_sample(input::AbstractArray{T, 4}, grid::AbstractArray{T, 4}; padding_mode = :zeros)</code></pre><p>Given <code>input</code>, compute output by sampling <code>input</code> values at pixel locations from <code>grid</code>. Uses bilinear interpolation to calculate output values.</p><p>This implementation assumes the extrema (<code>-1</code> and <code>1</code>) are considered as referring to the center points of the input’s corner pixels (i.e. align corners is <code>true</code>).</p><p><strong>Arguments</strong></p><ul><li><p><code>input</code>: Input array in <code>(W_in, H_in, C, N)</code> shape.</p></li><li><p><code>grid</code>: Input grid in <code>(2, W_out, H_out, N)</code> shape.   Where for each <code>(W_out, H_out, N)</code> grid contains <code>(x, y)</code>   coordinates that specify sampling locations normalized by the <code>input</code> shape.</p><p>Therefore, <code>x</code> and <code>y</code> should have values in <code>[-1, 1]</code> range.   For example, <code>(x = -1, y = -1)</code> is the left-top pixel of <code>input</code>,   and <code>(x = 1, y = 1)</code> is the right-bottom pixel of <code>input</code>.</p><p>Out-of-bound values are handled according to the <code>padding_mode</code>.</p></li><li><p><code>padding_mode</code>: Out-of-bound padding.   <code>:zeros</code> to use <code>0</code> for out-of-bound grid locations.   <code>:border</code> to use border values for out-of-bound grid locations.   Default is <code>:zeros</code>.</p></li></ul><p><strong>Returns</strong></p><p><code>(W_out, H_out, C, N)</code> sampled grid from <code>input</code>.</p><p><strong>Examples</strong></p><p>In the example below, grid contains two out-of-bound sampling locations, which are handled differently, depending on the <code>padding_mode</code>.</p><pre><code class="language-julia-repl hljs">julia&gt; x = reshape(collect(1.0:4.0), (2, 2, 1, 1))
 2×2×1×1 Array{Float64, 4}:
 [:, :, 1, 1] =
  1.0  3.0
@@ -885,7 +885,7 @@
 [:, :, 1, 1] =
  1.0  3.0
  1.5  3.5
- 2.0  4.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/sampling.jl#L26-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇grid_sample" href="#NNlib.∇grid_sample"><code>NNlib.∇grid_sample</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇grid_sample(Δ::AbstractArray{T, 4}, input::AbstractArray{T, 4}, grid::AbstractArray{T, 4}; padding_mode = :zeros) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Input gradient in <code>(W_out, H_out, C, N)</code> shape   (same as output of the primal computation).</li><li><code>input</code>: Input from primal computation in <code>(W_in, H_in, C, N)</code> shape.</li><li><code>grid</code>: Grid from primal computation in <code>(2, W_out, H_out, N)</code> shape.</li><li><code>padding_mode</code>: Out-of-bound padding.   <code>:zeros</code> to use <code>0</code> for out-of-bound grid locations.   <code>:border</code> to use border values for out-of-bound grid locations.   Should be the same as in primal computation.   Default is <code>:zeros</code>.</li></ul><p><strong>Returns</strong></p><p><code>dinput</code> (same shape as <code>input</code>) and <code>dgrid</code> (same shape as <code>grid</code>) gradients.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/sampling.jl#L152-L170">source</a></section></article><h2 id="Losses"><a class="docs-heading-anchor" href="#Losses">Losses</a><a id="Losses-1"></a><a class="docs-heading-anchor-permalink" href="#Losses" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.ctc_loss" href="#NNlib.ctc_loss"><code>NNlib.ctc_loss</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ctc_loss(ŷ, y)</code></pre><p>Computes the connectionist temporal classification loss between <code>ŷ</code> and <code>y</code>. <code>ŷ</code> must be a classes-by-time matrices, i.e., each row represents a class and each column represents a time step. Additionally, the <code>logsoftmax</code> function will be applied to <code>ŷ</code>, so <code>ŷ</code> must be the raw activation values from the neural network and not, for example, the activations after being passed through a <code>softmax</code> activation function. <code>y</code> must be a 1D array of the labels associated with <code>ŷ</code>. The blank label is assumed to be the last label category in <code>ŷ</code>, so it is equivalent to <code>size(ŷ, 1)</code>. Used for sequence-to-sequence classification problems such as speech recognition and handwriting recognition where the exact time-alignment of the output (e.g., letters) is not needed to solve the problem. See <a href="https://www.cs.toronto.edu/~graves/icml_2006.pdf">Graves et al. (2006)</a> or <a href="https://www.cs.toronto.edu/~graves/preprint.pdf#chapter.7">Graves (2012)</a> for mathematical details.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/ctc.jl#L109-L128">source</a></section></article><h2 id="Miscellaneous"><a class="docs-heading-anchor" href="#Miscellaneous">Miscellaneous</a><a id="Miscellaneous-1"></a><a class="docs-heading-anchor-permalink" href="#Miscellaneous" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsumexp" href="#NNlib.logsumexp"><code>NNlib.logsumexp</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logsumexp(x; dims = :)</code></pre><p>Computes <code>log.(sum(exp.(x); dims))</code> in a numerically stable way. Without <code>dims</code> keyword this returns a scalar.</p><p>See also <a href="#NNlib.logsoftmax"><code>logsoftmax</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/softmax.jl#L134-L141">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.glu" href="#NNlib.glu"><code>NNlib.glu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">glu(x, dim = 1)</code></pre><p>The gated linear unit from the <a href="https://arxiv.org/abs/1612.08083">&quot;Language Modeling with Gated Convolutional Networks&quot;</a> paper.</p><p>Calculates <code>a .* sigmoid(b)</code>, where <code>x</code> is split in half along given dimension <code>dim</code> to form <code>a</code> and <code>b</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/functions.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.within_gradient" href="#NNlib.within_gradient"><code>NNlib.within_gradient</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">within_gradient(x) --&gt; Bool</code></pre><p>Returns <code>false</code> except when used inside a <code>gradient</code> call, when it returns <code>true</code>. Useful for Flux regularisation layers which behave differently during training and inference.</p><p>This should work with any ChainRules-based differentiation package, in which case <code>x</code> is ignored. But Tracker.jl overloads <code>with_gradient(x::TrackedArray)</code>, thus for widest use you should pass it an array whose gradient is of interest. There is also an overload for ForwardDiff.jl&#39;s <code>Dual</code> types (and arrays of them).</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; using ForwardDiff, Zygote, NNlib
+ 2.0  4.0</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/sampling.jl#L26-L97">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.∇grid_sample" href="#NNlib.∇grid_sample"><code>NNlib.∇grid_sample</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">∇grid_sample(Δ::AbstractArray{T, 4}, input::AbstractArray{T, 4}, grid::AbstractArray{T, 4}; padding_mode = :zeros) where T</code></pre><p><strong>Arguments</strong></p><ul><li><code>Δ</code>: Input gradient in <code>(W_out, H_out, C, N)</code> shape   (same as output of the primal computation).</li><li><code>input</code>: Input from primal computation in <code>(W_in, H_in, C, N)</code> shape.</li><li><code>grid</code>: Grid from primal computation in <code>(2, W_out, H_out, N)</code> shape.</li><li><code>padding_mode</code>: Out-of-bound padding.   <code>:zeros</code> to use <code>0</code> for out-of-bound grid locations.   <code>:border</code> to use border values for out-of-bound grid locations.   Should be the same as in primal computation.   Default is <code>:zeros</code>.</li></ul><p><strong>Returns</strong></p><p><code>dinput</code> (same shape as <code>input</code>) and <code>dgrid</code> (same shape as <code>grid</code>) gradients.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/sampling.jl#L152-L170">source</a></section></article><h2 id="Losses"><a class="docs-heading-anchor" href="#Losses">Losses</a><a id="Losses-1"></a><a class="docs-heading-anchor-permalink" href="#Losses" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.ctc_loss" href="#NNlib.ctc_loss"><code>NNlib.ctc_loss</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ctc_loss(ŷ, y)</code></pre><p>Computes the connectionist temporal classification loss between <code>ŷ</code> and <code>y</code>. <code>ŷ</code> must be a classes-by-time matrices, i.e., each row represents a class and each column represents a time step. Additionally, the <code>logsoftmax</code> function will be applied to <code>ŷ</code>, so <code>ŷ</code> must be the raw activation values from the neural network and not, for example, the activations after being passed through a <code>softmax</code> activation function. <code>y</code> must be a 1D array of the labels associated with <code>ŷ</code>. The blank label is assumed to be the last label category in <code>ŷ</code>, so it is equivalent to <code>size(ŷ, 1)</code>. Used for sequence-to-sequence classification problems such as speech recognition and handwriting recognition where the exact time-alignment of the output (e.g., letters) is not needed to solve the problem. See <a href="https://www.cs.toronto.edu/~graves/icml_2006.pdf">Graves et al. (2006)</a> or <a href="https://www.cs.toronto.edu/~graves/preprint.pdf#chapter.7">Graves (2012)</a> for mathematical details.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/ctc.jl#L109-L128">source</a></section></article><h2 id="Miscellaneous"><a class="docs-heading-anchor" href="#Miscellaneous">Miscellaneous</a><a id="Miscellaneous-1"></a><a class="docs-heading-anchor-permalink" href="#Miscellaneous" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.logsumexp" href="#NNlib.logsumexp"><code>NNlib.logsumexp</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">logsumexp(x; dims = :)</code></pre><p>Computes <code>log.(sum(exp.(x); dims))</code> in a numerically stable way. Without <code>dims</code> keyword this returns a scalar.</p><p>See also <a href="#NNlib.logsoftmax"><code>logsoftmax</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/softmax.jl#L134-L141">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.glu" href="#NNlib.glu"><code>NNlib.glu</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">glu(x, dim = 1)</code></pre><p>The gated linear unit from the <a href="https://arxiv.org/abs/1612.08083">&quot;Language Modeling with Gated Convolutional Networks&quot;</a> paper.</p><p>Calculates <code>a .* sigmoid(b)</code>, where <code>x</code> is split in half along given dimension <code>dim</code> to form <code>a</code> and <code>b</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/functions.jl#L1-L7">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.within_gradient" href="#NNlib.within_gradient"><code>NNlib.within_gradient</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">within_gradient(x) --&gt; Bool</code></pre><p>Returns <code>false</code> except when used inside a <code>gradient</code> call, when it returns <code>true</code>. Useful for Flux regularisation layers which behave differently during training and inference.</p><p>This should work with any ChainRules-based differentiation package, in which case <code>x</code> is ignored. But Tracker.jl overloads <code>with_gradient(x::TrackedArray)</code>, thus for widest use you should pass it an array whose gradient is of interest. There is also an overload for ForwardDiff.jl&#39;s <code>Dual</code> types (and arrays of them).</p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; using ForwardDiff, Zygote, NNlib
 
 julia&gt; f_good(x) = if NNlib.within_gradient(x)
                      @show 10x
@@ -912,4 +912,4 @@
 
 julia&gt; ForwardDiff.derivative(x -&gt; f_bad(x, 3.0), 2.0)
 x * y = Dual{ForwardDiff.Tag{var&quot;#9#10&quot;, Float64}}(6.0,3.0)
-3.0</code></pre><p>What goes wrong in <code>f_bad</code> is that Zygote knows <code>any</code> to be non-differentiable, and thus completely ignores its contents. This is not a perfect mechanism, and the only style recommended is precisely that of <code>f_good</code> above.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/utils.jl#L1-L47">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.bias_act!" href="#NNlib.bias_act!"><code>NNlib.bias_act!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bias_act!(σ, x, b)</code></pre><p>This is equivalent to <code>x .= σ.(x .+ b)</code>, also replacing <code>sigmoid</code> &amp; <code>tanh</code> with <code>sigmoid_fast</code> &amp; <code>tanh_fast</code>. It will only overwrite <code>x</code> when <code>x isa StridedArray{&lt;:AbstractFloat}</code>.</p><p>When used within a gradient, it will overwrite only when <code>σ</code> has a method of <code>derivatives_given_output</code> which does not need the input at all. Such methods are defined by e.g. <code>@scalar_rule relu(x) Ω &gt; 0</code> where the derivative contains only <code>Ω</code> (the output) not <code>x</code>.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This is not safe to use if <code>x</code> is still needed for the gradient of some other function. Incorrect use will give silently wrong answers. It is intended mainly for Flux layers, in which the previous operation is known to be safe, e.g. <code>bias_act!(σ, weight * input, bias)</code> for a <code>Dense</code> layer.</p></div></div></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9488d77b10f80acce532713ddcdef1431cb60622/src/bias_act.jl#L14-L31">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../audio/">Audio »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 18:44">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+3.0</code></pre><p>What goes wrong in <code>f_bad</code> is that Zygote knows <code>any</code> to be non-differentiable, and thus completely ignores its contents. This is not a perfect mechanism, and the only style recommended is precisely that of <code>f_good</code> above.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/utils.jl#L1-L47">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="NNlib.bias_act!" href="#NNlib.bias_act!"><code>NNlib.bias_act!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bias_act!(σ, x, b)</code></pre><p>This is equivalent to <code>x .= σ.(x .+ b)</code>, also replacing <code>sigmoid</code> &amp; <code>tanh</code> with <code>sigmoid_fast</code> &amp; <code>tanh_fast</code>. It will only overwrite <code>x</code> when <code>x isa StridedArray{&lt;:AbstractFloat}</code>.</p><p>When used within a gradient, it will overwrite only when <code>σ</code> has a method of <code>derivatives_given_output</code> which does not need the input at all. Such methods are defined by e.g. <code>@scalar_rule relu(x) Ω &gt; 0</code> where the derivative contains only <code>Ω</code> (the output) not <code>x</code>.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This is not safe to use if <code>x</code> is still needed for the gradient of some other function. Incorrect use will give silently wrong answers. It is intended mainly for Flux layers, in which the previous operation is known to be safe, e.g. <code>bias_act!(σ, weight * input, bias)</code> for a <code>Dense</code> layer.</p></div></div></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/NNlib.jl/blob/9fe2aae5a911661fa86579205cb52e6d43627fc2/src/bias_act.jl#L14-L31">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../audio/">Audio »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Saturday 16 November 2024 20:23">Saturday 16 November 2024</span>. Using Julia version 1.9.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/previews/PR613/spectrogram.png b/previews/PR613/spectrogram.png
index be19a26f..e764fe97 100644
Binary files a/previews/PR613/spectrogram.png and b/previews/PR613/spectrogram.png differ
diff --git a/previews/PR613/waveform.png b/previews/PR613/waveform.png
index 547b984c..c10382c9 100644
Binary files a/previews/PR613/waveform.png and b/previews/PR613/waveform.png differ