diff --git a/_sources/book/topology/002-topological-spaces.md b/_sources/book/topology/002-topological-spaces.md
index 1422c22d..4efff7f6 100644
--- a/_sources/book/topology/002-topological-spaces.md
+++ b/_sources/book/topology/002-topological-spaces.md
@@ -32,7 +32,7 @@ We refer to the topology associated with a given metric as the induced topology.
 
 :::{prf:definition} Induced topology
 :label: topology:def-induced-topology
-Let $(X, d)$ be a metric space.
+Let $(X, d)$ be a {prf:ref}`metric space<topology:def-metric-space>`.
 Then, the topology induced by $d$ is the set of all open sets in $X$ with respect to the metric $d.$
 :::
 
@@ -45,3 +45,99 @@ Let $f: X \to Y$ be a function between topological spaces.
 Then, $f$ is continuous if for every open set $U \subseteq Y,$ the pre-image $f^{-1}(U)$ is an open set in $X.$
 :::
 
+
+:::{prf:lemma} Composition preserves continuity
+:label: topology:lemma-composition-preserves-continuity
+If $f: X \to Y$ and $g: Y \to Z$ are {prf:ref}`continuous functions<topology:def-continuous-function-topology>` between {prf:ref}`topological spaces<topology:def-topological-space>`, then the composition $g \circ f: X \to Z$ is continuous.
+:::
+
+
+In topology, we are interested in studying the properties of spaces that are preserved under continuous deformations.
+Therefore, from a topology perspective, two spaces are considered essentially the same up to a continuous bijection.
+This is captured by the notion of homeomorphism.
+
+:::{prf:definition} Homeomorphism
+:label: topology:def-homeomorphism
+A function $f: X \to Y$ between {prf:ref}`topological spaces<topology:def-topological-space>` is a {prf:ref}`homeomorphism<topology:def-homeomorphism>` if it is bijective, {prf:ref}`continuous<topology:def-continuous-function-topology>`, and its inverse $f^{-1}$ is also continuous.
+Equivalently, $f$ is a homeomorphism if $f$ is a bijection and $U \subseteq X$ is {prf:ref}`open<topology:def-topological-space>` if and only if $f(U) \subseteq Y$ is open.
+We say two spaces are homeomorphic if there exists a homeomorphism between them.
+:::
+
+:::{prf:lemma} Homeomorphism is an equivalence relation
+:label: topology:lemma-homeomorphism-equivalence-relation
+{prf:ref}`Homeomorphism<topology:def-homeomorphism>` is an equivalence relation between topological spaces.
+:::
+
+:::{dropdown} Proof: Homeomorphism is an equivalence relation
+__Reflexivity:__
+The identity map $I_X: X \to X$ is a homeomorphism, because it is bijective, continuous, and its inverse is itself.
+Therefore $X \equiv X.$
+
+__Symmetry:__
+If $f: X \to Y$ is a homeomorphism, then $f^{-1}: Y \to X$ is also a homeomorphism.
+Therefore $X \equiv Y$ implies $Y \equiv X.$
+
+__Transitivity:__
+If $f: X \to Y$ and $g: Y \to Z$ are homeomorphisms, then $g \circ f: X \to Z$ is a homeomorphism.
+Therefore $X \equiv Y$ and $Y \equiv Z$ implies $X \equiv Z.$
+:::
+
+In general, the approach for showing that two spaces are homeomorphic is to find a homeomorphism between them.
+However, showing that two spaces are _not_ homeomorphic is more difficult.
+In particular, there is no simple recipe for showing that two spaces are not homeomorphic.
+Instead, we resort to certain topological properties that are preserved under homeomorphisms.
+Whenever two spaces have different such properties, we can conclude that they are not homeomorphic.
+Two such properties are connectedness and compactness.
+In the remainder of this chapter we give definitions and results building up to these properties.
+
+
+## Sequences
+
+We now turn to re-defining concepts from metric spaces in terms of topological spaces, starting with sequences.
+First we re-define the following shorthand for open sets.
+
+:::{prf:definition} Open neighbourhood
+:label: topology:def-open-neighbourhood-topology
+An open neighbourhood of a point $x \in X$ in a {prf:ref}`topological space<topology:def-topological-space>` $(X, \mathcal{U})$ is an open set $U \in \mathcal{U}$ such that $x \in U.$
+:::
+
+
+In topological spaces, convergent sequences are defined directly in terms of open neighbourhoods, rather than using open balls.
+
+:::{prf:definition} Convergent sequence
+:label: topology:def-convergent-sequence-topology
+A sequence $x_n \to x$ if for every open neighbourhood $U$ of $x,$ there exists $N \in \mathbb{N}$ such that $x_n \in U$ for all $n > N.$
+:::
+
+
+We now turn to uniqueness of limits.
+In general, in a topological space limits need not be unique.
+For example, given a set $X$ with the coarse topology $\mathcal{U} = \{\emptyset, X\},$ every sequence converges to every point.
+However, further assumptions on the topology can result into unique limits.
+
+:::{prf:definition} Hausdorff space
+:label: topology:def-hausdorff-space
+A topological space $(X, \mathcal{U})$ is Hausdorff if for every pair of distinct points $x_1, x_2 \in X,$ there exist open neighbourhoods $U_1, U_2$ of $x_1, x_2$ respectively such that $U_1 \cap U_2 = \emptyset.$
+:::
+
+:::{margin}
+Earlier, we proved that {prf:ref}`limits in metric spaces are unique<topology:lemma-limits-in-metric-spaces-are-unique>`.
+The property we used in that proof was that, in a {prf:ref}`metric space <topology:def-metric-space>`, open balls centered around distinct points are disjoint if their radii are small enough.
+This was the Hausdorff property in disguise.
+Metric spaces are always {prf:ref}`Hausdorff<topology:def-hausdorff-space>`, and therefore have unique limits.
+:::
+
+:::{prf:lemma} Limits are unique in Hausdorff spaces
+:label: topology:lemma-limits-unique-hausdorff
+If $X$ is {prf:ref}`Hausdorff<topology:def-hausdorff-space>` and $(x_n)$ is a sequence in $X$ such that $x_n \to x$ and $x_n \to x',$ then $x = x'.$
+:::
+
+:::{dropdown} Proof: Limits are unique in Hausdorff spaces
+Let $(x_n)$ be a sequence in $X$ such that $x_n \to x$ and $x_n \to x'.$
+Suppose $x \neq x'.$
+Since $X$ is Hausdorff, there exist open neighbourhoods $U, U'$ of $x, x'$ respectively such that $U \cap U' = \emptyset.$
+Since $x_n \to x,$ there exists $N \in \mathbb{N}$ such that $x_n \in U$ for all $n > N.$
+Similarly, since $x_n \to x',$ there exists $N' \in \mathbb{N}$ such that $x_n \in U'$ for all $n > N'.$
+Then, for all $n > \max(N, N'),$ we have $x_n \in U \cap U' = \emptyset,$ which is a contradiction.
+Therefore, $x = x'.$
+:::
\ No newline at end of file
diff --git a/book/papers/ais/ais.html b/book/papers/ais/ais.html
index 6d509c63..b12c07c2 100644
--- a/book/papers/ais/ais.html
+++ b/book/papers/ais/ais.html
@@ -591,7 +591,7 @@ <h2>Importance sampling<a class="headerlink" href="#importance-sampling" title="
 It is reasonable to expect that the more dissimilar <span class="math notranslate nohighlight">\(q\)</span> and <span class="math notranslate nohighlight">\(p\)</span> are, the larger the variance will be.
 In partricular, we can show that the variance of the importance weights can be lower bounded by a quantity that scales exponentially with the KL divergence.</p>
 <div class="proof lemma admonition" id="lemma-0">
-<p class="admonition-title"><span class="caption-number">Lemma 15 </span> (Lower bound to importance weight variance)</p>
+<p class="admonition-title"><span class="caption-number">Lemma 18 </span> (Lower bound to importance weight variance)</p>
 <section class="lemma-content" id="proof-content">
 <p>Given distributions <span class="math notranslate nohighlight">\(p\)</span> and <span class="math notranslate nohighlight">\(q\)</span>, it holds that</p>
 <div class="math notranslate nohighlight">
@@ -649,7 +649,7 @@ <h2>Importance-weighted MCMC<a class="headerlink" href="#importance-weighted-mcm
 So we could, in principle, use MCMC within an importance-weighted estimator to reduce its variance.
 The following algorithm is based on this intuition.</p>
 <div class="proof definition admonition" id="definition-1">
-<p class="admonition-title"><span class="caption-number">Definition 83 </span> (Importance weighted MCMC algorithm)</p>
+<p class="admonition-title"><span class="caption-number">Definition 87 </span> (Importance weighted MCMC algorithm)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a proposal density <span class="math notranslate nohighlight">\(q\)</span>, a target density <span class="math notranslate nohighlight">\(p\)</span> and a sequence of transition kernels <span class="math notranslate nohighlight">\(T_1(x, x'), \dots, T_K(x, x')\)</span> be a sequence of transition kernels such that <span class="math notranslate nohighlight">\(T_k\)</span> leaves <span class="math notranslate nohighlight">\(p\)</span> invariant.
 Sampling <span class="math notranslate nohighlight">\(x_0 \sim q(x)\)</span> followed by</p>
@@ -737,7 +737,7 @@ <h2>Annealed Importance Sampling<a class="headerlink" href="#id2" title="Link to
 <p>These distributions interpolate between <span class="math notranslate nohighlight">\(q\)</span> and <span class="math notranslate nohighlight">\(p\)</span> as we vary <span class="math notranslate nohighlight">\(\beta\)</span>.
 AIS then proceeds in a similar way to the importance weighted MCMC algorithm we highlighted above, except that it requires that each <span class="math notranslate nohighlight">\(T_k\)</span> leaves <span class="math notranslate nohighlight">\(\pi_k\)</span>, instead of <span class="math notranslate nohighlight">\(p\)</span>, invariant.</p>
 <div class="proof definition admonition" id="definition-2">
-<p class="admonition-title"><span class="caption-number">Definition 84 </span> (Annealed Importance Sampling)</p>
+<p class="admonition-title"><span class="caption-number">Definition 88 </span> (Annealed Importance Sampling)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a target density <span class="math notranslate nohighlight">\(p\)</span>, a proposal density <span class="math notranslate nohighlight">\(q\)</span> and a sequence <span class="math notranslate nohighlight">\(0 = \beta_0 \leq \dots \leq \beta_K = 1\)</span>, define</p>
 <div class="math notranslate nohighlight">
diff --git a/book/papers/num-sde/num-sde.html b/book/papers/num-sde/num-sde.html
index 704c3cb3..3c230c2e 100644
--- a/book/papers/num-sde/num-sde.html
+++ b/book/papers/num-sde/num-sde.html
@@ -467,7 +467,7 @@ <h2>Why stochastic differential equations<a class="headerlink" href="#why-stocha
 <h2>The Wiener process<a class="headerlink" href="#the-wiener-process" title="Link to this heading">#</a></h2>
 <p>In order to define the stochastic component of the transition rule of a stochastic system, we must define an appropriate noise model. The Wiener process is a stochastic process that is commonly used for this purpose.</p>
 <div class="proof definition admonition" id="definition-0">
-<p class="admonition-title"><span class="caption-number">Definition 88 </span> (Wiener process)</p>
+<p class="admonition-title"><span class="caption-number">Definition 92 </span> (Wiener process)</p>
 <section class="definition-content" id="proof-content">
 <p>A standard Wiener process over [0, T] is a random variable <span class="math notranslate nohighlight">\(W(t)\)</span> that depends continuously on <span class="math notranslate nohighlight">\(t \in [0, T]\)</span> and satisfies:</p>
 <ol class="arabic simple">
@@ -650,7 +650,7 @@ <h2>Evaluating a stochastic integral<a class="headerlink" href="#evaluating-a-st
 <h2>Euler-Maruyama method<a class="headerlink" href="#euler-maruyama-method" title="Link to this heading">#</a></h2>
 <p>The Euler-Maruyama method is the analoge of the Euler method for deterministic integrals, applied to the stochastic case.</p>
 <div class="proof definition admonition" id="definition-1">
-<p class="admonition-title"><span class="caption-number">Definition 89 </span> (Euler-Maruyama method)</p>
+<p class="admonition-title"><span class="caption-number">Definition 93 </span> (Euler-Maruyama method)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a scalar SDE with drift and diffusion functions <span class="math notranslate nohighlight">\(f\)</span> and <span class="math notranslate nohighlight">\(g\)</span></p>
 <div class="math notranslate nohighlight">
@@ -770,7 +770,7 @@ <h2>Euler-Maruyama method<a class="headerlink" href="#euler-maruyama-method" tit
 <h2>Strong and weak convergence<a class="headerlink" href="#strong-and-weak-convergence" title="Link to this heading">#</a></h2>
 <p>Since the choice of the number of bins <span class="math notranslate nohighlight">\(N\)</span> of the discretisation affects the accuracy of our method, we are interested in how quickly the approximation converges to the exact solution as a function of <span class="math notranslate nohighlight">\(N\)</span>. To do so, we must first define <em>what convergence means</em> in the stochastic case, which leads us to two disctinct notions of convergence, the strong sence and the weak sense.</p>
 <div class="proof definition admonition" id="definition-2">
-<p class="admonition-title"><span class="caption-number">Definition 90 </span> (Strong convergence)</p>
+<p class="admonition-title"><span class="caption-number">Definition 94 </span> (Strong convergence)</p>
 <section class="definition-content" id="proof-content">
 <p>A method for approximating a stochastic process <span class="math notranslate nohighlight">\(X(t)\)</span> is said to have strong order of convergence <span class="math notranslate nohighlight">\(\gamma\)</span> if there exists a constant such that</p>
 <div class="math notranslate nohighlight">
@@ -780,7 +780,7 @@ <h2>Strong and weak convergence<a class="headerlink" href="#strong-and-weak-conv
 </section>
 </div><p>Strong convergence refers to the rate of convergence of the approximation <span class="math notranslate nohighlight">\(X_n\)</span> to the exact solution <span class="math notranslate nohighlight">\(X(\tau_n)\)</span> as <span class="math notranslate nohighlight">\(\Delta t \to 0\)</span>, in expectation. A weaker condition for convergence is rate at which the expected value of the approximation converges to the true expected value, as <span class="math notranslate nohighlight">\(\Delta t \to 0\)</span>, as given below.</p>
 <div class="proof definition admonition" id="definition-3">
-<p class="admonition-title"><span class="caption-number">Definition 91 </span> (Weak convergence)</p>
+<p class="admonition-title"><span class="caption-number">Definition 95 </span> (Weak convergence)</p>
 <section class="definition-content" id="proof-content">
 <p>A method for approximating a stochastic process <span class="math notranslate nohighlight">\(X(t)\)</span> is said to have weak order of convergence <span class="math notranslate nohighlight">\(\gamma\)</span> if there exists a constant such that</p>
 <div class="math notranslate nohighlight">
@@ -799,7 +799,7 @@ <h2>Strong and weak convergence<a class="headerlink" href="#strong-and-weak-conv
 <h2>Milstein’s higher order method<a class="headerlink" href="#milstein-s-higher-order-method" title="Link to this heading">#</a></h2>
 <p>Just as higher order methods for ODEs exist for obtaining refined estimates of the solution, so do methods for SDEs, such as Milstein’s higher order method.</p>
 <div class="proof definition admonition" id="definition-4">
-<p class="admonition-title"><span class="caption-number">Definition 92 </span> (Milstein’s method)</p>
+<p class="admonition-title"><span class="caption-number">Definition 96 </span> (Milstein’s method)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a scalar SDE with drift and diffusion functions <span class="math notranslate nohighlight">\(f\)</span> and <span class="math notranslate nohighlight">\(g\)</span></p>
 <div class="math notranslate nohighlight">
diff --git a/book/papers/rff/rff.html b/book/papers/rff/rff.html
index f38156c8..e856ab93 100644
--- a/book/papers/rff/rff.html
+++ b/book/papers/rff/rff.html
@@ -514,7 +514,7 @@ <h2>The RFF approximation<a class="headerlink" href="#the-rff-approximation" tit
 <p>This is also an unbiased estimate of the kernel, however its variance is lower than in the <span class="math notranslate nohighlight">\(M = 1\)</span> case, since the variance of the average of the sum of <span class="math notranslate nohighlight">\(K\)</span> i.i.d. random variables is lower than the variance of a single one of the variables.
 We therefore arrive at the following algorithm for estimating <span class="math notranslate nohighlight">\(k\)</span>.</p>
 <div class="proof definition admonition" id="definition-1">
-<p class="admonition-title"><span class="caption-number">Definition 85 </span> (Random Fourier Features)</p>
+<p class="admonition-title"><span class="caption-number">Definition 89 </span> (Random Fourier Features)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a translation invariant kernel <span class="math notranslate nohighlight">\(k\)</span> that is the Fourier transform of a probability measure <span class="math notranslate nohighlight">\(p\)</span>, we have the unbiased real-valued estimator</p>
 <div class="math notranslate nohighlight">
@@ -536,7 +536,7 @@ <h3>RFF and Bayesian regression<a class="headerlink" href="#rff-and-bayesian-reg
 <h3>Rates of convergence<a class="headerlink" href="#rates-of-convergence" title="Link to this heading">#</a></h3>
 <p>Now there remains the question of how large the error of the RFF estimator is. In other words, how closely does RFF estimate the exact kernel <span class="math notranslate nohighlight">\(k\)</span>? Since <span class="math notranslate nohighlight">\(-\sqrt{2} \leq z_{\omega, \phi} \leq \sqrt{2}\)</span>, we can use Hoeffding’s inequality<span id="id2">[<a class="reference internal" href="#id10" title="David Grimmett, Geoffrey Stirzaker. Probability and random processes. Oxford university press, 2020.">Grimmett, 2020</a>]</span> to obtain the following high-probability bound on the absolute error on our estimate of <span class="math notranslate nohighlight">\(k\)</span>.</p>
 <div class="proof lemma admonition" id="lemma-2">
-<p class="admonition-title"><span class="caption-number">Lemma 16 </span> (Hoeffding for RFF)</p>
+<p class="admonition-title"><span class="caption-number">Lemma 19 </span> (Hoeffding for RFF)</p>
 <section class="lemma-content" id="proof-content">
 <p>The RFF estimator of <span class="math notranslate nohighlight">\(k\)</span>, using <span class="math notranslate nohighlight">\(M\)</span> pairs of <span class="math notranslate nohighlight">\(\omega, \phi\)</span>, obeys</p>
 <div class="math notranslate nohighlight">
@@ -548,7 +548,7 @@ <h3>Rates of convergence<a class="headerlink" href="#rates-of-convergence" title
 Note that this is a statement about the closeness of <span class="math notranslate nohighlight">\(z^\top(x)z(y)\)</span> and <span class="math notranslate nohighlight">\(k(x, y)\)</span> for any two input pairs, rather than the closeness of these functions over the whole input space.
 In fact, it is possible<span id="id3">[<a class="reference internal" href="#id11" title="Ali Rahimi, Benjamin Recht, and others. Random features for large-scale kernel machines. In NIPS. 2007.">Rahimi <em>et al.</em>, 2007</a>]</span> to make a stronger statement about the uniform convergence of the estimator.</p>
 <div class="proof lemma admonition" id="lemma-3">
-<p class="admonition-title"><span class="caption-number">Lemma 17 </span> (Uniform convergence of RFF)</p>
+<p class="admonition-title"><span class="caption-number">Lemma 20 </span> (Uniform convergence of RFF)</p>
 <section class="lemma-content" id="proof-content">
 <p>Let <span class="math notranslate nohighlight">\(\mathcal{M}\)</span> be a compact subset of <span class="math notranslate nohighlight">\(\mathbb{R}^D\)</span>. Then the RFF estimator of <span class="math notranslate nohighlight">\(k\)</span>, using <span class="math notranslate nohighlight">\(M\)</span> pairs of <span class="math notranslate nohighlight">\(\omega, \phi\)</span> converges uniformly to <span class="math notranslate nohighlight">\(k\)</span> according to</p>
 <div class="math notranslate nohighlight">
diff --git a/book/papers/score-matching/score-matching.html b/book/papers/score-matching/score-matching.html
index 849b9d22..d1c39621 100644
--- a/book/papers/score-matching/score-matching.html
+++ b/book/papers/score-matching/score-matching.html
@@ -471,7 +471,7 @@ <h2>The score matching trick<a class="headerlink" href="#the-score-matching-tric
 <p>The second step is to find a way to use the score function <span class="math notranslate nohighlight">\(\psi_\theta(x)\)</span> along with some observed data, to estimate the parameters <span class="math notranslate nohighlight">\(\theta\)</span>.
 We can achieve this by defining the following score matching objective.</p>
 <div class="proof definition admonition" id="definition-0">
-<p class="admonition-title"><span class="caption-number">Definition 86 </span> (Score matching objective)</p>
+<p class="admonition-title"><span class="caption-number">Definition 90 </span> (Score matching objective)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a data distribution <span class="math notranslate nohighlight">\(p_d(x)\)</span> and an approximating distribution <span class="math notranslate nohighlight">\(p_\theta(x)\)</span> with parameters <span class="math notranslate nohighlight">\(\theta\)</span>, we define the score matching objective as</p>
 <div class="math notranslate nohighlight">
diff --git a/book/papers/svgd/svgd.html b/book/papers/svgd/svgd.html
index e89beeac..7b462cdb 100644
--- a/book/papers/svgd/svgd.html
+++ b/book/papers/svgd/svgd.html
@@ -583,7 +583,7 @@ <h3>Direction of steepest descent<a class="headerlink" href="#direction-of-steep
 <h3>Empirical approximation<a class="headerlink" href="#empirical-approximation" title="Link to this heading">#</a></h3>
 <p>Now, if we approximate <span class="math notranslate nohighlight">\(q\)</span> by a finite set of <span class="math notranslate nohighlight">\(N\)</span> particles at locations <span class="math notranslate nohighlight">\(x_n^{(i)}, n = 1, ..., N\)</span>, at the <span class="math notranslate nohighlight">\(i^{th}\)</span> iteration, we obtain at the following iterative algorithm.</p>
 <div class="proof definition admonition" id="definition-2">
-<p class="admonition-title"><span class="caption-number">Definition 87 </span> (Stein variational gradient descent)</p>
+<p class="admonition-title"><span class="caption-number">Definition 91 </span> (Stein variational gradient descent)</p>
 <section class="definition-content" id="proof-content">
 <p>Given a distribution <span class="math notranslate nohighlight">\(p(x)\)</span>, a postive definite kernel <span class="math notranslate nohighlight">\(k(x, x')\)</span> and a set of particles with initial positions <span class="math notranslate nohighlight">\(\{x_n^{(0)}\}_{n=1}^N\)</span>, Stein variational gradient descent evolves the particles according to</p>
 <div class="math notranslate nohighlight">
diff --git a/book/topology/002-topological-spaces.html b/book/topology/002-topological-spaces.html
index 9f941124..19fcc2f4 100644
--- a/book/topology/002-topological-spaces.html
+++ b/book/topology/002-topological-spaces.html
@@ -428,6 +428,7 @@ <h2> Contents </h2>
             <nav aria-label="Page">
                 <ul class="visible nav section-nav flex-column">
 <li class="toc-h2 nav-item toc-entry"><a class="reference internal nav-link" href="#topologies">Topologies</a></li>
+<li class="toc-h2 nav-item toc-entry"><a class="reference internal nav-link" href="#sequences">Sequences</a></li>
 </ul>
             </nav>
         </div>
@@ -470,7 +471,7 @@ <h2>Topologies<a class="headerlink" href="#topologies" title="Link to this headi
 <div class="proof definition admonition" id="topology:def-induced-topology">
 <p class="admonition-title"><span class="caption-number">Definition 81 </span> (Induced topology)</p>
 <section class="definition-content" id="proof-content">
-<p>Let <span class="math notranslate nohighlight">\((X, d)\)</span> be a metric space.
+<p>Let <span class="math notranslate nohighlight">\((X, d)\)</span> be a <a class="reference internal" href="001-metric-spaces.html#topology:def-metric-space">metric space</a>.
 Then, the topology induced by <span class="math notranslate nohighlight">\(d\)</span> is the set of all open sets in <span class="math notranslate nohighlight">\(X\)</span> with respect to the metric <span class="math notranslate nohighlight">\(d.\)</span></p>
 </section>
 </div><p>We now also re-define continuity in terms of open sets.</p>
@@ -480,7 +481,103 @@ <h2>Topologies<a class="headerlink" href="#topologies" title="Link to this headi
 <p>Let <span class="math notranslate nohighlight">\(f: X \to Y\)</span> be a function between topological spaces.
 Then, <span class="math notranslate nohighlight">\(f\)</span> is continuous if for every open set <span class="math notranslate nohighlight">\(U \subseteq Y,\)</span> the pre-image <span class="math notranslate nohighlight">\(f^{-1}(U)\)</span> is an open set in <span class="math notranslate nohighlight">\(X.\)</span></p>
 </section>
-</div></section>
+</div><div class="proof lemma admonition" id="topology:lemma-composition-preserves-continuity">
+<p class="admonition-title"><span class="caption-number">Lemma 15 </span> (Composition preserves continuity)</p>
+<section class="lemma-content" id="proof-content">
+<p>If <span class="math notranslate nohighlight">\(f: X \to Y\)</span> and <span class="math notranslate nohighlight">\(g: Y \to Z\)</span> are <a class="reference internal" href="#topology:def-continuous-function-topology">continuous functions</a> between <a class="reference internal" href="#topology:def-topological-space">topological spaces</a>, then the composition <span class="math notranslate nohighlight">\(g \circ f: X \to Z\)</span> is continuous.</p>
+</section>
+</div><p>In topology, we are interested in studying the properties of spaces that are preserved under continuous deformations.
+Therefore, from a topology perspective, two spaces are considered essentially the same up to a continuous bijection.
+This is captured by the notion of homeomorphism.</p>
+<div class="proof definition admonition" id="topology:def-homeomorphism">
+<p class="admonition-title"><span class="caption-number">Definition 83 </span> (Homeomorphism)</p>
+<section class="definition-content" id="proof-content">
+<p>A function <span class="math notranslate nohighlight">\(f: X \to Y\)</span> between <a class="reference internal" href="#topology:def-topological-space">topological spaces</a> is a <a class="reference internal" href="#topology:def-homeomorphism">homeomorphism</a> if it is bijective, <a class="reference internal" href="#topology:def-continuous-function-topology">continuous</a>, and its inverse <span class="math notranslate nohighlight">\(f^{-1}\)</span> is also continuous.
+Equivalently, <span class="math notranslate nohighlight">\(f\)</span> is a homeomorphism if <span class="math notranslate nohighlight">\(f\)</span> is a bijection and <span class="math notranslate nohighlight">\(U \subseteq X\)</span> is <a class="reference internal" href="#topology:def-topological-space">open</a> if and only if <span class="math notranslate nohighlight">\(f(U) \subseteq Y\)</span> is open.
+We say two spaces are homeomorphic if there exists a homeomorphism between them.</p>
+</section>
+</div><div class="proof lemma admonition" id="topology:lemma-homeomorphism-equivalence-relation">
+<p class="admonition-title"><span class="caption-number">Lemma 16 </span> (Homeomorphism is an equivalence relation)</p>
+<section class="lemma-content" id="proof-content">
+<p><a class="reference internal" href="#topology:def-homeomorphism">Homeomorphism</a> is an equivalence relation between topological spaces.</p>
+</section>
+</div><details class="sd-sphinx-override sd-dropdown sd-card sd-mb-3">
+<summary class="sd-summary-title sd-card-header">
+Proof: Homeomorphism is an equivalence relation<div class="sd-summary-down docutils">
+<svg version="1.1" width="1.5em" height="1.5em" class="sd-octicon sd-octicon-chevron-down" viewBox="0 0 24 24" aria-hidden="true"><path fill-rule="evenodd" d="M5.22 8.72a.75.75 0 000 1.06l6.25 6.25a.75.75 0 001.06 0l6.25-6.25a.75.75 0 00-1.06-1.06L12 14.44 6.28 8.72a.75.75 0 00-1.06 0z"></path></svg></div>
+<div class="sd-summary-up docutils">
+<svg version="1.1" width="1.5em" height="1.5em" class="sd-octicon sd-octicon-chevron-up" viewBox="0 0 24 24" aria-hidden="true"><path fill-rule="evenodd" d="M18.78 15.28a.75.75 0 000-1.06l-6.25-6.25a.75.75 0 00-1.06 0l-6.25 6.25a.75.75 0 101.06 1.06L12 9.56l5.72 5.72a.75.75 0 001.06 0z"></path></svg></div>
+</summary><div class="sd-summary-content sd-card-body docutils">
+<p class="sd-card-text"><strong>Reflexivity:</strong>
+The identity map <span class="math notranslate nohighlight">\(I_X: X \to X\)</span> is a homeomorphism, because it is bijective, continuous, and its inverse is itself.
+Therefore <span class="math notranslate nohighlight">\(X \equiv X.\)</span></p>
+<p class="sd-card-text"><strong>Symmetry:</strong>
+If <span class="math notranslate nohighlight">\(f: X \to Y\)</span> is a homeomorphism, then <span class="math notranslate nohighlight">\(f^{-1}: Y \to X\)</span> is also a homeomorphism.
+Therefore <span class="math notranslate nohighlight">\(X \equiv Y\)</span> implies <span class="math notranslate nohighlight">\(Y \equiv X.\)</span></p>
+<p class="sd-card-text"><strong>Transitivity:</strong>
+If <span class="math notranslate nohighlight">\(f: X \to Y\)</span> and <span class="math notranslate nohighlight">\(g: Y \to Z\)</span> are homeomorphisms, then <span class="math notranslate nohighlight">\(g \circ f: X \to Z\)</span> is a homeomorphism.
+Therefore <span class="math notranslate nohighlight">\(X \equiv Y\)</span> and <span class="math notranslate nohighlight">\(Y \equiv Z\)</span> implies <span class="math notranslate nohighlight">\(X \equiv Z.\)</span></p>
+</div>
+</details><p>In general, the approach for showing that two spaces are homeomorphic is to find a homeomorphism between them.
+However, showing that two spaces are <em>not</em> homeomorphic is more difficult.
+In particular, there is no simple recipe for showing that two spaces are not homeomorphic.
+Instead, we resort to certain topological properties that are preserved under homeomorphisms.
+Whenever two spaces have different such properties, we can conclude that they are not homeomorphic.
+Two such properties are connectedness and compactness.
+In the remainder of this chapter we give definitions and results building up to these properties.</p>
+</section>
+<section id="sequences">
+<h2>Sequences<a class="headerlink" href="#sequences" title="Link to this heading">#</a></h2>
+<p>We now turn to re-defining concepts from metric spaces in terms of topological spaces, starting with sequences.
+First we re-define the following shorthand for open sets.</p>
+<div class="proof definition admonition" id="topology:def-open-neighbourhood-topology">
+<p class="admonition-title"><span class="caption-number">Definition 84 </span> (Open neighbourhood)</p>
+<section class="definition-content" id="proof-content">
+<p>An open neighbourhood of a point <span class="math notranslate nohighlight">\(x \in X\)</span> in a <a class="reference internal" href="#topology:def-topological-space">topological space</a> <span class="math notranslate nohighlight">\((X, \mathcal{U})\)</span> is an open set <span class="math notranslate nohighlight">\(U \in \mathcal{U}\)</span> such that <span class="math notranslate nohighlight">\(x \in U.\)</span></p>
+</section>
+</div><p>In topological spaces, convergent sequences are defined directly in terms of open neighbourhoods, rather than using open balls.</p>
+<div class="proof definition admonition" id="topology:def-convergent-sequence-topology">
+<p class="admonition-title"><span class="caption-number">Definition 85 </span> (Convergent sequence)</p>
+<section class="definition-content" id="proof-content">
+<p>A sequence <span class="math notranslate nohighlight">\(x_n \to x\)</span> if for every open neighbourhood <span class="math notranslate nohighlight">\(U\)</span> of <span class="math notranslate nohighlight">\(x,\)</span> there exists <span class="math notranslate nohighlight">\(N \in \mathbb{N}\)</span> such that <span class="math notranslate nohighlight">\(x_n \in U\)</span> for all <span class="math notranslate nohighlight">\(n &gt; N.\)</span></p>
+</section>
+</div><p>We now turn to uniqueness of limits.
+In general, in a topological space limits need not be unique.
+For example, given a set <span class="math notranslate nohighlight">\(X\)</span> with the coarse topology <span class="math notranslate nohighlight">\(\mathcal{U} = \{\emptyset, X\},\)</span> every sequence converges to every point.
+However, further assumptions on the topology can result into unique limits.</p>
+<div class="proof definition admonition" id="topology:def-hausdorff-space">
+<p class="admonition-title"><span class="caption-number">Definition 86 </span> (Hausdorff space)</p>
+<section class="definition-content" id="proof-content">
+<p>A topological space <span class="math notranslate nohighlight">\((X, \mathcal{U})\)</span> is Hausdorff if for every pair of distinct points <span class="math notranslate nohighlight">\(x_1, x_2 \in X,\)</span> there exist open neighbourhoods <span class="math notranslate nohighlight">\(U_1, U_2\)</span> of <span class="math notranslate nohighlight">\(x_1, x_2\)</span> respectively such that <span class="math notranslate nohighlight">\(U_1 \cap U_2 = \emptyset.\)</span></p>
+</section>
+</div><aside class="margin sidebar">
+<p class="sidebar-title"></p>
+<p>Earlier, we proved that <a class="reference internal" href="001-metric-spaces.html#topology:lemma-limits-in-metric-spaces-are-unique">limits in metric spaces are unique</a>.
+The property we used in that proof was that, in a <a class="reference internal" href="001-metric-spaces.html#topology:def-metric-space">metric space</a>, open balls centered around distinct points are disjoint if their radii are small enough.
+This was the Hausdorff property in disguise.
+Metric spaces are always <a class="reference internal" href="#topology:def-hausdorff-space">Hausdorff</a>, and therefore have unique limits.</p>
+</aside>
+<div class="proof lemma admonition" id="topology:lemma-limits-unique-hausdorff">
+<p class="admonition-title"><span class="caption-number">Lemma 17 </span> (Limits are unique in Hausdorff spaces)</p>
+<section class="lemma-content" id="proof-content">
+<p>If <span class="math notranslate nohighlight">\(X\)</span> is <a class="reference internal" href="#topology:def-hausdorff-space">Hausdorff</a> and <span class="math notranslate nohighlight">\((x_n)\)</span> is a sequence in <span class="math notranslate nohighlight">\(X\)</span> such that <span class="math notranslate nohighlight">\(x_n \to x\)</span> and <span class="math notranslate nohighlight">\(x_n \to x',\)</span> then <span class="math notranslate nohighlight">\(x = x'.\)</span></p>
+</section>
+</div><details class="sd-sphinx-override sd-dropdown sd-card sd-mb-3">
+<summary class="sd-summary-title sd-card-header">
+Proof: Limits are unique in Hausdorff spaces<div class="sd-summary-down docutils">
+<svg version="1.1" width="1.5em" height="1.5em" class="sd-octicon sd-octicon-chevron-down" viewBox="0 0 24 24" aria-hidden="true"><path fill-rule="evenodd" d="M5.22 8.72a.75.75 0 000 1.06l6.25 6.25a.75.75 0 001.06 0l6.25-6.25a.75.75 0 00-1.06-1.06L12 14.44 6.28 8.72a.75.75 0 00-1.06 0z"></path></svg></div>
+<div class="sd-summary-up docutils">
+<svg version="1.1" width="1.5em" height="1.5em" class="sd-octicon sd-octicon-chevron-up" viewBox="0 0 24 24" aria-hidden="true"><path fill-rule="evenodd" d="M18.78 15.28a.75.75 0 000-1.06l-6.25-6.25a.75.75 0 00-1.06 0l-6.25 6.25a.75.75 0 101.06 1.06L12 9.56l5.72 5.72a.75.75 0 001.06 0z"></path></svg></div>
+</summary><div class="sd-summary-content sd-card-body docutils">
+<p class="sd-card-text">Let <span class="math notranslate nohighlight">\((x_n)\)</span> be a sequence in <span class="math notranslate nohighlight">\(X\)</span> such that <span class="math notranslate nohighlight">\(x_n \to x\)</span> and <span class="math notranslate nohighlight">\(x_n \to x'.\)</span>
+Suppose <span class="math notranslate nohighlight">\(x \neq x'.\)</span>
+Since <span class="math notranslate nohighlight">\(X\)</span> is Hausdorff, there exist open neighbourhoods <span class="math notranslate nohighlight">\(U, U'\)</span> of <span class="math notranslate nohighlight">\(x, x'\)</span> respectively such that <span class="math notranslate nohighlight">\(U \cap U' = \emptyset.\)</span>
+Since <span class="math notranslate nohighlight">\(x_n \to x,\)</span> there exists <span class="math notranslate nohighlight">\(N \in \mathbb{N}\)</span> such that <span class="math notranslate nohighlight">\(x_n \in U\)</span> for all <span class="math notranslate nohighlight">\(n &gt; N.\)</span>
+Similarly, since <span class="math notranslate nohighlight">\(x_n \to x',\)</span> there exists <span class="math notranslate nohighlight">\(N' \in \mathbb{N}\)</span> such that <span class="math notranslate nohighlight">\(x_n \in U'\)</span> for all <span class="math notranslate nohighlight">\(n &gt; N'.\)</span>
+Then, for all <span class="math notranslate nohighlight">\(n &gt; \max(N, N'),\)</span> we have <span class="math notranslate nohighlight">\(x_n \in U \cap U' = \emptyset,\)</span> which is a contradiction.
+Therefore, <span class="math notranslate nohighlight">\(x = x'.\)</span></p>
+</div>
+</details></section>
 </section>
 
     <script type="text/x-thebe-config">
@@ -548,6 +645,7 @@ <h2>Topologies<a class="headerlink" href="#topologies" title="Link to this headi
   <nav class="bd-toc-nav page-toc">
     <ul class="visible nav section-nav flex-column">
 <li class="toc-h2 nav-item toc-entry"><a class="reference internal nav-link" href="#topologies">Topologies</a></li>
+<li class="toc-h2 nav-item toc-entry"><a class="reference internal nav-link" href="#sequences">Sequences</a></li>
 </ul>
   </nav></div>
 
diff --git a/prf-prf.html b/prf-prf.html
index cfc44d0c..801115c9 100644
--- a/prf-prf.html
+++ b/prf-prf.html
@@ -488,6 +488,9 @@ <h1>Proof Index</h1>
    <a href="#cap-topology:cauchy-schwarz"><strong>topology:cauchy-schwarz</strong></a> | 
    <a href="#cap-topology:def-continuous-function"><strong>topology:def-continuous-function</strong></a> | 
    <a href="#cap-topology:def-continuous-function-topology"><strong>topology:def-continuous-function-topology</strong></a> | 
+   <a href="#cap-topology:def-convergent-sequence-topology"><strong>topology:def-convergent-sequence-topology</strong></a> | 
+   <a href="#cap-topology:def-hausdorff-space"><strong>topology:def-hausdorff-space</strong></a> | 
+   <a href="#cap-topology:def-homeomorphism"><strong>topology:def-homeomorphism</strong></a> | 
    <a href="#cap-topology:def-induced-topology"><strong>topology:def-induced-topology</strong></a> | 
    <a href="#cap-topology:def-limit-point"><strong>topology:def-limit-point</strong></a> | 
    <a href="#cap-topology:def-metric-space"><strong>topology:def-metric-space</strong></a> | 
@@ -495,13 +498,17 @@ <h1>Proof Index</h1>
    <a href="#cap-topology:def-open-and-closed-balls"><strong>topology:def-open-and-closed-balls</strong></a> | 
    <a href="#cap-topology:def-open-and-closed-subsets"><strong>topology:def-open-and-closed-subsets</strong></a> | 
    <a href="#cap-topology:def-open-neighbourhood"><strong>topology:def-open-neighbourhood</strong></a> | 
+   <a href="#cap-topology:def-open-neighbourhood-topology"><strong>topology:def-open-neighbourhood-topology</strong></a> | 
    <a href="#cap-topology:def-topological-space"><strong>topology:def-topological-space</strong></a> | 
    <a href="#cap-topology:examples-of-norms"><strong>topology:examples-of-norms</strong></a> | 
    <a href="#cap-topology:inner-product-space"><strong>topology:inner-product-space</strong></a> | 
    <a href="#cap-topology:lemma-closed-set-iff-set-contains-all-its-limit-points"><strong>topology:lemma-closed-set-iff-set-contains-all-its-limit-points</strong></a> | 
+   <a href="#cap-topology:lemma-composition-preserves-continuity"><strong>topology:lemma-composition-preserves-continuity</strong></a> | 
    <a href="#cap-topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood"><strong>topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood</strong></a> | 
+   <a href="#cap-topology:lemma-homeomorphism-equivalence-relation"><strong>topology:lemma-homeomorphism-equivalence-relation</strong></a> | 
    <a href="#cap-topology:lemma-inner-products-induce-norms"><strong>topology:lemma-inner-products-induce-norms</strong></a> | 
    <a href="#cap-topology:lemma-limits-in-metric-spaces-are-unique"><strong>topology:lemma-limits-in-metric-spaces-are-unique</strong></a> | 
+   <a href="#cap-topology:lemma-limits-unique-hausdorff"><strong>topology:lemma-limits-unique-hausdorff</strong></a> | 
    <a href="#cap-topology:lemma-non-constant-positive-continuous-function-has-positive-integral"><strong>topology:lemma-non-constant-positive-continuous-function-has-positive-integral</strong></a> | 
    <a href="#cap-topology:lemma-norms-induce-metrics"><strong>topology:lemma-norms-induce-metrics</strong></a> | 
    <a href="#cap-topology:lemma-open-and-closed-balls-are-open-and-closed"><strong>topology:lemma-open-and-closed-balls-are-open-and-closed</strong></a> | 
@@ -1303,6 +1310,30 @@ <h1>Proof Index</h1>
        <a href="book/topology/002-topological-spaces.html#topology:def-continuous-function-topology"><code class="xref">topology:def-continuous-function-topology</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
        <em>definition</em></td></tr>
      <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:def-convergent-sequence-topology"><td></td><td>
+       <strong>topology:def-convergent-sequence-topology</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:def-convergent-sequence-topology"><code class="xref">topology:def-convergent-sequence-topology</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>definition</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:def-hausdorff-space"><td></td><td>
+       <strong>topology:def-hausdorff-space</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:def-hausdorff-space"><code class="xref">topology:def-hausdorff-space</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>definition</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:def-homeomorphism"><td></td><td>
+       <strong>topology:def-homeomorphism</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:def-homeomorphism"><code class="xref">topology:def-homeomorphism</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>definition</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
      <tr class="cap" id="cap-topology:def-induced-topology"><td></td><td>
        <strong>topology:def-induced-topology</strong></td><td></td></tr>
      <tr>
@@ -1359,6 +1390,14 @@ <h1>Proof Index</h1>
        <a href="book/topology/001-metric-spaces.html#topology:def-open-neighbourhood"><code class="xref">topology:def-open-neighbourhood</code></a> <em>(book/topology/001-metric-spaces)</em></td><td>
        <em>definition</em></td></tr>
      <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:def-open-neighbourhood-topology"><td></td><td>
+       <strong>topology:def-open-neighbourhood-topology</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:def-open-neighbourhood-topology"><code class="xref">topology:def-open-neighbourhood-topology</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>definition</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
      <tr class="cap" id="cap-topology:def-topological-space"><td></td><td>
        <strong>topology:def-topological-space</strong></td><td></td></tr>
      <tr>
@@ -1391,6 +1430,14 @@ <h1>Proof Index</h1>
        <a href="book/topology/001-metric-spaces.html#topology:lemma-closed-set-iff-set-contains-all-its-limit-points"><code class="xref">topology:lemma-closed-set-iff-set-contains-all-its-limit-points</code></a> <em>(book/topology/001-metric-spaces)</em></td><td>
        <em>lemma</em></td></tr>
      <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:lemma-composition-preserves-continuity"><td></td><td>
+       <strong>topology:lemma-composition-preserves-continuity</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:lemma-composition-preserves-continuity"><code class="xref">topology:lemma-composition-preserves-continuity</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>lemma</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
      <tr class="cap" id="cap-topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood"><td></td><td>
        <strong>topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood</strong></td><td></td></tr>
      <tr>
@@ -1399,6 +1446,14 @@ <h1>Proof Index</h1>
        <a href="book/topology/001-metric-spaces.html#topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood"><code class="xref">topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood</code></a> <em>(book/topology/001-metric-spaces)</em></td><td>
        <em>lemma</em></td></tr>
      <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:lemma-homeomorphism-equivalence-relation"><td></td><td>
+       <strong>topology:lemma-homeomorphism-equivalence-relation</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:lemma-homeomorphism-equivalence-relation"><code class="xref">topology:lemma-homeomorphism-equivalence-relation</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>lemma</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
      <tr class="cap" id="cap-topology:lemma-inner-products-induce-norms"><td></td><td>
        <strong>topology:lemma-inner-products-induce-norms</strong></td><td></td></tr>
      <tr>
@@ -1415,6 +1470,14 @@ <h1>Proof Index</h1>
        <a href="book/topology/001-metric-spaces.html#topology:lemma-limits-in-metric-spaces-are-unique"><code class="xref">topology:lemma-limits-in-metric-spaces-are-unique</code></a> <em>(book/topology/001-metric-spaces)</em></td><td>
        <em>lemma</em></td></tr>
      <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
+     <tr class="cap" id="cap-topology:lemma-limits-unique-hausdorff"><td></td><td>
+       <strong>topology:lemma-limits-unique-hausdorff</strong></td><td></td></tr>
+     <tr>
+       <td></td>
+       <td>
+       <a href="book/topology/002-topological-spaces.html#topology:lemma-limits-unique-hausdorff"><code class="xref">topology:lemma-limits-unique-hausdorff</code></a> <em>(book/topology/002-topological-spaces)</em></td><td>
+       <em>lemma</em></td></tr>
+     <tr class="pcap"><td></td><td>&#160;</td><td></td></tr>
      <tr class="cap" id="cap-topology:lemma-non-constant-positive-continuous-function-has-positive-integral"><td></td><td>
        <strong>topology:lemma-non-constant-positive-continuous-function-has-positive-integral</strong></td><td></td></tr>
      <tr>
diff --git a/searchindex.js b/searchindex.js
index d65e9835..b68e57cc 100644
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"alltitles": {"": [[3, null], [3, null], [3, null], [10, null], [10, null], [11, null], [11, null], [11, null], [11, null], [11, null], [11, null], [17, "theorem-6"], [26, null], [28, "example-3"], [28, "example-5"], [28, "example-6"], [28, null], [28, "example-22"]], " (A function that is not Riemann integrable)": [[2, "mira-eg-not-riemann-integrable"]], " (A non-context-free language)": [[29, "example-12"]], " (Additivity of outer measure if one of the sets is closed)": [[3, "theorem-43"]], " (Additivity of outer measure if one of the sets is open)": [[3, "mira:thm:additivity-of-outer-measure-if-one-set-is-open"]], " (Algebra of inverse images)": [[3, "theorem-21"]], " (Algebraic operations with measurable functions)": [[3, "theorem-29"]], " (Ambiguity)": [[29, "definition-6"]], " (Annealed Importance Sampling)": [[4, "definition-2"]], " (Approximation of Borel sets from below by closed sets)": [[3, "theorem-44"]], " (Bayes\u2019 theorem)": [[13, "theorem-7"]], " (Bochner\u2019s theorem)": [[7, "theorem-0"]], " (Borel measurable function)": [[3, "definition-25"]], " (Borel set)": [[3, "definition-19"]], " (Borel subsets of [-\\infty, \\infty])": [[3, "definition-31"]], " (Bounds on the Riemann integral)": [[2, "mira-bounds-on-riemann-integral"]], " (CFL not closed under intersection)": [[29, "lemma-13"]], " (CFLs generated by CFG in Chomsky normal form)": [[29, "lemma-8"]], " (Cauchy-Schwartz inequality)": [[19, "theorem-4"]], " (Cauchy-Schwarz inequality)": [[31, "topology:cauchy-schwarz"]], " (Central limit theorem)": [[20, "theorem-6"]], " (Characterisation of continuity)": [[31, "topology:theorem-characterisation-of-continuity"]], " (Characteristic function)": [[19, "definition-13"]], " (Chebyshev\u2019s inequality)": [[20, "theorem-4"]], " (Chomsky normal form)": [[29, "definition-7"]], " (Closed set \\iff set contains all its limit points)": [[31, "topology:lemma-closed-set-iff-set-contains-all-its-limit-points"]], " (Closure under concatenation)": [[28, "theorem-14"]], " (Closure under star)": [[28, "theorem-15"]], " (Closure under union)": [[28, "theorem-10"], [28, "theorem-13"]], " (Composition of measurable functions)": [[3, "theorem-28"]], " (Condition for measurable function)": [[3, "mira-thm-condition-measurable"]], " (Conditional density function)": [[18, "definition-5"]], " (Conditional expectation)": [[14, "definition-7"]], " (Conditional probabiility)": [[13, "definition-3"]], " (Conditional probability space)": [[13, "theorem-5"]], " (Context-free grammar)": [[29, "definition-0"]], " (Context-free language)": [[29, "definition-2"]], " (Continuity theorem with characteristic functions)": [[20, "theorem-13"]], " (Continuity theorem with mgfs)": [[20, "theorem-7"]], " (Continuous conditional expectation)": [[18, "definition-8"]], " (Continuous function)": [[31, "topology:def-continuous-function"], [32, "topology:def-continuous-function-topology"]], " (Continuous functions are Riemann integrable)": [[2, "theorem-8"]], " (Continuous random variable)": [[17, "definition-2"]], " (Convergence implies sequence eventually in open neighbourhood)": [[31, "topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood"]], " (Convergence in distribution to c \\implies convergence in probability to c)": [[20, "theorem-12"]], " (Convergence in distribution)": [[20, "definition-10"]], " (Convergence in mean square \\implies convergence in probability)": [[20, "theorem-3"]], " (Convergence in probability (\\implies) convergence in distribution)": [[20, "theorem-11"]], " (Convergence in probability)": [[20, "definition-2"]], " (Convergent sequence)": [[31, "definition-3"]], " (Convex function)": [[19, "definition-10"]], " (Convolution formula)": [[15, "theorem-6"], [18, "theorem-3"]], " (Correlation between -1 and 1)": [[19, "theorem-3"]], " (Correlation coefficient)": [[19, "definition-2"]], " (Countable sets have outer measure zero)": [[3, "mira:thm:countable-sets-have-measure-zero"]], " (Countable subadditivity)": [[3, "theorem-38"]], " (Covariance)": [[19, "definition-1"]], " (Direction of steepest descent)": [[9, "theorem-1"]], " (Discrete random variable)": [[14, "definition-0"]], " (Distribution function)": [[17, "definition-1"]], " (Equivalent form of J)": [[8, "theorem-2"]], " (Euler-Maruyama method)": [[6, "definition-1"]], " (Event space)": [[13, "definition-0"]], " (Every continuous function is Borel measurable)": [[3, "theorem-26"]], " (Every increasing function is Borel measurable)": [[3, "theorem-27"]], " (Example CFG for mathematical expressions)": [[29, "example-4"]], " (Example CFG)": [[29, "example-3"]], " (Examples of metric spaces)": [[31, "example-1"]], " (Examples of norms)": [[31, "topology:examples-of-norms"]], " (Expectation of continuous random variable)": [[17, "definition-5"]], " (Expectation of product of independent variables)": [[15, "theorem-4"]], " (Expectation)": [[14, "definition-3"]], " (FSA accepts a string / recognises a language)": [[28, "definition-2"]], " (Fenchel-Legendre transform)": [[20, "definition-8"]], " (Finite State Automaton;  FSA)": [[28, "finite-state-automaton"]], " (Generalised nondeterministic finite automaton)": [[28, "definition-17"]], " (Generating function)": [[16, "definition-0"]], " (Heine-Borel)": [[3, "mira:thm:heine-borel"]], " (Hoeffding for RFF)": [[7, "lemma-2"]], " (Importance weighted MCMC algorithm)": [[4, "definition-1"]], " (Independence \\iff expectations of products of functions factorise)": [[18, "theorem-7"]], " (Independence \\iff expected product of functions factorises)": [[15, "theorem-5"]], " (Independence \\iff pdf factorises)": [[18, "theorem-2"]], " (Independence \\iff pmf factorises)": [[15, "theorem-3"]], " (Independence \\implies G factorises)": [[16, "theorem-5"]], " (Independence \\implies mgf of sum factorises)": [[19, "theorem-7"]], " (Independence of variables)": [[18, "definition-1"]], " (Independence)": [[13, "definition-4"], [15, "definition-2"]], " (Indicator functions)": [[15, "definition-7"]], " (Induced topology)": [[32, "topology:def-induced-topology"]], " (Inequalities with Riemann sums)": [[2, "mira-thm-refining-partitions"]], " (Infimum and supremum of a sequence of measurable functions is measurable)": [[3, "theorem-34"]], " (Infimum and supremum)": [[2, "definition-1"]], " (Inner product)": [[31, "topology:inner-product-space"]], " (Inner products induce norms)": [[31, "topology:lemma-inner-products-induce-norms"]], " (Interchanging Riemann integral and limit)": [[2, "theorem-14"]], " (Inverse image of a composition)": [[3, "theorem-22"]], " (Inverse image)": [[3, "definition-20"]], " (Inversion theorem)": [[19, "theorem-16"]], " (Ito\u2019s result for one dimension)": [[6, "theorem-5"]], " (Jacobian formula)": [[18, "theorem-4"]], " (Jensen\u2019s inequality)": [[19, "theorem-11"]], " (Joint distribution function)": [[18, "definition-0"]], " (Joint probability mass function)": [[15, "definition-0"]], " (Large deviation theorem)": [[20, "theorem-9"]], " (Law of iterated expectations)": [[18, "theorem-9"]], " (Law of the subconscious statistician - multivariate)": [[15, "theorem-1"]], " (Law of the subconscious statistician)": [[14, "theorem-4"], [18, "theorem-6"]], " (Leftmost derivation)": [[29, "definition-5"]], " (Length of an open interval)": [[3, "definition-0"]], " (Limit point)": [[31, "topology:def-limit-point"]], " (Limits in metric spaces are unique)": [[31, "topology:lemma-limits-in-metric-spaces-are-unique"]], " (Lower Riemann integral \\leq upper Riemann integral)": [[2, "theorem-6"]], " (Lower Riemann sum \\leq upper Riemann sum)": [[2, "theorem-4"]], " (Lower bound to importance weight variance)": [[4, "lemma-0"]], " (Markov inequality)": [[19, "theorem-9"]], " (Matching scores \\iff matching distributions)": [[8, "theorem-1"]], " (Mean square law of large numbers)": [[20, "theorem-1"]], " (Mean-square convergence)": [[20, "definition-0"]], " (Measurable function on [-\\infty, \\infty])": [[3, "mira:def:measurable-function-infinity"]], " (Measurable function)": [[3, "definition-23"]], " (Measurable space, measurable set)": [[3, "definition-17"]], " (Measure of a decreasing intersection)": [[3, "theorem-40"]], " (Measure of an increasing union)": [[3, "mira-thm-measure-increasing-union"]], " (Measure of the union of two sets)": [[3, "theorem-41"]], " (Measure preserves order; measure of a set difference)": [[3, "mira-thm-measure-preserves-order"]], " (Measure space)": [[3, "definition-36"]], " (Measure)": [[3, "definition-35"]], " (Metric space)": [[31, "topology:def-metric-space"]], " (Metric subspace)": [[31, "definition-2"]], " (Milstein\u2019s method)": [[6, "definition-4"]], " (Moment generating function)": [[19, "definition-5"]], " (Moment)": [[16, "definition-3"]], " (Moments equal to derivatives of mgf)": [[19, "theorem-6"]], " (Moments from pgf derivatives)": [[16, "theorem-4"]], " (NFA recognises A \\implies A is regular)": [[28, "toc-dfa-nfa-equivalence"]], " (Non-constant positive continuous function has positive integral)": [[31, "topology:lemma-non-constant-positive-continuous-function-has-positive-integral"]], " (Nonadditivity of the outer measure)": [[3, "mira:thm:non-additivity-of-outer-measure"]], " (Nondeterministic Finite Automaton)": [[28, "definition-11"]], " (Nonexistence of extension of length to all subsets of \\mathbb{R})": [[3, "mira:thm:nonexistence-length"]], " (Nontrivial intervals are uncountable)": [[3, "theorem-10"]], " (Norm)": [[31, "topology:def-norm"]], " (Norms induce metrics)": [[31, "topology:lemma-norms-induce-metrics"]], " (Open (closed) balls are open (closed))": [[31, "topology:lemma-open-and-closed-balls-are-open-and-closed"]], " (Open and closed balls)": [[31, "topology:def-open-and-closed-balls"]], " (Open and closed subsets)": [[31, "topology:def-open-and-closed-subsets"]], " (Open cover, finite subcover)": [[3, "definition-7"]], " (Open neighbourhood)": [[31, "topology:def-open-neighbourhood"]], " (Other properties of \\sigma-algebras)": [[3, "theorem-16"]], " (Outer measure is additive if sets are contained by disjoint open intervals)": [[3, "mira:thm:outer-measure-is-additive-if-sets-are-separable"]], " (Outer measure is countably subadditive)": [[3, "mira:thm:countable-subadditivity-of-outer-measure"]], " (Outer measure is translation invariant)": [[3, "theorem-5"]], " (Outer measure of a closed interval)": [[3, "mira:thm:outer-measure-of-a-closed-interval"]], " (Outer measure preserves order)": [[3, "mira:thm:outer-measure-preserves-order"]], " (Outer measure)": [[3, "definition-1"]], " (PDA accepts)": [[29, "definition-10"]], " (Partition theorem for conditional expectations)": [[14, "theorem-8"]], " (Partition theorem)": [[13, "theorem-6"]], " (Partition)": [[2, "definition-0"]], " (Pdf of a function of a random variable)": [[17, "theorem-4"]], " (Pointwise limit of S-measurable functions is S-measurable)": [[3, "theorem-30"]], " (Precision functions and consistency)": [[12, "remark-0"]], " (Probability density function properties)": [[17, "theorem-3"]], " (Probability generating function)": [[16, "definition-1"]], " (Probability mass function)": [[14, "definition-1"]], " (Probability measure)": [[13, "definition-1"]], " (Probability space)": [[13, "definition-2"]], " (Proof: Gradient of KL is the KSD)": [[9, "theorem-0"]], " (Properties of open sets)": [[31, "topology:lemma-properties-of-open-sets"]], " (Pumping Lemma for CFLs)": [[29, "toc-cfl-pumping-lemma"]], " (Pushdown automaton (PDA))": [[29, "definition-9"]], " (Random Fourier Features)": [[7, "definition-1"]], " (Random sum formula)": [[16, "theorem-6"]], " (Random variable)": [[17, "definition-0"]], " (Rational difference equivalence relation)": [[3, "mira:def:rational-difference-equivalence-relation"]], " (Regular Pumping Lemma)": [[28, "toc-dfa-pumping-lemma"]], " (Regular expressions yield regular languages)": [[28, "lemma-16"]], " (Regular expressions)": [[28, "toc-def-regex"]], " (Regular laguages \\equiv regular expressions)": [[28, "corollary-20"]], " (Regular language)": [[28, "definition-4"]], " (Regular languages can be written as regular expressions)": [[28, "lemma-18"]], " (Regular operations)": [[28, "definition-7"]], " (Riemann integral)": [[2, "definition-5"], [2, "definition-7"]], " (Riemann integration does not work with limits)": [[2, "example-12"]], " (Riemann integration does not work with pointwise limits)": [[2, "example-13"]], " (Riemann integration does not work with unbounded functions)": [[2, "example-11"]], " (Score matching objective)": [[8, "definition-0"]], " (Smallest \\sigma-algebra containing a collection of subsets)": [[3, "theorem-18"]], " (Some regular expressions)": [[28, "example-9"]], " (Stein variational gradient descent)": [[9, "definition-2"]], " (Strings and Languages)": [[28, "definition-1"]], " (Strong convergence)": [[6, "definition-2"]], " (Sufficient condition for measurable function)": [[3, "mira-thm-sufficient-condition-measurable-with-infinity"]], " (Supporting tangent theorem)": [[19, "theorem-12"]], " (Topological space)": [[32, "topology:def-topological-space"]], " (Translation invariance)": [[3, "mira:thm:outer-measure-is-translation-invariant"]], " (Two properties of characteristic functions)": [[19, "theorem-14"]], " (Two results for discrete random variables)": [[14, "theorem-5"]], " (Uniform convergence of RFF)": [[7, "lemma-3"]], " (Uniqueness of characteristic functions)": [[19, "theorem-15"]], " (Uniqueness of mgfs)": [[19, "theorem-8"]], " (Uniqueness of pgfs)": [[16, "theorem-2"]], " (Uniqueness theorem for moments)": [[19, "theorem-0"]], " (Upper and lower Riemann sums)": [[2, "definition-2"]], " (Variance)": [[14, "definition-6"]], " (Weak convergence)": [[6, "definition-3"]], " (Weak law of large numbers)": [[20, "theorem-5"]], " (Wiener process)": [[6, "definition-0"]], " (Yields, derives)": [[29, "definition-1"]], " (\\sigma-algebra)": [[3, "mira:def:sigma-algebra"]], " (\\texttt{Convert})": [[28, "algorithm-19"]], " (pmf \\implies probability space and random variable)": [[14, "theorem-2"]], "(Multi head) self-attention": [[11, "multi-head-self-attention"]], "Additivity of outer measure on Borel sets": [[3, "additivity-of-outer-measure-on-borel-sets"]], "Annealed Importance Sampling": [[4, "id2"]], "Annealed importance sampling": [[4, "annealed-importance-sampling"]], "Arrival and Inter-arrival times": [[23, "arrival-and-inter-arrival-times"]], "Bernoulli": [[14, "bernoulli"], [16, "bernoulli"]], "Beta": [[17, "beta"]], "Binomial": [[14, "binomial"]], "Binomial distribution": [[16, "binomial-distribution"]], "Birth and death process": [[23, "birth-and-death-process"]], "Branching processes": [[21, "branching-processes"]], "Cauchy": [[17, "cauchy"], [19, "cauchy"]], "Central limit theorem": [[20, "central-limit-theorem"]], "Changes of variables": [[18, "changes-of-variables"]], "Chapter 1": [[26, "chapter-1"]], "Chapter 1.A": [[0, "chapter-1-a"]], "Chapter 1.B": [[0, "chapter-1-b"]], "Chapter 2.A": [[0, "chapter-2-a"]], "Chapter 2.C": [[0, "chapter-2-c"]], "Chapter 2.D": [[0, "chapter-2-d"]], "Chapter 4.A": [[0, "chapter-4-a"]], "Chapter 5.A": [[0, "chapter-5-a"]], "Chapter 6.A": [[0, "chapter-6-a"]], "Characteristic functions": [[19, "characteristic-functions"]], "Chi-squared": [[17, "chi-squared"]], "Chomsky normal form": [[29, "chomsky-normal-form"]], "Classification of states": [[24, "classification-of-states"]], "Closure properties": [[28, "closure-properties"]], "Closure properties: union": [[28, "closure-properties-union"]], "Conclusion": [[4, "conclusion"], [7, "conclusion"], [9, "conclusion"], [10, "conclusion"], [11, "conclusion"]], "Conditional density functions": [[18, "conditional-density-functions"]], "Conditional expectations": [[14, "conditional-expectations"]], "Conditional probability and independence": [[13, "conditional-probability-and-independence"]], "Conditions for measurable functions": [[3, "conditions-for-measurable-functions"]], "Context Free Pumping Lemma": [[29, "context-free-pumping-lemma"]], "Context-free grammars": [[29, "context-free-grammars"]], "Continuous random variables": [[17, "continuous-random-variables"]], "Convergence in distribution": [[20, "convergence-in-distribution"]], "Convergence in mean-square": [[20, "convergence-in-mean-square"]], "Convergence in probability": [[20, "convergence-in-probability"]], "Convergence to equilibrium": [[24, "convergence-to-equilibrium"]], "Covariance and correlation": [[19, "covariance-and-correlation"]], "Dataset": [[10, "dataset"], [11, "dataset"]], "Deficiencies of the Riemann integral": [[2, "deficiencies-of-the-riemann-integral"]], "Definition": [[15, "definition"], [16, "definition"]], "Definition of the outer measure": [[3, "definition-of-the-outer-measure"]], "Demo on mixture of Gaussians": [[9, "demo-on-mixture-of-gaussians"]], "Derivation of SVGD": [[9, "derivation-of-svgd"]], "Direction of steepest descent": [[9, "direction-of-steepest-descent"]], "Discrete random variables": [[14, "discrete-random-variables"], [14, "id1"]], "Distribution and density functions": [[17, "distribution-and-density-functions"]], "Distribution functions": [[17, "distribution-functions"]], "Empirical approximation": [[9, "empirical-approximation"]], "Estimation by score matching": [[8, "estimation-by-score-matching"]], "Euler-Maruyama method": [[6, "euler-maruyama-method"]], "Evaluating a stochastic integral": [[6, "evaluating-a-stochastic-integral"]], "Events and Probabilities": [[13, "events-and-probabilities"]], "Example application": [[10, "example-application"]], "Examples of MGFS": [[19, "examples-of-mgfs"]], "Excercise 1.B.3": [[0, null]], "Excercise 2.A.8": [[0, null]], "Excercises": [[26, "excercises"]], "Exercise 1.1": [[26, null]], "Exercise 1.11": [[26, null]], "Exercise 1.2": [[26, null]], "Exercise 1.20": [[26, null]], "Exercise 1.31": [[26, null]], "Exercise 1.32": [[26, null]], "Exercise 1.33": [[26, null]], "Exercise 1.34": [[26, null], [26, null]], "Exercise 1.41": [[26, null]], "Exercise 1.43": [[26, null]], "Exercise 1.44": [[26, null]], "Exercise 1.45": [[26, null]], "Exercise 1.46": [[26, null]], "Exercise 1.47": [[26, null]], "Exercise 1.48": [[26, null]], "Exercise 1.51": [[26, "toc-ex-151"]], "Exercise 1.52 (Myhill-Nerode theorem)": [[26, "toc-myhill-nerode-theorem"]], "Exercise 1.59": [[26, null]], "Exercise 1.63": [[26, null]], "Exercise 1.67": [[26, null]], "Exercise 1.A.1": [[0, null]], "Exercise 1.A.10": [[0, null]], "Exercise 1.A.11": [[0, null]], "Exercise 1.A.12": [[0, null]], "Exercise 1.A.13": [[0, null]], "Exercise 1.A.14": [[0, null]], "Exercise 1.A.2": [[0, null]], "Exercise 1.A.3": [[0, "mira-ex-1a3"]], "Exercise 1.A.4": [[0, "mira-ex-1a4"]], "Exercise 1.A.5": [[0, null]], "Exercise 1.A.6": [[0, null]], "Exercise 1.A.7": [[0, "mira-ex-1a7"]], "Exercise 1.A.8": [[0, null]], "Exercise 1.A.9": [[0, null]], "Exercise 1.B.1": [[0, null]], "Exercise 1.B.2": [[0, null]], "Exercise 1.B.4": [[0, null]], "Exercise 1.B.5": [[0, null]], "Exercise 2.A.1": [[0, null]], "Exercise 2.A.10": [[0, null]], "Exercise 2.A.3": [[0, null]], "Exercise 2.A.4": [[0, null]], "Exercise 2.A.5": [[0, null]], "Exercise 2.A.6": [[0, null]], "Exercise 2.A.7": [[0, null]], "Exercise 2.A.9": [[0, null]], "Exercise 2.C.1": [[0, null]], "Exercise 2.C.10": [[0, null]], "Exercise 2.C.11": [[0, null]], "Exercise 2.C.12": [[0, null]], "Exercise 2.C.2": [[0, null]], "Exercise 2.C.3": [[0, null]], "Exercise 2.C.4": [[0, null]], "Exercise 2.C.5": [[0, null]], "Exercise 2.C.6": [[0, null]], "Exercise 2.C.7": [[0, null]], "Exercise 2.C.8": [[0, null]], "Exercise 2.C.9": [[0, null]], "Exercise 2.D.1": [[0, null]], "Exercise 2.D.10": [[0, null]], "Exercise 2.D.11": [[0, null]], "Exercise 2.D.2": [[0, null]], "Exercise 2.D.3": [[0, null]], "Exercise 2.D.4": [[0, null]], "Exercise 2.D.5": [[0, null]], "Exercise 2.D.6": [[0, null]], "Exercise 2.D.7": [[0, null]], "Exercise 2.D.8": [[0, null]], "Exercise 2.D.9": [[0, null]], "Exercise 4.A.1": [[0, null]], "Exercise 4.A.10": [[0, null]], "Exercise 4.A.11": [[0, null]], "Exercise 4.A.2 (Chebyshev\u2019s inequality)": [[0, null]], "Exercise 4.A.3": [[0, null]], "Exercise 4.A.4": [[0, null]], "Exercise 4.A.9": [[0, null]], "Exercise 5.A.1": [[0, null]], "Exercise 5.A.10": [[0, null]], "Exercise 6.A.1": [[0, null]], "Exercise 6.A.10": [[0, null]], "Exercise 6.A.11": [[0, null]], "Exercise 6.A.12": [[0, null]], "Exercise 6.A.14": [[0, null]], "Exercise 6.A.16": [[0, null]], "Exercise 6.A.2": [[0, null]], "Exercise 6.A.3": [[0, null]], "Exercise 6.A.4": [[0, null]], "Exercise 6.A.5": [[0, null]], "Exercise 6.A.6": [[0, null]], "Exercise 6.A.7": [[0, null]], "Exercise 6.A.8": [[0, null]], "Exercises": [[0, "exercises"]], "Exercises 2.A.2": [[0, null]], "Expectation and independence": [[15, "expectation-and-independence"]], "Expectations": [[14, "expectations"], [17, "expectations"], [18, "expectations"]], "Exponential": [[17, "exponential"], [19, "exponential"]], "Failure mode on mixture of Gaussians": [[9, "failure-mode-on-mixture-of-gaussians"]], "Finite Automata and Regular Expressions": [[28, "finite-automata-and-regular-expressions"]], "Finite State Automata": [[28, "finite-state-automata"]], "First come, first served queue": [[23, "first-come-first-served-queue"]], "Function of a Wiener process": [[6, "function-of-a-wiener-process"]], "Functions of random variables": [[17, "functions-of-random-variables"]], "Fundamental discrete distributions": [[14, "fundamental-discrete-distributions"]], "Gamma": [[17, "gamma"], [19, "gamma"]], "Geometric": [[14, "geometric"]], "Geometric distribution": [[16, "geometric-distribution"]], "Good properties": [[3, "good-properties"]], "Heine-Borel theorem": [[3, "heine-borel-theorem"]], "Home": [[33, "home"]], "Implementation": [[4, "implementation"], [7, "implementation"], [9, "implementation"], [10, "implementation"], [11, "implementation"]], "Importance sampling": [[4, "importance-sampling"]], "Importance-weighted MCMC": [[4, "importance-weighted-mcmc"]], "Independence and sums": [[18, "independence-and-sums"]], "Indicator functions": [[15, "indicator-functions"]], "Inner products": [[31, "inner-products"]], "Introduction to transformers": [[11, "introduction-to-transformers"]], "Invariant distributions": [[24, "invariant-distributions"]], "Invertible transformations": [[9, "invertible-transformations"]], "Joint distributions": [[18, "joint-distributions"]], "Lack of memory property": [[23, "lack-of-memory-property"]], "Large deviations": [[20, "large-deviations"]], "Lebesgue measure": [[3, "lebesgue-measure"]], "Limits of characteristic functions": [[20, "limits-of-characteristic-functions"]], "Main limit theorems": [[20, "main-limit-theorems"]], "Markov and Jensen inequalities": [[19, "markov-and-jensen-inequalities"]], "Markov chain and property": [[24, "markov-chain-and-property"]], "Markov chains": [[24, "markov-chains"]], "Masure, integration and real analysis": [[1, "masure-integration-and-real-analysis"]], "Mean population": [[21, "mean-population"]], "Measurable functions": [[3, "measurable-functions"]], "Measurable spaces and functions": [[3, "measurable-spaces-and-functions"]], "Measures": [[3, "measures"]], "Measures and their properties": [[3, "measures-and-their-properties"]], "Metric and topological spaces": [[30, "metric-and-topological-spaces"]], "Metric spaces": [[31, "metric-spaces"], [31, "id1"]], "Milstein\u2019s higher order method": [[6, "milstein-s-higher-order-method"]], "Modelling with tokens": [[11, "modelling-with-tokens"]], "Moment generating functions": [[19, "moment-generating-functions"], [19, "id1"]], "Moments": [[16, "moments"], [19, "moments"]], "Multi-layer perceptron": [[11, "multi-layer-perceptron"], [11, "id10"]], "Multivariate discrete distributions": [[15, "multivariate-discrete-distributions"]], "Multivariate distributions": [[18, "multivariate-distributions"]], "Negative binomial": [[14, "negative-binomial"]], "Negative binomial distribution": [[16, "negative-binomial-distribution"]], "Nonadditivity of the outer measure": [[3, "nonadditivity-of-the-outer-measure"]], "Nondeterminism": [[28, "nondeterminism"]], "Normal": [[17, "normal"], [19, "normal"]], "Norms": [[31, "norms"]], "Numerical simulation of SDEs": [[6, "numerical-simulation-of-sdes"]], "Open and closed sets": [[31, "open-and-closed-sets"]], "Outer measure": [[3, "outer-measure"]], "PDAs and context-free grammars": [[29, "pdas-and-context-free-grammars"]], "Partition theorem and Bayes\u2019 rule": [[13, "partition-theorem-and-bayes-rule"]], "Poisson": [[14, "poisson"]], "Poisson distribution": [[16, "poisson-distribution"]], "Probability generating functions": [[16, "probability-generating-functions"], [21, "probability-generating-functions"]], "Probability mass functions": [[14, "probability-mass-functions"]], "Probability measures": [[13, "probability-measures"]], "Probability spaces": [[13, "probability-spaces"]], "Probability: An introduction": [[25, "probability-an-introduction"]], "Processes in continuous time": [[23, "processes-in-continuous-time"]], "Properties of measurable functions": [[3, "properties-of-measurable-functions"]], "Properties of measures": [[3, "properties-of-measures"]], "Pushdown Automata": [[29, "pushdown-automata"]], "Putting it together": [[10, "putting-it-together"], [11, "putting-it-together"], [11, "id14"]], "RFF and Bayesian regression": [[7, "rff-and-bayesian-regression"]], "Random Fourier features": [[7, "random-fourier-features"]], "Random walk on a graph": [[24, "random-walk-on-a-graph"]], "Random walks": [[22, "random-walks"]], "Random walks with absorption": [[22, "random-walks-with-absorption"]], "Rates of convergence": [[7, "rates-of-convergence"]], "Recurrence and transience": [[22, "recurrence-and-transience"]], "References": [[4, "references"], [6, "references"], [7, "references"], [8, "references"], [9, "references"], [10, "references"], [11, "references"], [20, "references"], [21, "references"], [23, "references"], [24, "references"], [25, "references"]], "Regression with RFF features": [[7, "regression-with-rff-features"]], "Regular languages \\equiv Regular expressions": [[28, "regular-languages-equiv-regular-expressions"]], "Regular operations": [[28, "regular-operations"]], "Regular pumping lemma": [[28, "regular-pumping-lemma"]], "Residuals and normalisation": [[11, "residuals-and-normalisation"]], "Riemann integral": [[2, "riemann-integral"]], "Riemann integration": [[2, "riemann-integration"]], "Sample and event spaces": [[13, "sample-and-event-spaces"]], "Sampling from a Wiener process": [[6, "sampling-from-a-wiener-process"]], "Sampling from the prior": [[7, "sampling-from-the-prior"]], "Self-attention": [[11, "self-attention"]], "Shifted window transformers": [[10, "shifted-window-transformers"]], "Shifted windows": [[10, "shifted-windows"]], "Sigma algebras": [[3, "sigma-algebras"]], "Simple birth process": [[23, "simple-birth-process"]], "Simple random walk": [[22, "simple-random-walk"]], "Stein variational gradient descent": [[9, "stein-variational-gradient-descent"]], "Stochastic and brancing processes": [[21, "stochastic-and-brancing-processes"]], "Stochastic chain rule": [[6, "stochastic-chain-rule"]], "Stream of papers": [[5, "stream-of-papers"]], "Strong Markov property": [[24, "strong-markov-property"]], "Strong and weak convergence": [[6, "strong-and-weak-convergence"]], "Sums of discrete random variables": [[15, "sums-of-discrete-random-variables"]], "Sums of independent variables": [[16, "sums-of-independent-variables"]], "Swin transformer block": [[10, "swin-transformer-block"]], "The Poisson process": [[23, "the-poisson-process"]], "The RFF approximation": [[7, "the-rff-approximation"]], "The Wiener process": [[6, "the-wiener-process"]], "The score matching trick": [[8, "the-score-matching-trick"]], "Theory of Computation": [[27, "theory-of-computation"]], "Time reversal": [[24, "time-reversal"]], "Tokens and embeddings": [[11, "tokens-and-embeddings"], [11, "id12"]], "Topological Spaces": [[32, "topological-spaces"]], "Topologies": [[32, "topologies"]], "Toy experiment": [[4, "toy-experiment"]], "Training": [[10, "training"], [11, "training"]], "Transformer block": [[11, "transformer-block"], [11, "id11"]], "Ultimate extinction": [[21, "ultimate-extinction"]], "Uniform": [[17, "uniform"], [19, "uniform"]], "Uniqueness of PGFs and examples": [[16, "uniqueness-of-pgfs-and-examples"]], "Variance starvation": [[7, "variance-starvation"]], "Why covariance functions?": [[12, "why-covariance-functions"]], "Why stochastic differential equations": [[6, "why-stochastic-differential-equations"]], "Window extraction": [[10, "window-extraction"]], "Window shifting": [[10, "window-shifting"]], "Windowed self-attention": [[10, "windowed-self-attention"]]}, "docnames": ["book/mira/000-exercises", "book/mira/000-intro", "book/mira/001-riemann", "book/mira/002-measures", "book/papers/ais/ais", "book/papers/intro", "book/papers/num-sde/num-sde", "book/papers/rff/rff", "book/papers/score-matching/score-matching", "book/papers/svgd/svgd", "book/papers/swin/swin", "book/papers/transformers/transformers", "book/papers/why-covariances/why-covariances", "book/prob-intro/ch01/content", "book/prob-intro/ch02/content", "book/prob-intro/ch03/content", "book/prob-intro/ch04/content", "book/prob-intro/ch05/content", "book/prob-intro/ch06/content", "book/prob-intro/ch07/content", "book/prob-intro/ch08/content", "book/prob-intro/ch09/content", "book/prob-intro/ch10/content", "book/prob-intro/ch11/content", "book/prob-intro/ch12/content", "book/prob-intro/intro", "book/toc/000-exercises", "book/toc/000-intro", "book/toc/001-fsa", "book/toc/002-cfl", "book/topology/000-intro", "book/topology/001-metric-spaces", "book/topology/002-topological-spaces", "intro"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinxcontrib.bibtex": 9}, "filenames": ["book/mira/000-exercises.md", "book/mira/000-intro.md", "book/mira/001-riemann.md", "book/mira/002-measures.md", "book/papers/ais/ais.ipynb", "book/papers/intro.md", "book/papers/num-sde/num-sde.ipynb", "book/papers/rff/rff.ipynb", "book/papers/score-matching/score-matching.md", "book/papers/svgd/svgd.ipynb", "book/papers/swin/swin.ipynb", "book/papers/transformers/transformers.ipynb", "book/papers/why-covariances/why-covariances.md", "book/prob-intro/ch01/content.ipynb", "book/prob-intro/ch02/content.ipynb", "book/prob-intro/ch03/content.ipynb", "book/prob-intro/ch04/content.ipynb", "book/prob-intro/ch05/content.ipynb", "book/prob-intro/ch06/content.ipynb", "book/prob-intro/ch07/content.ipynb", "book/prob-intro/ch08/content.ipynb", "book/prob-intro/ch09/content.md", "book/prob-intro/ch10/content.md", "book/prob-intro/ch11/content.md", "book/prob-intro/ch12/content.md", "book/prob-intro/intro.md", "book/toc/000-exercises.md", "book/toc/000-intro.md", "book/toc/001-fsa.ipynb", "book/toc/002-cfl.ipynb", "book/topology/000-intro.md", "book/topology/001-metric-spaces.md", "book/topology/002-topological-spaces.md", "intro.md"], "indexentries": {}, "objects": {}, "objnames": {}, "objtypes": {}, "terms": {"": [2, 4, 9, 10, 11, 12, 14, 16, 17, 18, 21, 22, 23, 24, 26, 27, 28, 29, 30, 31], "0": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "0011": 29, "01": 26, "02": 4, "03167": 11, "03385": 11, "03762": 11, "04471": 9, "045": 10, "06": 7, "06450": 11, "07": [4, 7], "08022": 11, "081": 10, "082": 11, "08494": 11, "090": 11, "1": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 25, 28, 29, 31, 32], "10": [4, 10, 11, 26], "100": [0, 4, 7], "1000": [4, 6], "10012": 10, "10022": 10, "1024": 10, "10557": 11, "11": [4, 20, 28], "125": 4, "128": 11, "13": [4, 7], "139": 4, "1502": 11, "1512": 11, "16": [4, 10, 11], "1607": 11, "1608": 9, "1654": 26, "16x16": 11, "17": 11, "1706": 11, "18": 4, "1803": 11, "1959": 28, "198": 4, "1986": [20, 25], "1992": 6, "1_": 15, "1_0": [6, 17], "1_a": 15, "1cm": 24, "1e": [4, 10, 11], "2": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 28, 29, 31], "20": 7, "200": 7, "2001": [4, 6, 20, 23, 24], "2003": 7, "2005": 8, "2006": 7, "2007": 7, "2015": 11, "2016": [9, 11], "2017": 11, "2018": 11, "2019": 9, "2020": 7, "2021": [10, 11], "2023": 11, "21": [10, 11], "22": 4, "2304": 11, "255": [10, 11], "256": 10, "276": 9, "28": 11, "284": 9, "2b": 7, "2c": 0, "2f": 31, "2j": 0, "2k": 0, "2m": [22, 28], "2n": [0, 26, 28], "2p": 26, "2t": [0, 19, 21], "2u_2": 10, "2z": 26, "3": [3, 4, 6, 7, 10, 11, 13, 14, 20, 24, 26, 29, 31], "309": 4, "31": 11, "32": 10, "3387533354": 6, "391": 20, "3a_1": 26, "3k": 0, "3n": 0, "3rd": 6, "3z": 26, "4": [3, 4, 6, 7, 10, 11, 13, 19, 26, 29, 31], "40": [4, 7], "4096": 10, "415": 6, "419": 6, "43": 6, "44": 6, "46": 4, "48": 6, "4ac": 19, "4k": 0, "4pq": 22, "5": [4, 6, 7, 9, 10, 13, 20, 28, 31], "500": 6, "5000": 7, "525": 6, "53": 7, "546": 6, "55": 7, "6": [3, 4, 10, 13, 29], "60": 4, "61": 7, "63": 7, "64": 10, "69": 3, "71": 7, "74": 31, "8": [4, 6, 10, 11, 31], "806": 7, "848": 4, "9": 11, "915": 6, "917": 6, "956": [10, 11], "957": 11, "959": 10, "99": 0, "A": [3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 17, 19, 20, 21, 22, 23, 24, 25, 26, 31, 32], "And": 0, "As": [0, 2, 3, 4, 6, 11, 13, 14, 18, 19, 23, 26, 28], "At": [11, 21, 26], "But": [0, 10, 11, 28, 31], "By": [0, 3, 4, 6, 7, 8, 9, 11, 14, 16, 19, 20, 23, 24, 26, 28, 29, 31], "For": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "If": [0, 2, 3, 4, 5, 6, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32, 33], "In": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 16, 18, 19, 20, 21, 22, 24, 26, 28, 29, 31, 32], "It": [0, 2, 3, 4, 6, 10, 11, 12, 18, 19, 21, 23, 24, 28, 29, 31], "Of": 4, "On": [0, 2, 4, 12, 19, 23], "One": [3, 4, 6, 7, 9, 10, 11, 12, 15, 16, 19, 20], "That": [2, 3, 11, 14], "The": [0, 2, 3, 4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 25, 26, 28, 29, 31, 32], "Then": [0, 2, 3, 6, 7, 8, 9, 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "There": [3, 11, 12, 14, 19, 23, 24, 26, 28], "These": [1, 2, 4, 10, 11, 13, 20, 25, 27, 29, 30, 31, 32], "To": [0, 3, 4, 6, 7, 9, 10, 11, 14, 19, 20, 21, 22, 23, 24, 26, 31], "With": [2, 3, 6, 10, 31], "_": [0, 4, 7, 8, 9, 11, 12, 18, 19, 21, 24, 26, 28, 31, 32], "_1": [0, 4, 31], "_2": [0, 31], "__call__": 4, "__init__": [4, 10, 11], "_a": 0, "_d": 9, "_h": 11, "_i": 0, "_j": 0, "_k": [0, 4], "_log_geometric_mixtur": 4, "_m": 28, "_n": [9, 11, 23, 28], "_p": 9, "_q": 9, "_r": 31, "a_": [0, 11, 15, 22, 29], "a_1": [0, 3, 13, 15, 26, 28, 29], "a_2": [0, 3, 13, 15, 26, 28, 29], "a_3": [3, 26], "a_4": 26, "a_c": 0, "a_h": 11, "a_i": [13, 24, 26, 29], "a_k": [0, 3, 26], "a_n": [0, 3, 6, 13, 15, 22, 31], "aa": [26, 28], "aaa": 26, "aaai": 11, "aab": 28, "aabb": 26, "aapo": 8, "ab": [7, 19, 26, 28], "aba": [26, 28], "abab": 26, "abb": 28, "abba": 26, "abbrevi": 15, "abl": [2, 7, 9, 12], "about": [2, 3, 6, 7, 11, 12, 13, 14, 18, 19, 20, 29, 31], "abov": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 26, 28, 31], "absolut": [0, 7, 14, 15, 16, 17, 18, 19, 31], "absorb": [22, 24], "absorpt": 24, "abstract": [10, 28, 31, 32], "abus": 7, "acc": [10, 11], "acccept": 29, "acccess": 4, "acccord": 4, "acceler": 11, "accept": [4, 26], "access": [4, 6, 8, 33], "accord": [4, 6, 7, 9, 14, 21, 26], "account": [4, 29], "accur": 17, "accuraci": [6, 10, 11], "achiev": [4, 6, 8, 10, 11, 13, 19], "acord": 4, "across": [10, 11, 28], "act": 11, "activ": [10, 11, 23], "actual": [4, 10], "ad": [0, 2, 3, 10, 15, 17, 18, 23, 26, 28, 29], "adam": [10, 11], "adapt": [11, 23], "add": [0, 10, 11, 26, 29], "addit": [0, 4, 6, 11, 13, 15, 20, 22, 26, 28, 29], "additv": 3, "address": [4, 11], "adjust": [6, 13], "adopt": 11, "adrian": [0, 1], "advanc": [6, 11], "advantag": 4, "affect": [4, 6, 9, 10], "afford": 6, "aforement": 28, "after": [3, 4, 10, 11, 12, 21, 26, 28, 29], "again": [0, 2, 6, 11, 18, 21, 28, 29, 31], "against": 16, "aggreg": 11, "aginst": 6, "agre": 0, "agreement": 4, "ai": 4, "aidan": 11, "al": [7, 9, 10, 11, 20, 25], "alemi": 11, "alexand": [7, 11], "alexei": 11, "algebra": 0, "algorithm": [6, 7, 9, 26, 28], "ali": 7, "align": [0, 2, 3, 4, 6, 7, 8, 9, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "all": [0, 2, 4, 6, 7, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29, 32], "allevi": 7, "allow": [3, 7, 9, 10, 11, 13, 17, 22, 28, 29, 31], "almost": [0, 8, 11], "along": [0, 1, 8, 10, 28], "alpha": [4, 19, 22, 26], "alphabet": [26, 28, 29], "alreadi": [11, 22, 26, 28, 29, 32], "also": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32], "altern": [6, 10, 11, 17, 20, 21, 23], "although": [4, 6, 9, 14, 24], "altogeth": 31, "alwai": [0, 2, 4, 7, 8, 10, 12, 13, 14, 19, 28], "ammen": 6, "amount": [4, 7, 10, 11, 20], "an": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29, 31, 32, 33], "analog": [0, 3, 6, 17, 18, 29, 31], "analogi": 17, "analogu": [17, 18], "analysi": [0, 2, 31], "analyt": [4, 6, 9], "andd": 29, "andrea": 11, "angl": 31, "ani": [0, 1, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 23, 24, 26, 27, 28, 29, 31, 33], "annealed_log_prob": 4, "annealedimportancesampl": 4, "annot": 28, "anoth": [0, 2, 3, 4, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 21, 26, 29], "answer": [12, 21, 26], "antisymmetr": 4, "anywher": 33, "aperiod": [24, 26], "appart": 14, "appeal": [11, 14], "appear": [4, 6, 28, 29, 31], "append": [0, 4, 6, 10], "appli": [0, 3, 4, 6, 8, 9, 10, 11, 12, 13, 19, 20, 24, 26, 28, 29], "applic": [4, 6, 7, 9, 11, 19, 29], "apply_gradi": [10, 11], "approach": [2, 3, 9, 10, 11, 12], "appropri": [2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 28], "approx": [4, 7], "approxim": [2, 4, 6, 8, 11], "ar": [0, 1, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 32, 33], "arbitrari": [0, 3, 4, 6, 9], "arbitrarili": [0, 3, 4, 26], "archictectur": 11, "architectur": [10, 11], "archtectur": 11, "area": [2, 3, 20], "arg": [4, 10, 11], "arguabl": 10, "argument": [0, 2, 3, 6, 10, 11, 12, 15, 20, 31], "aris": 4, "around": [4, 6, 10, 11, 12], "arrai": [10, 11], "arriv": [0, 3, 6, 7, 8, 9, 14, 19, 20, 21, 22, 24, 28], "arrow": 28, "arsen": 11, "artifici": 11, "arxiv": [9, 11], "as_supervis": [10, 11], "ashish": 11, "asid": [6, 10], "aspect": [9, 16], "assert": [10, 11], "assign": [13, 14], "associ": [6, 12, 13, 14, 32], "assum": [0, 3, 4, 8, 10, 11, 17, 18, 20, 23, 31], "assumpt": [0, 2, 9, 20, 23, 24, 26, 31], "asymmetr": 22, "atent": 10, "atom": [13, 28], "attain": [0, 14], "attend": [10, 11], "attent": 9, "augment": [4, 10, 11, 12], "autodiff": 10, "automat": 11, "autonom": 6, "autotun": [10, 11], "avail": [1, 4, 8, 27], "averag": [7, 8, 11, 14, 21], "avoid": [20, 23], "awai": [0, 10, 31], "ax": [19, 20], "axi": [4, 6, 7, 9, 10, 11], "axler": [0, 1], "b": [2, 3, 6, 7, 10, 11, 12, 13, 14, 17, 18, 19, 20, 22, 23, 26, 28, 29, 31], "b_": [0, 23, 28, 31], "b_0": 28, "b_1": [0, 13, 14, 22, 26, 28], "b_2": [0, 13, 14, 22], "b_a": [6, 17], "b_i": 26, "b_k": [0, 13, 14, 22, 26], "b_m": 28, "b_n": [13, 22, 28, 31], "b_r": 31, "ba": [11, 26], "bab": 26, "back": 10, "backward": 22, "bain": 10, "bake": 11, "ball": [0, 32], "bar": [4, 11, 19, 28], "barrier": 22, "base": [0, 3, 4, 11, 13, 14, 20, 28, 30], "basi": 7, "basic": [3, 13, 17], "batch": [9, 10, 11], "batch_jacobian": 9, "batch_siz": [10, 11], "batchnorm": 11, "bayesian": [4, 9], "bb": 26, "bc": 29, "beacus": 26, "bear": 19, "beat": [4, 11], "becaus": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 14, 17, 18, 19, 24, 26, 28, 29, 31], "becom": [4, 10, 11, 21, 28], "been": [0, 3, 4, 9, 11, 26, 28], "befor": [3, 4, 6, 10, 11, 22, 28], "begin": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "behav": [6, 18], "behaviour": [8, 21, 24], "behind": 10, "being": [6, 7, 8, 11, 15, 18, 29], "belief": [13, 14], "belong": 24, "below": [0, 2, 6, 7, 9, 10, 13, 15, 18, 19, 20, 23, 26, 28, 29], "benjamin": 7, "berlin": 6, "bernoulli": 23, "bernt": 6, "best": [2, 6, 17, 26], "bet": 11, "beta": [4, 9, 22], "beta_0": 4, "beta_d": [9, 11], "beta_k": 4, "beta_n": 4, "better": [4, 12, 28], "between": [0, 3, 4, 6, 7, 9, 10, 11, 17, 18, 23, 24, 28, 31, 32], "bevaus": 3, "beyer": 11, "beyond": [3, 11, 27], "bia": [4, 11], "big": [6, 7, 8, 9, 20, 21], "bigcap": [3, 31], "bigcap_": [0, 3, 32], "bigcap_n": 15, "bigcup": [0, 13, 15, 21, 31], "bigcup_": [0, 3, 31, 32], "bigg": [6, 7, 21], "biject": 18, "bimod": 9, "bin": [4, 6], "binari": [0, 3, 23, 26, 28], "birth": 21, "bis06": 7, "bishop": 7, "bit": [4, 7, 26, 28], "bkh16": 11, "black": 6, "blue": [4, 20], "bmatrix": [0, 10, 12, 26], "boldsymbol": 12, "book": [0, 1, 2, 7, 20, 24, 25, 26], "border": 20, "bore": 3, "borel": 0, "born": 21, "borovitskii": 7, "both": [0, 3, 7, 8, 11, 23, 24, 26, 28, 29, 31], "bottleneck": 10, "bottom": [5, 11, 26], "bound": [0, 3, 7, 19, 22, 26], "boundari": [10, 22, 23], "boundedd": 0, "box": 10, "bracket": [6, 8], "branch": 16, "break": [3, 10, 26, 28, 31], "brief": 21, "bring": [9, 10], "broader": 19, "broadli": 9, "bruno": 0, "bt": 29, "bu": 14, "build": [3, 10, 11, 14, 26, 28, 29], "buse": 14, "c": [3, 6, 7, 10, 11, 12, 13, 19, 21, 22, 23, 24, 26, 29, 31], "c_": [0, 21, 23], "c_1": 0, "c_2": 0, "c_k": 0, "c_n": 0, "calcul": [10, 13, 15], "calculu": 13, "call": [2, 3, 4, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "callabl": 4, "cambridg": 30, "came": 3, "can": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 26, 29, 31], "cancel": [4, 19], "cannot": [0, 2, 3, 4, 6, 8, 9, 12, 23, 24, 26, 28, 29], "cao": 10, "cap": [0, 3, 13, 15, 22, 26, 28, 29, 31], "cap_": [0, 3, 31], "capac": 11, "captur": [2, 4, 9, 14, 19], "care": 24, "carl": 7, "carlo": [4, 7, 9], "carri": 26, "case": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "cast": [4, 10, 11], "categor": 4, "categori": 9, "categoricalaccuraci": [10, 11], "categoricalcrossentropi": [10, 11], "cauchi": [0, 7], "cdot": [0, 3, 6, 9, 14, 18, 23, 31], "center": [0, 6], "central": [3, 4, 7, 9, 12, 14], "certain": [2, 3, 6, 7, 10, 13, 14, 16, 19, 21, 28, 32], "cgg": 29, "chain": [4, 9], "chain_histori": 4, "challeng": [8, 9, 12, 28], "chang": [3, 4, 6, 9, 10, 11, 19, 28], "channel": 10, "channels_last": [10, 11], "chapman": 24, "chapter": [3, 7, 13, 15, 20, 21, 28, 32], "charact": 11, "characterist": [0, 3, 23], "cheap": [7, 9], "chebyshev": 19, "check": [3, 26], "chelsea": 11, "chi_": 0, "choic": [6, 10], "choos": [0, 6, 14, 22, 23, 26, 31], "chosen": [0, 4, 6], "christian": 11, "christoph": 7, "chua": 30, "chunk": 10, "circ": [0, 3, 28], "circl": 28, "circumv": [3, 4, 19], "clarendon": [20, 25], "clarifi": [7, 12], "class": [0, 3, 4, 10, 11, 19, 24, 26, 28, 29], "class_token": 11, "classic": 4, "classif": 11, "classs": 0, "clear": [3, 7, 11], "clearli": [4, 12, 19], "close": [0, 2, 4, 6, 7, 8, 9, 10, 11, 13, 24, 26, 28], "closer": [4, 9], "closur": [0, 6, 26], "clss": 3, "clump": 7, "cnn": [10, 11], "co": [6, 7, 19], "code": 10, "coeffici": [7, 9, 16], "coin": 14, "collect": [0, 5, 10, 11, 13, 21, 26, 31, 32], "collectioin": 14, "color": 4, "colour": 10, "column": [10, 11, 12, 26], "combin": [0, 7, 10, 11, 13, 20, 26], "combine_window": 10, "come": [0, 3, 4, 7, 13, 20, 28], "command": 10, "comment": 0, "common": [0, 4, 19, 20, 21, 31], "commonli": [4, 6], "commun": 24, "compact": [7, 11], "compar": [4, 6, 11, 17, 26], "comparison": 10, "complement": [0, 3, 13, 26, 31], "complet": [0, 3, 21, 26, 31], "complex": [6, 7, 10, 28], "complic": [9, 13], "compon": [6, 11], "compos": 11, "composit": [10, 28], "compris": 11, "comput": [0, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 19, 22, 26, 28, 29], "computation": 9, "concat": 11, "concaten": [6, 7, 10, 11], "concatent": 11, "concav": 19, "concept": [15, 28], "concern": [2, 11, 15], "concis": 11, "conclud": [0, 2, 3, 22, 26, 31], "conclus": [0, 26], "concret": 11, "condens": 25, "condit": [0, 6, 8, 10, 12, 15, 19, 20, 21, 22, 23, 24, 28, 29], "condition": [13, 22], "conert": 11, "confer": [9, 10, 11], "confirm": 8, "confus": 4, "conjectur": 26, "conjug": 4, "connect": [11, 24, 28], "consecut": [0, 10], "consequ": [13, 16, 19], "consid": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29], "consider": 18, "consist": [0, 11, 28, 29], "const": 8, "constant": [0, 6, 7, 8, 11, 14, 17, 19, 20, 22], "constantli": 31, "constrain": [9, 11, 22], "constraint": [9, 22], "construct": [14, 28, 29], "contain": [0, 2, 4, 10, 11, 12, 13, 14, 19, 20, 24, 25, 26, 28, 29], "context": [3, 7, 31], "continu": [0, 6, 7, 11, 13, 14, 19, 21, 28], "contradict": [0, 3, 20, 23, 24, 26, 28, 29, 31], "contrast": [6, 11, 26], "contribut": [4, 6], "conv": [10, 11], "conv2d": [10, 11], "conveni": [3, 9, 15], "converg": [0, 14, 15, 16, 17, 18, 19, 21, 23], "convers": [0, 3, 4, 10, 11, 26], "convert": [11, 26, 29], "convert_to_tensor": [4, 9], "convex": 21, "convolut": [10, 11, 17], "copi": 26, "core": 11, "correct": [4, 6], "correspond": [0, 4, 6, 7, 9, 10, 11, 12, 13, 19, 20, 23, 24, 26, 28, 29, 31], "correspondng": 28, "cost": [4, 7, 10, 12], "could": [0, 2, 4, 6, 11, 12, 13, 14, 19], "couldn": 12, "count": [3, 10, 14, 15], "countabl": [0, 13, 14, 17], "counterexampl": 0, "counterpart": [3, 17, 18], "coupl": [11, 29], "cours": [4, 27, 28, 30, 31], "cov": [7, 19], "covari": [7, 11], "cover": [0, 11, 27], "coverg": 20, "cpu": 7, "creat": [4, 10, 11, 28], "criterion": [20, 28], "cross": [0, 11], "crucial": [3, 4, 29], "cubic": [7, 26], "cumsum": 6, "cumul": [4, 6], "cup": [0, 2, 3, 13, 24, 26, 28], "cup_": [0, 3, 31], "current": [1, 4, 11], "curv": 2, "custom": 23, "cut": [10, 12], "cvf": 10, "cyan": 20, "cyclic": [10, 26], "d": [3, 4, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 20, 21, 23, 24, 25, 26, 31, 32], "d_": [0, 4, 11], "d_1": [0, 3], "d_2": [0, 3], "d_i": 24, "d_k": 3, "d_u": 0, "d_v": [0, 31], "d_w": 0, "d_x": 31, "d_y": 31, "darker": 20, "data": [7, 8, 10, 11, 14], "data_dir": [10, 11], "data_format": [10, 11], "datapoint": 7, "dataset": 7, "david": 7, "dayan": 8, "dbk": 11, "ddot": 0, "deal": [0, 9, 16, 17, 21, 24], "decai": 7, "decent": 9, "decim": 0, "decis": 28, "declar": 14, "decreas": [0, 9, 17, 21, 23, 29], "deep": [10, 11], "deeper": 11, "def": [2, 4, 6, 7, 9, 10, 11], "defeat": 7, "defer": 11, "defin": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 26, 28, 29, 31, 32], "definit": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 21, 22, 23, 24, 25, 28, 29, 31, 32], "defint": 12, "defnit": 31, "degre": [7, 17], "dehghani": 11, "deisenroth": 7, "delet": 29, "delta": [0, 2, 3, 6, 7, 17, 19, 20, 26, 28, 29, 31], "delta_": [0, 24, 26], "delta_1": [26, 28], "delta_2": [26, 28], "delta_a": 26, "delta_b": 26, "delta_t": 0, "delta_x": 3, "demo": [5, 10, 11], "denot": [0, 4, 7, 8, 9, 10, 11, 13, 14, 16, 19, 26, 28], "dens": [10, 11], "densiti": [4, 7, 8, 9, 19], "depend": [6, 8, 9, 10, 11, 12, 15, 24], "deriv": [0, 2, 4, 7, 8, 10], "descend": 21, "describ": [3, 6, 11, 17, 18, 21, 23, 26, 28], "descript": [0, 11, 26], "desir": [3, 10], "despit": 3, "det": 9, "detail": [3, 4, 7, 10, 11, 17], "determin": [0, 9, 11, 14, 16, 17, 19, 22, 23, 26, 31, 32], "determinist": [6, 28], "develop": 2, "deviat": [6, 11], "devis": 9, "dexter": 30, "dfa": [26, 28, 29], "di": 21, "diag": 7, "diagon": 3, "diagram": [20, 28], "did": [0, 6], "die": 13, "diff": [7, 9], "differ": [0, 4, 6, 7, 10, 11, 15, 20, 22, 28, 29, 31], "differenti": [8, 10, 11, 19, 23], "difficult": [9, 10], "difficulti": [7, 19], "diffus": 6, "digit": 0, "digress": 21, "dilat": [0, 10], "dilin": 9, "dimens": [7, 10, 11], "dimension": [9, 10, 11, 12], "diminish": 6, "direct": [0, 3, 4, 8, 12, 16, 20], "directli": [2, 4, 6, 8, 11, 14, 18, 31, 32], "dirk": 11, "discontinu": 2, "discrep": 9, "discret": [11, 16, 17, 18, 21, 31], "discretis": 6, "discrimin": 31, "disctinct": 6, "discuss": [2, 3, 6, 11, 12, 15], "disjoint": [0, 13, 22, 26, 29], "dissimilar": 4, "distanc": [0, 31], "distinct": [3, 26, 28], "distinguish": [10, 11, 26], "distrbut": 23, "distribut": [4, 6, 7, 9, 12, 19, 21, 23], "diver": 4, "diverg": [4, 9, 19, 24], "divers": 11, "divid": [0, 10, 11, 23, 24], "divis": [10, 11, 13, 23, 26], "dk": 9, "dlogp": 9, "dmitri": 11, "do": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 19, 26, 28, 29], "doe": [0, 3, 4, 6, 7, 8, 9, 11, 12, 14, 19, 20, 21, 22, 24, 26, 28, 29], "domain": [0, 2, 3, 6, 19, 31], "domin": 4, "don": 12, "done": [0, 28], "dosovitskii": 11, "dot": [0, 2, 3, 4, 11, 12, 19, 20, 26, 28, 29, 32], "dot_product": [10, 11], "doubl": [10, 28], "doubli": 3, "down": [2, 4, 7, 11, 12, 26, 31], "downarrow": 2, "downstream": [4, 15], "downward": 10, "draw": [4, 6, 7, 28], "drawn": [4, 7], "drift": 6, "dropdown": 10, "dropout": [10, 11, 26], "dt": [6, 14, 19], "dtype": [4, 9, 11], "du": 20, "due": [3, 8, 10, 11], "dure": [11, 23, 26], "dv": 6, "dw": 6, "dw_": 6, "dw_t": 6, "dx": [4, 6, 8, 9, 17, 18, 19, 31], "dx_": 4, "dx_0": 4, "dx_k": 4, "dx_t": 6, "dy": [17, 18, 23], "dy_": 12, "dy_t": 6, "dynam": 6, "dz": [9, 17], "e": [0, 2, 3, 4, 5, 6, 7, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29], "e_": [0, 3, 21], "e_0": [3, 21], "e_1": [0, 3, 21], "e_2": [0, 3, 21], "e_j": 3, "e_k": 3, "e_n": [0, 3], "e_t": 21, "each": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 16, 20, 21, 22, 23, 26, 28, 29, 31], "earlier": [0, 3, 4, 13, 28], "earliest": [24, 28], "eas": 11, "easi": [7, 9], "easier": [3, 13, 28], "easili": [4, 8, 9, 10, 16, 17, 28], "ed": 6, "edg": [11, 12, 29], "edward": 7, "effect": 28, "effici": [7, 9, 10], "effort": 0, "einsum": 7, "either": [3, 4, 9, 11, 20, 22, 24, 26, 28], "element": [0, 3, 10, 11, 12, 13, 23, 26, 28, 31, 32], "elementari": [13, 14], "elif": 7, "elimin": [8, 29], "ell": [0, 3, 9], "els": [3, 6], "em": 6, "email": [0, 5, 33], "emb": 11, "embed": 10, "empir": [4, 6, 8, 11], "empti": [3, 13, 26, 28, 31], "emptyset": [0, 3, 13, 17, 28, 31, 32], "en": 0, "enabl": [4, 9, 10, 13], "end": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "endpoint": [0, 2, 3], "enough": 2, "ensur": [0, 4, 9, 11, 13, 29], "enter": [26, 28], "entir": [0, 3, 11, 12, 26, 27, 28], "entireti": [6, 11], "entri": [8, 10, 11, 12, 26], "enumer": 10, "epoch": [10, 11], "eppstein": 26, "epsilon": [0, 2, 3, 7, 9, 20, 21, 26, 28, 29, 31], "eq": [7, 9], "equal": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 26, 28, 29], "equat": [0, 3, 4, 10, 11, 13, 19, 20, 21, 22, 23, 24, 26], "equip": [31, 32], "equiv": [0, 29], "equiv_l": 26, "equival": [0, 10, 11, 13, 18, 20, 21, 24, 26, 28, 29, 31], "equivari": 11, "ergod": [4, 24], "error": [0, 1, 4, 5, 7, 33], "especi": 16, "essenti": 26, "establish": [3, 28], "estim": [2, 4, 6, 7], "et": [7, 9, 10, 11, 20, 25], "eta": 21, "euclidean": 31, "euler_maruyama": 6, "evalu": [4, 8, 9, 10, 12, 18], "even": [0, 2, 4, 6, 7, 9, 10, 11, 13, 20, 22, 28, 29, 31], "event": [14, 15, 18, 21, 22, 23, 24], "eventu": 22, "ever": [21, 22], "everi": [0, 2, 6, 8, 10, 11, 14, 17, 19, 21, 24, 26, 28, 29, 31, 32], "everywher": [0, 3], "evolut": [6, 23], "evolv": [9, 21], "exact": [4, 6, 7], "exact_black_schol": 6, "exact_gp_posterior": 7, "exactli": [0, 2, 3, 4, 10, 11, 22, 26, 28, 29], "exampl": [0, 2, 3, 4, 6, 8, 9, 11, 13, 14, 15, 17, 20, 26, 28], "exapl": 10, "excel": 11, "except": [4, 10, 11, 12, 19, 24, 26, 28, 29], "exchang": 2, "exclud": 6, "exclus": [6, 15], "execut": 7, "exercis": [1, 25], "exist": [0, 2, 3, 6, 10, 11, 14, 15, 17, 18, 19, 20, 23, 24, 26, 28, 29, 31], "exp": [4, 6, 7, 9, 11, 17, 19, 20], "expand": [8, 29, 31], "expans": [0, 6, 16, 20], "expect": [2, 3, 4, 6, 7, 8, 9, 19, 21, 22, 24], "expens": [10, 12], "experi": [6, 13], "experiment": [13, 14], "experss": 13, "explain": 0, "explan": 20, "explicitli": [7, 8], "exponenti": [0, 4, 7, 11, 20, 23], "exponeti": 11, "exposit": 11, "express": [0, 2, 6, 7, 8, 9, 10, 11, 13, 14, 15, 17, 18, 21, 22, 26, 31], "extend": [2, 3, 11, 15, 17, 18, 24, 26, 28, 29], "extens": [4, 11, 15, 16], "extent": [11, 19], "extinct": 23, "extract_patch": 10, "extract_window": 10, "extrem": [10, 11], "ey": 7, "f": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 21, 22, 24, 26, 28, 29, 31, 32], "f_": [17, 18, 24, 26], "f_0": 0, "f_1": [0, 2, 3, 9, 26, 28], "f_2": [0, 2, 3, 26, 28], "f_a": 26, "f_b": 26, "f_d": 9, "f_g_black_schol": 6, "f_g_sine": 6, "f_k": [0, 3, 22, 26], "f_n": [0, 2, 22], "f_plot": 4, "f_x": [17, 18], "f_y": [17, 18], "f_z": 18, "facilit": 28, "fact": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 19, 20, 21, 23, 24, 28, 31], "factor": [0, 4, 10, 11, 15, 19], "factoris": 24, "fail": [2, 9], "failur": 14, "faint": 10, "fairli": 11, "fall": [2, 9, 19], "fals": [6, 10, 11], "famili": [9, 21], "familiar": 31, "far": [3, 4, 11, 28], "fast": 11, "faster": [7, 9], "fault": 27, "featur": [10, 11], "feature_dim": [10, 11], "features_data": 7, "features_pr": 7, "fed": 26, "feed": 28, "feedback": 5, "feel": [5, 33], "few": [2, 3, 4, 10, 11, 28], "fewer": [0, 26, 28], "fg": 3, "field": [8, 9], "fig": 10, "figsiz": 4, "figur": 4, "filter": [10, 11], "final": [0, 3, 11, 14, 17, 23, 24, 26, 28], "final_mlp": [10, 11], "find": [0, 8, 9, 16, 19, 26], "fine": 6, "finish": 26, "finit": [0, 2, 4, 6, 7, 9, 13, 20, 24, 26, 29], "fint": 7, "first": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 19, 20, 21, 22, 24, 26, 28, 29, 31, 32], "first_block": 10, "fit": 9, "fix": [0, 3, 6, 10, 11, 26], "flat": 10, "flatten": 10, "flaw": 3, "flexibl": [9, 10, 11], "flip": 19, "flipsid": 4, "float": 4, "float32": [9, 10, 11], "float64": 4, "fn": 7, "focu": [9, 10], "fold": 10, "follow": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 30, 31, 32, 33], "fontsiz": 4, "forget": 14, "form": [0, 2, 4, 6, 7, 9, 11, 17, 18, 19, 20, 26, 28], "formal": [6, 26, 28], "formalis": [8, 19, 29], "format": [25, 33], "former": [9, 20, 29], "formul": 10, "formula": [0, 6, 9, 22], "forward": [4, 10, 11, 22], "found": [6, 10, 11, 20, 25], "four": [2, 3, 10, 31], "fourier": 19, "fourth": [6, 31], "foward": 22, "frac": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 31], "framework": 10, "free": [5, 33], "freedom": [7, 17], "freeli": [1, 27], "frequenc": [7, 11], "frequent": 2, "from": [0, 2, 4, 5, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32, 33], "from_logit": [10, 11], "front": 17, "fsa": 26, "full": 11, "fun": 6, "func": 24, "funciton": 29, "function": [0, 4, 7, 8, 9, 10, 11, 22, 23, 24, 26, 28, 29], "fundament": [3, 13], "funtion": 0, "further": [0, 2, 3, 4, 7, 9, 10, 11, 17, 18, 19, 21, 22, 24, 28], "furthermor": [0, 24], "futur": [6, 21, 23], "g": [0, 3, 4, 6, 9, 10, 11, 14, 15, 17, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29, 31], "g_": [16, 21, 28], "g_0": 21, "g_1": [0, 3, 16, 21, 26, 29], "g_2": [0, 16, 26, 29], "g_d": 9, "g_j": 3, "g_k": [0, 28], "g_n": [0, 3, 16], "g_t": 21, "g_x": 16, "g_y": 16, "gambl": 14, "gambler": 22, "game": 14, "gamma": [6, 29], "gamma_": 29, "gamma_d": 11, "gamma_k": 4, "gap": [0, 7], "gaussian": [4, 6, 7, 8, 12], "gaussiantransitionkernel": 4, "gave": 26, "gca": 4, "gcd": 24, "gear": 4, "gelli": 11, "gelu": [10, 11], "gen": 24, "gener": [0, 2, 3, 4, 7, 8, 9, 10, 11, 12, 13, 15, 17, 18, 20, 22, 23, 24, 28, 31], "generalis": [3, 11, 31], "geoffrei": [7, 11], "geometr": [21, 24], "georg": 11, "geq": [0, 3, 4, 7, 13, 14, 16, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "get": [0, 4, 6, 7, 9, 10, 11, 19, 26, 28, 33], "get_batch": [10, 11], "give": [0, 2, 3, 4, 7, 9, 10, 11, 12, 13, 14, 17, 21, 22, 26, 28, 31], "given": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "global": 12, "glorotnorm": [10, 11], "gloss": 11, "gnfa": 28, "gnn": 11, "go": [0, 1, 4, 7, 8, 11, 23, 24, 25, 26, 31], "goe": [3, 6, 20, 26], "goldwas": [0, 1], "gomez": 11, "good": [2, 9], "got": 4, "govern": 6, "gp": [7, 12], "grad": 6, "grad_i": 10, "gradient": [8, 10, 11], "gradienttap": [9, 10, 11], "gradual": 4, "grai": 10, "grain": 6, "grant": 14, "graph": [10, 11, 12, 21], "great": [9, 25], "greater": [26, 28, 31], "greatli": 4, "green": [4, 20], "gri20": 7, "grid": 10, "grimmett": [7, 20, 23, 24, 25], "group": [1, 10, 11], "groupnorm": 11, "gs01": 20, "guarante": [9, 12, 28, 29], "guo": 10, "gww86": [20, 25], "h": [0, 3, 4, 6, 9, 10, 11, 12, 18, 23, 24, 26], "h_": [0, 6, 11, 12], "h_1": 0, "h_2": 0, "h_i": 24, "h_j": 24, "h_k": 0, "h_t": 6, "h_x": 0, "ha": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29], "had": [4, 18], "half": [3, 10], "han": 10, "hand": [0, 2, 3, 12, 17, 19, 23, 29, 31], "handi": [11, 31], "handl": [2, 4, 7, 9, 10, 29], "happen": 10, "hast": 4, "hat": [0, 24], "have": [0, 1, 2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 33], "he": 11, "head": [10, 12, 14], "heavi": 9, "heidelberg": 6, "height": [0, 10, 11], "heigold": 11, "hein": 0, "help": [4, 10, 28], "helper": [7, 10, 11], "henc": 22, "here": [2, 4, 5, 6, 9, 10, 11, 12, 14, 19, 26, 31, 32], "hereaft": 4, "hierarch": 10, "hig01": 6, "high": [0, 4, 7], "higham": 6, "higher": 16, "highli": [3, 10], "highlight": [3, 4, 6, 11], "hilbert": 9, "hinton": 11, "hist": 4, "histogram": 4, "histori": 4, "hit": 24, "hold": [0, 2, 3, 4, 6, 8, 10, 14, 17, 19, 20, 21, 23, 24, 28, 29, 31], "holmogorov": 24, "home": 26, "homogen": 24, "homogeneu": 24, "hope": 3, "hopefulli": [9, 20], "horizont": 10, "hot": [10, 11], "houlsbi": 11, "how": [4, 6, 7, 9, 10, 11, 18, 21], "howev": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 22, 24, 26, 29, 31], "hspace": 24, "hu": 10, "hundr": 0, "hwc": 10, "hypothesi": 3, "hypothesis": 8, "hyv": 8, "hyvarinen": 8, "hyvarinend05": 8, "hzrs15": 11, "i": [0, 1, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 29, 31, 32, 33], "i_": [0, 3], "i_1": [0, 3, 24], "i_2": [0, 3, 24], "i_j": 3, "i_k": 3, "i_n": [0, 3, 24], "ib": 30, "idea": [2, 3, 4, 9, 10, 14, 20, 28, 29, 31, 33], "ideal": 2, "idealis": 8, "ident": [7, 9, 11, 14, 16, 20, 31], "ieee": 10, "iff": [3, 13, 16, 20, 23, 24, 26], "ij": [12, 26], "ijk": 10, "illia": 11, "illustr": [2, 4, 7, 9, 10, 20], "im": [0, 14, 15], "imag": [0, 10, 11, 14, 26, 32], "image_tokenis": [10, 11], "imagetokenis": [10, 11], "imagin": [6, 11], "immedi": 3, "impact": [4, 10, 11], "implement": 6, "impli": [0, 2, 3, 4, 7, 8, 10, 18, 21, 22, 23, 24, 26, 29], "implic": [6, 20, 24, 31], "impoprt": 4, "import": [2, 3, 6, 9, 10, 11, 12, 14, 19, 20, 28, 31], "improv": [4, 11, 26, 33], "inaccess": 8, "incept": 11, "includ": [0, 1, 2, 3, 6, 10, 11, 14, 15, 20, 26, 28], "inclus": [3, 15], "inconsist": 12, "incorpor": 11, "increas": [0, 2, 4, 7, 9, 10, 11, 17, 20, 28], "increment": [0, 28], "inde": [0, 31], "independ": [3, 6, 7, 8, 9, 10, 14, 20, 22, 23, 24], "index": [3, 8, 10, 11, 21, 22, 23, 26, 28], "indic": [10, 11, 20, 28], "indirectli": 32, "indiscern": 31, "indistiguish": 26, "indistinguish": 26, "individu": [0, 11, 26], "induc": [4, 7], "induct": [0, 3, 11, 21, 28], "inequ": [3, 4, 7, 21, 24], "inf": [0, 2, 3, 17, 19, 20, 23, 24], "inf_": [0, 2], "inf_p": [0, 2], "infer": 9, "infimum": 0, "infimuma": 3, "infin": 3, "infinit": [0, 2, 6, 12, 19, 20, 24, 26, 28, 29], "infinitesim": [6, 14], "inform": [0, 6, 10, 11, 12, 13, 14, 19], "infrequ": 4, "infti": [0, 2, 6, 8, 14, 17, 18, 19, 20, 21, 22, 23, 24, 31], "infty_": [0, 3, 13, 16, 17, 18, 19, 21, 22, 23, 24], "ingredi": [10, 11], "inher": 29, "inhomogen": 24, "init": 28, "initi": [4, 6, 9, 10, 11, 21, 22, 23, 24, 26, 28], "initial_distribut": 4, "initialis": [4, 9, 11], "inner": [9, 11], "innov": [10, 11], "input": [7, 10, 11, 12, 26, 28, 29], "insid": 0, "inspect": 16, "inst": 4, "instanc": [6, 11], "instancenorm": 11, "instead": [0, 3, 4, 6, 9, 10, 11, 12, 14, 18, 20, 28, 31, 32], "int": [0, 4, 6, 7, 8, 9, 10, 11, 12, 17, 18, 19, 20], "int32": 4, "int_": [0, 31], "int_0": [0, 2, 17, 31], "int_a": [0, 2], "int_b": 0, "int_c": 0, "integ": [0, 2, 11, 16, 22, 23, 26, 28, 29], "integr": [0, 3, 4, 7, 8, 9, 12, 17, 18, 19, 23], "integrand": [4, 6, 7], "intellig": 11, "inter": 11, "interact": 11, "interest": [3, 4, 6, 7, 9, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21], "interestingli": 3, "intergr": 4, "interleav": [10, 11], "intermedi": [2, 3, 31], "intern": [9, 10, 11], "interpol": [4, 19], "interpret": 7, "intersect": [0, 13, 21, 26, 28], "interv": [0, 2, 6, 14, 17, 21, 23], "intesect": 21, "intial": 4, "intialis": 4, "intra": 11, "intract": [4, 8, 9], "introduc": [2, 3, 4, 7, 11, 15, 20, 22, 28, 29, 31], "introduct": [6, 10, 20, 26], "intuit": [3, 4, 8, 15, 19, 20, 28, 31], "invari": [0, 4, 7, 19], "invers": [0, 4, 10, 12], "invert": [10, 12, 18], "inverv": 3, "involv": [4, 6, 9, 11, 15, 19, 20, 29, 31], "ioff": 11, "irrat": 0, "irreduc": 24, "irrelev": 8, "irrespect": [0, 11, 24], "is15": 11, "isbn": 6, "isol": 11, "issu": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 19, 26, 27, 28, 29, 30, 31, 32, 33], "itax": 19, "itb": 19, "iter": [4, 9, 21], "iti": 19, "its": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29], "itself": [0, 4, 6, 9, 10, 11, 16, 20, 26, 28, 29, 31, 32], "itx": 19, "itz": 19, "iu": 23, "iub_": 23, "ixi": 29, "iz": 29, "j": [0, 2, 3, 6, 10, 18, 20, 24, 25, 26, 28], "j_1": [0, 3], "j_2": [0, 3], "j_n": [0, 3], "jacobian": 9, "jakob": 11, "jame": 7, "jami": 11, "jason": 9, "jensen": 4, "jian": 11, "jimmi": 11, "jit_compil": 4, "job": 11, "join": [26, 28], "joint": [0, 4, 11, 12], "jointli": 18, "jone": 11, "jordan": 9, "journal": 8, "just": [3, 4, 6, 10, 11, 12, 17, 20, 28], "justif": 0, "justifi": 31, "k": [0, 2, 3, 4, 7, 9, 10, 11, 12, 14, 16, 19, 21, 22, 23, 24, 26, 28, 29], "k1": 29, "k2": 29, "k_": [0, 11], "k_1": [0, 3], "k_2": [0, 3], "k_i": 24, "k_j": 24, "k_n": [0, 3], "kaim": 11, "kaiser": 11, "kdd": 7, "keep": [10, 11, 26, 33], "keepdim": [10, 11], "kei": [3, 10, 11, 19, 28, 31], "kera": 10, "kernel": [4, 7, 9, 11], "kernel_initi": [10, 11], "kernel_s": [10, 11], "key1": [10, 11], "key2": [10, 11], "kick": 6, "kind": [3, 4, 10, 11, 28, 29], "kiro": 11, "kl": 4, "know": [0, 26, 29], "known": [6, 10, 11, 14, 16, 26], "kolesnikov": 11, "kolmogorov": 24, "kpd": 7, "kpp": 7, "kullback": 4, "kwarg": [10, 11], "l": [0, 2, 3, 24, 26, 28, 29], "l1": 7, "l2": 7, "l_0": 23, "l_1": 3, "l_2": 3, "l_n": 3, "l_t": 23, "l_w": 19, "label": [4, 10, 11, 28], "lack": 3, "lambda": [6, 14, 16, 17, 19, 20, 22, 23, 24, 31], "lambda_": 24, "lambda_i": 24, "lamda": 6, "langl": [9, 31], "languag": 26, "laplac": 7, "laptop": [10, 11], "larg": [0, 4, 7, 9, 10, 11, 12, 26], "larger": [0, 4, 10, 19, 26, 28], "last": [0, 2, 3, 4, 6, 8, 10, 14, 15, 17, 28, 29], "lastli": [0, 2, 3, 7, 14, 24, 29], "later": [3, 11, 28, 29, 31], "latter": [8, 20, 29], "law": [17, 21], "layer": 10, "layernorm": [10, 11], "lceil": 0, "ldot": [0, 2, 3, 31], "lead": [0, 2, 3, 6, 14, 24, 26], "learn": [4, 7, 8, 9, 10, 11], "learnabl": 11, "learning_r": [10, 11], "learnt": 11, "least": [0, 2, 3, 4, 5, 8, 21, 24, 26, 28], "leav": [0, 4, 14], "lebesgu": [0, 2], "lectur": 27, "lee": 9, "left": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31], "leftarrow": 31, "leftrightarrow": 24, "legend": 4, "lei": 11, "leibler": 4, "lemma": [0, 4, 24, 26, 31], "lempitski": 11, "len": 10, "lenght": 3, "length": [0, 2, 11, 26, 28, 29, 31], "lengthscal": [7, 9], "leq": [0, 3, 4, 6, 7, 9, 13, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "less": [0, 2, 3, 6, 11, 19], "let": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "level": [0, 6, 19], "leverag": 10, "like": [1, 2, 3, 7, 9, 11, 14, 17, 18, 27], "likelihood": 8, "lim_": [0, 2, 3, 6, 18, 20, 21, 22, 23, 24], "limit": [0, 6, 7, 10, 11, 12, 21, 23, 24, 28], "lin": 10, "linalg": 7, "line": [0, 3, 4, 10, 19, 21, 23, 24, 31], "linear": [6, 7, 10, 11, 19], "linearli": [0, 10, 11], "linspac": [4, 6, 7], "list": [0, 2, 3, 4, 10], "liter": 28, "literatur": 5, "littl": [4, 11, 31], "liu": [9, 10], "ll": [4, 6, 10, 11], "llc": 10, "llion": 11, "llj16": 9, "ln1": [10, 11], "ln2": [10, 11], "load": [4, 10, 11], "loc": [4, 7], "local": [9, 10, 11, 14], "locat": [7, 9, 11], "log": [4, 8, 9, 20, 23], "log_geometric_mixtur": 4, "log_prob": 4, "log_prob_1": 4, "log_prob_2": 4, "log_prob_ratio": 4, "log_w": 4, "logarithm": 20, "logit": [10, 11], "logp": 9, "logprob": 9, "long": [10, 11], "longer": [7, 10, 11, 26], "look": [2, 4, 6, 9, 10, 11, 28, 29, 31], "loop": 26, "loss": [0, 10, 11], "loss_fn": [10, 11], "lot": [3, 4, 25], "low": 7, "lower": [0, 7, 10, 11, 19], "luca": 11, "lukasz": 11, "lw19": 9, "m": [0, 2, 3, 4, 7, 9, 10, 11, 12, 16, 20, 22, 24, 26, 28, 29], "m_": [12, 19, 20], "m_0": 23, "m_1": [20, 26, 28], "m_2": [20, 26, 28], "m_a": 26, "m_b": 26, "m_c": 26, "m_n": 20, "m_t": 23, "m_u": 20, "m_x": 19, "m_y": 19, "machin": [4, 7, 8, 9, 10, 11, 26, 28], "made": [0, 3, 13, 26, 28], "magnitud": 4, "mahtbb": 24, "mai": [3, 4, 6, 7, 9, 10, 13, 14, 16, 17, 18, 19, 20, 26, 28, 29], "main": [3, 10, 11, 28], "maintain": [11, 33], "make": [0, 4, 6, 7, 9, 10, 11, 13, 14, 20, 21, 22, 26, 28, 29, 33], "mani": [0, 2, 4, 6, 9, 10, 11, 14, 17, 19, 26, 31], "manipul": [7, 18], "manner": 23, "map": [0, 9, 10, 11, 12, 13, 14, 18, 19, 26], "map_fn": 4, "marc": 7, "margin": [4, 12, 15, 18, 23], "marginalis": [12, 15], "mark": [10, 28], "marker": 26, "markov": [0, 4, 8, 9, 20], "mass": 16, "master": 12, "match": [7, 16], "math": [4, 10, 11], "mathbb": [0, 2, 4, 6, 7, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 31], "mathbf": [0, 12, 24], "mathcal": [0, 3, 4, 6, 7, 9, 10, 12, 13, 14, 15, 16, 17, 18, 21, 22, 23, 28, 29, 32], "mathemat": [11, 30], "mathtild": 3, "matmul": [10, 11], "matric": [10, 11, 12, 24], "matrix": [0, 10, 11, 12, 24], "matter": [0, 6], "matthia": 11, "max": [0, 4, 31], "max_": 31, "maxima": 31, "maximis": [9, 19], "maximum": [0, 8, 26, 31], "mbc": 12, "mcmc": 9, "me": [0, 5, 27, 33], "mean": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 16, 24, 26, 28, 29], "mean_pr": 7, "meaning": 20, "meaningless": 13, "meant": [10, 11, 25], "measur": [0, 1, 2, 4, 7, 14, 15, 18, 20, 21], "meeasur": 0, "member": [13, 26], "memori": [10, 28], "merg": 10, "method": [4, 7, 9, 10, 11, 15], "metric": [0, 9, 10, 11, 19, 32], "metropoli": 4, "mhsa": [10, 11], "michael": [9, 27], "middl": [2, 4, 8], "midpoint": 6, "might": [0, 8, 10, 11, 13, 14, 19], "milisecond": 7, "min": [0, 7, 24, 31], "mind": [10, 19], "minder": 11, "mine": 25, "minim": [10, 24], "minimis": [9, 20], "minimum": 0, "minu": [0, 3, 20, 28], "miss": [0, 11, 13], "mistak": [25, 26, 27], "mitig": 10, "mixtur": 4, "mlodozeniec": 0, "mlp": [10, 11], "mlp_num_hidden": [10, 11], "mlp_num_lay": [10, 11], "mnist": [10, 11], "mod": 26, "modal": 10, "mode": [4, 10], "model": [4, 6, 7, 8, 9, 10, 12, 14, 28, 29], "modest": 9, "modestli": 7, "modifi": [3, 4, 10, 26, 28], "modul": 11, "modulo": 17, "moment": [6, 11, 17, 20, 23], "monoton": [0, 23], "mont": [4, 7, 9], "moost": 4, "more": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 15, 17, 18, 20, 23, 26, 28, 29, 31], "moreov": 26, "most": [0, 1, 3, 4, 5, 9, 10, 17, 19, 26, 31], "mostafa": 11, "mostowski": 7, "motiv": [3, 4, 11, 32], "mould": 4, "move": [0, 3, 26, 29], "mu": [0, 3, 6, 10, 11, 12, 16, 17, 19, 20, 21, 23], "mu_1": 4, "mu_2": 4, "mu_i": 24, "mu_q": 4, "much": [4, 6, 7, 9, 10, 11, 18, 19, 28], "multi": 10, "multi_head_self_attent": [10, 11], "multiheadselfattent": [10, 11], "multipl": [0, 9, 10, 11, 15, 26, 29], "multipli": [10, 19, 24], "multivari": 12, "must": [0, 3, 4, 6, 8, 10, 11, 14, 20, 23, 24, 26, 28, 31], "my": [1, 12, 26, 27], "myhil": 28, "myself": 33, "n": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 31, 32], "n_": [0, 8, 14, 15, 20, 22, 23], "n_0": 23, "n_1": [0, 12, 15], "n_2": 15, "n_i": [0, 12], "n_k": 0, "n_l": 12, "n_t": 23, "nabla": [8, 9], "nabla_": 9, "nabla_x": 9, "naiv": 3, "name": [0, 3, 6, 7, 10, 11, 13, 28, 29], "nat": 4, "natur": [3, 13, 14, 21], "nd": 7, "nea01": 4, "neal": 4, "neat": [10, 21], "necessari": 21, "necessarili": [3, 11], "need": [0, 2, 3, 4, 6, 10, 11, 12, 14, 17, 18, 24, 26, 28, 31], "neg": [3, 4, 7, 9, 17, 19, 20, 21, 24, 31], "neglect": 6, "neighbour": 11, "neighbourhood": [10, 19, 20], "neil": 11, "neither": [0, 26], "neq": [0, 3, 6, 8, 10, 19, 21, 22, 23, 24, 26, 28, 29, 31], "nerod": 28, "nest": 4, "network": [9, 10, 11], "neural": [9, 10, 11], "never": [12, 13], "nevertheless": 12, "new": [6, 21, 26, 28, 29], "next": [2, 3, 4, 6, 7, 10, 11, 15, 20, 29, 31], "next_annealed_log_prob": 4, "next_x": 4, "nfa": [26, 29], "nh": 23, "nice": [3, 6, 33], "niki": 11, "nip": 7, "nk": 26, "nm": 7, "nn": [4, 10, 11], "noam": 11, "node": [11, 12, 29], "nois": [6, 7], "noisi": 7, "nomad": 21, "non": [0, 3, 4, 7, 8, 10, 13, 16, 17, 19, 20, 21, 23, 24], "nonaddit": 0, "nondecreas": 23, "nondetermin": 29, "nondeterminist": 26, "none": [6, 7, 9, 10, 11, 28], "nonempti": [0, 2, 3, 28], "nonlinear": 11, "nontrivi": 0, "nonzero": [0, 3], "norm": 8, "normal": [4, 6, 7, 8, 10, 11, 20], "normalis": [8, 14, 17, 23], "notat": [2, 6, 7, 9, 11, 12, 16, 22, 23, 26, 28], "note": [0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14, 17, 18, 19, 20, 23, 24, 25, 26, 27, 28, 30, 31], "notic": 8, "notin": [3, 31], "notion": [3, 6, 11, 28, 29, 31, 32], "now": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "np": [4, 6, 7], "nq": 14, "nu": 0, "null": 24, "num_block": 11, "num_block_pair": 10, "num_blocks_per_stag": 10, "num_class": [10, 11], "num_data": 7, "num_epoch": [10, 11], "num_featur": 7, "num_funct": 7, "num_head": [10, 11], "num_hidden": [10, 11], "num_lay": [10, 11], "num_mlp_hidden": [10, 11], "num_mlp_lay": [10, 11], "num_out_featur": 10, "num_output": [10, 11], "num_sampl": 4, "num_stag": 10, "num_step": 4, "number": [0, 2, 3, 4, 6, 7, 9, 10, 12, 14, 18, 19, 22, 23, 24, 26, 28, 29, 31], "nz": [26, 28], "o": [6, 7, 10, 14, 20, 23, 26], "obei": 7, "oberv": 12, "object": [13, 31], "observ": [3, 4, 6, 7, 8, 9, 12, 13, 14, 16], "obtain": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "occcur": 29, "occur": [4, 9, 13, 14, 15, 22, 23, 26, 29], "occurr": 29, "od": 6, "odd": 22, "odot": 11, "off": [4, 11, 30], "often": [3, 4, 6, 13, 14, 15, 16, 17, 18, 19, 20, 32], "oks92": 6, "oksend": 6, "old": 28, "omega": [0, 6, 7, 13, 14, 15, 16, 17, 18, 19, 21, 24], "omega_1": [7, 11], "omega_d": 11, "omega_m": 7, "omega_shap": 7, "ommit": [6, 20], "onc": [0, 2, 3, 4, 11, 13, 14, 19, 23, 26, 29], "one": [0, 2, 4, 7, 8, 9, 10, 11, 12, 13, 14, 18, 19, 20, 21, 22, 24, 26, 28, 29], "one_hot": [10, 11], "ones": [5, 11, 26, 28, 29], "onli": [0, 2, 3, 4, 7, 8, 9, 10, 11, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "onlin": [1, 27, 33], "onto": 29, "onward": 6, "opeart": 10, "open": [0, 2, 32], "oper": [10, 11, 26], "opos": 6, "oppos": 7, "opposit": [0, 3, 11], "optim": [10, 11], "optimis": [8, 9], "optimum": 9, "option": [3, 7, 10, 11], "orang": 20, "order": [0, 2, 10, 11, 16, 20, 26, 28, 29], "ordinari": 6, "organ": 23, "origin": [3, 4, 10, 11, 20, 26, 29], "original_shap": 10, "other": [0, 2, 4, 5, 6, 7, 8, 10, 11, 12, 14, 16, 18, 19, 21, 22, 23, 24, 26, 28, 29], "otherwis": [0, 2, 3, 4, 15, 17, 18, 19, 20, 22, 26, 28, 31], "otim": 0, "our": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 23, 28], "out": [3, 4, 6, 8, 9, 10, 11, 12, 20, 27, 28, 29, 31], "out_feat": [10, 11], "outcom": [13, 14, 17, 18], "outer": [0, 2], "outgo": 28, "outlin": 2, "output": [10, 11, 28], "outsid": [10, 26], "over": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 15, 18, 19, 20, 21, 23, 24, 26, 28, 33], "overal": [4, 10, 11], "overfit": 7, "overlap": 10, "overlin": [0, 31], "overset": 26, "overview": 11, "own": [1, 11, 17, 26], "oxford": [7, 20, 25], "p": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 32], "p_": [0, 8, 9, 15, 23, 24, 26], "p_0": [16, 23], "p_1": [0, 16, 23], "p_2": [0, 16], "p_d": 8, "p_f": 0, "p_g": 0, "p_i": 23, "p_k": [21, 23], "p_n": [0, 16, 23, 26], "p_plot": 4, "p_x": [14, 15, 17], "p_y": 15, "pad": [10, 11], "padhi": [0, 1], "page": [0, 6, 7, 33], "pair": [3, 7, 11, 12, 18, 26, 29, 31], "pairwis": 26, "palindrom": 26, "paper": [4, 6, 7, 11, 28], "parallel": [11, 28], "parallelis": 10, "paramet": [4, 6, 7, 8, 9, 10, 11, 14, 16, 17, 19, 23], "paramt": 14, "parmar": 11, "pars": 29, "parser": 29, "part": [0, 2, 3, 8, 10, 11, 23, 24, 26, 28, 29, 30, 31], "partial": [6, 8, 13, 18, 20, 23, 24], "partial_i": 8, "particl": [6, 9], "particular": [0, 2, 4, 6, 8, 10, 11, 14, 21, 24, 28, 29, 31], "particularli": 14, "partit": [0, 12], "partli": 6, "partricular": 4, "pass": [4, 9, 10, 11, 19, 26], "passag": 24, "past": [6, 23], "patch": [10, 11], "patch_merg": 10, "patch_merging_lay": 10, "patch_siz": [10, 11], "patchmerginglay": 10, "path": [6, 28], "pattern": 7, "pdf": [19, 24], "per": [10, 11], "perfect": 26, "perfectli": 14, "perform": [4, 7, 10, 11, 26], "perhap": [3, 12, 28], "period": 24, "permit": 29, "permut": [10, 11, 26], "perp": [6, 23], "persist": 9, "perturb": 6, "peter": [7, 8], "pgf": [21, 23], "phantom": 12, "phi": [7, 9, 19, 20], "phi_": [19, 23], "phi_1": [7, 20], "phi_2": 20, "phi_i": 19, "phi_m": 7, "phi_n": 20, "phi_x": 19, "phi_z": 19, "phrase": [0, 11], "pi": [4, 7, 17, 19, 20, 22, 24, 26], "pi_": [4, 22, 23], "pi_0": [4, 23], "pi_1": [4, 23], "pi_2": 23, "pi_i": [14, 24], "pi_j": 24, "pi_k": [4, 23], "pi_v": 24, "pick": [0, 4, 9, 26], "piec": 10, "pigeonhol": [26, 28], "pin": 7, "pink": 20, "pixel": 10, "place": [2, 3, 5, 7, 13, 26, 31], "pleas": [0, 5, 27, 33], "plot": [4, 7], "plt": 4, "plu": 11, "plug": 6, "pmatrix": 12, "pmf": [16, 17, 18, 21], "pmlr": 9, "point": [0, 2, 3, 4, 6, 7, 11, 12, 14, 19, 20, 21, 22, 26, 28, 29, 32], "polosukhin": 11, "polya": 24, "polynomi": 26, "pool": [10, 11], "pop": 29, "popul": 23, "posit": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 19, 22, 23, 24, 28], "position_embed": [10, 11], "positionembed": [10, 11], "possibl": [0, 2, 3, 4, 7, 9, 11, 12, 13, 14, 22, 28], "possibli": 19, "posterior": [7, 9], "postiv": 9, "power": [0, 3, 11, 21, 28, 29], "powerset": [3, 14], "pq": [14, 22], "practic": [4, 7], "pre": [0, 10, 32], "preceed": 29, "precis": [0, 6, 10, 11, 17, 28], "prefetch": [10, 11], "prefix": 26, "preimag": 31, "preliminari": 0, "preprint": 11, "preprocess": [10, 11], "preprocess_imag": [10, 11], "presenc": 20, "present": [6, 7, 8, 9, 13, 14, 17, 19, 20, 23], "preserv": [0, 4, 28], "press": [7, 20, 25], "pretend": 4, "prevent": 11, "previou": [0, 2, 3, 12, 24, 26, 28, 32, 33], "previous": [15, 18, 29], "primarili": 33, "principl": [4, 26, 28], "prml": 7, "proabil": 21, "probabilist": 8, "probabilit": 24, "probabl": [4, 6, 7, 8, 11, 18, 19, 22, 23, 24], "problem": [0, 3, 4, 9, 10, 12, 15, 16, 26, 28], "problemat": 11, "proce": [3, 4, 14, 22], "procedur": [4, 10, 26, 28, 29], "proceed": [10, 11, 21, 23, 24], "process": [7, 11, 16, 20, 22], "prod_": [15, 23], "produc": [4, 7, 9, 11, 12, 26], "product": [3, 6, 9, 11, 13, 16, 17, 20, 21], "profit": 14, "program": 29, "progress": [1, 9, 11], "prohibit": 7, "project": [10, 11], "projection_dim": [10, 11], "proof": [0, 2, 3, 6, 8, 14, 15, 19, 20, 21, 22, 23, 24, 28, 29, 31], "propag": 10, "proper": 29, "properti": [0, 2, 11, 12, 13, 15, 26, 32], "proport": [9, 20], "propos": 4, "propto": 8, "protein": 11, "prove": [0, 3, 7, 13, 15, 17, 19, 20, 21, 24, 26, 28, 29, 31], "provid": [2, 3, 6, 11, 13, 20, 29], "psi_": 8, "psi_d": 8, "pump": 26, "purpl": 4, "purpos": [6, 7, 9, 11], "purposefulli": 7, "push": 29, "put": [0, 2, 3, 4, 28], "q": [0, 2, 3, 4, 9, 10, 11, 13, 14, 16, 22, 26, 28, 29], "q_": [8, 9, 11, 26, 28], "q_0": [23, 26, 28, 29], "q_0q_1": 28, "q_1": [26, 28], "q_2": [26, 28], "q_3": 26, "q_4": 26, "q_a": [26, 28], "q_b": 26, "q_e": 26, "q_i": [26, 28], "q_j": [26, 28], "q_k": [4, 28], "q_l": 26, "q_m": 26, "q_n": 26, "q_plot": 4, "q_t": 23, "qiang": 9, "quad": 9, "quadrat": [10, 19, 31], "qualiti": 7, "quantifi": 19, "quantiti": [0, 4, 6, 12, 13, 14, 16, 20, 21], "queri": [11, 12], "question": [3, 7, 9, 10, 12, 14], "queu": 23, "quicker": 7, "quickli": [0, 6, 7, 10, 19, 21], "quit": [7, 9], "r": [0, 2, 6, 7, 9, 11, 12, 13, 14, 15, 17, 18, 19, 20, 24, 26, 28, 29, 31], "r_": [0, 6, 7, 12, 28, 29], "r_0": [28, 29], "r_1": [0, 2, 3, 28, 29, 31], "r_1r_2": 28, "r_2": [0, 2, 3, 28], "r_3": 28, "r_4": 28, "r_i": [0, 28, 29], "r_j": 0, "r_k": [0, 2, 3], "r_m": 29, "r_n": [0, 2, 28, 31], "r_w": 19, "rabin": 28, "radford": 4, "radiu": [0, 21], "radoom": 19, "rahimi": 7, "random": [4, 6, 8, 10, 11, 12, 18, 19, 20, 21, 23, 33], "randomis": 4, "randomli": 7, "rang": [0, 2, 3, 4, 6, 10, 11, 19, 23], "rangl": [9, 31], "rangle_": 9, "rapidli": 9, "ras03": 7, "rasmussen": 7, "rate": [6, 9, 10, 11, 23], "rather": [3, 4, 7, 10, 24, 31], "ratio": [4, 6], "ration": [0, 2, 23], "raw": [4, 11], "rc": 26, "rceil": 0, "re": [7, 10, 11, 29, 31, 32], "reach": [3, 21, 24, 26, 27, 28, 29], "reachabl": 26, "read": [1, 4, 5, 26, 28, 29], "readi": [2, 10, 11, 31], "real": [0, 2, 3, 7, 12, 14, 19, 20, 23, 31], "rearrang": [0, 3, 14, 15, 19, 24, 31], "reason": [0, 4, 11, 12, 13, 14, 19], "recal": [0, 3], "recap": 11, "receiv": [4, 11, 26], "recent": [5, 7, 11], "recht": 7, "recognis": [26, 29], "recognit": [7, 11], "recogns": 28, "record": 27, "recov": 8, "rectangl": [0, 2], "recur": 28, "recurr": 24, "recurs": [0, 21, 22, 23, 24, 28], "red": [4, 10, 20], "reduc": [2, 4, 7, 9, 10, 11], "reduce_logsumexp": [10, 11], "reduce_mean": [10, 11], "reduce_min": 4, "reduce_sum": 9, "refer": [3, 13, 28, 32], "refin": [0, 2, 6, 11], "reflect": 11, "reflex": [3, 24, 26], "reflexivti": 26, "regard": 9, "regardless": [12, 26], "region": 20, "regressor": 7, "regular": [8, 10, 26, 29], "reject": [4, 26, 28], "rel": [4, 7, 10, 11], "relabel": [0, 3], "relat": [0, 7, 11, 13, 15, 17, 18, 19, 20, 22, 23, 24, 26], "relationship": 11, "relax": 3, "relev": 3, "reli": [3, 7], "remain": [0, 3, 4, 7, 10, 14, 29], "remark": 11, "rememb": 6, "remind": 11, "remov": [26, 28, 29], "ren": 11, "repeat": [0, 3, 4, 11, 26, 28, 29], "repeatedli": 28, "repetit": 28, "replac": [0, 3, 8, 26, 29], "repres": [0, 3, 6, 7, 11, 13, 21, 26, 28, 29], "represent": [3, 11], "reproduc": 9, "reproduct": 6, "repsect": 12, "requir": [0, 2, 3, 4, 10, 11, 14, 20, 24, 26, 28, 29, 32], "research": 8, "resembl": 4, "reshap": [10, 11], "resiz": 10, "resnet": 11, "resolv": [7, 9], "resort": [4, 8], "respect": [0, 2, 6, 8, 9, 10, 11, 12, 13, 16, 17, 19, 23, 26, 28, 32], "rest": 26, "restrict": [3, 9, 24], "result": [0, 2, 3, 4, 8, 9, 10, 11, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 31], "retain": [10, 11], "return": [4, 6, 7, 9, 10, 11, 22, 24, 28], "reus": 10, "revers": [4, 10, 26], "review": [2, 6, 14], "revisit": 22, "revrs": 10, "rewrit": [0, 4], "rewritten": [8, 19], "rff_posterior": 7, "rho": [19, 23], "rich": 11, "richard": 11, "rieman": [0, 2], "riemann": [0, 3, 6], "right": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 29], "rightarrow": 31, "rightward": 10, "rigor": 14, "rinen": 8, "rise": 31, "rkh": 9, "role": [11, 20], "roll": 10, "root": [19, 21], "rotat": 26, "rough": 17, "roughli": [7, 11], "row": [10, 12, 26], "rr": 7, "ruin": 22, "rule": [4, 12, 17, 28, 29], "rull": 29, "run": [4, 6, 9, 10, 11, 26, 28], "run_chain": 4, "ryan": 11, "s_": [22, 26, 29], "s_0": [22, 29], "s_1": [3, 26, 29], "s_2": [3, 26], "s_i": [14, 26, 29], "s_j": 26, "s_k": [3, 22, 26], "s_m": 29, "s_n": [3, 11, 20, 22, 26, 28], "sa": [10, 11], "sai": [0, 3, 6, 10, 11, 12, 13, 18, 19, 20, 24, 26, 28, 29, 31], "said": [3, 6, 14, 20, 23], "same": [0, 3, 4, 6, 7, 10, 11, 12, 16, 18, 19, 20, 24, 26, 28, 29, 31], "sampl": [8, 9, 12, 17], "sample_rff": 7, "sampler": 4, "samples_and_log_weight": 4, "sandwich": 10, "satisfi": [0, 3, 6, 8, 13, 14, 15, 17, 18, 20, 21, 23, 24, 26, 28, 29, 31, 32], "satisifi": 0, "save": 10, "saw": [2, 32], "scalar": [6, 9, 10], "scale": [0, 4, 7, 9, 10, 11, 17, 19], "schedul": [4, 10, 11], "schole": 6, "school": 7, "scienc": 28, "scott": 28, "script": 6, "second": [0, 2, 3, 6, 8, 10, 11, 19, 20, 21, 23, 24, 26, 28, 29, 31], "second_block": 10, "section": [0, 2, 3, 9, 20, 31], "see": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 14, 16, 19, 20, 21, 22, 23, 24, 26, 28, 31], "seed": [6, 10, 11], "seed1": [10, 11], "seed2": [10, 11], "seek": 9, "seem": 4, "seen": 19, "select": [0, 4, 12, 28], "self": [4, 12, 26], "self_attent": [10, 11], "self_attention_weight": [10, 11], "selfattent": [10, 11], "semidefinit": 9, "senc": 6, "sens": [0, 3, 4, 6, 10, 12, 13, 19, 20, 28, 29], "sensibl": [7, 13, 17], "sentenc": [10, 11], "separ": [3, 7, 9, 11, 14, 26], "sequenc": [0, 2, 4, 9, 10, 11, 14, 16, 20, 22, 23, 24, 26, 28, 29], "sequence_length": 11, "sequnc": 0, "sequnec": [3, 4], "sergei": 11, "seri": [0, 19, 21, 31], "servic": 23, "set": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 32], "setminu": [0, 2, 3, 13, 31], "seven": 4, "sever": [3, 9, 10, 13, 15], "sf": 7, "sfd": 7, "sfn": 7, "shaoq": 11, "shape": [4, 6, 7, 9, 10, 11], "share": [6, 10, 11], "shazeer": 11, "sheldon": [0, 1], "shift": 11, "shift_horizontally_and_vert": 10, "shitov": 26, "short": [2, 3, 6, 10], "shorten": 26, "shorthand": [14, 28, 31], "shortli": [3, 6, 7, 31], "should": [0, 3, 4, 6, 10, 11], "show": [0, 2, 3, 4, 6, 7, 8, 12, 13, 14, 15, 16, 19, 20, 21, 23, 24, 26, 28, 29, 31], "shown": [0, 2, 6, 10, 13, 14, 15, 16, 17, 18, 19, 20, 22, 28], "shreya": [0, 1], "shuffl": 26, "shuffle_fil": [10, 11], "siam": 6, "side": [0, 3, 6, 8, 17, 19, 20, 23, 24, 29, 31], "sigma": [0, 7, 11, 17, 19, 20, 21, 26, 28, 29], "sigma_": [12, 21, 28, 29], "sigma_1": [4, 21], "sigma_2": [4, 21, 26], "sigma_3": 26, "sigma_q": 4, "sigma_t": 21, "sigmoid": 4, "sign": [17, 19], "signific": [4, 10, 11], "significantli": [4, 7, 10], "signiticantli": 7, "sim": [0, 3, 4, 6, 7, 9], "similar": [0, 2, 3, 4, 9, 10, 11, 17, 18, 19, 29, 31], "similarli": [0, 2, 3, 4, 20, 23, 24, 26, 28, 29, 31], "simpl": [4, 10, 11], "simpler": 11, "simplest": 14, "simpli": [3, 4, 7, 10, 11, 12, 16], "simplic": 11, "simplifi": [11, 15, 18, 28], "simul": [4, 9], "simultan": 18, "sin": [6, 11], "sinc": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 15, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "singl": [3, 4, 6, 7, 10, 11, 14, 18, 22, 24, 26, 28], "singla": 26, "singleton": [0, 24, 28], "sinusoid": [7, 11], "sipser": [26, 27], "situat": 7, "siva17": 11, "size": [0, 3, 6, 7, 10, 11, 21, 26], "sketch": 6, "skip": 10, "slightli": [7, 11, 12, 13, 26, 31], "slow": 7, "slowli": 28, "small": [0, 4, 6, 7, 9, 10, 11], "smaller": [0, 4, 7, 10, 11, 21], "smallest": [0, 21, 26, 28], "smooth": 9, "sn": 7, "so": [0, 2, 3, 4, 6, 7, 8, 10, 11, 13, 14, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "softmax": [10, 11], "solut": [0, 1, 3, 6, 22, 24, 26], "solv": [1, 4, 6, 7, 22], "some": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 29, 31, 33], "someth": [9, 12], "sometim": [3, 4, 10, 11, 20, 29, 31], "somewhat": [3, 10, 29], "soon": 33, "space": [0, 2, 4, 7, 8, 9, 11, 12, 17, 18, 21, 24, 26, 28], "spars": 7, "speak": 17, "special": [3, 11, 17, 18, 20, 21, 26, 28], "specif": [0, 3, 4, 10, 11, 16, 32], "specifi": [4, 6, 11, 14], "speedup": 7, "spirit": 11, "split": [10, 11, 26], "split_se": [10, 11], "spot": [5, 27, 33], "spread": 14, "springer": [6, 7], "sqrt": [2, 7, 9, 10, 11, 17, 19, 20, 31], "squar": [6, 10, 19, 21], "stabil": [6, 11], "stabilis": 11, "stack": [10, 11, 29], "stackrel": [2, 29], "stage": [10, 11, 26], "stai": 26, "standard": [2, 4, 6, 9, 10, 11, 17, 20], "standard_cauchi": 7, "star": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 26, 27, 29, 30, 31, 32, 33], "start": [4, 7, 13, 22, 24, 26, 28, 29], "state": [3, 6, 16, 17, 19, 20, 23, 26, 29, 31], "statement": [0, 3, 6, 7, 13, 14, 26], "statemet": 14, "stationari": 7, "statisfi": 24, "statist": [4, 8, 9, 14], "statistician": 17, "statur": 11, "steadi": 23, "step": [3, 4, 6, 8, 9, 11, 21, 22, 24, 26, 28, 29], "stephen": 10, "stil": 10, "still": [4, 10, 11, 13, 26], "stirzak": [7, 20, 23, 24], "stochast": [22, 23, 24], "stop": [14, 24], "store": 6, "str": [10, 11], "straightforward": [11, 17], "straightforwardli": [10, 16], "stratonovich": 6, "strength": 11, "strict": [21, 29], "strictli": [17, 19, 20], "stride": [10, 11], "string": [26, 29], "stronger": 7, "structur": [12, 31, 32], "stuck": 9, "studi": [3, 16, 19, 26, 33], "style": [5, 11], "styliz": 11, "su": 19, "sub": [10, 11], "subaddit": 0, "subconsci": 17, "subcov": 0, "subinterv": [0, 2], "submatrix": 12, "subscript": [8, 9, 11, 12, 21], "subsequ": [0, 3], "subsert": 0, "subset": [0, 2, 6, 7, 12, 13, 14, 24, 26, 32], "subseteq": [0, 3, 13, 17, 18, 21, 24, 26, 28, 29, 31, 32], "substitut": [0, 8, 19, 21, 22, 29], "substr": [26, 28, 29], "subtler": 12, "subtract": [3, 23], "success": 14, "succinctli": 29, "sucov": 0, "suffer": 2, "suffici": [2, 6, 9, 21], "suffix": 26, "suggest": [3, 4, 13, 28], "sum": [0, 3, 4, 6, 7, 8, 10, 11, 13, 14, 17, 20, 21, 22, 23, 24, 26, 29, 31], "sum_": [0, 2, 3, 4, 6, 7, 9, 11, 14, 15, 19, 20, 22, 23, 24, 31], "sum_k": 14, "sum_n": [13, 15], "summand": [6, 18], "summari": [11, 14], "summaris": 16, "summat": [0, 11, 22], "summer": 7, "sun": 11, "sup": [0, 2, 3, 19, 20], "sup_": [0, 2, 7], "sup_p": [0, 2], "super": [10, 11], "superscript": 11, "superset": [3, 29], "supos": 26, "supplementari": 6, "suppoos": 0, "support": 7, "suppos": [0, 2, 3, 4, 6, 8, 9, 10, 12, 19, 20, 21, 24, 26, 28, 29, 31], "supremum": 0, "supremuma": 3, "supseteq": [0, 3], "surpris": [3, 21], "surprisingli": [9, 28], "surviv": 21, "svg": 9, "svgd_grad": 9, "svm": 7, "swap": 28, "swin_transformer_block": 10, "swintransformerblock": 10, "switch": 20, "sy": 7, "sylvain": 11, "symbol": [7, 11, 26, 28, 29], "symmetr": [0, 3, 4, 11, 22, 24, 26], "symmetri": [11, 26, 31], "synchronis": 26, "system": [6, 23], "szegedi": 11, "t": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 17, 18, 19, 20, 21, 23, 24, 26, 29], "t_": [0, 6, 23], "t_0": [0, 6, 23], "t_1": [4, 6, 23], "t_2": 6, "t_3": 6, "t_4": 6, "t_i": 24, "t_j": 24, "t_k": [4, 23], "t_n": 6, "ta": [0, 19, 20], "tab": 4, "tabl": 11, "tackl": 10, "taht": 0, "take": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 31], "taken": [2, 11, 14, 18, 21, 23], "talk": 31, "tan": 20, "tape": [9, 10, 11], "target": [4, 9], "target_distribut": 4, "task": [3, 11], "tau_j": 6, "tau_n": 6, "taylor": [6, 16, 20], "tb": [0, 19], "technic": [4, 19], "techniqu": [10, 11], "tell": 4, "ten": [0, 6], "tend": [0, 6, 11, 23], "tensor": [4, 10, 11], "tensorflow": [9, 10, 11], "tensorflow_dataset": [10, 11], "tensorflow_prob": 10, "terenin": 7, "term": [0, 3, 4, 6, 7, 8, 9, 11, 13, 15, 16, 17, 19, 21, 24, 28, 31, 32], "termin": [26, 28, 29], "terminologi": [3, 29], "test": [9, 10, 11], "text": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31], "textbook": [20, 23, 27], "texttt": [10, 11, 26, 29], "tf": [4, 9, 10, 11], "tfd": [4, 10, 11], "tfk": [10, 11], "tfp": [10, 11], "th": [0, 2, 8, 9, 12, 14, 16, 19, 21, 24, 26], "than": [0, 2, 3, 4, 7, 9, 10, 13, 15, 19, 20, 23, 24, 26, 28, 29, 31], "thatt": 0, "thefor": 0, "thei": [0, 3, 4, 5, 7, 11, 14, 16, 18, 19, 24, 25, 26, 28, 29, 31, 32, 33], "them": [0, 4, 6, 10, 11, 16, 18, 25, 26, 28, 29, 31, 32, 33], "theme": 28, "themselv": [0, 10, 11], "theorem": [0, 2, 6, 8, 21, 22, 23, 24, 28], "theoret": 2, "theori": [3, 16, 20, 26, 28], "therefor": [0, 2, 3, 4, 6, 7, 8, 9, 10, 12, 13, 14, 19, 20, 21, 23, 24, 26, 28, 29, 31], "theta": 8, "thi": [0, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32, 33], "thing": [6, 10, 29, 31, 33], "third": [0, 2, 3, 10, 12, 17, 19, 21, 24, 26, 29, 31], "thoma": 11, "those": [0, 10, 26, 28, 31], "though": [4, 6, 7, 9, 10, 28], "thought": [2, 11], "three": [2, 6, 7, 13, 20, 26, 28, 31], "threfor": 26, "threshold": 19, "through": [0, 1, 9, 10, 11, 12, 19, 25, 26], "thu": [0, 3, 4, 7, 11, 12, 21, 24, 26, 29], "ti_n": 0, "tild": [0, 3, 4], "tile": 11, "time": [0, 3, 4, 6, 7, 9, 10, 11, 12, 14, 21, 22, 26, 28, 29, 31], "tini": 10, "tiny_swin_transform": 10, "tiny_vision_transform": 11, "tinyswintransform": 10, "tinyvisiontransform": 11, "titl": 4, "tmp": [10, 11], "togeth": [0, 2, 3, 4, 22, 23, 24, 26, 28, 32], "toi": 7, "token": 10, "token_dim": [10, 11], "token_dimens": [10, 11], "tokenis": [10, 11], "too": [2, 6, 13, 20, 21], "tool": [15, 16, 19, 20], "top": [5, 7, 8, 9, 10, 11, 12, 26, 29], "toplogi": 31, "topologi": 31, "toss": [13, 14], "total": [7, 10, 14, 21], "toward": [3, 4, 11], "trace": 9, "track": [26, 33], "tractabl": [4, 9], "trahtman": 26, "trail": 26, "train": [7, 8], "train_step": [10, 11], "trainable_vari": [10, 11], "transfer": 33, "transform": [4, 7, 19], "transformer_block": 11, "transformerblock": [10, 11], "transienc": 24, "transient": [22, 24], "transit": [3, 4, 6, 22, 24, 26, 28, 29], "transition_kernel": 4, "transition_scal": 4, "transitionkernel": 4, "translat": [0, 7], "transpos": 10, "transpose_a": [10, 11], "transpose_b": [10, 11], "treat": [11, 17, 21], "tree": 29, "trial": [4, 14, 22], "triangl": [0, 31], "trick": [10, 11], "tricki": 3, "tripl": 3, "triplet": 13, "tripo": 30, "trivial": [12, 19, 21], "trou": 10, "true": [3, 4, 6, 8, 9, 10, 11, 17], "try": [3, 6], "ts_n": 20, "tu": 19, "tupl": [23, 28, 29], "tur23": 11, "turn": [0, 3, 4, 7, 9, 10, 11, 15, 28, 29, 31], "turner": 11, "twice": [3, 6, 26], "twin_axi": 4, "twinx": 4, "two": [0, 2, 4, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 20, 21, 22, 23, 26, 28, 29, 31], "tx": [19, 20], "type": [4, 10, 21, 29], "typic": [4, 7, 10, 11, 15], "tz_n": 20, "u": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29, 31, 32], "u_": [0, 11, 22, 29], "u_0": [16, 22], "u_1": [0, 10, 16, 29, 31, 32], "u_2": [0, 10, 16, 29], "u_i": [29, 31, 32], "u_k": [0, 11, 29], "u_m": 22, "u_n": [0, 16, 20, 22, 31, 32], "u_q": 11, "u_r": 0, "u_t": 6, "u_th_t": 6, "uav": 29, "uavaw": 29, "uavw": 29, "ubiquit": 11, "uk": [10, 11], "ultim": 4, "ulyanov": 11, "unari": 28, "unbias": [4, 7], "unbound": [0, 9], "uncertainti": 7, "unchang": 0, "unconstrain": 22, "uncount": 17, "under": [0, 2, 3, 4, 6, 8, 10, 12, 13, 19, 24, 26, 31], "underbrac": [4, 28], "underli": 31, "undesir": 11, "unequ": [0, 26], "unfortun": 4, "uniform": [0, 2], "uniformli": [0, 2, 7, 17, 19], "union": [0, 13, 26], "uniqu": [3, 6, 8, 11, 23, 24, 26], "unit": 29, "univari": 15, "univers": [7, 20, 25], "unless": 29, "unlik": [6, 9, 17, 19, 20, 32], "unrel": 18, "unterthin": 11, "until": [14, 26, 28, 29], "up": [0, 3, 4, 6, 7, 10, 11, 14, 17, 26, 28], "updat": [10, 13, 14], "upgrad": 33, "upper": [0, 4, 26], "uq": [10, 11], "us": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 31, 32, 33], "use_bia": [10, 11], "user": [4, 7], "usual": [3, 4, 15, 31], "uszkoreit": 11, "uv": [19, 29], "uvaw": 29, "uvl16": 11, "uvw": 29, "uvxyz": 29, "uwv": 29, "v": [0, 3, 4, 6, 10, 11, 18, 19, 22, 23, 24, 29, 31], "v4": 11, "v_": 0, "v_1": [0, 31], "v_2": [0, 31], "v_h": 11, "v_i": [0, 24], "v_j": 0, "v_k": 0, "v_l": 0, "v_t": 6, "vacuous": [28, 31], "valid": [4, 10, 11, 13, 14, 18, 28, 29], "valu": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 19, 21, 22, 23, 24, 31], "vanhouck": 11, "vanish": 6, "var": [6, 7, 14, 19, 20, 21], "var_pr": 7, "vari": [4, 19], "variabl": [4, 6, 7, 9, 10, 11, 12, 19, 20, 21, 23, 24, 29], "varianc": [6, 19, 20, 21], "variant": 11, "variou": [6, 11], "vaswani": 11, "vcl": 0, "vdot": [0, 21], "ve": [4, 11], "vector": [0, 7, 8, 9, 10, 11, 12, 24, 31], "vedaldi": 11, "veri": [2, 3, 4, 9, 10, 11, 12, 16, 25, 28, 29, 31], "verifi": [0, 4, 6, 14, 15], "verlag": 6, "versa": 16, "versatil": 11, "version": [9, 15, 17, 20, 23, 24, 26], "vert_": [6, 21], "vertic": 10, "vi": 9, "via": [0, 3, 4, 7, 9, 28], "viacheslav": 7, "vice": 16, "victor": 11, "video": [11, 27], "view": [10, 11, 28], "vincent": 11, "violat": 0, "vision": [10, 11], "visit": 24, "visualis": [4, 7, 29], "vit": [10, 11], "vitali": 0, "vmatrix": 18, "volum": [3, 11, 17], "vorel": 3, "vsp": 11, "vxy": 29, "vy": 29, "w": [0, 4, 6, 7, 9, 10, 11, 17, 19, 26, 28, 29, 31], "w_": [6, 29], "w_1": [0, 26, 28], "w_1w_2": 29, "w_2": [0, 26, 28], "w_2w_1": 26, "w_2z": 26, "w_i": [0, 29], "w_j": 6, "w_k": [0, 28], "w_m": 29, "w_n": [0, 4, 6, 26, 28], "w_t": 6, "wa": [0, 2, 3, 4, 10, 11, 12, 14, 20, 26, 28, 29], "wai": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 17, 20, 22, 24, 26, 28, 29], "walk": 33, "wall": 7, "wang": 9, "want": [3, 4, 6, 7, 9, 11, 12, 13, 24, 31], "watch": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 26, 27, 28, 29, 30, 31, 32, 33], "wbt": 7, "we": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "weak": 8, "weaker": [6, 20], "weakli": 20, "wealth": 7, "weather": 11, "websit": 33, "wei": 10, "weigh": [4, 11], "weight": [7, 10, 11, 14], "weight_decai": [10, 11], "weissenborn": 11, "welcom": 33, "well": [0, 2, 3, 4, 6, 7, 8, 9, 11, 17, 18, 20, 23, 26, 28, 29], "welsh": [20, 25], "were": [3, 6, 7, 10, 28], "wh18": 11, "what": [0, 3, 6, 10, 11, 12, 14, 26, 28, 29], "when": [0, 3, 4, 6, 9, 10, 11, 12, 14, 16, 17, 19, 20, 22, 26, 28, 29, 31, 32], "whenev": [0, 4, 11, 12, 14, 15, 17, 18, 19, 26, 31], "where": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "wherea": [6, 10, 11, 12, 16, 17, 23], "wherebi": 26, "whether": [3, 11, 12, 28, 31], "which": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32], "while": [0, 2, 4, 6, 7, 9, 10, 12, 13, 23, 28], "white": 20, "who": 26, "whole": [7, 12, 13, 31], "whose": [0, 3, 4, 6, 8, 14, 17, 21, 28], "why": [0, 4], "wide": 11, "widespread": 6, "width": [10, 11], "wihtout": 9, "wilson": 7, "window_s": 10, "wise": [10, 11, 12], "wish": [4, 6, 20], "within": [4, 6, 8, 10, 11, 14, 19], "without": [0, 6, 7, 8, 11, 26], "won": [10, 11], "wonder": 10, "word": [0, 3, 7, 8, 10, 11, 12], "work": [0, 1, 3, 4, 6, 7, 9, 12, 19, 32], "worri": 18, "worth": [4, 11, 12, 19], "would": [0, 2, 3, 4, 7, 10, 11, 12, 13, 14, 19, 20, 26], "wrap": 12, "write": [0, 2, 3, 6, 7, 8, 12, 17, 20, 22, 23, 24, 26, 28, 29], "written": [0, 4, 11, 14, 15, 17, 18, 19, 24, 26, 29, 31], "writtten": 8, "wrong": 12, "wt": 26, "wtw": 26, "wu": 11, "wx": 26, "wz": 26, "x": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31, 32], "x0": [4, 6], "x1": 7, "x2": 7, "x_": [0, 2, 4, 6, 9, 10, 11, 12, 17, 21, 24], "x_0": [0, 2, 4, 6, 11, 21, 24, 31], "x_1": [0, 2, 4, 11, 12, 14, 16, 17, 18, 20, 21, 23, 24, 26, 28], "x_1x_2": 0, "x_2": [0, 14, 16, 17, 18, 20, 21, 23, 24, 26], "x_accept": 4, "x_d": 9, "x_data": 7, "x_dim": 7, "x_full": 7, "x_i": [0, 2, 8, 11, 12, 24, 26], "x_j": [0, 2, 6, 11, 12, 24, 26], "x_k": [0, 2, 4, 23, 24, 26], "x_m": [9, 24], "x_mean": 6, "x_n": [0, 2, 4, 6, 9, 11, 12, 14, 16, 20, 24, 28, 31], "x_p": 26, "x_plot": 4, "x_pred": 7, "x_shape": 10, "x_stdev": 6, "x_t": [6, 21, 24], "x_w": 17, "xa": 26, "xf_x": 17, "xiangyu": 11, "xiaohua": 11, "xlabel": 4, "xlim": 4, "xtick": 4, "xw_1": 26, "xy": [15, 26, 28], "xyz": [26, 28], "xz": 26, "y": [0, 3, 6, 7, 10, 11, 14, 15, 16, 17, 18, 19, 21, 22, 24, 26, 28, 31, 32], "y_": [0, 10, 12], "y_0": [0, 24], "y_1": [0, 11, 12, 18, 20, 24, 26], "y_2": [0, 18, 20], "y_data": 7, "y_k": 24, "y_n": [0, 11, 12, 24], "y_q": 26, "y_t": 6, "ya": 26, "yellow": 20, "yet": [31, 33], "yield": [2, 3, 4, 6, 9, 11, 22, 23], "yixuan": 10, "ylabel": 4, "ylim": 4, "you": [0, 5, 6, 11, 26, 27, 33], "ytick": 4, "yue": 10, "yutong": 10, "yuxin": 11, "yx": 26, "yz": 26, "z": [0, 3, 7, 9, 15, 17, 18, 19, 20, 23, 24, 26, 28, 31], "z_": [7, 8, 11, 20], "z_0": [11, 20], "z_1": [0, 23], "z_2": 23, "z_k": 11, "z_n": [0, 6, 20, 23], "ze": 10, "zero": [0, 4, 6, 11, 19, 20, 23, 26, 28, 29], "zeta_": 7, "zhai": 11, "zhang": [10, 11], "zheng": 10, "zip": [10, 11], "zorder": 4, "zx": 26, "zy": 26, "\u00e0": 10, "\u00e4": 8, "\u010dern\u00fd": 26}, "titles": ["Exercises", "Masure, integration and real analysis", "Riemann integration", "Measures", "Annealed importance sampling", "Stream of papers", "Numerical simulation of SDEs", "Random Fourier features", "Estimation by score matching", "Stein variational gradient descent", "Shifted window transformers", "Introduction to transformers", "Why covariance functions?", "Events and Probabilities", "Discrete random variables", "Multivariate discrete distributions", "Probability generating functions", "Distribution and density functions", "Multivariate distributions", "Moment generating functions", "Main limit theorems", "Branching processes", "Random walks", "Processes in continuous time", "Markov chains", "Probability: An introduction", "Excercises", "Theory of Computation", "Finite Automata and Regular Expressions", "PDAs and context-free grammars", "Metric and topological spaces", "Metric spaces", "Topological Spaces", "Home"], "titleterms": {"": [0, 3, 6, 7, 19, 20], "1": [0, 19, 26], "10": 0, "11": [0, 26], "12": 0, "13": 0, "14": 0, "16": 0, "2": [0, 26], "20": 26, "3": 0, "31": 26, "32": 26, "33": 26, "34": 26, "4": 0, "41": 26, "43": 26, "44": 26, "45": 26, "46": 26, "47": 26, "48": 26, "5": 0, "51": 26, "52": 26, "59": 26, "6": 0, "63": 26, "67": 26, "7": 0, "8": 0, "9": 0, "A": [0, 2, 28, 29], "The": [6, 7, 8, 23], "absorpt": 22, "accept": [28, 29], "addit": 3, "algebra": 3, "algorithm": 4, "all": [3, 31], "ambigu": 29, "an": [3, 25], "analysi": 1, "anneal": 4, "applic": 10, "approxim": [3, 7, 9], "ar": [2, 3, 31], "arriv": 23, "attent": [10, 11], "automata": [28, 29], "automaton": [28, 29], "b": 0, "ball": 31, "bay": 13, "bayesian": 7, "below": 3, "bernoulli": [14, 16], "beta": 17, "between": 19, "binomi": [14, 16], "birth": 23, "block": [10, 11], "bochner": 7, "borel": 3, "bound": [2, 4], "branc": 21, "branch": 21, "c": [0, 20], "can": 28, "cauchi": [17, 19, 31], "central": 20, "cfg": 29, "cfl": 29, "chain": [6, 24], "chang": 18, "chapter": [0, 26], "characteris": 31, "characterist": [19, 20], "chebyshev": [0, 20], "chi": 17, "chomski": 29, "classif": 24, "close": [3, 29, 31], "closur": 28, "coeffici": 19, "collect": 3, "come": 23, "composit": 3, "comput": 27, "concaten": 28, "conclus": [4, 7, 9, 10, 11], "condit": [3, 13, 14, 18], "consist": 12, "constant": 31, "contain": [3, 31], "context": 29, "continu": [2, 3, 17, 18, 20, 23, 31, 32], "converg": [6, 7, 20, 24, 31], "convert": 28, "convex": 19, "convolut": [15, 18], "correl": 19, "countabl": 3, "covari": [12, 19], "cover": 3, "d": 0, "dataset": [10, 11], "death": 23, "decreas": 3, "defici": 2, "definit": [3, 15, 16], "demo": 9, "densiti": [17, 18], "deriv": [9, 16, 19, 29], "descent": 9, "deviat": 20, "differ": 3, "differenti": 6, "dimens": 6, "direct": 9, "discret": [14, 15], "disjoint": 3, "distribut": [8, 14, 15, 16, 17, 18, 20, 24], "doe": 2, "embed": 11, "empir": 9, "equal": 19, "equat": 6, "equilibrium": 24, "equiv": 28, "equival": [3, 8], "estim": 8, "euler": 6, "evalu": 6, "event": 13, "eventu": 31, "everi": 3, "exampl": [10, 16, 19, 29, 31], "excercis": [0, 26], "exercis": [0, 26], "expect": [14, 15, 17, 18], "experi": 4, "exponenti": [17, 19], "express": [28, 29], "extens": 3, "extinct": 21, "extract": 10, "factoris": [15, 16, 18, 19], "failur": 9, "featur": 7, "fenchel": 20, "finit": [3, 28], "first": 23, "form": [8, 29], "formula": [15, 16, 18], "fourier": 7, "free": 29, "from": [3, 6, 7, 16], "fsa": 28, "function": [2, 3, 6, 12, 14, 15, 16, 17, 18, 19, 20, 21, 31, 32], "fundament": 14, "g": 16, "gamma": [17, 19], "gaussian": 9, "gener": [16, 19, 21, 29], "generalis": 28, "geometr": [14, 16], "good": 3, "gradient": 9, "grammar": 29, "graph": 24, "ha": 31, "have": 3, "head": 11, "hein": 3, "higher": 6, "hoeffd": 7, "home": 33, "i": [2, 3, 9, 28], "iff": [8, 15, 18, 31], "imag": 3, "implement": [4, 7, 9, 10, 11], "impli": [14, 16, 19, 20, 28, 31], "import": 4, "increas": 3, "independ": [13, 15, 16, 18, 19], "indic": 15, "induc": [31, 32], "inequ": [0, 2, 19, 20, 31], "infimum": [2, 3], "infti": 3, "inner": 31, "integr": [1, 2, 6, 31], "inter": 23, "interchang": 2, "intersect": [3, 29], "interv": 3, "introduct": [11, 25], "invari": [3, 24], "invers": [3, 19], "invert": 9, "iter": 18, "ito": 6, "its": 31, "j": 8, "jacobian": 18, "jensen": 19, "joint": [15, 18], "kl": 9, "ksd": 9, "lack": 23, "laguag": 28, "languag": [28, 29], "larg": 20, "law": [14, 15, 18, 20], "layer": 11, "lebesgu": 3, "leftmost": 29, "legendr": 20, "lemma": [28, 29], "length": 3, "leq": 2, "limit": [2, 3, 20, 31], "lower": [2, 4], "main": 20, "markov": [19, 24], "maruyama": 6, "mass": [14, 15], "masur": 1, "match": 8, "mathbb": 3, "mathemat": 29, "mcmc": 4, "mean": [20, 21], "measur": [3, 13], "memori": 23, "method": 6, "metric": [30, 31], "mgf": [19, 20], "milstein": 6, "mixtur": 9, "mode": 9, "model": 11, "moment": [16, 19], "multi": 11, "multivari": [15, 18], "myhil": 26, "neg": [14, 16], "neighbourhood": 31, "nerod": 26, "nfa": 28, "non": [29, 31], "nonaddit": 3, "nondetermin": 28, "nondeterminist": 28, "nonexist": 3, "nontrivi": 3, "norm": 31, "normal": [17, 19, 29], "normalis": 11, "number": 20, "numer": 6, "object": 8, "one": [3, 6], "open": [3, 31], "oper": [3, 28], "order": [3, 6], "other": 3, "outer": 3, "paper": 5, "partit": [2, 13, 14], "pda": 29, "pdf": [17, 18], "perceptron": 11, "pgf": 16, "pmf": [14, 15], "point": 31, "pointwis": [2, 3], "poisson": [14, 16, 23], "popul": 21, "posit": 31, "precis": 12, "preserv": 3, "prior": 7, "probabiil": 13, "probabl": [13, 14, 15, 16, 17, 20, 21, 25], "process": [6, 21, 23], "product": [15, 18, 31], "proof": 9, "properti": [3, 17, 19, 23, 24, 28, 31], "pump": [28, 29], "pushdown": 29, "put": [10, 11], "queue": 23, "r": 3, "random": [7, 14, 15, 16, 17, 22, 24], "rate": 7, "ration": 3, "real": 1, "recognis": 28, "recurr": 22, "refer": [4, 6, 7, 8, 9, 10, 11, 20, 21, 23, 24, 25], "regress": 7, "regular": 28, "relat": 3, "residu": 11, "result": [6, 14], "revers": 24, "rff": 7, "riemann": 2, "rule": [6, 13], "sampl": [4, 6, 7, 13], "schwartz": 19, "schwarz": 31, "score": 8, "sde": 6, "self": [10, 11], "sequenc": [3, 31], "serv": 23, "set": [3, 31], "shift": 10, "sigma": 3, "simpl": [22, 23], "simul": 6, "smallest": 3, "some": 28, "space": [3, 13, 14, 30, 31, 32], "squar": [17, 20], "star": 28, "starvat": 7, "state": [24, 28], "statistician": [14, 15, 18], "steepest": 9, "stein": 9, "stochast": [6, 21], "stream": 5, "string": 28, "strong": [6, 24], "subaddit": 3, "subconsci": [14, 15, 18], "subcov": 3, "subset": [3, 31], "subspac": 31, "suffici": 3, "sum": [2, 15, 16, 18, 19], "support": 19, "supremum": [2, 3], "svgd": 9, "swin": 10, "tangent": 19, "texttt": 28, "theorem": [3, 7, 13, 14, 19, 20, 26], "theori": 27, "time": [23, 24], "togeth": [10, 11], "toi": 4, "token": 11, "topolog": [30, 32], "topologi": 32, "train": [10, 11], "transform": [9, 10, 11, 20], "transienc": 22, "translat": 3, "trick": 8, "two": [3, 14, 19], "ultim": 21, "unbound": 2, "uncount": 3, "under": [28, 29], "uniform": [7, 17, 19], "union": [3, 28], "uniqu": [16, 19, 31], "upper": 2, "variabl": [14, 15, 16, 17, 18], "varianc": [4, 7, 14], "variat": 9, "walk": [22, 24], "weak": [6, 20], "weight": 4, "why": [6, 12], "wiener": 6, "window": 10, "work": 2, "written": 28, "yield": [28, 29], "zero": 3}})
\ No newline at end of file
+Search.setIndex({"alltitles": {"": [[3, null], [3, null], [3, null], [10, null], [10, null], [11, null], [11, null], [11, null], [11, null], [11, null], [11, null], [17, "theorem-6"], [26, null], [28, "example-3"], [28, "example-5"], [28, "example-6"], [28, null], [28, "example-22"], [32, null]], " (A function that is not Riemann integrable)": [[2, "mira-eg-not-riemann-integrable"]], " (A non-context-free language)": [[29, "example-12"]], " (Additivity of outer measure if one of the sets is closed)": [[3, "theorem-43"]], " (Additivity of outer measure if one of the sets is open)": [[3, "mira:thm:additivity-of-outer-measure-if-one-set-is-open"]], " (Algebra of inverse images)": [[3, "theorem-21"]], " (Algebraic operations with measurable functions)": [[3, "theorem-29"]], " (Ambiguity)": [[29, "definition-6"]], " (Annealed Importance Sampling)": [[4, "definition-2"]], " (Approximation of Borel sets from below by closed sets)": [[3, "theorem-44"]], " (Bayes\u2019 theorem)": [[13, "theorem-7"]], " (Bochner\u2019s theorem)": [[7, "theorem-0"]], " (Borel measurable function)": [[3, "definition-25"]], " (Borel set)": [[3, "definition-19"]], " (Borel subsets of [-\\infty, \\infty])": [[3, "definition-31"]], " (Bounds on the Riemann integral)": [[2, "mira-bounds-on-riemann-integral"]], " (CFL not closed under intersection)": [[29, "lemma-13"]], " (CFLs generated by CFG in Chomsky normal form)": [[29, "lemma-8"]], " (Cauchy-Schwartz inequality)": [[19, "theorem-4"]], " (Cauchy-Schwarz inequality)": [[31, "topology:cauchy-schwarz"]], " (Central limit theorem)": [[20, "theorem-6"]], " (Characterisation of continuity)": [[31, "topology:theorem-characterisation-of-continuity"]], " (Characteristic function)": [[19, "definition-13"]], " (Chebyshev\u2019s inequality)": [[20, "theorem-4"]], " (Chomsky normal form)": [[29, "definition-7"]], " (Closed set \\iff set contains all its limit points)": [[31, "topology:lemma-closed-set-iff-set-contains-all-its-limit-points"]], " (Closure under concatenation)": [[28, "theorem-14"]], " (Closure under star)": [[28, "theorem-15"]], " (Closure under union)": [[28, "theorem-10"], [28, "theorem-13"]], " (Composition of measurable functions)": [[3, "theorem-28"]], " (Composition preserves continuity)": [[32, "topology:lemma-composition-preserves-continuity"]], " (Condition for measurable function)": [[3, "mira-thm-condition-measurable"]], " (Conditional density function)": [[18, "definition-5"]], " (Conditional expectation)": [[14, "definition-7"]], " (Conditional probabiility)": [[13, "definition-3"]], " (Conditional probability space)": [[13, "theorem-5"]], " (Context-free grammar)": [[29, "definition-0"]], " (Context-free language)": [[29, "definition-2"]], " (Continuity theorem with characteristic functions)": [[20, "theorem-13"]], " (Continuity theorem with mgfs)": [[20, "theorem-7"]], " (Continuous conditional expectation)": [[18, "definition-8"]], " (Continuous function)": [[31, "topology:def-continuous-function"], [32, "topology:def-continuous-function-topology"]], " (Continuous functions are Riemann integrable)": [[2, "theorem-8"]], " (Continuous random variable)": [[17, "definition-2"]], " (Convergence implies sequence eventually in open neighbourhood)": [[31, "topology:lemma-convergence-implies-sequence-eventually-in-open-neighbourhood"]], " (Convergence in distribution to c \\implies convergence in probability to c)": [[20, "theorem-12"]], " (Convergence in distribution)": [[20, "definition-10"]], " (Convergence in mean square \\implies convergence in probability)": [[20, "theorem-3"]], " (Convergence in probability (\\implies) convergence in distribution)": [[20, "theorem-11"]], " (Convergence in probability)": [[20, "definition-2"]], " (Convergent sequence)": [[31, "definition-3"], [32, "topology:def-convergent-sequence-topology"]], " (Convex function)": [[19, "definition-10"]], " (Convolution formula)": [[15, "theorem-6"], [18, "theorem-3"]], " (Correlation between -1 and 1)": [[19, "theorem-3"]], " (Correlation coefficient)": [[19, "definition-2"]], " (Countable sets have outer measure zero)": [[3, "mira:thm:countable-sets-have-measure-zero"]], " (Countable subadditivity)": [[3, "theorem-38"]], " (Covariance)": [[19, "definition-1"]], " (Direction of steepest descent)": [[9, "theorem-1"]], " (Discrete random variable)": [[14, "definition-0"]], " (Distribution function)": [[17, "definition-1"]], " (Equivalent form of J)": [[8, "theorem-2"]], " (Euler-Maruyama method)": [[6, "definition-1"]], " (Event space)": [[13, "definition-0"]], " (Every continuous function is Borel measurable)": [[3, "theorem-26"]], " (Every increasing function is Borel measurable)": [[3, "theorem-27"]], " (Example CFG for mathematical expressions)": [[29, "example-4"]], " (Example CFG)": [[29, "example-3"]], " (Examples of metric spaces)": [[31, "example-1"]], " (Examples of norms)": [[31, "topology:examples-of-norms"]], " (Expectation of continuous random variable)": [[17, "definition-5"]], " (Expectation of product of independent variables)": [[15, "theorem-4"]], " (Expectation)": [[14, "definition-3"]], " (FSA accepts a string / recognises a language)": [[28, "definition-2"]], " (Fenchel-Legendre transform)": [[20, "definition-8"]], " (Finite State Automaton;  FSA)": [[28, "finite-state-automaton"]], " (Generalised nondeterministic finite automaton)": [[28, "definition-17"]], " (Generating function)": [[16, "definition-0"]], " (Hausdorff space)": [[32, "topology:def-hausdorff-space"]], " (Heine-Borel)": [[3, "mira:thm:heine-borel"]], " (Hoeffding for RFF)": [[7, "lemma-2"]], " (Homeomorphism is an equivalence relation)": [[32, "topology:lemma-homeomorphism-equivalence-relation"]], " (Homeomorphism)": [[32, "topology:def-homeomorphism"]], " (Importance weighted MCMC algorithm)": [[4, "definition-1"]], " (Independence \\iff expectations of products of functions factorise)": [[18, "theorem-7"]], " (Independence \\iff expected product of functions factorises)": [[15, "theorem-5"]], " (Independence \\iff pdf factorises)": [[18, "theorem-2"]], " (Independence \\iff pmf factorises)": [[15, "theorem-3"]], " (Independence \\implies G factorises)": [[16, "theorem-5"]], " (Independence \\implies mgf of sum factorises)": [[19, "theorem-7"]], " (Independence of variables)": [[18, "definition-1"]], " (Independence)": [[13, "definition-4"], [15, "definition-2"]], " (Indicator functions)": [[15, "definition-7"]], " (Induced topology)": [[32, "topology:def-induced-topology"]], " (Inequalities with Riemann sums)": [[2, "mira-thm-refining-partitions"]], " (Infimum and supremum of a sequence of measurable functions is measurable)": [[3, "theorem-34"]], " (Infimum and supremum)": [[2, "definition-1"]], " (Inner product)": [[31, "topology:inner-product-space"]], " (Inner products induce norms)": [[31, "topology:lemma-inner-products-induce-norms"]], " (Interchanging Riemann integral and limit)": [[2, "theorem-14"]], " (Inverse image of a composition)": [[3, "theorem-22"]], " (Inverse image)": [[3, "definition-20"]], " (Inversion theorem)": [[19, "theorem-16"]], " (Ito\u2019s result for one dimension)": [[6, "theorem-5"]], " (Jacobian formula)": [[18, "theorem-4"]], " (Jensen\u2019s inequality)": [[19, "theorem-11"]], " (Joint distribution function)": [[18, "definition-0"]], " (Joint probability mass function)": [[15, "definition-0"]], " (Large deviation theorem)": [[20, "theorem-9"]], " (Law of iterated expectations)": [[18, "theorem-9"]], " (Law of the subconscious statistician - multivariate)": [[15, "theorem-1"]], " (Law of the subconscious statistician)": [[14, "theorem-4"], [18, "theorem-6"]], " (Leftmost derivation)": [[29, "definition-5"]], " (Length of an open interval)": [[3, "definition-0"]], " (Limit point)": [[31, "topology:def-limit-point"]], " (Limits are unique in Hausdorff spaces)": [[32, "topology:lemma-limits-unique-hausdorff"]], " (Limits in metric spaces are unique)": [[31, "topology:lemma-limits-in-metric-spaces-are-unique"]], " (Lower Riemann integral \\leq upper Riemann integral)": [[2, "theorem-6"]], " (Lower Riemann sum \\leq upper Riemann sum)": [[2, "theorem-4"]], " (Lower bound to importance weight variance)": [[4, "lemma-0"]], " (Markov inequality)": [[19, "theorem-9"]], " (Matching scores \\iff matching distributions)": [[8, "theorem-1"]], " (Mean square law of large numbers)": [[20, "theorem-1"]], " (Mean-square convergence)": [[20, "definition-0"]], " (Measurable function on [-\\infty, \\infty])": [[3, "mira:def:measurable-function-infinity"]], " (Measurable function)": [[3, "definition-23"]], " (Measurable space, measurable set)": [[3, "definition-17"]], " (Measure of a decreasing intersection)": [[3, "theorem-40"]], " (Measure of an increasing union)": [[3, "mira-thm-measure-increasing-union"]], " (Measure of the union of two sets)": [[3, "theorem-41"]], " (Measure preserves order; measure of a set difference)": [[3, "mira-thm-measure-preserves-order"]], " (Measure space)": [[3, "definition-36"]], " (Measure)": [[3, "definition-35"]], " (Metric space)": [[31, "topology:def-metric-space"]], " (Metric subspace)": [[31, "definition-2"]], " (Milstein\u2019s method)": [[6, "definition-4"]], " (Moment generating function)": [[19, "definition-5"]], " (Moment)": [[16, "definition-3"]], " (Moments equal to derivatives of mgf)": [[19, "theorem-6"]], " (Moments from pgf derivatives)": [[16, "theorem-4"]], " (NFA recognises A \\implies A is regular)": [[28, "toc-dfa-nfa-equivalence"]], " (Non-constant positive continuous function has positive integral)": [[31, "topology:lemma-non-constant-positive-continuous-function-has-positive-integral"]], " (Nonadditivity of the outer measure)": [[3, "mira:thm:non-additivity-of-outer-measure"]], " (Nondeterministic Finite Automaton)": [[28, "definition-11"]], " (Nonexistence of extension of length to all subsets of \\mathbb{R})": [[3, "mira:thm:nonexistence-length"]], " (Nontrivial intervals are uncountable)": [[3, "theorem-10"]], " (Norm)": [[31, "topology:def-norm"]], " (Norms induce metrics)": [[31, "topology:lemma-norms-induce-metrics"]], " (Open (closed) balls are open (closed))": [[31, "topology:lemma-open-and-closed-balls-are-open-and-closed"]], " (Open and closed balls)": [[31, "topology:def-open-and-closed-balls"]], " (Open and closed subsets)": [[31, "topology:def-open-and-closed-subsets"]], " (Open cover, finite subcover)": [[3, "definition-7"]], " (Open neighbourhood)": [[31, "topology:def-open-neighbourhood"], [32, "topology:def-open-neighbourhood-topology"]], " (Other properties of \\sigma-algebras)": [[3, "theorem-16"]], " (Outer measure is additive if sets are contained by disjoint open intervals)": [[3, "mira:thm:outer-measure-is-additive-if-sets-are-separable"]], " (Outer measure is countably subadditive)": [[3, "mira:thm:countable-subadditivity-of-outer-measure"]], " (Outer measure is translation invariant)": [[3, "theorem-5"]], " (Outer measure of a closed interval)": [[3, "mira:thm:outer-measure-of-a-closed-interval"]], " (Outer measure preserves order)": [[3, "mira:thm:outer-measure-preserves-order"]], " (Outer measure)": [[3, "definition-1"]], " (PDA accepts)": [[29, "definition-10"]], " (Partition theorem for conditional expectations)": [[14, "theorem-8"]], " (Partition theorem)": [[13, "theorem-6"]], " (Partition)": [[2, "definition-0"]], " (Pdf of a function of a random variable)": [[17, "theorem-4"]], " (Pointwise limit of S-measurable functions is S-measurable)": [[3, "theorem-30"]], " (Precision functions and consistency)": [[12, "remark-0"]], " (Probability density function properties)": [[17, "theorem-3"]], " (Probability generating function)": [[16, "definition-1"]], " (Probability mass function)": [[14, "definition-1"]], " (Probability measure)": [[13, "definition-1"]], " (Probability space)": [[13, "definition-2"]], " (Proof: Gradient of KL is the KSD)": [[9, "theorem-0"]], " (Properties of open sets)": [[31, "topology:lemma-properties-of-open-sets"]], " (Pumping Lemma for CFLs)": [[29, "toc-cfl-pumping-lemma"]], " (Pushdown automaton (PDA))": [[29, "definition-9"]], " (Random Fourier Features)": [[7, "definition-1"]], " (Random sum formula)": [[16, "theorem-6"]], " (Random variable)": [[17, "definition-0"]], " (Rational difference equivalence relation)": [[3, "mira:def:rational-difference-equivalence-relation"]], " (Regular Pumping Lemma)": [[28, "toc-dfa-pumping-lemma"]], " (Regular expressions yield regular languages)": [[28, "lemma-16"]], " (Regular expressions)": [[28, "toc-def-regex"]], " (Regular laguages \\equiv regular expressions)": [[28, "corollary-20"]], " (Regular language)": [[28, "definition-4"]], " (Regular languages can be written as regular expressions)": [[28, "lemma-18"]], " (Regular operations)": [[28, "definition-7"]], " (Riemann integral)": [[2, "definition-5"], [2, "definition-7"]], " (Riemann integration does not work with limits)": [[2, "example-12"]], " (Riemann integration does not work with pointwise limits)": [[2, "example-13"]], " (Riemann integration does not work with unbounded functions)": [[2, "example-11"]], " (Score matching objective)": [[8, "definition-0"]], " (Smallest \\sigma-algebra containing a collection of subsets)": [[3, "theorem-18"]], " (Some regular expressions)": [[28, "example-9"]], " (Stein variational gradient descent)": [[9, "definition-2"]], " (Strings and Languages)": [[28, "definition-1"]], " (Strong convergence)": [[6, "definition-2"]], " (Sufficient condition for measurable function)": [[3, "mira-thm-sufficient-condition-measurable-with-infinity"]], " (Supporting tangent theorem)": [[19, "theorem-12"]], " (Topological space)": [[32, "topology:def-topological-space"]], " (Translation invariance)": [[3, "mira:thm:outer-measure-is-translation-invariant"]], " (Two properties of characteristic functions)": [[19, "theorem-14"]], " (Two results for discrete random variables)": [[14, "theorem-5"]], " (Uniform convergence of RFF)": [[7, "lemma-3"]], " (Uniqueness of characteristic functions)": [[19, "theorem-15"]], " (Uniqueness of mgfs)": [[19, "theorem-8"]], " (Uniqueness of pgfs)": [[16, "theorem-2"]], " (Uniqueness theorem for moments)": [[19, "theorem-0"]], " (Upper and lower Riemann sums)": [[2, "definition-2"]], " (Variance)": [[14, "definition-6"]], " (Weak convergence)": [[6, "definition-3"]], " (Weak law of large numbers)": [[20, "theorem-5"]], " (Wiener process)": [[6, "definition-0"]], " (Yields, derives)": [[29, "definition-1"]], " (\\sigma-algebra)": [[3, "mira:def:sigma-algebra"]], " (\\texttt{Convert})": [[28, "algorithm-19"]], " (pmf \\implies probability space and random variable)": [[14, "theorem-2"]], "(Multi head) self-attention": [[11, "multi-head-self-attention"]], "Additivity of outer measure on Borel sets": [[3, "additivity-of-outer-measure-on-borel-sets"]], "Annealed Importance Sampling": [[4, "id2"]], "Annealed importance sampling": [[4, "annealed-importance-sampling"]], "Arrival and Inter-arrival times": [[23, "arrival-and-inter-arrival-times"]], "Bernoulli": [[14, "bernoulli"], [16, "bernoulli"]], "Beta": [[17, "beta"]], "Binomial": [[14, "binomial"]], "Binomial distribution": [[16, "binomial-distribution"]], "Birth and death process": [[23, "birth-and-death-process"]], "Branching processes": [[21, "branching-processes"]], "Cauchy": [[17, "cauchy"], [19, "cauchy"]], "Central limit theorem": [[20, "central-limit-theorem"]], "Changes of variables": [[18, "changes-of-variables"]], "Chapter 1": [[26, "chapter-1"]], "Chapter 1.A": [[0, "chapter-1-a"]], "Chapter 1.B": [[0, "chapter-1-b"]], "Chapter 2.A": [[0, "chapter-2-a"]], "Chapter 2.C": [[0, "chapter-2-c"]], "Chapter 2.D": [[0, "chapter-2-d"]], "Chapter 4.A": [[0, "chapter-4-a"]], "Chapter 5.A": [[0, "chapter-5-a"]], "Chapter 6.A": [[0, "chapter-6-a"]], "Characteristic functions": [[19, "characteristic-functions"]], "Chi-squared": [[17, "chi-squared"]], "Chomsky normal form": [[29, "chomsky-normal-form"]], "Classification of states": [[24, "classification-of-states"]], "Closure properties": [[28, "closure-properties"]], "Closure properties: union": [[28, "closure-properties-union"]], "Conclusion": [[4, "conclusion"], [7, "conclusion"], [9, "conclusion"], [10, "conclusion"], [11, "conclusion"]], "Conditional density functions": [[18, "conditional-density-functions"]], "Conditional expectations": [[14, "conditional-expectations"]], "Conditional probability and independence": [[13, "conditional-probability-and-independence"]], "Conditions for measurable functions": [[3, "conditions-for-measurable-functions"]], "Context Free Pumping Lemma": [[29, "context-free-pumping-lemma"]], "Context-free grammars": [[29, "context-free-grammars"]], "Continuous random variables": [[17, "continuous-random-variables"]], "Convergence in distribution": [[20, "convergence-in-distribution"]], "Convergence in mean-square": [[20, "convergence-in-mean-square"]], "Convergence in probability": [[20, "convergence-in-probability"]], "Convergence to equilibrium": [[24, "convergence-to-equilibrium"]], "Covariance and correlation": [[19, "covariance-and-correlation"]], "Dataset": [[10, "dataset"], [11, "dataset"]], "Deficiencies of the Riemann integral": [[2, "deficiencies-of-the-riemann-integral"]], "Definition": [[15, "definition"], [16, "definition"]], "Definition of the outer measure": [[3, "definition-of-the-outer-measure"]], "Demo on mixture of Gaussians": [[9, "demo-on-mixture-of-gaussians"]], "Derivation of SVGD": [[9, "derivation-of-svgd"]], "Direction of steepest descent": [[9, "direction-of-steepest-descent"]], "Discrete random variables": [[14, "discrete-random-variables"], [14, "id1"]], "Distribution and density functions": [[17, "distribution-and-density-functions"]], "Distribution functions": [[17, "distribution-functions"]], "Empirical approximation": [[9, "empirical-approximation"]], "Estimation by score matching": [[8, "estimation-by-score-matching"]], "Euler-Maruyama method": [[6, "euler-maruyama-method"]], "Evaluating a stochastic integral": [[6, "evaluating-a-stochastic-integral"]], "Events and Probabilities": [[13, "events-and-probabilities"]], "Example application": [[10, "example-application"]], "Examples of MGFS": [[19, "examples-of-mgfs"]], "Excercise 1.B.3": [[0, null]], "Excercise 2.A.8": [[0, null]], "Excercises": [[26, "excercises"]], "Exercise 1.1": [[26, null]], "Exercise 1.11": [[26, null]], "Exercise 1.2": [[26, null]], "Exercise 1.20": [[26, null]], "Exercise 1.31": [[26, null]], "Exercise 1.32": [[26, null]], "Exercise 1.33": [[26, null]], "Exercise 1.34": [[26, null], [26, null]], "Exercise 1.41": [[26, null]], "Exercise 1.43": [[26, null]], "Exercise 1.44": [[26, null]], "Exercise 1.45": [[26, null]], "Exercise 1.46": [[26, null]], "Exercise 1.47": [[26, null]], "Exercise 1.48": [[26, null]], "Exercise 1.51": [[26, "toc-ex-151"]], "Exercise 1.52 (Myhill-Nerode theorem)": [[26, "toc-myhill-nerode-theorem"]], "Exercise 1.59": [[26, null]], "Exercise 1.63": [[26, null]], "Exercise 1.67": [[26, null]], "Exercise 1.A.1": [[0, null]], "Exercise 1.A.10": [[0, null]], "Exercise 1.A.11": [[0, null]], "Exercise 1.A.12": [[0, null]], "Exercise 1.A.13": [[0, null]], "Exercise 1.A.14": [[0, null]], "Exercise 1.A.2": [[0, null]], "Exercise 1.A.3": [[0, "mira-ex-1a3"]], "Exercise 1.A.4": [[0, "mira-ex-1a4"]], "Exercise 1.A.5": [[0, null]], "Exercise 1.A.6": [[0, null]], "Exercise 1.A.7": [[0, "mira-ex-1a7"]], "Exercise 1.A.8": [[0, null]], "Exercise 1.A.9": [[0, null]], "Exercise 1.B.1": [[0, null]], "Exercise 1.B.2": [[0, null]], "Exercise 1.B.4": [[0, null]], "Exercise 1.B.5": [[0, null]], "Exercise 2.A.1": [[0, null]], "Exercise 2.A.10": [[0, null]], "Exercise 2.A.3": [[0, null]], "Exercise 2.A.4": [[0, null]], "Exercise 2.A.5": [[0, null]], "Exercise 2.A.6": [[0, null]], "Exercise 2.A.7": [[0, null]], "Exercise 2.A.9": [[0, null]], "Exercise 2.C.1": [[0, null]], "Exercise 2.C.10": [[0, null]], "Exercise 2.C.11": [[0, null]], "Exercise 2.C.12": [[0, null]], "Exercise 2.C.2": [[0, null]], "Exercise 2.C.3": [[0, null]], "Exercise 2.C.4": [[0, null]], "Exercise 2.C.5": [[0, null]], "Exercise 2.C.6": [[0, null]], "Exercise 2.C.7": [[0, null]], "Exercise 2.C.8": [[0, null]], "Exercise 2.C.9": [[0, null]], "Exercise 2.D.1": [[0, null]], "Exercise 2.D.10": [[0, null]], "Exercise 2.D.11": [[0, null]], "Exercise 2.D.2": [[0, null]], "Exercise 2.D.3": [[0, null]], "Exercise 2.D.4": [[0, null]], "Exercise 2.D.5": [[0, null]], "Exercise 2.D.6": [[0, null]], "Exercise 2.D.7": [[0, null]], "Exercise 2.D.8": [[0, null]], "Exercise 2.D.9": [[0, null]], "Exercise 4.A.1": [[0, null]], "Exercise 4.A.10": [[0, null]], "Exercise 4.A.11": [[0, null]], "Exercise 4.A.2 (Chebyshev\u2019s inequality)": [[0, null]], "Exercise 4.A.3": [[0, null]], "Exercise 4.A.4": [[0, null]], "Exercise 4.A.9": [[0, null]], "Exercise 5.A.1": [[0, null]], "Exercise 5.A.10": [[0, null]], "Exercise 6.A.1": [[0, null]], "Exercise 6.A.10": [[0, null]], "Exercise 6.A.11": [[0, null]], "Exercise 6.A.12": [[0, null]], "Exercise 6.A.14": [[0, null]], "Exercise 6.A.16": [[0, null]], "Exercise 6.A.2": [[0, null]], "Exercise 6.A.3": [[0, null]], "Exercise 6.A.4": [[0, null]], "Exercise 6.A.5": [[0, null]], "Exercise 6.A.6": [[0, null]], "Exercise 6.A.7": [[0, null]], "Exercise 6.A.8": [[0, null]], "Exercises": [[0, "exercises"]], "Exercises 2.A.2": [[0, null]], "Expectation and independence": [[15, "expectation-and-independence"]], "Expectations": [[14, "expectations"], [17, "expectations"], [18, "expectations"]], "Exponential": [[17, "exponential"], [19, "exponential"]], "Failure mode on mixture of Gaussians": [[9, "failure-mode-on-mixture-of-gaussians"]], "Finite Automata and Regular Expressions": [[28, "finite-automata-and-regular-expressions"]], "Finite State Automata": [[28, "finite-state-automata"]], "First come, first served queue": [[23, "first-come-first-served-queue"]], "Function of a Wiener process": [[6, "function-of-a-wiener-process"]], "Functions of random variables": [[17, "functions-of-random-variables"]], "Fundamental discrete distributions": [[14, "fundamental-discrete-distributions"]], "Gamma": [[17, "gamma"], [19, "gamma"]], "Geometric": [[14, "geometric"]], "Geometric distribution": [[16, "geometric-distribution"]], "Good properties": [[3, "good-properties"]], "Heine-Borel theorem": [[3, "heine-borel-theorem"]], "Home": [[33, "home"]], "Implementation": [[4, "implementation"], [7, "implementation"], [9, "implementation"], [10, "implementation"], [11, "implementation"]], "Importance sampling": [[4, "importance-sampling"]], "Importance-weighted MCMC": [[4, "importance-weighted-mcmc"]], "Independence and sums": [[18, "independence-and-sums"]], "Indicator functions": [[15, "indicator-functions"]], "Inner products": [[31, "inner-products"]], "Introduction to transformers": [[11, "introduction-to-transformers"]], "Invariant distributions": [[24, "invariant-distributions"]], "Invertible transformations": [[9, "invertible-transformations"]], "Joint distributions": [[18, "joint-distributions"]], "Lack of memory property": [[23, "lack-of-memory-property"]], "Large deviations": [[20, "large-deviations"]], "Lebesgue measure": [[3, "lebesgue-measure"]], "Limits of characteristic functions": [[20, "limits-of-characteristic-functions"]], "Main limit theorems": [[20, "main-limit-theorems"]], "Markov and Jensen inequalities": [[19, "markov-and-jensen-inequalities"]], "Markov chain and property": [[24, "markov-chain-and-property"]], "Markov chains": [[24, "markov-chains"]], "Masure, integration and real analysis": [[1, "masure-integration-and-real-analysis"]], "Mean population": [[21, "mean-population"]], "Measurable functions": [[3, "measurable-functions"]], "Measurable spaces and functions": [[3, "measurable-spaces-and-functions"]], "Measures": [[3, "measures"]], "Measures and their properties": [[3, "measures-and-their-properties"]], "Metric and topological spaces": [[30, "metric-and-topological-spaces"]], "Metric spaces": [[31, "metric-spaces"], [31, "id1"]], "Milstein\u2019s higher order method": [[6, "milstein-s-higher-order-method"]], "Modelling with tokens": [[11, "modelling-with-tokens"]], "Moment generating functions": [[19, "moment-generating-functions"], [19, "id1"]], "Moments": [[16, "moments"], [19, "moments"]], "Multi-layer perceptron": [[11, "multi-layer-perceptron"], [11, "id10"]], "Multivariate discrete distributions": [[15, "multivariate-discrete-distributions"]], "Multivariate distributions": [[18, "multivariate-distributions"]], "Negative binomial": [[14, "negative-binomial"]], "Negative binomial distribution": [[16, "negative-binomial-distribution"]], "Nonadditivity of the outer measure": [[3, "nonadditivity-of-the-outer-measure"]], "Nondeterminism": [[28, "nondeterminism"]], "Normal": [[17, "normal"], [19, "normal"]], "Norms": [[31, "norms"]], "Numerical simulation of SDEs": [[6, "numerical-simulation-of-sdes"]], "Open and closed sets": [[31, "open-and-closed-sets"]], "Outer measure": [[3, "outer-measure"]], "PDAs and context-free grammars": [[29, "pdas-and-context-free-grammars"]], "Partition theorem and Bayes\u2019 rule": [[13, "partition-theorem-and-bayes-rule"]], "Poisson": [[14, "poisson"]], "Poisson distribution": [[16, "poisson-distribution"]], "Probability generating functions": [[16, "probability-generating-functions"], [21, "probability-generating-functions"]], "Probability mass functions": [[14, "probability-mass-functions"]], "Probability measures": [[13, "probability-measures"]], "Probability spaces": [[13, "probability-spaces"]], "Probability: An introduction": [[25, "probability-an-introduction"]], "Processes in continuous time": [[23, "processes-in-continuous-time"]], "Properties of measurable functions": [[3, "properties-of-measurable-functions"]], "Properties of measures": [[3, "properties-of-measures"]], "Pushdown Automata": [[29, "pushdown-automata"]], "Putting it together": [[10, "putting-it-together"], [11, "putting-it-together"], [11, "id14"]], "RFF and Bayesian regression": [[7, "rff-and-bayesian-regression"]], "Random Fourier features": [[7, "random-fourier-features"]], "Random walk on a graph": [[24, "random-walk-on-a-graph"]], "Random walks": [[22, "random-walks"]], "Random walks with absorption": [[22, "random-walks-with-absorption"]], "Rates of convergence": [[7, "rates-of-convergence"]], "Recurrence and transience": [[22, "recurrence-and-transience"]], "References": [[4, "references"], [6, "references"], [7, "references"], [8, "references"], [9, "references"], [10, "references"], [11, "references"], [20, "references"], [21, "references"], [23, "references"], [24, "references"], [25, "references"]], "Regression with RFF features": [[7, "regression-with-rff-features"]], "Regular languages \\equiv Regular expressions": [[28, "regular-languages-equiv-regular-expressions"]], "Regular operations": [[28, "regular-operations"]], "Regular pumping lemma": [[28, "regular-pumping-lemma"]], "Residuals and normalisation": [[11, "residuals-and-normalisation"]], "Riemann integral": [[2, "riemann-integral"]], "Riemann integration": [[2, "riemann-integration"]], "Sample and event spaces": [[13, "sample-and-event-spaces"]], "Sampling from a Wiener process": [[6, "sampling-from-a-wiener-process"]], "Sampling from the prior": [[7, "sampling-from-the-prior"]], "Self-attention": [[11, "self-attention"]], "Sequences": [[32, "sequences"]], "Shifted window transformers": [[10, "shifted-window-transformers"]], "Shifted windows": [[10, "shifted-windows"]], "Sigma algebras": [[3, "sigma-algebras"]], "Simple birth process": [[23, "simple-birth-process"]], "Simple random walk": [[22, "simple-random-walk"]], "Stein variational gradient descent": [[9, "stein-variational-gradient-descent"]], "Stochastic and brancing processes": [[21, "stochastic-and-brancing-processes"]], "Stochastic chain rule": [[6, "stochastic-chain-rule"]], "Stream of papers": [[5, "stream-of-papers"]], "Strong Markov property": [[24, "strong-markov-property"]], "Strong and weak convergence": [[6, "strong-and-weak-convergence"]], "Sums of discrete random variables": [[15, "sums-of-discrete-random-variables"]], "Sums of independent variables": [[16, "sums-of-independent-variables"]], "Swin transformer block": [[10, "swin-transformer-block"]], "The Poisson process": [[23, "the-poisson-process"]], "The RFF approximation": [[7, "the-rff-approximation"]], "The Wiener process": [[6, "the-wiener-process"]], "The score matching trick": [[8, "the-score-matching-trick"]], "Theory of Computation": [[27, "theory-of-computation"]], "Time reversal": [[24, "time-reversal"]], "Tokens and embeddings": [[11, "tokens-and-embeddings"], [11, "id12"]], "Topological Spaces": [[32, "topological-spaces"]], "Topologies": [[32, "topologies"]], "Toy experiment": [[4, "toy-experiment"]], "Training": [[10, "training"], [11, "training"]], "Transformer block": [[11, "transformer-block"], [11, "id11"]], "Ultimate extinction": [[21, "ultimate-extinction"]], "Uniform": [[17, "uniform"], [19, "uniform"]], "Uniqueness of PGFs and examples": [[16, "uniqueness-of-pgfs-and-examples"]], "Variance starvation": [[7, "variance-starvation"]], "Why covariance functions?": [[12, "why-covariance-functions"]], "Why stochastic differential equations": [[6, "why-stochastic-differential-equations"]], "Window extraction": [[10, "window-extraction"]], "Window shifting": [[10, "window-shifting"]], "Windowed self-attention": [[10, "windowed-self-attention"]]}, "docnames": ["book/mira/000-exercises", "book/mira/000-intro", "book/mira/001-riemann", "book/mira/002-measures", "book/papers/ais/ais", "book/papers/intro", "book/papers/num-sde/num-sde", "book/papers/rff/rff", "book/papers/score-matching/score-matching", "book/papers/svgd/svgd", "book/papers/swin/swin", "book/papers/transformers/transformers", "book/papers/why-covariances/why-covariances", "book/prob-intro/ch01/content", "book/prob-intro/ch02/content", "book/prob-intro/ch03/content", "book/prob-intro/ch04/content", "book/prob-intro/ch05/content", "book/prob-intro/ch06/content", "book/prob-intro/ch07/content", "book/prob-intro/ch08/content", "book/prob-intro/ch09/content", "book/prob-intro/ch10/content", "book/prob-intro/ch11/content", "book/prob-intro/ch12/content", "book/prob-intro/intro", "book/toc/000-exercises", "book/toc/000-intro", "book/toc/001-fsa", "book/toc/002-cfl", "book/topology/000-intro", "book/topology/001-metric-spaces", "book/topology/002-topological-spaces", "intro"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinxcontrib.bibtex": 9}, "filenames": ["book/mira/000-exercises.md", "book/mira/000-intro.md", "book/mira/001-riemann.md", "book/mira/002-measures.md", "book/papers/ais/ais.ipynb", "book/papers/intro.md", "book/papers/num-sde/num-sde.ipynb", "book/papers/rff/rff.ipynb", "book/papers/score-matching/score-matching.md", "book/papers/svgd/svgd.ipynb", "book/papers/swin/swin.ipynb", "book/papers/transformers/transformers.ipynb", "book/papers/why-covariances/why-covariances.md", "book/prob-intro/ch01/content.ipynb", "book/prob-intro/ch02/content.ipynb", "book/prob-intro/ch03/content.ipynb", "book/prob-intro/ch04/content.ipynb", "book/prob-intro/ch05/content.ipynb", "book/prob-intro/ch06/content.ipynb", "book/prob-intro/ch07/content.ipynb", "book/prob-intro/ch08/content.ipynb", "book/prob-intro/ch09/content.md", "book/prob-intro/ch10/content.md", "book/prob-intro/ch11/content.md", "book/prob-intro/ch12/content.md", "book/prob-intro/intro.md", "book/toc/000-exercises.md", "book/toc/000-intro.md", "book/toc/001-fsa.ipynb", "book/toc/002-cfl.ipynb", "book/topology/000-intro.md", "book/topology/001-metric-spaces.md", "book/topology/002-topological-spaces.md", "intro.md"], "indexentries": {}, "objects": {}, "objnames": {}, "objtypes": {}, "terms": {"": [2, 4, 9, 10, 11, 12, 14, 16, 17, 18, 21, 22, 23, 24, 26, 27, 28, 29, 30, 31], "0": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "0011": 29, "01": 26, "02": 4, "03167": 11, "03385": 11, "03762": 11, "04471": 9, "045": 10, "06": 7, "06450": 11, "07": [4, 7], "08022": 11, "081": 10, "082": 11, "08494": 11, "090": 11, "1": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 25, 28, 29, 31, 32], "10": [4, 10, 11, 26], "100": [0, 4, 7], "1000": [4, 6], "10012": 10, "10022": 10, "1024": 10, "10557": 11, "11": [4, 20, 28], "125": 4, "128": 11, "13": [4, 7], "139": 4, "1502": 11, "1512": 11, "16": [4, 10, 11], "1607": 11, "1608": 9, "1654": 26, "16x16": 11, "17": 11, "1706": 11, "18": 4, "1803": 11, "1959": 28, "198": 4, "1986": [20, 25], "1992": 6, "1_": 15, "1_0": [6, 17], "1_a": 15, "1cm": 24, "1e": [4, 10, 11], "2": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 28, 29, 31], "20": 7, "200": 7, "2001": [4, 6, 20, 23, 24], "2003": 7, "2005": 8, "2006": 7, "2007": 7, "2015": 11, "2016": [9, 11], "2017": 11, "2018": 11, "2019": 9, "2020": 7, "2021": [10, 11], "2023": 11, "21": [10, 11], "22": 4, "2304": 11, "255": [10, 11], "256": 10, "276": 9, "28": 11, "284": 9, "2b": 7, "2c": 0, "2f": 31, "2j": 0, "2k": 0, "2m": [22, 28], "2n": [0, 26, 28], "2p": 26, "2t": [0, 19, 21], "2u_2": 10, "2z": 26, "3": [3, 4, 6, 7, 10, 11, 13, 14, 20, 24, 26, 29, 31], "309": 4, "31": 11, "32": 10, "3387533354": 6, "391": 20, "3a_1": 26, "3k": 0, "3n": 0, "3rd": 6, "3z": 26, "4": [3, 4, 6, 7, 10, 11, 13, 19, 26, 29, 31], "40": [4, 7], "4096": 10, "415": 6, "419": 6, "43": 6, "44": 6, "46": 4, "48": 6, "4ac": 19, "4k": 0, "4pq": 22, "5": [4, 6, 7, 9, 10, 13, 20, 28, 31], "500": 6, "5000": 7, "525": 6, "53": 7, "546": 6, "55": 7, "6": [3, 4, 10, 13, 29], "60": 4, "61": 7, "63": 7, "64": 10, "69": 3, "71": 7, "74": 31, "8": [4, 6, 10, 11, 31], "806": 7, "848": 4, "9": 11, "915": 6, "917": 6, "956": [10, 11], "957": 11, "959": 10, "99": 0, "A": [3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 17, 19, 20, 21, 22, 23, 24, 25, 26, 31, 32], "And": 0, "As": [0, 2, 3, 4, 6, 11, 13, 14, 18, 19, 23, 26, 28], "At": [11, 21, 26], "But": [0, 10, 11, 28, 31], "By": [0, 3, 4, 6, 7, 8, 9, 11, 14, 16, 19, 20, 23, 24, 26, 28, 29, 31], "For": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "If": [0, 2, 3, 4, 5, 6, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32, 33], "In": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 16, 18, 19, 20, 21, 22, 24, 26, 28, 29, 31, 32], "It": [0, 2, 3, 4, 6, 10, 11, 12, 18, 19, 21, 23, 24, 28, 29, 31], "Of": 4, "On": [0, 2, 4, 12, 19, 23], "One": [3, 4, 6, 7, 9, 10, 11, 12, 15, 16, 19, 20], "That": [2, 3, 11, 14], "The": [0, 2, 3, 4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 25, 26, 28, 29, 31, 32], "Then": [0, 2, 3, 6, 7, 8, 9, 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "There": [3, 11, 12, 14, 19, 23, 24, 26, 28], "These": [1, 2, 4, 10, 11, 13, 20, 25, 27, 29, 30, 31, 32], "To": [0, 3, 4, 6, 7, 9, 10, 11, 14, 19, 20, 21, 22, 23, 24, 26, 31], "With": [2, 3, 6, 10, 31], "_": [0, 4, 7, 8, 9, 11, 12, 18, 19, 21, 24, 26, 28, 31, 32], "_1": [0, 4, 31], "_2": [0, 31], "__call__": 4, "__init__": [4, 10, 11], "_a": 0, "_d": 9, "_h": 11, "_i": 0, "_j": 0, "_k": [0, 4], "_log_geometric_mixtur": 4, "_m": 28, "_n": [9, 11, 23, 28], "_p": 9, "_q": 9, "_r": 31, "a_": [0, 11, 15, 22, 29], "a_1": [0, 3, 13, 15, 26, 28, 29], "a_2": [0, 3, 13, 15, 26, 28, 29], "a_3": [3, 26], "a_4": 26, "a_c": 0, "a_h": 11, "a_i": [13, 24, 26, 29], "a_k": [0, 3, 26], "a_n": [0, 3, 6, 13, 15, 22, 31], "aa": [26, 28], "aaa": 26, "aaai": 11, "aab": 28, "aabb": 26, "aapo": 8, "ab": [7, 19, 26, 28], "aba": [26, 28], "abab": 26, "abb": 28, "abba": 26, "abbrevi": 15, "abl": [2, 7, 9, 12], "about": [2, 3, 6, 7, 11, 12, 13, 14, 18, 19, 20, 29, 31], "abov": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 26, 28, 31], "absolut": [0, 7, 14, 15, 16, 17, 18, 19, 31], "absorb": [22, 24], "absorpt": 24, "abstract": [10, 28, 31, 32], "abus": 7, "acc": [10, 11], "acccept": 29, "acccess": 4, "acccord": 4, "acceler": 11, "accept": [4, 26], "access": [4, 6, 8, 33], "accord": [4, 6, 7, 9, 14, 21, 26], "account": [4, 29], "accur": 17, "accuraci": [6, 10, 11], "achiev": [4, 6, 8, 10, 11, 13, 19], "acord": 4, "across": [10, 11, 28], "act": 11, "activ": [10, 11, 23], "actual": [4, 10], "ad": [0, 2, 3, 10, 15, 17, 18, 23, 26, 28, 29], "adam": [10, 11], "adapt": [11, 23], "add": [0, 10, 11, 26, 29], "addit": [0, 4, 6, 11, 13, 15, 20, 22, 26, 28, 29], "additv": 3, "address": [4, 11], "adjust": [6, 13], "adopt": 11, "adrian": [0, 1], "advanc": [6, 11], "advantag": 4, "affect": [4, 6, 9, 10], "afford": 6, "aforement": 28, "after": [3, 4, 10, 11, 12, 21, 26, 28, 29], "again": [0, 2, 6, 11, 18, 21, 28, 29, 31], "against": 16, "aggreg": 11, "aginst": 6, "agre": 0, "agreement": 4, "ai": 4, "aidan": 11, "al": [7, 9, 10, 11, 20, 25], "alemi": 11, "alexand": [7, 11], "alexei": 11, "algebra": 0, "algorithm": [6, 7, 9, 26, 28], "ali": 7, "align": [0, 2, 3, 4, 6, 7, 8, 9, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "all": [0, 2, 4, 6, 7, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29, 32], "allevi": 7, "allow": [3, 7, 9, 10, 11, 13, 17, 22, 28, 29, 31], "almost": [0, 8, 11], "along": [0, 1, 8, 10, 28], "alpha": [4, 19, 22, 26], "alphabet": [26, 28, 29], "alreadi": [11, 22, 26, 28, 29, 32], "also": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32], "altern": [6, 10, 11, 17, 20, 21, 23], "although": [4, 6, 9, 14, 24], "altogeth": 31, "alwai": [0, 2, 4, 7, 8, 10, 12, 13, 14, 19, 28, 32], "ammen": 6, "amount": [4, 7, 10, 11, 20], "an": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29, 31, 33], "analog": [0, 3, 6, 17, 18, 29, 31], "analogi": 17, "analogu": [17, 18], "analysi": [0, 2, 31], "analyt": [4, 6, 9], "andd": 29, "andrea": 11, "angl": 31, "ani": [0, 1, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 23, 24, 26, 27, 28, 29, 31, 33], "annealed_log_prob": 4, "annealedimportancesampl": 4, "annot": 28, "anoth": [0, 2, 3, 4, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 21, 26, 29], "answer": [12, 21, 26], "antisymmetr": 4, "anywher": 33, "aperiod": [24, 26], "appart": 14, "appeal": [11, 14], "appear": [4, 6, 28, 29, 31], "append": [0, 4, 6, 10], "appli": [0, 3, 4, 6, 8, 9, 10, 11, 12, 13, 19, 20, 24, 26, 28, 29], "applic": [4, 6, 7, 9, 11, 19, 29], "apply_gradi": [10, 11], "approach": [2, 3, 9, 10, 11, 12, 32], "appropri": [2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 28], "approx": [4, 7], "approxim": [2, 4, 6, 8, 11], "ar": [0, 1, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 33], "arbitrari": [0, 3, 4, 6, 9], "arbitrarili": [0, 3, 4, 26], "archictectur": 11, "architectur": [10, 11], "archtectur": 11, "area": [2, 3, 20], "arg": [4, 10, 11], "arguabl": 10, "argument": [0, 2, 3, 6, 10, 11, 12, 15, 20, 31], "aris": 4, "around": [4, 6, 10, 11, 12, 32], "arrai": [10, 11], "arriv": [0, 3, 6, 7, 8, 9, 14, 19, 20, 21, 22, 24, 28], "arrow": 28, "arsen": 11, "artifici": 11, "arxiv": [9, 11], "as_supervis": [10, 11], "ashish": 11, "asid": [6, 10], "aspect": [9, 16], "assert": [10, 11], "assign": [13, 14], "associ": [6, 12, 13, 14, 32], "assum": [0, 3, 4, 8, 10, 11, 17, 18, 20, 23, 31], "assumpt": [0, 2, 9, 20, 23, 24, 26, 31, 32], "asymmetr": 22, "atent": 10, "atom": [13, 28], "attain": [0, 14], "attend": [10, 11], "attent": 9, "augment": [4, 10, 11, 12], "autodiff": 10, "automat": 11, "autonom": 6, "autotun": [10, 11], "avail": [1, 4, 8, 27], "averag": [7, 8, 11, 14, 21], "avoid": [20, 23], "awai": [0, 10, 31], "ax": [19, 20], "axi": [4, 6, 7, 9, 10, 11], "axler": [0, 1], "b": [2, 3, 6, 7, 10, 11, 12, 13, 14, 17, 18, 19, 20, 22, 23, 26, 28, 29, 31], "b_": [0, 23, 28, 31], "b_0": 28, "b_1": [0, 13, 14, 22, 26, 28], "b_2": [0, 13, 14, 22], "b_a": [6, 17], "b_i": 26, "b_k": [0, 13, 14, 22, 26], "b_m": 28, "b_n": [13, 22, 28, 31], "b_r": 31, "ba": [11, 26], "bab": 26, "back": 10, "backward": 22, "bain": 10, "bake": 11, "ball": [0, 32], "bar": [4, 11, 19, 28], "barrier": 22, "base": [0, 3, 4, 11, 13, 14, 20, 28, 30], "basi": 7, "basic": [3, 13, 17], "batch": [9, 10, 11], "batch_jacobian": 9, "batch_siz": [10, 11], "batchnorm": 11, "bayesian": [4, 9], "bb": 26, "bc": 29, "beacus": 26, "bear": 19, "beat": [4, 11], "becaus": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 14, 17, 18, 19, 24, 26, 28, 29, 31, 32], "becom": [4, 10, 11, 21, 28], "been": [0, 3, 4, 9, 11, 26, 28], "befor": [3, 4, 6, 10, 11, 22, 28], "begin": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "behav": [6, 18], "behaviour": [8, 21, 24], "behind": 10, "being": [6, 7, 8, 11, 15, 18, 29], "belief": [13, 14], "belong": 24, "below": [0, 2, 6, 7, 9, 10, 13, 15, 18, 19, 20, 23, 26, 28, 29], "benjamin": 7, "berlin": 6, "bernoulli": 23, "bernt": 6, "best": [2, 6, 17, 26], "bet": 11, "beta": [4, 9, 22], "beta_0": 4, "beta_d": [9, 11], "beta_k": 4, "beta_n": 4, "better": [4, 12, 28], "between": [0, 3, 4, 6, 7, 9, 10, 11, 17, 18, 23, 24, 28, 31, 32], "bevaus": 3, "beyer": 11, "beyond": [3, 11, 27], "bia": [4, 11], "big": [6, 7, 8, 9, 20, 21], "bigcap": [3, 31], "bigcap_": [0, 3, 32], "bigcap_n": 15, "bigcup": [0, 13, 15, 21, 31], "bigcup_": [0, 3, 31, 32], "bigg": [6, 7, 21], "biject": [18, 32], "bimod": 9, "bin": [4, 6], "binari": [0, 3, 23, 26, 28], "birth": 21, "bis06": 7, "bishop": 7, "bit": [4, 7, 26, 28], "bkh16": 11, "black": 6, "blue": [4, 20], "bmatrix": [0, 10, 12, 26], "boldsymbol": 12, "book": [0, 1, 2, 7, 20, 24, 25, 26], "border": 20, "bore": 3, "borel": 0, "born": 21, "borovitskii": 7, "both": [0, 3, 7, 8, 11, 23, 24, 26, 28, 29, 31], "bottleneck": 10, "bottom": [5, 11, 26], "bound": [0, 3, 7, 19, 22, 26], "boundari": [10, 22, 23], "boundedd": 0, "box": 10, "bracket": [6, 8], "branch": 16, "break": [3, 10, 26, 28, 31], "brief": 21, "bring": [9, 10], "broader": 19, "broadli": 9, "bruno": 0, "bt": 29, "bu": 14, "build": [3, 10, 11, 14, 26, 28, 29, 32], "buse": 14, "c": [3, 6, 7, 10, 11, 12, 13, 19, 21, 22, 23, 24, 26, 29, 31], "c_": [0, 21, 23], "c_1": 0, "c_2": 0, "c_k": 0, "c_n": 0, "calcul": [10, 13, 15], "calculu": 13, "call": [2, 3, 4, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "callabl": 4, "cambridg": 30, "came": 3, "can": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 26, 29, 31, 32], "cancel": [4, 19], "cannot": [0, 2, 3, 4, 6, 8, 9, 12, 23, 24, 26, 28, 29], "cao": 10, "cap": [0, 3, 13, 15, 22, 26, 28, 29, 31, 32], "cap_": [0, 3, 31], "capac": 11, "captur": [2, 4, 9, 14, 19, 32], "care": 24, "carl": 7, "carlo": [4, 7, 9], "carri": 26, "case": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "cast": [4, 10, 11], "categor": 4, "categori": 9, "categoricalaccuraci": [10, 11], "categoricalcrossentropi": [10, 11], "cauchi": [0, 7], "cdot": [0, 3, 6, 9, 14, 18, 23, 31], "center": [0, 6, 32], "central": [3, 4, 7, 9, 12, 14], "certain": [2, 3, 6, 7, 10, 13, 14, 16, 19, 21, 28, 32], "cgg": 29, "chain": [4, 9], "chain_histori": 4, "challeng": [8, 9, 12, 28], "chang": [3, 4, 6, 9, 10, 11, 19, 28], "channel": 10, "channels_last": [10, 11], "chapman": 24, "chapter": [3, 7, 13, 15, 20, 21, 28, 32], "charact": 11, "characterist": [0, 3, 23], "cheap": [7, 9], "chebyshev": 19, "check": [3, 26], "chelsea": 11, "chi_": 0, "choic": [6, 10], "choos": [0, 6, 14, 22, 23, 26, 31], "chosen": [0, 4, 6], "christian": 11, "christoph": 7, "chua": 30, "chunk": 10, "circ": [0, 3, 28, 32], "circl": 28, "circumv": [3, 4, 19], "clarendon": [20, 25], "clarifi": [7, 12], "class": [0, 3, 4, 10, 11, 19, 24, 26, 28, 29], "class_token": 11, "classic": 4, "classif": 11, "classs": 0, "clear": [3, 7, 11], "clearli": [4, 12, 19], "close": [0, 2, 4, 6, 7, 8, 9, 10, 11, 13, 24, 26, 28], "closer": [4, 9], "closur": [0, 6, 26], "clss": 3, "clump": 7, "cnn": [10, 11], "co": [6, 7, 19], "coars": 32, "code": 10, "coeffici": [7, 9, 16], "coin": 14, "collect": [0, 5, 10, 11, 13, 21, 26, 31, 32], "collectioin": 14, "color": 4, "colour": 10, "column": [10, 11, 12, 26], "combin": [0, 7, 10, 11, 13, 20, 26], "combine_window": 10, "come": [0, 3, 4, 7, 13, 20, 28], "command": 10, "comment": 0, "common": [0, 4, 19, 20, 21, 31], "commonli": [4, 6], "commun": 24, "compact": [7, 11, 32], "compar": [4, 6, 11, 17, 26], "comparison": 10, "complement": [0, 3, 13, 26, 31], "complet": [0, 3, 21, 26, 31], "complex": [6, 7, 10, 28], "complic": [9, 13], "compon": [6, 11], "compos": 11, "composit": [10, 28], "compris": 11, "comput": [0, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 19, 22, 26, 28, 29], "computation": 9, "concat": 11, "concaten": [6, 7, 10, 11], "concatent": 11, "concav": 19, "concept": [15, 28, 32], "concern": [2, 11, 15], "concis": 11, "conclud": [0, 2, 3, 22, 26, 31, 32], "conclus": [0, 26], "concret": 11, "condens": 25, "condit": [0, 6, 8, 10, 12, 15, 19, 20, 21, 22, 23, 24, 28, 29], "condition": [13, 22], "conert": 11, "confer": [9, 10, 11], "confirm": 8, "confus": 4, "conjectur": 26, "conjug": 4, "connect": [11, 24, 28], "connected": 32, "consecut": [0, 10], "consequ": [13, 16, 19], "consid": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29, 32], "consider": 18, "consist": [0, 11, 28, 29], "const": 8, "constant": [0, 6, 7, 8, 11, 14, 17, 19, 20, 22], "constantli": 31, "constrain": [9, 11, 22], "constraint": [9, 22], "construct": [14, 28, 29], "contain": [0, 2, 4, 10, 11, 12, 13, 14, 19, 20, 24, 25, 26, 28, 29], "context": [3, 7, 31], "continu": [0, 6, 7, 11, 13, 14, 19, 21, 28], "contradict": [0, 3, 20, 23, 24, 26, 28, 29, 31, 32], "contrast": [6, 11, 26], "contribut": [4, 6], "conv": [10, 11], "conv2d": [10, 11], "conveni": [3, 9, 15], "converg": [0, 14, 15, 16, 17, 18, 19, 21, 23], "convers": [0, 3, 4, 10, 11, 26], "convert": [11, 26, 29], "convert_to_tensor": [4, 9], "convex": 21, "convolut": [10, 11, 17], "copi": 26, "core": 11, "correct": [4, 6], "correspond": [0, 4, 6, 7, 9, 10, 11, 12, 13, 19, 20, 23, 24, 26, 28, 29, 31], "correspondng": 28, "cost": [4, 7, 10, 12], "could": [0, 2, 4, 6, 11, 12, 13, 14, 19], "couldn": 12, "count": [3, 10, 14, 15], "countabl": [0, 13, 14, 17], "counterexampl": 0, "counterpart": [3, 17, 18], "coupl": [11, 29], "cours": [4, 27, 28, 30, 31], "cov": [7, 19], "covari": [7, 11], "cover": [0, 11, 27], "coverg": 20, "cpu": 7, "creat": [4, 10, 11, 28], "criterion": [20, 28], "cross": [0, 11], "crucial": [3, 4, 29], "cubic": [7, 26], "cumsum": 6, "cumul": [4, 6], "cup": [0, 2, 3, 13, 24, 26, 28], "cup_": [0, 3, 31], "current": [1, 4, 11], "curv": 2, "custom": 23, "cut": [10, 12], "cvf": 10, "cyan": 20, "cyclic": [10, 26], "d": [3, 4, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 20, 21, 23, 24, 25, 26, 31, 32], "d_": [0, 4, 11], "d_1": [0, 3], "d_2": [0, 3], "d_i": 24, "d_k": 3, "d_u": 0, "d_v": [0, 31], "d_w": 0, "d_x": 31, "d_y": 31, "darker": 20, "data": [7, 8, 10, 11, 14], "data_dir": [10, 11], "data_format": [10, 11], "datapoint": 7, "dataset": 7, "david": 7, "dayan": 8, "dbk": 11, "ddot": 0, "deal": [0, 9, 16, 17, 21, 24], "decai": 7, "decent": 9, "decim": 0, "decis": 28, "declar": 14, "decreas": [0, 9, 17, 21, 23, 29], "deep": [10, 11], "deeper": 11, "def": [2, 4, 6, 7, 9, 10, 11], "defeat": 7, "defer": 11, "defin": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 26, 28, 29, 31, 32], "definit": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 21, 22, 23, 24, 25, 28, 29, 31, 32], "defint": 12, "defnit": 31, "deform": 32, "degre": [7, 17], "dehghani": 11, "deisenroth": 7, "delet": 29, "delta": [0, 2, 3, 6, 7, 17, 19, 20, 26, 28, 29, 31], "delta_": [0, 24, 26], "delta_1": [26, 28], "delta_2": [26, 28], "delta_a": 26, "delta_b": 26, "delta_t": 0, "delta_x": 3, "demo": [5, 10, 11], "denot": [0, 4, 7, 8, 9, 10, 11, 13, 14, 16, 19, 26, 28], "dens": [10, 11], "densiti": [4, 7, 8, 9, 19], "depend": [6, 8, 9, 10, 11, 12, 15, 24], "deriv": [0, 2, 4, 7, 8, 10], "descend": 21, "describ": [3, 6, 11, 17, 18, 21, 23, 26, 28], "descript": [0, 11, 26], "desir": [3, 10], "despit": 3, "det": 9, "detail": [3, 4, 7, 10, 11, 17], "determin": [0, 9, 11, 14, 16, 17, 19, 22, 23, 26, 31, 32], "determinist": [6, 28], "develop": 2, "deviat": [6, 11], "devis": 9, "dexter": 30, "dfa": [26, 28, 29], "di": 21, "diag": 7, "diagon": 3, "diagram": [20, 28], "did": [0, 6], "die": 13, "diff": [7, 9], "differ": [0, 4, 6, 7, 10, 11, 15, 20, 22, 28, 29, 31, 32], "differenti": [8, 10, 11, 19, 23], "difficult": [9, 10, 32], "difficulti": [7, 19], "diffus": 6, "digit": 0, "digress": 21, "dilat": [0, 10], "dilin": 9, "dimens": [7, 10, 11], "dimension": [9, 10, 11, 12], "diminish": 6, "direct": [0, 3, 4, 8, 12, 16, 20], "directli": [2, 4, 6, 8, 11, 14, 18, 31, 32], "dirk": 11, "discontinu": 2, "discrep": 9, "discret": [11, 16, 17, 18, 21, 31], "discretis": 6, "discrimin": 31, "disctinct": 6, "discuss": [2, 3, 6, 11, 12, 15], "disguis": 32, "disjoint": [0, 13, 22, 26, 29, 32], "dissimilar": 4, "distanc": [0, 31], "distinct": [3, 26, 28, 32], "distinguish": [10, 11, 26], "distrbut": 23, "distribut": [4, 6, 7, 9, 12, 19, 21, 23], "diver": 4, "diverg": [4, 9, 19, 24], "divers": 11, "divid": [0, 10, 11, 23, 24], "divis": [10, 11, 13, 23, 26], "dk": 9, "dlogp": 9, "dmitri": 11, "do": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 19, 26, 28, 29], "doe": [0, 3, 4, 6, 7, 8, 9, 11, 12, 14, 19, 20, 21, 22, 24, 26, 28, 29], "domain": [0, 2, 3, 6, 19, 31], "domin": 4, "don": 12, "done": [0, 28], "dosovitskii": 11, "dot": [0, 2, 3, 4, 11, 12, 19, 20, 26, 28, 29, 32], "dot_product": [10, 11], "doubl": [10, 28], "doubli": 3, "down": [2, 4, 7, 11, 12, 26, 31], "downarrow": 2, "downstream": [4, 15], "downward": 10, "draw": [4, 6, 7, 28], "drawn": [4, 7], "drift": 6, "dropdown": 10, "dropout": [10, 11, 26], "dt": [6, 14, 19], "dtype": [4, 9, 11], "du": 20, "due": [3, 8, 10, 11], "dure": [11, 23, 26], "dv": 6, "dw": 6, "dw_": 6, "dw_t": 6, "dx": [4, 6, 8, 9, 17, 18, 19, 31], "dx_": 4, "dx_0": 4, "dx_k": 4, "dx_t": 6, "dy": [17, 18, 23], "dy_": 12, "dy_t": 6, "dynam": 6, "dz": [9, 17], "e": [0, 2, 3, 4, 5, 6, 7, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29], "e_": [0, 3, 21], "e_0": [3, 21], "e_1": [0, 3, 21], "e_2": [0, 3, 21], "e_j": 3, "e_k": 3, "e_n": [0, 3], "e_t": 21, "each": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 16, 20, 21, 22, 23, 26, 28, 29, 31], "earlier": [0, 3, 4, 13, 28, 32], "earliest": [24, 28], "eas": 11, "easi": [7, 9], "easier": [3, 13, 28], "easili": [4, 8, 9, 10, 16, 17, 28], "ed": 6, "edg": [11, 12, 29], "edward": 7, "effect": 28, "effici": [7, 9, 10], "effort": 0, "einsum": 7, "either": [3, 4, 9, 11, 20, 22, 24, 26, 28], "element": [0, 3, 10, 11, 12, 13, 23, 26, 28, 31, 32], "elementari": [13, 14], "elif": 7, "elimin": [8, 29], "ell": [0, 3, 9], "els": [3, 6], "em": 6, "email": [0, 5, 33], "emb": 11, "embed": 10, "empir": [4, 6, 8, 11], "empti": [3, 13, 26, 28, 31], "emptyset": [0, 3, 13, 17, 28, 31, 32], "en": 0, "enabl": [4, 9, 10, 13], "end": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "endpoint": [0, 2, 3], "enough": [2, 32], "ensur": [0, 4, 9, 11, 13, 29], "enter": [26, 28], "entir": [0, 3, 11, 12, 26, 27, 28], "entireti": [6, 11], "entri": [8, 10, 11, 12, 26], "enumer": 10, "epoch": [10, 11], "eppstein": 26, "epsilon": [0, 2, 3, 7, 9, 20, 21, 26, 28, 29, 31], "eq": [7, 9], "equal": [0, 2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 26, 28, 29], "equat": [0, 3, 4, 10, 11, 13, 19, 20, 21, 22, 23, 24, 26], "equip": [31, 32], "equiv": [0, 29, 32], "equiv_l": 26, "equival": [0, 10, 11, 13, 18, 20, 21, 24, 26, 28, 29, 31], "equivari": 11, "ergod": [4, 24], "error": [0, 1, 4, 5, 7, 33], "especi": 16, "essenti": [26, 32], "establish": [3, 28], "estim": [2, 4, 6, 7], "et": [7, 9, 10, 11, 20, 25], "eta": 21, "euclidean": 31, "euler_maruyama": 6, "evalu": [4, 8, 9, 10, 12, 18], "even": [0, 2, 4, 6, 7, 9, 10, 11, 13, 20, 22, 28, 29, 31], "event": [14, 15, 18, 21, 22, 23, 24], "eventu": 22, "ever": [21, 22], "everi": [0, 2, 6, 8, 10, 11, 14, 17, 19, 21, 24, 26, 28, 29, 31, 32], "everywher": [0, 3], "evolut": [6, 23], "evolv": [9, 21], "exact": [4, 6, 7], "exact_black_schol": 6, "exact_gp_posterior": 7, "exactli": [0, 2, 3, 4, 10, 11, 22, 26, 28, 29], "exampl": [0, 2, 3, 4, 6, 8, 9, 11, 13, 14, 15, 17, 20, 26, 28, 32], "exapl": 10, "excel": 11, "except": [4, 10, 11, 12, 19, 24, 26, 28, 29], "exchang": 2, "exclud": 6, "exclus": [6, 15], "execut": 7, "exercis": [1, 25], "exist": [0, 2, 3, 6, 10, 11, 14, 15, 17, 18, 19, 20, 23, 24, 26, 28, 29, 31, 32], "exp": [4, 6, 7, 9, 11, 17, 19, 20], "expand": [8, 29, 31], "expans": [0, 6, 16, 20], "expect": [2, 3, 4, 6, 7, 8, 9, 19, 21, 22, 24], "expens": [10, 12], "experi": [6, 13], "experiment": [13, 14], "experss": 13, "explain": 0, "explan": 20, "explicitli": [7, 8], "exponenti": [0, 4, 7, 11, 20, 23], "exponeti": 11, "exposit": 11, "express": [0, 2, 6, 7, 8, 9, 10, 11, 13, 14, 15, 17, 18, 21, 22, 26, 31], "extend": [2, 3, 11, 15, 17, 18, 24, 26, 28, 29], "extens": [4, 11, 15, 16], "extent": [11, 19], "extinct": 23, "extract_patch": 10, "extract_window": 10, "extrem": [10, 11], "ey": 7, "f": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 21, 22, 24, 26, 28, 29, 31, 32], "f_": [17, 18, 24, 26], "f_0": 0, "f_1": [0, 2, 3, 9, 26, 28], "f_2": [0, 2, 3, 26, 28], "f_a": 26, "f_b": 26, "f_d": 9, "f_g_black_schol": 6, "f_g_sine": 6, "f_k": [0, 3, 22, 26], "f_n": [0, 2, 22], "f_plot": 4, "f_x": [17, 18], "f_y": [17, 18], "f_z": 18, "facilit": 28, "fact": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 19, 20, 21, 23, 24, 28, 31], "factor": [0, 4, 10, 11, 15, 19], "factoris": 24, "fail": [2, 9], "failur": 14, "faint": 10, "fairli": 11, "fall": [2, 9, 19], "fals": [6, 10, 11], "famili": [9, 21], "familiar": 31, "far": [3, 4, 11, 28], "fast": 11, "faster": [7, 9], "fault": 27, "featur": [10, 11], "feature_dim": [10, 11], "features_data": 7, "features_pr": 7, "fed": 26, "feed": 28, "feedback": 5, "feel": [5, 33], "few": [2, 3, 4, 10, 11, 28], "fewer": [0, 26, 28], "fg": 3, "field": [8, 9], "fig": 10, "figsiz": 4, "figur": 4, "filter": [10, 11], "final": [0, 3, 11, 14, 17, 23, 24, 26, 28], "final_mlp": [10, 11], "find": [0, 8, 9, 16, 19, 26, 32], "fine": 6, "finish": 26, "finit": [0, 2, 4, 6, 7, 9, 13, 20, 24, 26, 29], "fint": 7, "first": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 19, 20, 21, 22, 24, 26, 28, 29, 31, 32], "first_block": 10, "fit": 9, "fix": [0, 3, 6, 10, 11, 26], "flat": 10, "flatten": 10, "flaw": 3, "flexibl": [9, 10, 11], "flip": 19, "flipsid": 4, "float": 4, "float32": [9, 10, 11], "float64": 4, "fn": 7, "focu": [9, 10], "fold": 10, "follow": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 30, 31, 32, 33], "fontsiz": 4, "forget": 14, "form": [0, 2, 4, 6, 7, 9, 11, 17, 18, 19, 20, 26, 28], "formal": [6, 26, 28], "formalis": [8, 19, 29], "format": [25, 33], "former": [9, 20, 29], "formul": 10, "formula": [0, 6, 9, 22], "forward": [4, 10, 11, 22], "found": [6, 10, 11, 20, 25], "four": [2, 3, 10, 31], "fourier": 19, "fourth": [6, 31], "foward": 22, "frac": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 31], "framework": 10, "free": [5, 33], "freedom": [7, 17], "freeli": [1, 27], "frequenc": [7, 11], "frequent": 2, "from": [0, 2, 4, 5, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32, 33], "from_logit": [10, 11], "front": 17, "fsa": 26, "full": 11, "fun": 6, "func": 24, "funciton": 29, "function": [0, 4, 7, 8, 9, 10, 11, 22, 23, 24, 26, 28, 29], "fundament": [3, 13], "funtion": 0, "further": [0, 2, 3, 4, 7, 9, 10, 11, 17, 18, 19, 21, 22, 24, 28, 32], "furthermor": [0, 24], "futur": [6, 21, 23], "g": [0, 3, 4, 6, 9, 10, 11, 14, 15, 17, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29, 31, 32], "g_": [16, 21, 28], "g_0": 21, "g_1": [0, 3, 16, 21, 26, 29], "g_2": [0, 16, 26, 29], "g_d": 9, "g_j": 3, "g_k": [0, 28], "g_n": [0, 3, 16], "g_t": 21, "g_x": 16, "g_y": 16, "gambl": 14, "gambler": 22, "game": 14, "gamma": [6, 29], "gamma_": 29, "gamma_d": 11, "gamma_k": 4, "gap": [0, 7], "gaussian": [4, 6, 7, 8, 12], "gaussiantransitionkernel": 4, "gave": 26, "gca": 4, "gcd": 24, "gear": 4, "gelli": 11, "gelu": [10, 11], "gen": 24, "gener": [0, 2, 3, 4, 7, 8, 9, 10, 11, 12, 13, 15, 17, 18, 20, 22, 23, 24, 28, 31, 32], "generalis": [3, 11, 31], "geoffrei": [7, 11], "geometr": [21, 24], "georg": 11, "geq": [0, 3, 4, 7, 13, 14, 16, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "get": [0, 4, 6, 7, 9, 10, 11, 19, 26, 28, 33], "get_batch": [10, 11], "give": [0, 2, 3, 4, 7, 9, 10, 11, 12, 13, 14, 17, 21, 22, 26, 28, 31, 32], "given": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "global": 12, "glorotnorm": [10, 11], "gloss": 11, "gnfa": 28, "gnn": 11, "go": [0, 1, 4, 7, 8, 11, 23, 24, 25, 26, 31], "goe": [3, 6, 20, 26], "goldwas": [0, 1], "gomez": 11, "good": [2, 9], "got": 4, "govern": 6, "gp": [7, 12], "grad": 6, "grad_i": 10, "gradient": [8, 10, 11], "gradienttap": [9, 10, 11], "gradual": 4, "grai": 10, "grain": 6, "grant": 14, "graph": [10, 11, 12, 21], "great": [9, 25], "greater": [26, 28, 31], "greatli": 4, "green": [4, 20], "gri20": 7, "grid": 10, "grimmett": [7, 20, 23, 24, 25], "group": [1, 10, 11], "groupnorm": 11, "gs01": 20, "guarante": [9, 12, 28, 29], "guo": 10, "gww86": [20, 25], "h": [0, 3, 4, 6, 9, 10, 11, 12, 18, 23, 24, 26], "h_": [0, 6, 11, 12], "h_1": 0, "h_2": 0, "h_i": 24, "h_j": 24, "h_k": 0, "h_t": 6, "h_x": 0, "ha": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 28, 29], "had": [4, 18], "half": [3, 10], "han": 10, "hand": [0, 2, 3, 12, 17, 19, 23, 29, 31], "handi": [11, 31], "handl": [2, 4, 7, 9, 10, 29], "happen": 10, "hast": 4, "hat": [0, 24], "have": [0, 1, 2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32, 33], "he": 11, "head": [10, 12, 14], "heavi": 9, "heidelberg": 6, "height": [0, 10, 11], "heigold": 11, "hein": 0, "help": [4, 10, 28], "helper": [7, 10, 11], "henc": 22, "here": [2, 4, 5, 6, 9, 10, 11, 12, 14, 19, 26, 31, 32], "hereaft": 4, "hierarch": 10, "hig01": 6, "high": [0, 4, 7], "higham": 6, "higher": 16, "highli": [3, 10], "highlight": [3, 4, 6, 11], "hilbert": 9, "hinton": 11, "hist": 4, "histogram": 4, "histori": 4, "hit": 24, "hold": [0, 2, 3, 4, 6, 8, 10, 14, 17, 19, 20, 21, 23, 24, 28, 29, 31], "holmogorov": 24, "home": 26, "homogen": 24, "homogeneu": 24, "hope": 3, "hopefulli": [9, 20], "horizont": 10, "hot": [10, 11], "houlsbi": 11, "how": [4, 6, 7, 9, 10, 11, 18, 21], "howev": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 22, 24, 26, 29, 31, 32], "hspace": 24, "hu": 10, "hundr": 0, "hwc": 10, "hypothesi": 3, "hypothesis": 8, "hyv": 8, "hyvarinen": 8, "hyvarinend05": 8, "hzrs15": 11, "i": [0, 1, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 29, 31, 33], "i_": [0, 3], "i_1": [0, 3, 24], "i_2": [0, 3, 24], "i_j": 3, "i_k": 3, "i_n": [0, 3, 24], "i_x": 32, "ib": 30, "idea": [2, 3, 4, 9, 10, 14, 20, 28, 29, 31, 33], "ideal": 2, "idealis": 8, "ident": [7, 9, 11, 14, 16, 20, 31, 32], "ieee": 10, "iff": [3, 13, 16, 20, 23, 24, 26], "ij": [12, 26], "ijk": 10, "illia": 11, "illustr": [2, 4, 7, 9, 10, 20], "im": [0, 14, 15], "imag": [0, 10, 11, 14, 26, 32], "image_tokenis": [10, 11], "imagetokenis": [10, 11], "imagin": [6, 11], "immedi": 3, "impact": [4, 10, 11], "implement": 6, "impli": [0, 2, 3, 4, 7, 8, 10, 18, 21, 22, 23, 24, 26, 29, 32], "implic": [6, 20, 24, 31], "impoprt": 4, "import": [2, 3, 6, 9, 10, 11, 12, 14, 19, 20, 28, 31], "improv": [4, 11, 26, 33], "inaccess": 8, "incept": 11, "includ": [0, 1, 2, 3, 6, 10, 11, 14, 15, 20, 26, 28], "inclus": [3, 15], "inconsist": 12, "incorpor": 11, "increas": [0, 2, 4, 7, 9, 10, 11, 17, 20, 28], "increment": [0, 28], "inde": [0, 31], "independ": [3, 6, 7, 8, 9, 10, 14, 20, 22, 23, 24], "index": [3, 8, 10, 11, 21, 22, 23, 26, 28], "indic": [10, 11, 20, 28], "indirectli": 32, "indiscern": 31, "indistiguish": 26, "indistinguish": 26, "individu": [0, 11, 26], "induc": [4, 7], "induct": [0, 3, 11, 21, 28], "inequ": [3, 4, 7, 21, 24], "inf": [0, 2, 3, 17, 19, 20, 23, 24], "inf_": [0, 2], "inf_p": [0, 2], "infer": 9, "infimum": 0, "infimuma": 3, "infin": 3, "infinit": [0, 2, 6, 12, 19, 20, 24, 26, 28, 29], "infinitesim": [6, 14], "inform": [0, 6, 10, 11, 12, 13, 14, 19], "infrequ": 4, "infti": [0, 2, 6, 8, 14, 17, 18, 19, 20, 21, 22, 23, 24, 31], "infty_": [0, 3, 13, 16, 17, 18, 19, 21, 22, 23, 24], "ingredi": [10, 11], "inher": 29, "inhomogen": 24, "init": 28, "initi": [4, 6, 9, 10, 11, 21, 22, 23, 24, 26, 28], "initial_distribut": 4, "initialis": [4, 9, 11], "inner": [9, 11], "innov": [10, 11], "input": [7, 10, 11, 12, 26, 28, 29], "insid": 0, "inspect": 16, "inst": 4, "instanc": [6, 11], "instancenorm": 11, "instead": [0, 3, 4, 6, 9, 10, 11, 12, 14, 18, 20, 28, 31, 32], "int": [0, 4, 6, 7, 8, 9, 10, 11, 12, 17, 18, 19, 20], "int32": 4, "int_": [0, 31], "int_0": [0, 2, 17, 31], "int_a": [0, 2], "int_b": 0, "int_c": 0, "integ": [0, 2, 11, 16, 22, 23, 26, 28, 29], "integr": [0, 3, 4, 7, 8, 9, 12, 17, 18, 19, 23], "integrand": [4, 6, 7], "intellig": 11, "inter": 11, "interact": 11, "interest": [3, 4, 6, 7, 9, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 32], "interestingli": 3, "intergr": 4, "interleav": [10, 11], "intermedi": [2, 3, 31], "intern": [9, 10, 11], "interpol": [4, 19], "interpret": 7, "intersect": [0, 13, 21, 26, 28], "interv": [0, 2, 6, 14, 17, 21, 23], "intesect": 21, "intial": 4, "intialis": 4, "intra": 11, "intract": [4, 8, 9], "introduc": [2, 3, 4, 7, 11, 15, 20, 22, 28, 29, 31], "introduct": [6, 10, 20, 26], "intuit": [3, 4, 8, 15, 19, 20, 28, 31], "invari": [0, 4, 7, 19], "invers": [0, 4, 10, 12, 32], "invert": [10, 12, 18], "inverv": 3, "involv": [4, 6, 9, 11, 15, 19, 20, 29, 31], "ioff": 11, "irrat": 0, "irreduc": 24, "irrelev": 8, "irrespect": [0, 11, 24], "is15": 11, "isbn": 6, "isol": 11, "issu": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 19, 26, 27, 28, 29, 30, 31, 32, 33], "itax": 19, "itb": 19, "iter": [4, 9, 21], "iti": 19, "its": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29, 32], "itself": [0, 4, 6, 9, 10, 11, 16, 20, 26, 28, 29, 31, 32], "itx": 19, "itz": 19, "iu": 23, "iub_": 23, "ixi": 29, "iz": 29, "j": [0, 2, 3, 6, 10, 18, 20, 24, 25, 26, 28], "j_1": [0, 3], "j_2": [0, 3], "j_n": [0, 3], "jacobian": 9, "jakob": 11, "jame": 7, "jami": 11, "jason": 9, "jensen": 4, "jian": 11, "jimmi": 11, "jit_compil": 4, "job": 11, "join": [26, 28], "joint": [0, 4, 11, 12], "jointli": 18, "jone": 11, "jordan": 9, "journal": 8, "just": [3, 4, 6, 10, 11, 12, 17, 20, 28], "justif": 0, "justifi": 31, "k": [0, 2, 3, 4, 7, 9, 10, 11, 12, 14, 16, 19, 21, 22, 23, 24, 26, 28, 29], "k1": 29, "k2": 29, "k_": [0, 11], "k_1": [0, 3], "k_2": [0, 3], "k_i": 24, "k_j": 24, "k_n": [0, 3], "kaim": 11, "kaiser": 11, "kdd": 7, "keep": [10, 11, 26, 33], "keepdim": [10, 11], "kei": [3, 10, 11, 19, 28, 31], "kera": 10, "kernel": [4, 7, 9, 11], "kernel_initi": [10, 11], "kernel_s": [10, 11], "key1": [10, 11], "key2": [10, 11], "kick": 6, "kind": [3, 4, 10, 11, 28, 29], "kiro": 11, "kl": 4, "know": [0, 26, 29], "known": [6, 10, 11, 14, 16, 26], "kolesnikov": 11, "kolmogorov": 24, "kpd": 7, "kpp": 7, "kullback": 4, "kwarg": [10, 11], "l": [0, 2, 3, 24, 26, 28, 29], "l1": 7, "l2": 7, "l_0": 23, "l_1": 3, "l_2": 3, "l_n": 3, "l_t": 23, "l_w": 19, "label": [4, 10, 11, 28], "lack": 3, "lambda": [6, 14, 16, 17, 19, 20, 22, 23, 24, 31], "lambda_": 24, "lambda_i": 24, "lamda": 6, "langl": [9, 31], "languag": 26, "laplac": 7, "laptop": [10, 11], "larg": [0, 4, 7, 9, 10, 11, 12, 26], "larger": [0, 4, 10, 19, 26, 28], "last": [0, 2, 3, 4, 6, 8, 10, 14, 15, 17, 28, 29], "lastli": [0, 2, 3, 7, 14, 24, 29], "later": [3, 11, 28, 29, 31], "latter": [8, 20, 29], "law": [17, 21], "layer": 10, "layernorm": [10, 11], "lceil": 0, "ldot": [0, 2, 3, 31], "lead": [0, 2, 3, 6, 14, 24, 26], "learn": [4, 7, 8, 9, 10, 11], "learnabl": 11, "learning_r": [10, 11], "learnt": 11, "least": [0, 2, 3, 4, 5, 8, 21, 24, 26, 28], "leav": [0, 4, 14], "lebesgu": [0, 2], "lectur": 27, "lee": 9, "left": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31], "leftarrow": 31, "leftrightarrow": 24, "legend": 4, "lei": 11, "leibler": 4, "lemma": [0, 4, 24, 26, 31], "lempitski": 11, "len": 10, "lenght": 3, "length": [0, 2, 11, 26, 28, 29, 31], "lengthscal": [7, 9], "leq": [0, 3, 4, 6, 7, 9, 13, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "less": [0, 2, 3, 6, 11, 19], "let": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "level": [0, 6, 19], "leverag": 10, "like": [1, 2, 3, 7, 9, 11, 14, 17, 18, 27], "likelihood": 8, "lim_": [0, 2, 3, 6, 18, 20, 21, 22, 23, 24], "limit": [0, 6, 7, 10, 11, 12, 21, 23, 24, 28], "lin": 10, "linalg": 7, "line": [0, 3, 4, 10, 19, 21, 23, 24, 31], "linear": [6, 7, 10, 11, 19], "linearli": [0, 10, 11], "linspac": [4, 6, 7], "list": [0, 2, 3, 4, 10], "liter": 28, "literatur": 5, "littl": [4, 11, 31], "liu": [9, 10], "ll": [4, 6, 10, 11], "llc": 10, "llion": 11, "llj16": 9, "ln1": [10, 11], "ln2": [10, 11], "load": [4, 10, 11], "loc": [4, 7], "local": [9, 10, 11, 14], "locat": [7, 9, 11], "log": [4, 8, 9, 20, 23], "log_geometric_mixtur": 4, "log_prob": 4, "log_prob_1": 4, "log_prob_2": 4, "log_prob_ratio": 4, "log_w": 4, "logarithm": 20, "logit": [10, 11], "logp": 9, "logprob": 9, "long": [10, 11], "longer": [7, 10, 11, 26], "look": [2, 4, 6, 9, 10, 11, 28, 29, 31], "loop": 26, "loss": [0, 10, 11], "loss_fn": [10, 11], "lot": [3, 4, 25], "low": 7, "lower": [0, 7, 10, 11, 19], "luca": 11, "lukasz": 11, "lw19": 9, "m": [0, 2, 3, 4, 7, 9, 10, 11, 12, 16, 20, 22, 24, 26, 28, 29], "m_": [12, 19, 20], "m_0": 23, "m_1": [20, 26, 28], "m_2": [20, 26, 28], "m_a": 26, "m_b": 26, "m_c": 26, "m_n": 20, "m_t": 23, "m_u": 20, "m_x": 19, "m_y": 19, "machin": [4, 7, 8, 9, 10, 11, 26, 28], "made": [0, 3, 13, 26, 28], "magnitud": 4, "mahtbb": 24, "mai": [3, 4, 6, 7, 9, 10, 13, 14, 16, 17, 18, 19, 20, 26, 28, 29], "main": [3, 10, 11, 28], "maintain": [11, 33], "make": [0, 4, 6, 7, 9, 10, 11, 13, 14, 20, 21, 22, 26, 28, 29, 33], "mani": [0, 2, 4, 6, 9, 10, 11, 14, 17, 19, 26, 31], "manipul": [7, 18], "manner": 23, "map": [0, 9, 10, 11, 12, 13, 14, 18, 19, 26, 32], "map_fn": 4, "marc": 7, "margin": [4, 12, 15, 18, 23], "marginalis": [12, 15], "mark": [10, 28], "marker": 26, "markov": [0, 4, 8, 9, 20], "mass": 16, "master": 12, "match": [7, 16], "math": [4, 10, 11], "mathbb": [0, 2, 4, 6, 7, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 31, 32], "mathbf": [0, 12, 24], "mathcal": [0, 3, 4, 6, 7, 9, 10, 12, 13, 14, 15, 16, 17, 18, 21, 22, 23, 28, 29, 32], "mathemat": [11, 30], "mathtild": 3, "matmul": [10, 11], "matric": [10, 11, 12, 24], "matrix": [0, 10, 11, 12, 24], "matter": [0, 6], "matthia": 11, "max": [0, 4, 31, 32], "max_": 31, "maxima": 31, "maximis": [9, 19], "maximum": [0, 8, 26, 31], "mbc": 12, "mcmc": 9, "me": [0, 5, 27, 33], "mean": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 16, 24, 26, 28, 29], "mean_pr": 7, "meaning": 20, "meaningless": 13, "meant": [10, 11, 25], "measur": [0, 1, 2, 4, 7, 14, 15, 18, 20, 21], "meeasur": 0, "member": [13, 26], "memori": [10, 28], "merg": 10, "method": [4, 7, 9, 10, 11, 15], "metric": [0, 9, 10, 11, 19, 32], "metropoli": 4, "mhsa": [10, 11], "michael": [9, 27], "middl": [2, 4, 8], "midpoint": 6, "might": [0, 8, 10, 11, 13, 14, 19], "milisecond": 7, "min": [0, 7, 24, 31], "mind": [10, 19], "minder": 11, "mine": 25, "minim": [10, 24], "minimis": [9, 20], "minimum": 0, "minu": [0, 3, 20, 28], "miss": [0, 11, 13], "mistak": [25, 26, 27], "mitig": 10, "mixtur": 4, "mlodozeniec": 0, "mlp": [10, 11], "mlp_num_hidden": [10, 11], "mlp_num_lay": [10, 11], "mnist": [10, 11], "mod": 26, "modal": 10, "mode": [4, 10], "model": [4, 6, 7, 8, 9, 10, 12, 14, 28, 29], "modest": 9, "modestli": 7, "modifi": [3, 4, 10, 26, 28], "modul": 11, "modulo": 17, "moment": [6, 11, 17, 20, 23], "monoton": [0, 23], "mont": [4, 7, 9], "moost": 4, "more": [0, 2, 3, 4, 6, 7, 9, 10, 11, 13, 15, 17, 18, 20, 23, 26, 28, 29, 31, 32], "moreov": 26, "most": [0, 1, 3, 4, 5, 9, 10, 17, 19, 26, 31], "mostafa": 11, "mostowski": 7, "motiv": [3, 4, 11, 32], "mould": 4, "move": [0, 3, 26, 29], "mu": [0, 3, 6, 10, 11, 12, 16, 17, 19, 20, 21, 23], "mu_1": 4, "mu_2": 4, "mu_i": 24, "mu_q": 4, "much": [4, 6, 7, 9, 10, 11, 18, 19, 28], "multi": 10, "multi_head_self_attent": [10, 11], "multiheadselfattent": [10, 11], "multipl": [0, 9, 10, 11, 15, 26, 29], "multipli": [10, 19, 24], "multivari": 12, "must": [0, 3, 4, 6, 8, 10, 11, 14, 20, 23, 24, 26, 28, 31], "my": [1, 12, 26, 27], "myhil": 28, "myself": 33, "n": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 31, 32], "n_": [0, 8, 14, 15, 20, 22, 23], "n_0": 23, "n_1": [0, 12, 15], "n_2": 15, "n_i": [0, 12], "n_k": 0, "n_l": 12, "n_t": 23, "nabla": [8, 9], "nabla_": 9, "nabla_x": 9, "naiv": 3, "name": [0, 3, 6, 7, 10, 11, 13, 28, 29], "nat": 4, "natur": [3, 13, 14, 21], "nd": 7, "nea01": 4, "neal": 4, "neat": [10, 21], "necessari": 21, "necessarili": [3, 11], "need": [0, 2, 3, 4, 6, 10, 11, 12, 14, 17, 18, 24, 26, 28, 31, 32], "neg": [3, 4, 7, 9, 17, 19, 20, 21, 24, 31], "neglect": 6, "neighbour": 11, "neighbourhood": [10, 19, 20], "neil": 11, "neither": [0, 26], "neq": [0, 3, 6, 8, 10, 19, 21, 22, 23, 24, 26, 28, 29, 31, 32], "nerod": 28, "nest": 4, "network": [9, 10, 11], "neural": [9, 10, 11], "never": [12, 13], "nevertheless": 12, "new": [6, 21, 26, 28, 29], "next": [2, 3, 4, 6, 7, 10, 11, 15, 20, 29, 31], "next_annealed_log_prob": 4, "next_x": 4, "nfa": [26, 29], "nh": 23, "nice": [3, 6, 33], "niki": 11, "nip": 7, "nk": 26, "nm": 7, "nn": [4, 10, 11], "noam": 11, "node": [11, 12, 29], "nois": [6, 7], "noisi": 7, "nomad": 21, "non": [0, 3, 4, 7, 8, 10, 13, 16, 17, 19, 20, 21, 23, 24], "nonaddit": 0, "nondecreas": 23, "nondetermin": 29, "nondeterminist": 26, "none": [6, 7, 9, 10, 11, 28], "nonempti": [0, 2, 3, 28], "nonlinear": 11, "nontrivi": 0, "nonzero": [0, 3], "norm": 8, "normal": [4, 6, 7, 8, 10, 11, 20], "normalis": [8, 14, 17, 23], "notat": [2, 6, 7, 9, 11, 12, 16, 22, 23, 26, 28], "note": [0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14, 17, 18, 19, 20, 23, 24, 25, 26, 27, 28, 30, 31], "notic": 8, "notin": [3, 31], "notion": [3, 6, 11, 28, 29, 31, 32], "now": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "np": [4, 6, 7], "nq": 14, "nu": 0, "null": 24, "num_block": 11, "num_block_pair": 10, "num_blocks_per_stag": 10, "num_class": [10, 11], "num_data": 7, "num_epoch": [10, 11], "num_featur": 7, "num_funct": 7, "num_head": [10, 11], "num_hidden": [10, 11], "num_lay": [10, 11], "num_mlp_hidden": [10, 11], "num_mlp_lay": [10, 11], "num_out_featur": 10, "num_output": [10, 11], "num_sampl": 4, "num_stag": 10, "num_step": 4, "number": [0, 2, 3, 4, 6, 7, 9, 10, 12, 14, 18, 19, 22, 23, 24, 26, 28, 29, 31], "nz": [26, 28], "o": [6, 7, 10, 14, 20, 23, 26], "obei": 7, "oberv": 12, "object": [13, 31], "observ": [3, 4, 6, 7, 8, 9, 12, 13, 14, 16], "obtain": [0, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "occcur": 29, "occur": [4, 9, 13, 14, 15, 22, 23, 26, 29], "occurr": 29, "od": 6, "odd": 22, "odot": 11, "off": [4, 11, 30], "often": [3, 4, 6, 13, 14, 15, 16, 17, 18, 19, 20, 32], "oks92": 6, "oksend": 6, "old": 28, "omega": [0, 6, 7, 13, 14, 15, 16, 17, 18, 19, 21, 24], "omega_1": [7, 11], "omega_d": 11, "omega_m": 7, "omega_shap": 7, "ommit": [6, 20], "onc": [0, 2, 3, 4, 11, 13, 14, 19, 23, 26, 29], "one": [0, 2, 4, 7, 8, 9, 10, 11, 12, 13, 14, 18, 19, 20, 21, 22, 24, 26, 28, 29], "one_hot": [10, 11], "ones": [5, 11, 26, 28, 29], "onli": [0, 2, 3, 4, 7, 8, 9, 10, 11, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "onlin": [1, 27, 33], "onto": 29, "onward": 6, "opeart": 10, "open": [0, 2], "oper": [10, 11, 26], "opos": 6, "oppos": 7, "opposit": [0, 3, 11], "optim": [10, 11], "optimis": [8, 9], "optimum": 9, "option": [3, 7, 10, 11], "orang": 20, "order": [0, 2, 10, 11, 16, 20, 26, 28, 29], "ordinari": 6, "organ": 23, "origin": [3, 4, 10, 11, 20, 26, 29], "original_shap": 10, "other": [0, 2, 4, 5, 6, 7, 8, 10, 11, 12, 14, 16, 18, 19, 21, 22, 23, 24, 26, 28, 29], "otherwis": [0, 2, 3, 4, 15, 17, 18, 19, 20, 22, 26, 28, 31], "otim": 0, "our": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 23, 28], "out": [3, 4, 6, 8, 9, 10, 11, 12, 20, 27, 28, 29, 31], "out_feat": [10, 11], "outcom": [13, 14, 17, 18], "outer": [0, 2], "outgo": 28, "outlin": 2, "output": [10, 11, 28], "outsid": [10, 26], "over": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 15, 18, 19, 20, 21, 23, 24, 26, 28, 33], "overal": [4, 10, 11], "overfit": 7, "overlap": 10, "overlin": [0, 31], "overset": 26, "overview": 11, "own": [1, 11, 17, 26], "oxford": [7, 20, 25], "p": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 32], "p_": [0, 8, 9, 15, 23, 24, 26], "p_0": [16, 23], "p_1": [0, 16, 23], "p_2": [0, 16], "p_d": 8, "p_f": 0, "p_g": 0, "p_i": 23, "p_k": [21, 23], "p_n": [0, 16, 23, 26], "p_plot": 4, "p_x": [14, 15, 17], "p_y": 15, "pad": [10, 11], "padhi": [0, 1], "page": [0, 6, 7, 33], "pair": [3, 7, 11, 12, 18, 26, 29, 31, 32], "pairwis": 26, "palindrom": 26, "paper": [4, 6, 7, 11, 28], "parallel": [11, 28], "parallelis": 10, "paramet": [4, 6, 7, 8, 9, 10, 11, 14, 16, 17, 19, 23], "paramt": 14, "parmar": 11, "pars": 29, "parser": 29, "part": [0, 2, 3, 8, 10, 11, 23, 24, 26, 28, 29, 30, 31], "partial": [6, 8, 13, 18, 20, 23, 24], "partial_i": 8, "particl": [6, 9], "particular": [0, 2, 4, 6, 8, 10, 11, 14, 21, 24, 28, 29, 31, 32], "particularli": 14, "partit": [0, 12], "partli": 6, "partricular": 4, "pass": [4, 9, 10, 11, 19, 26], "passag": 24, "past": [6, 23], "patch": [10, 11], "patch_merg": 10, "patch_merging_lay": 10, "patch_siz": [10, 11], "patchmerginglay": 10, "path": [6, 28], "pattern": 7, "pdf": [19, 24], "per": [10, 11], "perfect": 26, "perfectli": 14, "perform": [4, 7, 10, 11, 26], "perhap": [3, 12, 28], "period": 24, "permit": 29, "permut": [10, 11, 26], "perp": [6, 23], "persist": 9, "perspect": 32, "perturb": 6, "peter": [7, 8], "pgf": [21, 23], "phantom": 12, "phi": [7, 9, 19, 20], "phi_": [19, 23], "phi_1": [7, 20], "phi_2": 20, "phi_i": 19, "phi_m": 7, "phi_n": 20, "phi_x": 19, "phi_z": 19, "phrase": [0, 11], "pi": [4, 7, 17, 19, 20, 22, 24, 26], "pi_": [4, 22, 23], "pi_0": [4, 23], "pi_1": [4, 23], "pi_2": 23, "pi_i": [14, 24], "pi_j": 24, "pi_k": [4, 23], "pi_v": 24, "pick": [0, 4, 9, 26], "piec": 10, "pigeonhol": [26, 28], "pin": 7, "pink": 20, "pixel": 10, "place": [2, 3, 5, 7, 13, 26, 31], "pleas": [0, 5, 27, 33], "plot": [4, 7], "plt": 4, "plu": 11, "plug": 6, "pmatrix": 12, "pmf": [16, 17, 18, 21], "pmlr": 9, "point": [0, 2, 3, 4, 6, 7, 11, 12, 14, 19, 20, 21, 22, 26, 28, 29, 32], "polosukhin": 11, "polya": 24, "polynomi": 26, "pool": [10, 11], "pop": 29, "popul": 23, "posit": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 19, 22, 23, 24, 28], "position_embed": [10, 11], "positionembed": [10, 11], "possibl": [0, 2, 3, 4, 7, 9, 11, 12, 13, 14, 22, 28], "possibli": 19, "posterior": [7, 9], "postiv": 9, "power": [0, 3, 11, 21, 28, 29], "powerset": [3, 14], "pq": [14, 22], "practic": [4, 7], "pre": [0, 10, 32], "preceed": 29, "precis": [0, 6, 10, 11, 17, 28], "prefetch": [10, 11], "prefix": 26, "preimag": 31, "preliminari": 0, "preprint": 11, "preprocess": [10, 11], "preprocess_imag": [10, 11], "presenc": 20, "present": [6, 7, 8, 9, 13, 14, 17, 19, 20, 23], "preserv": [0, 4, 28], "press": [7, 20, 25], "pretend": 4, "prevent": 11, "previou": [0, 2, 3, 12, 24, 26, 28, 32, 33], "previous": [15, 18, 29], "primarili": 33, "principl": [4, 26, 28], "prml": 7, "proabil": 21, "probabilist": 8, "probabilit": 24, "probabl": [4, 6, 7, 8, 11, 18, 19, 22, 23, 24], "problem": [0, 3, 4, 9, 10, 12, 15, 16, 26, 28], "problemat": 11, "proce": [3, 4, 14, 22], "procedur": [4, 10, 26, 28, 29], "proceed": [10, 11, 21, 23, 24], "process": [7, 11, 16, 20, 22], "prod_": [15, 23], "produc": [4, 7, 9, 11, 12, 26], "product": [3, 6, 9, 11, 13, 16, 17, 20, 21], "profit": 14, "program": 29, "progress": [1, 9, 11], "prohibit": 7, "project": [10, 11], "projection_dim": [10, 11], "proof": [0, 2, 3, 6, 8, 14, 15, 19, 20, 21, 22, 23, 24, 28, 29, 31, 32], "propag": 10, "proper": 29, "properti": [0, 2, 11, 12, 13, 15, 26, 32], "proport": [9, 20], "propos": 4, "propto": 8, "protein": 11, "prove": [0, 3, 7, 13, 15, 17, 19, 20, 21, 24, 26, 28, 29, 31, 32], "provid": [2, 3, 6, 11, 13, 20, 29], "psi_": 8, "psi_d": 8, "pump": 26, "purpl": 4, "purpos": [6, 7, 9, 11], "purposefulli": 7, "push": 29, "put": [0, 2, 3, 4, 28], "q": [0, 2, 3, 4, 9, 10, 11, 13, 14, 16, 22, 26, 28, 29], "q_": [8, 9, 11, 26, 28], "q_0": [23, 26, 28, 29], "q_0q_1": 28, "q_1": [26, 28], "q_2": [26, 28], "q_3": 26, "q_4": 26, "q_a": [26, 28], "q_b": 26, "q_e": 26, "q_i": [26, 28], "q_j": [26, 28], "q_k": [4, 28], "q_l": 26, "q_m": 26, "q_n": 26, "q_plot": 4, "q_t": 23, "qiang": 9, "quad": 9, "quadrat": [10, 19, 31], "qualiti": 7, "quantifi": 19, "quantiti": [0, 4, 6, 12, 13, 14, 16, 20, 21], "queri": [11, 12], "question": [3, 7, 9, 10, 12, 14], "queu": 23, "quicker": 7, "quickli": [0, 6, 7, 10, 19, 21], "quit": [7, 9], "r": [0, 2, 6, 7, 9, 11, 12, 13, 14, 15, 17, 18, 19, 20, 24, 26, 28, 29, 31], "r_": [0, 6, 7, 12, 28, 29], "r_0": [28, 29], "r_1": [0, 2, 3, 28, 29, 31], "r_1r_2": 28, "r_2": [0, 2, 3, 28], "r_3": 28, "r_4": 28, "r_i": [0, 28, 29], "r_j": 0, "r_k": [0, 2, 3], "r_m": 29, "r_n": [0, 2, 28, 31], "r_w": 19, "rabin": 28, "radford": 4, "radii": 32, "radiu": [0, 21], "radoom": 19, "rahimi": 7, "random": [4, 6, 8, 10, 11, 12, 18, 19, 20, 21, 23, 33], "randomis": 4, "randomli": 7, "rang": [0, 2, 3, 4, 6, 10, 11, 19, 23], "rangl": [9, 31], "rangle_": 9, "rapidli": 9, "ras03": 7, "rasmussen": 7, "rate": [6, 9, 10, 11, 23], "rather": [3, 4, 7, 10, 24, 31, 32], "ratio": [4, 6], "ration": [0, 2, 23], "raw": [4, 11], "rc": 26, "rceil": 0, "re": [7, 10, 11, 29, 31, 32], "reach": [3, 21, 24, 26, 27, 28, 29], "reachabl": 26, "read": [1, 4, 5, 26, 28, 29], "readi": [2, 10, 11, 31], "real": [0, 2, 3, 7, 12, 14, 19, 20, 23, 31], "rearrang": [0, 3, 14, 15, 19, 24, 31], "reason": [0, 4, 11, 12, 13, 14, 19], "recal": [0, 3], "recap": 11, "receiv": [4, 11, 26], "recent": [5, 7, 11], "recht": 7, "recip": 32, "recognis": [26, 29], "recognit": [7, 11], "recogns": 28, "record": 27, "recov": 8, "rectangl": [0, 2], "recur": 28, "recurr": 24, "recurs": [0, 21, 22, 23, 24, 28], "red": [4, 10, 20], "reduc": [2, 4, 7, 9, 10, 11], "reduce_logsumexp": [10, 11], "reduce_mean": [10, 11], "reduce_min": 4, "reduce_sum": 9, "refer": [3, 13, 28, 32], "refin": [0, 2, 6, 11], "reflect": 11, "reflex": [3, 24, 26, 32], "reflexivti": 26, "regard": 9, "regardless": [12, 26], "region": 20, "regressor": 7, "regular": [8, 10, 26, 29], "reject": [4, 26, 28], "rel": [4, 7, 10, 11], "relabel": [0, 3], "relat": [0, 7, 11, 13, 15, 17, 18, 19, 20, 22, 23, 24, 26], "relationship": 11, "relax": 3, "relev": 3, "reli": [3, 7], "remain": [0, 3, 4, 7, 10, 14, 29], "remaind": 32, "remark": 11, "rememb": 6, "remind": 11, "remov": [26, 28, 29], "ren": 11, "repeat": [0, 3, 4, 11, 26, 28, 29], "repeatedli": 28, "repetit": 28, "replac": [0, 3, 8, 26, 29], "repres": [0, 3, 6, 7, 11, 13, 21, 26, 28, 29], "represent": [3, 11], "reproduc": 9, "reproduct": 6, "repsect": 12, "requir": [0, 2, 3, 4, 10, 11, 14, 20, 24, 26, 28, 29, 32], "research": 8, "resembl": 4, "reshap": [10, 11], "resiz": 10, "resnet": 11, "resolv": [7, 9], "resort": [4, 8, 32], "respect": [0, 2, 6, 8, 9, 10, 11, 12, 13, 16, 17, 19, 23, 26, 28, 32], "rest": 26, "restrict": [3, 9, 24], "result": [0, 2, 3, 4, 8, 9, 10, 11, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 31, 32], "retain": [10, 11], "return": [4, 6, 7, 9, 10, 11, 22, 24, 28], "reus": 10, "revers": [4, 10, 26], "review": [2, 6, 14], "revisit": 22, "revrs": 10, "rewrit": [0, 4], "rewritten": [8, 19], "rff_posterior": 7, "rho": [19, 23], "rich": 11, "richard": 11, "rieman": [0, 2], "riemann": [0, 3, 6], "right": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 29], "rightarrow": 31, "rightward": 10, "rigor": 14, "rinen": 8, "rise": 31, "rkh": 9, "role": [11, 20], "roll": 10, "root": [19, 21], "rotat": 26, "rough": 17, "roughli": [7, 11], "row": [10, 12, 26], "rr": 7, "ruin": 22, "rule": [4, 12, 17, 28, 29], "rull": 29, "run": [4, 6, 9, 10, 11, 26, 28], "run_chain": 4, "ryan": 11, "s_": [22, 26, 29], "s_0": [22, 29], "s_1": [3, 26, 29], "s_2": [3, 26], "s_i": [14, 26, 29], "s_j": 26, "s_k": [3, 22, 26], "s_m": 29, "s_n": [3, 11, 20, 22, 26, 28], "sa": [10, 11], "sai": [0, 3, 6, 10, 11, 12, 13, 18, 19, 20, 24, 26, 28, 29, 31, 32], "said": [3, 6, 14, 20, 23], "same": [0, 3, 4, 6, 7, 10, 11, 12, 16, 18, 19, 20, 24, 26, 28, 29, 31, 32], "sampl": [8, 9, 12, 17], "sample_rff": 7, "sampler": 4, "samples_and_log_weight": 4, "sandwich": 10, "satisfi": [0, 3, 6, 8, 13, 14, 15, 17, 18, 20, 21, 23, 24, 26, 28, 29, 31, 32], "satisifi": 0, "save": 10, "saw": [2, 32], "scalar": [6, 9, 10], "scale": [0, 4, 7, 9, 10, 11, 17, 19], "schedul": [4, 10, 11], "schole": 6, "school": 7, "scienc": 28, "scott": 28, "script": 6, "second": [0, 2, 3, 6, 8, 10, 11, 19, 20, 21, 23, 24, 26, 28, 29, 31], "second_block": 10, "section": [0, 2, 3, 9, 20, 31], "see": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 14, 16, 19, 20, 21, 22, 23, 24, 26, 28, 31], "seed": [6, 10, 11], "seed1": [10, 11], "seed2": [10, 11], "seek": 9, "seem": 4, "seen": 19, "select": [0, 4, 12, 28], "self": [4, 12, 26], "self_attent": [10, 11], "self_attention_weight": [10, 11], "selfattent": [10, 11], "semidefinit": 9, "senc": 6, "sens": [0, 3, 4, 6, 10, 12, 13, 19, 20, 28, 29], "sensibl": [7, 13, 17], "sentenc": [10, 11], "separ": [3, 7, 9, 11, 14, 26], "sequenc": [0, 2, 4, 9, 10, 11, 14, 16, 20, 22, 23, 24, 26, 28, 29], "sequence_length": 11, "sequnc": 0, "sequnec": [3, 4], "sergei": 11, "seri": [0, 19, 21, 31], "servic": 23, "set": [0, 2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 32], "setminu": [0, 2, 3, 13, 31], "seven": 4, "sever": [3, 9, 10, 13, 15], "sf": 7, "sfd": 7, "sfn": 7, "shaoq": 11, "shape": [4, 6, 7, 9, 10, 11], "share": [6, 10, 11], "shazeer": 11, "sheldon": [0, 1], "shift": 11, "shift_horizontally_and_vert": 10, "shitov": 26, "short": [2, 3, 6, 10], "shorten": 26, "shorthand": [14, 28, 31, 32], "shortli": [3, 6, 7, 31], "should": [0, 3, 4, 6, 10, 11], "show": [0, 2, 3, 4, 6, 7, 8, 12, 13, 14, 15, 16, 19, 20, 21, 23, 24, 26, 28, 29, 31, 32], "shown": [0, 2, 6, 10, 13, 14, 15, 16, 17, 18, 19, 20, 22, 28], "shreya": [0, 1], "shuffl": 26, "shuffle_fil": [10, 11], "siam": 6, "side": [0, 3, 6, 8, 17, 19, 20, 23, 24, 29, 31], "sigma": [0, 7, 11, 17, 19, 20, 21, 26, 28, 29], "sigma_": [12, 21, 28, 29], "sigma_1": [4, 21], "sigma_2": [4, 21, 26], "sigma_3": 26, "sigma_q": 4, "sigma_t": 21, "sigmoid": 4, "sign": [17, 19], "signific": [4, 10, 11], "significantli": [4, 7, 10], "signiticantli": 7, "sim": [0, 3, 4, 6, 7, 9], "similar": [0, 2, 3, 4, 9, 10, 11, 17, 18, 19, 29, 31], "similarli": [0, 2, 3, 4, 20, 23, 24, 26, 28, 29, 31, 32], "simpl": [4, 10, 11, 32], "simpler": 11, "simplest": 14, "simpli": [3, 4, 7, 10, 11, 12, 16], "simplic": 11, "simplifi": [11, 15, 18, 28], "simul": [4, 9], "simultan": 18, "sin": [6, 11], "sinc": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 15, 17, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "singl": [3, 4, 6, 7, 10, 11, 14, 18, 22, 24, 26, 28], "singla": 26, "singleton": [0, 24, 28], "sinusoid": [7, 11], "sipser": [26, 27], "situat": 7, "siva17": 11, "size": [0, 3, 6, 7, 10, 11, 21, 26], "sketch": 6, "skip": 10, "slightli": [7, 11, 12, 13, 26, 31], "slow": 7, "slowli": 28, "small": [0, 4, 6, 7, 9, 10, 11, 32], "smaller": [0, 4, 7, 10, 11, 21], "smallest": [0, 21, 26, 28], "smooth": 9, "sn": 7, "so": [0, 2, 3, 4, 6, 7, 8, 10, 11, 13, 14, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31], "softmax": [10, 11], "solut": [0, 1, 3, 6, 22, 24, 26], "solv": [1, 4, 6, 7, 22], "some": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 29, 31, 33], "someth": [9, 12], "sometim": [3, 4, 10, 11, 20, 29, 31], "somewhat": [3, 10, 29], "soon": 33, "space": [0, 2, 4, 7, 8, 9, 11, 12, 17, 18, 21, 24, 26, 28], "spars": 7, "speak": 17, "special": [3, 11, 17, 18, 20, 21, 26, 28], "specif": [0, 3, 4, 10, 11, 16, 32], "specifi": [4, 6, 11, 14], "speedup": 7, "spirit": 11, "split": [10, 11, 26], "split_se": [10, 11], "spot": [5, 27, 33], "spread": 14, "springer": [6, 7], "sqrt": [2, 7, 9, 10, 11, 17, 19, 20, 31], "squar": [6, 10, 19, 21], "stabil": [6, 11], "stabilis": 11, "stack": [10, 11, 29], "stackrel": [2, 29], "stage": [10, 11, 26], "stai": 26, "standard": [2, 4, 6, 9, 10, 11, 17, 20], "standard_cauchi": 7, "star": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 26, 27, 29, 30, 31, 32, 33], "start": [4, 7, 13, 22, 24, 26, 28, 29, 32], "state": [3, 6, 16, 17, 19, 20, 23, 26, 29, 31], "statement": [0, 3, 6, 7, 13, 14, 26], "statemet": 14, "stationari": 7, "statisfi": 24, "statist": [4, 8, 9, 14], "statistician": 17, "statur": 11, "steadi": 23, "step": [3, 4, 6, 8, 9, 11, 21, 22, 24, 26, 28, 29], "stephen": 10, "stil": 10, "still": [4, 10, 11, 13, 26], "stirzak": [7, 20, 23, 24], "stochast": [22, 23, 24], "stop": [14, 24], "store": 6, "str": [10, 11], "straightforward": [11, 17], "straightforwardli": [10, 16], "stratonovich": 6, "strength": 11, "strict": [21, 29], "strictli": [17, 19, 20], "stride": [10, 11], "string": [26, 29], "stronger": 7, "structur": [12, 31, 32], "stuck": 9, "studi": [3, 16, 19, 26, 32, 33], "style": [5, 11], "styliz": 11, "su": 19, "sub": [10, 11], "subaddit": 0, "subconsci": 17, "subcov": 0, "subinterv": [0, 2], "submatrix": 12, "subscript": [8, 9, 11, 12, 21], "subsequ": [0, 3], "subsert": 0, "subset": [0, 2, 6, 7, 12, 13, 14, 24, 26, 32], "subseteq": [0, 3, 13, 17, 18, 21, 24, 26, 28, 29, 31, 32], "substitut": [0, 8, 19, 21, 22, 29], "substr": [26, 28, 29], "subtler": 12, "subtract": [3, 23], "success": 14, "succinctli": 29, "sucov": 0, "suffer": 2, "suffici": [2, 6, 9, 21], "suffix": 26, "suggest": [3, 4, 13, 28], "sum": [0, 3, 4, 6, 7, 8, 10, 11, 13, 14, 17, 20, 21, 22, 23, 24, 26, 29, 31], "sum_": [0, 2, 3, 4, 6, 7, 9, 11, 14, 15, 19, 20, 22, 23, 24, 31], "sum_k": 14, "sum_n": [13, 15], "summand": [6, 18], "summari": [11, 14], "summaris": 16, "summat": [0, 11, 22], "summer": 7, "sun": 11, "sup": [0, 2, 3, 19, 20], "sup_": [0, 2, 7], "sup_p": [0, 2], "super": [10, 11], "superscript": 11, "superset": [3, 29], "supos": 26, "supplementari": 6, "suppoos": 0, "support": 7, "suppos": [0, 2, 3, 4, 6, 8, 9, 10, 12, 19, 20, 21, 24, 26, 28, 29, 31, 32], "supremum": 0, "supremuma": 3, "supseteq": [0, 3], "surpris": [3, 21], "surprisingli": [9, 28], "surviv": 21, "svg": 9, "svgd_grad": 9, "svm": 7, "swap": 28, "swin_transformer_block": 10, "swintransformerblock": 10, "switch": 20, "sy": 7, "sylvain": 11, "symbol": [7, 11, 26, 28, 29], "symmetr": [0, 3, 4, 11, 22, 24, 26], "symmetri": [11, 26, 31, 32], "synchronis": 26, "system": [6, 23], "szegedi": 11, "t": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 17, 18, 19, 20, 21, 23, 24, 26, 29], "t_": [0, 6, 23], "t_0": [0, 6, 23], "t_1": [4, 6, 23], "t_2": 6, "t_3": 6, "t_4": 6, "t_i": 24, "t_j": 24, "t_k": [4, 23], "t_n": 6, "ta": [0, 19, 20], "tab": 4, "tabl": 11, "tackl": 10, "taht": 0, "take": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 26, 28, 31], "taken": [2, 11, 14, 18, 21, 23], "talk": 31, "tan": 20, "tape": [9, 10, 11], "target": [4, 9], "target_distribut": 4, "task": [3, 11], "tau_j": 6, "tau_n": 6, "taylor": [6, 16, 20], "tb": [0, 19], "technic": [4, 19], "techniqu": [10, 11], "tell": 4, "ten": [0, 6], "tend": [0, 6, 11, 23], "tensor": [4, 10, 11], "tensorflow": [9, 10, 11], "tensorflow_dataset": [10, 11], "tensorflow_prob": 10, "terenin": 7, "term": [0, 3, 4, 6, 7, 8, 9, 11, 13, 15, 16, 17, 19, 21, 24, 28, 31, 32], "termin": [26, 28, 29], "terminologi": [3, 29], "test": [9, 10, 11], "text": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31], "textbook": [20, 23, 27], "texttt": [10, 11, 26, 29], "tf": [4, 9, 10, 11], "tfd": [4, 10, 11], "tfk": [10, 11], "tfp": [10, 11], "th": [0, 2, 8, 9, 12, 14, 16, 19, 21, 24, 26], "than": [0, 2, 3, 4, 7, 9, 10, 13, 15, 19, 20, 23, 24, 26, 28, 29, 31, 32], "thatt": 0, "thefor": 0, "thei": [0, 3, 4, 5, 7, 11, 14, 16, 18, 19, 24, 25, 26, 28, 29, 31, 32, 33], "them": [0, 4, 6, 10, 11, 16, 18, 25, 26, 28, 29, 31, 32, 33], "theme": 28, "themselv": [0, 10, 11], "theorem": [0, 2, 6, 8, 21, 22, 23, 24, 28], "theoret": 2, "theori": [3, 16, 20, 26, 28], "therefor": [0, 2, 3, 4, 6, 7, 8, 9, 10, 12, 13, 14, 19, 20, 21, 23, 24, 26, 28, 29, 31, 32], "theta": 8, "thi": [0, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32, 33], "thing": [6, 10, 29, 31, 33], "third": [0, 2, 3, 10, 12, 17, 19, 21, 24, 26, 29, 31], "thoma": 11, "those": [0, 10, 26, 28, 31], "though": [4, 6, 7, 9, 10, 28], "thought": [2, 11], "three": [2, 6, 7, 13, 20, 26, 28, 31], "threfor": 26, "threshold": 19, "through": [0, 1, 9, 10, 11, 12, 19, 25, 26], "thu": [0, 3, 4, 7, 11, 12, 21, 24, 26, 29], "ti_n": 0, "tild": [0, 3, 4], "tile": 11, "time": [0, 3, 4, 6, 7, 9, 10, 11, 12, 14, 21, 22, 26, 28, 29, 31], "tini": 10, "tiny_swin_transform": 10, "tiny_vision_transform": 11, "tinyswintransform": 10, "tinyvisiontransform": 11, "titl": 4, "tmp": [10, 11], "togeth": [0, 2, 3, 4, 22, 23, 24, 26, 28, 32], "toi": 7, "token": 10, "token_dim": [10, 11], "token_dimens": [10, 11], "tokenis": [10, 11], "too": [2, 6, 13, 20, 21], "tool": [15, 16, 19, 20], "top": [5, 7, 8, 9, 10, 11, 12, 26, 29], "toplogi": 31, "topologi": 31, "toss": [13, 14], "total": [7, 10, 14, 21], "toward": [3, 4, 11], "trace": 9, "track": [26, 33], "tractabl": [4, 9], "trahtman": 26, "trail": 26, "train": [7, 8], "train_step": [10, 11], "trainable_vari": [10, 11], "transfer": 33, "transform": [4, 7, 19], "transformer_block": 11, "transformerblock": [10, 11], "transienc": 24, "transient": [22, 24], "transit": [3, 4, 6, 22, 24, 26, 28, 29, 32], "transition_kernel": 4, "transition_scal": 4, "transitionkernel": 4, "translat": [0, 7], "transpos": 10, "transpose_a": [10, 11], "transpose_b": [10, 11], "treat": [11, 17, 21], "tree": 29, "trial": [4, 14, 22], "triangl": [0, 31], "trick": [10, 11], "tricki": 3, "tripl": 3, "triplet": 13, "tripo": 30, "trivial": [12, 19, 21], "trou": 10, "true": [3, 4, 6, 8, 9, 10, 11, 17], "try": [3, 6], "ts_n": 20, "tu": 19, "tupl": [23, 28, 29], "tur23": 11, "turn": [0, 3, 4, 7, 9, 10, 11, 15, 28, 29, 31, 32], "turner": 11, "twice": [3, 6, 26], "twin_axi": 4, "twinx": 4, "two": [0, 2, 4, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 20, 21, 22, 23, 26, 28, 29, 31, 32], "tx": [19, 20], "type": [4, 10, 21, 29], "typic": [4, 7, 10, 11, 15], "tz_n": 20, "u": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 13, 14, 17, 18, 19, 20, 22, 23, 24, 26, 28, 29, 31, 32], "u_": [0, 11, 22, 29], "u_0": [16, 22], "u_1": [0, 10, 16, 29, 31, 32], "u_2": [0, 10, 16, 29, 32], "u_i": [29, 31, 32], "u_k": [0, 11, 29], "u_m": 22, "u_n": [0, 16, 20, 22, 31, 32], "u_q": 11, "u_r": 0, "u_t": 6, "u_th_t": 6, "uav": 29, "uavaw": 29, "uavw": 29, "ubiquit": 11, "uk": [10, 11], "ultim": 4, "ulyanov": 11, "unari": 28, "unbias": [4, 7], "unbound": [0, 9], "uncertainti": 7, "unchang": 0, "unconstrain": 22, "uncount": 17, "under": [0, 2, 3, 4, 6, 8, 10, 12, 13, 19, 24, 26, 31, 32], "underbrac": [4, 28], "underli": 31, "undesir": 11, "unequ": [0, 26], "unfortun": 4, "uniform": [0, 2], "uniformli": [0, 2, 7, 17, 19], "union": [0, 13, 26], "uniqu": [3, 6, 8, 11, 23, 24, 26], "unit": 29, "univari": 15, "univers": [7, 20, 25], "unless": 29, "unlik": [6, 9, 17, 19, 20, 32], "unrel": 18, "unterthin": 11, "until": [14, 26, 28, 29], "up": [0, 3, 4, 6, 7, 10, 11, 14, 17, 26, 28, 32], "updat": [10, 13, 14], "upgrad": 33, "upper": [0, 4, 26], "uq": [10, 11], "us": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 31, 32, 33], "use_bia": [10, 11], "user": [4, 7], "usual": [3, 4, 15, 31], "uszkoreit": 11, "uv": [19, 29], "uvaw": 29, "uvl16": 11, "uvw": 29, "uvxyz": 29, "uwv": 29, "v": [0, 3, 4, 6, 10, 11, 18, 19, 22, 23, 24, 29, 31], "v4": 11, "v_": 0, "v_1": [0, 31], "v_2": [0, 31], "v_h": 11, "v_i": [0, 24], "v_j": 0, "v_k": 0, "v_l": 0, "v_t": 6, "vacuous": [28, 31], "valid": [4, 10, 11, 13, 14, 18, 28, 29], "valu": [0, 2, 3, 4, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 19, 21, 22, 23, 24, 31], "vanhouck": 11, "vanish": 6, "var": [6, 7, 14, 19, 20, 21], "var_pr": 7, "vari": [4, 19], "variabl": [4, 6, 7, 9, 10, 11, 12, 19, 20, 21, 23, 24, 29], "varianc": [6, 19, 20, 21], "variant": 11, "variou": [6, 11], "vaswani": 11, "vcl": 0, "vdot": [0, 21], "ve": [4, 11], "vector": [0, 7, 8, 9, 10, 11, 12, 24, 31], "vedaldi": 11, "veri": [2, 3, 4, 9, 10, 11, 12, 16, 25, 28, 29, 31], "verifi": [0, 4, 6, 14, 15], "verlag": 6, "versa": 16, "versatil": 11, "version": [9, 15, 17, 20, 23, 24, 26], "vert_": [6, 21], "vertic": 10, "vi": 9, "via": [0, 3, 4, 7, 9, 28], "viacheslav": 7, "vice": 16, "victor": 11, "video": [11, 27], "view": [10, 11, 28], "vincent": 11, "violat": 0, "vision": [10, 11], "visit": 24, "visualis": [4, 7, 29], "vit": [10, 11], "vitali": 0, "vmatrix": 18, "volum": [3, 11, 17], "vorel": 3, "vsp": 11, "vxy": 29, "vy": 29, "w": [0, 4, 6, 7, 9, 10, 11, 17, 19, 26, 28, 29, 31], "w_": [6, 29], "w_1": [0, 26, 28], "w_1w_2": 29, "w_2": [0, 26, 28], "w_2w_1": 26, "w_2z": 26, "w_i": [0, 29], "w_j": 6, "w_k": [0, 28], "w_m": 29, "w_n": [0, 4, 6, 26, 28], "w_t": 6, "wa": [0, 2, 3, 4, 10, 11, 12, 14, 20, 26, 28, 29, 32], "wai": [0, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13, 14, 15, 17, 20, 22, 24, 26, 28, 29], "walk": 33, "wall": 7, "wang": 9, "want": [3, 4, 6, 7, 9, 11, 12, 13, 24, 31], "watch": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 26, 27, 28, 29, 30, 31, 32, 33], "wbt": 7, "we": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "weak": 8, "weaker": [6, 20], "weakli": 20, "wealth": 7, "weather": 11, "websit": 33, "wei": 10, "weigh": [4, 11], "weight": [7, 10, 11, 14], "weight_decai": [10, 11], "weissenborn": 11, "welcom": 33, "well": [0, 2, 3, 4, 6, 7, 8, 9, 11, 17, 18, 20, 23, 26, 28, 29], "welsh": [20, 25], "were": [3, 6, 7, 10, 28], "wh18": 11, "what": [0, 3, 6, 10, 11, 12, 14, 26, 28, 29], "when": [0, 3, 4, 6, 9, 10, 11, 12, 14, 16, 17, 19, 20, 22, 26, 28, 29, 31, 32], "whenev": [0, 4, 11, 12, 14, 15, 17, 18, 19, 26, 31, 32], "where": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 29, 31, 32], "wherea": [6, 10, 11, 12, 16, 17, 23], "wherebi": 26, "whether": [3, 11, 12, 28, 31], "which": [0, 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32], "while": [0, 2, 4, 6, 7, 9, 10, 12, 13, 23, 28], "white": 20, "who": 26, "whole": [7, 12, 13, 31], "whose": [0, 3, 4, 6, 8, 14, 17, 21, 28], "why": [0, 4], "wide": 11, "widespread": 6, "width": [10, 11], "wihtout": 9, "wilson": 7, "window_s": 10, "wise": [10, 11, 12], "wish": [4, 6, 20], "within": [4, 6, 8, 10, 11, 14, 19], "without": [0, 6, 7, 8, 11, 26], "won": [10, 11], "wonder": 10, "word": [0, 3, 7, 8, 10, 11, 12], "work": [0, 1, 3, 4, 6, 7, 9, 12, 19, 32], "worri": 18, "worth": [4, 11, 12, 19], "would": [0, 2, 3, 4, 7, 10, 11, 12, 13, 14, 19, 20, 26], "wrap": 12, "write": [0, 2, 3, 6, 7, 8, 12, 17, 20, 22, 23, 24, 26, 28, 29], "written": [0, 4, 11, 14, 15, 17, 18, 19, 24, 26, 29, 31], "writtten": 8, "wrong": 12, "wt": 26, "wtw": 26, "wu": 11, "wx": 26, "wz": 26, "x": [0, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28, 31, 32], "x0": [4, 6], "x1": 7, "x2": 7, "x_": [0, 2, 4, 6, 9, 10, 11, 12, 17, 21, 24], "x_0": [0, 2, 4, 6, 11, 21, 24, 31], "x_1": [0, 2, 4, 11, 12, 14, 16, 17, 18, 20, 21, 23, 24, 26, 28, 32], "x_1x_2": 0, "x_2": [0, 14, 16, 17, 18, 20, 21, 23, 24, 26, 32], "x_accept": 4, "x_d": 9, "x_data": 7, "x_dim": 7, "x_full": 7, "x_i": [0, 2, 8, 11, 12, 24, 26], "x_j": [0, 2, 6, 11, 12, 24, 26], "x_k": [0, 2, 4, 23, 24, 26], "x_m": [9, 24], "x_mean": 6, "x_n": [0, 2, 4, 6, 9, 11, 12, 14, 16, 20, 24, 28, 31, 32], "x_p": 26, "x_plot": 4, "x_pred": 7, "x_shape": 10, "x_stdev": 6, "x_t": [6, 21, 24], "x_w": 17, "xa": 26, "xf_x": 17, "xiangyu": 11, "xiaohua": 11, "xlabel": 4, "xlim": 4, "xtick": 4, "xw_1": 26, "xy": [15, 26, 28], "xyz": [26, 28], "xz": 26, "y": [0, 3, 6, 7, 10, 11, 14, 15, 16, 17, 18, 19, 21, 22, 24, 26, 28, 31, 32], "y_": [0, 10, 12], "y_0": [0, 24], "y_1": [0, 11, 12, 18, 20, 24, 26], "y_2": [0, 18, 20], "y_data": 7, "y_k": 24, "y_n": [0, 11, 12, 24], "y_q": 26, "y_t": 6, "ya": 26, "yellow": 20, "yet": [31, 33], "yield": [2, 3, 4, 6, 9, 11, 22, 23], "yixuan": 10, "ylabel": 4, "ylim": 4, "you": [0, 5, 6, 11, 26, 27, 33], "ytick": 4, "yue": 10, "yutong": 10, "yuxin": 11, "yx": 26, "yz": 26, "z": [0, 3, 7, 9, 15, 17, 18, 19, 20, 23, 24, 26, 28, 31, 32], "z_": [7, 8, 11, 20], "z_0": [11, 20], "z_1": [0, 23], "z_2": 23, "z_k": 11, "z_n": [0, 6, 20, 23], "ze": 10, "zero": [0, 4, 6, 11, 19, 20, 23, 26, 28, 29], "zeta_": 7, "zhai": 11, "zhang": [10, 11], "zheng": 10, "zip": [10, 11], "zorder": 4, "zx": 26, "zy": 26, "\u00e0": 10, "\u00e4": 8, "\u010dern\u00fd": 26}, "titles": ["Exercises", "Masure, integration and real analysis", "Riemann integration", "Measures", "Annealed importance sampling", "Stream of papers", "Numerical simulation of SDEs", "Random Fourier features", "Estimation by score matching", "Stein variational gradient descent", "Shifted window transformers", "Introduction to transformers", "Why covariance functions?", "Events and Probabilities", "Discrete random variables", "Multivariate discrete distributions", "Probability generating functions", "Distribution and density functions", "Multivariate distributions", "Moment generating functions", "Main limit theorems", "Branching processes", "Random walks", "Processes in continuous time", "Markov chains", "Probability: An introduction", "Excercises", "Theory of Computation", "Finite Automata and Regular Expressions", "PDAs and context-free grammars", "Metric and topological spaces", "Metric spaces", "Topological Spaces", "Home"], "titleterms": {"": [0, 3, 6, 7, 19, 20], "1": [0, 19, 26], "10": 0, "11": [0, 26], "12": 0, "13": 0, "14": 0, "16": 0, "2": [0, 26], "20": 26, "3": 0, "31": 26, "32": 26, "33": 26, "34": 26, "4": 0, "41": 26, "43": 26, "44": 26, "45": 26, "46": 26, "47": 26, "48": 26, "5": 0, "51": 26, "52": 26, "59": 26, "6": 0, "63": 26, "67": 26, "7": 0, "8": 0, "9": 0, "A": [0, 2, 28, 29], "The": [6, 7, 8, 23], "absorpt": 22, "accept": [28, 29], "addit": 3, "algebra": 3, "algorithm": 4, "all": [3, 31], "ambigu": 29, "an": [3, 25, 32], "analysi": 1, "anneal": 4, "applic": 10, "approxim": [3, 7, 9], "ar": [2, 3, 31, 32], "arriv": 23, "attent": [10, 11], "automata": [28, 29], "automaton": [28, 29], "b": 0, "ball": 31, "bay": 13, "bayesian": 7, "below": 3, "bernoulli": [14, 16], "beta": 17, "between": 19, "binomi": [14, 16], "birth": 23, "block": [10, 11], "bochner": 7, "borel": 3, "bound": [2, 4], "branc": 21, "branch": 21, "c": [0, 20], "can": 28, "cauchi": [17, 19, 31], "central": 20, "cfg": 29, "cfl": 29, "chain": [6, 24], "chang": 18, "chapter": [0, 26], "characteris": 31, "characterist": [19, 20], "chebyshev": [0, 20], "chi": 17, "chomski": 29, "classif": 24, "close": [3, 29, 31], "closur": 28, "coeffici": 19, "collect": 3, "come": 23, "composit": [3, 32], "comput": 27, "concaten": 28, "conclus": [4, 7, 9, 10, 11], "condit": [3, 13, 14, 18], "consist": 12, "constant": 31, "contain": [3, 31], "context": 29, "continu": [2, 3, 17, 18, 20, 23, 31, 32], "converg": [6, 7, 20, 24, 31, 32], "convert": 28, "convex": 19, "convolut": [15, 18], "correl": 19, "countabl": 3, "covari": [12, 19], "cover": 3, "d": 0, "dataset": [10, 11], "death": 23, "decreas": 3, "defici": 2, "definit": [3, 15, 16], "demo": 9, "densiti": [17, 18], "deriv": [9, 16, 19, 29], "descent": 9, "deviat": 20, "differ": 3, "differenti": 6, "dimens": 6, "direct": 9, "discret": [14, 15], "disjoint": 3, "distribut": [8, 14, 15, 16, 17, 18, 20, 24], "doe": 2, "embed": 11, "empir": 9, "equal": 19, "equat": 6, "equilibrium": 24, "equiv": 28, "equival": [3, 8, 32], "estim": 8, "euler": 6, "evalu": 6, "event": 13, "eventu": 31, "everi": 3, "exampl": [10, 16, 19, 29, 31], "excercis": [0, 26], "exercis": [0, 26], "expect": [14, 15, 17, 18], "experi": 4, "exponenti": [17, 19], "express": [28, 29], "extens": 3, "extinct": 21, "extract": 10, "factoris": [15, 16, 18, 19], "failur": 9, "featur": 7, "fenchel": 20, "finit": [3, 28], "first": 23, "form": [8, 29], "formula": [15, 16, 18], "fourier": 7, "free": 29, "from": [3, 6, 7, 16], "fsa": 28, "function": [2, 3, 6, 12, 14, 15, 16, 17, 18, 19, 20, 21, 31, 32], "fundament": 14, "g": 16, "gamma": [17, 19], "gaussian": 9, "gener": [16, 19, 21, 29], "generalis": 28, "geometr": [14, 16], "good": 3, "gradient": 9, "grammar": 29, "graph": 24, "ha": 31, "hausdorff": 32, "have": 3, "head": 11, "hein": 3, "higher": 6, "hoeffd": 7, "home": 33, "homeomorph": 32, "i": [2, 3, 9, 28, 32], "iff": [8, 15, 18, 31], "imag": 3, "implement": [4, 7, 9, 10, 11], "impli": [14, 16, 19, 20, 28, 31], "import": 4, "increas": 3, "independ": [13, 15, 16, 18, 19], "indic": 15, "induc": [31, 32], "inequ": [0, 2, 19, 20, 31], "infimum": [2, 3], "infti": 3, "inner": 31, "integr": [1, 2, 6, 31], "inter": 23, "interchang": 2, "intersect": [3, 29], "interv": 3, "introduct": [11, 25], "invari": [3, 24], "invers": [3, 19], "invert": 9, "iter": 18, "ito": 6, "its": 31, "j": 8, "jacobian": 18, "jensen": 19, "joint": [15, 18], "kl": 9, "ksd": 9, "lack": 23, "laguag": 28, "languag": [28, 29], "larg": 20, "law": [14, 15, 18, 20], "layer": 11, "lebesgu": 3, "leftmost": 29, "legendr": 20, "lemma": [28, 29], "length": 3, "leq": 2, "limit": [2, 3, 20, 31, 32], "lower": [2, 4], "main": 20, "markov": [19, 24], "maruyama": 6, "mass": [14, 15], "masur": 1, "match": 8, "mathbb": 3, "mathemat": 29, "mcmc": 4, "mean": [20, 21], "measur": [3, 13], "memori": 23, "method": 6, "metric": [30, 31], "mgf": [19, 20], "milstein": 6, "mixtur": 9, "mode": 9, "model": 11, "moment": [16, 19], "multi": 11, "multivari": [15, 18], "myhil": 26, "neg": [14, 16], "neighbourhood": [31, 32], "nerod": 26, "nfa": 28, "non": [29, 31], "nonaddit": 3, "nondetermin": 28, "nondeterminist": 28, "nonexist": 3, "nontrivi": 3, "norm": 31, "normal": [17, 19, 29], "normalis": 11, "number": 20, "numer": 6, "object": 8, "one": [3, 6], "open": [3, 31, 32], "oper": [3, 28], "order": [3, 6], "other": 3, "outer": 3, "paper": 5, "partit": [2, 13, 14], "pda": 29, "pdf": [17, 18], "perceptron": 11, "pgf": 16, "pmf": [14, 15], "point": 31, "pointwis": [2, 3], "poisson": [14, 16, 23], "popul": 21, "posit": 31, "precis": 12, "preserv": [3, 32], "prior": 7, "probabiil": 13, "probabl": [13, 14, 15, 16, 17, 20, 21, 25], "process": [6, 21, 23], "product": [15, 18, 31], "proof": 9, "properti": [3, 17, 19, 23, 24, 28, 31], "pump": [28, 29], "pushdown": 29, "put": [10, 11], "queue": 23, "r": 3, "random": [7, 14, 15, 16, 17, 22, 24], "rate": 7, "ration": 3, "real": 1, "recognis": 28, "recurr": 22, "refer": [4, 6, 7, 8, 9, 10, 11, 20, 21, 23, 24, 25], "regress": 7, "regular": 28, "relat": [3, 32], "residu": 11, "result": [6, 14], "revers": 24, "rff": 7, "riemann": 2, "rule": [6, 13], "sampl": [4, 6, 7, 13], "schwartz": 19, "schwarz": 31, "score": 8, "sde": 6, "self": [10, 11], "sequenc": [3, 31, 32], "serv": 23, "set": [3, 31], "shift": 10, "sigma": 3, "simpl": [22, 23], "simul": 6, "smallest": 3, "some": 28, "space": [3, 13, 14, 30, 31, 32], "squar": [17, 20], "star": 28, "starvat": 7, "state": [24, 28], "statistician": [14, 15, 18], "steepest": 9, "stein": 9, "stochast": [6, 21], "stream": 5, "string": 28, "strong": [6, 24], "subaddit": 3, "subconsci": [14, 15, 18], "subcov": 3, "subset": [3, 31], "subspac": 31, "suffici": 3, "sum": [2, 15, 16, 18, 19], "support": 19, "supremum": [2, 3], "svgd": 9, "swin": 10, "tangent": 19, "texttt": 28, "theorem": [3, 7, 13, 14, 19, 20, 26], "theori": 27, "time": [23, 24], "togeth": [10, 11], "toi": 4, "token": 11, "topolog": [30, 32], "topologi": 32, "train": [10, 11], "transform": [9, 10, 11, 20], "transienc": 22, "translat": 3, "trick": 8, "two": [3, 14, 19], "ultim": 21, "unbound": 2, "uncount": 3, "under": [28, 29], "uniform": [7, 17, 19], "union": [3, 28], "uniqu": [16, 19, 31, 32], "upper": 2, "variabl": [14, 15, 16, 17, 18], "varianc": [4, 7, 14], "variat": 9, "walk": [22, 24], "weak": [6, 20], "weight": 4, "why": [6, 12], "wiener": 6, "window": 10, "work": 2, "written": 28, "yield": [28, 29], "zero": 3}})
\ No newline at end of file