History of Neural Audio Systems

<h1>History of<br>Neural Audio Systems</h1>

Notes:

- Neural Audio Systems are computational systems that leverage neural networks to analyze, generate, or process audio signals.
- Integrated architecture: Combine neural network components with audio processing pipelines, handling real-time constraints, state management, and interfacing with hardware or software environments.
- Focus on neural audio systems for music generation, analysis, processing. Not that much on speech recognition/synthesis and environmental sounds.

---

## Mathematical Foundations

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Calculus & Linear Algebra</div>
        <div class="timeline-text">Basis for optimization algorithms and machine learning model operations</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1676; --end-year: 1951;" data-timeline-fragments-select="1676:0,1805:1,1809:2,1847:3,1858:4,1901:5,1951:6">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 1676;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 1676;">
    <div class="timeline-content">
        <div class="timeline-year">1676</div>
        <div class="timeline-name">Chain Rule</div>
        <div class="timeline-author">Leibniz, G. W.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1676;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="1" style="--year: 1805;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="1" style="--year: 1805;">
    <div class="timeline-content">
        <div class="timeline-year">1805</div>
        <div class="timeline-name">Least Squares</div>
        <div class="timeline-author">Legendre, A. M.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1805;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="2" style="--year: 1809;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="2" style="--year: 1809;">
    <div class="timeline-content">
        <div class="timeline-year">1809</div>
        <div class="timeline-name">Normal Equations</div>
        <div class="timeline-author">Gauss, C. F.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1809;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="3" style="--year: 1847;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="3" style="--year: 1847;">
    <div class="timeline-content">
        <div class="timeline-year">1847</div>
        <div class="timeline-name">Gradient Descent</div>
        <div class="timeline-author">Cauchy, A. L.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1847;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="4" style="--year: 1858;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="4" style="--year: 1858;">
    <div class="timeline-content">
        <div class="timeline-year">1858</div>
        <div class="timeline-name">Eigenvalue Theory</div>
        <div class="timeline-author">Cayley & Hamilton</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1858;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="5" style="--year: 1901;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="5" style="--year: 1901;">
    <div class="timeline-content">
        <div class="timeline-year">1901</div>
        <div class="timeline-name">PCA</div>
        <div class="timeline-author">Pearson, K.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1901;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 1951;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 1951;">
    <div class="timeline-content">
    <div class="timeline-year">1951</div>
    <div class="timeline-name">Stochastic Gradient Descent</div>
    <div class="timeline-author">Robbins & Monro</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1951;"></div>
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="0" style="margin-top: 20px;">
    <p><strong>Chain Rule:</strong> For composite functions $f(g(x))$, the derivative is:</p>
    <div class="formula">
        $\frac{df}{dx} = \frac{df}{dg} \cdot \frac{dg}{dx}$
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="1" style="margin-top: 20px;">
    <p><strong>Least Squares:</strong> Method to find the best-fitting curve by minimizing the sum of squared residuals:</p>
    <div class="formula">
        $\boldsymbol{\theta}^* = \argmin\limits_{\boldsymbol{\theta} \in \Theta} \sum_{i=1}^{n} (y_i - f(\mathbf{x}_i, \boldsymbol{\theta}))^2 \text{,}$
    </div>
    <p style="text-align: center; margin: 10px 0;">
        where $y_i$ are observed values, $f(\mathbf{x}_i, \boldsymbol{\theta})$ are predicted values, and $\boldsymbol{\theta}$ are parameters
    </p>
</div>

<div class="fragment appear-vanish" data-fragment-index="2" style="margin-top: 20px;">
    <p><strong>Normal Equations:</strong> Closed-form solution for linear regression by setting the gradient to zero:</p>
    <div class="formula">
        $\boldsymbol{\theta}^* = (\mathbf{X}^\top \mathbf{X})^{-1} \mathbf{X}^\top \mathbf{y} \text{,}$
    </div>
    <p style="text-align: center; margin: 10px 0;">
        where $\mathbf{X}$ is the data matrix (observations $ \times $ features), $\mathbf{y}$ is the target vector, and $\boldsymbol{\theta}^*$ are the optimal parameters
    </p>
</div>

<div class="fragment appear-vanish" data-fragment-index="3" style="margin-top: 20px;">
    <p><strong>Gradient Descent:</strong> Iterative optimization algorithm that updates parameters $\boldsymbol{\theta}$ by moving in the direction opposite to the gradient:</p>
    <div class="formula">
        $\boldsymbol{\theta}_{t+1} = \boldsymbol{\theta}_t - \eta \nabla_{\boldsymbol{\theta}} \mathcal{L}(\boldsymbol{\theta}_t) \text{, with } t = 0, 1, 2, \ldots, T$
    </div>
    <p style="text-align: center; margin: 10px 0;">
        $\eta$ is the learning rate and $\nabla_{\boldsymbol{\theta}} \mathcal{L}(\boldsymbol{\theta}_t)$ is the gradient of the loss function at iteration $t$
    </p>
</div>

<div class="fragment appear-vanish" data-fragment-index="4" style="margin-top: 20px;">
    <p><strong>Eigenvalue Theory:</strong> For a square matrix $\mathbf{A}$, eigenvalues $\lambda$ and eigenvectors $\mathbf{v}$ satisfy:</p>
    <div class="formula">
        $\mathbf{A}\mathbf{v} = \lambda \mathbf{v} \text{,}$
    </div>
    <p style="text-align: center; margin: 10px 0;">
        where $\mathbf{v} \neq \mathbf{0}$ is an eigenvector and $\lambda$ is the corresponding eigenvalue (scalar)
    </p>
</div>

<div class="fragment appear-vanish" data-fragment-index="5" style="margin-top: 20px;">
    <p><strong>Principal Component Analysis (PCA):</strong> Dimensionality reduction technique that finds orthogonal directions of maximum variance:</p>
    <div class="formula">
        $
        \begin{aligned}
        \mathbf{w}_1 & = \argmax\limits_{\|\mathbf{w}\|=1} \text{Var}(\mathbf{X}\mathbf{w}) = \argmax\limits_{\|\mathbf{w}\|=1} \frac{1}{N} (\mathbf{X}\mathbf{w})^\top (\mathbf{X}\mathbf{w})\\
        & = \argmax\limits_{\|\mathbf{w}\|=1} \mathbf{w}^\top \mathbf{X}^\top \mathbf{X} \mathbf{w} = \argmax\limits_{\|\mathbf{w}\|=1} \mathbf{w}^\top \mathbf{\Sigma} \mathbf{w} \text{,}
        \end{aligned}
        $
    </div>
    <p style="text-align: center; margin: 10px 0;">
        where $\mathbf{X}$ is the data matrix, $\mathbf{\Sigma}$ is the covariance matrix, and $\mathbf{w}$ are the principal components (eigenvectors)
    </p>
</div>

<div class="fragment appear-vanish" data-fragment-index="6" style="margin-top: 20px;">
    <p><strong>Stochastic Gradient Descent (SGD):</strong> Variant of gradient descent that updates parameters using a single randomly selected data point at each iteration:</p>
    <div class="formula">
    $
        \boldsymbol{\theta}_{t+1} = \boldsymbol{\theta}_t - \eta \nabla_{\boldsymbol{\theta}} \ell(f_{\boldsymbol{\theta}_t}(\mathbf{x}_{i_t}), \mathbf{y}_{i_t}), \quad \text{where } i_t \sim \text{Uniform}(\{1, \ldots, N\})
    $
    </div>
</div>

---

## Mathematical Foundations

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Calculus & Linear Algebra</div>
        <div class="timeline-text">Basis for optimization algorithms and machine learning model operations</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1676; --end-year: 1951;">
        <div class="timeline-dot" style="--year: 1676;"></div>
<div class="timeline-item" style="--year: 1676;">
    <div class="timeline-content">
        <div class="timeline-year">1676</div>
        <div class="timeline-name">Chain Rule</div>
        <div class="timeline-author">Leibniz, G. W.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1676;"></div>
<div class="timeline-dot" style="--year: 1805;"></div>
<div class="timeline-item" style="--year: 1805;">
    <div class="timeline-content">
        <div class="timeline-year">1805</div>
        <div class="timeline-name">Least Squares</div>
        <div class="timeline-author">Legendre, A. M.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1805;"></div>
<div class="timeline-dot" style="--year: 1809;"></div>
<div class="timeline-item" style="--year: 1809;">
    <div class="timeline-content">
        <div class="timeline-year">1809</div>
        <div class="timeline-name">Normal Equations</div>
        <div class="timeline-author">Gauss, C. F.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1809;"></div>
<div class="timeline-dot" style="--year: 1847;"></div>
<div class="timeline-item" style="--year: 1847;">
    <div class="timeline-content">
        <div class="timeline-year">1847</div>
        <div class="timeline-name">Gradient Descent</div>
        <div class="timeline-author">Cauchy, A. L.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1847;"></div>
<div class="timeline-dot" style="--year: 1858;"></div>
<div class="timeline-item" style="--year: 1858;">
    <div class="timeline-content">
        <div class="timeline-year">1858</div>
        <div class="timeline-name">Eigenvalue Theory</div>
        <div class="timeline-author">Cayley & Hamilton</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1858;"></div>
<div class="timeline-dot" style="--year: 1901;"></div>
<div class="timeline-item" style="--year: 1901;">
    <div class="timeline-content">
        <div class="timeline-year">1901</div>
        <div class="timeline-name">PCA</div>
        <div class="timeline-author">Pearson, K.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1901;"></div>
<div class="timeline-dot" style="--year: 1951;"></div>
<div class="timeline-item" style="--year: 1951;">
    <div class="timeline-content">
    <div class="timeline-year">1951</div>
    <div class="timeline-name">Stochastic Gradient Descent</div>
    <div class="timeline-author">Robbins & Monro</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1951;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Probability & Statistics</div>
        <div class="timeline-text">Basis for Bayesian methods, statistical inference, and generative models</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1676; --end-year: 1951;" data-timeline-fragments-select="1763:0,1815:3,1830:6,1922:9">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 1763;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 1763;">
    <div class="timeline-content">
        <div class="timeline-year">1763</div>
        <div class="timeline-name">Bayes' Theorem</div>
        <div class="timeline-author">Bayes, T.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1763;"></div>
<div class="timeline-dot" style="--year: 1812;"></div>
<div class="timeline-item" style="--year: 1812;">
    <div class="timeline-content">
        <div class="timeline-year">1812</div>
        <div class="timeline-name">Bayesian Probability</div>
        <div class="timeline-author">Laplace, P. S.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1812;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="3" style="--year: 1815;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="3" style="--year: 1815;">
    <div class="timeline-content">
        <div class="timeline-year">1815</div>
        <div class="timeline-name">Gaussian Distribution</div>
        <div class="timeline-author">Gauss, C. F.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1815;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 1830;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 1830;">
    <div class="timeline-content">
        <div class="timeline-year">1830</div>
        <div class="timeline-name">Central Limit Theorem</div>
        <div class="timeline-author">Various</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1830;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="9" style="--year: 1922;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="9" style="--year: 1922;">
    <div class="timeline-content">
        <div class="timeline-year">1922</div>
        <div class="timeline-name">Maximum Likelihood</div>
        <div class="timeline-author">Fisher, R.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1922;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Information & Computation</div>
        <div class="timeline-text">Foundations of algorithmic thinking and information theory</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1676; --end-year: 1951;">
        <div class="timeline-dot" style="--year: 1843;"></div>
<div class="timeline-item" style="--year: 1843;">
    <div class="timeline-content">
        <div class="timeline-year">1843</div>
        <div class="timeline-name">First Computer Algorithm</div>
        <div class="timeline-author">Lovelace, A.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1843;"></div>
<div class="timeline-dot" style="--year: 1936;"></div>
<div class="timeline-item" style="--year: 1936;">
    <div class="timeline-content">
        <div class="timeline-year">1936</div>
        <div class="timeline-name">Turing Machine</div>
        <div class="timeline-author">Turing, A.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1936;"></div>
<div class="timeline-dot" style="--year: 1947;"></div>
<div class="timeline-item" style="--year: 1947;">
    <div class="timeline-content">
        <div class="timeline-year">1947</div>
        <div class="timeline-name">Linear Programming</div>
        <div class="timeline-author">Dantzig, G.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1947;"></div>
<div class="timeline-dot" style="--year: 1948;"></div>
<div class="timeline-item" style="--year: 1948;">
    <div class="timeline-content">
        <div class="timeline-year">1948</div>
        <div class="timeline-name">Information Theory</div>
        <div class="timeline-author">Shannon, C.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1948;"></div>
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="1" style="text-align: center; width: 60%;">
<p><strong>Bayes' Theorem:</strong> Relates conditional probabilities and forms the foundation of Bayesian inference:</p>
<div class="formula">
    $P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)}$
</div>
<p style="text-align: center; margin: 10px 0;">
    where $P(A|B)$ is the posterior probability, $P(B|A)$ is the likelihood, $P(A)$ is the prior, and $P(B)$ is the evidence
</p>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="4" style="text-align: center; width: 60%;">
<p><strong>Gaussian (Normal) Distribution:</strong> Continuous probability distribution characterized by mean $\mu$ and variance $\sigma^2$:</p>
<div class="formula">
    $f(x) = \frac{1}{\sqrt{2\pi\sigma^2}} e^{-\frac{(x-\mu)^2}{2\sigma^2}} = \mathcal{N}(\mu, \sigma^2)$
</div>
<p style="text-align: center; margin: 10px 0;">
    where $\mu$ is the mean (center) and $\sigma^2$ is the variance (spread) of the distribution
</p>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="7" style="text-align: center; width: 60%;">
<p><strong>Central Limit Theorem (CLT):</strong> The average of many independently sampled random variables approaches a normal distribution, <em>regardless of the original distribution</em>:</p>
<div class="formula">
    $\bar{X} \xrightarrow{d} \mathcal{N}\left(\mu, \frac{\sigma^2}{n}\right) \text{ as } n \to \infty$
</div>
<p style="text-align: center; margin: 10px 0;">
    where $\bar{X} = \frac{1}{n}\sum_{i=1}^{n} X_i$ is the sample mean, $\mu$ is the population mean, $\sigma^2$ is the population variance, and $n$ is the sample size
</p>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="10" style="text-align: center; width: 60%;">
<p><strong>Maximum Likelihood Estimation (MLE):</strong> Method to estimate parameters by maximizing the likelihood function:</p>
<div class="formula">
    $\boldsymbol{\theta}^*_{\text{MLE}} = \argmax\limits_{\boldsymbol{\theta}\in\Theta} L(\boldsymbol{\theta} | \mathbf{x}) = \argmax\limits_{\boldsymbol{\theta}\in\Theta} \prod_{i=1}^{n} P(x_i | \boldsymbol{\theta})$
</div>
<p style="text-align: center; margin: 10px 0;">
    where $\boldsymbol{\theta}$ are the parameters to estimate, $L$ is the likelihood function, and $\mathbf{x}$ is the observed data
</p>
</div>

Notes:

- **Turing Machine (1936)**: Alan Turing's abstract computational model established theoretical limits of computation and introduced the concept of a universal machine capable of simulating any other computation
- **Linear Programming (1947)**: George Dantzig's simplex algorithm enabled systematic optimization of linear objective functions under constraints, becoming foundational for operations research and constrained optimization in machine learning
- **Information Theory (1948)**: Claude Shannon's mathematical framework quantified information and uncertainty through entropy ($H(X) = -\sum p(x) \log p(x)$), establishing fundamental limits for data compression and transmission that underpin modern loss functions and information measures in deep learning

</div>

---

## Early History of Neural Networks

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Architectures & Layers</div>
        <div class="timeline-text">Evolution of network architectures and layer innovations</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1943; --end-year: 2012;" data-timeline-fragments-select="1943:0,1957:1,1965:2,1979:3,1982:4,1989:5,2006:6,2012:7">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 1943;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 1943;">
    <div class="timeline-content">
        <div class="timeline-year">1943</div>
        <div class="timeline-name">Artificial Neurons</div>
        <div class="timeline-author">McCulloch & Pitts</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1943;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="1" style="--year: 1957;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="1" style="--year: 1957;">
    <div class="timeline-content">
        <div class="timeline-year">1957</div>
        <div class="timeline-name">Perceptron</div>
        <div class="timeline-author">Rosenblatt, F.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1957;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="2" style="--year: 1965;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="2" style="--year: 1965;">
    <div class="timeline-content">
        <div class="timeline-year">1965</div>
        <div class="timeline-name">Deep Networks</div>
        <div class="timeline-author">Ivakhnenko & Lapa</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1965;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="3" style="--year: 1979;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="3" style="--year: 1979;">
    <div class="timeline-content">
        <div class="timeline-year">1979</div>
        <div class="timeline-name">Convolutional Networks</div>
        <div class="timeline-author">Fukushima, K.</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1979;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="4" style="--year: 1982;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="4" style="--year: 1982;">
    <div class="timeline-content">
        <div class="timeline-year">1982</div>
        <div class="timeline-name">Recurrent Networks</div>
        <div class="timeline-author">Hopfield</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1982;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="5" style="--year: 1989;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="5" style="--year: 1989;">
    <div class="timeline-content">
        <div class="timeline-year">1989</div>
        <div class="timeline-name">LSTM</div>
        <div class="timeline-author">Hochreiter & Schmidhuber</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2006;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2006;">
    <div class="timeline-content">
        <div class="timeline-year">2006</div>
        <div class="timeline-name">Deep Belief Networks</div>
        <div class="timeline-author">Hinton, G. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2006;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="7" style="--year: 2012;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="7" style="--year: 2012;">
    <div class="timeline-content">
        <div class="timeline-year">2012</div>
        <div class="timeline-name">AlexNet</div>
        <div class="timeline-author">Krizhevsky et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2012;"></div>
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="0" style="text-align: center; margin-top: 60px;">
    <img src="assets/images/01-history/artificial_neurons.png" alt="Deep Learning Era Timeline" style="max-width: 50%; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics, 5(4), 115-133.
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="2" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/feedforward_network.png" alt="Deep Learning Era Timeline" style="width: 500px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://www.cudocompute.com/topics/neural-networks/introduction-to-recurrent-neural-networks-rnns
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="3" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/neocognition.png" alt="Deep Learning Era Timeline" style="max-width: 40%; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        Fukushima, K. (1980). Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics, 36(4), 193–202. https://doi.org/10.1007/BF00344251
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="4" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/recurrent_network.png" alt="Deep Learning Era Timeline" style="width: 450px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://www.cudocompute.com/topics/neural-networks/introduction-to-recurrent-neural-networks-rnns
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="5" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/lstm_cell_illustration.png" alt="Deep Learning Era Timeline" style="width: 700px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://www.mql5.com/de/articles/15182
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="6" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/deep_believe_network.jpg" alt="Deep Learning Era Timeline" style="width: 600px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://www.analyticsvidhya.com/blog/2022/03/an-overview-of-deep-belief-network-dbn-in-deep-learning/
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="7" style="text-align: center; margin-top: 60px;">
    <img src="assets/images/01-history/alexnet.png" alt="Deep Learning Era Timeline" style="width: 1200px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25.
    </div>
</div>

Notes:

- In 1943, McCulloch and Pitts created the first mathematical model of an artificial neuron
- Demonstrated neurons could be modeled as binary threshold units performing logical operations (AND, OR, NOT)
- Proved networks of artificial neurons could compute any logical or arithmetic function
- Provided the first formal argument that the brain could be understood as a computing device
- In 1957, the perceptron was introduced by Frank Rosenblatt
- It was a simple model that could learn to classify inputs into different categories, by adjusting weights based on errors
- These errors were calculated from prelabeled data, which is called supervised learning
- Later, the multi-layer perceptron was developed, allowing for more complex representations of data
- In 1979, convolutional neural networks were introduced - replacing the multiplications with convolution operations
- And three years later - Hopfield networks were proposed, introducing recurrent connections - temporal dynamics
- Then the backpropagation algorithm enabled training of multi-layer networks - efficiently computing gradients
- Before the deep learning era, Deep Belief Networks were proposed as a way to pre-train deep networks layer by layer
- Finally, in 2012, AlexNet demonstrated the power of large deep convolutional networks on image classification tasks - marking the beginning of the deep learning revolution

</div>

---

## Early History of Neural Networks

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Architectures & Layers</div>
        <div class="timeline-text">Evolution of network architectures and layer innovations</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1943; --end-year: 2012;">
        <div class="timeline-dot" style="--year: 1943;"></div>
<div class="timeline-item" style="--year: 1943;">
    <div class="timeline-content">
        <div class="timeline-year">1943</div>
        <div class="timeline-name">Artificial Neurons</div>
        <div class="timeline-author">McCulloch & Pitts</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1943;"></div>
<div class="timeline-dot" style="--year: 1957;"></div>
<div class="timeline-item" style="--year: 1957;">
    <div class="timeline-content">
        <div class="timeline-year">1957</div>
        <div class="timeline-name">Perceptron</div>
        <div class="timeline-author">Rosenblatt, F.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1957;"></div>
<div class="timeline-dot" style="--year: 1965;"></div>
<div class="timeline-item" style="--year: 1965;">
    <div class="timeline-content">
        <div class="timeline-year">1965</div>
        <div class="timeline-name">Deep Networks</div>
        <div class="timeline-author">Ivakhnenko & Lapa</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1965;"></div>
<div class="timeline-dot" style="--year: 1979;"></div>
<div class="timeline-item" style="--year: 1979;">
    <div class="timeline-content">
        <div class="timeline-year">1979</div>
        <div class="timeline-name">Convolutional Networks</div>
        <div class="timeline-author">Fukushima, K.</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1979;"></div>
<div class="timeline-dot" style="--year: 1982;"></div>
<div class="timeline-item" style="--year: 1982;">
    <div class="timeline-content">
        <div class="timeline-year">1982</div>
        <div class="timeline-name">Recurrent Networks</div>
        <div class="timeline-author">Hopfield</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1982;"></div>
<div class="timeline-dot" style="--year: 1989;"></div>
<div class="timeline-item" style="--year: 1989;">
    <div class="timeline-content">
        <div class="timeline-year">1989</div>
        <div class="timeline-name">LSTM</div>
        <div class="timeline-author">Hochreiter & Schmidhuber</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989;"></div>
<div class="timeline-dot" style="--year: 2006;"></div>
<div class="timeline-item" style="--year: 2006;">
    <div class="timeline-content">
        <div class="timeline-year">2006</div>
        <div class="timeline-name">Deep Belief Networks</div>
        <div class="timeline-author">Hinton, G. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2006;"></div>
<div class="timeline-dot" style="--year: 2012;"></div>
<div class="timeline-item" style="--year: 2012;">
    <div class="timeline-content">
        <div class="timeline-year">2012</div>
        <div class="timeline-name">AlexNet</div>
        <div class="timeline-author">Krizhevsky et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2012;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Training & Optimization</div>
        <div class="timeline-text">Methods for efficient learning and gradient-based optimization</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1943; --end-year: 2012;" data-timeline-fragments-select="1967:0,1986:3,2010:6">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 1967;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 1967;">
    <div class="timeline-content">
        <div class="timeline-year">1967</div>
        <div class="timeline-name">Stochastic Gradient Descent for NN</div>
        <div class="timeline-author">Amari, S.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1967;"></div>
<div class="timeline-dot" style="--year: 1970;"></div>
<div class="timeline-item" style="--year: 1970;">
    <div class="timeline-content">
        <div class="timeline-year">1970</div>
        <div class="timeline-name">Automatic Differentiation</div>
        <div class="timeline-author">Linnainmaa, S.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1970;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="3" style="--year: 1986;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="3" style="--year: 1986;">
    <div class="timeline-content">
        <div class="timeline-year">1986</div>
        <div class="timeline-name">Backpropagation for NN</div>
        <div class="timeline-author">Hinton et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1986;"></div>
<div class="timeline-dot" style="--year: 1992;"></div>
<div class="timeline-item" style="--year: 1992;">
    <div class="timeline-content">
        <div class="timeline-year">1992</div>
        <div class="timeline-name">Weight Decay</div>
        <div class="timeline-author">Krogh & Hertz</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1992;"></div>
<div class="timeline-dot" style="--year: 2009;"></div>
<div class="timeline-item" style="--year: 2009;">
    <div class="timeline-content">
    <div class="timeline-year">2009</div>
    <div class="timeline-name">Convolutional DBNs & Prob. Max Pooling</div>
    <div class="timeline-author">Lee, H. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2009;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2010;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2010;">
    <div class="timeline-content">
        <div class="timeline-year">2010</div>
        <div class="timeline-name">ReLU & Xavier Init</div>
        <div class="timeline-author">Nair, Hinton & Glorot</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2010;"></div>
<div class="timeline-dot" style="--year: 2012;"></div>
<div class="timeline-item" style="--year: 2012;">
    <div class="timeline-content">
        <div class="timeline-year">2012</div>
        <div class="timeline-name">Dropout</div>
        <div class="timeline-author">Hinton, G. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2012;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Software & Datasets</div>
        <div class="timeline-text">Tools, platforms, and milestones that enabled practical deep learning</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 1943; --end-year: 2012;" data-timeline-fragments-select="1998:9,2009:12">
        <div class="timeline-dot" style="--year: 1997;"></div>
<div class="timeline-item" style="--year: 1997;">
    <div class="timeline-content">
        <div class="timeline-year">1997</div>
        <div class="timeline-name">Deep Blue</div>
        <div class="timeline-author">IBM</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1997;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="9" style="--year: 1998;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="9" style="--year: 1998;">
    <div class="timeline-content">
        <div class="timeline-year">1998</div>
        <div class="timeline-name">MNIST Dataset & LeNet 5</div>
        <div class="timeline-author">LeCun, Y. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1998;"></div>
<div class="timeline-dot" style="--year: 2002;"></div>
<div class="timeline-item" style="--year: 2002;">
    <div class="timeline-content">
        <div class="timeline-year">2002</div>
        <div class="timeline-name">Torch Framework</div>
        <div class="timeline-author">Torch Team</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2002;"></div>
<div class="timeline-dot" style="--year: 2007;"></div>
<div class="timeline-item" style="--year: 2007;">
    <div class="timeline-content">
        <div class="timeline-year">2007</div>
        <div class="timeline-name">CUDA Platform</div>
        <div class="timeline-author">NVIDIA</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2007;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="12" style="--year: 2009;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="12" style="--year: 2009;">
    <div class="timeline-content">
        <div class="timeline-year">2009</div>
        <div class="timeline-name">ImageNet Dataset</div>
        <div class="timeline-author">Deng, J. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2009;"></div>
<div class="timeline-dot" style="--year: 2011;"></div>
<div class="timeline-item" style="--year: 2011;">
    <div class="timeline-content">
        <div class="timeline-year">2011</div>
        <div class="timeline-name">Siri</div>
        <div class="timeline-author">Apple Inc.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2011;"></div>
    </div>
</div>

```python
# Initialize parameters
θ = initialize_parameters()
learning_rate = 0.01
num_epochs = 100

# Training loop
for epoch in range(num_epochs):
    # Shuffle training data
    shuffle(training_data)
    
    # Iterate through each training example
    for (x_i, y_i) in training_data:
        # Compute gradient for single example
        gradient = compute_gradient(loss(θ, x_i, y_i))
        
        # Update parameters
        θ = θ - learning_rate * gradient
```

</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="4" style="text-align: center; width: 60%;">
    <img src="assets/images/01-history/backprop.png" alt="Deep Learning Era Timeline" style="width: 1200px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://www.linkedin.com/pulse/backpropagation-neural-networks-brain-behind-deep-learning-ali-v8fsf
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="7" style="text-align: center; width: 60%;">
    <img src="assets/images/01-history/relu.png" alt="Deep Learning Era Timeline" style="width: 1200px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://machinelearningmastery.com/rectified-linear-activation-function-for-deep-learning-neural-networks/
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="10" style="text-align: center; width: 60%;">
    <img src="assets/images/01-history/mnist_dataset.png" alt="Deep Learning Era Timeline" style="width: 1200px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://de.wikipedia.org/wiki/MNIST-Datenbank#/media/Datei:MnistExamples.png
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="13" style="text-align: center; width: 80%;">
    <img src="assets/images/01-history/imagenet.png" alt="Deep Learning Era Timeline" style="width: 1500px; height: auto;">
    <div class="reference" style="margin-top: 10px; text-align: center;">
        https://blog.roboflow.com/introduction-to-imagenet/
    </div>
</div>

Notes:

- MNIST dataset of handwritten digits - 70,000 images
- ImageNet over 14 million images with hand-annotated labels

---

## Early History of Neural Audio Systems

<div class="timeline-container" style="flex-direction: column; gap: 10px;">
    <div style="display: flex; justify-content: space-between; align-items: flex-start;">
        <div style="width: 20%;">
            <div class="timeline-title">Key Milestones</div>
            <div class="timeline-text">Significant developments in neural audio systems</div>
        </div>
        <div style="display: flex; flex-direction: column; justify-content: space-between; align-items: flex-start; width: 80%; gap: 20px;">
            <div class="timeline timeline-secondary" style="width: 98%; --start-year: 1957; --end-year: 2012;">
                <div class="timeline-dot" style="--year: 1957;"></div>
<div class="timeline-item" style="--year: 1957;">
    <div class="timeline-content">
        <div class="timeline-year">1957</div>
        <div class="timeline-name">Perceptron</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1957;"></div>
<div class="timeline-dot" style="--year: 1979;"></div>
<div class="timeline-item" style="--year: 1979;">
    <div class="timeline-content">
        <div class="timeline-year">1979</div>
        <div class="timeline-name">CNN</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1979;"></div>
<div class="timeline-dot" style="--year: 1982;"></div>
<div class="timeline-item" style="--year: 1982;">
    <div class="timeline-content">
        <div class="timeline-year">1982</div>
        <div class="timeline-name">RNN</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1982;"></div>
<div class="timeline-dot" style="--year: 1986;"></div>
<div class="timeline-item" style="--year: 1986;">
    <div class="timeline-content">
        <div class="timeline-year">1986</div>
        <div class="timeline-name">Backpropagation</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1986;"></div>
<div class="timeline-dot" style="--year: 2006;"></div>
<div class="timeline-item" style="--year: 2006;">
    <div class="timeline-content">
        <div class="timeline-year">2006</div>
        <div class="timeline-name">Deep Belief Networks</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2006;"></div>
<div class="timeline-dot" style="--year: 2012;"></div>
<div class="timeline-item" style="--year: 2012;">
    <div class="timeline-content">
        <div class="timeline-year">2012</div>
        <div class="timeline-name">AlexNet</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2012;"></div>
            </div>
            <div class="timeline timeline-purple" style="width: 98%; --start-year: 1957; --end-year: 2012;" data-timeline-fragments-select="1960:0,1989:2,1999:5,2009:6">
                <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 1960;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 1960;">
    <div class="timeline-content">
        <div class="timeline-year">1960</div>
        <div class="timeline-name">LMS Filtering</div>
        <div class="timeline-author">Widrow & Hoff</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1960;"></div>
<div class="timeline-dot" style="--year: 1987;"></div>
<div class="timeline-item" style="--year: 1987;">
    <div class="timeline-content">
        <div class="timeline-year">1987</div>
        <div class="timeline-name">NN for Phoneme Recognition</div>
        <div class="timeline-author">Waibel et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1987;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="2" style="--year: 1989;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="2" style="--year: 1989;">
    <div class="timeline-content">
        <div class="timeline-year">1989</div>
        <div class="timeline-name">RNN for Symbolic Music Generation</div>
        <div class="timeline-author">Todd</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989;"></div>
<div class="timeline-dot" style="--year: 1989.2;"></div>
<div class="timeline-item" style="--year: 1989.2;">
    <div class="timeline-content">
        <div class="timeline-year">1989</div>
        <div class="timeline-name">Gradient Descent for Musical DSP</div>
        <div class="timeline-author">Shynk & Moorer</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989.2;"></div>
<div class="timeline-dot" style="--year: 1997;"></div>
<div class="timeline-item" style="--year: 1997;">
    <div class="timeline-content">
        <div class="timeline-year">1997</div>
        <div class="timeline-name">NN for Analog Effects Modeling</div>
        <div class="timeline-author">Zhang & Duhamel</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1997;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="5" style="--year: 1999;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="5" style="--year: 1999;">
    <div class="timeline-content">
        <div class="timeline-year">1999</div>
        <div class="timeline-name">NN for Piano Transcription</div>
        <div class="timeline-author">Matija Marolt</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1999;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2009;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2009;">
    <div class="timeline-content">
        <div class="timeline-year">2009</div>
        <div class="timeline-name">Audio features with DBN</div>
        <div class="timeline-author">Lee et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2009;"></div>
            </div>
        </div>
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="0" style="margin-top: 20px;">
    <h3>Least Mean Square Filtering<br>
    <span style="font-weight: normal; font-size: 0.6em; font-height: 0.6em;">(Widrow & Hoff)</span></h3>
    <ul>
        <li>Adaptive filtering algorithm for noise cancellation and echo suppression</li>
        <li>Uses <strong>stochastic gradient descent</strong> to minimize error between desired and actual output</li>
        <li>SGD = Foundation for later neural network training methods</li>
    </ul>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="3" style="text-align: center; top: 100%;">
    <img src="assets/images/01-history/rnn_symbolic_music_gen.png" alt="RNN for Symbolic Music Generation" style="width: 700px; height: auto;">
    <div class="reference" style="margin: 10px; text-align: center;">
        Todd, P. M. (1989). A Connectionist Approach to Algorithmic Composition. Computer Music Journal, 13(4), 27–43.
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="5" style="margin-top: 60px;">
    <h3>Neural Networks for Piano Transcription
    <br><span style="font-weight: normal;">(Matija Marolt)</span></h3>
    <ul>
        <li>Division of audio signals into frequency bands</li>
        <li>One Multilayer Perceptron (MLP) for each band</li>
    </ul>
</div>

<div class="fragment appear-vanish" data-fragment-index="6" style="margin-top: 20px;">
    <h3>Unsupervised Audio Feature Learning with Deep Belief Networks<br>
    <span style="font-weight: normal;">(Lee et al.)</span></h3>
    <ul>
        <li>Learning of audio features from unlabeled data - unsupervised learning</li>
        <li>Outperformed traditional hand-crafted features in many classification tasks</li>
    </ul>
</div>

Notes:

- Now let's look at some key milestones in neural audio systems during this early history
- Already in 1960, Widrow and Hoff introduced the Least Mean Square filtering algorithm
- Then 27 years later, neural networks were applied to phoneme recognition
- In 1989, Peter Todd used RNNs for symbolic music generation
- In the same year, there have been first attempts to use gradient descent for musical DSP
- In 1997, neural networks were used the first time for modeling analog effects
- Music transcription with neural networks dates back to 1999, with Matija Marolt's work on piano transcription
- Finally in 2009, Lee et al. demonstrated the effectiveness of deep belief networks for learning audio features with unsupervised learning - unlabeled data
- These features outperformed traditional hand-crafted features in many classification tasks

---

## Early History of Neural Audio Systems

<div class="timeline-container" style="flex-direction: column; gap: 10px;">
    <div style="display: flex; justify-content: space-between; align-items: flex-start;">
        <div style="width: 20%;">
            <div class="timeline-title">Key Milestones</div>
            <div class="timeline-text">Significant developments in neural audio systems</div>
        </div>
        <div style="display: flex; flex-direction: column; justify-content: space-between; align-items: flex-start; width: 80%; gap: 20px;">
            <div class="timeline timeline-secondary" style="width: 98%; --start-year: 1957; --end-year: 2012;">
                <div class="timeline-dot" style="--year: 1957;"></div>
<div class="timeline-item" style="--year: 1957;">
    <div class="timeline-content">
        <div class="timeline-year">1957</div>
        <div class="timeline-name">Perceptron</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1957;"></div>
<div class="timeline-dot" style="--year: 1979;"></div>
<div class="timeline-item" style="--year: 1979;">
    <div class="timeline-content">
        <div class="timeline-year">1979</div>
        <div class="timeline-name">CNN</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1979;"></div>
<div class="timeline-dot" style="--year: 1982;"></div>
<div class="timeline-item" style="--year: 1982;">
    <div class="timeline-content">
        <div class="timeline-year">1982</div>
        <div class="timeline-name">RNN</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1982;"></div>
<div class="timeline-dot" style="--year: 1986;"></div>
<div class="timeline-item" style="--year: 1986;">
    <div class="timeline-content">
        <div class="timeline-year">1986</div>
        <div class="timeline-name">Backpropagation</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1986;"></div>
<div class="timeline-dot" style="--year: 2006;"></div>
<div class="timeline-item" style="--year: 2006;">
    <div class="timeline-content">
        <div class="timeline-year">2006</div>
        <div class="timeline-name">Deep Belief Networks</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2006;"></div>
<div class="timeline-dot" style="--year: 2012;"></div>
<div class="timeline-item" style="--year: 2012;">
    <div class="timeline-content">
        <div class="timeline-year">2012</div>
        <div class="timeline-name">AlexNet</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2012;"></div>
            </div>
            <div class="timeline timeline-purple" style="width: 98%; --start-year: 1957; --end-year: 2012;" data-timeline-fragments-color-0="1960:0,1989.2:0,1997:0" data-timeline-fragments-color-1="1987:1,1999:1,2009:1" data-timeline-fragments-color-2="1989:2">
                <div class="timeline-dot fragment custom color-0" data-fragment-index="0" style="--year: 1960;"></div>
<div class="timeline-item fragment custom color-0" data-fragment-index="0" style="--year: 1960;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-0" data-fragment-index="0">1960</div>
        <div class="timeline-name">LMS Filtering</div>
        <div class="timeline-author">Widrow & Hoff</div>
    </div>
</div> 
<div class="timeline-connector" style="--year: 1960;"></div>
<div class="timeline-dot fragment custom color-1" data-fragment-index="1" style="--year: 1987;"></div>
<div class="timeline-item fragment custom color-1" data-fragment-index="1" style="--year: 1987;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-1" data-fragment-index="1">1987</div>
        <div class="timeline-name">NN for Phoneme Recognition</div>
        <div class="timeline-author">Waibel et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1987;"></div>
<div class="timeline-dot fragment custom color-2" data-fragment-index="2" style="--year: 1989;"></div>
<div class="timeline-item fragment custom color-2" data-fragment-index="2" style="--year: 1989;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-2" data-fragment-index="2">1989</div>
        <div class="timeline-name">RNN for Symbolic Music Generation</div>
        <div class="timeline-author">Todd</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989;"></div>
<div class="timeline-dot fragment custom color-0" data-fragment-index="0" style="--year: 1989.2;"></div>
<div class="timeline-item fragment custom color-0" data-fragment-index="0" style="--year: 1989.2;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-0" data-fragment-index="0">1989</div>
        <div class="timeline-name">Gradient Descent for Musical DSP</div>
        <div class="timeline-author">Shynk & Moorer</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1989.2;"></div>
<div class="timeline-dot fragment custom color-0" data-fragment-index="0" style="--year: 1997;"></div>
<div class="timeline-item fragment custom color-0" data-fragment-index="0" style="--year: 1997;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-0" data-fragment-index="0">1997</div>
        <div class="timeline-name">NN for Analog Effects Modeling</div>
        <div class="timeline-author">Zhang & Duhamel</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1997;"></div>
<div class="timeline-dot fragment custom color-1" data-fragment-index="1" style="--year: 1999;"></div>
<div class="timeline-item fragment custom color-1" data-fragment-index="1" style="--year: 1999;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-1" data-fragment-index="1">1999</div>
        <div class="timeline-name">NN for Piano Transcription</div>
        <div class="timeline-author">Matija Marolt</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 1999;"></div>
<div class="timeline-dot fragment custom color-1" data-fragment-index="1" style="--year: 2009;"></div>
<div class="timeline-item fragment custom color-1" data-fragment-index="1" style="--year: 2009;">
    <div class="timeline-content">
        <div class="timeline-year fragment custom color-1" data-fragment-index="1">2009</div>
        <div class="timeline-name">Audio features with DBN</div>
        <div class="timeline-author">Lee et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2009;"></div>
            </div>
        </div>
    </div>
</div>

<div class="tiles-grid">
    <div class="tile fragment color-0" data-fragment-index="0">
        <h3>Gradient Descent Based<br>Digital Signal Processing</h3>
        <div class="tile-description">
            Use gradient descent to optimize parameters of digital signal processing algorithms for tasks like audio effects modeling and synthesis.
        </div>
    </div>
    <div class="tile fragment color-1" data-fragment-index="1">
        <h3>Feature Extraction with<br>Neural Networks</h3>
        <div class="tile-description">
            Use neural networks to automatically learn and extract relevant features from audio data for tasks like classification, transcription, and analysis.
        </div>
    </div>
    <div class="tile fragment color-2" data-fragment-index="2">
        <h3>Symbolic Music Generation<br>with Neural Networks</h3>
        <div class="tile-description">
            Use neural networks to generate symbolic music representations, such as music notation or MIDI sequences, for composition and arrangement tasks.
        </div>
    </div>
</div>

<div class="image-overlay fragment" data-fragment-index="3" style="position: absolute; width: 60%; padding: 60px; text-align: center;">
  <p>What about <strong>neural audio synthesis</strong>?</p>
</div>

Notes:

- I would like to highlight that these early works can be categorised into three main areas.
- First, gradient descent based digital signal processing - using gradient descent to optimize parameters of DSP algorithms
- Second, feature extraction with neural networks - using neural networks to automatically learn and extract relevant features
- And the third category is symbolic music generation with neural networks
- But what about neural audio synthesis?

---

## The Deep Learning Era

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Deep architectures</div>
        <div class="timeline-text">Deep architectures and generative models transforming AI capabilities</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 2013; --end-year: 2023; " data-timeline-fragments-select="2013:0,2014:1,2015:2,2017:4,2021:6,2022:8">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 2013;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 2013;">
    <div class="timeline-content">
        <div class="timeline-year">2013</div>
        <div class="timeline-name">Variational Autoencoders</div>
        <div class="timeline-author">Kingma et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2013;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="1" style="--year: 2014;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="1" style="--year: 2014;">
    <div class="timeline-content">
        <div class="timeline-year">2014</div>
        <div class="timeline-name">Generative Adversarial Nets</div>
        <div class="timeline-author">Goodfellow et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2014;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="2" style="--year: 2015;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="2" style="--year: 2015;">
    <div class="timeline-content">
        <div class="timeline-year">2015</div>
        <div class="timeline-name">ResNet & Diffusion</div>
        <div class="timeline-author">He et al. & Sohl-Dickstein et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2015;"></div>
<div class="timeline-dot" style="--year: 2016;"></div>
<div class="timeline-item" style="--year: 2016;">
    <div class="timeline-content">
        <div class="timeline-year">2016</div>
        <div class="timeline-name">Style Transfer & WaveNet</div>
        <div class="timeline-author">Gatys & van den Oord</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2016;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="4" style="--year: 2017;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="4" style="--year: 2017;">
    <div class="timeline-content">
        <div class="timeline-year">2017</div>
        <div class="timeline-name">Transformers</div>
        <div class="timeline-author">Vaswani et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2017;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2021;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2021;">
    <div class="timeline-content">
        <div class="timeline-year">2021</div>
        <div class="timeline-name">ViT & CLIP</div>
        <div class="timeline-author">Dosovitskiy & Radford</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2021;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="8" style="--year: 2022;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="8" style="--year: 2022;">
    <div class="timeline-content">
        <div class="timeline-year">2022</div>
        <div class="timeline-name">Diffusion Transformer</div>
        <div class="timeline-author">Peebles & Xie</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2022;"></div>
<div class="timeline-dot" style="--year: 2023;"></div>
<div class="timeline-item" style="--year: 2023;">
    <div class="timeline-content">
        <div class="timeline-year">2023</div>
        <div class="timeline-name">Mamba</div>
        <div class="timeline-author">Gu & Dao</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2023;"></div>
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="0" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/vae.png" alt="Variational Autoencoder" style="max-width: 800px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin-top: 10px; text-align: center;">
        https://theaisummer.com/Autoencoder/
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="1" style="text-align: center; margin-top: 60px;">
    <img src="assets/images/01-history/gan.png" alt="Variational Autoencoder" style="max-width: 1200px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin-top: 10px; text-align: center;">
        https://www.linkedin.com/pulse/what-generative-adversarial-networks-gans-sushant-babbar-qpc9c
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="2" style="text-align: center; margin-top: 100px;">
    <img src="assets/images/01-history/diffusion_networks.png" alt="Diffusion Networks" style="width: 1200px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin-top: 10px; text-align: center;">
        Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in neural information processing systems, 33, 6840-6851.
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="5" style="text-align: center; top: 130%;">
    <img src="assets/images/01-history/transformer.png" alt="Transformer" style="max-width: 550px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="7" style="text-align: center; top: 130%;">
    <img src="assets/images/01-history/clip.png" alt="CLIP" style="max-width: 1800px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In International conference on machine learning (pp. 8748-8763). PmLR.
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="8" style="text-align: center; top: 120%;">
    <img src="assets/images/01-history/diffusion-transformer.png" alt="Diffusion Transformer" style="max-width: 1800px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        https://digialps.com/stability-ais-new-open-source-ai-creation-stable-audio-2-0-takes-on-suno-ai/
    </div>
</div>

Notes:

- Well, for neural audio synthesis we need the inventions of the deep learning era - first an overview of key milestones in deep learning in general
- In 2013, Variational Autoencoders were introduced - ability to generate new data points by sampling from a learned distribution - the latent distribution
- Learn in an unsupervised manner to encode input data into a compressed representation and then decode it back to the original input
- In 2014, Generative Adversarial Networks were proposed - two neural networks competing against each other
- In 2015, Diffusion models were introduced - iterative denoising process to generate high-quality samples
- The year 2017 was the year Transformers revolutionized sequence modeling with self-attention mechanisms
- In 2021, CLIP demonstrated the power of multi-modal learning by connecting images and text
- Two encoders that map images and text into a shared latent space - by using contrastive learning the images and text are mapped close to each other in the latent space
- It could for example classify images, without ever being trained on that specific task
- In 2022, Diffusion Transformers combined the strengths of diffusion models and transformers
- And finally in 2023, Mamba was introduced - a new architecture for sequence modeling

</div>

---

## The Deep Learning Era

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Deep architectures</div>
        <div class="timeline-text">Deep architectures and generative models transforming AI capabilities</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 2013; --end-year: 2023;">
        <div class="timeline-dot" style="--year: 2013;"></div>
<div class="timeline-item" style="--year: 2013;">
    <div class="timeline-content">
        <div class="timeline-year">2013</div>
        <div class="timeline-name">Variational Autoencoders</div>
        <div class="timeline-author">Kingma et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2013;"></div>
<div class="timeline-dot" style="--year: 2014;"></div>
<div class="timeline-item" style="--year: 2014;">
    <div class="timeline-content">
        <div class="timeline-year">2014</div>
        <div class="timeline-name">Generative Adversarial Nets</div>
        <div class="timeline-author">Goodfellow et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2014;"></div>
<div class="timeline-dot" style="--year: 2015;"></div>
<div class="timeline-item" style="--year: 2015;">
    <div class="timeline-content">
        <div class="timeline-year">2015</div>
        <div class="timeline-name">ResNet & Diffusion</div>
        <div class="timeline-author">He et al. & Sohl-Dickstein et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2015;"></div>
<div class="timeline-dot" style="--year: 2016;"></div>
<div class="timeline-item" style="--year: 2016;">
    <div class="timeline-content">
        <div class="timeline-year">2016</div>
        <div class="timeline-name">Style Transfer & WaveNet</div>
        <div class="timeline-author">Gatys & van den Oord</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2016;"></div>
<div class="timeline-dot" style="--year: 2017;"></div>
<div class="timeline-item" style="--year: 2017;">
    <div class="timeline-content">
        <div class="timeline-year">2017</div>
        <div class="timeline-name">Transformers</div>
        <div class="timeline-author">Vaswani et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2017;"></div>
<div class="timeline-dot" style="--year: 2021;"></div>
<div class="timeline-item" style="--year: 2021;">
    <div class="timeline-content">
        <div class="timeline-year">2021</div>
        <div class="timeline-name">ViT & CLIP</div>
        <div class="timeline-author">Dosovitskiy & Radford</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2021;"></div>
<div class="timeline-dot" style="--year: 2022;"></div>
<div class="timeline-item" style="--year: 2022;">
    <div class="timeline-content">
        <div class="timeline-year">2022</div>
        <div class="timeline-name">Diffusion Transformer</div>
        <div class="timeline-author">Peebles & Xie</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2022;"></div>
<div class="timeline-dot" style="--year: 2023;"></div>
<div class="timeline-item" style="--year: 2023;">
    <div class="timeline-content">
        <div class="timeline-year">2023</div>
        <div class="timeline-name">Mamba</div>
        <div class="timeline-author">Gu & Dao</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2023;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Training & Optimization</div>
        <div class="timeline-text">Advanced learning techniques and representation learning breakthroughs</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 2013; --end-year: 2023;" data-timeline-fragments-select="2013:0,2014:3,2015:6">
        <div class="timeline-dot fragment custom select" data-fragment-index="0" style="--year: 2013;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="0" style="--year: 2013;">
    <div class="timeline-content">
        <div class="timeline-year">2013</div>
        <div class="timeline-name">Word2Vec</div>
        <div class="timeline-author">Mikolov, T. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2013;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="3" style="--year: 2014;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="3" style="--year: 2014;">
    <div class="timeline-content">
        <div class="timeline-year">2014</div>
        <div class="timeline-name">Attention Mechanism</div>
        <div class="timeline-author">Bahdanau, D. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2014;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2015;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2015;">
    <div class="timeline-content">
        <div class="timeline-year">2015</div>
        <div class="timeline-name">BatchNorm & Adam</div>
        <div class="timeline-author">Ioffe & Kingma</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2015;"></div>
<div class="timeline-dot" style="--year: 2016;"></div>
<div class="timeline-item" style="--year: 2016;">
    <div class="timeline-content">
        <div class="timeline-year">2016</div>
        <div class="timeline-name">Layer Normalization</div>
        <div class="timeline-author">Ba, J. L. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2016;"></div>
<div class="timeline-dot" style="--year: 2020;"></div>
<div class="timeline-item" style="--year: 2020;">
    <div class="timeline-content">
        <div class="timeline-year">2020</div>
        <div class="timeline-name">DDPM</div>
        <div class="timeline-author">Ho, J. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2020;"></div>
    </div>
</div>

<div class="timeline-container" style="flex-direction: row;">
    <div style="width: 20%;">
        <div class="timeline-title">Software & Applications</div>
        <div class="timeline-text">Practical deployment and mainstream adoption of deep learning systems</div>
    </div>
    <div class="timeline" style="width: 80%; --start-year: 2013; --end-year: 2023;">
        <div class="timeline-dot" style="--year: 2016;"></div>
<div class="timeline-item" style="--year: 2016;">
    <div class="timeline-content">
        <div class="timeline-year">2016</div>
        <div class="timeline-name">AlphaGo</div>
        <div class="timeline-author">Silver, D. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2016;"></div>
<div class="timeline-dot" style="--year: 2017;"></div>
<div class="timeline-item" style="--year: 2017;">
    <div class="timeline-content">
        <div class="timeline-year">2017</div>
        <div class="timeline-name">PyTorch</div>
        <div class="timeline-author">Paszke, A. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2017;"></div>
<div class="timeline-dot" style="--year: 2018;"></div>
<div class="timeline-item" style="--year: 2018;">
    <div class="timeline-content">
        <div class="timeline-year">2018</div>
        <div class="timeline-name">GPT-1</div>
        <div class="timeline-author">Radford & Devlin</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2018;"></div>
<div class="timeline-dot" style="--year: 2020;"></div>
<div class="timeline-item" style="--year: 2020;">
    <div class="timeline-content">
        <div class="timeline-year">2020</div>
        <div class="timeline-name">GPT-3</div>
        <div class="timeline-author">Brown, T. B. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2020;"></div>
<div class="timeline-dot" style="--year: 2022;"></div>
<div class="timeline-item" style="--year: 2022;">
    <div class="timeline-content">
        <div class="timeline-year">2022</div>
        <div class="timeline-name">ChatGPT & Stable Diffusion</div>
        <div class="timeline-author">OpenAI & Stability AI</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2022;"></div>
<div class="timeline-dot" style="--year: 2023;"></div>
<div class="timeline-item" style="--year: 2023;">
    <div class="timeline-content">
        <div class="timeline-year">2023</div>
        <div class="timeline-name">LLaMA</div>
        <div class="timeline-author">Touvron, H. et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2023;"></div>
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="1" style="text-align: center; width: 70%;">
    <img src="assets/images/01-history/word2vec.png" alt="Word2Vec" style="max-width: 90%; max-height: 90%; object-fit: contain;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, 26.
    </div>
</div>
<div class="fragment" data-fragment-index="2"></div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="4" style="text-align: center;">
    <img src="assets/images/01-history/attention.png" alt="Attention Mechanism" style="max-width: 90%; max-height: 90%; object-fit: contain;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Bahdanau, D., Cho, K., & Bengio, Y. (2016). Neural Machine Translation by Jointly Learning to Align and Translate (No. arXiv:1409.0473). arXiv. https://doi.org/10.48550/arXiv.1409.0473
    </div>
</div>
<div class="fragment" data-fragment-index="5"></div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="7" style="width: 80%;">
    <div style="font-size: 0.85em; line-height: 1;">
        <p style="margin-bottom: 15px;"><strong>Adaptive Moment Estimation</strong> - combines momentum and RMSprop:</p>
        <p style="margin-bottom: 10px;"><strong>First moment (momentum):</strong></p>
        <div class="formula">$\mathbf{m}_t = \beta_1 \mathbf{m}_{t-1} + (1-\beta_1)\nabla_{\boldsymbol{\theta}}\mathcal{L}(\boldsymbol{\theta}_t)$</div>
        <p style="margin-bottom: 10px;"><strong>Second moment (RMSprop):</strong></p>
        <div class="formula">$\mathbf{v}_t = \beta_2 \mathbf{v}_{t-1} + (1-\beta_2)(\nabla_{\boldsymbol{\theta}}\mathcal{L}(\boldsymbol{\theta}_t))^2$</div>
        <p style="margin-bottom: 10px;"><strong>Bias correction:</strong></p>
        <div class="formula">$\hat{\mathbf{m}}_t = \frac{\mathbf{m}_t}{1-\beta_1^t}, \quad \hat{\mathbf{v}}_t = \frac{\mathbf{v}_t}{1-\beta_2^t}$</div>
        <p style="margin-bottom: 10px;"><strong>Parameter update:</strong></p>
        <div class="formula">$\boldsymbol{\theta}_{t+1} = \boldsymbol{\theta}_t - \alpha \frac{\hat{\mathbf{m}}_t}{\sqrt{\hat{\mathbf{v}}_t} + \epsilon}$</div>
    </div>
</div>

Notes:

</div>

---

## Deep Neural Audio Systems

<div class="timeline-container" style="flex-direction: column; gap: 10px;">
    <div style="display: flex; justify-content: space-between; align-items: flex-start;">
        <div style="width: 20%;">
            <div class="timeline-title">Key Milestones</div>
            <div class="timeline-text">Significant developments in deep neural audio systems</div>
        </div>
        <div style="display: flex; flex-direction: column; justify-content: space-between; align-items: flex-start; width: 80%; gap: 20px;">
            <div class="timeline timeline-secondary" style="width: 98%; --start-year: 2013; --end-year: 2025;">
                <div class="timeline-dot" style="--year: 2013;"></div>
<div class="timeline-item" style="--year: 2013;">
    <div class="timeline-content">
        <div class="timeline-year">2013</div>
        <div class="timeline-name">VAE</div>
        <div class="timeline-author">Kingma & Welling</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2013;"></div>
<div class="timeline-dot" style="--year: 2014;"></div>
<div class="timeline-item" style="--year: 2014;">
    <div class="timeline-content">
        <div class="timeline-year">2014</div>
        <div class="timeline-name">GAN</div>
        <div class="timeline-author">Goodfellow et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2014;"></div>
<div class="timeline-dot" style="--year: 2015;"></div>
<div class="timeline-item" style="--year: 2015;">
    <div class="timeline-content">
        <div class="timeline-year">2015</div>
        <div class="timeline-name">Diffusion</div>
        <div class="timeline-author">Sohl-Dickstein et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2015;"></div>
<div class="timeline-dot" style="--year: 2017;"></div>
<div class="timeline-item" style="--year: 2017;">
    <div class="timeline-content">
        <div class="timeline-year">2017</div>
        <div class="timeline-name">Transformers</div>
        <div class="timeline-author">Vaswani et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2017;"></div>
<div class="timeline-dot" style="--year: 2021;"></div>
<div class="timeline-item" style="--year: 2021;">
    <div class="timeline-content">
        <div class="timeline-year">2021</div>
        <div class="timeline-name">CLIP</div>
        <div class="timeline-author">Dosovitskiy & Radford</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2021;"></div>
<div class="timeline-dot" style="--year: 2022;"></div>
<div class="timeline-item" style="--year: 2022;">
    <div class="timeline-content">
        <div class="timeline-year">2022</div>
        <div class="timeline-name">Diffusion Transformer</div>
        <div class="timeline-author">Peebles & Xie</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2022;"></div>
<div class="timeline-dot" style="--year: 2023;"></div>
<div class="timeline-item" style="--year: 2023;">
    <div class="timeline-content">
        <div class="timeline-year">2023</div>
        <div class="timeline-name">Mamba</div>
        <div class="timeline-author">Gu & Dao</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2023;"></div>
            </div>
            <div class="timeline timeline-purple" style="width: 98%; --start-year: 2013; --end-year: 2025;" data-timeline-fragments-select="2016:1,2017:2,2019:4,2019.1:6,2021:8,2022:10,2024:12,2025:13">
                <div class="timeline-dot fragment custom select" data-fragment-index="1" style="--year: 2016;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="1" style="--year: 2016;">
    <div class="timeline-content">
        <div class="timeline-year">2016</div>
        <div class="timeline-name">WaveNet</div>
        <div class="timeline-author">Oord et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2016;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="2" style="--year: 2017;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="2" style="--year: 2017;">
    <div class="timeline-content">
        <div class="timeline-year">2017</div>
        <div class="timeline-name">Neural Synthesis</div>
        <div class="timeline-author">Engel et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2017;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="4" style="--year: 2019;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="4" style="--year: 2019;">
    <div class="timeline-content">
        <div class="timeline-year">2019</div>
        <div class="timeline-name">Real-time Amp Emul.</div>
        <div class="timeline-author">Damskägg et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2019;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="6" style="--year: 2019.1;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="6" style="--year: 2019.1;">
    <div class="timeline-content">
        <div class="timeline-year">2019</div>
        <div class="timeline-name">DDSP</div>
        <div class="timeline-author">Engel et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2019.1;"></div>
<div class="timeline-dot" style="--year: 2020;"></div>
<div class="timeline-item" style="--year: 2020;">
    <div class="timeline-content">
        <div class="timeline-year">2020</div>
        <div class="timeline-name">Automatic Mixing</div>
        <div class="timeline-author">Steinmetz et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2020;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="8" style="--year: 2021;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="8" style="--year: 2021;">
    <div class="timeline-content">
        <div class="timeline-year">2021</div>
        <div class="timeline-name">RAVE</div>
        <div class="timeline-author">Caillon & Esling</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2021;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="10" style="--year: 2022;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="10" style="--year: 2022;">
    <div class="timeline-content">
        <div class="timeline-year">2022</div>
        <div class="timeline-name">CLAP</div>
        <div class="timeline-author">Benjamin, et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2022;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="12" style="--year: 2024;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="12" style="--year: 2024;">
    <div class="timeline-content">
        <div class="timeline-year">2024</div>
        <div class="timeline-name">Stable Audio</div>
        <div class="timeline-author">Evans et al.</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2024;"></div>
<div class="timeline-dot fragment custom select" data-fragment-index="13" style="--year: 2025;"></div>
<div class="timeline-item fragment custom select" data-fragment-index="13" style="--year: 2025;">
    <div class="timeline-content">
        <div class="timeline-year">2025</div>
        <div class="timeline-name">Lyria 2</div>
        <div class="timeline-author">The Lyria Team</div>
    </div>
</div>
<div class="timeline-connector" style="--year: 2025;"></div>
            </div>
        </div>
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="0" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/wavenet_before.png" alt="WaveNet Before" style="width: 1200px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin-top: 10px; text-align: center;">
        Oord, A. van den, Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., Kalchbrenner, N., Senior, A., & Kavukcuoglu, K. (2016). WaveNet: A Generative Model for Raw Audio (No. arXiv:1609.03499). https://doi.org/10.48550/arXiv.1609.03499
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="1" style="text-align: center; margin-top: 20px;">
    <img src="assets/images/01-history/wavenet_after.png" alt="WaveNet Before" style="width: 1200px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin-top: 10px; text-align: center;">
        Oord, A. van den, Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., Kalchbrenner, N., Senior, A., & Kavukcuoglu, K. (2016). WaveNet: A Generative Model for Raw Audio (No. arXiv:1609.03499). https://doi.org/10.48550/arXiv.1609.03499
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="2" style="text-align: center; top: 100%;">
    <img src="assets/images/01-history/nsynth.png" alt="Neural Synthesis" style="width: 900px; height: auto;">
        <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
            Engel, J., Resnick, C., Roberts, A., Dieleman, S., Norouzi, M., Eck, D., & Simonyan, K. (2017, July). Neural audio synthesis of musical notes with wavenet autoencoders. In International conference on machine learning (pp. 1068-1077). PMLR.
        </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="4" style="text-align: center; top: 100%;">
    <img src="assets/images/01-history/guitar_emulation.png" alt="Neural Synthesis" style="width: 750px; height: auto;">
        <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
            Wright, A., Damskägg, E.-P., Juvela, L., & Välimäki, V. (2020). Real-Time Guitar Amplifier Emulation with Deep Learning. Applied Sciences, 10(3), 766. https://doi.org/10.3390/app10030766
        </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="6" style="text-align: center; top: 100%;">
    <img src="assets/images/01-history/ddsp.png" alt="Neural Synthesis" style="width: 1200px; height: auto;">
        <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
            Engel, J., Hantrakul, L. (Hanoi), Gu, C., & Roberts, A. (2019, September 25). DDSP: Differentiable Digital Signal Processing. International Conference on Learning Representations.
        </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="8" style="text-align: center; top: 100%;">
    <img src="assets/images/01-history/rave.png" alt="Neural Synthesis" style="width: 1300px; height: auto;">
        <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
            Caillon, A., & Esling, P. (2021). RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (No. arXiv:2111.05011). arXiv. http://arxiv.org/abs/2111.05011
        </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="11" style="text-align: center; top: 110%;">
    <img src="assets/images/01-history/clap.png" alt="CLAP" style="max-width: 1800px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Elizalde, B., Deshmukh, S., Al Ismail, M., & Wang, H. (2023, June). Clap learning audio concepts from natural language supervision. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE.
    </div>
</div>

<div class="fragment appear-vanish" data-fragment-index="12" style="text-align: center; top: 120%;">
    <img src="assets/images/01-history/diffusion-transformer.png" alt="Diffusion Transformer" style="max-width: 1000px; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        https://digialps.com/stability-ais-new-open-source-ai-creation-stable-audio-2-0-takes-on-suno-ai/
    </div>
</div>

<div class="fragment appear-vanish image-overlay" data-fragment-index="14" style="text-align: center; width: 1400px; height: 787.5px; top: 110%;">
    <img src="assets/images/01-history/lyria_2.png" alt="Lyria 2" style="max-width: 100%; height: auto;">
    <div class="reference" data-fragment-index="1" style="margin: 10px; text-align: center;">
        Lyria Team (2025). Live Music Models (No. arXiv:2508.04651). arXiv. https://doi.org/10.48550/arXiv.2508.04651
    </div>
</div>

Notes:

- We left the neural audio systems before the deep learning era, saying that there was no neural audio generation yet
- But that changed with the WaveNet model in 2016
- WaveNet used a clever trick in convolutional neural networks to model raw audio waveforms - it used so-called dilated convolutions to increase the receptive field of the network
- This allowed the model to capture long-range dependencies in audio signals, resulting in high-quality and realistic audio generation
- In 2017, Engel et al. introduced Neural Synthesis with WaveNet Autoencoders - a model that could generate musical notes by learning a latent representation of audio
- In 2019, the same team (Google Magenta) further advanced the field with Differentiable Digital Signal Processing (DDSP) - combining neural networks with traditional signal processing techniques
- Basically, they were predicting the parameters of an additive synthesizer with deep learning
- The key to this approach is that the synthesis process is differentiable, allowing for end-to-end training of the model
- In 2020, Steinmetz et al. proposed an approach for automatic mixing based on differentiable effects
- In 2021, Caillon and Esling introduced RAVE - a real-time audio synthesis model using variational autoencoders
- What works for images and text, should also work for audio - in 2022, CLAP was introduced - a model that learns audio concepts from natural language supervision
- Then in 2024, Stable Audio Open was released - a model based on diffusion transformers for high-quality text-to-audio generation
- Finally, in January 2025, Google's Lyria 2 was released - introducing real-time music generation with continuous user control
- Lyria 2 generates an uninterrupted stream of music in two-second chunks using a three-component pipeline:
    1. MusicCoCa style embedding model to capture musical style from text and audio prompts
    2. SpectroStream audio codec for efficient audio representation
    3. Encoder-decoder language model that generates new audio tokens conditioned on style embeddings and 10 seconds of past audio context

---

<h1 style="font-size: 1.5em; margin-block: 20% 10%;">OUR RECENT RESEARCH CONTRIBUTIONS</h1>

<div style="text-align: center; margin-top: 40px; font-size: 0.7em; color: var(--fs-text-muted-color);">
    Selected work from the Computer Music and Neural Audio Systems Research Team<br>
    Audio Communication Group<br>
    Technische Universität Berlin
</div>

Notes:

- Ok, this was my overview of the academic field from its origins to the present day
- This area is receiving growing interest from research groups worldwide
- From us, as well
- So I'd like to show you three of our recent contributions in the years 2024 and 2025
- By "us" I refer to the Computer Music and Neural Audio Systems Research Team at the Audio Communication Group

---

<h2>Anira <span style="color: var(--fs-text-muted-color); font-weight: 400; font-size: 0.8em">(Ackva, V.* & Schulz, F.*)</span></h2>

<div style="text-align: center; margin: 40px 0;">
    <strong style="font-size: 1em; display: block; margin-bottom: 15px;">
        ANIRA: An Architecture for Neural network Inference in Real-time Audio applications
    </strong>
    <div style="font-size: 0.95em; color: var(--fs-text-muted-color); font-style: italic;">
        → C++ Library that bridges the gap between neural audio research and real-time applications
    </div>
</div>

<div>
    <h4 style="margin: 60px 0 0 0;">Key Contributions</h4>
    <ul>
        <li>Enables <strong>real-time safe</strong> neural network integration in DAWs and audio plugins</li>
        <li>Provides a framework for benchmarking neural networks in real-time scenarios</li>
        <li>Paper: <strong>First benchmark</strong> of neural audio effects models with different backends in real-time audio contexts</li>
    </ul>
</div>

<div class="highlight" style="text-align: center; padding: 20px; margin-top: 60px;">
    <strong>Open-source • Extensive documentation • Permissive licensing</strong>
</div>

<div class="reference" style="font-size: 0.75em;">
    Ackva, V., & Schulz, F. (2024). ANIRA: An Architecture for Neural Network Inference in Real-Time Audio Applications. <em>2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)</em>, 1–10. https://doi.org/10.1109/IS262782.2024.10704099
</div>

</div>

Notes:

- The first contribution is ANIRA - an architecture for neural network inference in real-time audio applications - a project mainly by my colleague Valentin Ackva and me
- Inference is the process of using a trained neural network to make predictions on new data
- ANIRA is a C++ library that tries to bridge the gap between neural audio research and real-time applications
- It has two major focus areas - first the real-time safe integration of neural networks into DAWs, audio plugins and audio applications in general
- The second focus area is the performance evaluation of neural networks in audio applications
- For this ANIRA provides a framework for benchmarking neural networks in real-time scenarios
- And our paper was the first benchmark of neural audio effects models with different backends in real-time audio contexts
- Finally, ANIRA is open-source, has extensive documentation and permissive licensing

</div>

---

<h2>Neural Proxies for Sound Synthesizers<br><span style="color: var(--fs-text-muted-color); font-weight: 400; font-size: 0.8em">(Combes, P., Weinzierl, S., Obermayer, K.)</span></h2>

<div style="text-align: center; margin: 40px 0;">
    <div style="font-size: 0.95em; color: var(--fs-text-muted-color); font-style: italic;">
        → How can we integrate non-differential synthesizers in deep learning pipelines for automatic synthesizer programming?
    </div>
</div>

<div>
    <h4 style="margin: 60px 0 0 0;">Key Contributions</h4>
    <ul>
        <li>Method for training neural proxies for arbitrary synthesizers</li>
        <li>Evaluation of pretrained audio feature extraction models as proxy training representations</li>
        <li>Evaluation of method on synthesizer sound matching task</li>
    </ul>
</div>

<div class="highlight" style="text-align: center; padding: 20px; margin-top: 60px;">
    <strong>Open-source</strong>
</div>

<div class="reference" style="font-size: 0.75em;">
    Combes, P., Weinzierl, S., & Obermayer, K. (2025). Neural Proxies for Sound Synthesizers: Learning Perceptually Informed Preset Representations. <em>Journal of the Audio Engineering Society, 73(9)</em>, 561–577. https://doi.org/10.17743/jaes.2022.0219
</div>

</div>

<div class="image-overlay fragment appear-vanish" data-fragment-index="0" style="position: absolute; text-align: center; width: 62%; padding: 40px;">
    <img src="assets/images/01-history/neural_proxy_training.png" alt="Neural Proxy Training" style="width: 100%; height: auto;">
    <div style="color: var(--fs-text-muted-color); margin-top: 10px;">
        Training of a neural proxy to mimic the behavior of a<br>non-differentiable synthesizer
    </div>
</div>

<div class="image-overlay fragment appear-vanish" data-fragment-index="2" style="position: absolute; width: 82%; text-align: center; padding: 40px;">
    <img src="assets/images/01-history/synth_sound_matching_training.png" alt="Synth Sound Matching Training" style="width: 100%; height: auto;">
    <div style="color: var(--fs-text-muted-color); margin-top: 10px;">
        Training of a synthesizer sound matching system using the neural proxy
    </div>
</div>

Notes:

- The next contribution is Neural Proxies for Sound Synthesizers, primarily led by my colleague Paulo Combes
- The central question: how can we integrate non-differentiable synthesizers into deep learning pipelines for automatic synthesizer programming?
- In deep learning everything needs to be differentiable for our backpropagation algorithm to work
- This is why neural audio synthesis models like DDSP rely on differentiable synthesizers
- However, many high-quality synthesizers are non-differentiable, which limits their use in deep learning workflows
- Paulo's solution: neural proxies - differentiable neural networks that mimic non-differentiable synthesizer behavior
- The training process uses an audio feature extraction model (g()) to extract features from synthesizer output
- Then a neural network (f()) is trained to map synthesizer parameters to these extracted features
- The paper also provides extensive evaluation of different audio feature extraction models as proxy training representations
- Finally, the method was evaluated on synthesizer sound matching tasks
- Using the neural proxy (f()) to train a network (e()) that predicts synthesizer parameters for a given target sound

</div>

---

<h2>pGESAM <span style="color: var(--fs-text-muted-color); font-weight: 400; font-size: 0.8em">(Limberg, C.*, Schulz, F.*, Zhang, Z., Weinzierl, S.)</span></h2>

<div style="text-align: center; margin: 40px 0;">
    <strong style="font-size: 1em; display: block; margin-bottom: 15px;">
        pGESAM: pitch-conditioned GEnerative SAmple Map
    </strong>
    <div style="font-size: 0.95em; color: var(--fs-text-muted-color); font-style: italic;">
        → How can musicians find the perfect samples in an effective and creative way?<br>
        → How can we generate samples that can be played expressively throughout different pitches?
    </div>
</div>

<div>
    <h4 style="margin: 60px 0 0 0;">Key Contributions</h4>
    <ul>
        <li>Framework for successful generation of 4 second one-shot samples from 3 data-points</li>
        <li>Effective pitch-timbre disentanglement via semi-supervised learning (2D timbre, 1D pitch)</li>
        <li>Extensive evaluation on NSynth dataset</li>
    </ul>
</div>

<div class="highlight" style="text-align: center; padding: 20px; margin-top: 60px;">
    <strong>Open-source • Web Demonstration</strong>
</div>

<div class="reference" style="font-size: 0.75em;">
    Limberg, C., Schulz, F., Zhang, Z., & Weinzierl, S. (2025). Pitch-Conditioned Instrument Sound Synthesisfrom an Interactive Timbre Latent Space. <em>28th International Conference on Digital Audio Effects (DAFx25)</em>, 1–8. https://dafx.de/paper-archive/2025/DAFx25_paper_58.pdf
</div>

</div>

Notes:

- The last contribution is pGESAM - pitch-conditioned Generative Sample Map - a collaboration primarily between Christian Limberg and me
- Two central questions:
- How can musicians find the perfect samples in an effective and creative way?
- How can we generate samples that can be played expressively throughout different pitches?
- Key contributions: a framework generating 4-second one-shot samples from just 3 data points
- Three floats input, 4-second audio output
- These dimensions are disentangled - independent control over timbre (2D) and pitch (1D)
- Architecture overview: neural audio codec extracts embeddings (e), VAE learns low-dimensional timbre representation with disentangled pitch, pitch/timbre-conditioned transformer generates audio embeddings autoregressively
- Extensive evaluation on NSynth dataset demonstrates effectiveness
- Now I want to show you a quick demo of the pGESAM framework with our interactive web application

</div>

---

<h1>OUTLOOK</h1>

---

## Future Directions in Neural Audio Systems

<strong style="font-size: 1.1em;">Deep Learning & Model Architectures</strong>

- Advanced sequence modeling for extended, coherent audio generation
- Methods for explainability and interpretability of neural audio models
- Synthetic data generation with generative models

</div>

<strong style="font-size: 1.1em;">Deployment & Real-time Performance</strong>

- Real-time inference optimization for low-latency audio processing
- Efficient model compression for resource-constrained devices
- Sample-rate agnostic architectures for flexible synthesis

</div>

<strong style="font-size: 1.1em;">Creative & Artistic Applications</strong>

- Improved control mechanisms for user-guided generation and processing
- Multi-modal conditioning for richer, more expressive outputs
- Enhanced embodiment in neural musical instruments

</div>

Notes:

- In the deep learning research area there is active work on long-term coherent generation, model explainability, and synthetic data creation
- For real-time contexts, inference optimization, model compression, and sample-rate agnostic architectures are important topics
- Finally, for creative applications, there is research in enhanced user control and better multi-modal conditioning, which would hopefully lead to more embodiment of neural musical instruments

</div>

---

# Setup Python<br>Environment