.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_tutorials/multi_fidelity/plot_monte_carlo.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_tutorials_multi_fidelity_plot_monte_carlo.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_tutorials_multi_fidelity_plot_monte_carlo.py:


Monte Carlo Quadrature
======================
This tutorial describes how to use Monte Carlo sampling to compute the expectations of the output of a model :math:`f(\rv):\reals^{D}\to\reals` parameterized by a set of variables :math:`\rv=[\rv_1,\ldots,\rv_D]^\top` with joint density given by :math:`\rho(\rv):\reals^{d}\to\reals`. Specifically,  our goal is to approximate the integral

.. math:: Q=\int_\rvdom f(\rv)\pdf(\rv)\dx{\rv}

using Monte Carlo quadrature applied to an approximation :math:`f_\alpha` of the function :math:`f`, e.g. a representing a finite element approximation to the solution of a set of governing equations, where :math:`\alpha` is contols the accuracy of the approximation.

Monte Carlo quadrature approximates the integral

.. math:: Q_\alpha=\int_\rvdom f_\alpha(\rv)\pdf(\rv)\dx{\rv}\approx Q

by drawing :math:`N` random samples :math:`\rvset_N` of :math:`\rv` from :math:`\pdf` and evaluating the function at each of these samples to obtain the data pairs :math:`\{(\rv^{(n)},f^{(n)}_\alpha)\}_{n=1}^N`, where :math:`f^{(n)}_\alpha=f_\alpha(\rv^{(n)})` and computing

.. math::

   Q_{\alpha}(\rvset_N)=N^{-1}\sum_{n=1}^N f^{(n)}_\alpha

This estimate of the mean,is itself a random quantity, which we call an estimator, because its value depends on the :math:`\rvset_N` realizations of the inputs :math:`\rvset_N` used to compute :math:`Q_{\alpha}(\rvset_N)`. Specifically, using two different sets :math:`\rvset_N` will produce to different values.

To demonstrate this phenomenon, we will estimate the mean of a simple algebraic function :math:`f_0` which belongs to an ensemble of models

.. math::

   f_0(\rv) &= A_0 \left(\rv_1^5\cos\theta_0 +\rv_2^5\sin\theta_0\right), \\
   f_1(\rv) &= A_1 \left(\rv_1^3\cos\theta_1 +\rv_2^3\sin\theta_1\right)+s_1,\\
   f_2(\rv) &= A_2 \left(\rv_1  \cos\theta_2 +\rv_2  \sin\theta_2\right)+s_2


where :math:`\rv_1,\rv_2\sim\mathcal{U}(-1,1)` and all :math:`A` and :math:`\theta` coefficients are real. We choose to set :math:`A=\sqrt{11}`, :math:`A_1=\sqrt{7}` and :math:`A_2=\sqrt{3}` to obtain unitary variance for each model. The parameters :math:`s_1,s_2` control the bias between the models. Here we set :math:`s_1=1/10,s_2=1/5`. Similarly we can change the correlation between the models in a systematic way (by varying :math:`\theta_1`. We will levarage this later in the tutorial.

.. GENERATED FROM PYTHON SOURCE LINES 35-36

First setup the example

.. GENERATED FROM PYTHON SOURCE LINES 36-46

.. code-block:: default

    import numpy as np
    import matplotlib.pyplot as plt

    from pyapprox.benchmarks import setup_benchmark

    np.random.seed(1)
    shifts = [.1, .2]
    benchmark = setup_benchmark(
        "tunable_model_ensemble", theta1=np.pi/2*.95, shifts=shifts)


.. GENERATED FROM PYTHON SOURCE LINES 47-48

Now define a function that computes MC estimates of the mean using different sample sets :math:`\rvset_N` and plots the distribution the MC estimator :math:`Q_{\alpha}(\rvset_N)`, computed from 1000 different sets, and the exact value of the mean :math:`Q_{\alpha}`

.. GENERATED FROM PYTHON SOURCE LINES 48-64

.. code-block:: default


    def plot_estimator_histrogram(nsamples, model_id, ax):
        ntrials = 1000
        np.random.seed(1)
        means = np.empty((ntrials))
        model = benchmark.funs[model_id]
        for ii in range(ntrials):
            samples = benchmark.variable.rvs(nsamples)
            values = model(samples)
            means[ii] = values.mean()
        im = ax.hist(means, bins=ntrials//100, density=True, alpha=0.3,
                     label=r'$Q_{%d}(\mathcal{Z}_{%d})$' % (model_id, nsamples))[2]
        ax.axvline(x=benchmark.fun.get_means()[model_id], alpha=1,
                   label=r'$Q_{%d}$' % model_id, c=im[0].get_facecolor())


.. GENERATED FROM PYTHON SOURCE LINES 65-69

Now lets plot the historgram of the MC estimator :math:`Q_{0}(\rvset_N)` using :math:`N=100` samples

.. _estimator-histogram:


.. GENERATED FROM PYTHON SOURCE LINES 69-75

.. code-block:: default

    nsamples = int(1e2)
    model_id = 0
    ax = plt.subplots(1, 1, figsize=(8, 6))[1]
    plot_estimator_histrogram(nsamples, model_id, ax)
    _ = ax.legend()


.. image-sg:: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_001.png
   :alt: plot monte carlo
   :srcset: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 76-77

The variability of the MC estimator as we change :math:`\rvset_N` decreases as we increase :math:`N`. To see this, plot the estimator historgram using :math:`N=1000` samples

.. GENERATED FROM PYTHON SOURCE LINES 77-86

.. code-block:: default


    model_id = 0
    ax = plt.subplots(1, 1, figsize=(8, 6))[1]
    nsamples = int(1e2)
    plot_estimator_histrogram(nsamples, model_id, ax)
    nsamples = int(1e3)
    plot_estimator_histrogram(nsamples, model_id, ax)
    _ = ax.legend()


.. image-sg:: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_002.png
   :alt: plot monte carlo
   :srcset: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 87-92

Regardless of the value of :math:`N` the estimator :math:`Q_{0}(\rvset_N)` is an unbiased estimate of :math:`Q_{0}`, that is

.. math:: \mean{Q_{0}(\rvset_N)}-Q_0 = 0

Unfortunately, if the computational cost of evaluating a model is high, then one may not be able to make :math:`N` large using that model. Consequently, one will not be able to trust the MC estimate of the mean much because any one realization of the estimator, computed using a single sample set, may obtain a value that is very far from the truth. So often a cheaper less accurate model is used so that :math:`N` can be increased to reduce the variability of the estimator. The following compares the histograms of :math:`Q_0(\rvset_{100})` and :math:`Q_1(\rvset_{1000})` which uses the model :math:`f_1` which we assume is a cheap approximation of :math:`f_0`

.. GENERATED FROM PYTHON SOURCE LINES 92-102

.. code-block:: default


    model_id = 0
    ax = plt.subplots(1, 1, figsize=(8, 6))[1]
    nsamples = int(1e2)
    plot_estimator_histrogram(nsamples, model_id, ax)
    model_id = 1
    nsamples = int(1e3)
    plot_estimator_histrogram(nsamples, model_id, ax)
    _ = ax.legend()


.. image-sg:: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_003.png
   :alt: plot monte carlo
   :srcset: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_monte_carlo_003.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 103-141

However, using an approximate model means that the MC estimator is no longer unbiased. The mean of the histogram of :math:`Q_1(\rvset_{1000})` is no longer the mean of :math:`Q_0`

Letting :math:`Q` denote the true mean we want to estimate, e.g. :math:`Q=Q_0` in the example we have used so far, the mean squared error (MSE) is typically used to quantify the quality of a MC  estimator. The MSE can be expressed as

.. math::

   \mean{\left(Q_{\alpha}(\rvset_N)-Q\right)^2}&=\mean{\left(Q_{\alpha}(\rvset_N)-\mean{Q_{\alpha}(\rvset_N)}+\mean{Q_{\alpha}(\rvset_N)}-Q\right)^2}\\
   &=\mean{\left(Q_{\alpha}(\rvset_N)-\mean{Q_{\alpha}(\rvset_N)}\right)^2}+\mean{\left(\mean{Q_{\alpha}(\rvset_N)}-Q\right)^2}\\
   &\qquad\qquad+\mean{2\left(Q_{\alpha}(\rvset_N)-\mean{Q_{\alpha}(\rvset_N)}\right)\left(\mean{Q_{\alpha}(\rvset_N)}-Q\right)}\\
   &=\var{Q_{\alpha}(\rvset_N)}+\left(\mean{Q_{\alpha}(\rvset_N)}-Q\right)^2\\
   &=\var{Q_{\alpha}(\rvset_N)}+\left(Q_{\alpha}-Q\right)^2

where the expectation :math:`\mathbb{E}` and variance :math:`\mathbb{V}` are taken over different realization of the sample set :math:`\rvset_N`, we used that :math:`\mean{\left(Q_{\alpha}(\rvset_N)-\mean{Q_{\alpha}(\rvset_N)}\right)}=0` so the third term on the second line is zero, and we used :math:`\mean{Q_{\alpha}(\rvset_N)}=Q_{\alpha}` to get the final equality.

Now using the well known result that for random variable :math:`X_n`

.. math:: \var{\sum_{n=1}^N X_n} = \sum_{n=1}^N \var{X_n} + \sum_{n\neq p}\covar{X_n}{X_p}

and the result for a scalar :math:`a`

.. math:: \var{aX_n} =a^2\var{X_n}

yields

.. math::

   \var{Q_{\alpha}(\rvset_N)}=\var{N^{-1}\sum_{n=1}^N f^{(n)}_\alpha}=N^{-2}\sum_{n=1}^N \var{f^{(n)}_\alpha}=N^{-1}\var{f_\alpha}

where :math:`\covar{f^{(n)}}{f^{(p)}}=0, n\neq p` because the samples are drawn independently.

Finally, substituting :math:`\var{Q_{\alpha}(\rvset_N)}` into the expression for MSE yields

.. math::

   \mean{\left(Q_{\alpha}(\rvset_N)-\mean{Q}\right)^2}=\underbrace{N^{-1}\var{f_\alpha}}_{I}+\underbrace{\left(Q_{\alpha}-Q\right)^2}_{II}

From this expression we can see that the MSE can be decomposed into two terms; a so called stochastic error (I) and a deterministic bias (II). The first term is the variance of the Monte Carlo estimator which comes from using a finite number of samples. The second term is due to using an approximation of :math:`f_0`. These two errors should be balanced, however in the vast majority of all MC analyses a single model :math:`f_\alpha` is used and the choice of :math:`\alpha`, e.g. mesh resolution, is made a priori without much concern for the balancing bias and variance.


.. GENERATED FROM PYTHON SOURCE LINES 143-149

Video
-----
Click on the image below to view a video tutorial on Monte Carlo quadrature

.. image:: ../../figures/mc.png
   :target: https://youtu.be/OrLsYo11kvY?si=chAXN6UqssXY8pxh


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  0.332 seconds)


.. _sphx_glr_download_auto_tutorials_multi_fidelity_plot_monte_carlo.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_monte_carlo.py <plot_monte_carlo.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_monte_carlo.ipynb <plot_monte_carlo.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_