.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_tutorials/multi_fidelity/plot_approximate_control_variate_monte_carlo.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_tutorials_multi_fidelity_plot_approximate_control_variate_monte_carlo.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_tutorials_multi_fidelity_plot_approximate_control_variate_monte_carlo.py:


Approximate Control Variate Monte Carlo
=======================================
This tutorial builds upon :ref:`sphx_glr_auto_tutorials_multi_fidelity_plot_control_variate_monte_carlo.py` and describes how to implement and deploy *approximate* control variate Monte Carlo (ACVMC) sampling to compute expectations of model output from multiple low-fidelity models with unknown means.

CVMC is often not useful for practical analysis of numerical models because typically the mean of the lower fidelity model, i.e. :math:`\mu_\V{\kappa}`, is unknown and the cost of the lower fidelity model is non trivial. These two issues can be overcome by using approximate control variate Monte Carlo.

Let the cost of the high fidelity model per sample be :math:`C_\alpha` and let the cost of the low fidelity model be :math:`C_\kappa`. Now lets use :math:`N` samples to estimate :math:`Q_{\V{\alpha},N}` and :math:`Q_{\V{\kappa},N}` and these  :math:`N` samples plus another :math:`(r-1)N` samples to estimate :math:`\mu_{\V{\kappa}}` so that

.. math::

   Q_{\V{\alpha},N,r}^{\text{ACV}}=Q_{\V{\alpha},N} + \eta \left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r} \right)

and

.. math::

   \mu_{\V{\kappa},N,r}=\frac{1}{rN}\sum_{i=1}^{rN}Q_\V{\kappa}

With this sampling scheme we have

.. math::

  Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}&=\frac{1}{N}\sum_{i=1}^N f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=1}^{rN}f_\V{\kappa}^{(i)}\\
  &=\frac{1}{N}\sum_{i=1}^N f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=1}^{N}f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=N}^{rN}f_\V{\kappa}^{(i)}\\
  &=\frac{r-1}{rN}\sum_{i=1}^N f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=N}^{rN}f_\V{\kappa}^{(i)}\\

where for ease of notation we write :math:`r_\V{\kappa}N` and :math:`\lfloor r_\V{\kappa}N\rfloor` interchangibly.
Using the above expression yields

.. math::
   \var{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)}&=\mean{\left(\frac{r-1}{rN}\sum_{i=1}^N f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=N}^{rN}f_\V{\kappa}^{(i)}\right)^2}\\
  &=\frac{(r-1)^2}{r^2N^2}\sum_{i=1}^N \var{f_\V{\kappa}^{(i)}}+\frac{1}{r^2N^2}\sum_{i=N}^{rN}\var{f_\V{\kappa}^{(i)}}\\
  &=\frac{(r-1)^2}{r^2N^2}N\var{f_\V{\kappa}}+\frac{1}{r^2N^2}(r-1)N\var{f_\V{\kappa}}\\
  %&=\left(\frac{(r-1)^2}{r^2N}+\frac{(r-1)}{r^2N}\right)\var{f_\V{\kappa}}\\
  &=\frac{r-1}{r}\frac{\var{f_\V{\kappa}}}{N}

where we have used the fact that since the samples used in the first and second term on the first line are not shared, the covariance between these terms is zero. Also we have

.. math::

  \covar{Q_{\V{\alpha},N}}{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)}=\covar{\frac{1}{N}\sum_{i=1}^N f_\V{\alpha}^{(i)}}{\frac{r-1}{rN}\sum_{i=1}^N f_\V{\kappa}^{(i)}-\frac{1}{rN}\sum_{i=N}^{rN}f_\V{\kappa}^{(i)}}

The correlation between the estimators :math:`\frac{1}{N}\sum_{i=1}^{N}Q_\V{\alpha}` and :math:`\frac{1}{rN}\sum_{i=N}^{rN}Q_\V{\kappa}` is zero because the samples used in these estimators are different for each model. Thus

.. math::

   \covar{Q_{\V{\alpha},N}}{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)} &=\covar{\frac{1}{N}\sum_{i=1}^N f_\V{\alpha}^{(i)}}{\frac{r-1}{rN}\sum_{i=1}^N f_\V{\kappa}^{(i)}}\\
  &=\frac{r-1}{r}\frac{\covar{f_\V{\alpha}}{f_\V{\kappa}}}{N}

Recalling the variance reduction of the CV estimator using the optimal :math:`\eta` is

.. math::

   \gamma &= 1-\frac{\covar{Q_{\V{\alpha},N}}{\left( Q_{\V{\kappa},N} - \mu_{ \V{\kappa},N,r}\right)}^2}{\var{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)}\var{Q_{\V{\alpha},N}}}\\
   &=1-\frac{N^{-2}\frac{(r-1)^2}{r^2}\covar{f_\V{\alpha}}{f_\V{\kappa}}}{N^{-1}\frac{r-1}{r}\var{f_\V{\kappa}}N^{-1}\var{f_\V{\alpha}}}\\
   &=1-\frac{r-1}{r}\corr{f_\V{\alpha}}{f_\V{\kappa}}^2

which is found when

.. math::

   \eta&=-\frac{\covar{Q_{\V{\alpha},N}}{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)}}{\var{\left( Q_{\V{\kappa},N} - \mu_{\V{\kappa},N,r}\right)}}\\
  &=-\frac{N^{-1}\frac{r-1}{r}\covar{f_\V{\alpha}}{f_\V{\kappa}}}{N^{-1}\frac{r-1}{r}\var{f_\V{\kappa}}}\\
  &=-\frac{\covar{f_\V{\alpha}}{f_\V{\kappa}}}{\var{f_\V{\kappa}}}

.. GENERATED FROM PYTHON SOURCE LINES 69-70

Lets setup the problem and compute an ACV estimate of :math:`\mean{f_0}`

.. GENERATED FROM PYTHON SOURCE LINES 70-82

.. code-block:: default

    import numpy as np
    import matplotlib.pyplot as plt

    from pyapprox.benchmarks import setup_benchmark

    np.random.seed(1)
    shifts = [.1, .2]
    benchmark = setup_benchmark(
        "tunable_model_ensemble", theta1=np.pi/2*.95, shifts=shifts)
    model = benchmark.fun
    exact_integral_f0 = benchmark.means[0]


.. GENERATED FROM PYTHON SOURCE LINES 83-102

Before proceeding to estimate the mean using ACVMV we must first define how to generate samples to estimate :math:`Q_{\V{\alpha},N}` and :math:`\mu_{\V{\kappa},N,r}`. To do so clearly we must first introduce some additional notation. Let :math:`\mathcal{Z}_0` be the set of samples used to evaluate the high-fidelity model and let :math:`\mathcal{Z}_\alpha=\mathcal{Z}_{\alpha,1}\cup\mathcal{Z}_{\alpha,2}` be the samples used to evaluate the low fidelity model. Using this notation we can rewrite the ACV estimator as

.. math::

   Q_{\V{\alpha},\mathcal{Z}}^{\text{ACV}}=Q_{\V{\alpha},\mathcal{Z}_0} + \eta \left( Q_{\V{\kappa},\mathcal{Z}_{\alpha,1}} - \mu_{\V{\kappa},\mathcal{Z}_{\alpha,2}} \right)

where :math:`\mathcal{Z}=\bigcup_{\alpha=0}^M Z_\alpha`. The nature of these samples can be changed to produce different ACV estimators. Here we choose  :math:`\mathcal{Z}_{\alpha,1}\cap\mathcal{Z}_{\alpha,2}=\emptyset` and :math:`\mathcal{Z}_{\alpha,1}=\mathcal{Z_0}`. That is we use the set a common set of samples to compute the covariance between all the models and a second independent set to estimate the lower fidelity mean. The sample partitioning for :math:`M` models is  shown in the following Figure. We call this scheme the ACV IS sampling strategy where IS indicates that the second sample set :math:`\mathcal{Z}_{\alpha,2}` assigned to each model are not shared.

.. list-table::

   * - .. _acv-is-sample-allocation:

       .. figure:: ../../figures/acv_is.png
          :width: 50%
          :align: center

          ACV IS sampling strategy

The following code generates samples according to this strategy

.. GENERATED FROM PYTHON SOURCE LINES 102-115

.. code-block:: default


    from pyapprox import multifidelity
    model_costs = 10.**(-np.arange(3))
    est = multifidelity.get_estimator(
        "acvis", benchmark.model_covariance[:2, :2], model_costs[:2],
        benchmark.variable)
    est.nsamples_per_model = np.array([10, 100])
    samples_per_model, partition_indices_per_model = \
        est.generate_sample_allocations()

    print(partition_indices_per_model)
    samples_shared = samples_per_model[0]
    samples_lf_only = samples_per_model[1][:, partition_indices_per_model[1] == 1]


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [array([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]), array([0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 1., 1., 1., 1., 1., 1., 1.,
           1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
           1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
           1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
           1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
           1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.])]


.. GENERATED FROM PYTHON SOURCE LINES 116-117

Now lets plot the samples assigned to each model.

.. GENERATED FROM PYTHON SOURCE LINES 117-127

.. code-block:: default


    fig, ax = plt.subplots()
    ax.plot(samples_shared[0, :], samples_shared[1, :], 'ro', ms=12,
            label=r'$\mathrm{Low\ and\  high\  fidelity\  models}$')
    ax.plot(samples_lf_only[0, :], samples_lf_only[1, :], 'ks',
            label=r'$\mathrm{Low\  fidelity\  model\ only}$')
    ax.set_xlabel(r'$z_1$')
    ax.set_ylabel(r'$z_2$', rotation=0)
    _ = ax.legend(loc='upper left')


.. image-sg:: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_approximate_control_variate_monte_carlo_001.png
   :alt: plot approximate control variate monte carlo
   :srcset: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_approximate_control_variate_monte_carlo_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 128-129

The high-fidelity model is only evaluated on the red dots. Now lets use these samples to estimate the mean of :math:`f_0`.

.. GENERATED FROM PYTHON SOURCE LINES 129-141

.. code-block:: default


    values_per_model = []
    for ii in range(len(samples_per_model)):
        values_per_model.append(model.models[ii](samples_per_model[ii]))

    acv_mean = est.estimate_from_values_per_model(
        values_per_model, partition_indices_per_model)

    print('MC difference squared =', (
        values_per_model[0].mean()-exact_integral_f0)**2)
    print('ACVMC difference squared =', (acv_mean-exact_integral_f0)**2)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    MC difference squared = 0.1663308327274871
    ACVMC difference squared = 0.010697826122695035


.. GENERATED FROM PYTHON SOURCE LINES 142-148

Note here we have arbitrarily set the number of high fidelity samples :math:`N` and the ratio :math:`r`. In practice one should choose these in one of two ways: (i) for a fixed budget choose the free parameters to minimize the variance of the estimator; or (ii) choose the free parameters to achieve a desired MSE (variance) with the smallest computational cost. Note the cost of computing the two model ACV estimator is

.. math::

   C_\mathrm{cv} = NC_\alpha + r_\V{\kappa}NC_\kappa


.. GENERATED FROM PYTHON SOURCE LINES 150-151

Now lets compute the variance reduction for different sample sizes

.. GENERATED FROM PYTHON SOURCE LINES 151-165

.. code-block:: default

    from pyapprox import interface
    model_ensemble = interface.ModelEnsemble(model.models[:2])
    nhf_samples = 10
    ntrials = 1000
    nsample_ratios = np.array([10])
    nsamples_per_model = np.hstack((1, nsample_ratios))*nhf_samples
    target_cost = np.dot(model_costs[:2], nsamples_per_model)
    means, numerical_var, true_var = \
        multifidelity.estimate_variance(
            model_ensemble, est, target_cost, ntrials, nsample_ratios)

    print("Theoretical ACV variance", true_var)
    print("Achieved ACV variance", numerical_var)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Theoretical ACV variance 0.014971109874541333
    Achieved ACV variance 0.01617124648499629


.. GENERATED FROM PYTHON SOURCE LINES 166-167

Let us also plot the distribution of these estimators

.. GENERATED FROM PYTHON SOURCE LINES 167-185

.. code-block:: default


    fig, ax = plt.subplots()
    ax.hist(means[:, 0], bins=ntrials//100, density=True, alpha=0.5,
            label=r'$Q_{0, N}$')
    ax.hist(means[:, 1], bins=ntrials//100, density=True, alpha=0.5,
            label=r'$Q_{0, N, %d}^\mathrm{CV}$' % nsample_ratios[0])

    nsample_ratios = np.array([100])
    nsamples_per_model = np.hstack((1, nsample_ratios))*nhf_samples
    target_cost = np.dot(model_costs[:2], nsamples_per_model)
    means, numerical_var, true_var = \
        multifidelity.estimate_variance(
            model_ensemble, est, target_cost, ntrials, nsample_ratios)
    ax.hist(means[:, 1], bins=ntrials//100, density=True, alpha=0.5,
            label=r'$Q_{0, N, %d}^\mathrm{CV}$' % nsample_ratios[0])
    ax.axvline(x=0, c='k', label=r'$E[Q_0]$')
    _ = ax.legend(loc='upper left')


.. image-sg:: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_approximate_control_variate_monte_carlo_002.png
   :alt: plot approximate control variate monte carlo
   :srcset: /auto_tutorials/multi_fidelity/images/sphx_glr_plot_approximate_control_variate_monte_carlo_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 186-187

For a fixed number of high-fidelity evaluations :math:`N` the ACVMC variance reduction will converge to the CVMC variance reduction. Try changing :math:`N`.

.. GENERATED FROM PYTHON SOURCE LINES 189-192

References
^^^^^^^^^^
.. [GGEJJCP2020] `A generalized approximate control variate framework for multifidelity uncertainty quantification,  Journal of Computational Physics,  408:109257, 2020. <https://doi.org/10.1016/j.jcp.2020.109257>`_


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  0.664 seconds)


.. _sphx_glr_download_auto_tutorials_multi_fidelity_plot_approximate_control_variate_monte_carlo.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_approximate_control_variate_monte_carlo.py <plot_approximate_control_variate_monte_carlo.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_approximate_control_variate_monte_carlo.ipynb <plot_approximate_control_variate_monte_carlo.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_