.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_tutorials/surrogates/plot_tensor_product_interpolation.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_tutorials_surrogates_plot_tensor_product_interpolation.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_tutorials_surrogates_plot_tensor_product_interpolation.py:


Tensor-product Interpolation
============================
Many simulation models are extremely computationally expensive such that adequately understanding their behaviour and quantifying uncertainty can be computationally intractable for any of the aforementioned techniques. Various methods have been developed to produce surrogates of the model response to uncertain parameters, the most efficient are goal-oriented in nature and target very specific uncertainty measures.

Generally speaking surrogates are built using a "small" number of model simulations and are then substituted in place of the expensive simulation models in future analysis. Some of the most popular surrogate types include polynomial chaos expansions (PCE) [XKSISC2002]_, Gaussian processes (GP) [RWMIT2006]_, and sparse grids (SG) [BGAN2004]_.

Reduced order models (e.g. [SFIJNME2017]_) can also be used to construct surrogates and have been applied successfully for UQ on many applications. These methods do not construct response surface approximations, but rather solve the governing equations on a reduced basis. PyApprox does not currently implement reduced order modeling, however the modeling analyis tools found in PyApprox can easily be applied to assess or design systems based on reduced order models.

The use of surrogates for model analysis consists of two phases: (1) construction; and (2) post-processing.

Construction
------------
In this section we show how to construct a surrogate using tensor-product Lagrange interpolation.

Tensor-product Lagrange interpolation
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Let :math:`\hat{f}_{\boldsymbol{\alpha},\boldsymbol{\beta}}(\mathbf{z})` be an M-point tensor-product interpolant of the function :math:`\hat{f}_{\boldsymbol{\alpha}}`. This interpolant is a weighted linear combination of tensor-product of univariate Lagrange polynomials

.. math:: \phi_{i,j}(z_i) = \prod_{k=1,k\neq j}^{m_{\beta_i}}\frac{z_i-z_i^{(k)}}{z_i^{(j)}-z_i^{(k)}}, \quad i\in[d],


defined on a set of univariate points :math:`z_{i}^{(j)},j\in[m_{\beta_i}]`  Specifically the multivariate interpolant is given by

.. math:: \hat{f}_{\boldsymbol{\alpha},\boldsymbol{\beta}}(\mathbf{z}) = \sum_{\boldsymbol{j}\le\boldsymbol{\beta}} \hat{f}_{\boldsymbol{\alpha}}(\mathbf{z}^{(\boldsymbol{j})})\prod_{i\in[d]}\phi_{i,j_i}(z_i).

The partial ordering :math:`\boldsymbol{j}\le\boldsymbol{\beta}` is true if all the component wise conditions are true.

Constructing the interpolant requires evaluating the function :math:`\hat{f}_{\boldsymbol{\alpha}}` on the grid of points

.. math::

   \mathcal{Z}_{\boldsymbol{\beta}} = \bigotimes_{i=1}^d \mathcal{Z}_{\beta_i}^i=\begin{bmatrix}\mathbf{z}^{(1)} & \cdots&\mathbf{z}^{(M_{\boldsymbol{\beta}})}\end{bmatrix}\in\mathbb{R}^{d\times M_{\boldsymbol{\beta}}}


We denote the resulting function evaluations by

.. math:: \mathcal{F}_{\boldsymbol{\alpha},\boldsymbol{\beta}}=\hat{f}_{\boldsymbol{\alpha}}(\mathcal{Z}_{\boldsymbol{\beta}})=\begin{bmatrix}\hat{f}_{\boldsymbol{\alpha}}(\mathbf{z}^{(1)}) \quad \cdots\quad \hat{f}_{\boldsymbol{\alpha}}(\mathbf{z}^{(M_{\boldsymbol{\beta}})})\end{bmatrix}^T\in\mathbb{R}^{M_{\boldsymbol{\beta}}\times q},

where the number of points in the grid is :math:`M_{\boldsymbol{\beta}}=\prod_{i\in[d]} m_{\beta_i}`

It is often reasonable to assume that, for any :math:`\mathbf{z}`, the cost of each simulation is constant for a given :math:`\boldsymbol{\alpha}`. So letting :math:`W_{\boldsymbol{\alpha}}` denote the cost of a single simulation, we can write the total cost of evaluating the interpolant :math:`W_{\boldsymbol{\alpha},\boldsymbol{\beta}}=W_{\boldsymbol{\alpha}} M_{\boldsymbol{\beta}}`. Here we have assumed that the computational effort to compute the interpolant once data has been obtained is negligible, which is true for sufficiently expensive models :math:`\hat{f}_{\boldsymbol{\alpha}}`.
Here we will use the nested Clenshaw-Curtis points

.. math::

  z_{i}^{(j)}=\cos\left(\frac{(j-1)\pi}{m_{\beta_i}-1}\right),\qquad j=1,\ldots,m_{\beta_i}

to define the univariate Lagrange polynomials. The number of points :math:`m(l)` of this rule grows exponentially with the level :math:`l`, specifically
:math:`m(0)=1` and :math:`m(l)=2^{l}+1` for :math:`l\geq1`. The univariate Clenshaw-Curtis points, the tensor-product grid :math:`\mathcal{Z}_{\boldsymbol{\beta}}`, and two multivariate Lagrange polynomials with their corresponding univariate Lagrange polynomials are shown below for :math:`\boldsymbol{\beta}=(2,2)`.

.. GENERATED FROM PYTHON SOURCE LINES 53-88

.. code-block:: default

    import numpy as np
    from pyapprox.util.utilities import cartesian_product
    from pyapprox.util.visualization import get_meshgrid_function_data, plt
    from pyapprox.util.utilities import get_tensor_product_quadrature_rule
    from pyapprox.surrogates.orthopoly.quadrature import clenshaw_curtis_pts_wts_1D
    from pyapprox.surrogates.approximate import adaptive_approximate
    from pyapprox.surrogates.interp.adaptive_sparse_grid import (
        tensor_product_refinement_indicator)
    from functools import partial
    from pyapprox.surrogates.orthopoly.quadrature import (
        clenshaw_curtis_in_polynomial_order, clenshaw_curtis_rule_growth)
    from pyapprox.surrogates.interp.tensorprod import (
        canonical_univariate_piecewise_polynomial_quad_rule)
    from pyapprox.benchmarks.benchmarks import setup_benchmark
    from pyapprox.surrogates.interp.tensorprod import (
        UnivariatePiecewiseQuadraticBasis, UnivariateLagrangeBasis,
        TensorProductInterpolant, TensorProductBasis)

    nnodes_1d = [5, 9]
    nodes_1d = [-np.cos(np.arange(nnodes)*np.pi/(nnodes-1))
                for nnodes in nnodes_1d]
    nodes = cartesian_product(nodes_1d)
    lagrange_basis_1d = UnivariateLagrangeBasis()
    tp_lagrange_basis = TensorProductBasis([lagrange_basis_1d]*2)

    fig = plt.figure(figsize=(2*8, 6))
    ax = fig.add_subplot(1, 2, 1, projection='3d')
    ii, jj = 1, 3
    tp_lagrange_basis.plot_single_basis(ax, nodes_1d, ii, jj, nodes)

    ax = fig.add_subplot(1, 2, 2, projection='3d')
    level = 2
    ii, jj = 2, 4
    tp_lagrange_basis.plot_single_basis(ax, nodes_1d, ii, jj, nodes)


.. image-sg:: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_001.png
   :alt: plot tensor product interpolation
   :srcset: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 89-92

To construct a surrogate using tensor product interpolation we simply multiply all such basis functions by the value of the function :math:`f_\ai` evaluated at the corresponding interpolation point. The following uses tensor product interpolation to approximate the simple function

.. math:: f_\ai(\rv) = \cos(2\pi\rv_1)\cos(2\pi\rv_2), \qquad \rv\in\rvdom=[-1,1]^2

.. GENERATED FROM PYTHON SOURCE LINES 92-119

.. code-block:: default


    def fun(z): return (np.cos(2*np.pi*z[0, :]) *
                        np.cos(2*np.pi*z[1, :]))[:, np.newaxis]


    lagrange_interpolant = TensorProductInterpolant([lagrange_basis_1d]*2)
    values = fun(nodes)
    lagrange_interpolant.fit(nodes_1d, values)


    marker_color = 'k'
    alpha = 1.0
    fig, axs = plt.subplots(1, 1, figsize=(8, 6))
    axs.plot(nodes[0, :], nodes[1, :], 'o',
             color=marker_color, ms=10, alpha=alpha)

    plot_limits = [-1, 1, -1, 1]
    num_pts_1d = 101
    X, Y, Z = get_meshgrid_function_data(
        lagrange_interpolant, plot_limits, num_pts_1d)

    num_contour_levels = 10
    levels = np.linspace(Z.min(), Z.max(), num_contour_levels)
    cset = axs.contourf(
        X, Y, Z, levels=levels, cmap="coolwarm", alpha=alpha)


.. image-sg:: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_002.png
   :alt: plot tensor product interpolation
   :srcset: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 120-125

The error in the tensor product interpolant is given by

.. math:: \lVert f_\ai-f_{\ai,\bi}\rVert_{L^\infty(\rvdom)} \le C_{d,s} N_{\bi}^{-s/d}

where :math:`f_\alpha` has continuous mixed derivatives of order :math:`s`.

.. GENERATED FROM PYTHON SOURCE LINES 128-153

Post-processing
---------------
Once a surrogate has been constructed it can be used for many different purposes. For example one can use it to estimate moments, perform sensitivity analysis, or simply approximate the evaluation of the expensive model at new locations where expensive simulation model data is not available.

To use the surrogate for computing moments we simply draw realizations of the input random variables :math:`\rv` and evaluate the surrogate at those samples. We can approximate the mean of the expensive simluation model as the average of the surrogate values at the random samples.

We know from :ref:`sphx_glr_auto_tutorials_multi_fidelity_plot_monte_carlo.py` that the error in the Monte carlo estimate of the mean using the surrogate is

.. math::
  \mean{\left(Q_{\alpha}-\mean{Q}\right)^2}&=N^{-1}\var{Q_\alpha}+\left(\mean{Q_{\alpha}}-\mean{Q}\right)^2\\
  &\le N^{-1}\var{Q_\alpha}+C_{d,s} N_{\bi}^{-s/d}

Because a surrogate is inexpensive to evaluate the first term can be driven to zero so that only the bias remains. Thus the error in the Monte Carlo estimate of the mean using the surrogate is dominated by the error in the surrogate. If this error can be reduced more quickly than \frac{N^{-1}} (as is the case for low-dimensional tensor-product interpolation) then using surrogates for computing moments is very effective.

Note that moments can be estimated without using Monte-Carlo sampling by levaraging properties of the univariate interpolation rules used to build the multi-variate interpolant. Specifically, the expectation of a tensor product interpolant can be computed without explicitly forming the interpolant and is given by

.. math::

  \mu_{\bi}=\int_{\rvdom} \sum_{\V{j}\le\bi}f_\ai(\rv^{(\V{j})})\prod_{i=1}^d\phi_{i,j_i}(\rv_i) w(\rv)\,d\rv=\sum_{\V{j}\le\bi} f_\ai(\rv^{(\V{j})}) v_{\V{j}}.

The expectation is simply the weighted sum of the Cartesian-product of the univariate quadrature weights

.. math:: v_{\V{j}}=\prod_{i=1}^d\int_{\rvdom_i}{\phi_{i,j_i}(\rv_i)}\,dw(\rv_i),

which can be computed analytically.

.. GENERATED FROM PYTHON SOURCE LINES 153-156

.. code-block:: default

    x, w = get_tensor_product_quadrature_rule(level, 2, clenshaw_curtis_pts_wts_1D)
    surrogate_mean = fun(x)[:, 0].dot(w)
    print('Quadrature mean', surrogate_mean)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Quadrature mean 0.10540659444426355


.. GENERATED FROM PYTHON SOURCE LINES 157-160

Here we have recomptued the values of :math:`f` at the interpolation samples, but in practice we sould just re-use the values collected when building the interpolant.

Now let us compare the quadrature mean with the MC mean computed using the surrogate

.. GENERATED FROM PYTHON SOURCE LINES 160-166

.. code-block:: default

    num_samples = int(1e6)
    samples = np.random.uniform(-1, 1, (2, num_samples))
    values = lagrange_interpolant(samples)
    mc_mean = values.mean()
    print('Monte Carlo surrogate mean', mc_mean)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Monte Carlo surrogate mean -0.00010377793672674781


.. GENERATED FROM PYTHON SOURCE LINES 167-172

Piecewise-polynomial approximation
----------------------------------
Polynomial interpolation accurately approximates smooth functions, however its accuracy degrades as the regularity of the target function decreases. For piecewise continuous functions, or functions with only a limited number of continuous derivaties, piecewise-polynomial approximation may be more appropriate.

The following plots two piecewise-quadratic basis functions in 2D

.. GENERATED FROM PYTHON SOURCE LINES 172-183

.. code-block:: default

    fig = plt.figure(figsize=(2*8, 6))
    nnodes_1d = [5, 5]
    nodes_1d = [np.linspace(-1, 1, nnodes) for nnodes in nnodes_1d]
    nodes = cartesian_product(nodes_1d)
    tp_quadratic_basis = TensorProductBasis(
        [UnivariatePiecewiseQuadraticBasis()]*2)
    ax = fig.add_subplot(1, 2, 1, projection='3d')
    tp_quadratic_basis.plot_single_basis(ax, nodes_1d, 2, 2, nodes)
    ax = fig.add_subplot(1, 2, 2, projection='3d')
    tp_quadratic_basis.plot_single_basis(ax, nodes_1d, 0, 1, nodes)


.. image-sg:: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_003.png
   :alt: plot tensor product interpolation
   :srcset: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_003.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 184-187

The following compares the convergence of Lagrange and picewise polynomial tensor product interpolants. Change the benchmark to see the effect of smoothness on the approximation accuracy.

First define wrappers to build the tensor product interpolants

.. GENERATED FROM PYTHON SOURCE LINES 187-215

.. code-block:: default


    def build_lagrange_tp(max_level_1d):
        univariate_quad_rule_info = [
            clenshaw_curtis_in_polynomial_order, clenshaw_curtis_rule_growth,
            None, None]
        return adaptive_approximate(
            benchmark.fun, benchmark.variable, "sparse_grid",
            {"refinement_indicator": tensor_product_refinement_indicator,
             "max_level_1d": max_level_1d,
             "univariate_quad_rule_info": univariate_quad_rule_info,
             "max_nsamples": np.inf}).approx


    def build_piecewise_tp(max_level_1d):
        basis_type = "quadratic"
        # basis_type = "linear"
        univariate_quad_rule_info = [
            partial(canonical_univariate_piecewise_polynomial_quad_rule,
                    basis_type),
            clenshaw_curtis_rule_growth, None, None]
        return adaptive_approximate(
            benchmark.fun, benchmark.variable, "sparse_grid",
            {"refinement_indicator": tensor_product_refinement_indicator,
             "max_level_1d": max_level_1d,
             "univariate_quad_rule_info": univariate_quad_rule_info,
             "basis_type": basis_type, "max_nsamples": np.inf}).approx


.. GENERATED FROM PYTHON SOURCE LINES 216-217

Load a benchmark

.. GENERATED FROM PYTHON SOURCE LINES 217-222

.. code-block:: default

    nvars = 2
    benchmark = setup_benchmark("genz", nvars=nvars, test_name="oscillatory")
    # benchmark = setup_benchmark("genz", nvars=nvars, test_name="c0continuous",
    #                            c_factor=0.5, w=0.5)


.. GENERATED FROM PYTHON SOURCE LINES 223-224

Run a convergence study

.. GENERATED FROM PYTHON SOURCE LINES 224-252

.. code-block:: default

    validation_samples = benchmark.variable.rvs(1000)
    validation_values = benchmark.fun(validation_samples)

    piecewise_data = []
    lagrange_data = []
    for level in range(1, 7):   # nvars = 2
    # for level in range(1, 4):  # nvars = 3
        ltp = build_lagrange_tp(level)
        lvalues = ltp(validation_samples)
        lerror = np.linalg.norm(validation_values-lvalues)/np.linalg.norm(
            validation_values)
        lagrange_data.append([ltp.samples.shape[1], lerror])
        ptp = build_piecewise_tp(level)
        pvalues = ptp(validation_samples)
        perror = np.linalg.norm(validation_values-pvalues)/np.linalg.norm(
            validation_values)
        piecewise_data.append([ptp.samples.shape[1], perror])
    lagrange_data = np.array(lagrange_data).T
    piecewise_data = np.array(piecewise_data).T

    ax = plt.subplots()[1]
    ax.loglog(*lagrange_data, '-o', label='Lagrange')
    ax.loglog(*piecewise_data, '--o', label='Piecewise')
    work = piecewise_data[0][1:3]
    ax.loglog(work, work**(-1.0), ':', label='linear rate')
    ax.loglog(work, work**(-2.0), ':', label='quadratic rate')
    _ = ax.legend()


.. image-sg:: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_004.png
   :alt: plot tensor product interpolation
   :srcset: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_004.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 253-256

Similar behavior occurs when using quadrature.

Load in the benchmark.

.. GENERATED FROM PYTHON SOURCE LINES 256-259

.. code-block:: default

    nvars = 2
    benchmark = setup_benchmark("genz", nvars=nvars, test_name="oscillatory")


.. GENERATED FROM PYTHON SOURCE LINES 260-261

Run a convergence study

.. GENERATED FROM PYTHON SOURCE LINES 261-285

.. code-block:: default

    piecewise_data = []
    lagrange_data = []
    for level in range(1, 7):   # nvars = 2
        ltp = build_lagrange_tp(level)
        lvalues = ltp.moments()[0, 0]
        lerror = np.linalg.norm(benchmark.mean-lvalues)/np.linalg.norm(
            benchmark.mean)
        lagrange_data.append([ltp.samples.shape[1], lerror])
        ptp = build_piecewise_tp(level)
        pvalues = ptp.moments()[0, 0]
        perror = np.linalg.norm(benchmark.mean-pvalues)/np.linalg.norm(
            benchmark.mean)
        piecewise_data.append([ptp.samples.shape[1], perror])
    lagrange_data = np.array(lagrange_data).T
    piecewise_data = np.array(piecewise_data).T

    ax = plt.subplots()[1]
    ax.loglog(*lagrange_data, '-o', label='Lagrange')
    ax.loglog(*piecewise_data, '--o', label='Piecewise')
    work = piecewise_data[0][1:3]
    ax.loglog(work, work**(-1.0), ':', label='linear rate')
    ax.loglog(work, work**(-2.0), ':', label='quadratic rate')
    _ = ax.legend()


.. image-sg:: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_005.png
   :alt: plot tensor product interpolation
   :srcset: /auto_tutorials/surrogates/images/sphx_glr_plot_tensor_product_interpolation_005.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 286-296

References
^^^^^^^^^^

.. [XKSISC2002] `D. Xiu and G.E. Karniadakis. The Wiener-Askey Polynomial Chaos for stochastic differential equations. SIAM J. Sci. Comput., 24(2), 619-644, 2002. <http://dx.doi.org/10.1137/S1064827501387826>`_

.. [RWMIT2006] `C.E Rasmussen and C. Williams. Gaussian Processes for Machine Learning. MIT Press, 2006. <http://www.gaussianprocess.org/gpml/chapters/>`_

.. [BGAN2004] `H. Bungartz and M. Griebel. Sparse Grids. Acta Numerica, 13, 147-269, 2004. <http://dx.doi.org/10.1017/S0962492904000182>`_

.. [SFIJNME2017] `C Soize and C. Farhat. A nonparametric probabilistic approach for quantifying uncertainties in low-dimensional and high-dimensional nonlinear models. International Journal for Numerical Methods in Engineering, 109(6), 837-888, 2017. <https://onlinelibrary.wiley.com/doi/abs/10.1002/nme.5312>`_


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  2.034 seconds)


.. _sphx_glr_download_auto_tutorials_surrogates_plot_tensor_product_interpolation.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_tensor_product_interpolation.py <plot_tensor_product_interpolation.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_tensor_product_interpolation.ipynb <plot_tensor_product_interpolation.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_