Probability Theory Copula

In probability theory and statistics, a copula is a multivariate cumulative distribution function for which the marginal probability distribution of each variable is uniform on the interval .

Copulas are used to describe/model the dependence (inter-correlation) between random variables. Their name, introduced by applied mathematician Abe Sklar in 1959, comes from the Latin for "link" or "tie", similar but unrelated to grammatical copulas in linguistics. Copulas have been used widely in quantitative finance to model and minimize tail risk and portfolio-optimization applications.

Sklar's theorem states that any multivariate joint distribution can be written in terms of univariate marginal distribution functions and a copula which describes the dependence structure between the variables.

Copulas are popular in high-dimensional statistical applications as they allow one to easily model and estimate the distribution of random vectors by estimating marginals and copulae separately. There are many parametric copula families available, which usually have parameters that control the strength of dependence. Some popular parametric copula models are outlined below.

Two-dimensional copulas are known in some other areas of mathematics under the name permutons and doubly-stochastic measures.

Mathematical definition

Consider a random vector Probability Theory Copula . Suppose its marginals are continuous, i.e. the marginal CDFs Probability Theory Copula  are continuous functions. By applying the probability integral transform to each component, the random vector

    Probability Theory Copula 

has marginals that are uniformly distributed on the interval [0, 1].

The copula of Probability Theory Copula  is defined as the joint cumulative distribution function of Probability Theory Copula :

    Probability Theory Copula 

The copula C contains all information on the dependence structure between the components of Probability Theory Copula  whereas the marginal cumulative distribution functions Probability Theory Copula  contain all information on the marginal distributions of Probability Theory Copula .

The reverse of these steps can be used to generate pseudo-random samples from general classes of multivariate probability distributions. That is, given a procedure to generate a sample Probability Theory Copula  from the copula function, the required sample can be constructed as

    Probability Theory Copula 

The generalized inverses Probability Theory Copula  are unproblematic almost surely, since the Probability Theory Copula  were assumed to be continuous. Furthermore, the above formula for the copula function can be rewritten as:

    Probability Theory Copula 

Definition

In probabilistic terms, Probability Theory Copula  is a d-dimensional copula if C is a joint cumulative distribution function of a d-dimensional random vector on the unit cube Probability Theory Copula  with uniform marginals.

In analytic terms, Probability Theory Copula  is a d-dimensional copula if

    • Probability Theory Copula , the copula is zero if any one of the arguments is zero,
    • Probability Theory Copula , the copula is equal to u if one argument is u and all others 1,
    • C is d-non-decreasing, i.e., for each hyperrectangle Probability Theory Copula  the C-volume of B is non-negative:
        Probability Theory Copula 
      where the Probability Theory Copula .

For instance, in the bivariate case, Probability Theory Copula  is a bivariate copula if Probability Theory Copula , Probability Theory Copula  and Probability Theory Copula  for all Probability Theory Copula  and Probability Theory Copula .

Sklar's theorem

Probability Theory Copula 
Density and contour plot of a Bivariate Gaussian Distribution
Probability Theory Copula 
Density and contour plot of two Normal marginals joint with a Gumbel copula

Sklar's theorem, named after Abe Sklar, provides the theoretical foundation for the application of copulas. Sklar's theorem states that every multivariate cumulative distribution function

    Probability Theory Copula 

of a random vector Probability Theory Copula  can be expressed in terms of its marginals Probability Theory Copula  and a copula Probability Theory Copula . Indeed:

    Probability Theory Copula 

If the multivariate distribution has a density Probability Theory Copula , and if this density is available, it also holds that

    Probability Theory Copula 

where Probability Theory Copula  is the density of the copula.

The theorem also states that, given Probability Theory Copula , the copula is unique on Probability Theory Copula  which is the cartesian product of the ranges of the marginal cdf's. This implies that the copula is unique if the marginals Probability Theory Copula  are continuous.

The converse is also true: given a copula Probability Theory Copula  and marginals Probability Theory Copula  then Probability Theory Copula  defines a d-dimensional cumulative distribution function with marginal distributions Probability Theory Copula .

Stationarity condition

Copulas mainly work when time series are stationary and continuous. Thus, a very important pre-processing step is to check for the auto-correlation, trend and seasonality within time series.

When time series are auto-correlated, they may generate a non existing dependence between sets of variables and result in incorrect copula dependence structure.

Fréchet–Hoeffding copula bounds

Probability Theory Copula 
Graphs of the bivariate Fréchet–Hoeffding copula limits and of the independence copula (in the middle).

The Fréchet–Hoeffding theorem (after Maurice René Fréchet and Wassily Hoeffding) states that for any copula Probability Theory Copula  and any Probability Theory Copula  the following bounds hold:

    Probability Theory Copula 

The function W is called lower Fréchet–Hoeffding bound and is defined as

    Probability Theory Copula 

The function M is called upper Fréchet–Hoeffding bound and is defined as

    Probability Theory Copula 

The upper bound is sharp: M is always a copula, it corresponds to comonotone random variables.

The lower bound is point-wise sharp, in the sense that for fixed u, there is a copula Probability Theory Copula  such that Probability Theory Copula . However, W is a copula only in two dimensions, in which case it corresponds to countermonotonic random variables.

In two dimensions, i.e. the bivariate case, the Fréchet–Hoeffding theorem states

    Probability Theory Copula 

Families of copulas

Several families of copulas have been described.

Gaussian copula

Probability Theory Copula 
Cumulative and density distribution of Gaussian copula with ρ = 0.4

The Gaussian copula is a distribution over the unit hypercube Probability Theory Copula . It is constructed from a multivariate normal distribution over Probability Theory Copula  by using the probability integral transform.

For a given correlation matrix Probability Theory Copula , the Gaussian copula with parameter matrix Probability Theory Copula  can be written as

    Probability Theory Copula 

where Probability Theory Copula  is the inverse cumulative distribution function of a standard normal and Probability Theory Copula  is the joint cumulative distribution function of a multivariate normal distribution with mean vector zero and covariance matrix equal to the correlation matrix Probability Theory Copula . While there is no simple analytical formula for the copula function, Probability Theory Copula , it can be upper or lower bounded, and approximated using numerical integration. The density can be written as

    Probability Theory Copula 

where Probability Theory Copula  is the identity matrix.

Archimedean copulas

Archimedean copulas are an associative class of copulas. Most common Archimedean copulas admit an explicit formula, something not possible for instance for the Gaussian copula. In practice, Archimedean copulas are popular because they allow modeling dependence in arbitrarily high dimensions with only one parameter, governing the strength of dependence.

A copula C is called Archimedean if it admits the representation

    Probability Theory Copula 

where Probability Theory Copula  is a continuous, strictly decreasing and convex function such that Probability Theory Copula , Probability Theory Copula  is a parameter within some parameter space Probability Theory Copula , and Probability Theory Copula  is the so-called generator function and Probability Theory Copula  is its pseudo-inverse defined by

    Probability Theory Copula 

Moreover, the above formula for C yields a copula for Probability Theory Copula  if and only if Probability Theory Copula  is d-monotone on Probability Theory Copula . That is, if it is Probability Theory Copula  times differentiable and the derivatives satisfy

    Probability Theory Copula 

for all Probability Theory Copula  and Probability Theory Copula  and Probability Theory Copula  is nonincreasing and convex.

Most important Archimedean copulas

The following tables highlight the most prominent bivariate Archimedean copulas, with their corresponding generator. Not all of them are completely monotone, i.e. d-monotone for all Probability Theory Copula  or d-monotone for certain Probability Theory Copula  only.

Table with the most important Archimedean copulas
Name of copula Bivariate copula Probability Theory Copula  parameter Probability Theory Copula  generator Probability Theory Copula  generator inverse Probability Theory Copula 
Ali–Mikhail–Haq   Probability Theory Copula    Probability Theory Copula     Probability Theory Copula      Probability Theory Copula 
Clayton   Probability Theory Copula    Probability Theory Copula      Probability Theory Copula      Probability Theory Copula     
Frank   Probability Theory Copula      Probability Theory Copula       Probability Theory Copula      Probability Theory Copula     
Gumbel   Probability Theory Copula    Probability Theory Copula     Probability Theory Copula         Probability Theory Copula 
Independence   Probability Theory Copula        Probability Theory Copula         Probability Theory Copula 
Joe   Probability Theory Copula      Probability Theory Copula      Probability Theory Copula         Probability Theory Copula 

Expectation for copula models and Monte Carlo integration

In statistical applications, many problems can be formulated in the following way. One is interested in the expectation of a response function Probability Theory Copula  applied to some random vector Probability Theory Copula . If we denote the CDF of this random vector with Probability Theory Copula , the quantity of interest can thus be written as

    Probability Theory Copula 

If Probability Theory Copula  is given by a copula model, i.e.,

    Probability Theory Copula 

this expectation can be rewritten as

    Probability Theory Copula 

In case the copula C is absolutely continuous, i.e. C has a density c, this equation can be written as

    Probability Theory Copula 

and if each marginal distribution has the density Probability Theory Copula  it holds further that

    Probability Theory Copula 

If copula and marginals are known (or if they have been estimated), this expectation can be approximated through the following Monte Carlo algorithm:

  1. Draw a sample Probability Theory Copula  of size n from the copula C
  2. By applying the inverse marginal cdf's, produce a sample of Probability Theory Copula  by setting Probability Theory Copula 
  3. Approximate Probability Theory Copula  by its empirical value:
        Probability Theory Copula 

Empirical copulas

When studying multivariate data, one might want to investigate the underlying copula. Suppose we have observations

    Probability Theory Copula 

from a random vector Probability Theory Copula  with continuous marginals. The corresponding “true” copula observations would be

    Probability Theory Copula 

However, the marginal distribution functions Probability Theory Copula  are usually not known. Therefore, one can construct pseudo copula observations by using the empirical distribution functions

    Probability Theory Copula 

instead. Then, the pseudo copula observations are defined as

    Probability Theory Copula 

The corresponding empirical copula is then defined as

    Probability Theory Copula 

The components of the pseudo copula samples can also be written as Probability Theory Copula , where Probability Theory Copula  is the rank of the observation Probability Theory Copula :

    Probability Theory Copula 

Therefore, the empirical copula can be seen as the empirical distribution of the rank transformed data.

The sample version of Spearman's rho:

    Probability Theory Copula 

Applications

Quantitative finance

Probability Theory Copula 
Examples of bivariate copulæ used in finance.
Typical finance applications:

In quantitative finance copulas are applied to risk management, to portfolio management and optimization, and to derivatives pricing.

For the former, copulas are used to perform stress-tests and robustness checks that are especially important during "downside/crisis/panic regimes" where extreme downside events may occur (e.g., the global financial crisis of 2007–2008). The formula was also adapted for financial markets and was used to estimate the probability distribution of losses on pools of loans or bonds.

During a downside regime, a large number of investors who have held positions in riskier assets such as equities or real estate may seek refuge in 'safer' investments such as cash or bonds. This is also known as a flight-to-quality effect and investors tend to exit their positions in riskier assets in large numbers in a short period of time. As a result, during downside regimes, correlations across equities are greater on the downside as opposed to the upside and this may have disastrous effects on the economy. For example, anecdotally, we often read financial news headlines reporting the loss of hundreds of millions of dollars on the stock exchange in a single day; however, we rarely read reports of positive stock market gains of the same magnitude and in the same short time frame.

Copulas aid in analyzing the effects of downside regimes by allowing the modelling of the marginals and dependence structure of a multivariate probability model separately. For example, consider the stock exchange as a market consisting of a large number of traders each operating with his/her own strategies to maximize profits. The individualistic behaviour of each trader can be described by modelling the marginals. However, as all traders operate on the same exchange, each trader's actions have an interaction effect with other traders'. This interaction effect can be described by modelling the dependence structure. Therefore, copulas allow us to analyse the interaction effects which are of particular interest during downside regimes as investors tend to herd their trading behaviour and decisions. (See also agent-based computational economics, where price is treated as an emergent phenomenon, resulting from the interaction of the various market participants, or agents.)

The users of the formula have been criticized for creating "evaluation cultures" that continued to use simple copulæ despite the simple versions being acknowledged as inadequate for that purpose. Thus, previously, scalable copula models for large dimensions only allowed the modelling of elliptical dependence structures (i.e., Gaussian and Student-t copulas) that do not allow for correlation asymmetries where correlations differ on the upside or downside regimes. However, the development of vine copulas (also known as pair copulas) enables the flexible modelling of the dependence structure for portfolios of large dimensions. The Clayton canonical vine copula allows for the occurrence of extreme downside events and has been successfully applied in portfolio optimization and risk management applications. The model is able to reduce the effects of extreme downside correlations and produces improved statistical and economic performance compared to scalable elliptical dependence copulas such as the Gaussian and Student-t copula.

Other models developed for risk management applications are panic copulas that are glued with market estimates of the marginal distributions to analyze the effects of panic regimes on the portfolio profit and loss distribution. Panic copulas are created by Monte Carlo simulation, mixed with a re-weighting of the probability of each scenario.

As regards derivatives pricing, dependence modelling with copula functions is widely used in applications of financial risk assessment and actuarial analysis – for example in the pricing of collateralized debt obligations (CDOs). Some believe the methodology of applying the Gaussian copula to credit derivatives to be one of the reasons behind the global financial crisis of 2008–2009; see David X. Li § CDOs and Gaussian copula.

Despite this perception, there are documented attempts within the financial industry, occurring before the crisis, to address the limitations of the Gaussian copula and of copula functions more generally, specifically the lack of dependence dynamics. The Gaussian copula is lacking as it only allows for an elliptical dependence structure, as dependence is only modeled using the variance-covariance matrix. This methodology is limited such that it does not allow for dependence to evolve as the financial markets exhibit asymmetric dependence, whereby correlations across assets significantly increase during downturns compared to upturns. Therefore, modeling approaches using the Gaussian copula exhibit a poor representation of extreme events. There have been attempts to propose models rectifying some of the copula limitations.

Additional to CDOs, copulas have been applied to other asset classes as a flexible tool in analyzing multi-asset derivative products. The first such application outside credit was to use a copula to construct a basket implied volatility surface, taking into account the volatility smile of basket components. Copulas have since gained popularity in pricing and risk management of options on multi-assets in the presence of a volatility smile, in equity-, foreign exchange- and fixed income derivatives.

Civil engineering

Recently, copula functions have been successfully applied to the database formulation for the reliability analysis of highway bridges, and to various multivariate simulation studies in civil engineering, reliability of wind and earthquake engineering, and mechanical & offshore engineering. Researchers are also trying these functions in the field of transportation to understand the interaction between behaviors of individual drivers which, in totality, shapes traffic flow.

Reliability engineering

Copulas are being used for reliability analysis of complex systems of machine components with competing failure modes.

Warranty data analysis

Copulas are being used for warranty data analysis in which the tail dependence is analysed.

Turbulent combustion

Copulas are used in modelling turbulent partially premixed combustion, which is common in practical combustors.

Medicine

Copulæ have many applications in the area of medicine, for example,

  1. Copulæ have been used in the field of magnetic resonance imaging (MRI), for example, to segment images, to fill a vacancy of graphical models in imaging genetics in a study on schizophrenia, and to distinguish between normal and Alzheimer patients.
  2. Copulæ have been in the area of brain research based on EEG signals, for example, to detect drowsiness during daytime nap, to track changes in instantaneous equivalent bandwidths (IEBWs), to derive synchrony for early diagnosis of Alzheimer's disease, to characterize dependence in oscillatory activity between EEG channels, and to assess the reliability of using methods to capture dependence between pairs of EEG channels using their time-varying envelopes. Copula functions have been successfully applied to the analysis of neuronal dependencies and spike counts in neuroscience .
  3. A copula model has been developed in the field of oncology, for example, to jointly model genotypes, phenotypes, and pathways to reconstruct a cellular network to identify interactions between specific phenotype and multiple molecular features (e.g. mutations and gene expression change). Bao et al. used NCI60 cancer cell line data to identify several subsets of molecular features that jointly perform as the predictors of clinical phenotypes. The proposed copula may have an impact on biomedical research, ranging from cancer treatment to disease prevention. Copula has also been used to predict the histological diagnosis of colorectal lesions from colonoscopy images, and to classify cancer subtypes.
  4. A copula-based analysis model has been developed in the field of heart and cardiovascular disease, for example, to predict heart rate (HR) variation. Heart rate (HR) is one of the most critical health indicators for monitoring exercise intensity and load degree because it is closely related to heart rate. Therefore, an accurate short-term HR prediction technique can deliver efficient early warning for human health and decrease harmful events. Namazi (2022) used a novel hybrid algorithm to predict HR.

Geodesy

The combination of SSA and copula-based methods have been applied for the first time as a novel stochastic tool for Earth Orientation Parameters prediction.

Hydrology research

Copulas have been used in both theoretical and applied analyses of hydroclimatic data. Theoretical studies adopted the copula-based methodology for instance to gain a better understanding of the dependence structures of temperature and precipitation, in different parts of the world. Applied studies adopted the copula-based methodology to examine e.g., agricultural droughts or joint effects of temperature and precipitation extremes on vegetation growth.

Climate and weather research

Copulas have been extensively used in climate- and weather-related research.

Solar irradiance variability

Copulas have been used to estimate the solar irradiance variability in spatial networks and temporally for single locations.

Random vector generation

Large synthetic traces of vectors and stationary time series can be generated using empirical copula while preserving the entire dependence structure of small datasets. Such empirical traces are useful in various simulation-based performance studies.

Ranking of electrical motors

Copulas have been used for quality ranking in the manufacturing of electronically commutated motors.

Signal processing

Copulas are important because they represent a dependence structure without using marginal distributions. Copulas have been widely used in the field of finance, but their use in signal processing is relatively new. Copulas have been employed in the field of wireless communication for classifying radar signals, change detection in remote sensing applications, and EEG signal processing in medicine. In this section, a short mathematical derivation to obtain copula density function followed by a table providing a list of copula density functions with the relevant signal processing applications are presented.

Astronomy

Copulas have been used for determining the core radio luminosity function of Active galactic Nuclei (AGNs), while this cannot be realized using traditional methods due to the difficulties in sample completeness.

Mathematical derivation of copula density function

For any two random variables X and Y, the continuous joint probability distribution function can be written as

    Probability Theory Copula 

where Probability Theory Copula  and Probability Theory Copula  are the marginal cumulative distribution functions of the random variables X and Y, respectively.

then the copula distribution function Probability Theory Copula  can be defined using Sklar's theorem as:

    Probability Theory Copula 

where Probability Theory Copula  and Probability Theory Copula  are marginal distribution functions, Probability Theory Copula  joint and Probability Theory Copula .

Assuming Probability Theory Copula  is a.e. twice differentiable, we start by using the relationship between joint probability density function (PDF) and joint cumulative distribution function (CDF) and its partial derivatives.

    Probability Theory Copula 

where Probability Theory Copula  is the copula density function, Probability Theory Copula  and Probability Theory Copula  are the marginal probability density functions of X and Y, respectively. It is important to understand that there are four elements in this equation, and if any three elements are known, the fourth element can be calculated. For example, it may be used,

  • when joint probability density function between two random variables is known, the copula density function is known, and one of the two marginal functions are known, then, the other marginal function can be calculated, or
  • when the two marginal functions and the copula density function are known, then the joint probability density function between the two random variables can be calculated, or
  • when the two marginal functions and the joint probability density function between the two random variables are known, then the copula density function can be calculated.

List of copula density functions and applications

Various bivariate copula density functions are important in the area of signal processing. Probability Theory Copula  and Probability Theory Copula  are marginal distributions functions and Probability Theory Copula  and Probability Theory Copula  are marginal density functions. Extension and generalization of copulas for statistical signal processing have been shown to construct new bivariate copulas for exponential, Weibull, and Rician distributions. Zeng et al. presented algorithms, simulation, optimal selection, and practical applications of these copulas in signal processing.

Copula density: c(u, v) Use
Gaussian Probability Theory Copula  supervised classification of synthetic aperture radar (SAR) images,

validating biometric authentication, modeling stochastic dependence in large-scale integration of wind power, unsupervised classification of radar signals

Exponential Probability Theory Copula  queuing system with infinitely many servers
Rayleigh bivariate exponential, Rayleigh, and Weibull copulas have been proved to be equivalent change detection from SAR images
Weibull bivariate exponential, Rayleigh, and Weibull copulas have been proved to be equivalent digital communication over fading channels
Log-normal bivariate log-normal copula and Gaussian copula are equivalent shadow fading along with multipath effect in wireless channel
Farlie–Gumbel–Morgenstern (FGM) Probability Theory Copula  information processing of uncertainty in knowledge-based systems
Clayton Probability Theory Copula  location estimation of random signal source and hypothesis testing using heterogeneous data
Frank Probability Theory Copula  quantitative risk assessment of geo-hazards
Student's t Probability Theory Copula  supervised SAR image classification,

fusion of correlated sensor decisions

Nakagami-m
Rician

See also

References

Tags:

Probability Theory Copula Mathematical definitionProbability Theory Copula DefinitionProbability Theory Copula Sklars theoremProbability Theory Copula Stationarity conditionProbability Theory Copula Fréchet–Hoeffding copula boundsProbability Theory Copula Families of copulasProbability Theory Copula Expectation for copula models and Monte Carlo integrationProbability Theory Copula Empirical copulasProbability Theory Copula ApplicationsProbability Theory Copula Mathematical derivation of copula density functionProbability Theory Copula Further readingProbability Theory CopulaAbe SklarCopula (linguistics)Cumulative distribution functionDependent and independent variablesLinguisticsMarginal probabilityPortfolio optimizationProbability theoryQuantitative financeRandom variableStatisticsUniform distribution (continuous)

🔥 Trending searches on Wiki English:

North AmericaSeychellesMarlon WayansJava (programming language)Raj BabbarSarah DesjardinsKabzaa (2023 film)Brad PittMark Twain Prize for American HumorMcDonald'sCzech RepublicSarah, Duchess of YorkYou (season 4)Kamala HarrisDick Van DykeTornado outbreak of March 24–26, 2023Babylon (2022 film)NetherlandsN. T. Rama Rao Jr.MalaysiaBrie LarsonEd SheeranFlorida Atlantic UniversityDaniel RadcliffeLouis XVITottenham Hotspur F.C.AlbaniaElizabeth I2023 Women's Premier League (cricket)Cocaine Bear (bear)Florence PughFinland–NATO relationsList of prime ministers of IndiaBlac ChynaKnock at the CabinPakistanDepeche ModeJonah HillNetflixVal Kilmer2023 Cricket World CupTom HollandRyan PhillippeFC Bayern MunichMuhammadBrownie (folklore)Shazam! (film)CaliforniaBrooke ShieldsRyan GoslingThe Rookie (TV series)Jackie ChanLux PascalThe Guy GameBill GatesRoman EmpireMeryl StreepJared FogleXXXX (beer)Zlatan IbrahimovićBruce LeeFacebookBrock LesnarBruce WillisBaphometRon DeSantisHeath LedgerRonald ReaganShohei OhtaniAmy Jo JohnsonSteve BuscemiPuyiSwitzerland🡆 More