• Sonuç bulunamadı

Selecting the suitable Copula function when only values of distribution functions are avaliable

N/A
N/A
Protected

Academic year: 2021

Share "Selecting the suitable Copula function when only values of distribution functions are avaliable"

Copied!
7
0
0

Yükleniyor.... (view fulltext now)

Tam metin

(1)

Selçuk J. Appl. Math. Selçuk Journal of Vol. 13. No. 2. pp. 35-41, 2012 Applied Mathematics

Selecting the Suitable Copula Function When Only Values of Distribution Functions are Avaliable

Çi¼gdem Topçu, Fahrettin Arslan

Ankara University, Faculty of Science, Statistics Department, 06100, Tando¼gan, Ankara,Turkiye

e-mail:ctopcu@science.ankara.edu.tr, arslan@science.ankara.edu.tr Received Date: April 27, 2012

Accepted Date: December 1, 2012

Abstract. Copula functions is widely known tool to model dependence struc-ture amaong multivariate random variables. In this paper, a special class of copula is called Archimedean is considered. Using bivariate Archimedean copula functions, specifying the dependence structure for modelling bivariate distribu-tion is investigated when only the values of distribudistribu-tion funcdistribu-tions are avaliable. Key words: Copula; Archimedean Copula; Kendall’s Tau; Dependency. 2000 Mathematics Subject Classi…cation: 62G30

1. Introduction

Copulas may said to be …rst appeared in the probability metric literature. How-ever, using copula method has a short history in the statistics literature. Most of the statistical applications that has been used copula method have arisen in the last ten years. In the light of recent developments of copula literaure, several appealing properties of copulas are now widely known.

Copulas have been considered as a tool for modelling dependence in several applied …elds such as …nance, insurance, (bio)statistical, econometric, demeo-graphic and actuarial studies. For further details on copulas, we refer the inter-ested reader to [3,4,6,9,12,14,15].

If we assume that ,u = (u1; :::; um)0; uj 2 [0; 1] an m copula C(u) is de…ned as a

multivariate cumulative distribution function witih uniform margins. A proba-bilistic way to de…ne the copula is provided by the theorem of Sklar [11,13]. He obtained the results and introducing the notion, and the name, of a copula, and proving the theorem that is known Sklar’s theorem.This cumulative distribution function is termed di¤erent names in various studies, including ‘copulas’ [11], ‘dependence funciton’[2], and ‘uniform representation’[5]. In this study, these names will be used, interchangeably.

(2)

Since the Archimedean representation allows us to reduce the study of a mul-tivariate copula to a single univariate function, we focus on some bivariate Archimedean copulas: Gumbel, Clayton and Frank.

This paper is organized as follows: In section 2, copula methodology is intro-duced in terms of probability distribution functions. Section 3 consists of, a special class of copulas that is called Archimedean and their properties. Then we consider the nonparametric method that has been suggested in Genest and Rivest’s paper to choose the appropriate Archimedean copula function for data, under the assumption of the data can be suitably modelled by one of Arhimedean copulas.

In section 4, we illustrated the nonparametric method when only values of mar-ginal distribution functions are avaliable.

2. Copula Methodology

For m uniform random U1; U2; :::; Umvariables the joint distribution funciton C

is de…ned as

(2.1) C(u1; u2; :::; um; ) = P r(U1 u1; U2 u2; :::; Um um)

Here, is the dependence parameter.

A probabilistic way to de…ne the copula is provided by the theorem of Sklar [11,13]. Let X1; X2; :::; Xm be random variables with the continuous marginal

distributions F1(x1); F2(x2); :::; Fm(xm) and the corresponding joint

distribu-tion funcdistribu-tion is

H(x1; x2; :::; xm). Sklar theorem states that if H is a m -dimensional

distrib-ution function then there exists a unique m-copula C such that (2.2) H(x1; x2; :::; xm) = C(F1(x1); F2(x2); :::; Fm(xm))

The converse is also ture [11,13]. If C is an m -copula and F1(x1); F2(x2); :::;

Fm(xm) are continuous then H(x1; x2; :::; xm) is an -dimensional distribution

function with continuous marginals F1(x1); F2(x2); :::; Fm(xm) such that

(2.3) C(F1(x1); F2(x2); :::; Fm(xm)) = H(x1; x2; :::; xm)

Sklar [13] also showed that if the marginal distributions are continuous, then there is a unique copula representation. An important property is that the copula allows the dependence structure independently from the marginal dis-tributions. Hence, emprical marginals can be used for estimating their copula functions.

2.1. The Bivariate Archimedean Copula

Let H(x; y) be a bivariate distribution function with marginals F (x) and G(y) . If H(x; y) can be expressed in the form of

(3)

H(x; y) is said to be generated by an Archimedean copula, for some convex, decreasing function that has the properties below.

denote a funciton : [0; 1] ! [0; 1]which is continuous and satis…es: i. (1) = 0

ii. (0) = 1

iii. is decreasing, i.e 8t 2 (0; 1) ; 0(t) < 0 iv. is convex, i.e 8t 2 (0; 1) ; 00(t) 0

The function has an inverse 1: [0; 1] ! [0; 1] which has the same properties except that 1(0) = 1 ve 1(1) = 0

Table 1 gives some examples of bivariate Archimedean copulas and correspond-ing generator functions

Family Generator (t) Dependence Parameter Space Bivariate Copula C(u1; u2) Gumbel ( ln t) 1 exp [ [( ln u1) +( ln u2) ]1= ]

Clayton t 1 > 1 ((u1) +(u2) 1) 1=

Frank lnee t 11 1 < < 1 ( 1) ln f(1 e ) (1(1ee u1))(1 e u2)g Ta b le 1 . B iva ria te A rch im e d e a n c o p u la s a n d c o rre s p o n d in g g e n e ra to r fu n c t io n s .

There is a relationship between parameters of some Archimedean copulas and Kendall tau or Spearman rho. In this study only the relationship including Kendall’s Tau is given by Table 2. However, there is no unique multivariate dependence measure and this makes multivariate copulas less appealing.

Family Range of

Gumbel 2 [1; 1) 1

Clayton 2 [0; 1) +2

Frank 2 ( 1; 1) 1 4[1 D1( )]

Table 2.T he R elationship Between C opula Param eters and K endall0sTau

D1( ) is the Debye function and de…ned as Dn(x) = xnn

x

Z

0 tn

et 1dt; n > 0:

3. The Nonparametric Method

Genest and Rivest [1] suggested a nonparametric method for estimating the Archimedean copulas of a pair of random variables. The problem of specifying a probability model for independent observations (x1; y1); :::; (xn; yn) from

bi-variate non-Gaussian distribution function H(X; Y ) with continuous marginal distributions F (x)and G(y) is considered. Archimedean copulas can be charac-terized by behaviour of the random variable H(X; Y ) = T , [1]. Because of this fact, obtaning the estimation of (3.1) ) will be usefull. As seen from (3.1),K(t) is the distribution of H(X; Y ) = T .

(4)

Genest and Rivest [1] also proved that if the dependence function C is Archimedean then the estimation of an Archimedean copula is uniquely determined by a func-tion (3.2) de…ned on the interval (0; 1):

(3.2) K(t) = t 0(t)

(t)

Since we do not have T0s pseudo observations are used to obtain nonparametric

estimate of K(t) say, Kn(t) (3.3) Kn(t) = n X j=1 I[fTj tg]=(n + 1)

As seen the equations, Kn(t) is the emprical estimation of K(t).

The estimation of K can be done in two steps. The …rst step consists of con-structing the emprical bivariate distribution Hn(X; Y ). The second step consists

of computing Hn(xi; yi) for i = 1; 2; :::; n and using those pseudo-observations

to construct a one-dimensional emprical distribution function for K.

Family (t) 0(t) K(t) = t 0(t)(t) Gumbel ( ln t) ( ln t) 1 1 t t (t ln t) Clayton t 1 t 1 t (t +1 t) Frank lne t 1 e 1 1 et t lne t 1 e 1(e t 1)

Table 3. D istribution Functions of A rchim edean C opulas

1. Estimate Kendall’s correlation coe¢ cient using the nonparametric esti-mate of copula parameters.

2. Follow the three steps below to obtain nonparametric estimate of K(t) , say Kn(t) i. Ti = Fn(Xi; Yi) = n X j=1 I[ffXj Xi&Yj Yig]=(n + 1); i = 1; 2; : : : ; n ii. Kn(t) = #(Ti t) n + 1 = number of Ti t n + 1

iii. Construct a parametric of K(t) that is Kn(t) using therelationship (3.2)

K(t) = t 0(t) (t)

3. The selection of an Archimedean copula that …ts the data beter can be done by minimizing a distance such as

(3.4)

Z

K n(t) Kn(t) 2

(5)

4. Application

On the previous sections, the method that has been suggested by Genest and Rivest [1] is introduced. They have suggested the method to select the suitable copula function. In this study we use this method for same purpose when only the values of any continuous distribution function are avaliable instead of data itself. In other words, we assumued that data are collected as probabilities of any case such as a failure/ survival probabilities.

Under the assumption of the data may be observed for any study, we generated dependent bivariate data (n = 200 ) using Gumbel copula wtih positive and high correlation ( = 0:75 ). Ignoring the information of correlation and the data generated by using Gumbel copula, aim of this study is to choose the suitable copula function that provide best …t. An example of the generated data is given by Table(4). X Y 1 0.2280 0.3498 2 0.1615 0.2749 3 0.9508 0.9598 4 0.7921 0.8273 .. . ... ... 198 0.0269 0.0953 199 0.6716 0.6129 200 0.9315 0.9281

Table 4.Exam ple of dataset

By following …rst step of the procedure that introduced in section 3.1, we calcu-lated Kendall’s Tau coe…cient as 0.7472 for (X; Y ). Using the relationship that is shown in Table2, copula parametres are estimated and results are given by Table 5

Family Gumbel Clayton Frank

b 3.9563 5.9125 14.1385

Table 5. N onparam etric Estim ation of A rchim edean C opulaParam eters

By using pseudo-observations, Kn(t), the nonparametric estimatimates of K(t)is

calculated. The equation (3.2) and nonparametric estimates of parameters that are shown in Table 5 are used for the parametric estimates of copula functons. Finally, for selecting the suitable copula Kn(t), is compared to KGumbel(t),

KClayton(t)and KF rank(t) using the square distance measure (3.4). This

dis-tance measure (3.4) is calculated 0:00144 for Gumbel copula and 0:03686 , 0:00997 for Clayton and Frank copula, respectively. According to this meausure (3.4) Gumbel copula is selected. Calculations are summarized in Table 6.

(6)

t Cum. Kn(t) KGbl(t) KCl(t) KF rk(t) Gumbel Clayton F rank 0:00001 1 0:00498 0:00004 0:00001 0:00010 0:00002 0:00002 0:00002 0:05001 20 0:09950 0:08788 0:05847 0:09941 0:00014 0:00168 0:00000 0:10001 34 0:16915 0:15821 0:11692 0:16134 0:00012 0:00273 0:00006 0:15001 47 0:23383 0:22194 0:17538 0:21632 0:00014 0:00342 0:00031 0:20001 58 0:28856 0:28137 0:23384 0:26860 0:00005 0:00299 0:00040 0:25001 71 0:35323 0:33761 0:29228 0:31970 0:00024 0:00371 0:00112 0:30001 80 0:39801 0:39131 0:35071 0:37022 0:00004 0:00224 0:00077 0:35001 90 0:44776 0:44289 0:40909 0:42048 0:00002 0:00150 0:00074 0:40001 98 0:48756 0:49265 0:46736 0:47060 0:00003 0:00041 0:00029 0:45001 111 0:55224 0:54083 0:52544 0:52065 0:00013 0:00072 0:00100 0:50001 120 0:59701 0:58761 0:58317 0:57065 0:00009 0:00019 0:00070 0:55001 127 0:63184 0:63312 0:64032 0:62060 0:00000 0:00007 0:00013 0:60001 136 0:67662 0:67748 0:69654 0:67048 0:00000 0:00040 0:00004 0:65001 142 0:70647 0:72078 0:75134 0:72023 0:00020 0:00201 0:00019 0:70001 154 0:76617 0:76312 0:80403 0:76972 0:00001 0:00143 0:00001 0:75001 162 0:80597 0:80454 0:85371 0:81867 0:00000 0:00228 0:00016 0:80001 169 0:84080 0:84513 0:89915 0:86655 0:00002 0:00340 0:00066 0:85001 177 0:88060 0:88492 0:93877 0:91225 0:00002 0:00338 0:00100 0:90001 185 0:92040 0:92398 0:97058 0:95353 0:00001 0:00252 0:00110 0:95001 191 0:95025 0:96232 0:99204 0:98585 0:00015 0:00175 0:00127 1:00000 200 1:00000 1:00000 1:00000 1:00000 0:00144 0:03686 0:00997

Table 6.T he N onparam etric Param etric Estim atesof Kn(t) and SquareD istance

5. Conclusion

In this sudy, the copula term in statistics literature and a special class of copula that is called Archimeadan copula family are introduced. Because of appealing mathematical properties, Archimedean copulas are preferred in many studies such as biostatistical, econometrcial or studies with concerning mortatlity. In such studies, the data can be obtained as failure/ survival probabillites and researchers may be interested in modelling the dependency between two vari-ables. When modelling the dependecy, using copula is a common way. Genest and Rivest has been suggested the nonparametric method to select the suitable Archimedean copula.

In this study, under the assumption of the data can be modelled by Archimedean copula function, this method is applied to generated data by using these prob-abilities to select the appropriate copula function. We restrict this study in bivarite case, multivariate case is still an open problem. For proceeding studies, we are planning to apply the method to real data sets with di¤erent copula familes.

References

1.Genest C. ,Rivest L.P. (1993): Statistical inference procedures for bivariate archimedean copulas. Journal of The American Statistical Association Theory and Methods (88), No: 423

2.Deheuvels, P. (1978): Caracterisation complete des lois extremes multivariees et de la convergence des types extremes. Publ. Inst. Statist. Univ. Paris 23, 1-37.

3. Frees W.E., Valdez A.E. (1997): Understanding relationships using copulas. 32nd. Actuarial Research Conference, 6-8 August at University of Calgary, Albert ,Canada. 4. Joe H (1997). Multivariate Models and Dependence Concepts. (Chapman & Hall Ltd.).

(7)

5. Kimeldorf, G., Sampson, A. (1975b): Uniform representations of bivariate distrib-utions. Communications in Statistics, 4, 617–627 (1975b)

6.Kolev, N., dos Anjos, U., Mendes, B. (2006): Copulas: a review and recent develop-ments. Stoch.Models 22 (4), 617–660

7. Melchiori R. M. (2003):Which archimedean aopula is the right one? YieldCurve.com (e-Journal)

8. Naifar N. (2011). Modelling dependence structure with archimedean copulas and applications iTraxx CDS index. Journal of Computational and Applied Mathematics, (235); 2459-2466.

9. Nelsen, R., (1999): An Introduction to Copulas.( Springer, NewYork).

10.Oakes, D. (1982). A model for association in bivariate survival data. Journal of The Royal Statistical Society. Ser B. (44), 414-422.

11. Schweizer, B., Sklar, A. (1983): Probabilistic Metric Spaces. (North-Holland, New York.)

12. Schweizer, B. (1991): Thirty years of copulas, pp.13-50 in: G. Dall’Aglio, S.Katz and G. Salinetti, eds. Advances in probability distributions with given marginals Math. Appl., vol. 67,pp. 13–50. Kluwer Acad. Publ., Dordrecht

13. Sklar, A. (1959). Fonctions de repartition a n dimensions et leurs marges., Publi-cationsdel’Institut de Statistique de lUniversite de Paris,8, 229-231.

14. Sklar, A. (1996): Random variables, distribution functions and copulas-A personal look backward and foeward- in L.Rüschendorf, B. Schweizer and M.D.Taylor (Eds.), Distributions with Fixed Marginals and Related Topics, Institute of Mathematical Statistics, Hayward, CA.

15. Trivedi, P.K., Zimmer, D.M. (2005): Copula Modeling: An Introduction for Practitioners, vol.1.

Şekil

Table 1 gives some examples of bivariate Archimedean copulas and correspond- correspond-ing generator functions
Table 3. D istribution Functions of A rchim edean C opulas
Table 4. Exam ple of dataset
Table 6. T he N onparam etric Param etric Estim atesof K n (t) and SquareD istance

Referanslar

Benzer Belgeler

Thermocouples are a widely used type of temperature sensor for measurement and control and can also be used to convert a temperature gradient into electricity.. Commercial

Quantitative results are obtained using devices or instruments that allow us to determine the concentration of a chemical in a sample from an observable signal.. There

Linear dependence causes rank- deficiency, in which case we should identify the linear dependence relation, use it to express some regressors (or columns of the design matrix) in

Svetosavlje views the Serbian church not only as a link with medieval statehood, as does secular nationalism, but as a spiritual force that rises above history and society --

Like many other instances of nation building, Turkish nation building was a violent process. However, accounts of it usually focus on its constructive side or

Overall, the results on political factors support the hypothesis that political constraints (parliamentary democracies and systems with a large number of veto players) in

The aim of this study is to provide developing students’ awareness of mathematics in our lives, helping to connect with science and daily life, realizing

The ai m of this study was to investigate the atrial fibrillation occ ur- rence in patients unde rgoing coronary artery bypass gr afting (CABG) surgery and