Clustering financial data using Copula-GARCH model in an application for main market stock returns

Beata Basiura; Anna Czapkiewicz

doi:https://doi.org/10.59170/stattrans-2010-002

Clustering financial data using Copula-GARCH model in an application for main market stock returns

Beata Basiura Faculty of Management, AGH University of Science and Technology, Krakow, Poland , Anna Czapkiewicz Faculty of Management, AGH University of Science and Technology, Krakow, Poland Statistics in Transition new series, vol. 11, 2010, 1, pages: 25-45 Published online: 1 July 2010 https://doi.org/10.59170/stattrans-2010-002

308 Views 11 Downloads

ARTICLE

(English) PDF

ABSTRACT

There are many statistical techniques that allow us to find similarities among variables. Cluster analysis discovers structure within sets of data. The choice of a relevant metric is a fundamental problem in the case of clustering financial data. In this paper, the Copula–GARCH model is used to obtain the dependency parameter between time series. The dissimilarity measure based on the maximum likelihood parameter obtained from the Normal or t-Student copula is proposed and applied to classify forty two indices from American, European, and Asian stock markets.

KEYWORDS

Clustering stock indices, dependence parameter, Copula–GARCH model, Copula function, Skewed distribution

REFERENCES

ARELLANO-VALLE, R., and GÓMEZ, H., and QUINTANA, F., 2004, A New Class of Skew-Normal Distributions, Communications in Statistics, Series A, 33(7), 1465–1480.

BOLLERSLEV T., 1986, Generalized Autoregressive Conditional Heteroskedasticity, Journal of Econometrics, 31, 307–327.

BOLLERSLEV T., WOOLDRIDGE J.M., 1992, Quasi-Maximum Likelihood Estimation and Inference in Dynamic Models with Time-Varying Covariance’s, Econometric Reviews, 11, 143–172.

BREYMANN W., DIAS A., EMBRECHTS P., 2003, Dependence Structures for Multivariate High-Frequency Data in Finance, Quantitative Finance, 3, 1–14.

CAIADO J., CRANTO N., PENA D., 2006, A periodogram-based metric for time series classification, Computation Statistic & Data Analysis v50 i10, 2668– 2684.

CANELA M.A., COLLAZO E. P., 2005, Modeling Dependence in Latin American Markets Using Copula Function, Working Paper.

CHEN X., FAN Y., PATTON, A.J., 2004, Simple Tests for Models of Dependence Between Multiple Financial Time Series, with Applications to U.S. Equity Returns and Exchange Rates, FMG Discussion Papers, Financial Markets Group.

DIEBOLD F.X., GUNTHER T.A., TAY A.S., 1998, Evaluating Density Forecasts with Applications to Financial Risk Management, International Economic Review, 39(4), 863–883.

EMBRECHTS P., LINDSKOG F., MCNEIL A., 2001a, Modeling Dependence with Copulas and Applications to Risk Management, ETHZ, Working Paper.

EMBREECHT P., MCNEIL A.J., STRAUMANN D., 2001b, Correlation and dependency in risk management: properties and pitfalls. [In:] M. Dempster, H. Moffant, Risk Management, Cambridge University Press , New York, 176–223.

EMBRECHTS P., LINDSKOG F., MCNEIL A., 2003, Modeling Dependence with Copulas and Applications to Risk Management, In: Rachev, S.T. (Ed.), Handbook of Heavy Tailed Distributions in Finance, Elsevier/Noth-Holland, Amsterdam.

FERNANDEZ C., STEEL M., 1998, On Beyesian Modeling of Fat Tails and Skewness, Journal of the American Statistical Association, 93, 359–371.

EMBRECHTS P., LINDSKOG F., MCNEIL A., 2001a, Modeling Dependence with Copulas and Applications to Risk Management, ETHZ, Working Paper.

EMBRECHTS P., LINDSKOG F., MCNEIL A., 2003, Modeling Dependence with Copulas and Applications to Risk Management, [In:] Rachev, S.T. (Ed.), Handbook of Heavy Tailed Distributions in Finance, Elsevier/Noth-Holland, Amsterdam.

FERNANDEZ C., STEEL M., 1998, On Beyesian Modeling of Fat Tails and Skewness, Journal of the American Statistical Association, 93, 359–371.

GENEST R., REMILLARD B., 2008, Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models, Annales de l'Institut Henri Poincare: Probabilites et Statistiques, 44, 1096–1127.

GENEST R., REMILLARD B., BEAUDOIN D., 2009, Goodness-of-fit tests for copulas: A review and a power study, Insurance: Mathematics and Economics, 44, 199–214.

HANSEN B., 1994, Autoregressive Conditional Density Estimation, International Economic Review, v.35, no. 3,705–730.

JOE H., XU J.J., 1996, The estimation method of inference function for margins for multivariate models, Technical Report, Departments of Statistics, University of British Columbia.

JUNKER M., MAY A., 2005, Measurement of aggregate risk with copulas, Econometrics Journal, Royal Economic Society, 8(3): 428–454, December.

KAUFMAN L., ROUSSEEUW P., 1990, Finding Groups in Data: An Introduction to Cluster Analysis, Wiley, New York.

KOJADINOVIC I., YAN J., 2009a, Fast large-sample goodness-of-fit tests for copulas, Submitted.

KOJADINOVIC I., YAN J., 2009b, A goodness-of-fit test for multivariate multiparameter copulas based on multiplier central limit theorems, Statistics and Computing,. In press.

KRZANOWSKI W., LAI Y., 1988, A Criterion For Determining the Number of Groups In Data Set Using Sum of Squares Clustering, Biometrics 44(1), 23–34.

LUI Y., LUGER R., 2009, Efficient estimation of copula-GARCH models 1, Computational Statistics & Data Analysis, 53, 2284–2297

MASHAL R., ZEEVI A., 2002, Beyond Correlation: Extreme co-movements Between Financial Assets, Mimeo, Columbia Graduate School of Business.

MIRKIN B., 2005, Clustering for Data Mining: A Data Recovery Approach, Boca Raton Fl., Chapman and Hall/CRC.

NELSEN B. R., 1999, An Introduction to Copulas, Springer Verlag, New York.

NELSON D.B., 1991, Conditional Heteroskedasticity in Asset Returns: A New Approach, Econometrica, 59(2), 397–370.

OTRANTO E., 2004, Classifying the Markets Volatility with ARMA Distance Measures, Quaderni di Statistica, 6,1–19.

PATTON A.J., 2003, Modeling Asymmetric Exchange Rate Dependence, Working paper, University of California, San Diego.

PATTON A.J., 2004. On the Out-of-Sample Importance of Skewness and Asymmetric Dependence for Asset Allocation, Journal of Financial Econometrics, Oxford University Press, 2(1), 130–168.

PATTON A.J., 2006, Estimation of multivariate models for time series of possibly different lengths, Journal of Applied Econometrics, John Wiley & Sons, Ltd., 21(2), 147–173.

Piccolo, 1990, A distance measure for classifying ARIMA models, Journal of Time Series Analysis v11, 153–164.

R Development Core Team, 2004, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, ISBN is 3-900051-07-0 URL .

ROCH O., ALEGRE A., 2006, Testing the bivariate distribution of daily equity returns using copulas. An application to the Spanish stock market, Computational Statistics& Data Analysis, 51, 1312–1329.

ROSENBLATT M., 1952, Remarks on a Multivariate Transformation, The Annals of Mathematical statistics, 23, 470–472.

SHIH J., LOUIS T.A., 1995, Inference on the Association Parameter in Copula Models for Bivariate Survival Data, Biometrics, 51: 1384–1399.

SKLAR A., 1959, Fonction de Repartition a n Dimension et Leur Marges, Publications de L’Institut de Statistiques de L’Universite de Paris, 8, 229– 231.

WARD J. H., 1963, Hierarchical Grouping to Optimize an Objective Function, Journal of the American Statistical Association, 58, 236–244.