원문정보
초록
영어
In recent decades, machine learning (ML) algorithms has gained wide popularity in the finance literature. The goal of this research is to exploit machine learning techniques in order to analyze the effect of exchange-traded fund (ETF) illiquidity on tracking errors. We demonstrate the superior performance of the machine learning models – Random Forest and Gradient Boosting Decision Tree, in particular - over traditional linear models in predicting U.S. ETF’s tracking errors. Moreover, our variable importance analysis suggests that the features such as underlying assets based on U.S. assets (Invested in US Asset) and expense ratio (Expense Ratio), are two key factors in the determination of predicting the tracking errors on the ETF illiquidity. Finally, we further conduct SHAP (Shapley Additive exPlanations) technique in order to observe the impact of a particular variable(feature) on the difference between the considered- and average predictions of our machine learning models. Our results indicate that the most relevant variable is Invested in US Asset, which is in align with the previous importance analysis.
목차
I. Introduction
II. Literature review
2-1. The Effect of Illiquidity of ETFs on Tracking Error
2-2. The Machine Learning approach to financial market (ETF)
III. Data and Methodology
IV. Empirical analysis
4-1. The main analysis
4-2. Shapley value
V. Discussion
VI. Conclusion
References
Appendix
