DSpace at EWHA: Permutation Dimension Test of Fused Sliced Inverse Regression

Browse

My Repository

DSpace at EWHA일반대학원 통계학과 Theses_Master

View : 598 Download: 0

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	유재근	-
dc.contributor.author	조유나	-
dc.creator	조유나	-
dc.date.accessioned	2018-03-06T16:30:40Z	-
dc.date.available	2018-03-06T16:30:40Z	-
dc.date.issued	2018	-
dc.identifier.other	OAK-000000148249	-
dc.identifier.uri	http://dcollection.ewha.ac.kr/common/orgView/000000148249	en_US
dc.identifier.uri	https://dspace.ewha.ac.kr/handle/2015.oak/240359	-
dc.description.abstract	Recently, as the importance of data becomes more important, the amount of data accumulated in real time is increasing exponentially. Therefore, there are high-dimensional data that contain many explanatory variables. Of the many explanatory variables included in the data, relatively few explanatory variables have significant meaning in explaining actual dependent variables. Accordingly, it is important these days to reduce the dimension of explanatory variables in terms of efficiency. As a result, methods of reducing the dimension of the explanatory variable are emerging while minimizing the loss of information. In this paper, we deal with Sufficient Dimension Reduction (SDR) which is a method to reduce the dimension without loss of information. Among the various SDR methodologies, we investigate SIR which aims to find the central subspace of the target space. Furthermore, Fused SIR, which complements the problem of the SIR method, is also dealt with. Finally, we compare the result of the reduction of the dimension by two methods through the simulation and explain the result.;최근 데이터의 중요성이 부각되면서, 실시간으로 축적되는 데이터의 양이 기하급수적으로 늘어나고 있다. 이에 따라, 기존에 비해 보다 방대해진 데이터를 다루는 방법들을 필요로 하며, 그 예로 많은 설명변수를 포함하고 있는 고차원 데이터가 존재한다. 데이터에 포함된 많은 설명변수 중 실제 중요한 의미를 가지는 설명변수는 상대적으로 적을 뿐더러, 설명변수가 많은 경우 설명변수들 간의 다중공선성이 발생하는 등의 문제가 있으므로, 설명변수의 차원을 축소하는 것이 오늘날 중요한 화두가 되고 있다. 차원축소를 위한 방법론에는 Regularized, Principal Component Analysis, Partial Least Squares, Canonical Correlation Analysis 등이 이전부터 사용되어 왔다. 본 논문에서는 정보의 손실 없이 차원을 축소하는 Sufficient Dimension Reduction (SDR)을 다루었다. 그 중 target space에 대한 central subspace 를 찾는 Sliced Inverse Regression (SIR)가 많이 사용된다. 반면, SIR 방법의 경우 slice의 개수가 달라짐에 따른 결과의 변동이 작지 않고, quadratic data에서 차원을 잘 추정하지 못한다는 단점이 있다. 따라서, 본 논문에서는 이러한 SIR의 문제를 보완한 Fused SIR 방법에 대해 알아보고, 모의실험을 통해 SIR와의 차이를 알아보았다. 설명변수와 오차항이 각각 정규분포와 t분포를 따르는 여러 모형을 대상으로 하였으며, 대부분의 경우에서 Fused SIR 방법론이 SIR 방법론보다 차원을 더 안정적으로 추정하는 것을 확인하였다.	-
dc.description.tableofcontents	Ⅰ. Introduction 1 Ⅱ. Algorithm 3 A. Review of Sliced Inverse Regression(SIR) 3 B. Fused Sliced Inverse Regression(Fused SIR) 5 Ⅲ. Permutation Dimension Test 6 Ⅳ. Simulation Study 8 A. Model construction 8 B. Simulation results 10 Ⅴ. Discussion 20 1.References 22 2.Appendix 23 ABSTRACT(in Korean) 27	-
dc.format	application/pdf	-
dc.format.extent	1798748 bytes	-
dc.language	eng	-
dc.publisher	이화여자대학교 대학원	-
dc.subject.ddc	500	-
dc.title	Permutation Dimension Test of Fused Sliced Inverse Regression	-
dc.type	Master's Thesis	-
dc.format.page	iv, 28 p.	-
dc.identifier.thesisdegree	Master	-
dc.identifier.major	대학원 통계학과	-
dc.date.awarded	2018.2	-