DSpace at EWHA: 로지스틱과 생존분석을 이용한 생명보험 해약 예측모형연구

Browse

My Repository

DSpace at EWHA일반대학원 통계학과 Theses_Master

View : 1069 Download: 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	김수나	-
dc.creator	김수나	-
dc.date.accessioned	2016-08-25T04:08:49Z	-
dc.date.available	2016-08-25T04:08:49Z	-
dc.date.issued	2008	-
dc.identifier.other	OAK-000000050031	-
dc.identifier.uri	https://dspace.ewha.ac.kr/handle/2015.oak/177294	-
dc.identifier.uri	http://dcollection.ewha.ac.kr/jsp/common/DcLoOrgPer.jsp?sItemId=000000050031	-
dc.description.abstract	This paper studies the problem of predicting cancellation of an insurance contract based upon various predictor variables including the information of policyholder, the information about the insurance contract and a new variable called “period of upkeep (t) after the insurance contract." Recently, there have been many researches on cancellation prediction problem based on the methods of survival analysis such as Cox PHM (Proportional Hazard Model) as an alternative to the more conventional tool based on Logistic Regression. These studies are motivated by the limitations of the Logistic Regression on the lack of time concept suggesting dichotomized solutions. In this paper, therefore, a simple method to obtain results similar to PHM is described and ways to fix the shortcomings of the Logistic Regression are explored by adding the new variable called "period of upkeep after the insurance contract" into our study. By analyzing the insurance data of the ten thousand policyholder, we show that "period of upkeep of the insurance contract " is adopted as a significant variable, and both Logistic Regression with time covariates and PHM give similar prediction results.;우리나라의 생명보험 시장은 경제성장과 국민소득의 증가, 타 금융권과는 차별화되는 공격적인 마케팅을 통하여 비약적인 성장을 해왔다. 이러한 외형적인 성장은 과거 생명보험회사들의 양적 경영형태를 지향하는 과정에서 모집인의 대량도입, 대량 탈락, 생명보험 가입자들의 높은 실효·해약률 문제를 발생시켰다. 그 중에서도 높은 실효·해약률은 최근 금리의 하양안정화, 금융 자율화와 개방에 따른 경쟁의 격화로 수익성이 불투명해진 생명보험 산업에서 수익성 제고를 통한 안정적인 경영기반 확보를 위해 해결해야만 하는 과제로 떠올랐다. 때문에 본 연구에서는 생명보험 가입자와 해지자의 신상데이터 및 보험 상품관련 데이터를 이용하여 보험의 해지에 영향을 미치는 주요 변수를 찾아내고 그 변수들을 이용하여 생명보험 해약 시기를 예측함으로써 가입자의 생명보험해약을 사전에 방지할 수 있는 기회를 만들고자 생명보험 가입자의 해약 예측 모형을 연구하게 되었다. 분석 자료는 보험사의 데이터로 2001년 6월의 데이터를 모형개발용 데이터로 사용하였고, 2001년 9월의 데이터를 모형평가용 데이터로 사용한다. 모형개발용 데이터 셋과 모델평가용 데이터 셋은 각각 동일 구조로 구성되어 있으며, 변수명 중 Target_yn은 종속변수이고 C1_01~C1_35까지의 변수는 독립변수이다. 또한 각각 10,000개의 데이터로 이루어져 있다. 본 연구에서는 실증분석으로 우선 시간에 대한 변수를 생명보험 해약 예측모형에 넣었을 경우와 넣지 않았을 경우의 로지스틱 결과를 비교하여 시간 변수가 해약 예측을 하는데 있어서 미치는 영향을 알아보고, 둘째로는 보험 계약이 해약되기까지의 기간이 시간과 관련된 변수이며 데이터의 형태가 중도 절단형태이기 때문에 생존분석을 이용한 예측 모형을 만들었다. 여기서 사용된 방법은 독립변수가 시간에 의존하는 경우의 Cox-비례위험모형이다. 마지막으로 위의 첫째 결과와 둘째 결과. 즉, 로지스틱 모형과 Cox의 비례모형의 예측력 결과를 비교하여 보았다. 리프트 차트 (Lift Chart)를 이용하여 본 분석 결과 실증분석을 통해서는 시간변수 t를 포함한 로지스틱 분석과 시간변수 t를 제외한 로지스틱 분석, Cox의 비례위험모형을 모델링 해본 결과 그 추정계수가 비슷하고 그 예측력에 있어서도 시간변수 t를 포함한 로지스틱 모형과 Cox의 비례위험 모형이 거의 비슷함을 알 수 있었다. 따라서 ‘생명보험 가입 후 유지기간(t)'이라는 새로운 변수를 추가하여 로지스틱 분석을 실행한다면 Cox의 비례위험모형과 동일한 효과를 낼 수 있다는 결과를 얻게 되었다.	-
dc.description.tableofcontents	Ⅰ. 서론 = 1 A. 연구의 배경 및 목적 = 1 B. 연구의 방법과 범위 = 2 Ⅱ. 생명 보험 해약 예측 모형의 이론적 배경 = 3 A. 선행 연구 = 3 B. 로지스틱 회귀분석(Logistic Regression) = 3 C. 생존분석-Cox의 비례위험모형(Survival Analysis) = 8 D. 로지스틱 회귀분석의 시간변수 추가 = 11 Ⅲ. 실증분석 = 15 A. 자료의 구성 = 15 B. 종속변수와 독립(설명)변수 = 16 C. 분석 결과 = 19 1. 로지스틱 회귀분석 = 19 가. 시간에 관한 변수를 넣은 경우의 로지스틱 = 19 나. 시간변수를 제외한 경우의 로지스틱 = 21 2. Cox의 비례위험 모형 = 24 D. 모형평가-리프트차트(Lift Chart) = 25 Ⅳ. 요약 및 결론 = 31 참고문헌 = 33 Abstract = 35	-
dc.format	application/pdf	-
dc.format.extent	727312 bytes	-
dc.language	kor	-
dc.publisher	이화여자대학교 대학원	-
dc.title	로지스틱과 생존분석을 이용한 생명보험 해약 예측모형연구	-
dc.type	Master's Thesis	-
dc.format.page	vi, 35 p.	-
dc.identifier.thesisdegree	Master	-
dc.identifier.major	대학원 통계학과	-
dc.date.awarded	2008. 8	-