DSpace at EWHA: ANOVA를 이용한 cDNA 마이크로어레이 실험 데이터의 분석과 그에 따른 등분산성 가정에 관한 연구

Browse

My Repository

DSpace at EWHA일반대학원 통계학과 Theses_Master

View : 588 Download: 0

ANOVA를 이용한 cDNA 마이크로어레이 실험 데이터의 분석과 그에 따른 등분산성 가정에 관한 연구

Title: ANOVA를 이용한 cDNA 마이크로어레이 실험 데이터의 분석과 그에 따른 등분산성 가정에 관한 연구

Authors: 황수민

Issue Date: 2003

Department/Major: 대학원 통계학과

Publisher: 이화여자대학교 대학원

Degree: Master

Abstract: cDNA microarray 실험 데이터는 계통적인 분산을 보정하기 위해서 정규화 과정을 거치게 된다. 그러나 정규화 과정에서 발현된 값이 민감히 반응하여 손상될 수 있으므로 구하고자 하는 두 환경에서의 유전자 발현 차이에 영향을 미칠 수 있는 문제점을 가진다. 이러한 문제점의 대안으로 외부 효과를 자동적으로 보정해줄 수 있는 ANOVA 방법이 제안되었다. 본 논문은 데이터의 계통적인 분산을 보정하고 모수를 추정하는 방법으로 ANOVA를 사용하였고, 그 결과를 바탕으로 ANOVA 접근 시 필요한 기본 가정에 대한 검토와 ANOVA 모형의 유의성을 논의할 수 있었다. 본 논문은 유전자 발현 차이에 대한 ANOVA 모형의 실증 분석의 데이터로 HDL( high density lipoprotein ) 콜레스테롤 수치가 낮은 쥐에게서 얻은 microarray를 적용시켰다. ANOVA의 결과는 본 연구에 관심 대상인 variety gene 효과 뿐 아니라 그 외 다른 효과에 대해서도 유의한 결과를 보여주었다. 그러나 ANOVA를 할 때 전제가 되는 가정을 몇 가지 검토해본 결과 오차와 특정 요인들에 대한 등분산성이 위배되어 그 가정이 적합하지 않는 것을 볼 수 있었다. 따라서 기존의 분석에서 제안된 ANOVA 모형으로 유전자 발현을 설명하는 것은 어려움이 있다고 판단하였고, 이것을 설명할 수 있는 모형에 대한 연구가 필요함을 확인할 수 있었다. ;cDNA microarray experimental data should be normalized to correct systematic variations. However normalized values are so sensitive that they can be easily harmed and this causes a problem which affects gene expression in two different circumstances. In order to remedy this problem, an ANOVA method which can correct external effect automatically is proposed as an alternative method for the usual normalizations. In this paper, an ANOVA method is adopted to correct systematic variations and estimate parameters of gene expression data. Based on these results obtained from method, it is possible to examine the basic assumptions for valid ANOVA method and to discuss the advantages of this method. It is applied to microarray data which are produced from a HDL cholesterol lowered rat to investigate difference of gene expression. The result of the ANOVA method is not only significant in studying variety gene effect which is concerned mainly here but also in investigating other effects. Through the examination of a few hypotheses for ANOVA method, several nonstandard properties are encountered such as heteroscedasticity in an average error and in certain factors. From these results, that seems to be difficult to explain gene expression with ANOVA method. It is clear that further a research is required for a more appropriate model for the gene expression data which is able to explain this nonstandard properties.