DSpace at EWHA: Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies

Browse

My Repository

DSpace at EWHA자연과학대학 통계학전공 Journal papers

View : 590 Download: 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	이동환	*
dc.date.accessioned	2016-08-27T04:08:58Z	-
dc.date.available	2016-08-27T04:08:58Z	-
dc.date.issued	2015	*
dc.identifier.issn	1467-5463	*
dc.identifier.issn	1477-4054	*
dc.identifier.other	OAK-15468	*
dc.identifier.uri	https://dspace.ewha.ac.kr/handle/2015.oak/217503	-
dc.description.abstract	It is common and advised practice in biomedical research to validate experimental or observational findings in a population different from the one where the findings were initially assessed. This practice increases the generalizability of the results and decreases the likelihood of reporting false-positive findings. Validation becomes critical when dealing with high-throughput experiments, where the large number of tests increases the chance to observe false-positive results. In this article, we review common approaches to determine statistical thresholds for validation and describe the factors influencing the proportion of significant findings from a 'training' sample that are replicated in a 'validation' sample. We refer to this proportion as rediscovery rate (RDR). In high-throughput studies, the RDR is a function of false-positive rate and power in both the training and validation samples. We illustrate the application of the RDR using simulated data and real data examples from metabolomics experiments. We further describe an online tool to calculate the RDR using t-statistics. We foresee two main applications. First, if the validation study has not yet been collected, the RDR can be used to decide the optimal combination between the proportion of findings taken to validation and the size of the validation study. Secondly, if a validation study has already been done, the RDR estimated using the training data can be compared with the observed RDR from the validation data; hence, the success of the validation study can be assessed.	*
dc.language	English	*
dc.publisher	OXFORD UNIV PRESS	*
dc.subject	statistical validation	*
dc.subject	rediscovery rate	*
dc.subject	false discovery rate	*
dc.subject	multiple testing	*
dc.subject	metabolomics	*
dc.title	Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies	*
dc.type	Article	*
dc.relation.issue	4	*
dc.relation.volume	16	*
dc.relation.index	SCIE	*
dc.relation.index	SCOPUS	*
dc.relation.startpage	563	*
dc.relation.lastpage	575	*
dc.relation.journaltitle	BRIEFINGS IN BIOINFORMATICS	*
dc.identifier.doi	10.1093/bib/bbu033	*
dc.identifier.wosid	WOS:000359083200002	*
dc.author.google	Ganna, Andrea	*
dc.author.google	Lee, Donghwan	*
dc.author.google	Ingelsson, Erik	*
dc.author.google	Pawitan, Yudi	*
dc.contributor.scopusid	이동환(56434427300;58539708000)	*
dc.date.modifydate	20231123114357	*