View : 588 Download: 0

Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies

Title
Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies
Authors
Ganna, AndreaLee, DonghwanIngelsson, ErikPawitan, Yudi
Ewha Authors
이동환
SCOPUS Author ID
이동환scopusscopus
Issue Date
2015
Journal Title
BRIEFINGS IN BIOINFORMATICS
ISSN
1467-5463JCR Link

1477-4054JCR Link
Citation
BRIEFINGS IN BIOINFORMATICS vol. 16, no. 4, pp. 563 - 575
Keywords
statistical validationrediscovery ratefalse discovery ratemultiple testingmetabolomics
Publisher
OXFORD UNIV PRESS
Indexed
SCIE; SCOPUS WOS
Document Type
Article
Abstract
It is common and advised practice in biomedical research to validate experimental or observational findings in a population different from the one where the findings were initially assessed. This practice increases the generalizability of the results and decreases the likelihood of reporting false-positive findings. Validation becomes critical when dealing with high-throughput experiments, where the large number of tests increases the chance to observe false-positive results. In this article, we review common approaches to determine statistical thresholds for validation and describe the factors influencing the proportion of significant findings from a 'training' sample that are replicated in a 'validation' sample. We refer to this proportion as rediscovery rate (RDR). In high-throughput studies, the RDR is a function of false-positive rate and power in both the training and validation samples. We illustrate the application of the RDR using simulated data and real data examples from metabolomics experiments. We further describe an online tool to calculate the RDR using t-statistics. We foresee two main applications. First, if the validation study has not yet been collected, the RDR can be used to decide the optimal combination between the proportion of findings taken to validation and the size of the validation study. Secondly, if a validation study has already been done, the RDR estimated using the training data can be compared with the observed RDR from the validation data; hence, the success of the validation study can be assessed.
DOI
10.1093/bib/bbu033
Appears in Collections:
자연과학대학 > 통계학전공 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE