DSpace at EWHA: Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis

Browse

My Repository

View : 724 Download: 391

Full metadata record

DC Field	Value	Language
dc.contributor.author	김형래	*
dc.contributor.author	김한나	*
dc.date.accessioned	2019-01-02T16:30:09Z	-
dc.date.available	2019-01-02T16:30:09Z	-
dc.date.issued	2018	*
dc.identifier.issn	1471-2105	*
dc.identifier.other	OAK-24061	*
dc.identifier.uri	https://dspace.ewha.ac.kr/handle/2015.oak/248042	-
dc.description.abstract	Background: The use of whole genome sequence has increased recently with rapid progression of next-generation sequencing (NGS) technologies. However, storing raw sequence reads to perform large-scale genome analysis pose hardware challenges. Despite advancement in genome analytic platforms, efficient approaches remain relevant especially as applied to the human genome. In this study, an Integrated Genome Sizing (IGS) approach is adopted to speed up multiple whole genome analysis in high-performance computing (HPC) environment. The approach splits a genome (GRCh37) into 630 chunks (fragments) wherein multiple chunks can simultaneously be parallelized for sequence analyses across cohorts. Results: IGS was integrated on Maha-Fs (HPC) system, to provide the parallelization required to analyze 2504 whole genomes. Using a single reference pilot genome, NA12878, we compared the NGS process time between Maha-Fs (NFS SATA hard disk drive) and SGI-UV300 (solid state drive memory). It was observed that SGI-UV300 was faster, having 32.5 mins of process time, while that of the Maha-Fs was 55.2 mins. Conclusions: The implementation of IGS can leverage the ability of HPC systems to analyze multiple genomes simultaneously. We believe this approach will accelerate research advancement in personalized genomic medicine. Our method is comparable to the fastest methods for sequence alignment. © 2018 The Author(s).	*
dc.language	English	*
dc.publisher	BioMed Central Ltd.	*
dc.subject	Genome analysis	*
dc.subject	Genome sizing	*
dc.subject	Infrastructure	*
dc.subject	Sequencing	*
dc.subject	Statistics	*
dc.subject	Storage	*
dc.subject	Whole genome	*
dc.title	Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis	*
dc.type	Article	*
dc.relation.issue	1	*
dc.relation.volume	19	*
dc.relation.index	SCIE	*
dc.relation.index	SCOPUS	*
dc.relation.journaltitle	BMC Bioinformatics	*
dc.identifier.doi	10.1186/s12859-018-2499-1	*
dc.identifier.wosid	WOS:000451968900001	*
dc.identifier.scopusid	2-s2.0-85057853109	*
dc.author.google	Sona P.	*
dc.author.google	Hong J.H.	*
dc.author.google	Lee S.	*
dc.author.google	Kim B.J.	*
dc.author.google	Hong W.-Y.	*
dc.author.google	Jung J.	*
dc.author.google	Kim H.-N.	*
dc.author.google	Kim H.-L.	*
dc.author.google	Christopher D.	*
dc.author.google	Herviou L.	*
dc.author.google	Im Y.H.	*
dc.author.google	Lee K.-Y.	*
dc.author.google	Kim T.S.	*
dc.contributor.scopusid	김형래(57202558385;57219111690;57567109600)	*
dc.contributor.scopusid	김한나(55950033500;57224993635)	*
dc.date.modifydate	20240118123830	*