DSpace at EWHA: Data-driven drug discovery by chemical genomics approaches

Browse

My Repository

DSpace at EWHA일반대학원 생명과학과 Theses_Ph.D

View : 1358 Download: 0

Data-driven drug discovery by chemical genomics approaches

Title: Data-driven drug discovery by chemical genomics approaches

Authors: 이해승

Issue Date: 2015

Department/Major: 대학원 생명과학과

Publisher: 이화여자대학교 대학원

Degree: Doctor

Advisors: 김완규

이상혁

Abstract: With the accumulation of chemical genomic data, novel approaches have been actively developed in order to interrogate and translate those data to facilitate drug discovery and development. Among the approaches, in-silico drug repositioning (DR) has a great potential to discover new indications of existing drugs, particularly in the chemical genomic era. New and novel types of datasets are being generated such as drug targets, structure, drug-induced omics profiles, and large-scale drug screening dataset. In this thesis, I focus on the two types of dataset that were relatively less explored for their utility in in-silico DR: 1) LINCS (Library of Integrated Network-based Cellular Signatures) consisting of drug-induced transcriptional profiles for > 20,000 compounds in multiple cancer cell lines, 2) PubChem Bioassay database currently storing more than a million screen records involving about 2,000,000 compounds across thousands of molecular targets. They provide a global view of transcriptional or genomic perturbation even by unknown off-targets, and rich information on the drug’s mode-of-action. This thesis focuses on data-driven computational drug discovery by chemical genomics approach. This thesis is composed of three chapters. Chapter I describes background information on the field of computational drug repositioning and review different strategies to predict new indication of existing drugs. Chapter II describes a chemical genomics approach to identify DR candidates for three types of cancers: glioblastoma, lung cancer, and breast cancer. First, I showed that the drug-induced expression profile is more effective than other features, such as the drug structure or target, particularly in identifying novel DR candidates. The utility of DR was robustly demonstrated using multiple HTS datasets as unbiased benchmarks. Second, I predicted 14 high-scoring DR candidates solely based on expression signatures. Eight of the fourteen drugs showed significant anti-proliferative activity against glioblastoma; i.e., ivermectin, trifluridine, astemizole, amlodipine, maprotiline, apomorphine, mometasone, and nortriptyline. The DR score strongly correlated with that of cell-based experimental results; the top seven DR candidates were positive, corresponding to an approximately 20-fold enrichment compared with conventional HTS. Despite diverse original indications, the pattern of pathway enrichment of all DR candidates was highly similar to that of other cancer drugs, which may indicate the modes of action of these drugs. In chapter III, I used bioassay data from PubChem BioAssay to construct a large-scale compound-compound network. The network was built by compound comparisons through bioactivity profiles. I assess the utility of this network for three applications: virtual screening, target identification, and drug repositioning. When applied to virtual screening, the network achieved good performance for predicting structurally diverse hit compounds in target-based assays, in-house cell based assays and known active compound sets. Also, I showed that the bioactivity-based similarity could be used as a means for target identification. Furthermore, drug association network extracted from the compound association network elicited drug clusters of similar physiological effects. It supports feasibility of drug repositioning.;방대한 화학유전체 데이터의 축적과 더불어, 이를 이용한 다양하고 새로운 방법들이 효율적인 약물 개발을 위해 개발되어 왔다. 그 중 컴퓨터를 사용한 신약 재창출 (computational drug repositioning) 은 빅데이터 시대에 맞춰 기존 의약품의 새로운 약효를 발굴할 수 있는 기회를 제공한다. 그 동안 약물의 타겟, 구조, 오믹스 데이터, 대용량 스크리닝 데이터 등과 같은 새로운 데이터 타입들이 생산되어왔으며, 이를 이용한 다양한 예측 알고리즘들이 신약 재장출을 성공적으로 이끌어 왔다. 최근, 대량의 화학 유전체 데이터들이 공개적으로 제공되면서, 다음과 같은 데이터베이스가 관심을 받고 있다. 1) LINCS (Library of Integrated Network-based Cellular Signature) 는 20,000 여 개의 화합물을 여러 암 세포주에 처리한 약물 유도 전사체 프로파일을 제공한다. 2) PubChem BioAssay 데이터베이스는 현재까지 다양한 분자 타겟에 대해 2,000,000여 개의 화합물을 처리한 생물학적 검정 실험(bioassay) 결과를 제공한다. 이 데이터베이스들은 약물의 작용 양식(mechanism of action)을 도출할 수 있는 정보를 전사체와 유전체 수준에서 제공한다. 이 논문은 이러한 화학 유전체 데이터 기반으로 약물을 개발하는 것에 초점을 맞추고 있다. 이 논문은 크게 세 가지 단원으로 구성된다. 첫 번째 단원에서는 컴퓨터를 사용한 신약 재창출의 배경을 설명하고, 그 것을 위한 다양한 연구 전략들을 소개한다. 두 번째 단원에서는 뇌종양, 폐암, 유방암에서 효과를 내는 신약재창출 후보를 예측하는 연구를 다루었다. 먼저, 새로운 후보를 예측하는 것에 약물 전사체 데이터가 약물 타겟, 약물 구조를 이용한 예측보다 우수함을 보였다. 약물 전사체 데이터에 기반하여 뇌종양에 항암효과를 나타낼14개의 약물 후보를 예측하여 뇌종양 환자 유래 세포주에 세포 독성(cytotoxicity)실험을 수행하였다. 전사체 데이터기반 점수는 세포독성 수치의 상관관계를 보였으며, 14개 중 8개의 약물 (ivermectin, trifluridine, astemizole, amlodipine, maprotiline, apomorphine, mometasone, and nortriptyline)에서 유의한 항암효과를 확인하였다. 이 실험을 통해 약물 전사체 기반 예측력이 기존의 HTS와 비교하였을 때 약 20배가량 Hit이 농축되어 있는 것을 확인하였다. 각 후보약물의 본래 작용질병이 다름에도 불구하고, 모든 후보들의 생체경로(pathway) 활성 패턴이 다른 항암 약물과 유사하였고, 이로부터 이들 약물의 작용 양식을 해석하였다. 세 번째 단원에서, PubChem BioAssay의 생물학적 검정 데이터 (Bioassay)로부터 대용량 화합물-화합물 네트워크를 구축한 방법을 소개한다. 이 네트워크는 대용량으로 화합물 간에 생활성 프로파일의 유사도를 계산하여 생성되었다. 이 네트워크를 기반으로 가상 탐색 실험, 타겟 인식, 그리고 신약 재창출에 대한 적용가능성을 검증하였다. 가상 탐색 실험에 적용할 때, 이 네트워크는 구조적으로 다양한 활성 화합물을 예측해내었다. 특히, 타겟에 대한 활성이 알려진 화합물 세트를 예측하는데 우수한 결과를 나타내었다. 또한, 이 화합물 네트워크를 약물 타겟 규명에 적용시켰을 때, 약 50%의 알려진 타겟을 정확하게 예측해내는 것을 확인하였다. 더 나아가, FDA승인된 약물로 구성된 서브-네트워크를 분석하여 유사한 생리활성 효과를 가지는 약물 클러스터를 이끌어 냈으며, 이 네트워크 모듈 분석을 통해 신약 재창출의 실현 가능성을 제시하였다.