Search CORE

198 research outputs found

Hierarchical maximum likelihood clustering approach

Author: Boroevich Keith
Kamatani Yoichiro
Kubo Michiaki
Sharma Alokanand
Shigemizu Daichi
Tsunoda T.
Publication venue: IEEE
Publication date: 01/01/2016
Field of study

Objective: In this work, we focused on developing a clustering approach for biological data. In many biological analyses, such as multi-omics data analysis and genome-wide association studies (GWAS) analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Methods: Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. Results: This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. Conclusion: The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood based methods. Significance: This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A Matlab implementation of this method can be downloaded from the web-link http://www.riken.jp/en/research/labs/ims/med_sci_math/

Crossref

ZENODO

University of the South Pacific Electronic Research Repository

Griffith Research Online

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Recommended from our members

Combined burden and functional impact tests for cancer driver discovery using DriverPower

Author: Abascal Federico
Amin Samirkumar B.
Bader Gary D.
Bandopadhayay Pratiti
Barenboim Jonathan
Beroukhim Rameen
Bertl Johanna
Boroevich Keith A.
Brunak Søren
Campbell Peter J.
Carlevaro-Fita Joana
Chakravarty Dimple
Chan Calvin Wing Yiu
Chen Ken
Choi Jung Kyoon
Deu-Pons Jordi
Dhingra Priyanka
Diamanti Klev
Feuerbach Lars
Fink J. Lynn
Fonseca Nuno A.
Frigola Joan
Gambacorti-Passerini Carlo
Garsed Dale W.
Gerstein Mark
Getz Gad
Guo Qianyun
Gut Ivo G.
Haan David
Hamilton Mark P.
Haradhvala Nicholas J.
Harmanci Arif O.
Helmy Mohamed
Herrmann Carl
Hess Julian M.
Hobolth Asger
Hodzic Ermin
Hong Chen
Hornshøj Henrik
Isaev Keren
Izarzugaza Jose M.G.
Johnson Rory
Johnson Todd A.
Juul Malene
Juul Randi Istrup
Kahles Andre
Kahraman Abdullah
Kellis Manolis
Khurana Ekta
Shuai Shimin
Stein Lincoln D.
Townend David
Publication venue
Publication date: 01/01/2020
Field of study

The discovery of driver mutations is one of the key motivations for cancer genome sequencing. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we describe DriverPower, a software package that uses mutational burden and functional impact evidence to identify driver mutations in coding and non-coding sites within cancer whole genomes. Using a total of 1373 genomic features derived from public sources, DriverPower's background mutation model explains up to 93% of the regional variance in the mutation rate across multiple tumour types. By incorporating functional impact scores, we are able to further increase the accuracy of driver discovery. Testing across a collection of 2583 cancer genomes from the PCAWG project, DriverPower identifies 217 coding and 95 non-coding driver candidates. Comparing to six published methods used by the PCAWG Drivers and Functional Interpretation Working Group, DriverPower has the highest F1 score for both coding and non-coding driver discovery. This demonstrates that DriverPower is an effective framework for computational driver discovery

Princeton University Open Access Repository

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Maastricht University Research Portal

Repository for Publications and Research Data

Lund University Publications

DSpace@MIT

Ghent University Academic Bibliography

Publikationer från Uppsala Universitet

Copenhagen University Research Information System

eScholarship - University of California

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

University of Melbourne Institutional Repository

Comparative Genomics Identifies Candidate Genes for Infectious Salmon Anemia (ISA) Resistance in Atlantic Salmon (Salmo salar)

Author: Boroevich Keith A.
Davidson William S.
Koop Ben F.
Li Jieying
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Infectious salmon anemia (ISA) has been described as the hoof and mouth disease of salmon farming. ISA is caused by a lethal and highly communicable virus, which can have a major impact on salmon aquaculture, as demonstrated by an outbreak in Chile in 2007. A quantitative trait locus (QTL) for ISA resistance has been mapped to three microsatellite markers on linkage group (LG) 8 (Chr 15) on the Atlantic salmon genetic map. We identified bacterial artificial chromosome (BAC) clones and three fingerprint contigs from the Atlantic salmon physical map that contains these markers. We made use of the extensive BAC end sequence database to extend these contigs by chromosome walking and identified additional two markers in this region. The BAC end sequences were used to search for conserved synteny between this segment of LG8 and the fish genomes that have been sequenced. An examination of the genes in the syntenic segments of the tetraodon and medaka genomes identified candidates for association with ISA resistance in Atlantic salmon based on differential expression profiles from ISA challenges or on the putative biological functions of the proteins they encode. One gene in particular, HIV-EP2/MBP-2, caught our attention as it may influence the expression of several genes that have been implicated in the response to infection by infectious salmon anemia virus (ISAV). Therefore, we suggest that HIV-EP2/MBP-2 is a very strong candidate for the gene associated with the ISAV resistance QTL in Atlantic salmon and is worthy of further study

Springer - Publisher Connector

PubMed Central

Genomic Organization and Evolution of the Atlantic salmon Hemoglobin Repertoire

Author: Boroevich Keith
Boroevich Keith
Chow William
Chow William
Davidson Evelyn
Davidson Evelyn
Davidson William
Davidson William
Koop Ben
Koop Ben
Lubieniecki Krzysztof
Lubieniecki Krzysztof
Phillips Ruth
Phillips Ruth
Quinn Nicole
Quinn Nicole
Publication venue: 'Simon Fraser University Library'
Publication date: 01/01/2010
Field of study

Summit Research Repository (Simon Fraser University)

Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen

Author: Abante J
Abecassis BS
Aben N
Aghamirzaie D
Ahsen ME
Aittokallio T
Akhtari FS
Al-lazikani B
Alam T
Allam A
Allen C
Altarawy D
Alves V
Amadoz A
Anchang B
Angel Pujana M
Antolin AA
Ash JR
Ba-alawi W
Bagheri M
Bajic V
Ball G
Ballester PJ
Baptista D
Bare C
Bateson M
Bender A
Bertrand D
Boroevich KA
Bosdriesz E
Bougouffa S
Bounova G
Brouwer T
Bryant B
Bulusu KC
Calaza M
Calderone A
Calza S
Capuzzi S
Carbonell-Caballero J
Carlin D
Carter H
Castagnoli L
Celebi R
Cesareni G
Chang H
Chen G
Chen H
Chen H
Cheng L
Chernomoretz A
Chicco D
Cho K-H
Cho S
Choi D
Choi J
Choi K
Choi M
Coker E
Combinatio A-SD
Cortes-Ciriano I
Cserzo M
Cubuk C
Curtis C
Dang CC
de Almeida MP
De Cock M
de Esch I
de Graaf C
De Maeyer D
De Niz C
de Ruiter JR
De Troyer E
Di Veroli GY
Dijkstra T
Dopazo J
Draghici S
Drosou A
Dry JR
Dumontier M
Ehrhart F
Eid F-E
ElHefnawi M
Elmarakeby H
Engin HB
Evelo C
Falcao AO
Farag S
Fawell S
Fernandez-Lozano C
Fisch K
Flobak A
Fornari C
Foroushani ABK
Fotso DC
Fourches D
Friend S
Frigessi A
Gao F
Gao X
Garnett MJ
Gerold JM
Gestraud P
Ghazoui Z
Ghosh S
Gillberg J
Godoy-Lorite A
Godynyuk L
Godzik A
Goldenberg A
Gomez-Cabrero D
Gonen M
Gray H
Grechkin M
Guan Y
Guimera R
Guinney J
Guney E
Haibe-Kains B
Han Y
Hase T
He D
He L
Heath LS
Hellton KH
Helmer-Citterich M
Hidalgo MR
Hidru D
Hill SM
Hochreiter S
Hong S
Hovig E
Hsueh Y-C
Hu Z
Huang JK
Huang RS
Hunyady L
Hwang J
Hwang TH
Hwang W
Hwang Y
Isayev O
Jack J
Jahandideh S
Jang IS
Jeon M
Ji J
Jo Y
Kamola PJ
Kanev GK
Kang J
Karacosta L
Karimi M
Kaski S
Kazanov M
Khamis AM
Khan SA
Kiani NA
Kim A
Kim J
Kim J
Kim K
Kim K
Kim S
Kim Y
Kim Y
Kirk PDW
Kitano H
Klambauer G
Knowles D
Ko M
Kohn-Luque A
Kooistra AJ
Kuenemann MA
Kuiper M
Kurz C
Kwon M
Laegreid A
Lederer S
Lee H
Lee J
Lee YW
Leppaho E
Lewis R
Li J
Li L
Liley J
Lim WK
Lin C
Liu Y
Lopez Y
Low J
Lysenko A
Machado D
Madhukar N
Malpartida AB
Mamitsuka H
Marabita F
Marchal K
Marttinen P
Mason D
Mason MJ
Mazaheri A
Mehmood A
Mehreen A
Menden MP
Michaut M
Miller RA
Mitsopoulos C
Modos D
Moo K
Motsinger-Reif A
Movva R
Muraru S
Muratov E
Mushthofa M
Nagarajan N
Nakken S
Nath A
Neto EC
Neuvial P
Newton R
Nguyen T
Ning Z
Norman T
Oliva B
Olsen C
Palmeri A
Panesar B
Papadopoulos S
Park J
Park S
Park S
Pawitan Y
Peluso D
Pendyala S
Peng J
Perfetto L
Pirro S
Plevritis S
Politi R
Poon H
Porta E
Prellner I
Preuer K
Ramnarine R
Reid JE
Reyal F
Richardson S
Ricketts C
Rieswijk L
Rocha M
Rodriguez-Gonzalvez C
Roell K
Romeo Aznar V
Rotroff D
Rukawa P
Sadacca B
Saez-Rodriguez J
Safikhani Z
Safitri F
Sales-Pardo M
Sauer S
Schlichting M
Seoane JA
Serra J
Shang M-M
Sharma A
Sharma H
Shen Y
Shiga M
Shin M
Shkedy Z
Shopsowitz K
Sinai S
Skola D
Smirnov P
Soerensen IF
Soerensen P
Song J-H
Song SO
Soufan O
Spitzmueller A
Steipe B
Stolovitzky G
Suphavilai C
Szalai B
Tamayo SP
Tamborero D
Tang EKY
Tang J
Tanoli Z-U-R
Tarres-Deulofeu M
Tegner J
Thommesen L
Tonekaboni SAM
Tran H
Truong A
Tsunoda T
Turu G
Tzeng G-Y
Van Daele D
van Engelen B
van Laarhoven T
Van Moerbeke M
van Westen GJP
Verbeke L
Videla S
Vis D
Vogel R
Voronkov A
Votis K
Walk OBD
Wang A
Wang D
Wang H-QH
Wang P-W
Wang S
Wang W
Wang X
Wang X
Wennerberg K
Wernisch L
Wessels L
Westerman BA
White SR
Wijayawardena B
Willighagen E
Wolfinger R
Wurdinger T
Xie L
Xie S
Xu H
Yadav B
Yau C
Yeerna H
Yin JW
Yu M
Yu M
Yu T
Yun SJ
Zakharov A
Zamichos A
Zanin M
Zaslavskiy M
Zeng L
Zenil H
Zhang F
Zhang P
Zhang W
Zhao H
Zhao L
Zheng W
Zoufir A
Zucknick M
Publication venue: Nature Publishing Group
Publication date: 01/01/2019
Field of study

The effectiveness of most cancer targeted therapies is short-lived. Tumors often develop resistance that might be overcome with drug combinations. However, the number of possible combinations is vast, necessitating data-driven approaches to find optimal patient-specific treatments. Here we report AstraZeneca's large drug combination dataset, consisting of 11,576 experiments from 910 combinations across 85 molecularly characterized cancer cell lines, and results of a DREAM Challenge to evaluate computational strategies for predicting synergistic drug pairs and biomarkers. 160 teams participated to provide a comprehensive methodological development and benchmarking. Winning methods incorporate prior knowledge of drug-target interactions. Synergy is predicted with an accuracy matching biological replicates for >60% of combinations. However, 20% of drug combinations are poorly predicted by all methods. Genomic rationale for synergy predictions are identified, including ADAM17 inhibitor antagonism when combined with PIK3CB/D inhibition contrasting to synergy when combined with other PI3K-pathway inhibitors in PIK3CA mutant cells

VU Research Portal

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Publikationsserver der Universität Tübingen

Archivio istituzionale della ricerca - Università di Brescia

Leiden University Scholary Publications

UPF Digital Repository

NORA - Norwegian Open Research Archives

White Rose Research Online

VTech Works (Virginia Tech)

CONICET Digital

Ghent University Academic Bibliography

Aaltodoc Publication Archive

Griffith Research Online

Oxford University Research Archive

Repository of the Academy's Library

Apollo (Cambridge)

ScholarWolf (University of Nevada, Reno)

Diposit Digital de la Universitat de Barcelona

ScholarBank@NUS

KAIST Institutional Repository

Universidade do Minho: RepositoriUM

Ege University Institutional Repository

Spiral - Imperial College Digital Repository

Document Server@UHasselt (Universiteit Hasselt)

ART

Helsingin yliopiston digitaalinen arkisto

Queen Mary Research Online

Lirias

Maastricht University Research Portal

University of the South Pacific Electronic Research Repository

eScholarship - University of California

Semmelweis Repository

Document Server@UHasselt

Archivio della ricerca- Università di Roma La Sapienza

RISalud-ANDALUCÍA

An integrative machine learning approach for prediction of toxicity - related drug safety

Author: Boroevich Keith
Lysenko Artem
Sharma Alokanand
Tsunoda Tatsuhiko
Publication venue: 'Life Science Alliance, LLC'
Publication date: 01/01/2018
Field of study

Recent trends in drug development have been marked by diminishing returns caused by the escalating costs and falling rates of new drug approval. Unacceptable drug toxicity is a substantial cause of drug failure during clinical trials and the leading cause of drug withdraws after release to the market. Computational methods capable of predicting these failures can reduce the waste of resources and time devoted to the investigation of compounds that ultimately fail. We propose an original machine learning method that leverages identity of drug targets and off-targets, functional impact score computed from Gene Ontology annotations, and biological network data to predict drug toxicity. We demonstrate that our method (TargeTox) can distinguish potentially idiosyncratically toxic drugs from safe drugs and is also suitable for speculative evaluation of different target sets to support the design of optimal low-toxicity combinations

University of the South Pacific Electronic Research Repository

Griffith Research Online

Prediction Models of Breast Cancer Outcome

Author: Boroevich Keith A
Iwase Takuji
Katagiri Toyomasa
Miya Fuyuki
Shigemizu Daichi
Suzuki Yasuyo
Tsunoda Tatsuhiko
Yoshimoto Masataka
Zembutsu Hitoshi
Publication venue: 'Wiley'
Publication date: 27/10/2020
Field of study

The goal of this study is to establish a method for predicting overall survival (OS ) and disease‐free survival (DFS ) in breast cancer patients after surgical operation. The gene expression profiles of cancer tissues from the patients, who underwent complete surgical resection of breast cancer and were subsequently monitored for postoperative survival, were analyzed using cDNA microarrays. We detected seven and three probes/genes associated with the postoperative OS and DFS , respectively, from our discovery cohort data. By incorporating these genes associated with the postoperative survival into MammaPrint genes, often used to predict prognosis of patients with early‐stage breast cancer, we constructed postoperative OS and DFS prediction models from the discovery cohort data using a Cox proportional hazard model. The predictive ability of the models was evaluated in another independent cohort using Kaplan–Meier (KM ) curves and the area under the receiver operating characteristic curve (AUC ). The KM curves showed a statistically significant difference between the predicted high‐ and low‐risk groups in both OS (log‐rank trend test P = 0.0033) and DFS (log‐rank trend test P = 0.00030). The models also achieved high AUC scores of 0.71 in OS and of 0.60 in DFS . Furthermore, our models had improved KM curves when compared to the models using MammaPrint genes (OS : P = 0.0058, DFS : P = 0.00054). Similar results were observed when our model was tested in publicly available datasets. These observations indicate that there is still room for improvement in the current methods of predicting postoperative OS and DFS in breast cancer

Tokushima University Institutional Repository

Assessing the Feasibility of GS FLX Pyrosequencing for Sequencing the Atlantic salmon Genome

Author: Boroevich Keith
Boroevich Keith
Bouffars Pascal
Bouffars Pascal
Chow William
Chow William
Davidson William
Davidson William
Desany Brian
Desany Brian
Harkins Timothy
Harkins Timothy
Jarvie Thomas
Jarvie Thomas
Knight James
Knight James
Koop Ben
Koop Ben
Levenkova Natasha
Levenkova Natasha
Lubieniecki Krzysztof
Lubieniecki Krzysztof
Quinn Nicole
Quinn Nicole
Publication venue: 'Simon Fraser University Library'
Publication date: 01/01/2008
Field of study

Summit Research Repository (Simon Fraser University)

DeepInsight: a methodology to transform a non - image data to an image for convolution neural network architecture

Author: Boroevich Keith A.
Sharma Alokanand
Shigemizu Daichi
Tsunoda Tatsuhiko
Vans Edwin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

It is critical, but difficult, to catch the small variation in genomic or other kinds of data that differentiates phenotypes or categories. A plethora of data is available, but the information from its genes or elements is spread over arbitrarily, making it challenging to extract relevant details for identification. However, an arrangement of similar genes into clusters makes these differences more accessible and allows for robust identification of hidden mechanisms (e.g. pathways) than dealing with elements individually. Here we propose, DeepInsight, which converts non-image samples into a well-organized image-form. Thereby, the power of convolution neural network (CNN), including GPU utilization, can be realized for non-image samples. Furthermore, DeepInsight enables feature extraction through the application of CNN for non-image samples to seize imperative information and shown promising results. To our knowledge, this is the first work to apply CNN simultaneously on different kinds of non-image datasets: RNA-seq, vowels, text, and artificial

University of the South Pacific Electronic Research Repository

Griffith Research Online