Search CORE

34 research outputs found

Use of Text Data in Identifying and Prioritizing Potential Drug Repositioning Candidates

Author: Rastegar-Mojarad Majid
Publication venue: UWM Digital Commons
Publication date: 01/05/2019
Field of study

New drug development costs between 500 million and 2 billion dollars and takes 10-15 years, with a success rate of less than 10%. Drug repurposing (defined as discovering new indications for existing drugs) could play a significant role in drug development, especially considering the declining success rates of developing novel drugs. In the period 2007-2009, drug repurposing led to the launching of 30-40% of new drugs. Typically, new indications for existing medications are identified by accident. However, new technologies and a large number of available resources enable the development of systematic approaches to identify and validate drug-repurposing candidates with significantly lower cost. A variety of resources have been utilized to identify novel drug repurposing candidates such as biomedical literature, clinical notes, and genetic data. In this dissertation, we focused on using text data in identifying and prioritizing drug repositioning candidates and conducted five studies. In the first study, we aimed to assess the feasibility of using patient reviews from social media to identify potential candidates for drug repurposing. We retrieved patient reviews of 180 medications from an online forum, WebMD. Using dictionary-based and machine learning approaches, we identified disease names in the reviews. Several publicly available resources were used to exclude comments containing known indications and adverse drug effects. After manually reviewing some of the remaining comments, we implemented a rule-based system to identify beneficial effects. The dictionary-based system and machine learning system identified 2178 and 6171 disease names respectively in 64,616 patient comments. We provided a list of 10 common patterns that patients used to report any beneficial effects or uses of medication. After manually reviewing the comments tagged by our rule-based system, we identified five potential drug repurposing candidates. To our knowledge, this was the first study to consider using social media data to identify drug-repurposing candidates. We found that even a rule-based system, with a limited number of rules, could identify beneficial effect mentions in the comments of patients. Our preliminary study shows that social media has the potential to be used in drug repurposing. In the second study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature-based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in the Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need for discourse-level analysis in extracting the relations from biomedical literature. In the third and fourth study, we focused on prioritizing drug repositioning candidates extracted from biomedical literature which we refer to as Literature-Based Discovery (LBD). In the third study, we used drug-gene and gene-disease semantic predications extracted from Medline abstracts to generate a list of potential drug-disease pairs. We further ranked the generated pairs, by assigning scores based on the predicates that qualify drug-gene and gene-disease relationships. On comparing the top-ranked drug-disease pairs against the Comparative Toxicogenomics Database, we found that a significant percentage of top-ranked pairs appeared in CTD. Co-occurrence of these high-ranked pairs in Medline abstracts is then used to improve the rankings of the inferred drug-disease relations. Finally, manual evaluation of the top-ten pairs ranked by our approach revealed that nine of them have good potential for biological significance based on expert judgment. In the fourth study, we proposed a method, utilizing information surrounding causal findings, to prioritize discoveries generated by LBD systems. We focused on discovering drug-disease relations, which have the potential to identify drug repositioning candidates or adverse drug reactions. Our LBD system used drug-gene and gene-disease semantic predication in SemMedDB as causal findings and Swanson’s ABC model to generate potential drug-disease relations. Using sentences, as a source of causal findings, our ranking method trained a binary classifier to classify generated drug-disease relations into desired classes. We trained and tested our classifier for three different purposes: a) drug repositioning b) adverse drug-event detection and c) drug-disease relation detection. The classifier obtained 0.78, 0.86, and 0.83 F-measures respectively for these tasks. The number of causal findings of each hypothesis, which were classified as positive by the classifier, is the main metric for ranking hypotheses in the proposed method. To evaluate the ranking method, we counted and compared the number of true relations in the top 100 pairs, ranked by our method and one of the previous methods. Out of 181 true relations in the test dataset, the proposed method ranked 20 of them in the top 100 relations while this number was 13 for the other method. In the last study, we used biomedical literature and clinical trials in ranking potential drug repositioning candidates identified by Phenome-Wide Association Studies (PheWAS). Unlike previous approaches, in this study, we did not limit our method to LBD. First, we generated a list of potential drug repositioning candidates using PheWAS. We retrieved 212,851 gene-disease associations from PheWAS catalog and 14,169 gene-drug relationships from DrugBank. Following Swanson’s model, we generated 52,966 potential drug repositioning candidates. Then, we developed an information retrieval system to retrieve any evidence of those candidates co-occurring in the biomedical literature and clinical trials. We identified nearly 14,800 drug-disease pairs with some evidence of support. In addition, we identified more than 38,000 novel candidates for re-purposing, encompassing hundreds of different disease states and over 1,000 individual medications. We anticipate that these results will be highly useful for hypothesis generation in the field of drug repurposing

University of Wisconsin-Milwaukee

Extraction and Classification of Drug-Drug Interaction from Biomedical Text Using a Two-Stage Classifier

Author: Rastegar-Mojarad Majid
Publication venue: UWM Digital Commons
Publication date: 01/12/2013
Field of study

One of the critical causes of medical errors is Drug-Drug interaction (DDI), which occurs when one drug increases or decreases the effect of another drug. We propose a machine learning system to extract and classify drug-drug interactions from the biomedical literature, using the annotated corpus from the DDIExtraction-2013 shared task challenge. Our approach applies a two-stage classifier to handle the highly unbalanced class distribution in the corpus. The first stage is designed for binary classification of drug pairs as interacting or non-interacting, and the second stage for further classification of interacting pairs into one of four interacting types: advise, effect, mechanism, and int. To find the set of best features for classification, we explored many features, including stemmed words, bigrams, part of speech tags, verb lists, parse tree information, mutual information, and similarity measures, among others. As the system faced two different classification tasks, binary and multi-class, we also explored various classifiers in each stage. Our results show that the best performing classifier in both stages was Support Vector Machines, and the best performing features were 1000 top informative words and part of speech tags between two main drugs. We obtained an F-Measure of 0.64, showing a 12% improvement over our submitted system to the DDIExtraction 2013 competition

University of Wisconsin-Milwaukee

Dynamic enhancement of drug product labels to support drug safety, efficacy, and effectiveness

Author: Boyce Richard D
Hassanzadeh Oktie
Horn John R
Liakata Maria
Luciano Joanne S
Rastegar-Mojarad Majid
Schneider Jodi
Waard Anita De
Publication venue
Publication date: 01/01/2013
Field of study

Out-of-date or incomplete drug product labeling information may increase the risk of otherwise preventable adverse drug events. In recognition of these concerns, the United States Federal Drug Administration (FDA) requires drug product labels to include specific information. Unfortunately, several studies have found that drug product labeling fails to keep current with the scientific literature. We present a novel approach to addressing this issue. The primary goal of this novel approach is to better meet the information needs of persons who consult the drug product label for information on a drug’s efficacy, effectiveness, and safety. Using FDA product label regulations as a guide, the approach links drug claims present in drug information sources available on the Semantic Web with specific product label sections. Here we report on pilot work that establishes the baseline performance characteristics of a proof-of-concept system implementing the novel approach. Claims from three drug information sources were linked to the Clinical Studies, Drug Interactions, and Clinical Pharmacology sections of the labels for drug products that contain one of 29 psychotropic drugs. The resulting Linked Data set maps 409 efficacy/effectiveness study results, 784 drug-drug interactions, and 112 metabolic pathway assertions derived from three clinically-oriented drug information sources (ClinicalTrials.gov, the National Drug File – Reference Terminology, and the Drug Interaction Knowledge Base) to the sections of 1,102 product labels. Proof-of-concept web pages were created for all 1,102 drug product labels that demonstrate one possible approach to presenting information that dynamically enhances drug product labeling. We found that approximately one in five efficacy/effectiveness claims were relevant to the Clinical Studies section of a psychotropic drug product, with most relevant claims providing new information. We also identified several cases where all of the drug-drug interaction claims linked to the Drug Interactions section for a drug were potentially novel. The baseline performance characteristics of the proof-of-concept will enable further technical and user-centered research on robust methods for scaling the approach to the many thousands of product labels currently on the market

Crossref

Aberystwyth Research Portal

Springer - Publisher Connector

PubMed Central

Warwick Research Archives Portal Repository

Access to Research at National University of Ireland, Galway

Global, regional, and national burden of colorectal cancer and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019

Author: Abbasi-kangevari Mohsen
Abd-rabu Rami
Abidi Hassan
Abu-gharbieh Eman
Acuna Juan Manuel
Adhikari Sangeet
Advani Shailesh M
Afzal Muhammad Sohail
Aghaie Meybodi Mohamad
Ahinkorah Bright Opoku
Ahmad Sajjad
Ahmadi Ali
Ahmadi Sepideh
Ahmed Haroon
Ahmed Luai A
Ahmed Muktar Beshir
Al Hamad Hanadi
Al-raddadi Rajaa M
Alahdab Fares
Alanezi Fahad Mashhour
Alanzi Turki M
Alhalaiqa Fadwa Alhalaiqa Naji
Alimohamadi Yousef
Alipour Vahid
Aljunid Syed Mohamed
Alkhayyat Motasem
Almustanyir Sami
Alvand Saba
Alvis-guzman Nelson
Amini Saeed
Ancuceanu Robert
Anoushiravani Amir
Anoushirvani Ali Arash
Ansari-moghaddam Alireza
Arabloo Jalal
Aryannejad Armin
Asghari Jafarabadi Mohammad
Athari Seyyed Shamsadin
Ausloos Floriane
Ausloos Marcel
Awedew Atalel Fentahun
Awoke Mamaru Ayenew
Ayana Tegegn Mulatu
Azadnajafabad Sina
Azami Hiva
Azangou-khyavy Mohammadreza
Azari Jafari Amirhossein
Badiye Ashish D
Bagherieh Sara
Bahadory Saeed
Baig Atif Amin
Baker Jennifer L
Banach Maciej
Barrow Amadou
Berhie Alemshet Yirga
Besharat Sima
Bhagat Devidas S
Bhagavathula Akshaya Srikanth
Bhala Neeraj
Bhattacharyya Krittika
Bhojaraja Vijayalakshmi S
Bibi Sadia
Bijani Ali
Biondi Antonio
Bjørge Tone
Bodicha Belay Boda Abule
Braithwaite Dejana
Brenner Hermann
Calina Daniela
Cao Chao
Cao Yin
Carreras Giulia
Carvalho Felix
Cerin Ester
Chakinala Raja Chandra
Cho William C S
Chu Dinh-toi
Conde Joao
Costa Vera Marisa
Cruz-martins Natália
Dadras Omid
Dai Xiaochen
Dandona Lalit
Dandona Rakhi
Danielewicz Anna
Demeke Feleke Mekonnen
Demissie Getu Debalkie
Desai Rupak
Dhamnetiya Deepak
Dianatinasab Mostafa
Diaz Daniel
Didehdar Mojtaba
Doaei Saeid
Doan Linh Phuong
Dodangeh Milad
Eghbalian Fatemeh
Ejeta Debela Debela
Ekholuenetale Michael
Ekundayo Temitope Cyrus
El Sayed Iman
Elhadi Muhammed
Enyew Daniel Berhanie
Eyayu Tahir
Ezzeddini Rana
Fakhradiyev Ildar Ravisovich
Farooque Umar
Farrokhpour Hossein
Farzadfar Farshad
Fatehizadeh Ali
Fattahi Hamed
Fattahi Nima
Fereidoonnezhad Masood
Fernandes Eduarda
Fetensa Getahun
Filip Irina
Fischer Florian
Foroutan Masoud
Gaal Peter Andras
Gad Mohamed M
Gallus Silvano
Garg Tushar
Getachew Tamiru
Ghamari Seyyed-hadi
Ghashghaee Ahmad
Ghith Nermin
Gholamalizadeh Maryam
Gholizadeh Navashenaq Jamshid
Gizaw Abraham Tamirat
Glasbey James C
Golechha Mahaveer
Goleij Pouya
Gonfa Kebebe Bekele
Gorini Giuseppe
Guha Avirup
Gupta Sapna
Gupta Veer Bala
Gupta Vivek Kumar
Haddadi Rasool
Hafezi-nejad Nima
Haj-mirzaian Arvin
Halwani Rabih
Haque Shafiul
Hariri Sanam
Hasaballah Ahmed I
Hassanipour Soheil
Hay Simon I
Herteliu Claudiu
Holla Ramesh
Hosseini Mohammad-salar
Hosseinzadeh Mehdi
Hostiuc Mihaela
Househ Mowafa
Huang Junjie
Humayun Ayesha
Iavicoli Ivo
Ilesanmi Olayinka Stephen
Ilic Irena M
Ilic Milena D
Islami Farhad
Iwagami Masao
Jahani Mohammad Ali
Jakovljevic Mihajlo
Javaheri Tahereh
Jayawardena Ranil
Jebai Rime
Jha Ravi Prakash
Joo Tamas
Joseph Nitin
Joukar Farahnaz
Jozwiak Jacek Jerzy
Kabir Ali
Kalhor Rohollah
Kamath Ashwin
Kapoor Neeti
Karaye Ibraheem M
Karimi Amirali
Kauppila Joonas H
Kazemi Asma
Keykhaei Mohammad
Khader Yousef Saleh
Khajuria Himanshu
Khalilov Rovshan
Khanali Javad
Khayamzadeh Maryam
Khodadost Mahmoud
Kim Hanna
Kim Min Seo
Kisa Adnan
Kisa Sezer
Kolahi Ali-asghar
Koohestani Hamid Reza
Kopec Jacek A
Koteeswaran Rajasekaran
Koyanagi Ai
Krishnamoorthy Yuvaraj
Kumar G Anil
Kumar Manoj
Kumar Vivek
La Vecchia Carlo
Lami Faris Hasan
Landires Iván
Ledda Caterina
Lee Sang-woong
Lee Wei-chen
Lee Yeong Yeh
Leong Elvynna
Li Bingyu
Lim Stephen S
Lobo Stany W
Loureiro Joana A
Lunevicius Raimundas
Madadizadeh Farzan
Mahmoodpoor Ata
Majeed Azeem
Malekpour Mohammad-reza
Malekzadeh Reza
Malik Ahmad Azam
Mansour-ghanaei Fariborz
Mantovani Lorenzo Giovanni
Martorell Miquel
Masoudi Sahar
Mathur Prashant
Meena Jitendra Kumar
Mehrabi Nasab Entezar
Mendoza Walter
Mentis Alexios-fotios A
Mestrovic Tomislav
Miao Jonasson Junmei
Miazgowski Bartosz
Miazgowski Tomasz
Mijena Gelana Fekadu Worku
Mirmoeeni Seyyedmohammadsadeq
Mirza-aghazadeh-attari Mohammad
Mirzaei Hamed
Misra Sanjeev
Mohammad Karzan Abdulmuhsin
Mohammadi Esmaeil
Mohammadi Saeed
Mohammadi Seyyede Momeneh
Mohammadian-hafshejani Abdollah
Mohammed Shafiu
Mohammed Teroj Abdulrahman
Moka Nagabhishek
Mokdad Ali H
Mokhtari Zeinab
Molokhia Mariam
Momtazmanesh Sara
Monasta Lorenzo
Moradi Ghobad
Moradzadeh Rahmatollah
Moraga Paula
Morgado-da-costa Joana
Mubarik Sumaira
Mulita Francesk
Naghavi Mohsen
Naimzada Mukhammad David
Nam Hae Sung
Natto Zuhair S
Nayak Biswa Prakash
Nazari Javad
Nazemalhosseini-mojarad Ehsan
Negoi Ionut
Nguyen Cuong Tat
Nguyen Son Hoang
Noor Nurulamin M
Noori Maryam
Noori Seyyed Mohammad Ali
Nuñez-samudio Virginia
Nzoputam Chimezie Igwegbe
Oancea Bogdan
Odukoya Oluwakemi Ololade
Oguntade Ayodipupo Sikiru
Okati-aliabad Hassan
Olagunju Andrew T
Olagunju Tinuke O
Ong Sokking
Ostroff Samuel M
Padron-monedero Alicia
Pakzad Reza
Pana Adrian
Pandey Anamika
Pashazadeh Kan Fatemeh
Patel Urvish K
Paudel Uttam
Pereira Renato B
Perumalsamy Navaraj
Pestell Richard G
Piracha Zahra Zahid
Pollok Richard Charles G
Pourshams Akram
Pourtaheri Naeimeh
Prashant Akila
Rabiee Mohammad
Rabiee Navid
Radfar Amir
Rafiei Sima
Rahman Mosiur
Rahmani Amir Masoud
Rahmanian Vahid
Rajai Nazanin
Rajesh Aashish
Ramezani-doroh Vajiheh
Ramezanzadeh Kiana
Ranabhat Kamal
Rashedi Sina
Rashidi Amirfarzan
Rashidi Mahsa
Rashidi Mohammad-mahdi
Rastegar Mandana
Rawaf David Laith
Rawaf Salman
Rawassizadeh Reza
Razeghinia Mohammad Sadegh
Renzaho Andre M N
Rezaei Negar
Rezaei Nima
Rezaei Saeid
Rezaeian Mohsen
Rezazadeh-khadem Sahba
Roshandel Gholamreza
Saber-ayad Maha Mohamed
Saberzadeh-ardestani Bahar
Saddik Basema
Sadeghi Hossein
Saeed Umar
Sahebazzamani Maryam
Sahebkar Amirhossein
Salek Farrokhi Amir
Salimi Amir
Salimzadeh Hamideh
Samadi Pouria
Samaei Mehrnoosh
Samy Abdallah M
Sanabria Juan
Santric-milicevic Milena M
Saqib Muhammad Arif Nadeem
Sarveazad Arash
Sathian Brijesh
Satpathy Maheswar
Schneider Ione Jayce Ceola
Sepanlou Sadaf G
Seylani Allen
Sha Feng
Shafiee Sayed Mohammad
Shaghaghi Zahra
Shahabi Saeed
Shaker Elaheh
Sharifi-rad Javad
Sharifian Maedeh
Sharma Rajesh
Sheikhbahaei Sara
Shetty Jeevan K
Shirkoohi Reza
Shobeiri Parnian
Siddappa Malleshappa Sudeep K
Silva Julian Guilherme
Silva Diego Augusto Santos
Singh Achintya Dinesh
Singh Jasvinder A
Siraj Md Shahjahan
Sivandzadeh Gholam Reza
Skryabin Valentin Yurievich
Skryabina Anna Aleksandrovna
Socea Bogdan
Solmi Marco
Soltani-zangbar Mohammad Sadegh
Song Suhang
Szerencsés Viktória
Szócska Miklós
Tabarés-seisdedos Rafael
Tabibian Elnaz
Taheri Majid
Taheriabkenar Yasaman
Taherkhani Amir
Talaat Iman M
Tan Ker-kan
Tbakhi Abdelghani
Tesfaye Bekele
Tiyuri Amir
Tollosa Daniel Nigusse
Touvier Mathilde
Tran Bach Xuan
Tusa Biruk Shalmeno
Ullah Irfan
Ullah Saif
Vacante Marco
Valadan Tahbaz Sahel
Veroux Massimiliano
Vo Bay
Vos Theo
Wang Cong
Westerman Ronny
Woldemariam Melat
Yahyazadeh Jabbari Seyed Hossein
Yang Lin
Yazdanpanah Fereshteh
Yu Chuanhua
Yuce Deniz
Yunusa Ismaeel
Zadnik Vesna
Zahir Mazyar
Zare Iman
Zhang Zhi-jiang
Zoladl Mohammad
Šekerija Mario
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Funding: F Carvalho and E Fernandes acknowledge support from Fundação para a Ciência e a Tecnologia, I.P. (FCT), in the scope of the project UIDP/04378/2020 and UIDB/04378/2020 of the Research Unit on Applied Molecular Biosciences UCIBIO and the project LA/P/0140/2020 of the Associate Laboratory Institute for Health and Bioeconomy i4HB; FCT/MCTES through the project UIDB/50006/2020. J Conde acknowledges the European Research Council Starting Grant (ERC-StG-2019-848325). V M Costa acknowledges the grant SFRH/BHD/110001/2015, received by Portuguese national funds through Fundação para a Ciência e Tecnologia (FCT), IP, under the Norma Transitória DL57/2016/CP1334/CT0006.proofepub_ahead_of_prin

AIR Universita degli studi di Milano

Repositório da Universidade Nova de Lisboa

shahrekord university of medical scinces

BELTracker: evidence sentence retrieval for BEL statements

Author: Gebel
Hongfang Liu
Lupu
Majid Rastegar-Mojarad
Ravikumar Komandur Elayavilli
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

Crossref

Discovering associations between problem list and practice setting

Author: Feichen Shen
Hongfang Liu
Liwei Wang
Majid Rastegar-Mojarad
Yanshan Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2019
Field of study

Abstract Background The Health Information Technology for Economic and Clinical Health Act (HITECH) has greatly accelerated the adoption of electronic health records (EHRs) with the promise of better clinical decisions and patients’ outcomes. One of the core criteria for “Meaningful Use” of EHRs is to have a problem list that shows the most important health problems faced by a patient. The implementation of problem lists in EHRs has a potential to help practitioners to provide customized care to patients. However, it remains an open question on how to leverage problem lists in different practice settings to provide tailored care, of which the bottleneck lies in the associations between problem list and practice setting. Methods In this study, using sampled clinical documents associated with a cohort of patients who received their primary care at Mayo Clinic, we investigated the associations between problem list and practice setting through natural language processing (NLP) and topic modeling techniques. Specifically, after practice settings and problem lists were normalized, statistical χ2 test, term frequency-inverse document frequency (TF-IDF) and enrichment analysis were used to choose representative concepts for each setting. Then Latent Dirichlet Allocations (LDA) were used to train topic models and predict potential practice settings using similarity metrics based on the problem concepts representative of practice settings. Evaluation was conducted through 5-fold cross validation and Recall@k, Precision@k and F1@k were calculated. Results Our method can generate prioritized and meaningful problem lists corresponding to specific practice settings. For practice setting prediction, recall increases from 0.719 (k = 2) to 0.931 (k = 10), precision increases from 0.882 (k = 2) to 0.931 (k = 10) and F1 increases from 0.790 (k = 2) to 0.931 (k = 10). Conclusion To our best knowledge, our study is the first attempting to discover the association between the problem lists and hospital practice settings. In the future, we plan to investigate how to provide more tailored care by utilizing the association between problem list and practice setting revealed in this study

Directory of Open Access Journals