Search CORE

4,048 research outputs found

Motif Discovery through Predictive Modeling of Gene Regulation

Author: A. Battle
A.P. Gasch
C.E. Lawrence
E. Segal
E. Segal
E. Wingender
E.M. Conlon
G.Z. Hertz
H.J. Bussemaker
J.D. Hughes
N. Slonim
R.E. Schapire
T. Cover
T.I. Lee
T.L. Bailey
Y. Pilpel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

We present MEDUSA, an integrative method for learning motif models of transcription factor binding sites by incorporating promoter sequence and gene expression data. We use a modern large-margin machine learning approach, based on boosting, to enable feature selection from the high-dimensional search space of candidate binding sequences while avoiding overfitting. At each iteration of the algorithm, MEDUSA builds a motif model whose presence in the promoter region of a gene, coupled with activity of a regulator in an experiment, is predictive of differential expression. In this way, we learn motifs that are functional and predictive of regulatory response rather than motifs that are simply overrepresented in promoter sequences. Moreover, MEDUSA produces a model of the transcriptional control logic that can predict the expression of any gene in the organism, given the sequence of the promoter region of the target gene and the expression state of a set of known or putative transcription factors and signaling molecules. Each motif model is either a

k

-length sequence, a dimer, or a PSSM that is built by agglomerative probabilistic clustering of sequences with similar boosting loss. By applying MEDUSA to a set of environmental stress response expression data in yeast, we learn motifs whose ability to predict differential expression of target genes outperforms motifs from the TRANSFAC dataset and from a previously published candidate set of PSSMs. We also show that MEDUSA retrieves many experimentally confirmed binding sites associated with environmental stress response from the literature.Comment: RECOMB 200

arXiv.org e-Print Archive

CiteSeerX

Crossref

A Robust Deep Model for Improved Categorization of Legal Documents for Predictive Analytics

Author: Mohan Divya
Nair Latha Ravindran
Publication venue: Auricle Global Society of Education and Research
Publication date: 11/03/2023
Field of study

Predictive legal analytics is a technology used to predict the chances of successful and unsuccessful outcomes in a particular case. Predictive legal analytics is performed through automated document classification for facilitating legal experts in their classification of court documents to retrieve and understand the details of specific legal factors from legal judgments for accurate document analysis. However, extracting these factors from legal texts document is a time-consuming process. In order to facilitate the task of classifying documents, a robust method namely Distributed Stochastic Keyword Extraction based Ensemble Theil-Sen Regressive Deep Belief Reweight Boost Classification (DSKE-TRDBRBC) is proposed. The DSKE-TRDBRBC technique consists of two major processes namely Keyword Extraction and Classification. At first, the t-distributed stochastic neighbor embedding technique is applied to DSKE-TRDBRBC for keyword extraction. This in turn minimizes the time consumption for document classification. After that, the Ensemble Theil-Sen Regressive Deep Belief Reweight Boosting technique is applied for document classification. The Ensemble boosting algorithm initially constructs’ set of Theil-Sen Regressive Deep Belief neural networks to classify the input legal documents. Then the results of the Deep Belief neural network are combined to built a strong classifier by reducing the error. This aids in improving the classification accuracy. The proposed method is experimentally evaluated with various metrics such as F-measure , recall, accuracy, precision, , and computational time. The experimental results quantitatively confirm that the proposed DSKE-TRDBRBC technique achieves better accuracy with lowest computation time as compared to the conventional approaches

International Journal on Recent and Innovation Trends in Computing and Communication

Ensemble deep learning: A review

Author: Ganaie M. A.
Hu Minghui
Malik A. K.
Suganthan P. N.
Tanveer M.
Publication venue
Publication date: 06/04/2021
Field of study

Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning models with multilayer processing architecture is showing better performance as compared to the shallow or traditional classification models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. This paper reviews the state-of-art deep ensemble models and hence serves as an extensive summary for the researchers. The ensemble models are broadly categorised into ensemble models like bagging, boosting and stacking, negative correlation based deep ensemble models, explicit/implicit ensembles, homogeneous /heterogeneous ensemble, decision fusion strategies, unsupervised, semi-supervised, reinforcement learning and online/incremental, multilabel based deep ensemble models. Application of deep ensemble models in different domains is also briefly discussed. Finally, we conclude this paper with some future recommendations and research directions

arXiv.org e-Print Archive

Qatar University Institutional Repository

Histopathological image analysis : a review

Author: Boucheron Laura E.
Can Ali
Gurcan Metin N.
Madabhushi Anant
Rajpoot Nasir M.
Yener Bülent
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Over the past decade, dramatic increases in computational power and improvement in image analysis algorithms have allowed the development of powerful computer-assisted analytical approaches to radiological data. With the recent advent of whole slide digital scanners, tissue histopathology slides can now be digitized and stored in digital image form. Consequently, digitized tissue histopathology has now become amenable to the application of computerized image analysis and machine learning techniques. Analogous to the role of computer-assisted diagnosis (CAD) algorithms in medical imaging to complement the opinion of a radiologist, CAD algorithms have begun to be developed for disease detection, diagnosis, and prognosis prediction to complement the opinion of the pathologist. In this paper, we review the recent state of the art CAD technology for digitized histopathology. This paper also briefly describes the development and application of novel image analysis technology for a few specific histopathology related problems being pursued in the United States and Europe

Crossref

PubMed Central

Warwick Research Archives Portal Repository

Modeling Financial Time Series with Artificial Neural Networks

Author: Versace Massimiliano
Wong Charles
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 15/12/2009
Field of study

Financial time series convey the decisions and actions of a population of human actors over time. Econometric and regressive models have been developed in the past decades for analyzing these time series. More recently, biologically inspired artificial neural network models have been shown to overcome some of the main challenges of traditional techniques by better exploiting the non-linear, non-stationary, and oscillatory nature of noisy, chaotic human interactions. This review paper explores the options, benefits, and weaknesses of the various forms of artificial neural networks as compared with regression techniques in the field of financial time series analysis.CELEST, a National Science Foundation Science of Learning Center (SBE-0354378); SyNAPSE program of the Defense Advanced Research Project Agency (HR001109-03-0001

Boston University Institutional Repository (OpenBU)

Distance-based and ranking methods for preference rankings, preference-approvals and textual analysis

Author: ALBANO ALESSANDRO
Publication venue: place:Palermo
Publication date: 06/12/2022
Field of study

Archivio istituzionale della ricerca - Università di Palermo

Improving Image Contrastive Clustering Through Self-Learning Pairwise Constraints

Author: Bai Liang
Guo Yecheng
Liang Jiye
Yang Xian
Publication venue
Publication date: 15/11/2023
Field of study

The University of Manchester - Institutional Repository

Radiomics-based machine learning approach for the prediction of grade and stage in upper urinary tract urothelial carcinoma:a step towards virtual biopsy

Author: Almopti Abdulrahman
Alqahtani Abdulsalam
Bhattacharjee Sourav
Li Chunhui
Nabi Ghulam
Publication venue
Publication date: 01/06/2024
Field of study

Objectives: Upper tract urothelial carcinoma (UTUC) is a rare, aggressive lesion, with early detection a key to its management. This study aimed to utilise computed tomographic urogram data to develop machine learning models for predicting tumour grading and staging in upper urothelial tract carcinoma patients and to compare these predictions with histopathological diagnosis used as reference standards.Methods: Protocol-based computed tomographic urogram data from 106 patients were obtained and visualised in 3D. Digital segmentation of the tumours was conducted by extracting textural radiomics features. They were further classified using 11 predictive models. The predicted grades and stages were compared to the histopathology of radical nephroureterectomy specimens.Results: Classifier models worked well in mining the radiomics data and delivered satisfactory predictive machine learning models. The multilayer panel showed 84% sensitivity and 93% specificity while predicting UTUC grades. The Logistic Regression model showed a sensitivity of 83% and a specificity of 76% while staging. Similarly, other classifier algorithms [e.g. Support Vector classifier (SVC)] provided a highly accurate prediction while grading UTUC compared to clinical features alone or ureteroscopic biopsy histopathology.Conclusion: Data mining tools could handle medical imaging datasets from small (<2 cm) tumours for UTUC. The radiomics-based machine learning algorithms provide a potential tool to model tumour grading and staging with implications for clinical practice and the upgradation of current paradigms in cancer diagnostics.Clinical Relevance: Machine learning based on radiomics features can predict upper tract urothelial cancer grading and staging with significant improvement over ureteroscopic histopathology. The study showcased the prowess of such emerging tools in the set objectives with implications towards virtual biopsy

University of Dundee Online Publications