Search CORE

31,584 research outputs found

An Overview of the Use of Neural Networks for Data Mining Tasks

Author: Alberts B
Alpaydin E
Ando T
Blake CL
Bramer MA
Castanheira LG
Han J
Lu H
Mitchell M
Ni X
Quinlan RJ
Rumelhart DE
Shafer JC
Shendure J
Simić D
Stahl F
Steinwart I
Surjandari I
Wei JS
Widrow B
Witten IH
Zaslavsky B
Zhang D
Publication venue: 'Wiley'
Publication date: 01/01/2012
Field of study

In the recent years the area of data mining has experienced a considerable demand for technologies that extract knowledge from large and complex data sources. There is a substantial commercial interest as well as research investigations in the area that aim to develop new and improved approaches for extracting information, relationships, and patterns from datasets. Artificial Neural Networks (NN) are popular biologically inspired intelligent methodologies, whose classification, prediction and pattern recognition capabilities have been utilised successfully in many areas, including science, engineering, medicine, business, banking, telecommunication, and many other fields. This paper highlights from a data mining perspective the implementation of NN, using supervised and unsupervised learning, for pattern recognition, classification, prediction and cluster analysis, and focuses the discussion on their usage in bioinformatics and financial data analysis tasks

Central Archive at the University of Reading

Crossref

Portsmouth University Research Portal (Pure)

Bournemouth University Research Online

PRED-CLASS: cascading neural networks for generalized protein classification and genome-wide applications

Author: Adams
Aloy
Apweiler
Bairoch
Brenner
Fariselli
Fariselli
Finkelstein
Fischer
Hamodrakas
Haykin
Hobohm
Jones
Koehl
Levitt
Minsky
Murzin
Murzin
Murzin
Möeller
Nielsen
Pasquier
Pasquier
Pasquier
Pedersen
Petersen
Rice
Rost
Rost
Rumelhart
Sanchez
Schulz
Stevens
Vlahou
Wallin
Publication venue: 'Wiley'
Publication date: 01/01/2001
Field of study

A cascading system of hierarchical, artificial neural networks (named PRED-CLASS) is presented for the generalized classification of proteins into four distinct classes-transmembrane, fibrous, globular, and mixed-from information solely encoded in their amino acid sequences. The architecture of the individual component networks is kept very simple, reducing the number of free parameters (network synaptic weights) for faster training, improved generalization, and the avoidance of data overfitting. Capturing information from as few as 50 protein sequences spread among the four target classes (6 transmembrane, 10 fibrous, 13 globular, and 17 mixed), PRED-CLASS was able to obtain 371 correct predictions out of a set of 387 proteins (success rate approximately 96%) unambiguously assigned into one of the target classes. The application of PRED-CLASS to several test sets and complete proteomes of several organisms demonstrates that such a method could serve as a valuable tool in the annotation of genomic open reading frames with no functional assignment or as a preliminary step in fold recognition and ab initio structure prediction methods. Detailed results obtained for various data sets and completed genomes, along with a web sever running the PRED-CLASS algorithm, can be accessed over the World Wide Web at http://o2.biol.uoa.gr/PRED-CLAS

arXiv.org e-Print Archive

Crossref

Prediction of peptide and protein propensity for amyloid formation

Author: A Quintas
A Trovato
A Trovato
AC Davison
AC Tsolis
Alexandre Quintas
AM Fernandez-Escamilla
AP Pawar
AV Finkelstein
B Rost
C Nerelius
Carlos Família
CM Dobson
D Eisenberg
David A. Phoenix
DJ Selkoe
DM Fowler
Eugene A. Permyakov
F Chiti
F Chiti
F Sasagawa
GG Tartaglia
GG Tartaglia
H Hu
I Cherny
I Walsh
IV Baskakov
J Palau
J Tian
JC Rochet
JD Sipe
JM Zimmerman
JW Kelly
JW Kelly
K Rajagopal
KF DuBay
KK Frousios
KT O’Neil
L Goldschmidt
LO Jimenez
M Belli
M Emily
M Hollander
M Kuhn
M López de la Paz
M Oliveberg
M Stefani
M Sunde
M Sunde
M Zamani
MB Kursa
MJ Thompson
MT Pastor
N Becker
N Qian
O Conchillo-Solé
PK Teng
PY Chou
RS Harrison
S Idicula-thomas
S Kawashima
S Kawashima
S Maurer-Stroh
S Ventura
S Yoon
S Yoon
Sarah R. Dennison
SJ Hamodrakas
SJ Hamodrakas
SK Maji
SO Garbuzynskiy
T Hothorn
T Hothorn
T Hothorn
T Scheibel
TPJ Knowles
VS Mathura
WH DePas
WT Astbury
Y Kallberg
Ž Eva
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 09/07/2014
Field of study

Understanding which peptides and proteins have the potential to undergo amyloid formation and what driving forces are responsible for amyloid-like fiber formation and stabilization remains limited. This is mainly because proteins that can undergo structural changes, which lead to amyloid formation, are quite diverse and share no obvious sequence or structural homology, despite the structural similarity found in the fibrils. To address these issues, a novel approach based on recursive feature selection and feed-forward neural networks was undertaken to identify key features highly correlated with the self-assembly problem. This approach allowed the identification of seven physicochemical and biochemical properties of the amino acids highly associated with the self-assembly of peptides and proteins into amyloid-like fibrils (normalized frequency of β-sheet, normalized frequency of β-sheet from LG, weights for β-sheet at the window position of 1, isoelectric point, atom-based hydrophobic moment, helix termination parameter at position j+1 and ΔGº values for peptides extrapolated in 0 M urea). Moreover, these features enabled the development of a new predictor (available at http://cran.r-project.org/web/packages/appnn/index.html) capable of accurately and reliably predicting the amyloidogenic propensity from the polypeptide sequence alone with a prediction accuracy of 84.9 % against an external validation dataset of sequences with experimental in vitro, evidence of amyloid formation

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

FigShare

A swarm intelligence framework for reconstructing gene networks: searching for biologically plausible architectures

Author: Kentzoglanakis Kyriakos
Poole Matthew
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/05/2011
Field of study

Portsmouth University Research Portal (Pure)

PaperRobot: Incremental Draft Generation of Scientific Ideas

Author: Bansal Mohit
Huang Lifu
Ji Heng
Jiang Zhiying
Knight Kevin
Luan Yi
Wang Qingyun
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

We present a PaperRobot who performs as an automatic research assistant by (1) conducting deep understanding of a large collection of human-written papers in a target domain and constructing comprehensive background knowledge graphs (KGs); (2) creating new ideas by predicting links from the background KGs, by combining graph attention and contextual text attention; (3) incrementally writing some key elements of a new paper based on memory-attention networks: from the input title along with predicted related entities to generate a paper abstract, from the abstract to generate conclusion and future work, and finally from future work to generate a title for a follow-on paper. Turing Tests, where a biomedical domain expert is asked to compare a system output and a human-authored string, show PaperRobot generated abstracts, conclusion and future work sections, and new titles are chosen over human-written ones up to 30%, 24% and 12% of the time, respectively.Comment: 12 pages. Accepted by ACL 2019 Code and resource is available at https://github.com/EagleW/PaperRobo

arXiv.org e-Print Archive

Crossref