Search CORE

77 research outputs found

Handwritten digit recognition by bio-inspired hierarchical networks

Author: D. Hubel
D.O. Hebb
G. Dileep
H. Makino
J. Cleary
J. DiCarlo
J. Rissanen
L.E. Baum
L.E. Baum
N. Takahashi
P. Bühlmann
R. Begleiter
R.W. Hamming
S. Kotsiantis
T. Branco
T. Kleindienst
W. Teahan
W.B. Powell
Y. LeCun
Publication venue
Publication date: 06/11/2012
Field of study

The human brain processes information showing learning and prediction abilities but the underlying neuronal mechanisms still remain unknown. Recently, many studies prove that neuronal networks are able of both generalizations and associations of sensory inputs. In this paper, following a set of neurophysiological evidences, we propose a learning framework with a strong biological plausibility that mimics prominent functions of cortical circuitries. We developed the Inductive Conceptual Network (ICN), that is a hierarchical bio-inspired network, able to learn invariant patterns by Variable-order Markov Models implemented in its nodes. The outputs of the top-most node of ICN hierarchy, representing the highest input generalization, allow for automatic classification of inputs. We found that the ICN clusterized MNIST images with an error of 5.73% and USPS images with an error of 12.56%

arXiv.org e-Print Archive

Crossref

Artificial Sequences and Complexity Measures

In this paper we exploit concepts of information theory to address the fundamental problem of identifying and defining the most suitable tools to extract, in a automatic and agnostic way, information from a generic string of characters. We introduce in particular a class of methods which use in a crucial way data compression techniques in order to define a measure of remoteness and distance between pairs of sequences of characters (e.g. texts) based on their relative information content. We also discuss in detail how specific features of data compression techniques could be used to introduce the notion of dictionary of a given sequence and of Artificial Text and we show how these new tools can be used for information extraction purposes. We point out the versatility and generality of our method that applies to any kind of corpora of character strings independently of the type of coding behind them. We consider as a case study linguistic motivated problems and we present results for automatic language recognition, authorship attribution and self consistent-classification.Comment: Revised version, with major changes, of previous "Data Compression approach to Information Extraction and Classification" by A. Baronchelli and V. Loreto. 15 pages; 5 figure

arXiv.org e-Print Archive

City Research Online

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Differences between Human Plasma and Serum Metabolite Profiles

Author: A Doring
Alexey Polonikov
Annette Peters
AS Jaffe
C Gieger
Cornelia Prehn
DB Sacks
F Kronenberg
F Mannello
Florian Kronenberg
G Zhai
Gabi Kastenmüller
Gabriele Möller
H. -Erich Wichmann
Holger Prokisch
J Aoki
Jerzy Adamski
JH Ladenson
Joaquim Mendes
JR Denery
K Bando
K Dettmer
K Suhre
Karsten Suhre
L Liu
Lu Xie
M Kolz
Matej Oresic
N Psychogios
Norbert Dahmen
O Teahan
Petra Belcredi
R Holle
R Kaddurah-Daouk
R Wang-Sattler
RB Schnabel
Rui Wang-Sattler
S Barelli
Simone Wahl
T Illig
T Teerlink
Thomas Illig
Uta Ceglarek
W Römisch-Margl
Werner Roemisch-Margl
Y Yatomi
Ying He
Yixue Li
Zhonghao Yu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

BACKGROUND: Human plasma and serum are widely used matrices in clinical and biological studies. However, different collecting procedures and the coagulation cascade influence concentrations of both proteins and metabolites in these matrices. The effects on metabolite concentration profiles have not been fully characterized. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed the concentrations of 163 metabolites in plasma and serum samples collected simultaneously from 377 fasting individuals. To ensure data quality, 41 metabolites with low measurement stability were excluded from further analysis. In addition, plasma and corresponding serum samples from 83 individuals were re-measured in the same plates and mean correlation coefficients (r) of all metabolites between the duplicates were 0.83 and 0.80 in plasma and serum, respectively, indicating significantly better stability of plasma compared to serum (p = 0.01). Metabolite profiles from plasma and serum were clearly distinct with 104 metabolites showing significantly higher concentrations in serum. In particular, 9 metabolites showed relative concentration differences larger than 20%. Despite differences in absolute concentration between the two matrices, for most metabolites the overall correlation was high (mean r = 0.81±0.10), which reflects a proportional change in concentration. Furthermore, when two groups of individuals with different phenotypes were compared with each other using both matrices, more metabolites with significantly different concentrations could be identified in serum than in plasma. For example, when 51 type 2 diabetes (T2D) patients were compared with 326 non-T2D individuals, 15 more significantly different metabolites were found in serum, in addition to the 25 common to both matrices. CONCLUSIONS/SIGNIFICANCE: Our study shows that reproducibility was good in both plasma and serum, and better in plasma. Furthermore, as long as the same blood preparation procedure is used, either matrix should generate similar results in clinical and biological studies. The higher metabolite concentrations in serum, however, make it possible to provide more sensitive results in biomarker detection

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

PuSH

Evaluation and Characterization of Bacterial Metabolic Dynamics with a Novel Profiling Technique, Real-Time Metabolotyping

Author: A Craig
A Wahl
AD Maher
AJ Ragauskas
AL Tang
AM Weljie
AR Neves
B Teusink
B Watzl
BJ Blaise
BS Samuel
BS Shane
C O'Mahony
C Tian
CN Jacobsen
CR Kepler
CR Kepler
CR Kepler
DB Kell
DC Baumgart
E Chikayama
E Holmes
Eisuke Chikayama
F Backhed
F Delaglio
F Dieterle
F Diez-Gonzalez
F Pizarro
G Vinderola
H Morita
H Putaala
Hiroshi Ohno
IA Lewis
J Kikuchi
J Kikuchi
J Kraft
J Lehar
JB Ewaschuk
JC Portais
JK Nicholson
JP Grivet
Jun Kikuchi
K Akiyama
K Koba
K Kurokawa
L Brecker
L Wen
LM Raamsdonk
LV Hooper
M Assfalg
M Igarashi
M Lecuit
M Li
M Tarnopolsky
Mariko Hatakeyama
MW Pariza
N Asanuma
N Ishii
NS Kelley
O Cloarec
O Cloarec
O Teahan
PD Majors
PJ Turnbaugh
PJ Turnbaugh
R Bonneau
R Suzuki
RG Shulman
RM Dawson
RT Eakin
S Fukuda
S Fukuda
S Fukuda
S Fukuda
S Fukuda
S Ohkawara
S Rakoff-Nahoum
S Tiziani
SG Villas-Boas
Shinji Fukuda
SR Gill
SY Lee
T Ogino
TA Clayton
TM Barbosa
Tsuneo Hino
TT Macdonald
V Ladero
VI Chalova
W Jia
W Li
WA Walker
Y Sekiyama
Yumiko Nakanishi
Z Serber
Publication venue: Public Library of Science
Publication date: 16/03/2009
Field of study

BACKGROUND: Environmental processes in ecosystems are dynamically altered by several metabolic responses in microorganisms, including intracellular sensing and pumping, battle for survival, and supply of or competition for nutrients. Notably, intestinal bacteria maintain homeostatic balance in mammals via multiple dynamic biochemical reactions to produce several metabolites from undigested food, and those metabolites exert various effects on mammalian cells in a time-dependent manner. We have established a method for the analysis of bacterial metabolic dynamics in real time and used it in combination with statistical NMR procedures. METHODOLOGY/PRINCIPAL FINDINGS: We developed a novel method called real-time metabolotyping (RT-MT), which performs sequential (1)H-NMR profiling and two-dimensional (2D) (1)H, (13)C-HSQC (heteronuclear single quantum coherence) profiling during bacterial growth in an NMR tube. The profiles were evaluated with such statistical methods as Z-score analysis, principal components analysis, and time series of statistical TOtal Correlation SpectroScopY (TOCSY). In addition, using 2D (1)H, (13)C-HSQC with the stable isotope labeling technique, we observed the metabolic kinetics of specific biochemical reactions based on time-dependent 2D kinetic profiles. Using these methods, we clarified the pathway for linolenic acid hydrogenation by a gastrointestinal bacterium, Butyrivibrio fibrisolvens. We identified trans11, cis13 conjugated linoleic acid as the intermediate of linolenic acid hydrogenation by B. fibrisolvens, based on the results of (13)C-labeling RT-MT experiments. In addition, we showed that the biohydrogenation of polyunsaturated fatty acids serves as a defense mechanism against their toxic effects. CONCLUSIONS: RT-MT is useful for the characterization of beneficial bacterium that shows potential for use as probiotic by producing bioactive compounds

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition research

Author: A Abdel-Sayed
A Bhattacharjee
A Chavez
A Fardet
A Fardet
A Fardet
A Jiye
AD Maher
AD Southam
AM Weljie
AR Pico
Augustin Scalbert
B Kristal
B Ommen van
B Ommen van
B Ommen van
Ben van Ommen
Bruce S. Kristal
C Chen
C Hu
C Rubingh
CA Daykin
CA Smith
CA Smith
CB Clish
CF Taylor
D Broadhurst
David Wishart
DI Ellis
DM Jacobs
DR Stoll
DS Wishart
DS Wishart
DS Wishart
E Altmaier
E Saude
EE Carlson
EE Carlson
EJ Saude
EJ Want
EL Doets
EL Ulrich
Elwin Verheij
ER Miller III
Estelle Pujos-Guillot
FPJ Martin
G Bjelakovic
GG Harrigan
H Kanani
H Shi
HG Gika
HG Gika
HL Shi
HY Huang
J Forshed
J Greef van der
J Greef van der
J Griffin
J Han
J Kopka
J Kopka
J Kuhl
J Lindon
J Saric
J Taylor
J Trygg
J Vogels
J Wang
J Westerhuis
J Wood
J Yang
JJ Jansen
JT Brindle
JW Newman
K Dettmer
K Guo
KA Lawton
KCM Verhoeckx
KE Vigneau-Callahan
KO Boernsen
L Afman
L Mennen
L Sumner
LC Kenny
LI Mennen
Lorraine Brennan
M Assfalg
M Katajamaa
M Katajamaa
M Lauridsen
M Meydani
M Müller
M Reich
M-E Dumas
MC Walsh
MC Walsh
ME Garber
MJ Gibney
MM Koek
MM Koek
MPS Brown
O Fiehn
O Fiehn
O Fiehn
O Hansson
O Shaham
O Teahan
Oliver Fiehn
P Dwivedi
P D’Haeseleer
P Garosi
P Tamayo
Q Shen
Q Sun
Q Sun
R Goodacre
R Goodacre
R Goodacre
R Kleemann
R Laaksonen
R Landberg
RA Berg van den
RA Blanco
RJAN Lamers
RP Tracy
RS Plumb
S Bruschi
S Moco
S Rezzi
S Rozen
S Tiziani
S Wold
S Wopereis
S-A Sansone
SA Sansone
SA Sansone
SJ Bruce
SK Drake
SM Watkins
SO Hagan
SS Wang
SU Bajad
Suzan Wopereis
T Soga
Thomas Hankemeier
U Paolucci
U Paolucci
V Emilsson
VV Tolstikov
VV Tolstikov
W Windig
XL Han
Y Iijima
Y Noguchi
Y Shurubor
Y Tikunov
Y Tominaga
YQ Chen
Publication venue: Springer US
Publication date: 01/01/2009
Field of study

Mass spectrometry (MS) techniques, because of their sensitivity and selectivity, have become methods of choice to characterize the human metabolome and MS-based metabolomics is increasingly used to characterize the complex metabolic effects of nutrients or foods. However progress is still hampered by many unsolved problems and most notably the lack of well established and standardized methods or procedures, and the difficulties still met in the identification of the metabolites influenced by a given nutritional intervention. The purpose of this paper is to review the main obstacles limiting progress and to make recommendations to overcome them. Propositions are made to improve the mode of collection and preparation of biological samples, the coverage and quality of mass spectrometry analyses, the extraction and exploitation of the raw data, the identification of the metabolites and the biological interpretation of the results

Crossref

Harvard University - DASH

Springer - Publisher Connector

HAL Clermont Université

PubMed Central

Universal entropy of word ordering across linguistic families

Author: AD Wyner
C Darwin
CE Shannon
CK Peng
CK Peng
D Zanette
Damián H. Zanette
DH Zanette
E Alvarez-Lacalle
GK Zipf
HS Heaps
I Kontoyiannis
J Maynard Smith
J Ziv
J Ziv
JH Greenberg
JH Greenberg
JW Kantelhardt
M Abramowitz
M Hollander
M Miestamo
M Ruhlen
MA Montemurro
MA Nowak
MA Nowak
Marcelo A. Montemurro
Michael Breakspear
MP Lewis
N Chomsky
QD Atkinson
RD Gray
SV Buldyrev
TM Cover
TM Cover
W Ebeling
W Ebeling
WJ Teahan
Y Gao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Background The language faculty is probably the most distinctive feature of our species, and endows us with a unique ability to exchange highly structured information. In written language, information is encoded by the concatenation of basic symbols under grammatical and semantic constraints. As is also the case in other natural information carriers, the resulting symbolic sequences show a delicate balance between order and disorder. That balance is determined by the interplay between the diversity of symbols and by their specific ordering in the sequences. Here we used entropy to quantify the contribution of different organizational levels to the overall statistical structure of language. Methodology/Principal Findings We computed a relative entropy measure to quantify the degree of ordering in word sequences from languages belonging to several linguistic families. While a direct estimation of the overall entropy of language yielded values that varied for the different families considered, the relative entropy quantifying word ordering presented an almost constant value for all those families. Conclusions/Significance Our results indicate that despite the differences in the structure and vocabulary of the languages analyzed, the impact of word ordering in the structure of language is a statistical linguistic universal

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

Open Research Online (The Open University)

PubMed Central

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

The University of Manchester - Institutional Repository

A modified data normalization method for GC-MS-based metabolomics to minimize batch variation

Author: AJ Jauhiainen
AM De Livera
BM Bolstad
C Deport
CD Broeckling
E Engel
FMVD Kloet
H Redestig
HG Gika
HH Kanani
HM Parsons
J Gullberg
LR Crawford
M Katajamaa
M Sysi-Aho
MP Styczynski
O Teahan
RA Tate
RH Liu
S Bijlsma
S Wagner
SE Stein
T Frenzel
T Sangster
TM Annesley
TPJ Linsinger
VM Asiago
W Wang
WB Dunn
WB Dunn
YI Shurubor
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An Open Interface for Probabilistic Models of Text

Author: John G. Cleary
W. J. Teahan
Publication venue: Society Press
Publication date: 01/01/1999
Field of study

An Application Program Interface (API) for modelling sequential text is described. The API is intended to shield the user from details of the modelling and probability estimation process. This should enable different implementations of models to be replaced transparently in application programs. The motivation for this API is work on the use of textual models for applications in addition to strict data compression, e.g. determination of the source of text, spelling correction or segmentation of text by inserting spaces. The API is probabilistic: that is, it supplies the probability of the next symbol in the sequence. It is general enough to deal accurately with models that include escapes for probabilities. The concepts abstracted by the API are explained together with details of the API calls

CiteSeerX

Crossref