Search CORE

arXiv.org e-Print Archive

Deterministic polarization chaos from a laser diode

Author: A Argyris
A Garfinkel
A Provenzale
A Scirè
A Uchida
A Wolf
AA Andronov
B Nagler
C Bracikowski
CO Weiss
D Rontani
DS Coffey
EN Lorenz
F Albert
F Hopfer
F Meier
FT Arecchi
GD VanWiggeren
GD VanWiggeren
H Haken
H Kawaguchi
H Poincaré
HDI Abarbanel
Hugo Thienpont
I Kanter
I Kanter
J Martin-Regalado
J Renaudier
JR Tredicce
K Fraedrich
K Kubodera
K Otsuka
Krassimir Panajotov
L Larger
L Olejniczak
LR Rabiner
M San Miguel
M Sciamanna
M Sondermann
Marc Sciamanna
Martin Virte
MB Willemsen
N Oliver
NC Gerhardt
P Grassberger
S Hovel
S Steingrube
SF Yu
SH Strogatz
T Adachi
T Mukai
T Yamada
T-Y Li
TB Simpson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Fifty years after the invention of the laser diode and fourty years after the report of the butterfly effect - i.e. the unpredictability of deterministic chaos, it is said that a laser diode behaves like a damped nonlinear oscillator. Hence no chaos can be generated unless with additional forcing or parameter modulation. Here we report the first counter-example of a free-running laser diode generating chaos. The underlying physics is a nonlinear coupling between two elliptically polarized modes in a vertical-cavity surface-emitting laser. We identify chaos in experimental time-series and show theoretically the bifurcations leading to single- and double-scroll attractors with characteristics similar to Lorenz chaos. The reported polarization chaos resembles at first sight a noise-driven mode hopping but shows opposite statistical properties. Our findings open up new research areas that combine the high speed performances of microcavity lasers with controllable and integrated sources of optical chaos.Comment: 13 pages, 5 figure

HAL-CentraleSupelec

Preparation of name and address data for record linkage using hidden Markov models

Author: A McCallum
A Rigo
AP Dempster
B Aldelberg
C Barrett
D Carnall
D Freitag
D Freitag
DL Ellsworth
DM Bikel
G van Rossum
GD Forney
Justin Xi Zhu
K Seymore
Kim Lim
L Gill
L Gill
L Rabiner
LJ Cook
LL Roos
LR Rabiner
MatchWare Technologies
ME Califf
MJ Khoury
National Center for Biotechnology Information
New South Wales Department of Health
P Armitage
P Christen
P-S Laplace
Peter Christen
Public Health Division
S Soderland
SE Levinson
SF Altschul
Tim Churches
TR Leek
V Borkar
WE Winkler
Publication venue: BioMed Central
Publication date: 01/12/2002
Field of study

BACKGROUND: Record linkage refers to the process of joining records that relate to the same entity or event in one or more data collections. In the absence of a shared, unique key, record linkage involves the comparison of ensembles of partially-identifying, non-unique data items between pairs of records. Data items with variable formats, such as names and addresses, need to be transformed and normalised in order to validly carry out these comparisons. Traditionally, deterministic rule-based data processing systems have been used to carry out this pre-processing, which is commonly referred to as "standardisation". This paper describes an alternative approach to standardisation, using a combination of lexicon-based tokenisation and probabilistic hidden Markov models (HMMs). METHODS: HMMs were trained to standardise typical Australian name and address data drawn from a range of health data collections. The accuracy of the results was compared to that produced by rule-based systems. RESULTS: Training of HMMs was found to be quick and did not require any specialised skills. For addresses, HMMs produced equal or better standardisation accuracy than a widely-used rule-based system. However, acccuracy was worse when used with simpler name data. Possible reasons for this poorer performance are discussed. CONCLUSION: Lexicon-based tokenisation and HMMs provide a viable and effort-effective alternative to rule-based systems for pre-processing more complex variably formatted data such as addresses. Further work is required to improve the performance of this approach with simpler data such as names. Software which implements the methods described in this paper is freely available under an open source license for other researchers to use and improve

ANU Digital Collections

The Australian National University

DWT and LPC based feature extraction methods for isolated word recognition

Author: AE Rosenberg
B Kotnik
DS Pallett
F Itakura
H Hermansky
H Hermansky
J Xu
JN Gowdy
K Wang
KP Soman
L Rabiner
M Gupta
M Krishnan
MJF Gales
Navnath S Nehe
NS Nehe
O Farooq
O Farooq
Raghunath S Holambe
S Mallat
SB Davis
SF Boll
Y Hao
Z Tufekci
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Linear predictive coding representation of correlated mutation for protein sequence alignment

Author: A Elofsson
AG Murzin
AS Yang
BC Lee
Chan-seok Jeong
CM Buslje
D Cozzetto
Dongsup Kim
DT Jones
E Neher
ER Tillier
G Shackelford
GJ Bartlett
GM Süel
J Kleinjung
J Kopp
J Söding
JM Chandonia
JP Dekker
LR Rabiner
M Lee
N Siew
O Olmea
S Wu
SD Dunn
SF Altschul
SW Lockless
T Ohlson
T Pham
U Göbel
WR Atchley
Y Qi
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Preferentially Quantized Linker DNA Lengths in Saccharomyces cerevisiae

Author: A Bertin
A Dempster
AB Cohanim
AB Cohanim
BW Silverman
D Lohr
E Segal
Eran Segal
F Strauss
F Strauss
Gary Stormo
GC Yuan
Guei-Feng Tsai
I Albert
J Thomas
J Widom
J Widom
Ji-Ping Wang
Jonathan Widom
JPZ Wang
K Luger
KE van Holde
LE Ulanovsky
Liqun Xi
LR Rabiner
M Kato
PJ Robinson
R Durbin
S Satchwell
S Satchwell
SF Altschul
T Schalch
W Hörz
WK Olson
Y Cui
Yvonne Fondufe-Mittendorf
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The exact lengths of linker DNAs connecting adjacent nucleosomes specify the intrinsic three-dimensional structures of eukaryotic chromatin fibers. Some studies suggest that linker DNA lengths preferentially occur at certain quantized values, differing one from another by integral multiples of the DNA helical repeat, ∼10 bp; however, studies in the literature are inconsistent. Here, we investigate linker DNA length distributions in the yeast Saccharomyces cerevisiae genome, using two novel methods: a Fourier analysis of genomic dinucleotide periodicities adjacent to experimentally mapped nucleosomes and a duration hidden Markov model applied to experimentally defined dinucleosomes. Both methods reveal that linker DNA lengths in yeast are preferentially periodic at the DNA helical repeat (∼10 bp), obeying the forms 10n+5 bp (integer n). This 10 bp periodicity implies an ordered superhelical intrinsic structure for the average chromatin fiber in yeast

CiteSeerX

Model based dynamics analysis in live cell microtubule images

Author: A Altinok
A Goncalves
A Janulevicius
A Krogh
A Rao
Alphan Altınok
Austin J Peck
B Alberts
BS Manjunath
C Bystroff
D Panda
Erkan Kiris
G Danuser
G Margolin
J Sethian
JC Augustinack
JM Bunker
JM Bunker
K Kamath
K Karplus
K Shafique
Kenneth Rose
L Rabiner
Leslie Wilson
LR Bahl
M Bicego
M Brand
M El-Saban
M Jiang
ML Gupta
P Maddox
R Durbin
RA Crowther
RA Walker
S Hadjidemetriou
SC Feinstein
SF Levy
Stuart C Feinstein
T Crowther
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: The dynamic growing and shortening behaviors of microtubules are central to the fundamental roles played by microtubules in essentially all eukaryotic cells. Traditionally, microtubule behavior is quantified by manually tracking individual microtubules in time-lapse images under various experimental conditions. Manual analysis is laborious, approximate, and often offers limited analytical capability in extracting potentially valuable information from the data. Results: In this work, we present computer vision and machine-learning based methods for extracting novel dynamics information from time-lapse images. Using actual microtubule data, we estimate statistical models of microtubule behavior that are highly effective in identifying common and distinct characteristics of microtubule dynamic behavior. Conclusion: Computational methods provide powerful analytical capabilities in addition to traditional analysis methods for studying microtubule dynamic behavior. Novel capabilities, such as building and querying microtubule image databases, are introduced to quantify and analyze microtubule dynamic behavior

OpenMETU (Middle East Technical University)

Prediction of protein binding sites in protein structures using hidden Markov support vector machine

Author: A Henschel
A Koike
A Kouranov
A Porollo
A Rossi
AJ Bordner
B Wang
Bin Liu
Buzhou Tang
C Chothia
C Yan
C Yan
C-T Chen
C-W Cheng
H Chen
H Kim
H Neuvirth
H-X Zhou
HX Zhou
I Ezkurdia
I Res
I Tsochantaridis
I Tsochantaridis
J Lafferty
J Song
J Song
J-L Chung
JD Fischer
JL Chung
JR Bradford
JW Torrance
K Henrick
L Holm
L Lo Conte
L Wang
Lei Lin
LR Rabiner
M Gribskov
M Vincent
M Šikić
MH Li
N Li
NJ Burgoyne
P Fariselli
Q Dong
Qiwen Dong
S Ahmad
S Liang
S Qin
SF Altschul
SF Altschul
T Joachims
T Zhang
TH Dang
W Kabsch
WK Kim
X-w Chen
Xiaolong Wang
Xuan Wang
Y Altun
Y Liu
Y Ofran
Y Ofran
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector machines, conditional random field, etc. However, the prediction performance is still too low to be used in practice. It is necessary to explore new algorithms, theories and features to further improve the performance. Results In this study, we introduce a novel machine learning model hidden Markov support vector machine for protein binding site prediction. The model treats the protein binding site prediction as a sequential labelling task based on the maximum margin criterion. Common features derived from protein sequences and structures, including protein sequence profile and residue accessible surface area, are used to train hidden Markov support vector machine. When tested on six data sets, the method based on hidden Markov support vector machine shows better performance than some state-of-the-art methods, including artificial neural networks, support vector machines and conditional random field. Furthermore, its running time is several orders of magnitude shorter than that of the compared methods. Conclusion The improved prediction performance and computational efficiency of the method based on hidden Markov support vector machine can be attributed to the following three factors. Firstly, the relation between labels of neighbouring residues is useful for protein binding site prediction. Secondly, the kernel trick is very advantageous to this field. Thirdly, the complexity of the training step for hidden Markov support vector machine is linear with the number of training samples by using the cutting-plane algorithm.</p

ScholarBank@NUS

Improved annotation with de novo transcriptome assembly in four social amoeba species

Author: AJ Heidel
AM Waterhouse
B Langmead
B Ma
BJ Haas
C Burge
C Cole
C Gissi
C Schilde
C Trapnell
Christian Cole
Christina Schilde
CP Ponting
EM Quinn
F Jeanmougin
FA Simão
G Glöckner
G Rot
Geoffrey J. Barton
Gernot Glöckner
GS Slater
Hajara M. Lawal
I Letunic
JP Huelsenbeck
JW Nicol
KE Hayer
L Eichinger
LR Rabiner
M Felder
M Stanke
M Yandell
MA Hassan
MD Macmanes
MG Grabherr
MH Schulz
ML Metzker
NJ Schurch
Pauline Schaap
PSG Chain
R Piskol
R Smith-Unna
Reema Singh
RL Chisholm
RS Young
S Kumar
SF Altschul
T Steijger
TBK Reddy
TD Wu
TJ Jentsch
W Xue
WH Majoros
WJ Kent
YL Xie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/05/2016
Field of study

Background: Annotation of gene models and transcripts is a fundamental step in genome sequencing projects. Often this is performed with automated prediction pipelines, which can miss complex and atypical genes or transcripts. RNA sequencing (RNA-seq) data can aid the annotation with empirical data. Here we present de novo transcriptome assemblies generated from RNA-seq data in four Dictyostelid species: D. discoideum, P. pallidum, D. fasciculatum and D. lacteum. The assemblies were incorporated with existing gene models to determine corrections and improvement on a whole-genome scale. This is the first time this has been performed in these eukaryotic species. Results: An initial de novo transcriptome assembly was generated by Trinity for each species and then refined with Program to Assemble Spliced Alignments (PASA). The completeness and quality were assessed with the Benchmarking Universal Single-Copy Orthologs (BUSCO) and Transrate tools at each stage of the assemblies. The final datasets of 11,315-12,849 transcripts contained 5,610-7,712 updates and corrections to >50% of existing gene models including changes to hundreds or thousands of protein products. Putative novel genes are also identified and alternative splice isoforms were observed for the first time in P. pallidum, D. lacteum and D. fasciculatum. Conclusions: In taking a whole transcriptome approach to genome annotation with empirical data we have been able to enrich the annotations of four existing genome sequencing projects. In doing so we have identified updates to the majority of the gene annotations across all four species under study and found putative novel genes and transcripts which could be worthy for follow-up. The new transcriptome data we present here will be a valuable resource for genome curators in the Dictyostelia and we propose this effective methodology for use in other genome annotation projects

Kölner UniversitätsPublikationsServer

University of Dundee Online Publications

Detection of recurrent copy number alterations in the genome: taking among-subject heterogeneity seriously

Author: A Ben-Dor
A Misra
AB Olshen
AJ Aguirre
B Weir
B Ylstra
C Barnes
C Lee
C Lengauer
C Rouveirol
C Xie
D Lipson
D Pinkel
E Douglas
G Tonon
H Willenbrock
I Ionita-Laza
J Bilmes
J Gonzalez
J Huang
J Liu
J Liu
J Pollack
J Sebat
JH Kim
JM Kidd
JMM Korn
JO Korbel
JR Lupski
JS Beckmann
K Nakao
K Wang
KA Frazer
LA Garraway
LDD Wood
LR Rabiner
LV Wain
M Guttman
MAvan de Wiel
NP Carter
O Cappé
OM Rueda
OM Rueda
OM Rueda
Oscar M Rueda
R Beroukhim
R Redon
Ramon Diaz-Uriarte
S Diskin
S Lee
S Scott
S Shah
SA McCarroll
SF Chin
SP Shah
T Laframboise
W Sun
WNN Van Wieringen
WRR Lai
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Se adjunta un fichero pdf con los datos de investigación titulado "Supplementary Material for \Detection of Recurrent Copy Number Alterations in the Genome: taking among-subject heterogeneity seriously"Background: Alterations in the number of copies of genomic DNA that are common or recurrent among diseased individuals are likely to contain disease-critical genes. Unfortunately, defining common or recurrent copy number alteration (CNA) regions remains a challenge. Moreover, the heterogeneous nature of many diseases requires that we search for common or recurrent CNA regions that affect only some subsets of the samples (without knowledge of the regions and subsets affected), but this is neglected by most methods. Results: We have developed two methods to define recurrent CNA regions from aCGH data. Our methods are unique and qualitatively different from existing approaches: they detect regions over both the complete set of arrays and alterations that are common only to some subsets of the samples (i.e., alterations that might characterize previously unknown groups); they use probabilities of alteration as input and return probabilities of being a common region, thus allowing researchers to modify thresholds as needed; the two parameters of the methods have an immediate, straightforward, biological interpretation. Using data from previous studies, we show that we can detect patterns that other methods miss and that researchers can modify, as needed, thresholds of immediate interpretability and develop custom statistics to answer specific research questions. Conclusion: These methods represent a qualitative advance in the location of recurrent CNA regions, highlight the relevance of population heterogeneity for definitions of recurrence, and can facilitate the clustering of samples with respect to patterns of CNA. Ultimately, the methods developed can become important tools in the search for genomic regions harboring disease-critical genesFunding provided by Fundación de Investigación Médica Mutua Madrileña. Publication charges covered by projects CONSOLIDER: CSD2007-00050 of the Spanish Ministry of Science and Innovation and by RTIC COMBIOMED RD07/0067/0014 of the Spanish Health Ministr