Search CORE

244 research outputs found

Computing vibrational eigenstates with tree tensor network states (TTNS)

Author: Larsson Henrik R.
Publication venue: 'AIP Publishing'
Publication date: 26/11/2019
Field of study

We present how to compute vibrational eigenstates with tree tensor network states (TTNSs), the underlying ansatz behind the multilayer multiconfiguration time-dependent Hartree (ML-MCTDH) method. The eigenstates are computed with an algorithm that is based on the density matrix renormalization group (DMRG). We apply this to compute the vibrational spectrum of acetonitrile (CH₃CN) to high accuracy and compare TTNSs with matrix product states (MPSs), the ansatz behind the DMRG. The presented optimization scheme converges much faster than ML-MCTDH-based optimization. For this particular system, we found no major advantage of the more general TTNS over MPS. We highlight that for both TTNS and MPS, the usage of an adaptive bond dimension significantly reduces the amount of required parameters. We furthermore propose a procedure to find good trees

arXiv.org e-Print Archive

Caltech Authors

Approaching the taxonomic affiliation of unidentified sequences in public databases – an example from the mycorrhizal fungi

Author: Kristiansson Erik
Larsson Karl-Henrik
Nilsson R Henrik
Ryberg Martin
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: During the last few years, DNA sequence analysis has become one of the primary means of taxonomic identification of species, particularly so for species that are minute or otherwise lack distinct, readily obtainable morphological characters. Although the number of sequences available for comparison in public databases such as GenBank increases exponentially, only a minuscule fraction of all organisms have been sequenced, leaving taxon sampling a momentous problem for sequence-based taxonomic identification. When querying GenBank with a set of unidentified sequences, a considerable proportion typically lack fully identified matches, forming an ever-mounting pile of sequences that the researcher will have to monitor manually in the hope that new, clarifying sequences have been submitted by other researchers. To alleviate these concerns, a project to automatically monitor select unidentified sequences in GenBank for taxonomic progress through repeated local BLAST searches was initiated. Mycorrhizal fungi – a field where species identification often is prohibitively complex – and the much used ITS locus were chosen as test bed. RESULTS: A Perl script package called emerencia is presented. On a regular basis, it downloads select sequences from GenBank, separates the identified sequences from those insufficiently identified, and performs BLAST searches between these two datasets, storing all results in an SQL database. On the accompanying web-service , users can monitor the taxonomic progress of insufficiently identified sequences over time, either through active searches or by signing up for e-mail notification upon disclosure of better matches. Other search categories, such as listing all insufficiently identified sequences (and their present best fully identified matches) publication-wise, are also available. DISCUSSION: The ever-increasing use of DNA sequences for identification purposes largely falls back on the assumption that public sequence databases contain a thorough sampling of taxonomically well-annotated sequences. Taxonomy, held by some to be an old-fashioned trade, has accordingly never been more important. emerencia does not automate the taxonomic process, but it does allow researchers to focus their efforts elsewhere than countless manual BLAST runs and arduous sieving of BLAST hit lists. The emerencia system is available on an open source basis for local installation with any organism and gene group as targets

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Chalmers Research

Göteborgs universitets publikationer - e-publicering och e-arkiv

galaxieEST: addressing EST identity through automated phylogenetic analysis

Author: Larsson Karl-Henrik
Nilsson R Henrik
Rajashekar Balaji
Ursing Björn M
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. RESULTS: galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at . CONCLUSIONS: By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and additional information on the query EST is needed

Lund University Publications

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central