Search CORE

328 research outputs found

PonyGE2: Grammatical Evolution in Python

Author: Fagan David
Fenton Michael
Forstenlechner Stefan
Hemberg Erik
McDermott James
O'Neill Michael
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/04/2017
Field of study

Grammatical Evolution (GE) is a population-based evolutionary algorithm, where a formal grammar is used in the genotype to phenotype mapping process. PonyGE2 is an open source implementation of GE in Python, developed at UCD's Natural Computing Research and Applications group. It is intended as an advertisement and a starting-point for those new to GE, a reference for students and researchers, a rapid-prototyping medium for our own experiments, and a Python workout. As well as providing the characteristic genotype to phenotype mapping of GE, a search algorithm engine is also provided. A number of sample problems and tutorials on how to use and adapt PonyGE2 have been developed.Comment: 8 pages, 4 figures, submitted to the 2017 GECCO Workshop on Evolutionary Computation Software Systems (EvoSoft

arXiv.org e-Print Archive

Crossref

An evolution of a complete program using XML-based grammar definition

Author: Christopher Hinde (1258023)
Nor Zainah Siau (7169549)
Roger Stone (1256976)
Publication venue
Publication date: 01/01/2012
Field of study

XML technology is a technique to describe structured data that can be manipulated by different types of applications, especially to represent content on the Web. This paper presents a viable approach to automatically evolve a ‘sorting program’ by applying genetic programming and full syntax XML-based grammar definition to map the genotype to phenotype. The genotypes are composed of fixed-length blocks of genes that are made up of a series of integer values. The paper reports that our approach improves the structure of the grammar used in the mapping process, which guarantees that the generated program follows the correct syntax with no repair function, in comparison to earlier work. This allows more structured programs than earlier systems

Loughborough University Institutional Repository

Extending Grammatical Evolution to Evolve Digital Surfaces with Genr8

Author: F. Gruau
G.S. Hornby
J.R. Koza
M. O’Neill
P. Prusinkiewicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Crossref

Building and Documenting Workflows with Python-Based Snakemake

Author: Rahmann Sven
Publication venue: OASIcs - OpenAccess Series in Informatics. German Conference on Bioinformatics 2012
Publication date: 01/01/2012
Field of study

Snakemake is a novel workflow engine with a simple Python-derived workflow definition language and an optimizing execution environment. It is the first system that supports multiple named wildcards (or variables) in input and output filenames of each rule definition. It also allows to write human-readable workflows that document themselves. We have found Snakemake especially useful for building high-throughput sequencing data analysis pipelines and present examples from this area. Snakemake exemplifies a generic way to implement a domain specific language in python, without writing a full parser or introducing syntactical overhead by overloading language features

Dagstuhl Research Online Publication Server

Tools for visualization and analysis of molecular networks, pathways, and -omics data.

Author: Habermann B.
Koti P.
Villaveces J.
Publication venue: 'Dove Medical Press Ltd.'
Publication date: 04/06/2015
Field of study

Biological pathways have become the standard way to represent the coordinated reactions and actions of a series of molecules in a cell. A series of interconnected pathways is referred to as a biological network, which denotes a more holistic view on the entanglement of cellular reactions. Biological pathways and networks are not only an appropriate approach to visualize molecular reactions. They have also become one leading method in -omics data analysis and visualization. Here, we review a set of pathway and network visualization and analysis methods and take a look at potential future developments in the field

PubMed Central

MPG.PuRe

Semantic wikis as flexible database interfaces for biomedical applications

Author: Atzori Manfredo
Corbetta Maurizio
Falda Marco
Publication venue: Digital Commons@Becker
Publication date: 19/01/2023
Field of study

Several challenges prevent extracting knowledge from biomedical resources, including data heterogeneity and the difficulty to obtain and collaborate on data and annotations by medical doctors. Therefore, flexibility in their representation and interconnection is required; it is also essential to be able to interact easily with such data. In recent years, semantic tools have been developed: semantic wikis are collections of wiki pages that can be annotated with properties and so combine flexibility and expressiveness, two desirable aspects when modeling databases, especially in the dynamic biomedical domain. However, semantics and collaborative analysis of biomedical data is still an unsolved challenge. The aim of this work is to create a tool for easing the design and the setup of semantic databases and to give the possibility to enrich them with biostatistical applications. As a side effect, this will also make them reproducible, fostering their application by other research groups. A command-line software has been developed for creating all structures required by Semantic MediaWiki. Besides, a way to expose statistical analyses as R Shiny applications in the interface is provided, along with a facility to export Prolog predicates for reasoning with external tools. The developed software allowed to create a set of biomedical databases for the Neuroscience Department of the University of Padova in a more automated way. They can be extended with additional qualitative and statistical analyses of data, including for instance regressions, geographical distribution of diseases, and clustering. The software is released as open source-code and published under the GPL-3 license at https://github.com/mfalda/tsv2swm

Digital Commons@Becker

Dynamic Protocol Reverse Engineering a Grammatical Inference Approach

Author: DeYoung Mark E.
Publication venue: AFIT Scholar
Publication date: 01/03/2008
Field of study

Round trip engineering of software from source code and reverse engineering of software from binary files have both been extensively studied and the state-of-practice have documented tools and techniques. Forward engineering of protocols has also been extensively studied and there are firmly established techniques for generating correct protocols. While observation of protocol behavior for performance testing has been studied and techniques established, reverse engineering of protocol control flow from observations of protocol behavior has not received the same level of attention. State-of-practice in reverse engineering the control flow of computer network protocols is comprised of mostly ad hoc approaches. We examine state-of-practice tools and techniques used in three open source projects: Pidgin, Samba, and rdesktop . We examine techniques proposed by computational learning researchers for grammatical inference. We propose to extend the state-of-art by inferring protocol control flow using grammatical inference inspired techniques to reverse engineer automata representations from captured data flows. We present evidence that grammatical inference is applicable to the problem domain under consideration

AFTI Scholar (Air Force Institute of Technology)

NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata

Author: Adida
Anurag Priyam
Arlin Stoltzfus
Ashburner
Balhoff
Beaman
Beckett
Benson
Biron
Bisby
Brandes
Cardona
Connelly
Constable
Dahdul
Dahdul
Drummond
Fallside
Felsenstein
Felsenstein
Gkoutos
Gopalan
Han
Hilmar Lapp
Hladish
Hyam
James P. Balhoff
Jason A. Caravas
Jeet Sukumaran
Johnson
Jordan
Leary
Leebens-Mack
Lewis
Li
Maddison
Maddison
Maddison
Maddison
Mark T. Holder
Matthews
McEntire
Miller
Moore
Mungall
O'Leary
Page
Parks
Peter E. Midford
Piel
Prosdocimi
Rausher
Rice
Ronquist
Rutger A. Vos
Sanderson
Schmitt
Sidlauskas
Smits
Stoesser
Sukumaran
Swofford
Taylor
Than
Thompson
Wayne P. Maddison
Whelan
Whitlock
Xia
Xuhua Xia
Zmasek
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML

Crossref

KU ScholarWorks

PubMed Central

Carolina Digital Repository