46,099 research outputs found
Searching for test data with feature diversity
There is an implicit assumption in software testing that more diverse and
varied test data is needed for effective testing and to achieve different types
and levels of coverage. Generic approaches based on information theory to
measure and thus, implicitly, to create diverse data have also been proposed.
However, if the tester is able to identify features of the test data that are
important for the particular domain or context in which the testing is being
performed, the use of generic diversity measures such as this may not be
sufficient nor efficient for creating test inputs that show diversity in terms
of these features. Here we investigate different approaches to find data that
are diverse according to a specific set of features, such as length, depth of
recursion etc. Even though these features will be less general than measures
based on information theory, their use may provide a tester with more direct
control over the type of diversity that is present in the test data. Our
experiments are carried out in the context of a general test data generation
framework that can generate both numerical and highly structured data. We
compare random sampling for feature-diversity to different approaches based on
search and find a hill climbing search to be efficient. The experiments
highlight many trade-offs that needs to be taken into account when searching
for diversity. We argue that recurrent test data generation motivates building
statistical models that can then help to more quickly achieve feature
diversity.Comment: This version was submitted on April 14th 201
Assembling the Tree of Life in Europe (AToLE)
A network of scientists under the umbrella of 'Assembling the Tree of Life in Europe (AToLE)' seeks funding under the FP7-Theme: Cooperation - Environment (including Climate Change and Biodiversity Conservation) programme of the European Commission.

Astrocladistics: Multivariate Evolutionary Analysis in Astrophysics
The Hubble tuning fork diagram, based on morphology and established in the
1930s, has always been the preferred scheme for classification of galaxies.
However, the current large amount of data up to higher and higher redshifts
asks for more sophisticated statistical approaches like multivariate analyses.
Clustering analyses are still very confidential, and do not take into account
the unavoidable characteristics in our Universe: evolution. Assuming branching
evolution of galaxies as a 'transmission with modification', we have shown that
the concepts and tools of phylogenetic systematics (cladistics) can be
heuristically transposed to the case of galaxies. This approach that we call
"astrocladistics", has now successfully been applied on several samples of
galaxies and globular clusters. Maximum parsimony and distance-based approaches
are the most popular methods to produce phylogenetic trees and, like most other
studies, we had to discretize our variables. However, since astrophysical data
are intrinsically continuous, we are contributing to the growing need for
applying phylogenetic methods to continuous characters.Comment: Invited talk at the session: Astrostatistics (Statistical analysis of
data related to Astronomy and Astrophysics
A new reference genome assembly for the microcrustacean Daphnia pulex
Comparing genomes of closely related genotypes from populations with distinct demographic histories can help reveal the impact of effective population size on genome evolution. For this purpose, we present a high quality genome assembly of Daphnia pulex (PA42), and compare this with the first sequenced genome of this species (TCO), which was derived from an isolate from a population with >90% reduction in nucleotide diversity. PA42 has numerous similarities to TCO at the gene level, with an average amino acid sequence identity of 98.8 and >60% of orthologous proteins identical. Nonetheless, there is a highly elevated number of genes in the TCO genome annotation, with similar to 7000 excess genes appearing to be false positives. This view is supported by the high GC content, lack of introns, and short length of these suspicious gene annotations. Consistent with the view that reduced effective population size can facilitate the accumulation of slightly deleterious genomic features, we observe more proliferation of transposable elements (TEs) and a higher frequency of gained introns in the TCO genome
Beyond multiregional and simple out-of-Africa models of human evolution
info:eu-repo/semantics/publishedVersio
Regional Genetic Structure in the Aquatic Macrophyte Ruppia cirrhosa Suggests Dispersal by Waterbirds
The evolutionary history of the genus Ruppia has been shaped by hybridization, polyploidisation and vicariance that have resulted in a problematic taxonomy. Recent studies provided insight into species circumscription, organelle takeover by hybridization, and revealed the importance of verifying species identification to avoid distorting effects of mixing different species, when estimating population connectivity. In the present study, we use microsatellite markers to determine population diversity and connectivity patterns in Ruppia cirrhosa including two spatial scales: (1) from the Atlantic Iberian coastline in Portugal to the Siculo-Tunisian Strait in Sicily and (2) within the Iberian Peninsula comprising the Atlantic-Mediterranean transition. The higher diversity in the Mediterranean Sea suggests that populations have had longer persistence there, suggesting a possible origin and/or refugial area for the species. The high genotypic diversities highlight the importance of sexual reproduction for survival and maintenance of populations. Results revealed a regional population structure matching a continent-island model, with strong genetic isolation and low gene flow between populations. This population structure could be maintained by waterbirds, acting as occasional dispersal vectors. This information elucidates ecological strategies of brackish plant species in coastal lagoons, suggesting mechanisms used by this species to colonize new isolated habitats and dominate brackish aquatic macrophyte systems, yet maintaining strong genetic structure suggestive of very low dispersal.Fundacao para a Cincia e Tecnologia (FCT, Portugal) [PTDC/MAR/119363/2010, BIODIVERSA/0004/2015, UID/Multi/04326/2013]Pew FoundationSENECA FoundationMurcia Government, Spain [11881/PI/09]FCT Investigator Programme-Career Development [IF/00998/2014]Spanish Ministry of Education [AP2008-01209]European Community [00399/2012]info:eu-repo/semantics/publishedVersio
- …