3,534 research outputs found
Recommended from our members
An experimental comparison of a genetic algorithm and a hill-climber for term selection
Purpose – The term selection problem for selecting query terms in information filtering and routing has been investigated using hill-climbers of various kinds, largely through the Okapi experiments in the TREC series of conferences. Although these are simple deterministic approaches which examine the effect of changing the weight of one term at a time, they have been shown to improve the retrieval effectiveness of filtering queries in these TREC experiments. Hill-climbers are, however, likely to get trapped in local optima, and the use of more sophisticated local search techniques for this problem that attempt to break out of these optima are worth investigating. To this end, we apply a genetic algorithm (GA) to the same problem.
Design/Methodology/Approach – We use a standard TREC test collection from the TREC-8 filtering track, recording mean average precision and recall measures to allow comparison between the hillclimber and GA algorithms. We also vary elements of the GA, such as probability of a word being included, probability of mutation and population size in order to measure the effect of these variables. Different strategies such as Elitist and Non-Elitist methods are used, as well as Roulette Wheel and Rank selection GA algorithms.
Findings – The results of tests suggest that both techniques are, on average, better than the baseline, but the implemented GA does not match the overall performance of a hill-climber. The Rank selection algorithm does better on average than the Roulette Wheel algorithm. There is no evidence in this study that varying word inclusion probability, mutation probability or Elitist method make much difference to the overall results. Small population sizes do not appear to be as effective as larger population sizes.
Research limitations/implications – The evidence provided here would suggest that being stuck in a local optima for the term selection optimization problem does not appear to be detrimental to the overall success of the hill-climber. The evidence from term rank order would appear to provide extra useful evidence which hill-climbers can use efficiently and effectively to narrow the search space.
Originality/Value – The paper represents the first attempt to compare hill-climbers with GAs on a problem of this type
"Going back to our roots": second generation biocomputing
Researchers in the field of biocomputing have, for many years, successfully
"harvested and exploited" the natural world for inspiration in developing
systems that are robust, adaptable and capable of generating novel and even
"creative" solutions to human-defined problems. However, in this position paper
we argue that the time has now come for a reassessment of how we exploit
biology to generate new computational systems. Previous solutions (the "first
generation" of biocomputing techniques), whilst reasonably effective, are crude
analogues of actual biological systems. We believe that a new, inherently
inter-disciplinary approach is needed for the development of the emerging
"second generation" of bio-inspired methods. This new modus operandi will
require much closer interaction between the engineering and life sciences
communities, as well as a bidirectional flow of concepts, applications and
expertise. We support our argument by examining, in this new light, three
existing areas of biocomputing (genetic programming, artificial immune systems
and evolvable hardware), as well as an emerging area (natural genetic
engineering) which may provide useful pointers as to the way forward.Comment: Submitted to the International Journal of Unconventional Computin
Capturing Regular Human Activity through a Learning Context Memory
A learning context memory consisting of two main parts is
presented. The first part performs lossy data compression,
keeping the amount of stored data at a minimum by combining
similar context attributes — the compression rate for the
presented GPS data is 150:1 on average. The resulting data is
stored in an appropriate data structure highlighting the level
of compression. Elements with a high level of compression
are used in the second part to form the start and end points
of episodes capturing common activity consisting of consecutive
events. The context memory is used to investigate how
little context data can be stored containing still enough information
to capture regular human activity
A new paradigm for SpeckNets:inspiration from fungal colonies
In this position paper, we propose the development of a new biologically inspired paradigm based on fungal colonies, for the application to pervasive adaptive systems. Fungal colonies have a number of properties that make them an excellent candidate for inspiration for engineered systems. Here we propose the application of such inspiration to a speckled computing platform. We argue that properties from fungal colonies map well to properties and requirements for controlling SpeckNets and suggest that an existing mathematical model of a fungal colony can developed into a new computational paradigm
Pronouns and identity: A case study from a 1930s working-class community
This article investigates the relationship between certain pronoun uses and identity in a 1930s working class community. It is based on a corpus of informal conversations drawn from the Mass-Observation archive, a sociological and anthropological study of the Bolton (UK) working class at this time. The article argues that certain pronoun uses in the corpus can only be explained as homophoric reference, a kind of reference which depends on implicit agreement about the intended referent of the pronoun. The article then discusses the basis on which this implicit agreement could operate: shared culture and knowledge and a tight network of social relations. In the conclusion, two particular questions are raised: 1) How far can the homophoric reference described be related to social class? 2) When does (dialect) grammar become pragmatics
The Influence of Route Characteristics, Train Design and Maintenance Policy on Wheel Tread Damage, Wheel Life and Costs for Multiple-Unit Trains
In the UK, the use of similar vehicle types by a range of privatised operators gives the opportunity to assess the influence of different route conditions and maintenance practices on wheel tread damage, wheelset life and costs. This paper investigates these influences, using data obtained directly from the train operators and maintainers. By disseminating best practice it is expected that wheelset life can be improved on many fleets, with resultant cost savings
Recruiting patients to medical research: double blind randomised trial of "opt-in" versus "opt-out" strategies
Objective To evaluate the effect of opt-in compared with opt-out recruitment strategies on response rate and selection bias. Design Double blind randomised controlled trial. Setting Two general practices in England. Participants 510 patients with angina. Intervention Patients were randomly allocated to an opt-in (asked to actively signal willingness to participate in research) or opt-out (contacted repeatedly unless they signalled unwillingness to participate) approach for recruitment to an observational prognostic study of patients with angina. Main outcome measures Recruitment rate and clinical characteristics of patients. Results The recruitment rate, defined by clinic attendance, was 38% (96/252) in the opt-in arm and 50% (128/258) in the opt-out arm (P = 0.014). Once an appointment had been made, non-attendance at the clinic was similar (20% opt-in arm v 17% opt-out arm; P = 0.86). Patients in the opt-in arm had fewer risk factors (44% v 60%; P = 0.053), less treatment for angina (69% v 82%; P = 0.010), and less functional impairment (9% v 20%; P = 0.023) than patients in the opt-out arm. Conclusions The opt-in approach to participant recruitment, increasingly required by ethics committees, resulted in lower response rates and a biased sample. We propose that the opt-out approach should be the default recruitment strategy for studies with low risk to participants
- …