174 research outputs found
Towards low-carbon conferencing : acceptance of virtual conferencing solutions and other sustainability measures in the ALIFE community
The latest report from the Intergovernmental Panel on Climate Change (IPCC) estimated that humanity has a time window of about 12 years in order to prevent anthropogenic climate change of catastrophic magnitude. Green house gas emission from air travel, which is currently rising, is possibly one of the factors that can be most readily reduced. Within this context, we advocate for the re-design of academic conferences in order to decrease their environmental footprint. Today, virtual technologies hold the promise to substitute many forms of physical interactions and increasingly make their way into conferences to reduce the number of travelling delegates. Here, we present the results of a survey in which we gathered the opinion on this topic of academics worldwide. Results suggest there is ample room for challenging the (dangerous) business-as-usual inertia of scientific lifestyle
ROSEFW-RF: the winner algorithm for the ECBDL’14 big data competition: an extremely imbalanced big data bioinformatics problem
The application of data mining and machine learning techniques to biological and biomedicine data continues to be an ubiquitous research theme in current bioinformatics. The rapid advances in biotechnology are allowing us to obtain and store large quantities of data about cells, proteins, genes, etc., that should be processed. Moreover, in many of these problems such as contact map prediction, the problem tackled in this paper, it is difficult to collect representative positive examples. Learning under these circumstances, known as imbalanced big data classification, may not be straightforward for most of the standard machine learning methods.
In this work we describe the methodology that won the ECBDL’14 big data challenge for a bioinformatics big data problem. This algorithm, named as ROSEFW-RF, is based on several MapReduce approaches to (1) balance the classes distribution through random oversampling, (2) detect the most relevant features via an evolutionary feature weighting process and a threshold to choose them, (3) build an appropriate Random Forest model from the pre-processed data and finally (4) classify the test data. Across the paper, we detail and analyze the decisions made during the competition showing an extensive experimental study that characterize the way of working of our methodology. From this analysis we can conclude that this approach is very suitable to tackle large-scale bioinformatics classifications problems
Automated Alphabet Reduction for Protein Datasets
<p>Abstract</p> <p>Background</p> <p>We investigate automated and generic alphabet reduction techniques for protein structure prediction datasets. Reducing alphabet cardinality without losing key biochemical information opens the door to potentially faster machine learning, data mining and optimization applications in structural bioinformatics. Furthermore, reduced but informative alphabets often result in, e.g., more compact and human-friendly classification/clustering rules. In this paper we propose a robust and sophisticated alphabet reduction protocol based on mutual information and state-of-the-art optimization techniques.</p> <p>Results</p> <p>We applied this protocol to the prediction of two protein structural features: contact number and relative solvent accessibility. For both features we generated alphabets of two, three, four and five letters. The five-letter alphabets gave prediction accuracies statistically similar to that obtained using the full amino acid alphabet. Moreover, the automatically designed alphabets were compared against other reduced alphabets taken from the literature or human-designed, outperforming them. The differences between our alphabets and the alphabets taken from the literature were quantitatively analyzed. All the above process had been performed using a primary sequence representation of proteins. As a final experiment, we extrapolated the obtained five-letter alphabet to reduce a, much richer, protein representation based on evolutionary information for the prediction of the same two features. Again, the performance gap between the full representation and the reduced representation was small, showing that the results of our automated alphabet reduction protocol, even if they were obtained using a simple representation, are also able to capture the crucial information needed for state-of-the-art protein representations.</p> <p>Conclusion</p> <p>Our automated alphabet reduction protocol generates competent reduced alphabets tailored specifically for a variety of protein datasets. This process is done without any domain knowledge, using information theory metrics instead. The reduced alphabets contain some unexpected (but sound) groups of amino acids, thus suggesting new ways of interpreting the data.</p
From biochemical markers to molecular endotypes of osteoarthritis: a review on validated biomarkers
\ua9 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.Introduction: Osteoarthritis (OA) affects over 500 million people worldwide. OA patients are symptomatically treated, and current therapies exhibit marginal efficacy and frequently carry safety-risks associated with chronic use. No disease-modifying therapies have been approved to date leaving surgical joint replacement as a last resort. To enable effective patient care and successful drug development there is an urgent need to uncover the pathobiological drivers of OA and how these translate into disease endotypes. Endotypes provide a more precise and mechanistic definition of disease subgroups than observable phenotypes, and a panel of tissue- and pathology-specific biochemical markers may uncover treatable endotypes of OA. Areas covered: We have searched PubMed for full-text articles written in English to provide an in-depth narrative review of a panel of validated biochemical markers utilized for endotyping of OA and their association to key OA pathologies. Expert opinion: As utilized in IMI-APPROACH and validated in OAI-FNIH, a panel of biochemical markers may uncover disease subgroups and facilitate the enrichment of treatable molecular endotypes for recruitment in therapeutic clinical trials. Understanding the link between biochemical markers and patient-reported outcomes and treatable endotypes that may respond to given therapies will pave the way for new drug development in OA
Novel P-in-N Si-Sensor technology for high resolution and high repetition-rate experiments at accelerator facilities
Linear array detectors with high spatial resolution and MHz frame-rates are essential for high-rate experiments at accelerator facilities. KALYPSO, a line array detector with 1024 pixels operating over 1 Mfps has been developed. To improve the spatial resolution and sensitivity at different wavelengths, novel p-in-n Si microstrip sensors based on have been developed with a pitch of 25 micrometer. The efficiency of the sensor has been improved with the use of anti reflecting coating layers optimized for near infrared, visible and near ultraviolet. In this contribution the detector system and the sensors will be presented
- …