3,146 research outputs found

    WormBase - Annotating many nematode genomes

    Get PDF
    WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase’s role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE

    Improving the Accuracy of Oncology Diagnosis: A Machine Learning-Based Approach to Cancer Prediction

    Get PDF
    Cancer ranks among the most lethal illnesses worldwide, and predicting its onset can be a crucial factor in enhancing people’s quality of life by taking preventive measures to improve treatment and survival. This study conducted comparative research to determine the machine learning model with the highest accuracy for tumor type classification, distinguishing between malignant (cancer) and benign tumors. The models evaluated include decision tree (DT), naive bayes (NB), extra trees classifier (ETM), random forest (RF), K-means clustering (K-means), logistic regression (LR), adaptive boosting (AdaBoost), gradient boosting (GB), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost) to identify the one with the best accuracy. The models were trained using a dataset of 569 records and a total of 32 variables, containing patient information and tumor characteristics. The study was structured into sections, such as related studies, descriptions of the models, case study development, results, discussion, and conclusions. The models’ performance was evaluated based on metrics of precision, sensitivity, accuracy, and F1 score. Following the training, the results positioned the XGBoost model as having the best performance, achieving 98% precision, accuracy, sensitivity, and F1 score

    Evaluation of machine learning algorithms in the early detection of Parkinson's disease: a comparative study

    Get PDF
    Parkinson's is a neurodegenerative disease that generally affects people over 60 years of age. The disease destroys neurons and increases the accumulation of α-synuclein in many parts of the brain stem, although at present its causes remain unknown. It is therefore a priority to identify a method that can detect the disease, and this is where machine learning models become important. This study aims to perform a comparative analysis of machine learning models focused on the early detection of Parkinson's disease. Logistic regression (LR), support vector machines (SVM), decision trees (DT), extra trees classifiers (ETC), K-nearest neighbors (KNN), random forests (RF), adaptive boosting (AdaBoost) and gradient boosting (GB) algorithms are described and developed to identify the one that offers the best performance. In the training stage, we used the Oxford University dataset for Parkinson's disease detection, which has a total of 23 attributes and 195 records on patient voice recordings. The article is structured into six sections, such as introduction, related work, methodology, results, discussions, and conclusions. The metrics of accuracy, sensitivity, F1 count, and precision were used to measure the models' performance. The results position the KNN model as the best predictor with 95% accuracy, precision, sensitivity, and F1 score

    Performance analysis of 10 machine learning models in lung cancer prediction

    Get PDF
    Lung cancer is one of the diseases with the highest incidence and mortality in the world. Machine learning (ML) models can play an important role in the early detection of this disease. This study aims to identify the ML algorithm that has the best performance in predicting lung cancer. The algorithms that were contrasted were logistic regression (LR), decision tree (DT), k-nearest neighbors (KNN), gaussian Naive Bayes (GNB), multinomial Naive Bayes (MNB), support vector classifier (SVC), random forest (RF), extreme gradient boosting (XGBoost), multilayer perceptron (MLP) and gradient boosting (GB). The dataset used was provided by Kaggle, with a total of 309 records and 16 attributes. The study was developed in several phases, such as the description of the ML models and the analysis of the dataset. In addition, the contrast of the models was performed under the metrics of specificity, sensitivity, F1 count, accuracy, and precision. The results showed that the SVC, RF, MLP, and GB models obtained the best performance metrics, achieving 98% accuracy, 98% precision, and 98% sensitivity

    5G Technology for Innovation Education (Sustainable Development Goals 4): A Systematic Review

    Get PDF
    Access to quality education remains a significant global challenge today. In line with the goals outlined by the World Health Organization (WHO) in its 2030 agenda, ensuring access to education is a fundamental objective. Consequently, it is imperative to undertake an investigation into the influence of technological innovation on educational practices. This study examines the impact of incorporating the 5G network into educational settings to improve learning experiences. The analysis covered 134 articles, 62 of which were deemed relevant, classifying the research as ongoing projects or pilot studies for future exploration. The main digital tools identified were artificial intelligence, the Internet of Things, virtual reality, and machine learning. The use of the 5G network appears to have a more significant impact on higher education and universities. Research in this field is mainly concentrated in Europe, America, and Asia. In addition, it is clear that the adoption of 5G technology is influencing pedagogical methods, emphasising immersive learning, e-learning platforms, and flipped classrooms. This study argues for further research into the integration of technology in education, advocating a careful examination of the implementation of 5G infrastructure and its potential to improve access to high-quality education

    Autoencoder-based Online Data Quality Monitoring for the CMS Electromagnetic Calorimeter

    Full text link
    The online Data Quality Monitoring system (DQM) of the CMS electromagnetic calorimeter (ECAL) is a crucial operational tool that allows ECAL experts to quickly identify, localize, and diagnose a broad range of detector issues that would otherwise hinder physics-quality data taking. Although the existing ECAL DQM system has been continuously updated to respond to new problems, it remains one step behind newer and unforeseen issues. Using unsupervised deep learning, a real-time autoencoder-based anomaly detection system is developed that is able to detect ECAL anomalies unseen in past data. After accounting for spatial variations in the response of the ECAL and the temporal evolution of anomalies, the new system is able to efficiently detect anomalies while maintaining an estimated false discovery rate between 10210^{-2} to 10410^{-4}, beating existing benchmarks by about two orders of magnitude. The real-world performance of the system is validated using anomalies found in 2018 and 2022 LHC collision data. Additionally, first results from deploying the autoencoder-based system in the CMS online DQM workflow for the ECAL barrel during Run 3 of the LHC are presented, showing its promising performance in detecting obscure issues that could have been missed in the existing DQM system.Comment: Submitted to the Proceedings of 21st International Workshop on Advanced Computing and Analysis Techniques in Physics Research ACAT 2022 conferenc

    Searching for Intelligent Life in Gravitational Wave Signals Part I: Present Capabilities and Future Horizons

    Full text link
    We show that the Laser Interferometer Gravitational Wave Observatory (LIGO) is a powerful instrument in the Search for Extraterrestrial Intelligence (SETI). LIGO's ability to detect gravitational waves (GWs) from astrophysical sources, such as binary black hole mergers, also provides the potential to detect extraterrestrial mega-technology, such as Rapid and/or Massive Accelerating spacecraft (RAMAcraft). We show that LIGO is sensitive to RAMAcraft of 1 Jupiter mass accelerating to a fraction of the speed of light (e.g. 30\%) from 1010010 - 100\,kpc or a Moon mass from 1101-10\,pc. While existing SETI searches can probe on the order of ten-thousand stars for human-scale technology (e.g. radio waves), LIGO can probe all 1011^{11} stars in the Milky Way for RAMAcraft. Moreover, thanks to the f1f^{-1} scaling of RAMAcraft signals, our sensitivity to these objects will increase as low-frequency detectors are developed and improved, allowing for the detection of smaller masses further from Earth. In particular, we find that DECIGO and the Big Bang Observer (BBO) will be about 100 times more sensitive than LIGO, increasing the search volume by 106^{6}, while LISA and Pulsar Timing Arrays (PTAs) may improve sensitivities to objects with long acceleration periods. In this paper, we calculate the waveforms for linearly-accelerating RAMAcraft in a form suitable for LIGO, Virgo, and KAGRA searches and provide the range for a variety of masses and accelerations. We expect that the current and upcoming GW detectors will soon become an excellent complement to the existing SETI efforts.Comment: 20 pages, 12 figures, submitted to MNRAS, comments welcom

    Development and evaluation of a didactic tool with augmented reality for Quechua language learning in preschoolers

    Get PDF
    "It is important to preserve our cultural identity through the preservation of our mother tongue, contributing to its dissemination. Augmented reality (AR) is a great ally of education that provides efficiency, and productivity and increases the interest of students in their academic activities. An AR application was developed for learning Quechua in preschool children, thus improving their learning, satisfaction, and preference compared to traditional teaching. Previously, learning styles were identified for better coverage of the application; the design thinking methodology was applied for the development of the application, then the respective tests were conducted where it was obtained that the children's performance improved by 28.3% more compared to traditional teaching, with an average satisfaction of 89% of the classrooms, and 81% of students' preference. It was concluded that the proposed application considerably favors the written and audiovisual learning of the Quechua language in preschool students.
    corecore