52 research outputs found

    Comparative Analysis of the Performance of Popular Sorting Algorithms on Datasets of Different Sizes and Characteristics

    Get PDF
    Abstract: The efficiency and performance of sorting algorithms play a crucial role in various applications and industries. In this research paper, we present a comprehensive comparative analysis of popular sorting algorithms on datasets of different sizes and characteristics. The aim is to evaluate the algorithms' performance and identify their strengths and weaknesses under varying scenarios. We consider six commonly used sorting algorithms: QuickSort, TimSort, MergeSort, HeapSort, RadixSort, and ShellSort. These algorithms represent a range of approaches and techniques, including divide-and-conquer, hybrid sorting, and simple comparison-based methods. To assess their performance, we employ a diverse set of datasets, including the Iris dataset (1K), student dataset (5.8K), Wine dataset (6.5K), Uniform (10K), Normal (10K), Exponential (10K), Bimodal (10K), Yelp dataset (10K), MNIST dataset (42K), Uniform (100K), Normal (100K), Exponential (100K), Bimodal (100K), Uniform (500K), Normal (500K), Exponential (500K), Bimodal (500K), Uniform (1M), Normal (1M), Exponential (1M), and Bimodal (1M). These datasets cover a wide range of sizes and characteristics, allowing us to analyze the algorithms' performance across different dimensions. We measure and compare several key metrics, including execution time, memory usage, algorithmic complexity and stability. By analyzing these metrics, we gain insights into the efficiency and suitability of each algorithm for different dataset sizes and characteristics. We also discuss the implications of the findings in practical applications. Our results reveal important trade-offs among the sorting algorithms. While some algorithms excel in certain scenarios, others demonstrate better scalability or memory efficiency. We identify the best-performing algorithms for specific dataset characteristics and highlight their strengths and limitations. This research can assist developers and practitioners in selecting appropriate sorting algorithms based on their specific requirements and dataset characteristics. In conclusion, this comparative analysis provides a valuable contribution to the understanding of sorting algorithm performance. The findings contribute insights into the efficiency and suitability of popular sorting algorithms across datasets of different sizes and characteristics. By evaluating key metrics and discussing the implications, we offer guidance for selecting the most appropriate sorting algorithm in various practical scenarios

    TANGLE: Two-Level Support Vector Regression Approach for Protein Backbone Torsion Angle Prediction from Primary Sequences

    Get PDF
    Protein backbone torsion angles (Phi) and (Psi) involve two rotation angles rotating around the Cα-N bond (Phi) and the Cα-C bond (Psi). Due to the planarity of the linked rigid peptide bonds, these two angles can essentially determine the backbone geometry of proteins. Accordingly, the accurate prediction of protein backbone torsion angle from sequence information can assist the prediction of protein structures. In this study, we develop a new approach called TANGLE (Torsion ANGLE predictor) to predict the protein backbone torsion angles from amino acid sequences. TANGLE uses a two-level support vector regression approach to perform real-value torsion angle prediction using a variety of features derived from amino acid sequences, including the evolutionary profiles in the form of position-specific scoring matrices, predicted secondary structure, solvent accessibility and natively disordered region as well as other global sequence features. When evaluated based on a large benchmark dataset of 1,526 non-homologous proteins, the mean absolute errors (MAEs) of the Phi and Psi angle prediction are 27.8° and 44.6°, respectively, which are 1% and 3% respectively lower than that using one of the state-of-the-art prediction tools ANGLOR. Moreover, the prediction of TANGLE is significantly better than a random predictor that was built on the amino acid-specific basis, with the p-value<1.46e-147 and 7.97e-150, respectively by the Wilcoxon signed rank test. As a complementary approach to the current torsion angle prediction algorithms, TANGLE should prove useful in predicting protein structural properties and assisting protein fold recognition by applying the predicted torsion angles as useful restraints. TANGLE is freely accessible at http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/TANGLE/

    A framework for evolutionary systems biology

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many difficult problems in evolutionary genomics are related to mutations that have weak effects on fitness, as the consequences of mutations with large effects are often simple to predict. Current systems biology has accumulated much data on mutations with large effects and can predict the properties of knockout mutants in some systems. However experimental methods are too insensitive to observe small effects.</p> <p>Results</p> <p>Here I propose a novel framework that brings together evolutionary theory and current systems biology approaches in order to quantify small effects of mutations and their epistatic interactions <it>in silico</it>. Central to this approach is the definition of fitness correlates that can be computed in some current systems biology models employing the rigorous algorithms that are at the core of much work in computational systems biology. The framework exploits synergies between the realism of such models and the need to understand real systems in evolutionary theory. This framework can address many longstanding topics in evolutionary biology by defining various 'levels' of the adaptive landscape. Addressed topics include the distribution of mutational effects on fitness, as well as the nature of advantageous mutations, epistasis and robustness. Combining corresponding parameter estimates with population genetics models raises the possibility of testing evolutionary hypotheses at a new level of realism.</p> <p>Conclusion</p> <p>EvoSysBio is expected to lead to a more detailed understanding of the fundamental principles of life by combining knowledge about well-known biological systems from several disciplines. This will benefit both evolutionary theory and current systems biology. Understanding robustness by analysing distributions of mutational effects and epistasis is pivotal for drug design, cancer research, responsible genetic engineering in synthetic biology and many other practical applications.</p

    Optimal Afforestation Contracts with Asymmetric Information on Private Environmental Benefits

    Full text link

    A comparative study of the reported performance of ab initio protein structure prediction algorithms

    No full text
    Protein structure prediction is one of the major challenges in bioinformatics today. Throughout the past five decades, many different algorithmic approaches have been attempted, and although progress has been made the problem remains unsolvable even for many small proteins. While the general objective is to predict the three-dimensional structure from primary sequence, our current knowledge and computational power are simply insufficient to solve a problem of such high complexity

    Potentials and perspectives of balanced development in Lithuanian forestry

    No full text

    Kinerja Rantai Pasokan Kopi Arabika Java Preanger di Kabupaten Sumedang Jawa Barat

    Full text link
    Kabupaten Sumedang memiliki banyak komoditas unggulan di sektor pertanian seperti kopi Priangan, tetapi masih terdapat banyak masalah didalam sektor pertanian khususnya kopi. Permasalahan lambatnya pertumbuhan sektor pertanian dapat dilihat dari permasalahan utama yang dihadapi oleh petani, yaitu terletak pada keterbatasan lahan yang dikelola dan status petani yang sebagian besar merupakan buruh tani. Hal ini menyebabkan usaha tani mereka menjadi tidak efisien karena tidak mencapai skala ekonomis. Memperhatikan permasalahan dalam komoditi tersebut, didalam meningkatkan daya saing suatu bisnis sangat memerlukan informasi dari aliran produk yang dimulai dari hulu sampai ke hilir agar bisa menilai apakah nanti bisnis yang berjalan memiliki nilai tambah yang di harapkan atau tidak. Metode kualitatif deskriptif peneliti pilih untuk menggambarkan hasil penelitian dari rantai pasokan dan analisis nilai tambahnya. Hasil penelitian masih terdapat kendala yang dihadapi pelaku usaha pada setiap aktivitas rantai pasok, khususnya pada penyediaan agro input dan proses budidaya yang belum mendukung peningkatan produktivitas, keterbatasan alat pengolahan, dan pemasaran yang belum terintegrasi antar pelaku usaha sehingga daya saingnya belum optimal. Diperlukan kebijakan terarah dan integratif dari pemerintah untuk mendukung iklim usaha kopi agar KAJP Manglayang Timur dapat menjadi komoditas unggulan yang mampu mendorong aktivitas perekonomian masyarakat, sehingga kebijakan pemerintah sangat menentukan produktifitas pertanian dibidang kopi. Kata Kunci : kopi, nilai tambah, rantai pasoka

    Privacy lost in online education: analysis of web tracking evolution

    No full text
    Digital tracking poses a significant and multifaceted threat to personal privacy and integrity. Tracking techniques, such as the use of cookies and scripts, are widespread on the World Wide Web and have become more pervasive in the past decade. This paper focuses on the historical analysis of tracking practices specifically on educational websites, which require particular attention due to their often mandatory usage by users, including young individuals who may not adequately assess privacy implications. The paper proposes a framework for comparing tracking activities on a specific domain of websites by contrasting a sample of these sites with a control group consisting of sites with comparable traffic levels, but without a specific functional purpose. This comparative analysis allows us to evaluate the distinctive evolution of tracking on educational platforms against a standard benchmark. Our findings reveal that although educational websites initially demonstrated lower levels of tracking, their growth rate from 2012 to 2021 has exceeded that of the control group, resulting in higher levels of tracking at present. Through our investigation into the expansion of various types of trackers, we suggest that the accelerated growth of tracking on educational websites is partly attributable to the increased use of interactive features, facilitated by third-party services that enable the collection of user data. The paper concludes by proposing ways in which web developers can safeguard their design choices to mitigate user exposure to tracking.Algorithms and the Foundations of Software technolog
    corecore