256 research outputs found

    An empirical analysis of pruning techniques performance, retrievability and bias

    Get PDF
    Prior work on using retrievability measures in the evaluation of information retrieval (IR) systems has laid out the foundations for investigating the relation between retrieval performance and retrieval bias. While various factors influencing retrievability have been examined, showing how the retrieval model may influence bias, no prior work has examined the impact of the index (and how it is optimized) on retrieval bias. Intuitively, how the documents are represented, and what terms they contain, will influence whether they are retrievable or not. In this paper, we investigate how the retrieval bias of a system changes as the inverted index is optimized for efficiency through static index pruning. In our analysis, we consider four pruning methods and examine how they affect performance and bias on the TREC GOV2 Collection. Our results show that the relationship between these factors is varied and complex-and very much dependent on the pruning algorithm. We find that more pruning results in relatively little change or a slight decrease in bias up to a point, and then a dramatic increase. The increase in bias corresponds to a sharp decrease in early precision such as NDCG@10 and is also indicative of a large decrease in MAP. The findings suggest that the impact of pruning algorithms can be quite varied-but retrieval bias could be used to guide the pruning process. Further work is required to determine precisely which documents are most affected and how this impacts upon performance

    Factors Affecting Aluminum Dissolve from Acidified Water Purification Sludge

    Get PDF
    Recovering resources from water purification sludge (WPS) have been gradually stipulated in environmental protection laws and regulations in many nations. Hence, reusing the WPS is becoming an important topic, and recovering alum from WPS is one of the many practical alternatives. Most previous research efforts have been conducted on studying the amphoteric characteristic of aluminum hydroxide for investigating the optimum pH range to dissolve the Al(III) species from WPS, but it has been lack of reaction kinetics or mechanisms related discussion. Therefore, in this investigation, water purification sludge (WPS) solution was broken by ultrasound to make particle size of reactants smaller, specific surface area larger. According to the reaction kinetics, these phenomena let the dissolved aluminum salt quantity increased and the reaction rate go faster

    Development of an Intelligent Equipment Lock Management System with RFID Technology

    Get PDF
    The equipment lock has been an important tool for the power company to protect the electricity metering equipment. however, the conventional equipment lock has two potential problems: vandalism and counterfeiting. To fulfill the control and track the potential illegal behavior, the human labor and paper are required to proceed with related operations, resulting in the consumption of a large amount of human resources and maintenance costs. This study focused on the design of RFID technology applied to the traditional equipment lock, which, through the mobile and electronic technology, strengthens the management/operating convenience of the lock and provides the solutions for anti-counterfeiting and spoilage detection so that the national energy can be properly protected and fairly distributed

    Query-level Early Exit for Additive Learning-to-Rank Ensembles

    Get PDF
    Search engine ranking pipelines are commonly based on large ensembles of machine-learned decision trees. The tight constraints on query response time recently motivated researchers to investigate algorithms to make faster the traversal of the additive ensemble or to early terminate the evaluation of documents that are unlikely to be ranked among the top-k. In this paper, we investigate the novel problem of query-level early exiting, aimed at deciding the profitability of early stopping the traversal of the ranking ensemble for all the candidate documents to be scored for a query, by simply returning a ranking based on the additive scores computed by a limited portion of the ensemble. Besides the obvious advantage on query latency and throughput, we address the possible positive impact on ranking effectiveness. To this end, we study the actual contribution of incremental portions of the tree ensemble to the ranking of the top-k documents scored for a given query. Our main finding is that queries exhibit different behaviors as scores are accumulated during the traversal of the ensemble and that query-level early stopping can remarkably improve ranking quality. We present a reproducible and comprehensive experimental evaluation, conducted on two public datasets, showing that query-level early exiting achieves an overall gain of up to 7.5% in terms of NDCG@10 with a speedup of the scoring process of up to 2.2x

    Two non-homologous brain diseases-related genes, SERPINI1 and PDCD10, are tightly linked by an asymmetric bidirectional promoter in an evolutionarily conserved manner

    Get PDF
    BACKGROUND: Despite of the fact that mammalian genomes are far more spacious than prokaryotic genomes, recent nucleotide sequencing data have revealed that many mammalian genes are arranged in a head-to-head orientation and separated by a small intergenic sequence. Extensive studies on some of these neighboring genes, in particular homologous gene pairs, have shown that these genes are often co-expressed in a symmetric manner and regulated by a shared promoter region. Here we report the identification of two non-homologous brain disease-related genes, with one coding for a serine protease inhibitor (SERPINI1) and the other for a programmed cell death-related gene (PDCD10), being tightly linked together by an asymmetric bidirectional promoter in an evolutionarily conserved fashion. This asymmetric bidirectional promoter, in cooperation with some cis-acting elements, is responsible for the co-regulation of the gene expression pattern as well as the tissue specificity of SERPINI1 and PDCD10. RESULTS: While SERPINI1 is predominantly expressed in normal brain and down-regulated in brain tumors, PDCD10 is ubiquitously expressed in all normal tissues but its gene transcription becomes aberrant in different types of cancers. By measuring the luciferase activity in various cell lysates, their 851-bp intergenic sequence was shown to be capable of driving the reporter gene expression in either direction. A 175-bp fragment from nt 1 to 175 in the vicinity of PDCD10 was further determined to function as a minimal bidirectional promoter. A critical regulatory fragment, from nt 176-473 outside the minimal promoter in the intergenic region, was identified to contain a strong repressive element for SERPINI1 and an enhancer for PDCD10. These cis-acting elements may exist to help coordinate the expression and regulation of the two flanking genes. CONCLUSION: For all non-homologous genes that have been described to be closely adjacent in the mammalian genomes, the intergenic region of the head-to-head PDCD10-SERPINI1 gene pair provides an interesting and informative example of a complex regulatory system that governs the expression of both genes not only through an asymmetric bidirectional promoter, but also through fine-tuned regulations with some cis-acting elements

    Correlation Between Time Change in Modulus of Short-Period Geomagnetic Variation and Seismicity in Taiwan

    Full text link
    In this study, geomagnetic data of the Lunping observatory from 1993 to 2000 are utilized for computing the amplitude variation of short-period geomagnetic total intensity data, using the complex demodulation method (CD method). In order to compare these time changes with seismicity, earthquakes that occurred within 150 km of Lunping, with magnitude ML greater than 3.0, are located. The total sum of those earthquakes, summed month by month, is correlated with the modulus. After removing seasonal effect, our results show that the modulus of periods 24, 12, and 8 hr reveals a notable change that seems to be related to the total sum of events within the whole study period. One possible precursor is found 6 months prior to the 1999 high seismicity. The modulus for the periods 24, 12, and 8 hr increased gradually from the beginning of 1999 to August 1999. After earthquake occurrence the modulus decreased again to a normal level. We propose that this notable increase might be related to a preparation process for this strong earthquake
    corecore