2,041 research outputs found

    Efficient Multi-way Theta-Join Processing Using MapReduce

    Full text link
    Multi-way Theta-join queries are powerful in describing complex relations and therefore widely employed in real practices. However, existing solutions from traditional distributed and parallel databases for multi-way Theta-join queries cannot be easily extended to fit a shared-nothing distributed computing paradigm, which is proven to be able to support OLAP applications over immense data volumes. In this work, we study the problem of efficient processing of multi-way Theta-join queries using MapReduce from a cost-effective perspective. Although there have been some works using the (key,value) pair-based programming model to support join operations, efficient processing of multi-way Theta-join queries has never been fully explored. The substantial challenge lies in, given a number of processing units (that can run Map or Reduce tasks), mapping a multi-way Theta-join query to a number of MapReduce jobs and having them executed in a well scheduled sequence, such that the total processing time span is minimized. Our solution mainly includes two parts: 1) cost metrics for both single MapReduce job and a number of MapReduce jobs executed in a certain order; 2) the efficient execution of a chain-typed Theta-join with only one MapReduce job. Comparing with the query evaluation strategy proposed in [23] and the widely adopted Pig Latin and Hive SQL solutions, our method achieves significant improvement of the join processing efficiency.Comment: VLDB201

    Analyses and Concerns in Precision Medicine: A Statistical Perspective

    Full text link
    This article explores the critical role of statistical analysis in precision medicine. It discusses how personalized healthcare is enhanced by statistical methods that interpret complex, multidimensional datasets, focusing on predictive modeling, machine learning algorithms, and data visualization techniques. The paper addresses challenges in data integration and interpretation, particularly with diverse data sources like electronic health records (EHRs) and genomic data. It also delves into ethical considerations such as patient privacy and data security. In addition, the paper highlights the evolution of statistical analysis in medicine, core statistical methodologies in precision medicine, and future directions in the field, emphasizing the integration of artificial intelligence (AI) and machine learning (ML)

    A Comparative Study of Supergrid and Superblock Urban Structure in China and Japan Rethinking the Chinese Superblocks: Learning from Japanese Experience

    Get PDF
    The Supergrid and Superblock together constitute a grid-and-cell urban structure that is especially evident in China and Japan. The Supergrid is a large-scale net of wide roads that defines a series of cells or Superblocks, each containing a network of narrower streets. While common in both countries, there are no comparative morphological studies. As a crucial contribution to urban design, this thesis places the structures in their cultural contexts and examines them against a group of post-1960 theories that focus on interrelationships between urban structure and functions. Here, they are synthesized as ‘Interconnection theory’ and the source of qualitative and quantitative methods (including Space Syntax) used to examine form-function interrelationships by understanding levels of Integration, Connection and Interaction in two Superblocks in each country. Particular emphasis is on the relationships between street networks and distribution of functions/activities. Primary research findings indicate that 1) Supergrid/Superblock systems are strongly rooted in Eastern culture, with Supergrid systems providing multi-directional global movement across wide urban areas in both countries. 2) However, the work reveals different types of street structures and functional patterns within Superblocks, with (Chinese) ‘wall’ and (Japanese) ‘floor’ spatial conceptions underlying differences. 3) These differing internal structures within the Superblocks have a deterministic impact on the spatial distribution of human activities. Clear but divergent patterns are displayed in the Superblocks with strong interrelationships between the street network and distribution of activities in the Japanese cases but less distinct ones in the Chinese: this is linked to China’s wall and gate structure that is absent in Japan

    Low-carbon scenario analysis on urban transport of one metropolitan in China in 2020

    Get PDF
    Purpose: This paper discussed possible ways of implementing effective energy conservation and GHG emission reduction measures by providing: the forecasts of mid-to-long term city-wide carbon emission rate; and the analysis of potential low-carbon transport solutions. Design/methodology/approach: According to the characteristics of the transport system in Beijing, based on the review and application analysis of existing transport energy and GHG emission calculation models, the comprehensive carbon emission calculation model established. Existing data were utilized with regression analysis to project the prospective traffic data in the baseline scenario at the target year of 2020 to calculate the emission amount. Four low-carbon scenarios were set in accordance with the goal of “low carbon transportation, green trip”, and the effectiveness of each low-carbon scenario was evaluated by comparing them with the baseline scenario in terms of the respective GHG emission rate. Findings: Under the current developing trend in policy environment and technical specifications, the total projected GHG (CO2) emissions from transport sector at 2020 in Beijing will reach 24.69 million t CO2; private-vehicle is the major contributor among all transport modes at 15.96 million t CO2. Practical implications: Limiting the growth in private-vehicle ownership, reducing the frequency of mid-to-long range travel and the average trip distance, and prompting the public transit oriented policies are all possible solutions to reduce carbon emission. The most effective practice involves a shift in public travel behavior. Originality/value: This paper presents a method to forecast the mid-to-long term city-wide carbon emission rate; and provides some potential low-carbon transport solutions.Peer Reviewe

    Analysis of frequent trading effects of various machine learning models

    Full text link
    In recent years, high-frequency trading has emerged as a crucial strategy in stock trading. This study aims to develop an advanced high-frequency trading algorithm and compare the performance of three different mathematical models: the combination of the cross-entropy loss function and the quasi-Newton algorithm, the FCNN model, and the vector machine. The proposed algorithm employs neural network predictions to generate trading signals and execute buy and sell operations based on specific conditions. By harnessing the power of neural networks, the algorithm enhances the accuracy and reliability of the trading strategy. To assess the effectiveness of the algorithm, the study evaluates the performance of the three mathematical models. The combination of the cross-entropy loss function and the quasi-Newton algorithm is a widely utilized logistic regression approach. The FCNN model, on the other hand, is a deep learning algorithm that can extract and classify features from stock data. Meanwhile, the vector machine is a supervised learning algorithm recognized for achieving improved classification results by mapping data into high-dimensional spaces. By comparing the performance of these three models, the study aims to determine the most effective approach for high-frequency trading. This research makes a valuable contribution by introducing a novel methodology for high-frequency trading, thereby providing investors with a more accurate and reliable stock trading strategy

    Constructing entanglement witnesses for infinite-dimensional systems

    Full text link
    It is shown that, every entangled state in an infinite-dimensional composite system has a simple entanglement witness of the form αI+T\alpha I+T with α\alpha a nonnegative number and TT a finite rank self-adjoint operator. We also provide two methods of constructing entanglement witness and apply them to obtain some entangled states that cannot be detected by the PPT criterion and the realignment criterion.Comment: 15 page
    • 

    corecore