2,041 research outputs found
Efficient Multi-way Theta-Join Processing Using MapReduce
Multi-way Theta-join queries are powerful in describing complex relations and
therefore widely employed in real practices. However, existing solutions from
traditional distributed and parallel databases for multi-way Theta-join queries
cannot be easily extended to fit a shared-nothing distributed computing
paradigm, which is proven to be able to support OLAP applications over immense
data volumes. In this work, we study the problem of efficient processing of
multi-way Theta-join queries using MapReduce from a cost-effective perspective.
Although there have been some works using the (key,value) pair-based
programming model to support join operations, efficient processing of multi-way
Theta-join queries has never been fully explored. The substantial challenge
lies in, given a number of processing units (that can run Map or Reduce tasks),
mapping a multi-way Theta-join query to a number of MapReduce jobs and having
them executed in a well scheduled sequence, such that the total processing time
span is minimized. Our solution mainly includes two parts: 1) cost metrics for
both single MapReduce job and a number of MapReduce jobs executed in a certain
order; 2) the efficient execution of a chain-typed Theta-join with only one
MapReduce job. Comparing with the query evaluation strategy proposed in [23]
and the widely adopted Pig Latin and Hive SQL solutions, our method achieves
significant improvement of the join processing efficiency.Comment: VLDB201
Analyses and Concerns in Precision Medicine: A Statistical Perspective
This article explores the critical role of statistical analysis in precision
medicine. It discusses how personalized healthcare is enhanced by statistical
methods that interpret complex, multidimensional datasets, focusing on
predictive modeling, machine learning algorithms, and data visualization
techniques. The paper addresses challenges in data integration and
interpretation, particularly with diverse data sources like electronic health
records (EHRs) and genomic data. It also delves into ethical considerations
such as patient privacy and data security. In addition, the paper highlights
the evolution of statistical analysis in medicine, core statistical
methodologies in precision medicine, and future directions in the field,
emphasizing the integration of artificial intelligence (AI) and machine
learning (ML)
A Comparative Study of Supergrid and Superblock Urban Structure in China and Japan Rethinking the Chinese Superblocks: Learning from Japanese Experience
The Supergrid and Superblock together constitute a grid-and-cell urban structure that is especially evident in China and Japan. The Supergrid is a large-scale net of wide roads that defines a series of cells or Superblocks, each containing a network of narrower streets. While common in both countries, there are no comparative morphological studies. As a crucial contribution to urban design, this thesis places the structures in their cultural contexts and examines them against a group of post-1960 theories that focus on interrelationships between urban structure and functions. Here, they are synthesized as âInterconnection theoryâ and the source of qualitative and quantitative methods (including Space Syntax) used to examine form-function interrelationships by understanding levels of Integration, Connection and Interaction in two Superblocks in each country. Particular emphasis is on the relationships between street networks and distribution of functions/activities. Primary research findings indicate that 1) Supergrid/Superblock systems are strongly rooted in Eastern culture, with Supergrid systems providing multi-directional global movement across wide urban areas in both countries. 2) However, the work reveals different types of street structures and functional patterns within Superblocks, with (Chinese) âwallâ and (Japanese) âfloorâ spatial conceptions underlying differences. 3) These differing internal structures within the Superblocks have a deterministic impact on the spatial distribution of human activities. Clear but divergent patterns are displayed in the Superblocks with strong interrelationships between the street network and distribution of activities in the Japanese cases but less distinct ones in the Chinese: this is linked to Chinaâs wall and gate structure that is absent in Japan
Low-carbon scenario analysis on urban transport of one metropolitan in China in 2020
Purpose: This paper discussed possible ways of implementing effective energy conservation and GHG emission reduction measures by providing: the forecasts of mid-to-long term city-wide carbon emission rate; and the analysis of potential low-carbon transport solutions.
Design/methodology/approach: According to the characteristics of the transport system in Beijing, based on the review and application analysis of existing transport energy and GHG emission calculation models, the comprehensive carbon emission calculation model established. Existing data were utilized with regression analysis to project the prospective traffic data in the baseline scenario at the target year of 2020 to calculate the emission amount. Four low-carbon scenarios were set in accordance with the goal of âlow carbon transportation, green tripâ, and the effectiveness of each low-carbon scenario was evaluated by comparing them with the baseline scenario in terms of the respective GHG emission rate.
Findings: Under the current developing trend in policy environment and technical specifications, the total projected GHG (CO2) emissions from transport sector at 2020 in Beijing will reach 24.69 million t CO2; private-vehicle is the major contributor among all transport modes at 15.96 million t CO2.
Practical implications: Limiting the growth in private-vehicle ownership, reducing the frequency of mid-to-long range travel and the average trip distance, and prompting the public transit oriented policies are all possible solutions to reduce carbon emission. The most effective practice involves a shift in public travel behavior.
Originality/value: This paper presents a method to forecast the mid-to-long term city-wide carbon emission rate; and provides some potential low-carbon transport solutions.Peer Reviewe
Analysis of frequent trading effects of various machine learning models
In recent years, high-frequency trading has emerged as a crucial strategy in
stock trading. This study aims to develop an advanced high-frequency trading
algorithm and compare the performance of three different mathematical models:
the combination of the cross-entropy loss function and the quasi-Newton
algorithm, the FCNN model, and the vector machine. The proposed algorithm
employs neural network predictions to generate trading signals and execute buy
and sell operations based on specific conditions. By harnessing the power of
neural networks, the algorithm enhances the accuracy and reliability of the
trading strategy. To assess the effectiveness of the algorithm, the study
evaluates the performance of the three mathematical models. The combination of
the cross-entropy loss function and the quasi-Newton algorithm is a widely
utilized logistic regression approach. The FCNN model, on the other hand, is a
deep learning algorithm that can extract and classify features from stock data.
Meanwhile, the vector machine is a supervised learning algorithm recognized for
achieving improved classification results by mapping data into high-dimensional
spaces. By comparing the performance of these three models, the study aims to
determine the most effective approach for high-frequency trading. This research
makes a valuable contribution by introducing a novel methodology for
high-frequency trading, thereby providing investors with a more accurate and
reliable stock trading strategy
Constructing entanglement witnesses for infinite-dimensional systems
It is shown that, every entangled state in an infinite-dimensional composite
system has a simple entanglement witness of the form with
a nonnegative number and a finite rank self-adjoint operator. We also
provide two methods of constructing entanglement witness and apply them to
obtain some entangled states that cannot be detected by the PPT criterion and
the realignment criterion.Comment: 15 page
- âŠ