3,466 research outputs found
Buffer Pool Aware Query Scheduling via Deep Reinforcement Learning
In this extended abstract, we propose a new technique for query scheduling
with the explicit goal of reducing disk reads and thus implicitly increasing
query performance. We introduce \system, a learned scheduler that leverages
overlapping data reads among incoming queries and learns a scheduling strategy
that improves cache hits. \system relies on deep reinforcement learning to
produce workload-specific scheduling strategies that focus on long-term
performance benefits while being adaptive to previously-unseen data access
patterns. We present results from a proof-of-concept prototype, demonstrating
that learned schedulers can offer significant performance improvements over
hand-crafted scheduling heuristics. Ultimately, we make the case that this is a
promising research direction in the intersection of machine learning and
databases
Critical Management Issues for Implementing RFID in Supply Chain Management
The benefits of radio frequency identification (RFID) technology in the supply chain are fairly compelling. It has the potential to revolutionise the efficiency, accuracy and security of the supply chain with significant impact on overall profitability. A number of companies are actively involved in testing and adopting this technology. It is estimated that the market for RFID products and services will increase significantly in the next few years. Despite this trend, there are major impediments to RFID adoption in supply chain. While RFID systems have been around for several decades, the technology for supply chain management is still emerging. We describe many of the challenges, setbacks and barriers facing RFID implementations in supply chains, discuss the critical issues for management and offer some suggestions. In the process, we take an in-depth look at cost, technology, standards, privacy and security and business process reengineering related issues surrounding RFID technology in supply chains
A Schedule of Duties in the Cloud Space Using a Modified Salp Swarm Algorithm
Cloud computing is a concept introduced in the information technology era,
with the main components being the grid, distributed, and valuable computing.
The cloud is being developed continuously and, naturally, comes up with many
challenges, one of which is scheduling. A schedule or timeline is a mechanism
used to optimize the time for performing a duty or set of duties. A scheduling
process is accountable for choosing the best resources for performing a duty.
The main goal of a scheduling algorithm is to improve the efficiency and
quality of the service while at the same time ensuring the acceptability and
effectiveness of the targets. The task scheduling problem is one of the most
important NP-hard issues in the cloud domain and, so far, many techniques have
been proposed as solutions, including using genetic algorithms (GAs), particle
swarm optimization, (PSO), and ant colony optimization (ACO). To address this
problem, in this paper, one of the collective intelligence algorithms, called
the Salp Swarm Algorithm (SSA), has been expanded, improved, and applied. The
performance of the proposed algorithm has been compared with that of GAs, PSO,
continuous ACO, and the basic SSA. The results show that our algorithm has
generally higher performance than the other algorithms. For example, compared
to the basic SSA, the proposed method has an average reduction of approximately
21% in makespan.Comment: 15 pages, 6 figures, 2023 IFIP International Internet of Things
Conference. Dallas-Fort Worth Metroplex, Texas, US
FedCSD: A Federated Learning Based Approach for Code-Smell Detection
This paper proposes a Federated Learning Code Smell Detection (FedCSD)
approach that allows organizations to collaboratively train federated ML models
while preserving their data privacy. These assertions have been supported by
three experiments that have significantly leveraged three manually validated
datasets aimed at detecting and examining different code smell scenarios. In
experiment 1, which was concerned with a centralized training experiment,
dataset two achieved the lowest accuracy (92.30%) with fewer smells, while
datasets one and three achieved the highest accuracy with a slight difference
(98.90% and 99.5%, respectively). This was followed by experiment 2, which was
concerned with cross-evaluation, where each ML model was trained using one
dataset, which was then evaluated over the other two datasets. Results from
this experiment show a significant drop in the model's accuracy (lowest
accuracy: 63.80\%) where fewer smells exist in the training dataset, which has
a noticeable reflection (technical debt) on the model's performance. Finally,
the last and third experiments evaluate our approach by splitting the dataset
into 10 companies. The ML model was trained on the company's site, then all
model-updated weights were transferred to the server. Ultimately, an accuracy
of 98.34% was achieved by the global model that has been trained using 10
companies for 100 training rounds. The results reveal a slight difference in
the global model's accuracy compared to the highest accuracy of the centralized
model, which can be ignored in favour of the global model's comprehensive
knowledge, lower training cost, preservation of data privacy, and avoidance of
the technical debt problem.Comment: 17 pages, 7 figures, Journal pape
Bao: Learning to Steer Query Optimizers
Query optimization remains one of the most challenging problems in data
management systems. Recent efforts to apply machine learning techniques to
query optimization challenges have been promising, but have shown few practical
gains due to substantive training overhead, inability to adapt to changes, and
poor tail performance. Motivated by these difficulties and drawing upon a long
history of research in multi-armed bandits, we introduce Bao (the BAndit
Optimizer). Bao takes advantage of the wisdom built into existing query
optimizers by providing per-query optimization hints. Bao combines modern tree
convolutional neural networks with Thompson sampling, a decades-old and
well-studied reinforcement learning algorithm. As a result, Bao automatically
learns from its mistakes and adapts to changes in query workloads, data, and
schema. Experimentally, we demonstrate that Bao can quickly (an order of
magnitude faster than previous approaches) learn strategies that improve
end-to-end query execution performance, including tail latency. In cloud
environments, we show that Bao can offer both reduced costs and better
performance compared with a sophisticated commercial system
- …