Search CORE

154 research outputs found

Performance Evaluation Between HarperDB, Mongo DB and PostgreSQL

Author: Abbasi Maryam
Cardoso Filipe
Figueiredo Figueiredo
Martins Pedro
Rebelo João
Rodrigues Ricardo
Saraiva Goncalo
Wanzeller Cristina
Publication venue: Springer
Publication date: 01/01/2023
Field of study

Several modern-day problems, like information overload and big data, need to deal with large amounts of data. As such, to meet the application requirements, for instance, performance and consistency, more and more systems are adapting to the specificities. The existing Relational Database Management System (RDBMS)’s the processing of massive data has become an issue because these databases do not deal with a massive amount of data. NoSQL is a database management system that makes processing massive and/or unstructured data easier because it uses key-value to store the data, collections or document stores instead of tables. Many companies today tend to start a project using NoSQL. However, HarperDB aims to produce a relational and nonrelational DBMS, allowing developers to choose between different solutions. This paper aims to show the most relevant differences between HarperDB, MongoDB and PostgreSQL and compare their performances. Preliminary results show that PostgreSQL performs better with structured data, but HarperDB can integrate NoSQL and SQL, which can be a significant advantage to HarperDB compared to the other solutions.info:eu-repo/semantics/publishedVersio

Repositório Cientifico do Instituto Politécnico de Santarém

Sustainable system design for gridded, spatio-temporal, agroecosystem forecasting models

Author
Publication venue: 'Journal of Agricultural Informatics'
Publication date
Field of study

Crossref

Sustainable system design for gridded, spatio-temporal, agroecosystem forecasting models

Author: Baker Kathleen M.
Roehsner Paul
Publication venue: 'Journal of Agricultural Informatics'
Publication date: 01/01/2016
Field of study

Repository of the Academy's Library

Optimizing Traffic Mobility Simulations: a Study on Data Processing and Storage Methods

Author: João António Cardoso Vieira e Basto de Sousa
Publication venue
Publication date: 17/07/2023
Field of study

Repositório Aberto da Universidade do Porto

A comparative analysis of non-relational databases in e-commerce applications

Author: Plechawska-Wójcik Małgorzata
Rożek Grzegorz
Saweczko Kacper
Publication venue: Lublin University of Technology
Publication date: 30/09/2023
Field of study

In this article, a comparative analysis of non-relational databases was conducted to determine the best database for e-commerce systems. Non-relational systems such as MongoDB and Apache Cassandra were used for the study and the results were compared with a relational PostgreSQL database. The main research criterion was performance testing of several types of queries based on execution time. To implement the research, typical e-commerce databases were created and then tested in a .NET test application created by authors. In addition, the difference in community support between non-relational and relational systems was determined. The research showed that MongoDB is best suited for e-commerce systems

Lublin University of Technology Journals

MOBANA: A distributed stream-based information system for public transit

Author: Liu Dongmeng
LIU KAIXU
MA TIANYI
MOTTA GIANMARIO PIERO ANTONIO
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Abstract: Public transit generates a wide range of diverse data, which include static data and high-velocity data streams from sensors. Integrating and processing this big real-time data is a challenge in developing analytical systems for public transit. We here propose MOBANA (MOBility ANAlyzer), a distributed stream-based system, which provides real-time information to a wide range of users for monitoring and analyzing the performance of public transit. To do so, MOBANA integrates the diverse data sources of public transit, and converts them into standard and exchangeable data formats. In order to manage such diverse data, we propose a layered architecture, where each layer handles a specific kind of data. MOBANA is designed to be efficient. e.g., it identifies the real time position of vehicles by adjusting planned position with real-time data as needed, thus dropping network load. MOBANA is implemented by Distributed Stream Processing Engine (DSPE) and Distributed Messaging System (DMS), which pursue scalable, efficient and reliable real-time processing and analytics. MOBANA was deployed as pilot in Pavia, and tested with real data

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

A Data-driven Methodology Towards Mobility- and Traffic-related Big Spatiotemporal Data Frameworks

Author: Figueiras Paulo Alves
Publication venue
Publication date: 01/12/2021
Field of study

Human population is increasing at unprecedented rates, particularly in urban areas. This increase, along with the rise of a more economically empowered middle class, brings new and complex challenges to the mobility of people within urban areas. To tackle such challenges, transportation and mobility authorities and operators are trying to adopt innovative Big Data-driven Mobility- and Traffic-related solutions. Such solutions will help decision-making processes that aim to ease the load on an already overloaded transport infrastructure. The information collected from day-to-day mobility and traffic can help to mitigate some of such mobility challenges in urban areas. Road infrastructure and traffic management operators (RITMOs) face several limitations to effectively extract value from the exponentially growing volumes of mobility- and traffic-related Big Spatiotemporal Data (MobiTrafficBD) that are being acquired and gathered. Research about the topics of Big Data, Spatiotemporal Data and specially MobiTrafficBD is scattered, and existing literature does not offer a concrete, common methodological approach to setup, configure, deploy and use a complete Big Data-based framework to manage the lifecycle of mobility-related spatiotemporal data, mainly focused on geo-referenced time series (GRTS) and spatiotemporal events (ST Events), extract value from it and support decision-making processes of RITMOs. This doctoral thesis proposes a data-driven, prescriptive methodological approach towards the design, development and deployment of MobiTrafficBD Frameworks focused on GRTS and ST Events. Besides a thorough literature review on Spatiotemporal Data, Big Data and the merging of these two fields through MobiTraffiBD, the methodological approach comprises a set of general characteristics, technical requirements, logical components, data flows and technological infrastructure models, as well as guidelines and best practices that aim to guide researchers, practitioners and stakeholders, such as RITMOs, throughout the design, development and deployment phases of any MobiTrafficBD Framework. This work is intended to be a supporting methodological guide, based on widely used Reference Architectures and guidelines for Big Data, but enriched with inherent characteristics and concerns brought about by Big Spatiotemporal Data, such as in the case of GRTS and ST Events. The proposed methodology was evaluated and demonstrated in various real-world use cases that deployed MobiTrafficBD-based Data Management, Processing, Analytics and Visualisation methods, tools and technologies, under the umbrella of several research projects funded by the European Commission and the Portuguese Government.A população humana cresce a um ritmo sem precedentes, particularmente nas áreas urbanas. Este aumento, aliado ao robustecimento de uma classe média com maior poder económico, introduzem novos e complexos desafios na mobilidade de pessoas em áreas urbanas. Para abordar estes desafios, autoridades e operadores de transportes e mobilidade estão a adotar soluções inovadoras no domínio dos sistemas de Dados em Larga Escala nos domínios da Mobilidade e Tráfego. Estas soluções irão apoiar os processos de decisão com o intuito de libertar uma infraestrutura de estradas e transportes já sobrecarregada. A informação colecionada da mobilidade diária e da utilização da infraestrutura de estradas pode ajudar na mitigação de alguns dos desafios da mobilidade urbana. Os operadores de gestão de trânsito e de infraestruturas de estradas (em inglês, road infrastructure and traffic management operators — RITMOs) estão limitados no que toca a extrair valor de um sempre crescente volume de Dados Espaciotemporais em Larga Escala no domínio da Mobilidade e Tráfego (em inglês, Mobility- and Traffic-related Big Spatiotemporal Data —MobiTrafficBD) que estão a ser colecionados e recolhidos. Os trabalhos de investigação sobre os tópicos de Big Data, Dados Espaciotemporais e, especialmente, de MobiTrafficBD, estão dispersos, e a literatura existente não oferece uma metodologia comum e concreta para preparar, configurar, implementar e usar uma plataforma (framework) baseada em tecnologias Big Data para gerir o ciclo de vida de dados espaciotemporais em larga escala, com ênfase nas série temporais georreferenciadas (em inglês, geo-referenced time series — GRTS) e eventos espacio- temporais (em inglês, spatiotemporal events — ST Events), extrair valor destes dados e apoiar os RITMOs nos seus processos de decisão. Esta dissertação doutoral propõe uma metodologia prescritiva orientada a dados, para o design, desenvolvimento e implementação de plataformas de MobiTrafficBD, focadas em GRTS e ST Events. Além de uma revisão de literatura completa nas áreas de Dados Espaciotemporais, Big Data e na junção destas áreas através do conceito de MobiTrafficBD, a metodologia proposta contem um conjunto de características gerais, requisitos técnicos, componentes lógicos, fluxos de dados e modelos de infraestrutura tecnológica, bem como diretrizes e boas práticas para investigadores, profissionais e outras partes interessadas, como RITMOs, com o objetivo de guiá-los pelas fases de design, desenvolvimento e implementação de qualquer pla- taforma MobiTrafficBD. Este trabalho deve ser visto como um guia metodológico de suporte, baseado em Arqui- teturas de Referência e diretrizes amplamente utilizadas, mas enriquecido com as característi- cas e assuntos implícitos relacionados com Dados Espaciotemporais em Larga Escala, como no caso de GRTS e ST Events. A metodologia proposta foi avaliada e demonstrada em vários cenários reais no âmbito de projetos de investigação financiados pela Comissão Europeia e pelo Governo português, nos quais foram implementados métodos, ferramentas e tecnologias nas áreas de Gestão de Dados, Processamento de Dados e Ciência e Visualização de Dados em plataformas MobiTrafficB

Repositório da Universidade Nova de Lisboa

Transformation of Matlab analysis programs into database-supported evaluations of SQL and NoSQL database systems

Author: Voigt Jakob
Publication venue
Publication date: 11/02/2020
Field of study

Universität Rostock, Lehrstuhl Datenbank- und Informationssysteme: Dbis Repository

Perbandingan Performa Fitur Connection Pooling dan Load Balancing pada Database PostgreSQL

Author: Sondi Sondi
Publication venue
Publication date: 01/01/2023
Field of study

Berdasarkan kebutuhan environment database PostgreSQL Perusahaan X mengenai penampungan koneksi, manajemen dan pemutusan koneksi idle, serta penggunaan sumber daya berlebih pada server database tereplikasi. Masalah tersebut dijawab oleh connection pooling dan load balancing. Connection pooling, menggunakan PGBOUNCER atau PGPOOL-II. Pengujian efektivitas connection pooling dan load balancing, akan menggunakan data Transaction Per Second (TPS) dan connection latency berdasarkan skenario kombinasi PGPOOL-II dan PGBOUNCER. Untuk memberikan implementasi connection pooling dan load balancing terbaik dari kombinasi PGPOOL-II dan PGBOUNCER, dibentuk environment database PostgreSQL tereplikasi secara Asynchronous dan diuji 3 skenario yang melibatkan PGPOOL-II dan PGBOUNCER. Tiga skenario ini dilakukan testing untuk 3 jumlah client yang berbeda dengan menggunakan tools pgbench yaitu (900, 500 dan 100). Dengan catatan load yang dibagi hanyalah query select saja. Didapatkan skenario yang terbaik adalah penggunaan PGBOUNCER sebagai connection pooling dan PGPOOL-II sebagai load balancing saja tanpa mengaktifkan fitur connection pooling dari PGPOOL-II. Skenario ini memiliki nilai latency yang paling rendah dan nilai TPS tertinggi untuk setiap jumlah clientnya. Nilai latency dari jumlah client yang berbeda-beda memiliki persentase 14% lebih rendah dibanding skenario lainnya dan memiliki nilai TPS 15% lebih tinggi dibanding skenario lainnya.Sehingga disarankan untuk environment database Perusahaan X digunakan kombinasi PGBOUNCER sebagai connection pooling dan PGPOOL-II sebagai load balancing

UMN Knowledge Center