220,245 research outputs found
Intelligent Computing for Big Data
Recent advances in artificial intelligence have the potential to further develop current big data research. The Special Issue on ‘Intelligent Computing for Big Data’ highlighted a number of recent studies related to the use of intelligent computing techniques in the processing of big data for text mining, autism diagnosis, behaviour recognition, and blockchain-based storage
Intelligent Management and Efficient Operation of Big Data
This chapter details how Big Data can be used and implemented in networking
and computing infrastructures. Specifically, it addresses three main aspects:
the timely extraction of relevant knowledge from heterogeneous, and very often
unstructured large data sources, the enhancement on the performance of
processing and networking (cloud) infrastructures that are the most important
foundational pillars of Big Data applications or services, and novel ways to
efficiently manage network infrastructures with high-level composed policies
for supporting the transmission of large amounts of data with distinct
requisites (video vs. non-video). A case study involving an intelligent
management solution to route data traffic with diverse requirements in a wide
area Internet Exchange Point is presented, discussed in the context of Big
Data, and evaluated.Comment: In book Handbook of Research on Trends and Future Directions in Big
Data and Web Intelligence, IGI Global, 201
NEARBY Platform: Algorithm for Automated Asteroids Detection in Astronomical Images
In the past two decades an increasing interest in discovering Near Earth
Objects has been noted in the astronomical community. Dedicated surveys have
been operated for data acquisition and processing, resulting in the present
discovery of over 18.000 objects that are closer than 30 million miles of
Earth. Nevertheless, recent events have shown that there still are many
undiscovered asteroids that can be on collision course to Earth. This article
presents an original NEO detection algorithm developed in the NEARBY research
object, that has been integrated into an automated MOPS processing pipeline
aimed at identifying moving space objects based on the blink method. Proposed
solution can be considered an approach of Big Data processing and analysis,
implementing visual analytics techniques for rapid human data validation.Comment: IEEE 14th International Conference on Intelligent Computer
Communication and Processing (ICCP), Sep 6-8, 2018, Cluj-Napoca, Romani
A Large-scale Distributed Video Parsing and Evaluation Platform
Visual surveillance systems have become one of the largest data sources of
Big Visual Data in real world. However, existing systems for video analysis
still lack the ability to handle the problems of scalability, expansibility and
error-prone, though great advances have been achieved in a number of visual
recognition tasks and surveillance applications, e.g., pedestrian/vehicle
detection, people/vehicle counting. Moreover, few algorithms explore the
specific values/characteristics in large-scale surveillance videos. To address
these problems in large-scale video analysis, we develop a scalable video
parsing and evaluation platform through combining some advanced techniques for
Big Data processing, including Spark Streaming, Kafka and Hadoop Distributed
Filesystem (HDFS). Also, a Web User Interface is designed in the system, to
collect users' degrees of satisfaction on the recognition tasks so as to
evaluate the performance of the whole system. Furthermore, the highly
extensible platform running on the long-term surveillance videos makes it
possible to develop more intelligent incremental algorithms to enhance the
performance of various visual recognition tasks.Comment: Accepted by Chinese Conference on Intelligent Visual Surveillance
201
Marketing relations and communication infrastructure development in the banking sector based on big data mining
Purpose: The article aims to study the methodological tools for applying the technologies of intellectual analysis of big data in the modern digital space, the further implementation of which can become the basis for the marketing relations concept implementation in the banking sector of the Russian Federation‘ economy. Structure/Methodology/Approach: For the marketing relations development in the banking sector in the digital economy, it seems necessary: firstly, to identify the opportunities and advantages of the big data mining in banking marketing; secondly, to identify the sources and methods of processing big data; thirdly, to study the examples of the big data mining successful use by Russian banks and to formulate the recommendations on the big data technologies implementation in the digital marketing banking strategy. Findings: The authors‘ analysis showed that big data technologies processing of open online and offline sources of information significantly increases the data amount available for intelligent analysis, as a result of which the interaction between the bank and the target client reaches a new level of partnership. Practical Implications: Conclusions and generalizations of the study can be applied in the practice of managing financial institutions. The results of the study can be used by bank management to form a digital marketing strategy for long-term communication. Originality/Value: The main contribution of this study is that the authors have identified the main directions of using big data in relationship marketing to generate additional profit, as well as the possibility of intellectual analysis of the client base, aimed at expanding the market share and retaining customers in the banking sector of the economy.peer-reviewe
Big Data in Smart-Cities: Current Research and Challenges
Smart-cities are an emerging paradigm containing heterogeneous network infrastructure, ubiquitous sensing devices, big-data processing and intelligent control systems. Their primary aim is to improve the quality of life of the citizens by providing intelligent services in a wide variety of aspects like transportation, healthcare, entertainment, environment, and energy. In order to provide such services, the role of big-data and its analysis is extremely important as it enables to obtain valuable insights into the large data generated by the smart-cities. In this article, we investigate the state-of-art research efforts directed towards big-data analytics in a smart-city context. Specifically, first we present a big-data centric taxonomy for the smart-cities to bring forth a generic overview of the importance of big-data paradigm in a smart-city environment. This is followed by the presentation of a top-level snapshot of the commonly used big-data analytical platforms. Due to the heterogeneity of data being collected by the smart-cities, often with conflicting processing requirements, suitable analytical techniques depending upon the data type are also suggested. In addition to this, a generic four-tier big-data framework comprising of the sensing hub, storage hub, processing hub and application hub is also proposed that can be applied in any smart-city context. This is complemented by providing the common big-data applications in a smart-city and presentation of ten selected case studies of smart-cities across the globe. Finally, the open challenges are highlighted in order to give future research directions
Apache Mahout’s k-Means vs. fuzzy k-Means performance evaluation
(c) 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.The emergence of the Big Data as a disruptive technology for next generation of intelligent systems, has brought many issues of how to extract and make use of the knowledge obtained from the data within short times, limited budget and under high rates of data generation. The foremost challenge identified here is the data processing, and especially, mining and analysis for knowledge extraction. As the 'old' data mining frameworks were designed without Big Data requirements, a new generation of such frameworks is being developed fully implemented in Cloud platforms. One such frameworks is Apache Mahout aimed to leverage fast processing and analysis of Big Data. The performance of such new data mining frameworks is yet to be evaluated and potential limitations are to be revealed. In this paper we analyse the performance of Apache Mahout using large real data sets from the Twitter stream. We exemplify the analysis for the case of two clustering algorithms, namely, k-Means and Fuzzy k-Means, using a Hadoop cluster infrastructure for the experimental study.Peer ReviewedPostprint (author's final draft
- …