2 research outputs found

    What do we know about the big data researches? A systematic review from 2011 to 2017

    Get PDF
    Big data are defined as a new phenomenon that can be novel step for improving social life and business condition. Analysing the big data’s researches to extract insights by systematic literature review is the main objective of this research. For synthesis systematically, data from 123 articles are extracted and kinds of studies that were usually done on big data area are investigated. The Systematic Review showed: the most studies were published in 2014, also the main journal that published big data’s article was ‘Big Data Research’ and country with highest investigate about big data were ‘United State and China’. Beside, most researches were done with analytic background. The main research method was experimental and major research type was case study. Our study proved that the majority of researches carried out around big data focused on data management, and most of them identify ‘volume and variety’ of as significant challenges of big data. Likewise, ‘business analytics’ was described in the major benefits

    A CCG virtual system for big data application communication costs analysis

    No full text
    Network topology and routing are two important factors in determining the communication costs of big data applications at large scale. As for a given Cluster, Cloud, or Grid system, the network topology is fixed and static or dynamic routing protocols are preinstalled to direct the network traffic. Users cannot change them once the system is deployed. Hence, it is hard for application developers to identify the optimal network topology and routing algorithm for their applications with distinct communication patterns. In this study, we design a CCG virtual system (CCGVS), which first uses container-based virtualization to allow users to create a farm of lightweight virtual machines on a single host. Then, it uses software-defined networking (SDN) technique to control the network traffic among these virtual machines. Users can change the network topology and control the network traffic programmingly, thereby enabling application developers to evaluate their applications on the same system with different network topologies and routing algorithms. The preliminary experimental results through both synthetic big data programs and NPB benchmarks have shown that CCGVS can represent application performance variations caused by network topology and routing algorithm
    corecore