Search CORE

8,883 research outputs found

Finding Top-k Dominance on Incomplete Big Data Using Map-Reduce Framework

Author: Ezatpoor Payam
Publication venue: Digital Scholarship@UNLV
Publication date: 01/05/2017
Field of study

Incomplete data is one major kind of multi-dimensional dataset that has random-distributed missing nodes in its dimensions. It is very difficult to retrieve information from this type of dataset when it becomes huge. Finding top-k dominant values in this type of dataset is a challenging procedure. Some algorithms are present to enhance this process but are mostly efficient only when dealing with a small-size incomplete data. One of the algorithms that make the application of TKD query possible is the Bitmap Index Guided (BIG) algorithm. This algorithm strongly improves the performance for incomplete data, but it is not originally capable of finding top-k dominant values in incomplete big data, nor is it designed to do so. Several other algorithms have been proposed to find the TKD query, such as Skyband Based and Upper Bound Based algorithms, but their performance is also questionable. Algorithms developed previously were among the first attempts to apply TKD query on incomplete data; however, all these had weak performances or were not compatible with the incomplete data. This thesis proposes MapReduced Enhanced Bitmap Index Guided Algorithm (MRBIG) for dealing with the aforementioned issues. MRBIG uses the MapReduce framework to enhance the performance of applying top-k dominance queries on huge incomplete datasets. The proposed approach uses the MapReduce parallel computing approach using multiple computing nodes. The framework separates the tasks between several computing nodes that independently and simultaneously work to find the result. This method has achieved up to two times faster processing time in finding the TKD query result in comparison to previously presented algorithms

University of Nevada, Las Vegas Repository

GraphCombEx: A Software Tool for Exploration of Combinatorial Optimisation Properties of Large Graphs

Author: A Rosete-Suárez
AL Barabási
C Bachmaier
C Binucci
D Brélaz
D Chalupa
D Chalupa
D Chalupa
D Chalupa
D Holten
David Chalupa
DJ Watts
DS Johnson
F Schreiber
G Csardi
I Xenarios
I Xenarios
I Xenarios
J Ellson
J Leskovec
J Leskovec
J Pattillo
JC Culberson
JS Turner
K Sugiyama
Ken A Hawick
L Salwinski
LM Abualigah
LM Abualigah
MEJ Newman
MM Halldórsson
MR Garey
MY Becker
P Bonami
P Csermely
R Albert
R Tamassia
U Brandes
U Brandes
V Chvátal
W Czech
Y Khosiawan
Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/01/2018
Field of study

We present a prototype of a software tool for exploration of multiple combinatorial optimisation problems in large real-world and synthetic complex networks. Our tool, called GraphCombEx (an acronym of Graph Combinatorial Explorer), provides a unified framework for scalable computation and presentation of high-quality suboptimal solutions and bounds for a number of widely studied combinatorial optimisation problems. Efficient representation and applicability to large-scale graphs and complex networks are particularly considered in its design. The problems currently supported include maximum clique, graph colouring, maximum independent set, minimum vertex clique covering, minimum dominating set, as well as the longest simple cycle problem. Suboptimal solutions and intervals for optimal objective values are estimated using scalable heuristics. The tool is designed with extensibility in mind, with the view of further problems and both new fast and high-performance heuristics to be added in the future. GraphCombEx has already been successfully used as a support tool in a number of recent research studies using combinatorial optimisation to analyse complex networks, indicating its promise as a research software tool

arXiv.org e-Print Archive

Crossref

VBN

Evolution of galaxy groups in the Illustris simulation

Author: Dariush Ali A.
Khosroshahi Habib G.
Raouf Mojtaba
Publication venue: 'American Astronomical Society'
Publication date: 01/01/2016
Field of study

We present the first study of evolution of galaxy groups in the Illustris simulation. We focus on dynamically relaxed and unrelaxed galaxy groups representing dynamically evolved and evolving galaxy systems, respectively. The evolutionary state of a group is probed from its luminosity gap and separation between the brightest group galaxy and the center of mass of the group members. We find that the Illustris simulation, over-produces large luminosity gap galaxy systems, known as fossil systems, in comparison to observations and the probed semi-analytical predictions. However, this simulation is equally successful in recovering the correlation between luminosity gap and luminosity centroid offset, in comparison to the probed semi-analytic model. We find evolutionary tracks based on luminosity gap which indicate that a large luminosity gap group is rooted in a small luminosity gap group, regardless of the position of the brightest group galaxy within the halo. This simulation helps, for the first time, to explore the black hole mass and its accretion rate in galaxy groups. For a given stellar mass of the brightest group galaxies, the black hole mass is larger in dynamically relaxed groups with a lower rate of mass accretion. We find this consistent with the latest observational studies of the radio activities in the brightest group galaxies in fossil groups. We also find that the IGM in dynamically evolved groups is hotter for a given halo mass than that in evolving groups, again consistent with earlier observational studies.Comment: 10 pages, 10 figures. Accepted for publication in Ap

arXiv.org e-Print Archive

HAL-INSU

Truss Decomposition in Massive Networks

Author: Cheng James
Wang Jia
Publication venue
Publication date: 01/01/2012
Field of study

The k-truss is a type of cohesive subgraphs proposed recently for the study of networks. While the problem of computing most cohesive subgraphs is NP-hard, there exists a polynomial time algorithm for computing k-truss. Compared with k-core which is also efficient to compute, k-truss represents the "core" of a k-core that keeps the key information of, while filtering out less important information from, the k-core. However, existing algorithms for computing k-truss are inefficient for handling today's massive networks. We first improve the existing in-memory algorithm for computing k-truss in networks of moderate size. Then, we propose two I/O-efficient algorithms to handle massive networks that cannot fit in main memory. Our experiments on real datasets verify the efficiency of our algorithms and the value of k-truss.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

DR-NTU (Digital Repository of NTU)

The redshift evolution of the distribution of star formation among dark matter halos as seen in the infrared

Author: Amblard
Amblard
Behroozi
Behroozi
Behroozi
Berta
Bertone
Birnboim
Blain
Béthermin
Béthermin
Béthermin
Béthermin
Béthermin
Béthermin
Cai
Cantalupo
Cattaneo
Chabrier
Challinor
Chapman
Condon
Conroy
Cooray
Cooray
Daddi
Daddi
De Bernardis
Dekel
Dole
Dole
Eisenstein
Elbaz
Emanuele Daddi
Fakhouri
Farrah
Faucher-Giguère
Feruglio
Fiolet
Fixsen
Franceschini
Glenn
Grossan
Gruppioni
Guilaine Lagache
Hajian
Hall
Hanson
Hauser
Hauser
Hervé Aussel
Holder
Ilbert
Ilbert
Jauzac
Kennicutt
Kereš
Kim
Knox
Lagache
Lagache
Lagache
Lagache
Lapi
Larson
Le Borgne
Leauthaud
Limber
Lingyu Wang
Maddox
Magdis
Magliocchetti
Magliocchetti
Mark Sargent
Marsden
Marsden
Matthieu Béthermin
Mazin
More
Morgane Cousin
Moster
Navarro
Nguyen
Noeske
Oliver
Olivier Doré
Ostriker
Pannella
Papovich
Park
Patanchon
Planck Collaboration
Puget
Pénin
Pénin
Rahmati
Rodighiero
Salpeter
Sargent
Schechter
Shang
Silk
Somerville
Stark
Tacconi
Teplitz
Tinker
Tinker
Vale
Valiante
Viero
Viero
Wang
Watson
Weinmann
Xia
Zemcov
Publication venue: 'EDP Sciences'
Publication date: 01/01/2013
Field of study

Recent studies revealed a strong correlation between the star formation rate (SFR) and stellar mass of star-forming galaxies, the so-called star-forming main sequence. An empirical modeling approach (2-SFM) which distinguishes between the main sequence and rarer starburst galaxies is capable of reproducing most statistical properties of infrared galaxies. In this paper, we extend this approach by establishing a connection between stellar mass and halo mass with the technique of abundance matching. Based on a few, simple assumptions and a physically motivated formalism, our model successfully predicts the (cross-)power spectra of the cosmic infrared background (CIB), the cross-correlation between CIB and cosmic microwave background (CMB) lensing, and the correlation functions of bright, resolved infrared galaxies measured by Herschel, Planck, ACT and SPT. We use this model to infer the redshift distribution these observables, as well as the level of correlation between CIB-anisotropies at different wavelengths. We also predict that more than 90% of cosmic star formation activity occurs in halos with masses between 10^11.5 and 10^13.5 Msun. Taking into account subsequent mass growth of halos, this implies that the majority of stars were initially (at z>3) formed in the progenitors of clusters, then in groups at 0.5<z<3 and finally in Milky-Way-like halos at z<0.5. At all redshifts, the dominant contribution to the star formation rate density stems from halos of mass ~10^12 Msun, in which the instantaneous star formation efficiency is maximal (~70%). The strong redshift-evolution of SFR in the galaxies that dominate the CIB is thus plausibly driven by increased accretion from the cosmic web onto halos of this characteristic mass scale

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

HAL-INSU

Caltech Authors

HAL-CEA

Sussex Research Online

Hal-Diderot

Influence of baryonic physics in galaxy simulations: a semi-analytic treatment of the molecular component

Author: Combes F.
Halle A.
Publication venue: 'EDP Sciences'
Publication date: 09/08/2013
Field of study

Recent work in galaxy formation has enlightened the important role of baryon physics, to solve the main problems encountered by the standard theory at the galactic scale, such as the galaxy stellar mass functions, or the missing satellites problem. The present work aims at investigating in particular the role of the cold and dense molecular phase, which could play a role of gas reservoir in the outer galaxy discs, with low star formation efficiency. Through TreeSPH simulations, implementing the cooling to low temperatures, and the inclusion of the molecular hydrogen component, several feedback efficiencies are studied, and results on the gas morphology and star formation are obtained. It is shown that molecular hydrogen allows some slow star formation (with gas depletion times of about 5 Gyr) to occur in the outer parts of the discs. This dense and quiescent phase might be a way to store a significant fraction of dark baryons, in a relatively long time-scale, in the complete baryonic cycle, connecting the galaxy discs to hot gaseous haloes and to the cosmic filaments.Comment: Accepted for publication in Astronomy and Astrophysics, 21 pages, 29 figure

arXiv.org e-Print Archive

CiteSeerX

EDP Sciences OAI-PMH repository (1.2.0)

HAL-INSU

HAL-OBSPM

Hal-Diderot