252 research outputs found

    Parallel Construction of Wavelet Trees on Multicore Architectures

    Get PDF
    The wavelet tree has become a very useful data structure to efficiently represent and query large volumes of data in many different domains, from bioinformatics to geographic information systems. One problem with wavelet trees is their construction time. In this paper, we introduce two algorithms that reduce the time complexity of a wavelet tree's construction by taking advantage of nowadays ubiquitous multicore machines. Our first algorithm constructs all the levels of the wavelet in parallel in O(n)O(n) time and O(nlgσ+σlgn)O(n\lg\sigma + \sigma\lg n) bits of working space, where nn is the size of the input sequence and σ\sigma is the size of the alphabet. Our second algorithm constructs the wavelet tree in a domain-decomposition fashion, using our first algorithm in each segment, reaching O(lgn)O(\lg n) time and O(nlgσ+pσlgn/lgσ)O(n\lg\sigma + p\sigma\lg n/\lg\sigma) bits of extra space, where pp is the number of available cores. Both algorithms are practical and report good speedup for large real datasets.Comment: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sk{\l}odowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 69094

    Técnicas de indexación y recuperación de documentos utilizando referencias geográficas y textuales

    Get PDF
    [Resumen] Internet y la World Wide Web se han convertido en un enorme repositorio de información consultado diariamente por millones de usuarios. Además, otros repositorios de información, como las bases de datos documentales o las bibliotecas digitales, también han aumentado su popularidad considerablemente. Esto ha provocado que la recuperación de información se haya convertido en una de las áreas de investigación más importantes dentro de la informática. Aunque estos repositorios contienen información de distinta naturaleza, la información más habitual es de tipo textual. A menudo, en el texto de un documento se pueden encontrar referencias geográficas que permiten asignar a ese documento una zona del espacio en la cual es relevante. Los usuarios de los sistemas que enumerábamos demandan cada vez más servicios que les permitan situar la información recuperada en un mapa. Además, también está aumentando el interés en consultas que permitan recuperar documentos relevantes no sólo para un tema determinado sino también para una zona determinada. El desarrollo de arquitecturas de sistemas, estructuras de indexación y otros componentes que permitan satisfacer estas necesidades es el objetivo principal de una nueva área de investigación denominada recuperación de información geográfica (GIR). En esta tesis abordamos varios temas de interés en el área. En primer lugar, las estructuras de indexación que permiten recuperar documentos empleando tanto su ámbito textual como su ámbito espacial no tienen en cuenta la naturaleza jerárquica del espacio geográfico ni las relaciones topológicas entre los objetos espaciales que indexan. Por tanto, nuestro primer objetivo es desarrollar una estructura que solucione los problemas debidos a estas limitaciones. Esta estructura constituye la base de la arquitectura para sistemas GIR que proponemos como segundo objetivo de la tesis. Estudiamos las limitaciones de las arquitecturas de los sistemas GIR propuestas hasta la fecha y proponemos una arquitectura genérica, modular y extensible. Además desarrollamos un prototipo de sistema basado en dicha arquitectura. Finalmente, como tercer objetivo de esta tesis proponemos una estructura para indexar objetos geográficos optimizada para las características de la información que se maneja habitualmente en sistemas GIR

    When Edge Computing Meets Compact Data Structures

    Full text link
    Edge computing enables data processing and storage closer to where the data are created. Given the largely distributed compute environment and the significantly dispersed data distribution, there are increasing demands of data sharing and collaborative processing on the edge. Since data shuffling can dominate the overall execution time of collaborative processing jobs, considering the limited power supply and bandwidth resource in edge environments, it is crucial and valuable to reduce the communication overhead across edge devices. Compared with data compression, compact data structures (CDS) seem to be more suitable in this case, for the capability of allowing data to be queried, navigated, and manipulated directly in a compact form. However, the relevant work about applying CDS to edge computing generally focuses on the intuitive benefit from reduced data size, while few discussions about the challenges are given, not to mention empirical investigations into real-world edge use cases. This research highlights the challenges, opportunities, and potential scenarios of CDS implementation in edge computing. Driven by the use case of shuffling-intensive data analytics, we proposed a three-layer architecture for CDS-aided data processing and particularly studied the feasibility and efficiency of the CDS layer. We expect this research to foster conjoint research efforts on CDS-aided edge data analytics and to make wider practical impacts

    Nutrition, Nutritional Status and Functionality

    Get PDF
    [EN] A good quality of life means obtaining adequate nutrition and regular physical ac- tivity. This combination also reduces the risk of developing many chronic diseases while increasing one’s level of physical performance.S

    The Mechanisms of Regulated Cell Death: Structural and Functional Proteomic Pathways Induced or Inhibited by a Specific Protein—A Narrative Review

    Get PDF
    [EN] Billions of cells die in us every hour, and our tissues do not shrink because there is a natural regulation where Cell Death (CD) is balanced with cell division. The process in which cells eliminate themselves in a controlled manner is called Programmed Cell Death (PCD). The PCD plays an important role during embryonic development, in maintaining homeostasis of the body’s tissues, and in the elimination of damaged cells, under a wide range of physiological and developmental stimuli. A multitude of protein mediators of PCD have been identified and signals have been found to utilize common pathways elucidating the proteins involved. This narrative review focuses on caspase-dependent and caspase-independent PCD pathways. Included are studies of caspase-dependent PCD such as Anoikis, Catastrophe Mitotic, Pyroptosis, Emperitosis, Parthanatos and Cornification, and Caspase-Independent PCD as Wallerian Degeneration, Ferroptosis, Paraptosis, Entosis, Methuosis, and Extracellular Trap Abnormal Condition (ETosis), as well as neutrophil extracellular trap abnormal condition (NETosis) and Eosinophil Extracellular Trap Abnormal Condition (EETosis). Understanding PCD from those reported in this review could shed substantial light on the processes of biological homeostasis. In addition, identifying specific proteins involved in these processes is mandatory to identify molecular biomarkers, as well as therapeutic targets. This knowledge could provide the ability to modulate the PCD response and could lead to new therapeutic interventions in a wide range of diseases.S

    Potential Satellite Cell-Linked Biomarkers in Aging Skeletal Muscle Tissue: Proteomics and Proteogenomics to Monitor Sarcopenia

    Get PDF
    [EN] Sarcopenia (Sp) is the loss of skeletal muscle mass associated with aging which causes an involution of muscle function and strength. Satellite cells (Sc) are myogenic stem cells, which are activated by injury or stress, and repair muscle tissue. With advancing age, there is a decrease in the efficiency of the regenerative response of Sc. Diagnosis occurs with the Sp established by direct assessments of muscle. However, the detection of biomarkers in real-time biofluids by liquid biopsy could represent a step-change in the understanding of the molecular biology and heterogeneity of Sp. A total of 13 potential proteogenomic biomarkers of Sp by their physiological and biological interaction with Sc have been previously described in the literature. Increases in the expression of GDF11, PGC-1α, Sirt1, Pax7, Pax3, Myf5, MyoD, CD34, MyoG, and activation of Notch signaling stimulate Sc activity and proliferation, which could modulate and delay Sp progression. On the contrary, intensified expression of GDF8, p16INK4a, Mrf4, and activation of the Wnt pathway would contribute to early Sp development by directly inducing reduced and/or altered Sc function, which would attenuate the restorative capacity of skeletal muscle. Additionally, tissue biopsy remains an important diagnostic tool. Proteomic profiling of aged muscle tissues has shown shifts toward protein isoforms characteristic of a fast-to-slow transition process and an elevated number of oxidized proteins. In addition, a strong association between age and plasma values of growth differentiation factor 15 (GDF-15) has been described and serpin family A member 3 (serpin A3n) was more secreted by atrophied muscle cells. The identification of these new biomarkers holds the potential to change personalized medicine because it could predict in real time the course of Sp by monitoring its evolution and assessing responses to potential therapeutic strategies.S

    Los sistemas de información geográfica en turismo

    Get PDF
    [Resumo] A internet converteuse nun dos lugares máis populares para publicar e buscar case calquera tipo de información. En particular, a información turística gañou moita atención na rede durante os últimos anos, e non só a información sobre viaxes, recursos, lugares, museos ou monumentos, senón tamén sobre turismo cultural. Neste artigo presentamos as posibilidades que ofrecen os sistemas de información xeográfica (SIX) para a publicación de información turística e o acceso a ela, a través de interfaces coa capacidade de xerar mapas interactivos que presenten información asociada a cada elemento de interese que apareza neles. Ademais, describimos como caso de estudo a viaxe virtual que se nos propón na Biblioteca Virtual Galega (http://bvg.udc.es), un sistema accesible a través da web que, por medio de tecnoloxías SIX, permite acceder a calquera información turística ou cultural de Galicia de xeito sinxelo.[Resumen] Internet se ha convertido en uno de los lugares más populares para publicar y buscar casi cualquier tipo de información. En particular, la información turística ha ganado mucha atención en la red durante los últimos años, no sólo información sobre viajes, recursos, lugares, museos o monumentos, sino también sobre turismo cultural. En este artículo presentamos las posibilidades que ofrecen los Sistemas de Información Geográfica (SIG) en la publicación y acceso a información turística, a través de interfaces con capacidades de generación de mapas interactivos con información asociada a cada elemento de interés presentado en los mapas. Además, describimos como caso de estudio el Viaje Virtual de la Biblioteca Virtual Gallega (http://bvg.udc.es), un sistema accesible a través de la Web que, utilizando tecnologías SIG, permite acceder a cualquier información turística o cultural de Galicia de manera sencilla.[Abstract] The Internet has become one of the most popular places to publish and search for almost any type of information. In particular, tourist information has received much attention in the Internet over the past few years, not only information about travel, resources, places, museums or monuments, but also about cultural tourism. In this article we discuss the potential offered by Geographic Information Systems (GIS) in the publication of and access to tourist information, through interfaces capable of generating interactive maps with information associated with each element of interest shown in the maps. In addition, as a case study, we describe the Virtual Trip of the Galician Virtual Library (http://bvg.udc.es), an Internet-accessible system which makes it possible, using GIS technologies, to easily access any tourist or cultural information about Galicia

    Space-Efficient Representations of Raster Time Series

    Get PDF
    Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG[Abstract] Raster time series, a.k.a. temporal rasters, are collections of rasters covering the same region at consecutive timestamps. These data have been used in many different applications ranging from weather forecast systems to monitoring of forest degradation or soil contamination. Many different sensors are generating this type of data, which makes such analyses possible, but also challenges the technological capacity to store and retrieve the data. In this work, we propose a space-efficient representation of raster time series that is based on Compact Data Structures (CDS). Our method uses a strategy of snapshots and logs to represent the data, in which both components are represented using CDS. We study two variants of this strategy, one with regular sampling and another one based on a heuristic that determines at which timestamps should the snapshots be created to reduce the space redundancy. We perform a comprehensive experimental evaluation using real datasets. The results show that the proposed strategy is competitive in space with alternatives based on pure data compression, while providing much more efficient query times for different types of queries.The data used in this study were acquired as part of the mission of NASA’s Earth Science Division and archived and distributed by the Goddard Earth Sciences (GES) Data and Information Services Center (DISC). Funding: CITIC, as Research Center accredited by Galician University System, is funded by “Consellería de Cultura, Educación e Universidade from Xunta de Galicia”, supported in an 80% through ERDF Funds, ERDF Operational Programme Galicia 2014-2020, and the remaining 20% by “Secretaría Xeral de Universidades” (Grant ED431G 2019/01). This work was also supported by Xunta de Galicia/FEDER-UE under Grants [IG240.2020.1.185; IN852A 2018/14]; Ministerio de Ciencia, Innovación y Universidades under Grants [TIN2016-78011-C4-1-R; RTC-2017-5908-7; PID2019- 105221RB-C41/AEI/10.13039/501100011033]; ANID - Millennium Science Initiative Program - Code ICN17_002; Programa Iberoamericano de Ciencia y Tecnología para el Desarrollo (CYTED) [Grant No. 519RT0579]Xunta de Galicia; ED431G 2019/01Xunta de Galicia; IG240.2020.1.185Xunta de Galicia; IN852A 2018/14Chile. Agencia Nacional de Investigación y Desarrollo; ICN17_00
    corecore