609 research outputs found

    Exploratory Analysis of Highly Heterogeneous Document Collections

    Full text link
    We present an effective multifaceted system for exploratory analysis of highly heterogeneous document collections. Our system is based on intelligently tagging individual documents in a purely automated fashion and exploiting these tags in a powerful faceted browsing framework. Tagging strategies employed include both unsupervised and supervised approaches based on machine learning and natural language processing. As one of our key tagging strategies, we introduce the KERA algorithm (Keyword Extraction for Reports and Articles). KERA extracts topic-representative terms from individual documents in a purely unsupervised fashion and is revealed to be significantly more effective than state-of-the-art methods. Finally, we evaluate our system in its ability to help users locate documents pertaining to military critical technologies buried deep in a large heterogeneous sea of information.Comment: 9 pages; KDD 2013: 19th ACM SIGKDD Conference on Knowledge Discovery and Data Minin

    ІНФОРМАЦІЙНІ ТЕХНОЛОГІЇ ПРОСТОРОВОЇ ІНВЕНТАРИЗАЦІЇ ПАРНИКОВИХ ГАЗІВ У ЕНЕРГЕТИЧНОМУ СЕКТОРІ СІЛЕЗЬКОГО ВОЄВОДСТВА

    Get PDF
    GIS technology of spatial inventory of greenhouse gases (carbon dioxide, methane, etc.) in the energy sector of Silesia Region in Poland has been presented. Georeferenced databases, GIS software, and international inventory methodologies have been used. The mathematical models for inventory of carbon dioxide, methane and other greenhouse gases during the combustion of fuel in the production of electricity, in the residential sector, industry, construction, and transport have beencreated. These models allow to obtain the spatial distribution of total emissions of greenhouse gases of Silesia Region, taking into account the contribution of each region in the overall processes of emission.Представлено геоінформаційні технології просторової інвентаризації парникових газів (двоокису вуглецю, метану та ін.) в енергетичному секторі в Сілезькому воєводстві Польщі. Використано георозподілені бази даних, програмне забезпечення геоінформаційної системи та міжнародні методології інвентаризації. Розроблено математичні моделі для інвентаризації двоокису вуглецю, метану та інших парникових газів в процесі спалювання палива на виробництво електроенергії, в житловому секторі, у промисловості та будівництві, на транспорті. Ці моделі дали змогу отримати просторовий розподіл сумарних викидів парникових газів Сілезького воєводства з врахуванням внеску кожного району в загальні процеси емісії

    Geoinformation Technologies and Spatial Analysis of GHG Emissions in Polish Regions Bordering Ukraine

    Get PDF
    The specificity of territorial distribution of the GHG emission sources has been analyzed for polish regions bordering Ukraine. Mathematical models and geoinformation technology for spatial analysis of GHG emissions in the Energy sector that consider the territorial distribution of GHG emission sources and the structure of statistical data for Polish regions Lublin and Subcapathian are developed. The results of spatial analysis for the Lublin and Subcapathian voivodeships are presented.Проаналізовано специфіку територіального розміщення джерел емісії парникових газів в польських регіонах, що межують з Україною. Розроблено математичні моделі емісії парникових газів в енергетичному секторі з врахуванням структури статистичної інформації та відповідні геоінформаційні технології для здійснення просторової інвентаризації в польських воєводствах: Люблінському та Підкарпатському. Представлено результати просторового аналізу для цих двох воєводств.Проанализирована специфика территориального размещения источников эмиссии парниковых газов в польских регионах, граничащих с Украиной. Разработаны математические модели эмиссии парниковых газов в энергетическом секторе с учетом структуры статистической информации и соответствующие геоинформационные технологии для осуществления пространственной инвентаризации в польских воеводствах: Люблинском и Подкарпатском. Представлены результаты пространственного анализа для этих двух воеводств

    Geoinformation Technologies and Spatial Analysis of Carbon Dioxide Transport through Border Line

    Get PDF
    Geoinformation technologies and methods of spatial analysis of emissions in the border regions have been developed and GIS based software has been created for estimating mass of carbon dioxide (CO2) emissions that goes through border line. Described mathematical models of processes of CO2 emissions in the energy sector in the border regions take into account the meteorological data. Spatial analysis of carbon dioxide transport processes has been done for Ukrainian – Polish border zone in consideration with wind rose.Описаны геоинформационные технологии и методы пространственного анализа эмиссий парниковых газов в приграничных регионах и создано программное обеспечение для численного моделирования процессов переноса диоксида углерода через границу. Предложенные математические модели процессов эмиссии углекислого газа в энергетическом секторе западных регионов Украины для вычисления перемещения атмосферных масс учитывают метеорологические условия, а именно – розу ветров. Пространственный анализ эмиссий углекислого газа был сделан для украинско-польской пограничной полосы.Описано геоінформаційні технології та методи просторового аналізу емісій парникових газів в прикордонних регіонах та створено програмний засіб для числового моделювання процесів переносу діоксиду вуглецю через лінію кордону. Запропоновані математичні моделі процесів емісії вуглекислого газу в енергетичному секторі західних регіонів України для обчислення переміщення атмосферних мас враховують метеорологічні умови, а саме – розу вітрів. Просторовий аналіз емісій вуглекислого газу зроблено для українсько-польської прикордонної смуги

    Taking advantage of the UNFCCC Kyoto Policy Process: What can we learn about learning?

    Get PDF
    Learning is difficult to anticipate when it happen instantaneously, e.g. in the context of innovations [2]. However, even if learning is anticipated to happen continuously, it is difficult to grasp, e.g. when it occurs outside well-defined lab conditions, because adequate monitoring had not been put in place. Our study is retrospective. It focuses on the emissions of greenhouse gases (GHGs)that had been reported by countries (Parties) under the Kyoto Protocol (KP) to the United Nations Framework on Climate Change (UNFCCC). Discussions range widely on (i) whether the KP is considered a failure [6] or a success [5] ; and (ii) whether international climate policy should transit from a centralized model of governance to a 'hybrid' decentralized approach that combines country-level mitigation pledges with common principles for accounting and monitoring [1] . Emissions of GHGs - in the following we refer to CO2 emissions from burning fossil fuels at country level, particularly in the case of Austria - provide a perfect means to study learning in a globally relevant context. We are not aware of a similar data treasure of global relevance. Our mode of grasping learning is novel, i.e. it may have been referred to in general but, to the best of our knowledge, had not been quantifed so far. (That is, we consider the KP a success story potentially and advocate for the hybrid decentralized approach.) Learning requires 'measuring' differences or deviations. Here we follow Marland et al. [3] who discuss this issue in the context of emissions accounting: 'Many of the countries and organizations that make estimates of CO2 emissions provide annual updates in which they add another year of data to the time series and revise the estimates for earlier years. Revisions may reflect revised or more complete energy data and ... more complete and detailed understanding of the emissions processes and emissions coefficients. In short, we expect revisions to reflect learning and a convergence toward more complete and accurate estimates.' The United Nations Framework Convention on Climate Change (UNFCCC)requires exactly this to be done. Each year UNFCCC signatory countries are obliged to provide an annual inventory of emissions (and removals) of specified GHGs from five sectors (energy; industrial processes and product use; agriculture; land use, land use change and forestry; and waste) and revisit the emissions (and removals) for all previous years, back to the country specified base years (or periods). These data are made available by means of a database [4]. The time series of revised emission estimates reflect learning, but they are 'contaminated' by (i) structural change (e.g., when a coal-power plant is substituted by a gas-power plant); (ii) changes in consumption; and, rare but possible, (iii)methodological changes in surveying emission related activities. De-trending time series of revised emission estimates allows this contamination to be isolated by country, for which we provide three approaches: (I) parametric approach employing polynomial trend; (II) non-parametric approach employing smoothing splines; and (III) approach in which the most recent estimate is used as trend. That is, after de-trending for each year we are left with a set of revisions that reflect 'pure'(uncontaminated) learning which, is expected to be independent of the year under consideration (i.e., identical from year to year). However, we are confronted with two non-negligible problems (P): (P.1) the problem of small numbers - the remaining differences in emissions are small (before and after de-trending); and (P.2) the problem of non-monotonic learning - our knowledge of emission-generating activities and emission factors may not become more accurate from revision to revision

    Comparative analysis of the human serine hydrolase OVCA2 to the model serine hydrolase homolog FSH1 from S. cerevisiae

    Get PDF
    Over 100 metabolic serine hydrolases are present in humans with confirmed functions in metabolism, immune response, and neurotransmission. Among potentially clinically relevant but uncharacterized human serine hydrolases is OVCA2, a serine hydrolase that has been linked with a variety of cancer-related processes. Herein, we developed a heterologous expression system for OVCA2 and determined the comprehensive substrate specificity of OVCA2 against two ester substrate libraries. Based on this analysis, OVCA2 was confirmed as a serine hydrolase with a strong preference for long-chain alkyl ester substrates (\u3e10-carbons) and high selectivity against a variety of short, branched, and substituted esters. Substitutional analysis was used to identify the catalytic residues of OVCA2 with a Ser117-His206-Asp179 classic catalytic triad. Comparison of the substrate specificity of OVCA2 to the model homologue FSH1 from Saccharomyces cerevisiae illustrated the tighter substrate selectivity of OVCA2, but their overlapping substrate preference for extended straight-chain alkyl esters. Conformation of the overlapping biochemical properties of OVCA2 and FSH1 was used to model structural information about OVCA2. Together our analysis provides detailed substrate specificity information about a previously, uncharacterized human serine hydrolase and begins to define the biological properties of OVCA2

    Provider-initiated symptom screening for tuberculosis in Zimbabwe: diagnostic value and the effect of HIV status.

    Get PDF
    OBJECTIVE: To assess the diagnostic value of provider-initiated symptom screening for tuberculosis (TB) and how HIV status affects it. METHODS: We performed a secondary analysis of randomly selected participants in a community-based TB-HIV prevalence survey in Harare, Zimbabwe. All completed a five-symptom questionnaire and underwent sputum TB culture and HIV testing. We calculated the sensitivity, specificity, and positive and negative predictive values of various symptoms and used regression analysis to investigate the relationship between symptoms and TB disease. FINDINGS: We found one or more symptoms of TB in 21.2% of 1858 HIV-positive (HIV+) and 9.9% of 7121 HIV-negative (HIV-) participants (P or = 2 weeks' duration, any symptom and a positive sputum culture had sensitivities of 48%, 81% and 65%, respectively; in HIV- participants, the sensitivities were 45%, 71% and 74%, respectively. Symptoms had a similar sensitivity and specificity in HIV+ and HIV- participants, but in HIV+ participants they had a higher positive and a lower negative predictive value. CONCLUSION: Even smear-positive TB may be missed by provider-initiated symptom screening, especially in HIV+ individuals. Symptom screening is useful for ruling out TB, but better TB diagnostics are urgently needed for resource-poor settings

    МАТЕМАТИЧНЕ МОДЕЛЮВАННЯ ТА АНАЛІЗ НЕВИЗНАЧЕНОСТІ ЕМІСІЇ ПАРНИКОВИХ ГАЗІВ ПРИ ВИДОБУВАННІ І ПЕРЕРОБЦІ НАФТИ У ПОЛЬЩІ

    Get PDF
    The article deals with the famous approaches of greenhouse gases emission evaluation in different branches of business activities. The peculiarities have been described. Emission processes at oil production and recycling have been classified. Mathematical modelling of these processes, which take into account carbon dioxide and methane volatile emissions for oil production, transporting and recycling, and carbon dioxide, methane and nitrous oxide emissions from using fossil fuel in oil recycling technological process. Emission processes have been researched on the level of separate fields and oil refineries in Poland. To provide space inventory, emissions digital map has been created. Computing experiments have been carried out, as a result of which greenhouse gases emission have been evaluated. The results of space inventory have been visualised with the help of digital maps and the ambiguity of obtained results has been evaluated.Здійснено огляд відомих підходів до оцінювання емісій парникових газів у різноманітних галузях господарської діяльності. Описано особливості та наведено класифікацію емісійних процесів при видобуванні і переробці нафти. Обґрунтовано математичні моделі цих процесів, які враховують леткі емісії діоксиду вуглецю та метану при видобуванні, транспортуванні та переробці нафти, а також емісії діоксиду вуглецю, метану та закису азоту від використання викопного палива у технологічному процесі переробки нафти. Емісійні процеси досліджено на рівні окремих родовищ і нафтопереробних заводів Польщі. Для проведення просторової інвентаризації створено цифрову карту джерел емісії та здійснено ряд обчислювальних експериментів, у результаті яких отримано оцінки емісій парникових газів. Результати просторової інвентаризації емісій візуалізовано за допомогою цифрових карт та оцінено невизначеність отриманих результатів
    corecore