56 research outputs found
Innovative Heuristics to Improve the Latent Dirichlet Allocation Methodology for Textual Analysis and a New Modernized Topic Modeling Approach
Natural Language Processing is a complex method of data mining the vast trove of documents created and made available every day. Topic modeling seeks to identify the topics within textual corpora with limited human input into the process to speed analysis. Current topic modeling techniques used in Natural Language Processing have limitations in the pre-processing steps. This dissertation studies topic modeling techniques, those limitations in the pre-processing, and introduces new algorithms to gain improvements from existing topic modeling techniques while being competitive with computational complexity. This research introduces four contributions to the field of Natural Language Processing and topic modeling. First, this research identifies a requirement for a more robust âstopwordsâ list and proposes a heuristic for creating a more robust list. Second, a new dimensionality-reduction technique is introduced that exploits the number of words within a document to infer importance to word choice. Third, an algorithm is developed to determine the number of topics within a corpus and demonstrated using a standard topic modeling data set. These techniques produce a higher quality result from the Latent Dirichlet Allocation topic modeling technique. Fourth, a novel heuristic utilizing Principal Component Analysis is introduced that is capable of determining the number of topics within a corpus that produces stable sets of topic words
Global, regional, and national under-5 mortality, adult mortality, age-specific mortality, and life expectancy, 1970â2016: a systematic analysis for the Global Burden of Disease Study 2016
BACKGROUND: Detailed assessments of mortality patterns, particularly age-specific mortality, represent a crucial input that enables health systems to target interventions to specific populations. Understanding how all-cause mortality has changed with respect to development status can identify exemplars for best practice. To accomplish this, the Global Burden of Diseases, Injuries, and Risk Factors Study 2016 (GBD 2016) estimated age-specific and sex-specific all-cause mortality between 1970 and 2016 for 195 countries and territories and at the subnational level for the five countries with a population greater than 200 million in 2016.
METHODS: We have evaluated how well civil registration systems captured deaths using a set of demographic methods called death distribution methods for adults and from consideration of survey and census data for children younger than 5 years. We generated an overall assessment of completeness of registration of deaths by dividing registered deaths in each location-year by our estimate of all-age deaths generated from our overall estimation process. For 163 locations, including subnational units in countries with a population greater than 200 million with complete vital registration (VR) systems, our estimates were largely driven by the observed data, with corrections for small fluctuations in numbers and estimation for recent years where there were lags in data reporting (lags were variable by location, generally between 1 year and 6 years). For other locations, we took advantage of different data sources available to measure under-5 mortality rates (U5MR) using complete birth histories, summary birth histories, and incomplete VR with adjustments; we measured adult mortality rate (the probability of death in individuals aged 15-60 years) using adjusted incomplete VR, sibling histories, and household death recall. We used the U5MR and adult mortality rate, together with crude death rate due to HIV in the GBD model life table system, to estimate age-specific and sex-specific death rates for each location-year. Using various international databases, we identified fatal discontinuities, which we defined as increases in the death rate of more than one death per million, resulting from conflict and terrorism, natural disasters, major transport or technological accidents, and a subset of epidemic infectious diseases; these were added to estimates in the relevant years. In 47 countries with an identified peak adult prevalence for HIV/AIDS of more than 0·5% and where VR systems were less than 65% complete, we informed our estimates of age-sex-specific mortality using the Estimation and Projection Package (EPP)-Spectrum model fitted to national HIV/AIDS prevalence surveys and antenatal clinic serosurveillance systems. We estimated stillbirths, early neonatal, late neonatal, and childhood mortality using both survey and VR data in spatiotemporal Gaussian process regression models. We estimated abridged life tables for all location-years using age-specific death rates. We grouped locations into development quintiles based on the Socio-demographic Index (SDI) and analysed mortality trends by quintile. Using spline regression, we estimated the expected mortality rate for each age-sex group as a function of SDI. We identified countries with higher life expectancy than expected by comparing observed life expectancy to anticipated life expectancy on the basis of development status alone.
FINDINGS: Completeness in the registration of deaths increased from 28% in 1970 to a peak of 45% in 2013; completeness was lower after 2013 because of lags in reporting. Total deaths in children younger than 5 years decreased from 1970 to 2016, and slower decreases occurred at ages 5-24 years. By contrast, numbers of adult deaths increased in each 5-year age bracket above the age of 25 years. The distribution of annualised rates of change in age-specific mortality rate differed over the period 2000 to 2016 compared with earlier decades: increasing annualised rates of change were less frequent, although rising annualised rates of change still occurred in some locations, particularly for adolescent and younger adult age groups. Rates of stillbirths and under-5 mortality both decreased globally from 1970. Evidence for global convergence of death rates was mixed; although the absolute difference between age-standardised death rates narrowed between countries at the lowest and highest levels of SDI, the ratio of these death rates-a measure of relative inequality-increased slightly. There was a strong shift between 1970 and 2016 toward higher life expectancy, most noticeably at higher levels of SDI. Among countries with populations greater than 1 million in 2016, life expectancy at birth was highest for women in Japan, at 86·9 years (95% UI 86·7-87·2), and for men in Singapore, at 81·3 years (78·8-83·7) in 2016. Male life expectancy was generally lower than female life expectancy between 1970 and 2016, an
Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network
Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated signals from strong lensing by 1) performing targeted searches for subthreshold signals, 2) calculating the degree of overlap amongst the intrinsic parameters and sky location of pairs of signals, 3) comparing the similarities of the spectrograms amongst pairs of signals, and 4) performing dual-signal Bayesian analysis that takes into account selection effects and astrophysical knowledge. We also search for distortions to the gravitational waveform caused by 1) frequency-independent phase shifts in strongly lensed images, and 2) frequency-dependent modulation of the amplitude and phase due to point masses. None of these searches yields significant evidence for lensing. Finally, we use the non-detection of gravitational-wave lensing to constrain the lensing rate based on the latest merger-rate estimates and the fraction of dark matter composed of compact objects
Search for eccentric black hole coalescences during the third observing run of LIGO and Virgo
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass M>70 Mâ) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities 0<eâ€0.3 at 0.33 Gpcâ3 yrâ1 at 90\% confidence level
Recommended from our members
The TFAP2C-Regulated OCT4 Naive Enhancer Is Involved in Human Germline Formation.
Human primordial germ cells (hPGCs) are the first embryonic progenitors in the germ cell lineage, yet the molecular mechanisms required for hPGC formation are not well characterized. To identify regulatory regions in hPGC development, we used the assay for transposase-accessible chromatin using sequencing (ATAC-seq) to systematically characterize regions of open chromatin in hPGCs and hPGC-like cells (hPGCLCs) differentiated from human embryonic stem cells (hESCs). We discovered regions of open chromatin unique to hPGCs and hPGCLCs that significantly overlap with TFAP2C-bound enhancers identified in the naive ground state of pluripotency. Using CRISPR/Cas9, we show that deleting the TFAP2C-bound naive enhancer at the OCT4 locus (also called POU5F1) results in impaired OCT4 expression and a negative effect on hPGCLC identity
The TFAP2C-Regulated OCT4 Naive Enhancer Is Involved in Human Germline Formation
Summary: Human primordial germ cells (hPGCs) are the first embryonic progenitors in the germ cell lineage, yet the molecular mechanisms required for hPGC formation are not well characterized. To identify regulatory regions in hPGC development, we used the assay for transposase-accessible chromatin using sequencing (ATAC-seq) to systematically characterize regions of open chromatin in hPGCs and hPGC-like cells (hPGCLCs) differentiated from human embryonic stem cells (hESCs). We discovered regions of open chromatin unique to hPGCs and hPGCLCs that significantly overlap with TFAP2C-bound enhancers identified in the naive ground state of pluripotency. Using CRISPR/Cas9, we show that deleting the TFAP2C-bound naive enhancer at the OCT4 locus (also called POU5F1) results in impaired OCT4 expression and a negative effect on hPGCLC identity. : Combining genomics and functional studies, Chen et al. identify the open chromatin state of human primordial germ cells (hPGCs), leading to the discovery that TFAP2C regulates hPGC development through the opening of naive enhancers. Keywords: TFAP2C, OCT4, naive, enhancer, PGC, PGCLC, pluripotenc
Demographic and Disease Characteristics Associated With Non-Hodgkin Lymphoma Survivors' Quality of Life: Does Age Matter?
PURPOSE/OBJECTIVES: To examine demographic and disease characteristics by age and the moderating effect of age on quality of life (QOL) among non-Hodgkin lymphoma (NHL) survivors.
DESIGN: A cross-sectional, secondary analysis study of NHL survivors.
SETTING: Two North Carolina cancer registries.
SAMPLE: 741 NHL survivors with a mean age of 62 years and a mean time since diagnosis of 10 years.
METHODS: Mailed surveys were sent to individuals treated for NHL. All analyses were conducted using SPSSÂź, version 18.0. Multiple regression was used to analyze relationships among demographic and disease characteristics, age, and QOL.
MAIN RESEARCH VARIABLES: Demographic, disease, and clinical characteristics on QOL.
FINDINGS: In relation to QOL, income and gender were moderated by age; for example, younger survivors who earned less than $30,000 annually had a poorer QOL. Women reported a higher QOL than men.
CONCLUSIONS: Age was a moderator for income and an indicator for how income could affect care of younger survivors. Men reported a lower QOL than women and gender-specific resources may be helpful to them.
IMPLICATIONS FOR NURSING: Nursing research should focus on age-sensitive resources targeted for younger NHL survivors.
KNOWLEDGE TRANSLATION: Age is an important characteristic that impacts overall health-related QOL. Oncology nurses are instrumental in identifying patients at all ages who could benefit from age-specific resources
Pharmacological characterisation of a rat model of incisional pain
1. Both clinical and preclinical models of postsurgical pain are being used more frequently in the early evaluation of new chemical entities. In order to assess the validity and reliability of a rat model of postincisional pain, the effects of different classes of clinically effective analgesic drugs were evaluated against multiple behavioural end points. 2. Following surgical incision, under general anaesthesia, of the plantar surface of the rat hind paw, we determined the time course of mechanical hyperalgesia, tactile allodynia and hind limb weight bearing using the RandallâSelitto (paw pressure) assay, electronic von Frey and dual channel weight averager, respectively. Behavioural evaluations began 24 h following surgery, and were continued for 9â14 days. 3. Mechanical hyperalgesia, tactile allodynia and a decrease in weight bearing were present on the affected limb within 1 day of surgery with maximum sensitivity 1â3 days postsurgery. Accordingly, we examined the effect of nonsteroidal antiinflammatory drugs (NSAIDs), morphine and gabapentin, on established hyperalgesia and allodynia, 1 day following plantar incision. 4. In accordance with previous reports, both systemic morphine and gabapentin administration reversed mechanical hyperalgesia and tactile allodynia in the incised rat hind paw. Both drugs were more potent against mechanical hyperalgesia than tactile allodynia. 5. All of the NSAIDs tested, including cyclooxygenase 2 selective inhibitors, reversed mechanical hyperalgesia and tactile allodynia in the incised rat hind paw. The rank order of potency for both hyperalgesia and allodynia was indomethacin > celecoxib > etoricoxib > naproxen. 6. We have investigated the potency and efficacy of different classes of analgesic drugs in a rat model of postincisional pain. The rank order of potency for these drugs reflects their utility in treating postoperative pain in the clinic. As these compounds showed reliable efficacy across two different behavioural end points, the RandallâSelitto (paw pressure) assay and electronic von Frey, these methods may prove useful in the study of postsurgical pain and the assessment of novel treatments
- âŠ