1,782 research outputs found
Recommended from our members
Text-based document geolocation and its application to the digital humanities
This dissertation investigates automatic geolocation of documents (i.e. identification of their location, expressed as latitude/longitude coordinates), based on the text of those documents rather than metadata. I assert that such geolocation can be performed using text alone, at a sufficient accuracy for use in real-world applications. Although in some corpora metadata is found in abundance (e.g. home location, time zone, friends, followers, etc. in Twitter), it is lacking in others, such as many corpora of primary-source documents in the digital humanities, an area to which document geolocation has hardly been applied. To this end, I first develop methods for accurate text-based geolocation and then apply them to newly-annotated corpora in the digital humanities. The geolocation methods I develop use both uniform and adaptive (k-d tree) grids over the Earth’s surface, culminating in a hierarchical logistic-regression-based technique that achieves state of the art results on well-known corpora (Twitter user feeds, Wikipedia articles and Flickr image tags). In the second part of the dissertation I develop a new NLP task, text-based geolocation of historical corpora. Because there are no existing corpora to test on, I create and annotate two new corpora of significantly different natures (a 19th-century travel log and a large set of Civil War archives). I show how my methods produce good geolocation accuracy even given the relatively small amount of annotated data available, which can be further improved using domain adaptation. I then use the predictions on the much larger unannotated portion of the Civil War archives to generate and analyze geographic topic models, showing how they can be mined to produce interesting revelations concerning various Civil War-related subjects. Finally, I develop a new geolocation technique for text-only corpora involving co-training between document-geolocation and toponym- resolution models, using a gazetteer to inject additional information into the training process. To evaluate this technique I develop a new metric, the closest toponym error distance, on which I show improvements compared with a baseline geolocator.Linguistic
A phenomenological model for morphology development of disperse polymer blends in complex flows
A social history of industrial strikes and the labour movement in Hong Kong, 1946-1989
published_or_final_versio
Recommended from our members
The influence of adrenergic receptors on the blood sugar and lactic acid levels in the rat
The protoplanetary disk of FT Tauri: multi-wavelength data analysis and modeling
Investigating the evolution of protoplanetary disks is crucial for our
understanding of star and planet formation. Several theoretical and
observational studies have been performed in the last decades to advance this
knowledge. FT Tauri is a young star in the Taurus star forming region that was
included in a number of spectroscopic and photometric surveys. We investigate
the properties of the star, the circumstellar disk, and the accretion and
ejection processes and propose a consistent gas and dust model also as a
reference for future observational studies. We performed a multi-wavelength
data analysis to derive the basic stellar and disk properties, as well as mass
accretion/outflow rate from TNG-Dolores, WHT-Liris, NOT-Notcam, Keck-Nirspec,
and Herschel-Pacs spectra. From the literature, we compiled a complete Spectral
Energy Distribution. We then performed detailed disk modeling using the MCFOST
and ProDiMo codes. Multi-wavelengths spectroscopic and photometric measurements
were compared with the reddened predictions of the codes in order to constrain
the disk properties. This object can serve as a benchmark for primordial disks
with significant mass accretion rate, high gas content and typical size.Comment: 16 pages, 9 figures, accepted for publication in A&
Clinical and Nonclinical Health Care Workers Faced a Similar Risk of Acquiring 2009 Pandemic H1N1 Infection
(See the editorial commentary by Drumright and Holmes, on pages 284-286.) Reporting of confirmed pandemic influenza A virus (pH1N1) 2009 infection was mandatory among health care workers in Hong Kong. Among 1158 confirmed infections, there was no significant difference in incidence among clinical versus nonclinical staff (relative risk, 0.98; 95% confidence interval, 0.78-1.20). Reported community exposure to pH1N1 was common and was similar in both group
Double Averaging Analysis Applied to a Large Eddy Simulation of Coupled Turbulent Overlying and Porewater Flow
Freestream turbulence in rivers is a key contributor to the flux of dissolved nutrients, carbon, and other ecologically important solutes into porewater. To advance understanding of turbulent hyporheic exchange and porewater transport, we investigate flow over and through a rough bed of spheres using large eddy simulation (LES). We apply double averaging (combined space and time averaging) to the LES results to determine the mean velocity distribution, momentum balance, and drag forces. Our simulations show large-scale freestream structures interacting strongly with vortices generated at the surfaces of individual spheres to control turbulent momentum fluxes into the bed. The transition between turbulent flow and Darcy flow occurs over the first row of spheres, where turbulence decays rapidly and turbulent kinetic energy, Reynolds stress, and drag forces peak. Below this region, turbulence is only present in the high-velocity flow in open pore throats. Experimental observations suggest that minimum mean porewater velocity occurs in the first open pore space below the transition region, but our results show that the minimum occurs between the first and second pore spaces. The simulation mean porewater velocities are approximately half those captured in measurements because the model resolves the entire flow continuum while measurements can access high-velocity fluid in open pores. The high-resolution dual time-space averaging of the LES resolves both turbulent and mean flow features that are important to interfacial solute and particle fluxes, providing a means to include turbulent hyporheic exchange in upscaled river models, which has not been achieved to date
The contribution of daytime sleepiness to impaired quality of life in NAFLD in an ethnically diverse population
Health-related quality of life (HRQoL) is lower in people with NAFLD compared to the general population. Sleep disturbance resulting in daytime sleepiness is common in patients with NAFLD, but the effect of daytime sleepiness on HRQoL in NAFLD is unclear. The prevalence and natural history of NAFLD vary in different ethnic groups, but there has been limited ethnic diversity in HrQoL studies to date. We aimed to assess whether daytime sleepiness is independently associated with reduced HRQoL in an ethnically diverse UK population. We conducted HRQoL assessments using SF-36 version 2 and Epworth Sleepiness Scale (ESS) questionnaires in 192 people with NAFLD. Multivariate linear regression was used to identify factors independently affecting HRQoL scales. People with NAFLD reported significantly reduced physical health-related SF-36 scores compared to the general UK population. South Asian NAFLD patients reported impairment in physical health, but not mental health, approximately a decade before White NAFLD patients. In multivariate linear regression, daytime sleepiness (ESS score > 10), was the most significant independent predictor of reduced physical health. Age, BMI and liver stiffness score were also significantly associated. HRQoL is impaired earlier in patients of South Asian ethnicity. ESS score > 10, indicative of excessive daytime sleepiness, is an independent predictor of reduced HRQoL in people with NAFLD regardless of ethnicity. Daytime sleepiness should be considered as a contributing factor to reduced HRQoL in clinical practice and when evaluating patient-related outcomes in clinical trials.Grant from the Diabetes Wellness and Research Foundation (WA, WKS). WA was supported by a New Investigator Research Grant from the Medical Research Council
Solar-Driven Reduction of Aqueous Protons Coupled to Selective Alcohol Oxidation with a Carbon Nitride-Molecular Ni Catalyst System.
Solar water-splitting represents an important strategy toward production of the storable and renewable fuel hydrogen. The water oxidation half-reaction typically proceeds with poor efficiency and produces the unprofitable and often damaging product, O2. Herein, we demonstrate an alternative approach and couple solar H2 generation with value-added organic substrate oxidation. Solar irradiation of a cyanamide surface-functionalized melon-type carbon nitride ((NCN)CNx) and a molecular nickel(II) bis(diphosphine) H2-evolution catalyst (NiP) enabled the production of H2 with concomitant selective oxidation of benzylic alcohols to aldehydes in high yield under purely aqueous conditions, at room temperature and ambient pressure. This one-pot system maintained its activity over 24 h, generating products in 1:1 stoichiometry, separated in the gas and solution phases. The (NCN)CNx-NiP system showed an activity of 763 μmol (g CNx)(-1) h(-1) toward H2 and aldehyde production, a Ni-based turnover frequency of 76 h(-1), and an external quantum efficiency of 15% (λ = 360 ± 10 nm). This precious metal-free and nontoxic photocatalytic system displays better performance than an analogous system containing platinum instead of NiP. Transient absorption spectroscopy revealed that the photoactivity of (NCN)CNx is due to efficient substrate oxidation of the material, which outweighs possible charge recombination compared to the nonfunctionalized melon-type carbon nitride. Photoexcited (NCN)CNx in the presence of an organic substrate can accumulate ultralong-lived "trapped electrons", which allow for fuel generation in the dark. The artificial photosynthetic system thereby catalyzes a closed redox cycle showing 100% atom economy and generates two value-added products, a solar chemical, and solar fuel.This work was supported by the Christian Doppler Research Association (Austrian Federal Ministry of Science, Research, and Economy and the National Foundation for Research, Technology and Development) and the OMV Group (to E.R.), an Oppenheimer PhD scholarship (to B.C.M.M.), a Marie Curie Postdoctoral Fellowship (GAN 624997 to C.A.C.), a FRQNT Postdoctoral Fellowship (to R.G.), and an ERC Starting Grant (B. V. L., Grant No. 639233).This is the final version of the article. It first appeared from American Chemical Society via http://dx.doi.org/10.1021/jacs.6b0432
- …