164 research outputs found
Capacities, Removable Sets andL(p)-Uniqueness on Wiener Spaces
Hinz M, Kang S. Capacities, Removable Sets andL(p)-Uniqueness on Wiener Spaces. Potential Analysis. 2021;54:503–533.We prove the equivalence of two different types of capacities in abstract Wiener spaces. This yields a criterion for theL(p)-uniqueness of the Ornstein-Uhlenbeck operator and its integer powers defined on suitable algebras of functions vanishing in a neighborhood of a given closed set sigma of zero Gaussian measure. To prove the equivalence we show theW(r,p)(B,mu)-boundedness of certain smooth nonlinear truncation operators acting on potentials of nonnegative functions. We discuss connections to Gaussian Hausdorff measures. Roughly speaking, ifL(p)-uniqueness holds then the 'removed' set sigma must have sufficiently large codimension, in the case of the Ornstein-Uhlenbeck operator for instance at least 2p. Forp= 2 we obtain parallel results on truncations, capacities and essential self-adjointness for Ornstein-Uhlenbeck operators with linear drift. These results apply to the time zero Gaussian free field as a prototype example
Stress Processing Sensitivity in Reading Korean and English Words
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20-22, 200
Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations
Most weakly supervised named entity recognition (NER) models rely on
domain-specific dictionaries provided by experts. This approach is infeasible
in many domains where dictionaries do not exist. While a phrase retrieval model
was used to construct pseudo-dictionaries with entities retrieved from
Wikipedia automatically in a recent study, these dictionaries often have
limited coverage because the retriever is likely to retrieve popular entities
rather than rare ones. In this study, we present a novel framework, HighGEN,
that generates NER datasets with high-coverage pseudo-dictionaries.
Specifically, we create entity-rich dictionaries with a novel search method,
called phrase embedding search, which encourages the retriever to search a
space densely populated with various entities. In addition, we use a new
verification process based on the embedding distance between candidate entity
mentions and entity types to reduce the false-positive noise in weak labels
generated by high-coverage dictionaries. We demonstrate that HighGEN
outperforms the previous best model by an average F1 score of 4.7 across five
NER benchmark datasets.Comment: ACL 202
Why are hotel room prices different? Exploring spatially varying relationships between room price and hotel attributes
Despite abundant research on modeling hotel room prices, traditional hedonic pricing models (HPMs) have failed to consider spatial variations in the relationships among hotel room price and attribute variables. This study demonstrates the utility of a spatial HPM (s-HPM) using a geographically weighted regression analysis of 387 hotels in the Chicago area. Specifically, this study explored spatial variations in modeling hotel room prices and further identified spatial clustering patterns of relationships between room price and hotel attributes across market segments. The findings reveal that the s-HPM successfully identified spatially varying relationships between room price and hotel attributes, such as site attributes – size, age, class and service quality – and situation attributes – distances to airports, highways and tourist attractions – across the study area. This study contributes to a better understanding of local patterns of modeling room prices, ultimately providing guidelines for effective location-based hotel room pricing strategies
Simple Questions Generate Named Entity Recognition Datasets
Recent named entity recognition (NER) models often rely on human-annotated
datasets requiring the vast engagement of professional knowledge on the target
domain and entities. This work introduces an ask-to-generate approach, which
automatically generates NER datasets by asking simple natural language
questions to an open-domain question answering system (e.g., "Which disease?").
Despite using fewer training resources, our models solely trained on the
generated datasets largely outperform strong low-resource models by 20.8 F1
score on average across six popular NER benchmarks. Our models also show
competitive performance with rich-resource models that additionally leverage
in-domain dictionaries provided by domain experts. In few-shot NER, we
outperform the previous best model by 5.2 F1 score on three benchmarks and
achieve new state-of-the-art performance.Comment: Code available at https://github.com/dmis-lab/GeNE
Evaluation of public libraries and the urban situation in Seoul
Thesis (M.C.P.)--Massachusetts Institute of Technology, Dept. of Urban Studies and Planning, 2012.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student submitted PDF version of thesis.Includes bibliographical references (p. 116-121).This thesis investigates the current situation of public space in the city of Seoul through public libraries. The public library has been one of the most Important civic spaces since the invention in the 19th century in the US or UK. While roles of public library are changing due to advances in digital technology, the physical and visible presence of public library spaces in the city remains significant in the privatized urban situation. In Seoul, the number of public libraries has significantly increased for the last 15 years due to the new policy for library construction in the entire country. Despite these attempts, however, I argue that the current urban context of the city prevent public libraries from functioning effectively as civic centers. Investigating the reciprocal relationship between architecture and the urban condition, the thesis confirms the discontinuous and impermeable urban form in Seoul that impedes the publicness of the existing public spaces. The urban morphology hinders people from freely navigating and accessing the existing public spaces. Two recent cases of incremental commercial development offer lessons for possible future revitalization strategies that could increase the publicness of the existing urban condition in Seoul without the need for radically painful and costly urban reform. Public spaces like the public library can be used as a strategy to improve the city's physical form and civic realm.by Seunghyun Kang.M.C.P
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
Contrastive language-image pre-training (CLIP) models have demonstrated
considerable success across various vision-language tasks, such as
text-to-image retrieval, where the model is required to effectively process
natural language input to produce an accurate visual output. However, current
models still face limitations in dealing with linguistic variations in input
queries, such as paraphrases, making it challenging to handle a broad range of
user queries in real-world applications. In this study, we introduce a
straightforward fine-tuning approach to enhance the representations of CLIP
models for paraphrases. Our approach involves a two-step paraphrase generation
process, where we automatically create two categories of paraphrases from
web-scale image captions by leveraging large language models. Subsequently, we
fine-tune the CLIP text encoder using these generated paraphrases while
freezing the image encoder. Our resulting model, which we call ParaCLIP,
exhibits significant improvements over baseline CLIP models across various
tasks, including paraphrased retrieval (with rank similarity scores improved by
up to 2.0% and 5.6%), Visual Genome Relation and Attribution, as well as seven
semantic textual similarity tasks.Comment: EACL 2024 (Findings of the ACL
CVD-grown monolayer MoS2 in bioabsorbable electronics and biosensors
Transient electronics entails the capability of electronic components to dissolve or reabsorb in a controlled manner when used in biomedical implants. Here, the authors perform a systematic study of the processes of hydrolysis, bioabsorption, cytotoxicity and immunological biocompatibility of monolayer MoS2
- …