9,654 research outputs found

    On the Use of Parsing for Named Entity Recognition

    Get PDF
    [Abstract] Parsing is a core natural language processing technique that can be used to obtain the structure underlying sentences in human languages. Named entity recognition (NER) is the task of identifying the entities that appear in a text. NER is a challenging natural language processing task that is essential to extract knowledge from texts in multiple domains, ranging from financial to medical. It is intuitive that the structure of a text can be helpful to determine whether or not a certain portion of it is an entity and if so, to establish its concrete limits. However, parsing has been a relatively little-used technique in NER systems, since most of them have chosen to consider shallow approaches to deal with text. In this work, we study the characteristics of NER, a task that is far from being solved despite its long history; we analyze the latest advances in parsing that make its use advisable in NER settings; we review the different approaches to NER that make use of syntactic information; and we propose a new way of using parsing in NER based on casting parsing itself as a sequence labeling task.Xunta de Galicia; ED431C 2020/11Xunta de Galicia; ED431G 2019/01This work has been funded by MINECO, AEI and FEDER of UE through the ANSWER-ASAP project (TIN2017-85160-C2-1-R); and by Xunta de Galicia through a Competitive Reference Group grant (ED431C 2020/11). CITIC, as Research Center of the Galician University System, is funded by the Consellería de Educación, Universidade e Formación Profesional of the Xunta de Galicia through the European Regional Development Fund (ERDF/FEDER) with 80%, the Galicia ERDF 2014-20 Operational Programme, and the remaining 20% from the Secretaría Xeral de Universidades (Ref. ED431G 2019/01). Carlos Gómez-Rodríguez has also received funding from the European Research Council (ERC), under the European Union’s Horizon 2020 research and innovation programme (FASTPARSE, Grant No. 714150)

    Special Libraries, November 1955

    Get PDF
    Volume 46, Issue 9https://scholarworks.sjsu.edu/sla_sl_1955/1008/thumbnail.jp

    Getting More out of Biomedical Documents with GATE's Full Lifecycle Open Source Text Analytics.

    Get PDF
    This software article describes the GATE family of open source text analysis tools and processes. GATE is one of the most widely used systems of its type with yearly download rates of tens of thousands and many active users in both academic and industrial contexts. In this paper we report three examples of GATE-based systems operating in the life sciences and in medicine. First, in genome-wide association studies which have contributed to discovery of a head and neck cancer mutation association. Second, medical records analysis which has significantly increased the statistical power of treatment/ outcome models in the UK’s largest psychiatric patient cohort. Third, richer constructs in drug-related searching. We also explore the ways in which the GATE family supports the various stages of the lifecycle present in our examples. We conclude that the deployment of text mining for document abstraction or rich search and navigation is best thought of as a process, and that with the right computational tools and data collection strategies this process can be made defined and repeatable. The GATE research programme is now 20 years old and has grown from its roots as a specialist development tool for text processing to become a rather comprehensive ecosystem, bringing together software developers, language engineers and research staff from diverse fields. GATE now has a strong claim to cover a uniquely wide range of the lifecycle of text analysis systems. It forms a focal point for the integration and reuse of advances that have been made by many people (the majority outside of the authors’ own group) who work in text processing for biomedicine and other areas. GATE is available online ,1. under GNU open source licences and runs on all major operating systems. Support is available from an active user and developer community and also on a commercial basis

    Studying Ourselves: The Academic Labor Market

    Get PDF
    [Excerpt] The study of academic labor markets by economists goes back at least to Adam Smith’s suggestion in The Wealth of Nations that a professor’s compensation be tied to the number of students that enrolled in his classes. This paper focuses on three academic labor market issues that students at Cornell and I are currently addressing; the declining salaries of faculty employed at public colleges and universities relative to the salaries of their counterparts employed at private higher education institutions, the growing dispersion of average faculty salaries across academic institutions within both the public and private sectors, and the impacts of the growing importance and costs of science on the academic labor market and universities

    SciTech News Volume 70, No. 2 (2016)

    Get PDF
    Table of Contents: Columns and Reports From the Editor 3 Division News Science-Technology Division 4 New Members 6 Chemistry Division 7 New Members11 Engineering Division 12 Aerospace Section of the Engineering Division 17 Reviews Sci-Tech Book News Reviews 1

    Studying Ourselves: The Academic Labor Market

    Get PDF
    This paper addresses three academic labor market issues; the declining salaries of faculty employed at public colleges and universities relative to their private institution counterparts, the growing dispersion of average faculty salaries across academic institutions within both the public and private sectors, and the impacts of the growing importance and costs of science on the academic labor market and universities. The decline in the salaries of faculty in public institutions relative to their private sector counterparts is attributed primarily to private institutions' tuition levels rising by more in real terms than public institutions' tuition levels. The growing dispersion in average faculty salaries across institutions within each sector is attributed primarily to the growing disperion of endowmentper student levels across private institutions and the growing dispersion of state appropriations per student across public institutions. Finally, controlling for other factors, those universities whose real research expenditures per faculty from institutional funds are growing the most experience the greatest increase in their student/faculty ratio, other variables held constant.

    Sentiment Analysis for Fake News Detection

    Get PDF
    [Abstract] In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention of deception. The dissemination of this type of news poses a serious threat to cohesion and social well-being, since it fosters political polarization and the distrust of people with respect to their leaders. The huge amount of news that is disseminated through social media makes manual verification unfeasible, which has promoted the design and implementation of automatic systems for fake news detection. The creators of fake news use various stylistic tricks to promote the success of their creations, with one of them being to excite the sentiments of the recipients. This has led to sentiment analysis, the part of text analytics in charge of determining the polarity and strength of sentiments expressed in a text, to be used in fake news detection approaches, either as a basis of the system or as a complementary element. In this article, we study the different uses of sentiment analysis in the detection of fake news, with a discussion of the most relevant elements and shortcomings, and the requirements that should be met in the near future, such as multilingualism, explainability, mitigation of biases, or treatment of multimedia elements.Xunta de Galicia; ED431G 2019/01Xunta de Galicia; ED431C 2020/11This work has been funded by FEDER/Ministerio de Ciencia, Innovación y Universidades — Agencia Estatal de Investigación through the ANSWERASAP project (TIN2017-85160-C2-1-R); and by Xunta de Galicia through a Competitive Reference Group grant (ED431C 2020/11). CITIC, as Research Center of the Galician University System, is funded by the Consellería de Educación, Universidade e Formación Profesional of the Xunta de Galicia through the European Regional Development Fund (ERDF/FEDER) with 80%, the Galicia ERDF 2014-20 Operational Programme, and the remaining 20% from the Secretaría Xeral de Universidades (ref. ED431G 2019/01). David Vilares is also supported by a 2020 Leonardo Grant for Researchers and Cultural Creators from the BBVA Foundation. Carlos Gómez-Rodríguez has also received funding from the European Research Council (ERC), under the European Union’s Horizon 2020 research and innovation programme (FASTPARSE, grant No. 714150

    Coatings for Corrosion Protection: Offshore Oil and Gas Operation Facilities, Marine Pipeline and Ship Structures

    Get PDF
    This workshop on Coatings for Corrosion Protection: Offshore Oil and Gas Operation Facilities, Marine Pipelines, Ship Structures, and Port Facilities was held on April 14-16, 2004 in Biloxi, Mississippi. This workshop of 150 attendees drew participation by internationally recognized marine coating experts, material specialists, inspection specialists, coating manufacturers, maintenance engineers, and designers. The workshop was crafted to include multiple viewpoints: industrial, academic, environmental, regulatory, standardization, and certification. Keynote and topic papers were presented to establish a current information base for discussions. Six discussion groups addressed specific issues and identified, prioritized, and recommended specific research and development topics for the government and industries to undertake. The recommendations of this workshop offer a clear identification of research and development issues and create a roadmap for achieving them
    corecore