68 research outputs found

    D6.2 Integrated Final Version of the Components for Lexical Acquisition

    Get PDF
    The PANACEA project has addressed one of the most critical bottlenecks that threaten the development of technologies to support multilingualism in Europe, and to process the huge quantity of multilingual data produced annually. Any attempt at automated language processing, particularly Machine Translation (MT), depends on the availability of language-specific resources. Such Language Resources (LR) contain information about the language\u27s lexicon, i.e. the words of the language and the characteristics of their use. In Natural Language Processing (NLP), LRs contribute information about the syntactic and semantic behaviour of words - i.e. their grammar and their meaning - which inform downstream applications such as MT. To date, many LRs have been generated by hand, requiring significant manual labour from linguistic experts. However, proceeding manually, it is impossible to supply LRs for every possible pair of European languages, textual domain, and genre, which are needed by MT developers. Moreover, an LR for a given language can never be considered complete nor final because of the characteristics of natural language, which continually undergoes changes, especially spurred on by the emergence of new knowledge domains and new technologies. PANACEA has addressed this challenge by building a factory of LRs that progressively automates the stages involved in the acquisition, production, updating and maintenance of LRs required by MT systems. The existence of such a factory will significantly cut down the cost, time and human effort required to build LRs. WP6 has addressed the lexical acquisition component of the LR factory, that is, the techniques for automated extraction of key lexical information from texts, and the automatic collation of lexical information into LRs in a standardized format. The goal of WP6 has been to take existing techniques capable of acquiring syntactic and semantic information from corpus data, improving upon them, adapting and applying them to multiple languages, and turning them into powerful and flexible techniques capable of supporting massive applications. One focus for improving the scalability and portability of lexical acquisition techniques has been to extend exiting techniques with more powerful, less "supervised" methods. In NLP, the amount of supervision refers to the amount of manual annotation which must be applied to a text corpus before machine learning or other techniques are applied to the data to compile a lexicon. More manual annotation means more accurate training data, and thus a more accurate LR. However, given that it is impractical from a cost and time perspective to manually annotate the vast amounts of data required for multilingual MT across domains, it is important to develop techniques which can learn from corpora with less supervision. Less supervised methods are capable of supporting both large-scale acquisition and efficient domain adaptation, even in the domains where data is scarce. Another focus of lexical acquisition in PANACEA has been the need of LR users to tune the accuracy level of LRs. Some applications may require increased precision, or accuracy, where the application requires a high degree of confidence in the lexical information used. At other times a greater level of coverage may be required, with information about more words at the expense of some degree of accuracy. Lexical acquisition in PANACEA has investigated confidence thresholds for lexical acquisition to ensure that the ultimate users of LRs can generate lexical data from the PANACEA factory at the desired level of accuracy

    D6.1: Technologies and Tools for Lexical Acquisition

    Get PDF
    This report describes the technologies and tools to be used for Lexical Acquisition in PANACEA. It includes descriptions of existing technologies and tools which can be built on and improved within PANACEA, as well as of new technologies and tools to be developed and integrated in PANACEA platform. The report also specifies the Lexical Resources to be produced. Four main areas of lexical acquisition are included: Subcategorization frames (SCFs), Selectional Preferences (SPs), Lexical-semantic Classes (LCs), for both nouns and verbs, and Multi-Word Expressions (MWEs)

    D7.4 Third evaluation report. Evaluation of PANACEA v3 and produced resources

    Get PDF
    D7.4 reports on the evaluation of the different components integrated in the PANACEA third cycle of development as well as the final validation of the platform itself. All validation and evaluation experiments follow the evaluation criteria already described in D7.1. The main goal of WP7 tasks was to test the (technical) functionalities and capabilities of the middleware that allows the integration of the various resource-creation components into an interoperable distributed environment (WP3) and to evaluate the quality of the components developed in WP5 and WP6. The content of this deliverable is thus complementary to D8.2 and D8.3 that tackle advantages and usability in industrial scenarios. It has to be noted that the PANACEA third cycle of development addressed many components that are still under research. The main goal for this evaluation cycle thus is to assess the methods experimented with and their potentials for becoming actual production tools to be exploited outside research labs. For most of the technologies, an attempt was made to re-interpret standard evaluation measures, usually in terms of accuracy, precision and recall, as measures related to a reduction of costs (time and human resources) in the current practices based on the manual production of resources. In order to do so, the different tools had to be tuned and adapted to maximize precision and for some tools the possibility to offer confidence measures that could allow a separation of the resources that still needed manual revision has been attempted. Furthermore, the extension to other languages in addition to English, also a PANACEA objective, has been evaluated. The main facts about the evaluation results are now summarized

    Enriching the "Senso Comune" Platform with Automatically Acquired Data

    Get PDF
    International audienceThis paper reports on research activities on automatic methods for the enrichment of the Senso Comune platform. At this stage of development, we will report on two tasks, namely word sense alignment with MultiWordNet and automatic acquisition of Verb Shallow Frames from sense annotated data in the MultiSemCor corpus. The results obtained are satisfying. We achieved a final F-measure of 0.64 for noun sense alignment and a F-measure of 0.47 for verb sense alignment, and an accuracy of 68% on the acquisition of VerbShallow Frames

    Mining the Unmapped Reads in Bovine RNA-Seq Data Reveals the Prevalence of Bovine Herpes Virus-6 in European Dairy Cows and the Associated Changes in Their Phenotype and Leucocyte Transcriptome

    Get PDF
    Microbial RNA is detectable in host samples by aligning unmapped reads from RNA sequencing against taxon reference sequences, generating a score proportional to the microbial load. An RNA-Seq data analysis showed that 83.5% of leukocyte samples from six dairy herds in different EU countries contained bovine herpes virus-6 (BoHV-6). Phenotypic data on milk production, metabolic function, and disease collected during their first 50 days in milk (DIM) were compared between cows with low (1–200 and n = 114) or high (201–1175 and n = 24) BoHV-6 scores. There were no differences in milk production parameters, but high score cows had numerically fewer incidences of clinical mastitis (4.2% vs. 12.2%) and uterine disease (54.5% vs. 62.7%). Their metabolic status was worse, based on measurements of IGF-1 and various metabolites in blood and milk. A comparison of the global leukocyte transcriptome between high and low BoHV-6 score cows at around 14 DIM yielded 485 differentially expressed genes (DEGs). The top pathway from Gene Ontology (GO) enrichment analysis was the immune system process. Down-regulated genes in the high BoHV-6 cows included those encoding proteins involved in viral detection (DDX6 and DDX58), interferon response, and E3 ubiquitin ligase activity. This suggested that BoHV-6 may largely evade viral detection and that it does not cause clinical disease in dairy cows

    3rd EGEE User Forum

    Get PDF
    We have organized this book in a sequence of chapters, each chapter associated with an application or technical theme introduced by an overview of the contents, and a summary of the main conclusions coming from the Forum for the chapter topic. The first chapter gathers all the plenary session keynote addresses, and following this there is a sequence of chapters covering the application flavoured sessions. These are followed by chapters with the flavour of Computer Science and Grid Technology. The final chapter covers the important number of practical demonstrations and posters exhibited at the Forum. Much of the work presented has a direct link to specific areas of Science, and so we have created a Science Index, presented below. In addition, at the end of this book, we provide a complete list of the institutes and countries involved in the User Forum

    Social Marketing in Action

    Get PDF
    This book was written for those interested in creating social change for the greater good. In this book, we provide a wide selection of social marketing cases from which we can learn and teach. The book is intended for both academic and practitioner use. Part I of this book offers a brief yet comprehensive review of social marketing. This provides the reader with the background in social change and marketing necessary to read and analyze the subsequent cases. Parts II, III, IV, and V of this book offer a total of 24 social marketing cases, from a variety of countries, addressing many different issues. For classroom use, this book is written to serve as a stand-alone tool, with Part I providing a concise introduction to social marketing principles and theory. Chapter appendices provide links to further readings on social marketing principles for those wishing to delve deeper. If still further detail on social marketing principles and theory are desired, this book can easily be paired with another social marketing textbook as well. We have aimed the book to be accessible to undergraduate students but also offer sufficient material to challenge students at the graduate level. Advanced students should be encouraged to further explore the references and links provided, to critique the case approaches, and to offer alternative strategies for the cases provided. A separate teaching guide collection is available for the book as well. This collection contains answers to the discussion questions in the cases, as well as suggested activities for inside and outside of the classroom. Additionally, the teaching guides for many cases offer suggestions for further reading and other helpful resources. For practitioner use, Part I offers a refresher on basic social marketing principles. In addition, a variety of references are offered, allowing for further personal study. Parts II through V offer detailed information about a wide variety of actual social marketing programs. You can compare and contrast these cases with your own situation, hopefully gaining insights that will be helpful in your own social marketing efforts

    Design and Management of Manufacturing Systems

    Get PDF
    Although the design and management of manufacturing systems have been explored in the literature for many years now, they still remain topical problems in the current scientific research. The changing market trends, globalization, the constant pressure to reduce production costs, and technical and technological progress make it necessary to search for new manufacturing methods and ways of organizing them, and to modify manufacturing system design paradigms. This book presents current research in different areas connected with the design and management of manufacturing systems and covers such subject areas as: methods supporting the design of manufacturing systems, methods of improving maintenance processes in companies, the design and improvement of manufacturing processes, the control of production processes in modern manufacturing systems production methods and techniques used in modern manufacturing systems and environmental aspects of production and their impact on the design and management of manufacturing systems. The wide range of research findings reported in this book confirms that the design of manufacturing systems is a complex problem and that the achievement of goals set for modern manufacturing systems requires interdisciplinary knowledge and the simultaneous design of the product, process and system, as well as the knowledge of modern manufacturing and organizational methods and techniques
    • …
    corecore