Search CORE

10,983 research outputs found

XML content warehousing: Improving sociological studies of mailing lists and web data

Author: Colazzo Dario
Dudouet François-Xavier
Manolescu Ioana
Nguyen Benjamin
Senellart Pierre
Vion Antoine
Publication venue
Publication date: 01/01/2011
Field of study

In this paper, we present the guidelines for an XML-based approach for the sociological study of Web data such as the analysis of mailing lists or databases available online. The use of an XML warehouse is a flexible solution for storing and processing this kind of data. We propose an implemented solution and show possible applications with our case study of profiles of experts involved in W3C standard-setting activity. We illustrate the sociological use of semi-structured databases by presenting our XML Schema for mailing-list warehousing. An XML Schema allows many adjunctions or crossings of data sources, without modifying existing data sets, while allowing possible structural evolution. We also show that the existence of hidden data implies increased complexity for traditional SQL users. XML content warehousing allows altogether exhaustive warehousing and recursive queries through contents, with far less dependence on the initial storage. We finally present the possibility of exporting the data stored in the warehouse to commonly-used advanced software devoted to sociological analysis

arXiv.org e-Print Archive

Base de publications de l'université Paris-Dauphine

Crossref

INRIA a CCSD electronic archive server

HAL UVSQ

HAL-Rennes 1

Identification of delivery models for the provision of predictive genetic testing in Europe: protocol for a multicentre qualitative study and a systematic review of the literature

Author: Adamo Giovanna
D'Andrea Elvira
DE VITO Corrado
Lagerberg Tyra Brita Margareta
Marzuillo Carolina
Pitini Erica
Rosso Annalisa
Unim BRIGID ANDOUNIMYE
Vacchio Maria Rosaria
Villari Paolo
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Introduction: The appropriate application of genomic technologies in healthcare is surrounded by many concerns. In particular, there is a lack of evidence on what constitutes an optimal genetic service delivery model, which depends on the type of genetic test and healthcare context considered. The present project aims to identify, classify, and evaluate delivery models for the provision of predictive genetic testing in Europe and in selected Anglophone extra-European countries (the USA, Canada, Australia, and New Zealand). It also sets out to survey the European public health community’s readiness to incorporate public health genomics into their practice. Materials and equipment: The project consists of (i) a systematic review of published literature and selected country websites, (ii) structured interviews with health experts on the genetic service delivery models in their respective countries, and (iii) a survey of European Public Health Association (EUPHA) members’ knowledge and attitudes toward genomics applications in clinical practice. The inclusion criteria for the systematic review are that articles be published in the period 2000–2015; be in English or Italian; and be from European countries or from Canada, the USA, Australia, or New Zealand. Additional policy documents will be retrieved from represented countries’ government-affiliated websites. The results of the research will be disseminated through the EUPHA network, the Italian Network for Genomics in Public Health (GENISAP), and seminars and workshops. Expected impact of the study on public health: The transfer of genomic technologies from research to clinical application is influenced not only by several factors inherent to research goals and delivery of healthcare but also by external and commercial interests that may cause the premature introduction of genetic tests in the public and private sectors. Furthermore, current genetic services are delivered without a standardized set of process and outcome measures, which makes the evaluation of healthcare services difficult. The present study will identify and classify delivery models and, subsequently, establish which are appropriate for the provision of predictive genetic testing in Europe by comparing sets of process and outcome measures. In this way, the study will provide a basis for future recommendations to decision makers involved in the financing, delivery, and consumption of genetic services

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Archivio istituzionale della ricerca - Università di Ferrara

Archivio della ricerca- Università di Roma La Sapienza

Natural language processing

Author: Adams
Amsler
Bangalore
Barker
Benoît
Bian
Bondale
Carrick
Ceric
Chandrasekar
Chang
Charniak
Chen
Chowdhury
Chowdhury
Costantino
Cowie
Craven
Craven
Craven
Dogru
Evans
Feldman
Fernandez
Gaizauskas
Glasgow
Haas
Hayes
Hayes
Hedlund
Herath
Ide
Isahara
Jelinek
Jeong
Jurafsky
Kazakov
Kehler
Khoo
Kim
King
Lange
Lee
Lehmam
Lehtokangas
Lewis
Liddy
Liddy
Lovis
Ma
Magnini
Mani
Manning
Marquez
Martinez
Martinez
McMurchie
Meyer
Mihalcea
Mock
Moens
Morin
Narita
Nerbonne
Oard
Ogura
Oudet
Owei
Paris
Pasero
Pedersen
Perez-Carballo
Petreley
Pirkola
Poesio
Rosenfield
Roux
Say
Scarlett
Schenker
Silber
Smeaton
Smeaton
Smith
Sokol
Song
Sparck Jones
Staab
Stock
Tolle
Trybula
Tsuda
Vickery
Waldrop
Warner
Weigard
Wilks
Wong
Yang
Yang
Zadrozny
Zweigenbaum
Publication venue: 'Wiley'
Publication date: 01/01/2003
Field of study

Beginning with the basic issues of NLP, this chapter aims to chart the major research activities in this area since the last ARIST Chapter in 1996 (Haas, 1996), including: (i) natural language text processing systems - text summarization, information extraction, information retrieval, etc., including domain-specific applications; (ii) natural language interfaces; (iii) NLP in the context of www and digital libraries ; and (iv) evaluation of NLP systems

Crossref

University of Strathclyde Institutional Repository

OPUS - University of Technology Sydney

Design and Development of a User Specific Dynamic E-Magazine

Author: Santhalia Vikram
Singh Sanjay
Publication venue
Publication date: 11/01/2014
Field of study

Internet and electronic media gaining more popularity due to ease and speed, the count of Internet users has increased tremendously. The world is moving faster each day with several events taking place at once and the Internet is flooded with information in every field. There are categories of information ranging from most relevant to user, to the information totally irrelevant or less relevant to specific users. In such a scenario getting the information which is most relevant to the user is indispensable to save time. The motivation of our solution is based on the idea of optimizing the search for information automatically. This information is delivered to user in the form of an interactive GUI. The optimization of the contents or information served to him is based on his social networking profiles and on his reading habits on the proposed solution. The aim is to get the user's profile information based on his social networking profile considering that almost every Internet user has one. This helps us personalize the contents delivered to the user in order to produce what is most relevant to him, in the form of a personalized e-magazine. Further the proposed solution learns user's reading habits for example the news he saves or clicks the most and makes a decision to provide him with the best contents.Comment: 19 pages, 6 figure

arXiv.org e-Print Archive

CiteSeerX

ZENODO

Curating E-Mails; A life-cycle approach to the management and preservation of e-mail messages

Author: Pennock Mrs Maureen
Publication venue
Publication date: 01/01/2006
Field of study

E-mail forms the backbone of communications in many modern institutions and organisations and is a valuable type of organisational, cultural, and historical record. Successful management and preservation of valuable e-mail messages and collections is therefore vital if organisational accountability is to be achieved and historical or cultural memory retained for the future. This requires attention by all stakeholders across the entire life-cycle of the e-mail records. This instalment of the Digital Curation Manual reports on the several issues involved in managing and curating e-mail messages for both current and future use. Although there is no 'one-size-fits-all' solution, this instalment outlines a generic framework for e-mail curation and preservation, provides a summary of current approaches, and addresses the technical, organisational and cultural challenges to successful e-mail management and longer-term curation.

Automatic detection of change in address blocks for reply forms processing

Author: Gray Alison J.
Karthick Keerambur Ramaswami
Marshall Stephen
United Kingdom's Knowledge Transfer Partnership (KTP) Program (Funder)
Publication venue: International Association of Engineers
Publication date: 01/01/2008
Field of study

In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing in the address block of various types of subscription and utility payment forms is presented. The proposed approach employs bottom-up segmentation of the address block. Heuristic rules based on structural features are used to automate the detection process. The algorithm is applied on a large dataset of 5,780 real world document forms of 200 dots per inch resolution. The proposed algorithm performs well with an average processing time of 108 milliseconds per document with a detection accuracy of 98.96%

University of Strathclyde Institutional Repository

Links between the personalities, styles and performance in computer programming

Author: Baraani-Dastjerdia Ahmad
Ghasem-Aghaeea Nasser
Karimia Zahra
Wagner Stefan
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

There are repetitive patterns in strategies of manipulating source code. For example, modifying source code before acquiring knowledge of how a code works is a depth-first style and reading and understanding before modifying source code is a breadth-first style. To the extent we know there is no study on the influence of personality on them. The objective of this study is to understand the influence of personality on programming styles. We did a correlational study with 65 programmers at the University of Stuttgart. Academic achievement, programming experience, attitude towards programming and five personality factors were measured via self-assessed survey. The programming styles were asked in the survey or mined from the software repositories. Performance in programming was composed of bug-proneness of programmers which was mined from software repositories, the grades they got in a software project course and their estimate of their own programming ability. We did statistical analysis and found that Openness to Experience has a positive association with breadth-first style and Conscientiousness has a positive association with depth-first style. We also found that in addition to having more programming experience and better academic achievement, the styles of working depth-first and saving coarse-grained revisions improve performance in programming.Comment: 27 pages, 6 figure

arXiv.org e-Print Archive

Land Shark at the Door? Why and How States Should Regulate Landmen

Author: Eisenberg Ann M.
Publication venue: FLASH: The Fordham Law Archive of Scholarship and History
Publication date: 01/01/2016
Field of study

bepress Legal Repository

Scholar Commons - Institutional Repository of the University of South Carolina

Fordham University School of Law

The Research Repository @ WVU (West Virginia University)