Search CORE

85 research outputs found

Recommendation and weaving of reusable mashup model patterns for assisted development

Author: Casati Fabio
Chowdhury Soudip Roy
Daniel Florian
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

With this article, we give an answer to one of the open problems of mashup development that users may face when operating a model-driven mashup tool, namely the lack of modeling expertise. Although commonly considered simple applications, mashups can also be complex software artifacts depending on the number and types of Web resources (the components) they integrate. Mashup tools have undoubtedly simplified mashup development, yet the problem is still generally nontrivial and requires intimate knowledge of the components provided by the mashup tool, its underlying mashup paradigm, and of how to apply such to the integration of the components. This knowledge is generally neither intuitive nor standardized across different mashup tools and the consequent lack of modeling expertise affects both skilled programmers and end-user programmers alike. In this article, we show how to effectively assist the users of mashup tools with contextual, interactive recommendations of composition knowledge in the form of reusable mashup model patterns. We design and study three different recommendation algorithms and describe a pattern weaving approach for the one-click reuse of composition knowledge. We report on the implementation of three pattern recommender plugins for different mashup tools and demonstrate via user studies that recommending and weaving contextual mashup model patterns significantly reduces development times in all three cases

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

Semantic technologies: from niche to the mainstream of Web 3? A comprehensive framework for web Information modelling and semantic annotation

Author: Dotsika F.
Dotsika F.
Publication venue
Publication date: 01/01/2012
Field of study

Context: Web information technologies developed and applied in the last decade have considerably changed the way web applications operate and have revolutionised information management and knowledge discovery. Social technologies, user-generated classification schemes and formal semantics have a far-reaching sphere of influence. They promote collective intelligence, support interoperability, enhance sustainability and instigate innovation. Contribution: The research carried out and consequent publications follow the various paradigms of semantic technologies, assess each approach, evaluate its efficiency, identify the challenges involved and propose a comprehensive framework for web information modelling and semantic annotation, which is the thesis’ original contribution to knowledge. The proposed framework assists web information modelling, facilitates semantic annotation and information retrieval, enables system interoperability and enhances information quality. Implications: Semantic technologies coupled with social media and end-user involvement can instigate innovative influence with wide organisational implications that can benefit a considerable range of industries. The scalable and sustainable business models of social computing and the collective intelligence of organisational social media can be resourcefully paired with internal research and knowledge from interoperable information repositories, back-end databases and legacy systems. Semantified information assets can free human resources so that they can be used to better serve business development, support innovation and increase productivity

WestminsterResearch

Assisted Reuse of Pattern-Based Composition Knowledge for Mashup Development

Author: Roy Chowdhury Soudip
Publication venue: University of Trento
Publication date: 22/03/2013
Field of study

First generation of the World Wide Web (WWW) enabled users to have instantaneous access to a large diversity of knowledge. Second generation of the WWW (Web 2.0) brought a fundamental change in the way people interact with and through the World Wide Web. Web 2.0 has made the World Wide Web a platform not only for communication and sharing information but also for software development (e.g., web service composition). Web mashup or mashup development is a Web2.0 development approach in which users are expected to create applications by combining multiple data sources, application logic and UI components from the web to cater for their situational application needs. However, in reality creating an even simple mashup application is a complex task that can only be managed by skilled developers. Examples of ready mashup models are one of the main sources of help for users who don't know how to design a mashup, provided that suitable examples can be found (examples that have an analogy with the modeling situation faced by the user). But also tutorials, expert colleagues or friends, and, of course, Google are typical means to find help. However, searching for help does not always lead to a success, and retrieved information is only seldom immediately usable as it is, since the retrieved pieces of information are not contextual, i.e., immediately applicable to the given modeling problem. Motivated by the development challenges faced by a naive user of existing mashup tools, in this thesis we propose toaid such users by enabling assisted reuse of pattern-based composition knowledge. In this thesis we show how it is possible to effectively assist these users in their development task with contextual, interactive recommendations of composition knowledge in the form of mashup model patterns. We study a set of recommendation algorithms with different levels of performance and describe a flexible pattern weaving approach for the one-click reuse of patterns. We prove the generality of our algorithms and approach by implementing two prototype tools for two different mashup platforms. Finally, we validate the usefulness of our assisted development approach by performing thorough empirical tests and two user studies with our prototype tools

Unitn-eprints PhD

Emerging technologies for learning report (volume 3)

Author: De Freitas Sara
Greenfield Adam
Haller Michael
Oblinger Diana
Tonkin Emma
Van't Hooft Mark
Publication venue: British Educational Communications and Technology Agency (BECTA)
Publication date: 01/01/2008
Field of study

Digital Education Resource Archive

Web Data Extraction, Applications and Techniques: A Survey

Author: Abel
Amalfitano
Balduzzi
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Berger
Berthold
Bettencourt
Califf
Catanese
Chang
Chen
Chen
Chen
Collins
Conover
Crandall
Crescenzi
Crescenzi
Dalvi
Dalvi
De Meo
De Meo
Doan
Emilio Ferrara
Ferrara
Ferrara
Ferrara
Ferrara
Ferrara
Flesca
Freitag
Furche
Gatterbauer
Gatterbauer
Giacomo Fiumara
Gjoka
Gkotsis
Gottlob
Gottlob
Hammersley
Han
Hecht
Hsu
Irmak
Khare
Kim
Kinsella
Kleinberg
Kleinberg
Kohlschütter
Kokkoras
Kokkoras
Kokkoras
Krüpl
Kushmerick
Kwak
Laender
Liu
Manning
Masanès
Mathes
Meng
Mislove
Monge
Muslea
Oro
Pan
Pasquale De Meo
Perito
Phan
Plake
Rahm
Rahm
Reis
Robert Baumgartner
Sahuguet
Sarawagi
Schifanella
Selkow
Shi
Soderland
Szomszor
Turmo
Vosecky
Wang
Wang
Weikum
Wilson
Winograd
Yang
Ye
Zafarani
Zanasi
Zhai
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 09/06/2014
Field of study

Web Data Extraction is an important problem that has been studied by means of different scientific tools and in a broad range of applications. Many approaches to extracting data from the Web have been designed to solve specific problems and operate in ad-hoc domains. Other approaches, instead, heavily reuse techniques and algorithms developed in the field of Information Extraction. This survey aims at providing a structured and comprehensive overview of the literature in the field of Web Data Extraction. We provided a simple classification framework in which existing Web Data Extraction applications are grouped into two main classes, namely applications at the Enterprise level and at the Social Web level. At the Enterprise level, Web Data Extraction techniques emerge as a key tool to perform data analysis in Business and Competitive Intelligence systems as well as for business process re-engineering. At the Social Web level, Web Data Extraction techniques allow to gather a large amount of structured data continuously generated and disseminated by Web 2.0, Social Media and Online Social Network users and this offers unprecedented opportunities to analyze human behavior at a very large scale. We discuss also the potential of cross-fertilization, i.e., on the possibility of re-using Web Data Extraction techniques originally designed to work in a given domain, in other domains.Comment: Knowledge-based System

arXiv.org e-Print Archive

Crossref

Open Personalization: Involving Third Parties in Improving the User Experience of Websites

Author: Arellano Bartolomé Cristóbal
Publication venue
Publication date: 15/03/2013
Field of study

Traditional software development captures the user needs during the requirement analysis. The Web makes this endeavour even harder due to the difficulty to determine who these users are. In an attempt to tackle the heterogeneity of the user base, Web Personalization techniques are proposed to guide the users’ experience. In addition, Open Innovation allows organisations to look beyond their internal resources to develop new products or improve existing processes. This thesis sits in between by introducing Open Personalization as a means to incorporate actors other than webmasters in the personalization of web applications. The aim is to provide the technological basis that builds up a trusty environment for webmasters and companion actors to collaborate, i.e. "an architecture of participation". Such architecture very much depends on these actors’ profile. This work tackles three profiles (i.e. software partners, hobby programmers and end users), and proposes three "architectures of participation" tuned for each profile. Each architecture rests on different technologies: a .NET annotation library based on Inversion of Control for software partners, a Modding Interface in JavaScript for hobby programmers, and finally, a domain specific language for end-users. Proof-of-concept implementations are available for the three cases while a quantitative evaluation is conducted for the domain specific language

Archivo Digital para la Docencia y la Investigación

Query-Time Data Integration

Author: Eberius Julian
Publication venue
Publication date: 10/12/2015
Field of study

Today, data is collected in ever increasing scale and variety, opening up enormous potential for new insights and data-centric products. However, in many cases the volume and heterogeneity of new data sources precludes up-front integration using traditional ETL processes and data warehouses. In some cases, it is even unclear if and in what context the collected data will be utilized. Therefore, there is a need for agile methods that defer the effort of integration until the usage context is established. This thesis introduces Query-Time Data Integration as an alternative concept to traditional up-front integration. It aims at enabling users to issue ad-hoc queries on their own data as if all potential other data sources were already integrated, without declaring specific sources and mappings to use. Automated data search and integration methods are then coupled directly with query processing on the available data. The ambiguity and uncertainty introduced through fully automated retrieval and mapping methods is compensated by answering those queries with ranked lists of alternative results. Each result is then based on different data sources or query interpretations, allowing users to pick the result most suitable to their information need. To this end, this thesis makes three main contributions. Firstly, we introduce a novel method for Top-k Entity Augmentation, which is able to construct a top-k list of consistent integration results from a large corpus of heterogeneous data sources. It improves on the state-of-the-art by producing a set of individually consistent, but mutually diverse, set of alternative solutions, while minimizing the number of data sources used. Secondly, based on this novel augmentation method, we introduce the DrillBeyond system, which is able to process Open World SQL queries, i.e., queries referencing arbitrary attributes not defined in the queried database. The original database is then augmented at query time with Web data sources providing those attributes. Its hybrid augmentation/relational query processing enables the use of ad-hoc data search and integration in data analysis queries, and improves both performance and quality when compared to using separate systems for the two tasks. Finally, we studied the management of large-scale dataset corpora such as data lakes or Open Data platforms, which are used as data sources for our augmentation methods. We introduce Publish-time Data Integration as a new technique for data curation systems managing such corpora, which aims at improving the individual reusability of datasets without requiring up-front global integration. This is achieved by automatically generating metadata and format recommendations, allowing publishers to enhance their datasets with minimal effort. Collectively, these three contributions are the foundation of a Query-time Data Integration architecture, that enables ad-hoc data search and integration queries over large heterogeneous dataset collections

Technische Universität Dresden: Qucosa

Recommended from our members

PowerAqua: Open Question Answering on the Semantic Web

Author: Lopez Vanessa
Publication venue
Publication date: 01/01/2011
Field of study

With the rapid growth of semantic information in the Web, the processes of searching and querying these very large amounts of heterogeneous content have become increasingly challenging. This research tackles the problem of supporting users in querying and exploring information across multiple and heterogeneous Semantic Web (SW) sources. A review of literature on ontology-based Question Answering reveals the limitations of existing technology. Our approach is based on providing a natural language Question Answering interface for the SW, PowerAqua. The realization of PowerAqua represents a considerable advance with respect to other systems, which restrict their scope to an ontology-specific or homogeneous fraction of the publicly available SW content. To our knowledge, PowerAqua is the only system that is able to take advantage of the semantic data available on the Web to interpret and answer user queries posed in natural language. In particular, PowerAqua is uniquely able to answer queries by combining and aggregating information, which can be distributed across heterogeneous semantic resources. Here, we provide a complete overview of our work on PowerAqua, including: the research challenges it addresses; its architecture; the techniques we have realised to map queries to semantic data, to integrate partial answers drawn from different semantic resources and to rank alternative answers; and the evaluation studies we have performed, to assess the performance of PowerAqua. We believe our experiences can be extrapolated to a variety of end-user applications that wish to open up to large scale and heterogeneous structured datasets, to be able to exploit effectively what possibly is the greatest wealth of data in the history of Artificial Intelligence

Open Research Online (The Open University)

OpenGrey Repository

The web 2.0 Internet: Democratized Internet collaborations in the healthcare sector

Author: Hughes Benjamin Alexander Paul
Publication venue: Blanquerna - Universitat Ramon Llull
Publication date: 01/01/2010
Field of study

Les col•laboracions democratitzades a Internet, entenent-les com les eines participatives de la xarxa, o la Web 2.0, afecten en l'actualitat a nombrosos aspectes la nostra vida. Els acadèmics destaquen el potencial de la Web 2.0 per millorar l’aprenentatge o la salut, així com el seu continu impacte en sectors com el de la tecnologia de mitjans de comunicació. També plantegen un gran nombre de qüestions importants als professionals i estudiosos. Per exemple, la consideració crítica de la Web 2.0 com una bombolla o bé com un element més del màrqueting, que necessita d'una determinació del seu abast i naturalesa. Aquest mateix punt és aplicable a l'ús de la Web 2.0 en el sector sanitari, també anomenat com Medicina 2.0 o Sanitat 2.0. Referent a això, considerant el risc que el contingut generat per altres usuaris sigui utilitzat per prendre decisions relatives a la salut, i tenint en compte l'eficàcia no provada de la Web 2.0 com a instrument de la política sanitària, els acadèmics del tema conviden a la definició de millors models que es puguin aplicar a l'ús pràctic d'aquesta eina. Aquesta tesi es centra en l'estudi d'aquestes qüestions fonamentals, en un camp que es mou a gran velocitat, per darrera de la pràctica real, i que requereix la concertació d'una investigació interdisciplinària. Per tant, aquesta tesi incorpora set obres diferents que ofereixen àmplies perspectives sobre l'ús d'eines de col•laboració en la xarxa en el camp de l'atenció sanitària, cadascuna analitzant el tema amb una profunditat suficient com per seguir sent rellevant en un camp en ràpida evolució. Aquestes obres inclouen un examen d'(1) la Web 2.0 i (2) la Medicina 2.0, utilitzant l'anàlisi del contingut de milions de converses de la xarxa per identificar les principals qüestions pràctiques o teòriques i les tensions subjacents a cada concepte. Dos estudis addicionals analitzen (3) com i per què els metges fan servir les eines de la Web 2.0, i (4) com els metges busquen la informació en aquest context en constant moviment com és el d'Internet. Aquests dos estudis es basen en enquestes, diaris i entrevistes amb els metges que treballen en el Servei Nacional de Salut del Regne Unit. Tots dos destaquen resultats importants com ara models per a l'ús de la Medicina 2.0, o contribucions importants a la literatura com la connexió de la recerca cognitiva en la xarxa i la valoració de la informació en xarxa, tots dos camps sense connexió amb anterioritat a aquest treball. Tres estudis addicionals analitzen la web 2.0 des d'una perspectiva organitzacional, incloent (5) un estudi dels models de disseny de l'ús de la Web 2.0 en el sector farmacèutic, el qual detalla els millors models de pràctiques d'ús, i la seva clara relació amb els models de disseny de codi obert, i (6) també les estratègies d'innovació oberta al sector farmacèutic, on les eines de col•laboració en la xarxa permeten aquest tipus d'estratègies. Els dos últims estudis fan servir entrevistes amb 120 executius del sector farmacèutic analitzats a través d'anàlisi temàtic. Tots dos fan contribucions importants a la literatura mitjançant la caracterització de les estratègies d'innovació oberta i les implicacions per generar la capacitat d'absorció en el context d'innovació oberta. L'últim estudi (7) examina la Medicina 2.0 des de la perspectiva dels proveïdors de serveis de salut, per ajudar a la gestió d'ús de la Web 2.0 com un instrument per a millorar l’atenció sanitària. En general, hi ha moltes contribucions importants a la literatura, que en conjunt aconsegueixen ampliar el panorama de la Web 2.0 en l'assistència sanitària, i aporten consideracions especifiques a la literatura que abasta els sistemes d'informació, les ciències de la informació i la informàtica mèdica , així com la innovació oberta i l'estratègia. Las colaboraciones democratizadas en Internet, entendiéndolas como las herramientas participativas de la red o la Web 2.0, afectan en la actualidad a numerosos aspectos nuestra vida. Los académicos destacan el potencial de la Web 2.0 para mejorar el eAprendizaje o la salud, así como su continuo impacto en sectores como el de la tecnología de medios de comunicación. También plantean un gran número de cuestiones importantes a los profesionales y estudiosos. Por ejemplo, la consideración crítica de la Web 2.0 como una burbuja o bien como un elemento más del marketing, que necesita de una determinación de su alcance y naturaleza. Este mismo punto es aplicable al uso de la Web 2.0 en el sector sanitario, también denominado como Medicina 2.0 o Sanitad 2.0. A este respecto y considerando el riesgo de que el contenido generado por otros usuarios sea utilizado para tomar decisiones relativas a la salud, y la eficacia no probada de la Web 2.0 como instrumento de la política sanitaria; los académicos del tema invitan a la definición de mejores modelos que se puedan aplicar al uso práctico de esta herramienta. Esta tesis se centra en el estudio de estas cuestiones fundamentales, en un campo que se mueve a gran velocidad, por detrás de la práctica real, y que requiere la concertación de una investigación interdisciplinaria. Por lo tanto, esta tesis incorpora siete obras distintas que ofrecen amplias perspectivas sobre el uso de herramientas de colaboración en la red en el campo de la atención sanitaria, cada una analizando el tema con una profundidad suficiente como para seguir siendo relevante en un campo en rápida evolución. Estas obras incluyen un examen de (1) la Web 2.0 y (2) la Medicina 2.0, utilizando el análisis del contenido de millones de conversaciones de la red, para identificar las principales cuestiones prácticas o teóricas y las tensiones que subyacen a cada concepto. Dos estudios adicionales analizan (3) cómo y por qué los médicos usan las herramientas de la Web 2.0, y (4) cómo los médicos buscan la información en este contexto en constante movimiento como es el de Internet. Estos dos estudios se basan en encuestas, diarios y entrevistas con los médicos que trabajan en el Servicio Nacional de Salud del Reino Unido. Ambos destacan resultados importantes tales como modelos para el uso de la Medicina 2.0, o contribuciones importantes a la literatura como la conexión de la búsqueda cognitiva en la red y la valoración de la información en red, ambos campos sin conexión con anterioridad al presente trabajo.Tres estudios adicionales analizan la Web 2.0 desde una perspectiva organizacional, incluyendo (5) un estudio de los modelos de diseño del uso de la Web 2.0 en el sector farmacéutico, el cual detalla los mejores modelos de prácticas de uso, y su clara relación con los modelos de diseño de la open source, y (6) y también las estrategias de innovación abierta en el sector farmacéutico donde las herramientas de colaboración en la red permiten este tipo de estrategias. Los dos últimos estudios emplean entrevistas con 120 ejecutivos del sector farmacéutico analizados a través de análisis temático. Ambos hacen contribuciones importantes a la literatura mediante la caracterización de las estrategias de innovación abierta y las implicaciones para generar la capacidad de absorción en el contexto de innovación abierta. El último estudio (7) examina la Medicina 2.0 desde la perspectiva de los proveedores de servicios de salud, para ayudar a la gestión de uso de la Web 2.0 como un instrumento para la gestión de una mejor atención sanitaria. En general, hay muchas contribuciones importantes a la literatura, que en conjunto logran ampliar el panorama de la Web 2.0 en la asistencia sanitaria, y aportan consideraciones específicas a la literatura que abarca los sistemas de información, las ciencias de la información, la informática médica, así como la innovación abierta y la estrategia.Democratized internet collaborations, referring to participatory online tools or Web 2.0, now impact many aspects of people’s lives. Scholars note Web 2.0’s potential to improve eLearning or healthcare, and its ongoing impact in sectors such as tech-media. They also raise a plethora of important questions for practitioners and scholars, such as the criticism of Web 2.0 as hype or marketing term, which necessitates some determination of the scope and nature of Web 2.0. This holds equally for Web 2.0’s use in health care, denoted as Medicine 2.0 or Health 2.0. Moreover, given the risks of people using user-generated content for health decisions, and its unproven effectiveness as a health policy tool, scholars have called for best practice models of use. This thesis addresses these fundamental issues, in a field that is fast moving, behind actual practice, and that requires concerted inter-disciplinary research. Therefore, this thesis incorporates seven distinct works that provide broad perspectives on the use of online collaboration tools in healthcare, each analyzing a specific topic in enough depth to remain relevant in a fast moving field. These works include an examination of (1) Web 2.0 and (2) Medicine 2.0, using content analysis of millions of online conversations to surface the major practical or theoretical issues and tensions that underpin each concept. Two further studies examine (3) how and why doctors use Web 2.0 tools, and (4) how doctors search or forage for information in this evolving internet environment. These two studies rely on surveys, diaries and interviews from doctors working in the UK’s National Health Service (NHS). Both highlight important results, such as models for Medicine 2.0 use, or make important contributions to literature such as connecting the previously separate cognitive online search and internet information judgment literatures. Three further studies examine Web 2.0 from an organizational perspective, including (5) design patterns of Web 2.0’s use in global Pharma, which details best practice models of use and its clear link to Open Source design patterns, and (6) global Pharma’s Open Innovation strategies, where online collaboration tools enable these strategies. The latter two studies employ interviews with 120 pharmaceutical executives analyzed through thematic analysis. They make major contributions to literature by characterizing open innovation strategies and gleaning implications for Absorptive Capacity in the Open Innovation context. The final study (7) examines Medicine 2.0 form the perspective of health service providers, informing management using eHealth as an instrument for improved healthcare management. Overall, there are many major contributions to literature, which together achieve both a broad overview of Web 2.0 in healthcare, but also make specific additions to literature encompassing information systems, information science, medical informatics, and open innovation and strategy

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Tesis Doctorals en Xarxa

A treatise on Web 2.0 with a case study from the financial markets

Author: Martin D. Sykora (7121681)
Publication venue
Publication date: 01/01/2012
Field of study

There has been much hype in vocational and academic circles surrounding the emergence of web 2.0 or social media; however, relatively little work was dedicated to substantiating the actual concept of web 2.0. Many have dismissed it as not deserving of this new title, since the term web 2.0 assumes a certain interpretation of web history, including enough progress in certain direction to trigger a succession [i.e. web 1.0 → web 2.0]. Others provided arguments in support of this development, and there has been a considerable amount of enthusiasm in the literature. Much research has been busy evaluating current use of web 2.0, and analysis of the user generated content, but an objective and thorough assessment of what web 2.0 really stands for has been to a large extent overlooked. More recently the idea of collective intelligence facilitated via web 2.0, and its potential applications have raised interest with researchers, yet a more unified approach and work in the area of collective intelligence is needed. This thesis identifies and critically evaluates a wider context for the web 2.0 environment, and what caused it to emerge; providing a rich literature review on the topic, a review of existing taxonomies, a quantitative and qualitative evaluation of the concept itself, an investigation of the collective intelligence potential that emerges from application usage. Finally, a framework for harnessing collective intelligence in a more systematic manner is proposed. In addition to the presented results, novel methodologies are also introduced throughout this work. In order to provide interesting insight but also to illustrate analysis, a case study of the recent financial crisis is considered. Some interesting results relating to the crisis are revealed within user generated content data, and relevant issues are discussed where appropriate

Loughborough University Institutional Repository