85 research outputs found
Recommendation and weaving of reusable mashup model patterns for assisted development
With this article, we give an answer to one of the open problems of mashup development that users may face when operating a model-driven mashup tool, namely the lack of modeling expertise. Although commonly considered simple applications, mashups can also be complex software artifacts depending on the number and types of Web resources (the components) they integrate. Mashup tools have undoubtedly simplified mashup development, yet the problem is still generally nontrivial and requires intimate knowledge of the components provided by the mashup tool, its underlying mashup paradigm, and of how to apply such to the integration of the components. This knowledge is generally neither intuitive nor standardized across different mashup tools and the consequent lack of modeling expertise affects both skilled programmers and end-user programmers alike. In this article, we show how to effectively assist the users of mashup tools with contextual, interactive recommendations of composition knowledge in the form of reusable mashup model patterns. We design and study three different recommendation algorithms and describe a pattern weaving approach for the one-click reuse of composition knowledge. We report on the implementation of three pattern recommender plugins for different mashup tools and demonstrate via user studies that recommending and weaving contextual mashup model patterns significantly reduces development times in all three cases
Semantic technologies: from niche to the mainstream of Web 3? A comprehensive framework for web Information modelling and semantic annotation
Context: Web information technologies developed and applied in the last decade
have considerably changed the way web applications operate and have
revolutionised information management and knowledge discovery. Social
technologies, user-generated classification schemes and formal semantics have a
far-reaching sphere of influence. They promote collective intelligence, support
interoperability, enhance sustainability and instigate innovation.
Contribution: The research carried out and consequent publications follow the
various paradigms of semantic technologies, assess each approach, evaluate its
efficiency, identify the challenges involved and propose a comprehensive framework for web information modelling and semantic annotation, which is the thesis’ original contribution to knowledge. The proposed framework assists web information
modelling, facilitates semantic annotation and information retrieval, enables system interoperability and enhances information quality.
Implications: Semantic technologies coupled with social media and end-user
involvement can instigate innovative influence with wide organisational implications that can benefit a considerable range of industries. The scalable and sustainable business models of social computing and the collective intelligence of organisational social media can be resourcefully paired with internal research and knowledge from interoperable information repositories, back-end databases and legacy systems.
Semantified information assets can free human resources so that they can be used to better serve business development, support innovation and increase productivity
Assisted Reuse of Pattern-Based Composition Knowledge for Mashup Development
First generation of the World Wide Web (WWW) enabled users to have instantaneous access to a large diversity of knowledge. Second generation of the WWW (Web 2.0) brought a fundamental change in the way people interact with and through the World Wide Web. Web 2.0 has made the World Wide Web a platform not only for communication and sharing information but also for software development (e.g., web service composition). Web mashup or mashup development is a Web2.0 development approach in which users are expected to create applications by combining multiple data sources, application logic and UI components from the web to cater for their situational application needs. However, in reality creating an even simple mashup application is a complex task that can only be managed by skilled developers.
Examples of ready mashup models are one of the main sources of help for users who don't know how to design a mashup, provided that suitable examples can be found (examples that have an analogy with the modeling situation faced by the user). But also tutorials, expert colleagues or friends, and, of course, Google are typical means to find help. However, searching for help does not always lead to a success, and retrieved information is only seldom immediately usable as it is, since the retrieved pieces of information are not contextual, i.e., immediately applicable to the given modeling problem.
Motivated by the development challenges faced by a naive user of existing mashup tools, in this thesis we propose toaid such users by enabling assisted reuse of pattern-based composition knowledge. In this thesis we show how it is possible to effectively assist these users in their development task with contextual, interactive recommendations of composition knowledge in the form of mashup model patterns. We study a set of recommendation algorithms with different levels of performance and describe a flexible pattern weaving approach for the one-click reuse of patterns. We prove the generality of our algorithms and approach by implementing two prototype tools for two different mashup platforms. Finally, we validate the usefulness of our assisted development approach by performing thorough empirical tests and two user studies with our prototype tools
Web Data Extraction, Applications and Techniques: A Survey
Web Data Extraction is an important problem that has been studied by means of
different scientific tools and in a broad range of applications. Many
approaches to extracting data from the Web have been designed to solve specific
problems and operate in ad-hoc domains. Other approaches, instead, heavily
reuse techniques and algorithms developed in the field of Information
Extraction.
This survey aims at providing a structured and comprehensive overview of the
literature in the field of Web Data Extraction. We provided a simple
classification framework in which existing Web Data Extraction applications are
grouped into two main classes, namely applications at the Enterprise level and
at the Social Web level. At the Enterprise level, Web Data Extraction
techniques emerge as a key tool to perform data analysis in Business and
Competitive Intelligence systems as well as for business process
re-engineering. At the Social Web level, Web Data Extraction techniques allow
to gather a large amount of structured data continuously generated and
disseminated by Web 2.0, Social Media and Online Social Network users and this
offers unprecedented opportunities to analyze human behavior at a very large
scale. We discuss also the potential of cross-fertilization, i.e., on the
possibility of re-using Web Data Extraction techniques originally designed to
work in a given domain, in other domains.Comment: Knowledge-based System
Open Personalization: Involving Third Parties in Improving the User Experience of Websites
Traditional software development captures the user needs during the
requirement analysis. The Web makes this endeavour even harder due to
the difficulty to determine who these users are. In an attempt to tackle
the heterogeneity of the user base, Web Personalization techniques are
proposed to guide the users’ experience. In addition, Open Innovation
allows organisations to look beyond their internal resources to develop new
products or improve existing processes.
This thesis sits in between by introducing Open Personalization as a
means to incorporate actors other than webmasters in the personalization
of web applications. The aim is to provide the technological basis that
builds up a trusty environment for webmasters and companion actors to
collaborate, i.e. "an architecture of participation". Such architecture
very much depends on these actors’ profile. This work tackles three
profiles (i.e. software partners, hobby programmers and end users), and
proposes three "architectures of participation" tuned for each profile. Each
architecture rests on different technologies: a .NET annotation library
based on Inversion of Control for software partners, a Modding Interface in
JavaScript for hobby programmers, and finally, a domain specific language
for end-users. Proof-of-concept implementations are available for the three
cases while a quantitative evaluation is conducted for the domain specific
language
Query-Time Data Integration
Today, data is collected in ever increasing scale and variety, opening up enormous potential for new insights and data-centric products. However, in many cases the volume and heterogeneity of new data sources precludes up-front integration using traditional ETL processes and data warehouses. In some cases, it is even unclear if and in what context the collected data will be utilized. Therefore, there is a need for agile methods that defer the effort of integration until the usage context is established.
This thesis introduces Query-Time Data Integration as an alternative concept to traditional up-front integration. It aims at enabling users to issue ad-hoc queries on their own data as if all potential other data sources were already integrated, without declaring specific sources and mappings to use. Automated data search and integration methods are then coupled directly with query processing on the available data. The ambiguity and uncertainty introduced through fully automated retrieval and mapping methods is compensated by answering those queries with ranked lists of alternative results. Each result is then based on different data sources or query interpretations, allowing users to pick the result most suitable to their information need.
To this end, this thesis makes three main contributions. Firstly, we introduce a novel method for Top-k Entity Augmentation, which is able to construct a top-k list of consistent integration results from a large corpus of heterogeneous data sources. It improves on the state-of-the-art by producing a set of individually consistent, but mutually diverse, set of alternative solutions, while minimizing the number of data sources used. Secondly, based on this novel augmentation method, we introduce the DrillBeyond system, which is able to process Open World SQL queries, i.e., queries referencing arbitrary attributes not defined in the queried database. The original database is then augmented at query time with Web data sources providing those attributes. Its hybrid augmentation/relational query processing enables the use of ad-hoc data search and integration in data analysis queries, and improves both performance and quality when compared to using separate systems for the two tasks. Finally, we studied the management of large-scale dataset corpora such as data lakes or Open Data platforms, which are used as data sources for our augmentation methods. We introduce Publish-time Data Integration as a new technique for data curation systems managing such corpora, which aims at improving the individual reusability of datasets without requiring up-front global integration. This is achieved by automatically generating metadata and format recommendations, allowing publishers to enhance their datasets with minimal effort.
Collectively, these three contributions are the foundation of a Query-time Data Integration architecture, that enables ad-hoc data search and integration queries over large heterogeneous dataset collections
Recommended from our members
PowerAqua: Open Question Answering on the Semantic Web
With the rapid growth of semantic information in the Web, the processes of searching and querying these very large amounts of heterogeneous content have become increasingly challenging. This research tackles the problem of supporting users in querying and exploring information across multiple and heterogeneous Semantic Web (SW) sources.
A review of literature on ontology-based Question Answering reveals the limitations of existing technology. Our approach is based on providing a natural language Question Answering interface for the SW, PowerAqua. The realization of PowerAqua represents a considerable advance with respect to other systems, which restrict their scope to an ontology-specific or homogeneous fraction of the publicly available SW content. To our knowledge, PowerAqua is the only system that is able to take advantage of the semantic data available on the Web to interpret and answer user queries posed in natural language. In particular, PowerAqua is uniquely able to answer queries by combining and aggregating information, which can be distributed across heterogeneous semantic resources.
Here, we provide a complete overview of our work on PowerAqua, including: the research challenges it addresses; its architecture; the techniques we have realised to map queries to semantic data, to integrate partial answers drawn from different semantic resources and to rank alternative answers; and the evaluation studies we have performed, to assess the performance of PowerAqua. We believe our experiences can be extrapolated to a variety of end-user applications that wish to open up to large scale and heterogeneous structured datasets, to be able to exploit effectively what possibly is the greatest wealth of data in the history of Artificial Intelligence
The web 2.0 Internet: Democratized Internet collaborations in the healthcare sector
Les col•laboracions democratitzades a Internet, entenent-les com les eines participatives de la xarxa, o la Web 2.0, afecten en l'actualitat a nombrosos aspectes la nostra vida. Els acadèmics destaquen el potencial de la Web 2.0 per millorar l’aprenentatge o la salut, aixà com el seu continu impacte en sectors com el de la tecnologia de mitjans de comunicació. També plantegen un gran nombre de qüestions importants als professionals i estudiosos. Per exemple, la consideració crÃtica de la Web 2.0 com una bombolla o bé com un element més del mà rqueting, que necessita d'una determinació del seu abast i naturalesa. Aquest mateix punt és aplicable a l'ús de la Web 2.0 en el sector sanitari, també anomenat com Medicina 2.0 o Sanitat 2.0. Referent a això, considerant el risc que el contingut generat per altres usuaris sigui utilitzat per prendre decisions relatives a la salut, i tenint en compte l'eficà cia no provada de la Web 2.0 com a instrument de la polÃtica sanità ria, els acadèmics del tema conviden a la definició de millors models que es puguin aplicar a l'ús prà ctic d'aquesta eina. Aquesta tesi es centra en l'estudi d'aquestes qüestions fonamentals, en un camp que es mou a gran velocitat, per darrera de la prà ctica real, i que requereix la concertació d'una investigació interdisciplinà ria. Per tant, aquesta tesi incorpora set obres diferents que ofereixen à mplies perspectives sobre l'ús d'eines de col•laboració en la xarxa en el camp de l'atenció sanità ria, cadascuna analitzant el tema amb una profunditat suficient com per seguir sent rellevant en un camp en rà pida evolució.
Aquestes obres inclouen un examen d'(1) la Web 2.0 i (2) la Medicina 2.0, utilitzant l'anà lisi del contingut de milions de converses de la xarxa per identificar les principals qüestions prà ctiques o teòriques i les tensions subjacents a cada concepte. Dos estudis addicionals analitzen (3) com i per què els metges fan servir les eines de la Web 2.0, i (4) com els metges busquen la informació en aquest context en constant moviment com és el d'Internet. Aquests dos estudis es basen en enquestes, diaris i entrevistes amb els metges que treballen en el Servei Nacional de Salut del Regne Unit. Tots dos destaquen resultats importants com ara models per a l'ús de la Medicina 2.0, o contribucions importants a la literatura com la connexió de la recerca cognitiva en la xarxa i la valoració de la informació en xarxa, tots dos camps sense connexió amb anterioritat a aquest treball. Tres estudis addicionals analitzen la web 2.0 des d'una perspectiva organitzacional, incloent (5) un estudi dels models de disseny de l'ús de la Web 2.0 en el sector farmacèutic, el qual detalla els millors models de prà ctiques d'ús, i la seva clara relació amb els models de disseny de codi obert, i (6) també les estratègies d'innovació oberta al sector farmacèutic, on les eines de col•laboració en la xarxa permeten aquest tipus d'estratègies. Els dos últims estudis fan servir entrevistes amb 120 executius del sector farmacèutic analitzats a través d'anà lisi temà tic. Tots dos fan contribucions importants a la literatura mitjançant la caracterització de les estratègies d'innovació oberta i les implicacions per generar la capacitat d'absorció en el context d'innovació oberta. L'últim estudi (7) examina la Medicina 2.0 des de la perspectiva dels proveïdors de serveis de salut, per ajudar a la gestió d'ús de la Web 2.0 com un instrument per a millorar l’atenció sanità ria.
En general, hi ha moltes contribucions importants a la literatura, que en conjunt aconsegueixen ampliar el panorama de la Web 2.0 en l'assistència sanità ria, i aporten consideracions especifiques a la literatura que abasta els sistemes d'informació, les ciències de la informació i la informà tica mèdica , aixà com la innovació oberta i l'estratègia. Las colaboraciones democratizadas en Internet, entendiéndolas como las herramientas participativas de la red o la Web 2.0, afectan en la actualidad a numerosos aspectos nuestra vida. Los académicos destacan el potencial de la Web 2.0 para mejorar el eAprendizaje o la salud, asà como su continuo impacto en sectores como el de la tecnologÃa de medios de comunicación. También plantean un gran número de cuestiones importantes a los profesionales y estudiosos. Por ejemplo, la consideración crÃtica de la Web 2.0 como una burbuja o bien como un elemento más del marketing, que necesita de una determinación de su alcance y naturaleza. Este mismo punto es aplicable al uso de la Web 2.0 en el sector sanitario, también denominado como Medicina 2.0 o Sanitad 2.0. A este respecto y considerando el riesgo de que el contenido generado por otros usuarios sea utilizado para tomar decisiones relativas a la salud, y la eficacia no probada de la Web 2.0 como instrumento de la polÃtica sanitaria; los académicos del tema invitan a la definición de mejores modelos que se puedan aplicar al uso práctico de esta herramienta. Esta tesis se centra en el estudio de estas cuestiones fundamentales, en un campo que se mueve a gran velocidad, por detrás de la práctica real, y que requiere la concertación de una investigación interdisciplinaria. Por lo tanto, esta tesis incorpora siete obras distintas que ofrecen amplias perspectivas sobre el uso de herramientas de colaboración en la red en el campo de la atención sanitaria, cada una analizando el tema con una profundidad suficiente como para seguir siendo relevante en un campo en rápida evolución.
Estas obras incluyen un examen de (1) la Web 2.0 y (2) la Medicina 2.0, utilizando el análisis del contenido de millones de conversaciones de la red, para identificar las principales cuestiones prácticas o teóricas y las tensiones que subyacen a cada concepto. Dos estudios adicionales analizan (3) cómo y por qué los médicos usan las herramientas de la Web 2.0, y (4) cómo los médicos buscan la información en este contexto en constante movimiento como es el de Internet. Estos dos estudios se basan en encuestas, diarios y entrevistas con los médicos que trabajan en el Servicio Nacional de Salud del Reino Unido. Ambos destacan resultados importantes tales como modelos para el uso de la Medicina 2.0, o contribuciones importantes a la literatura como la conexión de la búsqueda cognitiva en la red y la valoración de la información en red, ambos campos sin conexión con anterioridad al presente trabajo.Tres estudios adicionales analizan la Web 2.0 desde una perspectiva organizacional, incluyendo (5) un estudio de los modelos de diseño del uso de la Web 2.0 en el sector farmacéutico, el cual detalla los mejores modelos de prácticas de uso, y su clara relación con los modelos de diseño de la open source, y (6) y también las estrategias de innovación abierta en el sector farmacéutico donde las herramientas de colaboración en la red permiten este tipo de estrategias. Los dos últimos estudios emplean entrevistas con 120 ejecutivos del sector farmacéutico analizados a través de análisis temático. Ambos hacen contribuciones importantes a la literatura mediante la caracterización de las estrategias de innovación abierta y las implicaciones para generar la capacidad de absorción en el contexto de innovación abierta. El último estudio (7) examina la Medicina 2.0 desde la perspectiva de los proveedores de servicios de salud, para ayudar a la gestión de uso de la Web 2.0 como un instrumento para la gestión de una mejor atención sanitaria.
En general, hay muchas contribuciones importantes a la literatura, que en conjunto logran ampliar el panorama de la Web 2.0 en la asistencia sanitaria, y aportan consideraciones especÃficas a la literatura que abarca los sistemas de información, las ciencias de la información, la informática médica, asà como la innovación abierta y la estrategia.Democratized internet collaborations, referring to participatory online tools or Web 2.0, now impact many aspects of people’s lives. Scholars note Web 2.0’s potential to improve eLearning or healthcare, and its ongoing impact in sectors such as tech-media. They also raise a plethora of important questions for practitioners and scholars, such as the criticism of Web 2.0 as hype or marketing term, which necessitates some determination of the scope and nature of Web 2.0. This holds equally for Web 2.0’s use in health care, denoted as Medicine 2.0 or Health 2.0. Moreover, given the risks of people using user-generated content for health decisions, and its unproven effectiveness as a health policy tool, scholars have called for best practice models of use. This thesis addresses these fundamental issues, in a field that is fast moving, behind actual practice, and that requires concerted inter-disciplinary research. Therefore, this thesis incorporates seven distinct works that provide broad perspectives on the use of online collaboration tools in healthcare, each analyzing a specific topic in enough depth to remain relevant in a fast moving field.
These works include an examination of (1) Web 2.0 and (2) Medicine 2.0, using content analysis of millions of online conversations to surface the major practical or theoretical issues and tensions that underpin each concept. Two further studies examine (3) how and why doctors use Web 2.0 tools, and (4) how doctors search or forage for information in this evolving internet environment. These two studies rely on surveys, diaries and interviews from doctors working in the UK’s National Health Service (NHS). Both highlight important results, such as models for Medicine 2.0 use, or make important contributions to literature such as connecting the previously separate cognitive online search and internet information judgment literatures. Three further studies examine Web 2.0 from an organizational perspective, including (5) design patterns of Web 2.0’s use in global Pharma, which details best practice models of use and its clear link to Open Source design patterns, and (6) global Pharma’s Open Innovation strategies, where online collaboration tools enable these strategies. The latter two studies employ interviews with 120 pharmaceutical executives analyzed through thematic analysis. They make major contributions to literature by characterizing open innovation strategies and gleaning implications for Absorptive Capacity in the Open Innovation context. The final study (7) examines Medicine 2.0 form the perspective of health service providers, informing management using eHealth as an instrument for improved healthcare management.
Overall, there are many major contributions to literature, which together achieve both a broad overview of Web 2.0 in healthcare, but also make specific additions to literature encompassing information systems, information science, medical informatics, and open innovation and strategy
A treatise on Web 2.0 with a case study from the financial markets
There has been much hype in vocational and academic circles surrounding the emergence of
web 2.0 or social media; however, relatively little work was dedicated to substantiating the
actual concept of web 2.0. Many have dismissed it as not deserving of this new title, since the
term web 2.0 assumes a certain interpretation of web history, including enough progress in
certain direction to trigger a succession [i.e. web 1.0 → web 2.0]. Others provided arguments in
support of this development, and there has been a considerable amount of enthusiasm in the
literature. Much research has been busy evaluating current use of web 2.0, and analysis of the
user generated content, but an objective and thorough assessment of what web 2.0 really stands
for has been to a large extent overlooked. More recently the idea of collective intelligence
facilitated via web 2.0, and its potential applications have raised interest with researchers, yet a
more unified approach and work in the area of collective intelligence is needed.
This thesis identifies and critically evaluates a wider context for the web 2.0 environment, and
what caused it to emerge; providing a rich literature review on the topic, a review of existing
taxonomies, a quantitative and qualitative evaluation of the concept itself, an investigation of
the collective intelligence potential that emerges from application usage. Finally, a framework
for harnessing collective intelligence in a more systematic manner is proposed.
In addition to the presented results, novel methodologies are also introduced throughout this
work. In order to provide interesting insight but also to illustrate analysis, a case study of the
recent financial crisis is considered. Some interesting results relating to the crisis are revealed
within user generated content data, and relevant issues are discussed where appropriate
- …