7,081 research outputs found
GEORDi: Supporting lightweight end-user authoring and exploration of Linked Data
The US and UK governments have recently made much of the data created by their various departments available as data sets (often as csv files) available on the web. Known as âopen dataâ while these are valuable assets, much of this data remains useless because it is effectively inaccessible for citizens to access for the following reasons: (1) it is often a tedious, many step process for citizens simply to find data relevant to a query. Once the data candidate is located, it often must be downloaded and opened in a separate application simply to see if the data that may satisfy the query is contained in it. (2) It is difficult to join related data sets to create richer integrated information (3) it is particularly difficult to query either a single data set, and even harder to query across related data sets. (4) To date, one has had to be well versed in semantic web protocols like SPARQL, RDF and URI formation to integrate and query such sources as reusable linked data. Our goal has been to develop tools that will let regular, non-programmer web citizens make use of this Web of Data. To this end, we present GEORDi, a set of integrated tools and services that lets citizen users identify, explore, query and represent these open data sources over the web via Linked Data mechanisms. In this paper we describe the GEORDi process of authoring new and translating existing open data in a linkable format, GEORDiâs lens mechanism for rendering rich, plain language descriptions and views of resources, and the GEORDI link-sliding paradigm for data exploration. With these tools we demonstrate that it is possible to make the Web of open (and linked) data accessible for ordinary web citizen users
BioCloud Search EnGene: Surfing Biological Data on the Cloud
The massive production and spread of biomedical data around the web introduces new challenges related to identify computational approaches for providing quality search and browsing of web resources. This papers presents BioCloud Search EnGene (BSE), a cloud application that facilitates searching and integration of the many layers of biological information offered by public large-scale genomic repositories. Grounding on the concept of dataspace, BSE is built on top of a cloud platform that severely curtails issues associated with scalability and performance. Like popular online gene portals, BSE adopts a gene-centric approach: researchers can find their information of interest by means of a simple âGoogle-likeâ query interface that accepts standard gene identification as keywords. We present BSE architecture and functionality and discuss how our strategies contribute to successfully tackle big data problems in querying gene-based web resources. BSE is publically available at: http://biocloud-unica.appspot.com/
Recommended from our members
A study of the information needs of the users of a folk music library and the implications for the design of a digital library system
A qualitative study of user information needs is reported, based on a purposive sample of users and potential users of the Vaughan Williams Memorial Library, a small specialist folk music library in North London. The study set out to establish what the userâs (both existing and potential) information needs are, so that the libraryâs online service may take them into account with its design. The information needs framework proposed by Nicholas (2000) is used as an analytical tool to achieve this end. The demographics of the users were examined in order to establish four user groups: Performer, Academic, Professional and Enthusiast. Important information needs were found to be based on social interaction, and key resources of the library were its staff, the concentration of the collection and the libraryâs social nature. A collection of broad design requirements are proposed based on the analysis and this study also provided some insights into the issue of musical relevance, which are discussed
Integrating musicology's heterogeneous data sources for better exploration
Musicologists have to consult an extraordinarily heterogeneous body of primary and secondary sources during all stages of their research. Many of these sources are now available online, but the historical dispersal of material across libraries and archives has now been replaced by segregation of data and metadata into a plethora of online repositories. This segregation hinders the intelligent manipulation of metadata, and means that extracting large tranches of basic factual information or running multi-part search queries is still enormously and needlessly time consuming. To counter this barrier to research, the âmusicSpaceâ project is experimenting with integrating access to many of musicologyâs leading data sources via a modern faceted browsing interface that utilises Semantic Web and Web2.0 technologies such as RDF and AJAX. This will make previously intractable search queries tractable, enable musicologists to use their time more efficiently, and aid the discovery of potentially significant information that users did not think to look for. This paper outlines our work to date
Recommended from our members
On Birthing Dancing Stars: The Need for Bounded Chaos in Information Interaction
While computers causing chaos is acommon social trope, nearly the entirety of the history of computing is dedicated to generating order. Typical interactive information retrieval tasks ask computers to support the traversal and exploration of large, complex information spaces. The implicit assumption is that they are to support users in simplifying the complexity (i.e. in creating order from chaos). But for some types of task, particularly those that involve the creative application or synthesis of knowledge or the creation of new knowledge, this assumption may be incorrect. It is increasingly evident that perfect orderâand the systems we create with itâsupport highly-structured information tasks well, but provide poor support for less-structured tasks.We need digital information environments that help create a little more chaos from order to spark creative thinking and knowledge creation. This paper argues for the need for information systems that offerwhat we term âbounded chaosâ, and offers research directions that may support the creation of such interface
Sometimes the Internet reads the question wrong: childrenâs search strategies & difficulties
When children search for information on a given topic, how do they go about searching for and retrieving information? What can their information seeking strategies tell us about the development of search interfaces for children's digital libraries, search engines and information repositories? We interviewed New Zealand (NZ) school children to seek insights into how they are conducting information searches during their education
EviPlant: An efficient digital forensic challenge creation, manipulation and distribution solution
Education and training in digital forensics requires a variety of suitable
challenge corpora containing realistic features including regular
wear-and-tear, background noise, and the actual digital traces to be discovered
during investigation. Typically, the creation of these challenges requires
overly arduous effort on the part of the educator to ensure their viability.
Once created, the challenge image needs to be stored and distributed to a class
for practical training. This storage and distribution step requires significant
time and resources and may not even be possible in an online/distance learning
scenario due to the data sizes involved. As part of this paper, we introduce a
more capable methodology and system as an alternative to current approaches.
EviPlant is a system designed for the efficient creation, manipulation, storage
and distribution of challenges for digital forensics education and training.
The system relies on the initial distribution of base disk images, i.e., images
containing solely base operating systems. In order to create challenges for
students, educators can boot the base system, emulate the desired activity and
perform a "diffing" of resultant image and the base image. This diffing process
extracts the modified artefacts and associated metadata and stores them in an
"evidence package". Evidence packages can be created for different personae,
different wear-and-tear, different emulated crimes, etc., and multiple evidence
packages can be distributed to students and integrated into the base images. A
number of additional applications in digital forensic challenge creation for
tool testing and validation, proficiency testing, and malware analysis are also
discussed as a result of using EviPlant.Comment: Digital Forensic Research Workshop Europe 201
The Hypertext Internet Connection: E-mail, Online Search, Gopher
In this paper we show how to handle and organize the large amount of information accessible through the Internet or other public communication networks in a hypertext environment. The C(K)onstance-Hypertext-System (KHS) uses typed units to indicate the differences and the content and structure of information, comprising text, forms, images pointers to external information. We show how to imbed Internet services, which usual require rather different interaction styles, such as point-to-point communication (e-mail query formulation (online databases) or browsing (Gopher) into the uniform interaction model of the KHS. The integration of Internet services in an open hypertext environment produces value-adding effects which are also discussed. (DIPF/Orig.
Orchestrating musical (meta)data to better address the real-world search queries of musicologists
The dispersal of musicologyâs diverse array of primary and secondary sources across countless libraries and archives was once an enormous obstacle to conducting research, but this has largely been overcome by the digitisation and online publication of resources in recent years. Yet, while the research process has undoubtedly been revolutionised, the current situation is far from perfect, as the digitisation of resources has often been accompanied by their segregationâaccording to media type, date of publication, subject, language, copyright holder, etc.âinto a myriad of discrete online repositories, often with little thought having been given to interoperability. Given that musicological research typically cuts across such artificial divisions, this segregation of data means that accessing basic factual information or running multi-part search queries remains endlessly complicated, needlessly time consuming, and sometimes impossible. This barrier to tractability is only exacerbated by the limited capabilities of currently deployed search interfaces. There is one seemingly obvious solution to this query dilemma: enable integrated real-time querying over all the available metadata from as many sources as possible, and allow users to use that metadata to guide their queries. This solution implies that all data that could feasibly be construed as useful, but which is buried in the records, is extracted in some way, and that there is an interaction approach that enables metadata to be explored effectively and allows for the formulation of rich compound queries. The musicSpace project has taken a dual approach towards realising this solution. At the back-end we are developing services to integrate and, where necessary, surface (meta)data from many of musicologyâs most important online resources, including the British Library Music Collections catalogue, the British Library Sound Archive catalogue, Cecilia, Copac, Grove Music Online, Naxos Music Library, RĂ©pertoire International de LittĂ©rature Musicale (RILM), and RĂ©pertoire International des Sources Musicale (RISM) UK and Ireland. While at the front-end, in order to optimise the exploration of this integrated dataset, we are developing a modern web-based faceted browsing interface that utilises Semantic Web and Web2.0 technologies such as RDF and AJAX, and which is based on the existing âmSpaceâ codebase. Our poster outlines the approach we have taken to importing, enriching and integrating the metadata provided by our data partners, and gives examples of the real-world musicological research questions that musicSpace has enabled
- âŠ