8 research outputs found
Design and Implementation of World Wide Web Pages for the Computational Information Systems Research Group in the Department of Information Studies, University of Sheffield
This dissertation consists of six chapters, and describes the design and implementation of the World Wide Web pages for the Computational Information Systems Research Group in the Department of Information Studies.
The idea of implementing the World Wide Web pages for the research group is to get the information of the group published on the Internet. The area of networking (Internet) is reviewed particularly with reference to the World Wide Web. A few network tools are discussed. The emphasis is given more on the World Wide Web and the browser used as these form the most important part in
this project. The Web pages were created via the HTML editor called the HTML Writer. The HTML language used in writing the program is described together with a brief description of the functions used in the program. A few number of hyper-editing systems are also discussed, with most attention being given to the hyper-editing system used in this project. The comparison was made between HTML Writer and the Hot-Metal Editor. The design and development of the Web pages is described from the first attempts to write the program to the final version. Lastly, the Web pages are reviewed as the final step in completing this
project
Modelo hipertextual - mhtx: um modelo para organização hipertextual de documentos
Pesquisa aplicada sobre a construção e implementação de um protótipo semanticamente estruturado para auxiliar a organização e representação do conhecimento humano em hipertextos, com base em quatro referenciais: a Teoria da Análise Facetada (TAF), a Teoria dos Mapas Conceituais, a estrutura semântica de links hipertextuais e as normas técnicas da Associação Brasileira de Normas Técnicas (ABNT). O protótipo, denominado Modelo Hipertextual para Organização de Documentos (MHTX), é formado por uma estrutura semântica denominada Mapa Conceitual (MC) e um Sumário Expandido (SE), este último consistindo de um instrumento formado a partir do sumário de uma tese de doutorado selecionada, ao qual se agregaram pontos de acesso. Posteriormente, espera-se que este protótipo seja utilizado para implementar a BTDECI – UFMG (Biblioteca de Teses e Dissertações do Programa de Pós-Graduação da Escola de Ciência da Informação da UFMG)
Semantic linking through spaces for cyber-physical-socio intelligence:a methodology
Humans consciously and subconsciously establish various links, emerge semantic images and reason in mind, learn linking effect and rules, select linked individuals to interact, and form closed loops through links while co-experiencing in multiple spaces in lifetime. Machines are limited in these abilities although various graph-based models have been used to link resources in the cyber space. The following are fundamental limitations of machine intelligence: (1) machines know few links and rules in the physical space, physiological space, psychological space, socio space and mental space, so it is not realistic to expect machines to discover laws and solve problems in these spaces; and, (2) machines can only process pre-designed algorithms and data structures in the cyber space. They are limited in ability to go beyond the cyber space, to learn linking rules, to know the effect of linking, and to explain computing results according to physical, physiological, psychological and socio laws. Linking various spaces will create a complex space — the Cyber-Physical-Physiological-Psychological-Socio-Mental Environment CP3SME. Diverse spaces will emerge, evolve, compete and cooperate with each other to extend machine intelligence and human intelligence. From multi-disciplinary perspective, this paper reviews previous ideas on various links, introduces the concept of cyber-physical society, proposes the ideal of the CP3SME including its definition, characteristics, and multi-disciplinary revolution, and explores the methodology of linking through spaces for cyber-physical-socio intelligence. The methodology includes new models, principles, mechanisms, scientific issues, and philosophical explanation. The CP3SME aims at an ideal environment for humans to live and work. Exploration will go beyond previous ideals on intelligence and computing
Studying, developing, and experimenting contextual advertising systems
The World Wide Web has grown so fast in the last decade and it is today a vital daily part of people. The Internet is used for many purposes by an ever growing number of users, mostly for daily activities, tasks, and services.
To face the needs of users, an efficient and effective access to information is required. To deal with this task, the adoption of Information Retrieval and Information Filtering techniques is continuously growing. Information Re-trieval (IR) is the field concerned with searching for documents, information within documents, and metadata about documents, as well as searching for structured storage, relational databases, and the World Wide Web. Infor-
mation Filtering deals with the problem of selecting relevant information for a given user, according to her/his preferences and interest. Nowadays, Web advertising is one of the major sources of income for a large number of websites. Its main goal is to suggest products and services to the still ever growing population of Internet users. Web advertising is aimed at suggesting products and services to the users. A significant part of Web ad-vertising consists of textual ads, the ubiquitous short text messages usually
marked as sponsored links. There are two primary channels for distributing ads: Sponsored Search (or Paid Search Advertising) and Contextual Ad-vertising (or Content Match). Sponsored Search advertising is the task of
displaying ads on the page returned from a Web search engine following a query. Contextual Advertising (CA) displays ads within the content of a generic, third party, webpage. In this thesis I study, develop, and evaluated novel solutions in the field of Contextual Advertising. In particular, I studied and developed novel text summarization techniques, I adopted a novel semantic approach, I studied and adopted collaborative approaches, I started a conjunct study of Contex-tual Advertising and Geo-Localization, and I study the task of advertising
in the field of Multi-Modal Aggregation. The thesis is organized as follows. In Chapter 1, we briefly describe the
main aspects of Information Retrieval. Following, the Chapter 2 shows the problem of Contextual Advertising and describes the main contributes of the literature. Chapter 3 sketches a typical adopted approach and the eval-uation metrics of a Contextual Advertising system. Chapter 4 is related to the syntactic aspects, and its focus is on text summarization. In Chapter 5 the semantic aspects are taken into account, and a novel approach based on ConceptNet is proposed. Chapter 6 proposes a novel view of CA by the
adoption of a collaborative filtering approach. Chapter 7 shows a prelim-inary study of Geo Location, performed in collaboration with the Yahoo! Research center in Barcelona. The target is to study several techniques
of suggesting localized advertising in the field of mobile applications and search engines. In Chapter 8 is shown a joint work with the RAI Centre for Research and Technological Innovation. The main goal is to study and
propose a system of advertising for Multimodal Aggregation data. Chapter 9 ends this work with conclusions and future directions
Visual analysis of anatomy ontologies and related genomic information
Challenges in scientific research include the difficulty in obtaining overviews of the large
amount of data required for analysis, and in resolving the differences in terminology used
to store and interpret information in multiple, independently created data sets. Ontologies
provide one solution for analysis involving multiple data sources, improving cross-referencing
and data integration.
This thesis looks at harnessing advanced human perception to reduce the cognitive load
in the analysis of the multiple, complex data sets the bioinformatics user group studied use
in research, taking advantage also of users’ domain knowledge, to build mental models of
data that map to its underlying structure. Guided by a user-centred approach, prototypes
were developed to provide a visual method for exploring users’ information requirements
and to identify solutions for these requirements. 2D and 3D node-link graphs were built to
visualise the hierarchically structured ontology data, to improve analysis of individual and
comparison of multiple data sets, by providing overviews of the data, followed by techniques
for detailed analysis of regions of interest.
Iterative, heuristic and structured user evaluations were used to assess and refine the
options developed for the presentation and analysis of the ontology data. The evaluation
results confirmed the advantages that visualisation provides over text-based analysis, and
also highlighted the advantages of each of 2D and 3D for visual data analysis.Overseas Research Students Awards SchemeJames Watt Scholarshi
Selective web information retrieval
This thesis proposes selective Web information retrieval, a framework formulated in terms of statistical decision theory, with the aim to apply an appropriate retrieval approach on a per-query basis. The main component of the framework is a decision mechanism that selects an appropriate retrieval approach on a per-query basis. The selection of a particular retrieval approach is based on the outcome of an experiment, which is performed before the final ranking of the retrieved documents. The experiment is a process that extracts features from a sample of the set of retrieved documents. This thesis investigates three broad types of experiments. The first one counts the occurrences of query terms in the retrieved documents, indicating the extent to which the query topic is covered in the document collection. The second type of experiments considers information from the distribution of retrieved documents in larger aggregates of related Web documents, such as whole Web sites, or directories within Web sites. The third type of experiments estimates the usefulness of the hyperlink structure among a sample of the set of retrieved Web documents. The proposed experiments are evaluated in the context of both informational and navigational search tasks with an optimal Bayesian decision mechanism, where it is assumed that relevance information exists.
This thesis further investigates the implications of applying selective Web information retrieval in an operational setting, where the tuning of a decision mechanism is based on limited existing relevance information and the information retrieval system’s input is a stream of queries related to mixed informational and navigational search tasks. First, the experiments are evaluated using different training and testing query sets, as well as a mixture of different types of queries. Second, query sampling is introduced, in order to approximate the queries that a retrieval system receives, and to tune an ad-hoc decision mechanism with a broad set of automatically sampled queries
A Probabilistic Framework for Information Modelling and Retrieval Based on User Annotations on Digital Objects
Annotations are a means to make critical remarks, to explain and
comment things, to add notes and give opinions, and to relate objects.
Nowadays, they can be found in digital libraries and collaboratories,
for example as a building block for scientific discussion on the one
hand or as private notes on the other. We further find them in product
reviews, scientific databases and many "Web 2.0" applications; even
well-established concepts like emails can be regarded as annotations
in a certain sense. Digital annotations can be (textual) comments,
markings (i.e. highlighted parts) and references to other documents
or document parts. Since annotations convey information which is
potentially important to satisfy a user's information need, this
thesis tries to answer the question of how to exploit annotations for
information retrieval. It gives a first answer to the question if
retrieval effectiveness can be improved with annotations.
A survey of the "annotation universe" reveals some facets of
annotations; for example, they can be content level annotations
(extending the content of the annotation object) or meta level ones
(saying something about the annotated object). Besides the annotations
themselves, other objects created during the process of annotation can
be interesting for retrieval, these being the annotated fragments.
These objects are integrated into an object-oriented model comprising
digital objects such as structured documents and annotations as well
as fragments. In this model, the different relationships among the
various objects are reflected. From this model, the basic data
structure for annotation-based retrieval, the structured annotation
hypertext, is derived.
In order to thoroughly exploit the information contained in structured
annotation hypertexts, a probabilistic, object-oriented logical
framework called POLAR is introduced. In POLAR, structured annotation
hypertexts can be modelled by means of probabilistic propositions and
four-valued logics. POLAR allows for specifying several relationships
among annotations and annotated (sub)parts or fragments. Queries can
be posed to extract the knowledge contained in structured annotation
hypertexts. POLAR supports annotation-based retrieval, i.e. document
and discussion search, by applying an augmentation strategy (knowledge
augmentation, propagating propositions from subcontexts like annotations,
or relevance augmentation, where retrieval status values are propagated)
in conjunction with probabilistic inference, where P(d -> q), the probability
that a document d implies a query q, is estimated.
POLAR's semantics is based on possible worlds and accessibility
relations. It is implemented on top of four-valued probabilistic Datalog.
POLAR's core retrieval functionality, knowledge augmentation with
probabilistic inference, is evaluated for discussion and document
search. The experiments show that all relevant POLAR objects, merged
annotation targets, fragments and content annotations, are able to
increase retrieval effectiveness when used as a context for discussion
or document search. Additional experiments reveal that we can determine
the polarity of annotations with an accuracy of around 80%
Modelos de leitura documentária para indexação
A motivação para a elaboração deste livro surgiu da observação da necessidade de reunir novas pesquisas no cenário da área de Ciência da Informação, que mostram perspectivas inovadoras para a Análise Documentária ou Documental, leitura documentária e modelo de leitura documentária, já que são fatores determinantes para um bom desempenho do trabalho do indexador com diferentes tipos de documentos. O tema deste livro consiste nos estudos sobre a elaboração de modelos de leitura documentária ou documental para indexação, principalmente, no que tange aos seus aspectos de abordagens teóricas interdisciplinares e aplicações dos modelos em diferentes tipos de documentos. Portanto, tem o objetivo de enfatizar os aspectos das teorias interdisciplinares que fundamentam a elaboração do modelo de leitura documentária e as aplicações possÃveis em diferentes documentos, estas que permeiam pesquisas de diferentes autores da atualidade