Search CORE

126,696 research outputs found

Towards a multimedia formatting vocabulary

Author: Geurts J.P.T.M. (Joost)
Hardman L. (Lynda)
Ossenbruggen J.R. (Jacco) van
Rutledge L. (Lloyd)
Publication venue: CWI
Publication date: 01/01/2003
Field of study

Time-based, media-centric Web presentations can be described declaratively in the XML world through the development of languages such as SMIL. It is difficult, however, to fully integrate them in a complete document transformation processing chain. In order to achieve the desired processing of data-driven, time-based, media-centric presentations, the text-flow based formatting vocabularies used by style languages such as XSL, CSS and DSSSL need to be extended. The paper presents a selection of use cases which are used to derive a list of requirements for a multimedia style and transformation formatting vocabulary. The boundaries of applicability of existing text-based formatting models for media-centric transformations are analyzed. The paper then discusses the advantages and disadvantages of a fully-fledged time-based multimedia formatting model. Finally, the discussion is illustrated by describing the key properties of the example multimedia formatting vocabulary currently implemented in the back-end of our Cuypers multimedia transformation engine

CWI's Institutional Repository

Towards a multimedia formatting vocabulary

Author: Geurts J.P.T.M. (Joost)
Hardman L. (Lynda)
Ossenbruggen J.R. (Jacco) van
Rutledge L. (Lloyd)
Publication venue: ACM Press
Publication date: 01/01/2003
Field of study

CWI's Institutional Repository

Ontology Mediated Information Extraction with MASTRO SYSTEM-T

Author: Chiticariu Laura
Giacomo Giuseppe De
Gruber Tom
Lembo Domenico
Lembo Domenico
Popov Borislav
Publication venue
Publication date: 01/01/2020
Field of study

In several data-centric application domains, the need arises to extract valuable information from unstructured text documents. The recent paradigm of Ontology Mediated Information Extraction (OMIE) faces this problem by taking into account the knowledge expressed by a domain ontology, and reasoning over it to improve the quality of extracted data. MASTRO SYSTEM-T is a novel tool for OMIE, developed by Sapienza University and IBM Almaden Research. In this work, we demonstrate its usage for information extraction over real-world financial text documents from the U.S. EDGAR system

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes

Author: Huang Haifeng
Wang Zehan
Zhang Ziang
Zhao Yang
Zhao Zhou
Publication venue
Publication date: 16/08/2023
Field of study

3D scene understanding has gained significant attention due to its wide range of applications. However, existing methods for 3D scene understanding are limited to specific downstream tasks, which hinders their practicality in real-world applications. This paper presents Chat-3D, which combines the 3D visual perceptual ability of pre-trained 3D representations and the impressive reasoning and conversation capabilities of advanced LLMs to achieve the first universal dialogue systems for 3D scenes. Specifically, we align 3D representations into the feature space of LLMs, thus enabling LLMs to perceive the 3D world. Given the scarcity of 3D scene-text data, we propose a three-stage training strategy to efficiently utilize the available data for better alignment. To enhance the reasoning ability and develop a user-friendly interaction scheme, we further construct a high-quality object-centric 3D instruction dataset and design an associated object-centric prompt. Our experiments show that Chat-3D achieves an impressive ability to comprehend diverse instructions for 3D scenes, engage in intricate spatial reasoning, and incorporate external knowledge into its responses. Chat-3D achieves a 75.6% relative score compared with GPT-4 on the constructed instruction dataset.Comment: The project page is \url{https://chat-3d.github.io/

arXiv.org e-Print Archive

Instant Messaging on handhelds: affective feedback

Author: Adesemowo A. Kayode
Tucker William David
Publication venue: 'LPPM Institut Teknologi Telkom Purwokerto'
Publication date: 01/01/2004
Field of study

A text only Instant Messaging (IM) built on the IETF open standard SIP/SIMPLE has been developed in line with our proposed introduction of a user-defined text Hotkey feature. These act as an on-click Affective Gesture (AG): in similitude to Face-to-Face (F2F) expressive gesture-like abilities. Given that text communication possesses expressive discourse with some presence level, we seek to show that one-click text-gesture fast-tracking enhances text communication further. For this study, we are taking a hybrid quantitative and qualitative approach. Initial Pre-trial results have shown that an AG approach is more likely to improve IM chat spontaneity/response rate. Further experimental trials are being undertaken. Mobile devices and networks are becoming more data-centric (evident in Japanese I-mode) even as mobile network voice Average Revenue Per User (ARPU) are declining, new stream of data services are required which must take cognisance of handhelds features albeit their small screen estate and input/output limitation. Given that IM is entrenched in the social space, especially among teenagers and gaining wide adoption in the business place, we believe extensions are required for IM steep uptake in the mobile world, much as SMS has gained prominence. Enhanced input mechanisms for handheld IM system are expected to increase co-presence between handheld users and their desktop-based counterparts while in a synchronous discussion.Telkom, Cisco, THRI

University of the Western Cape Research Repository

A User-Centric and Sentiment Aware Privacy-Disclosure Detection Framework Based on Multi-Input Neural Network

Author: Mehrpouyan Hoda
Nuhil Mehdy A. K. M.
Publication venue: 'IUScholarWorks'
Publication date: 01/01/2020
Field of study

Data and information privacy is a major concern of today’s world. More specifically, users’ digital privacy has become one of the most important issues to deal with, as advancements are being made in information sharing technology. An increasing number of users are sharing information through text messages, emails, and social media without proper awareness of privacy threats and their consequences. One approach to prevent the disclosure of private information is to identify them in a conversation and warn the dispatcher before the conveyance happens between the sender and the receiver. Another way of preventing information (sensitive) loss might be to analyze and sanitize a batch of offline documents when the data is already accumulated somewhere. However, automating the process of identifying user-centric privacy disclosure in textual data is challenging. This is because the natural language has an extremely rich form and structure with different levels of ambiguities. Therefore, we inquire after a potential framework that could bring this challenge within reach by precisely recognizing users’ privacy disclosures in a piece of text by taking into account - the authorship and sentiment (tone) of the content alongside the linguistic features and techniques. The proposed framework is considered as the supporting plugin to help text classification systems more accurately identify text that might disclose the author’s personal or private information

Boise State University - ScholarWorks

A Survey of Location Prediction on Twitter

Author: Han Jialong
Sun Aixin
Zheng Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Locations, e.g., countries, states, cities, and point-of-interests, are central to news, emergency events, and people's daily lives. Automatic identification of locations associated with or mentioned in documents has been explored for decades. As one of the most popular online social network platforms, Twitter has attracted a large number of users who send millions of tweets on daily basis. Due to the world-wide coverage of its users and real-time freshness of tweets, location prediction on Twitter has gained significant attention in recent years. Research efforts are spent on dealing with new challenges and opportunities brought by the noisy, short, and context-rich nature of tweets. In this survey, we aim at offering an overall picture of location prediction on Twitter. Specifically, we concentrate on the prediction of user home locations, tweet locations, and mentioned locations. We first define the three tasks and review the evaluation metrics. By summarizing Twitter network, tweet content, and tweet context as potential inputs, we then structurally highlight how the problems depend on these inputs. Each dependency is illustrated by a comprehensive review of the corresponding strategies adopted in state-of-the-art approaches. In addition, we also briefly review two related problems, i.e., semantic location prediction and point-of-interest recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Scenarios and research issues for a network of information

Author: Dannewitz Christian
Pentikousis Kostas
Rembarz René
Renault Éric
Strandberg Ove
Ubillos Javier
Publication venue
Publication date: 01/01/2008
Field of study

This paper describes ideas and items of work within the framework of the EU-funded 4WARD project. We present scenarios where the current host-centric approach to infor- mation storage and retrieval is ill-suited for and explain how a new networking paradigm emerges, by adopting the information-centric network architecture approach, which we call Network of Information (NetInf). NetInf capital- izes on a proposed identifier/locator split and allows users to create, distribute, and retrieve information using a com- mon infrastructure without tying data to particular hosts. NetInf introduces the concepts of information and data ob- jects. Data objects correspond to the particular bits and bytes of a digital object, such as text file, a specific encod- ing of a song or a video. Information objects can be used to identify other objects irrespective of their particular dig- ital representation. After discussing the benefits of such an indirection, we consider the impact of NetInf with respect to naming and governance in the Future Internet. Finally, we provide an outlook on the research scope of NetInf along with items for future work

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

WISER: A Semantic Approach for Expert Finding in Academia based on Entity Linking

Author: Cifariello Paolo
Ferragina Paolo
Ponza Marco
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

We present WISER, a new semantic search engine for expert finding in academia. Our system is unsupervised and it jointly combines classical language modeling techniques, based on text evidences, with the Wikipedia Knowledge Graph, via entity linking. WISER indexes each academic author through a novel profiling technique which models her expertise with a small, labeled and weighted graph drawn from Wikipedia. Nodes in this graph are the Wikipedia entities mentioned in the author's publications, whereas the weighted edges express the semantic relatedness among these entities computed via textual and graph-based relatedness functions. Every node is also labeled with a relevance score which models the pertinence of the corresponding entity to author's expertise, and is computed by means of a proper random-walk calculation over that graph; and with a latent vector representation which is learned via entity and other kinds of structural embeddings derived from Wikipedia. At query time, experts are retrieved by combining classic document-centric approaches, which exploit the occurrences of query terms in the author's documents, with a novel set of profile-centric scoring strategies, which compute the semantic relatedness between the author's expertise and the query topic via the above graph-based profiles. The effectiveness of our system is established over a large-scale experimental test on a standard dataset for this task. We show that WISER achieves better performance than all the other competitors, thus proving the effectiveness of modelling author's profile via our "semantic" graph of entities. Finally, we comment on the use of WISER for indexing and profiling the whole research community within the University of Pisa, and its application to technology transfer in our University

arXiv.org e-Print Archive

Archivio della Ricerca - Università di Pisa