332,401 research outputs found
Parsimonious Language Models for a Terabyte of Text
The aims of this paper are twofold. Our first aim\ud
is to compare results of the earlier Terabyte tracks\ud
to the Million Query track. We submitted a number\ud
of runs using different document representations\ud
(such as full-text, title-fields, or incoming\ud
anchor-texts) to increase pool diversity. The initial\ud
results show broad agreement in system rankings\ud
over various measures on topic sets judged at both\ud
Terabyte and Million Query tracks, with runs using\ud
the full-text index giving superior results on\ud
all measures, but also some noteworthy upsets.\ud
Our second aim is to explore the use of parsimonious\ud
language models for retrieval on terabyte-scale\ud
collections. These models are smaller thus\ud
more efficient than the standard language models\ud
when used at indexing time, and they may also improve\ud
retrieval performance. We have conducted\ud
initial experiments using parsimonious models in\ud
combination with pseudo-relevance feedback, for\ud
both the Terabyte and Million Query track topic\ud
sets, and obtained promising initial results
A characteristics framework for Semantic Information Systems Standards
Semantic Information Systems (IS) Standards play a critical role in the development of the networked economy. While their importance is undoubted by all stakeholders—such as businesses, policy makers, researchers, developers—the current state of research leaves a number of questions unaddressed. Terminological confusion exists around the notions of “business semantics”, “business-to-business interoperability”, and “interoperability standards” amongst others. And, moreover, a comprehensive understanding about the characteristics of Semantic IS Standards is missing. The paper addresses this gap in literature by developing a characteristics framework for Semantic IS Standards. Two case studies are used to check the applicability of the framework in a “real-life” context. The framework lays the foundation for future research in an important field of the IS discipline and supports practitioners in their efforts to analyze, compare, and evaluate Semantic IS Standard
- …