Search CORE

24,054 research outputs found

Using Text Segmentation to Enhance the Cluster Hypothesis

Author: B. Levrat
F. Saubion
S. Lamprier
T. Amghar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

An alternative way to tackle Information Retrieval, called Passage Retrieval, considers text fragments independently rather than assessing global relevance of documents. In such a context, the fact that relevant information is surrounded by parts of text deviating from the interesting topic does not penalize the document. In this paper, we propose to study the impact of the consideration of these text fragments on a document clustering process. The use of clustering in the field of Information Retrieval is mainly supported by the cluster hypothesis which states that relevant documents tend to be more similar one to each other than to non-relevant documents and hence a clustering process is likely to gather them. Previous experiments have shown that clustering the first retrieved documents as response to a user’s query allows the Information Retrieval systems to improve their effectiveness. In the clustering process used in these studies, documents have been considered globally. Nevertheless, the assumption stating that a document can refer to more than one topic/concept may have also impacts on the document clustering process. Considering passages of the retrieved documents separately may allow to create more representative clusters of the addressed topics. Different approaches have been assessed and results show that using text fragments in the clustering process may turn out to be actually relevant

Okina

Infants segment words from songs - an EEG study

Author: Benders T.
Fikkert P.
Snijders T.
Publication venue: 'MDPI AG'
Publication date: 09/01/2020
Field of study

Children’s songs are omnipresent and highly attractive stimuli in infants’ input. Previous work suggests that infants process linguistic–phonetic information from simplified sung melodies. The present study investigated whether infants learn words from ecologically valid children’s songs. Testing 40 Dutch-learning 10-month-olds in a familiarization-then-test electroencephalography (EEG) paradigm, this study asked whether infants can segment repeated target words embedded in songs during familiarization and subsequently recognize those words in continuous speech in the test phase. To replicate previous speech work and compare segmentation across modalities, infants participated in both song and speech sessions. Results showed a positive event-related potential (ERP) familiarity effect to the final compared to the first target occurrences during both song and speech familiarization. No evidence was found for word recognition in the test phase following either song or speech. Comparisons across the stimuli of the present and a comparable previous study suggested that acoustic prominence and speech rate may have contributed to the polarity of the ERP familiarity effect and its absence in the test phase. Overall, the present study provides evidence that 10-month-old infants can segment words embedded in songs, and it raises questions about the acoustic and other factors that enable or hinder infant word segmentation from songs and speech

MPG.PuRe

Point Source Extraction with MOPEX

Author: David Makovoz
Francine R. Marleau
Juhola M.
Meijering E. H. W.
Nelder J. A.
Publication venue: 'University of Chicago Press'
Publication date: 30/06/2005
Field of study

MOPEX (MOsaicking and Point source EXtraction) is a package developed at the Spitzer Science Center for astronomical image processing. We report on the point source extraction capabilities of MOPEX. Point source extraction is implemented as a two step process: point source detection and profile fitting. Non-linear matched filtering of input images can be performed optionally to increase the signal-to-noise ratio and improve detection of faint point sources. Point Response Function (PRF) fitting of point sources produces the final point source list which includes the fluxes and improved positions of the point sources, along with other parameters characterizing the fit. Passive and active deblending allows for successful fitting of confused point sources. Aperture photometry can also be computed for every extracted point source for an unlimited number of aperture sizes. PRF is estimated directly from the input images. Implementation of efficient methods of background and noise estimation, and modified Simplex algorithm contribute to the computational efficiency of MOPEX. The package is implemented as a loosely connected set of perl scripts, where each script runs a number of modules written in C/C++. Input parameter setting is done through namelists, ASCII configuration files. We present applications of point source extraction to the mosaic images taken at 24 and 70 micron with the Multiband Imaging Photometer (MIPS) as part of the Spitzer extragalactic First Look Survey and to a Digital Sky Survey image. Completeness and reliability of point source extraction is computed using simulated data.Comment: 20 pages, 13 Postscript figures, accepted for publication in PAS

arXiv.org e-Print Archive

Crossref

CERN Document Server

Automated speech and audio analysis for semantic access to multimedia

Author: Huijbregts Marijn
Jong Franciska de
Ordelman Roeland
Publication venue: Springer Verlag
Publication date: 01/01/2006
Field of study

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives

University of Twente Research Information

Dynamic spot analysis in the 2D electrophoresis gels images

Author: Polášková Lenka
Publication venue: Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií
Publication date: 01/01/2014
Field of study

Práce shrnuje faktory a parametry, které ovlivňují výsledky 2D elektroforézy, se zaměřením na zpracování obrazu jako jeden ze způsobů snížení nesprávné interpretace jejích výstupů. Proces zpracování obrazu využívá jako zdroj dat především obrazů z opakovaných provedení téhož pokusu, neboli víceplik. Pomocí analýzy obrazů víceplik je možno pozorovat nebo korigovat změny jednoho pokusu a také porovnávat je s výstupy jiných pokusů. Cílem práce je poskytnout podporu specialistovi, který má na starosti popsat vlastnosti struktur nacházejících se v elektroforetických obrazech.The text briefly describes factors and parameters which influence the results of 2D electrophoresis focusing on image processing as one manner to reduce incorrect interpretation of its outputs. As dataset, image processing performance uses images from repeated execution of one experiment also known as multiplicates. Using multiplicates analysis it is possible to observe or lower the changes of one experiment and to compare them with outputs of other experiments. The aim of this work is to provide support for specialist who takes care about describing the character patterns located in electrophoretic images.

Digital library of Brno University of Technology

National Repository of Grey Literature

Are words easier to learn from infant- than adult-directed speech? A quantitative corpus-based investigation

Author: Cristia Alejandrina
Dupoux Emmanuel
Guevara-Rukoz Adriana
Ludusan Bogdan
Martin Andrew
Mazuka Reiko
Thiollière Roland
Publication venue
Publication date: 23/12/2017
Field of study

We investigate whether infant-directed speech (IDS) could facilitate word form learning when compared to adult-directed speech (ADS). To study this, we examine the distribution of word forms at two levels, acoustic and phonological, using a large database of spontaneous speech in Japanese. At the acoustic level we show that, as has been documented before for phonemes, the realizations of words are more variable and less discriminable in IDS than in ADS. At the phonological level, we find an effect in the opposite direction: the IDS lexicon contains more distinctive words (such as onomatopoeias) than the ADS counterpart. Combining the acoustic and phonological metrics together in a global discriminability score reveals that the bigger separation of lexical categories in the phonological space does not compensate for the opposite effect observed at the acoustic level. As a result, IDS word forms are still globally less discriminable than ADS word forms, even though the effect is numerically small. We discuss the implication of these findings for the view that the functional role of IDS is to improve language learnability.Comment: Draf

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Image retrieval by hypertext links

Author: Buck C.
M. D. Dunlop
M. Sanderson
V. Harmandas
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1997
Field of study

This paper presents a model for retrieval of images from a large World Wide Web based collection. Rather than considering complex visual recognition algorithms, the model presented is based on combining evidence of the text content and hypertext structure of the Web. The paper shows that certain types of query are amply served by this form of representation. It also presents a novel means of gathering relevance judgements

CiteSeerX

Crossref

University of Strathclyde Institutional Repository

White Rose Research Online