71,366 research outputs found
ON MONITORING LANGUAGE CHANGE WITH THE SUPPORT OF CORPUS PROCESSING
One of the fundamental characteristics of language is that it can change over time. One
method to monitor the change is by observing its corpora: a structured language
documentation. Recent development in technology, especially in the field of Natural
Language Processing allows robust linguistic processing, which support the description of
diverse historical changes of the corpora. The interference of human linguist is inevitable as
it determines the gold standard, but computer assistance provides considerable support by
incorporating computational approach in exploring the corpora, especially historical
corpora. This paper proposes a model for corpus development, where corpus are annotated
to support further computational operations such as lexicogrammatical pattern matching,
automatic retrieval and extraction. The corpus processing operations are performed by local
grammar based corpus processing software on a contemporary Indonesian corpus. This
paper concludes that data collection and data processing in a corpus are equally crucial
importance to monitor language change, and none can be set aside
Adult participation in childrenâs word searches: on the use of prompting, hinting, and supplying a model
Although word searching in children is very common, very little is known about how adults support children in the turns following the childâs search behaviours, an important topic because of the social, educational and clinical implications. This study characterises, in detail, teachersâ use of prompting, hinting and supplying a model. From a classroom dataset of 53 instances, several distinctive patterns emerged. A prompted completion sequence is initiated by a âword retrieval elicitorâ (âfishingâ) and is interpreted as a request to complete the phrase. Non-verbal prompting is accomplished through a combination of gaze and gesture and, also, as a series of prompts. Hinting supplies a verbal clue, typically via a wh-question, or by specifying the nature of the repairable. In contrast, the strategies that supply a linguistic model include both embedded and exposed corrections and offers of candidates. A sequential relationship was found between prompting, hinting and supplying a model which has implications for how clinicians and teachers can foster self-repair
Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture
We present the architecture behind Twitter's real-time related query
suggestion and spelling correction service. Although these tasks have received
much attention in the web search literature, the Twitter context introduces a
real-time "twist": after significant breaking news events, we aim to provide
relevant results within minutes. This paper provides a case study illustrating
the challenges of real-time data processing in the era of "big data". We tell
the story of how our system was built twice: our first implementation was built
on a typical Hadoop-based analytics stack, but was later replaced because it
did not meet the latency requirements necessary to generate meaningful
real-time results. The second implementation, which is the system deployed in
production, is a custom in-memory processing engine specifically designed for
the task. This experience taught us that the current typical usage of Hadoop as
a "big data" platform, while great for experimentation, is not well suited to
low-latency processing, and points the way to future work on data analytics
platforms that can handle "big" as well as "fast" data
Classroom Research and the Digital Learning Media
UdostÄpnienie publikacji Wydawnictwa Uniwersytetu ĹĂłdzkiego finansowane w ramach projektu âDoskonaĹoĹÄ naukowa kluczem do doskonaĹoĹci ksztaĹceniaâ. Projekt realizowany jest ze ĹrodkĂłw Europejskiego Funduszu SpoĹecznego w ramach Programu Operacyjnego Wiedza Edukacja RozwĂłj; nr umowy: POWER.03.05.00-00-Z092/17-00
Using the Annotated Bibliography as a Resource for Indicative Summarization
We report on a language resource consisting of 2000 annotated bibliography
entries, which is being analyzed as part of our research on indicative document
summarization. We show how annotated bibliographies cover certain aspects of
summarization that have not been well-covered by other summary corpora, and
motivate why they constitute an important form to study for information
retrieval. We detail our methodology for collecting the corpus, and overview
our document feature markup that we introduced to facilitate summary analysis.
We present the characteristics of the corpus, methods of collection, and show
its use in finding the distribution of types of information included in
indicative summaries and their relative ordering within the summaries.Comment: 8 pages, 3 figure
Visualization of database structures for information retrieval
This paper describes the Book House system, which is designed to support children's information retrieval in libraries as part of their education. It is a shareware program available on CDâROM or floppy disks, and comprises functionality for database searching as well as for classifying and storing book information in the database. The system concept is based on an understanding of children's domain structures and their capabilities for categorization of information needs in connection with their activities in schools, in school libraries or in public libraries. These structures are visualized in the interface by using metaphors and multimedia technology. Through the use of text, images and animation, the Book House encourages children â even at a very early age â to learn by doing in an enjoyable way, which plays on their previous experiences with computer games. Both words and pictures can be used for searching; this makes the system suitable for all age groups. Even children who have not yet learned to read properly can, by selecting pictures, search for and find those books they would like to have read aloud. Thus, at the very beginning of their school life, they can learn to search for books on their own. For the library community, such a system will provide an extended service which will increase the number of children's own searches and also improve the relevance, quality and utilization of the book collections in the libraries. A market research report on the need for an annual indexing service for books in the Book House format is in preparation by the Danish Library Centre A/S
- âŚ