Search CORE

42,539 research outputs found

OpenAdaptxt: an open source enabling technology for high quality text entry

Author: De Meo Roberto
Dona Prima
Dunlop Mark
Durga naveen
Motaparti Sunil
Publication venue
Publication date: 01/01/2012
Field of study

Modern text entry systems, especially for touch screen phones and novel devices, rely on complex underlying technologies such as error correction and word suggestion. Furthermore, for global deployment a vast number of languages have to be supported. Together this has raised the entry bar for new text entry techniques, which makes developing and testing a longer process thus stifling innovation. For example, testing a new feedback mechanism in comparison to a stock keyboard now requires the researchers to support at least slip correction and probably word suggestion. This paper introduces OpenAdaptxt: an open source community driven text input platform to enable development of higher quality text input solutions. It is the first commercial-grade open source enabling technology for modern text entry that supports both multiple platforms and dictionary support for over 50 spoken languages

CiteSeerX

University of Strathclyde Institutional Repository

Towards Understanding Egyptian Arabic Dialogues

Author: Abdou Sherif M
Elmadany Abdelrahim A
Gheith Mervat
Publication venue: 'Foundation of Computer Science'
Publication date: 13/07/2015
Field of study

Labelling of user's utterances to understanding his attends which called Dialogue Act (DA) classification, it is considered the key player for dialogue language understanding layer in automatic dialogue systems. In this paper, we proposed a novel approach to user's utterances labeling for Egyptian spontaneous dialogues and Instant Messages using Machine Learning (ML) approach without relying on any special lexicons, cues, or rules. Due to the lack of Egyptian dialect dialogue corpus, the system evaluated by multi-genre corpus includes 4725 utterances for three domains, which are collected and annotated manually from Egyptian call-centers. The system achieves F1 scores of 70. 36% overall domains.Comment: arXiv admin note: substantial text overlap with arXiv:1505.0308

arXiv.org e-Print Archive

CiteSeerX

Automated tutoring for a database skills training environment

Author: Bhagat S.
Claire Kenny
Claus Pahl
McLoughlin C.
Mitrovic
Wenger E.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2005
Field of study

Universities are increasingly offering courses online. Feedback, assessment, and guidance are important features of this online courseware. Together, in the absence of a human tutor, they aid the student in the learning process. We present a programming training environment for a database course. It aims to offer a substitute for classroom based learning by providing synchronous automated feedback to the student, along with guidance based on a personalized assessment. The automated tutoring system should promote procedural knowledge acquisition and skills training. An automated tutoring feature is an integral part of this tutoring system

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

An Arabic Optical Braille Recognition System

Author: Al-Salman AbdulMalik
AlKanhal Mohammed
AlOhali Yosef
AlRajih Abdullah
Publication venue
Publication date: 01/04/2007
Field of study

Technology has shown great promise in providing access to textual information for visually impaired people. Optical Braille Recognition (OBR) allows people with visual impairments to read volumes of typewritten documents with the help of flatbed scanners and OBR software. This project looks at developing a system to recognize an image of embossed Arabic Braille and then convert it to text. It particularly aims to build fully functional Optical Arabic Braille Recognition system. It has two main tasks, first is to recognize printed Braille cells, and second is to convert them to regular text. Converting Braille to text is not simply a one to one mapping, because one cell may represent one symbol (alphabet letter, digit, or special character), two or more symbols, or part of a symbol. Moreover, multiple cells may represent a single symbol

Southampton (e-Prints Soton)

King Saud University Repository

More blogging features for author identification

Author: Ahmed Amr
Mohtasseb Haytham
Publication venue
Publication date: 01/01/2009
Field of study

In this paper we present a novel improvement in the field of authorship identification in personal blogs. The improvement in authorship identification, in our work, is by utilizing a hybrid collection of linguistic features that best capture the style of users in diaries blogs. The features sets contain LIWC with its psychology background, a collection of syntactic features & part-of-speech (POS), and the misspelling errors features. Furthermore, we analyze the contribution of each feature set on the final result and compare the outcome of using different combination from the selected feature sets. Our new categorization of misspelling words which are mapped into numerical features, are noticeably enhancing the classification results. The paper also confirms the best ranges of several parameters that affect the final result of authorship identification such as the author numbers, words number in each post, and the number of documents/posts for each author/user. The results and evaluation show that the utilized features are compact, while their performance is highly comparable with other much larger feature sets

University of Lincoln Institutional Repository

CiteSeerX

Edge Hill University Research Information Repository