Search CORE

2,962 research outputs found

Language Identification Using Visual Features

Author: Cox S
Newman J
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/04/2012
Field of study

Automatic visual language identification (VLID) is the technology of using information derived from the visual appearance and movement of the speech articulators to iden- tify the language being spoken, without the use of any audio information. This technique for language identification (LID) is useful in situations in which conventional audio processing is ineffective (very noisy environments), or impossible (no audio signal is available). Research in this field is also beneficial in the related field of automatic lip-reading. This paper introduces several methods for visual language identification (VLID). They are based upon audio LID techniques, which exploit language phonology and phonotactics to discriminate languages. We show that VLID is possible in a speaker-dependent mode by discrimi- nating different languages spoken by an individual, and we then extend the technique to speaker-independent operation, taking pains to ensure that discrimination is not due to artefacts, either visual (e.g. skin-tone) or audio (e.g. rate of speaking). Although the low accuracy of visual speech recognition currently limits the performance of VLID, we can obtain an error-rate of < 10% in discriminating between Arabic and English on 19 speakers and using about 30s of visual speech

Crossref

University of East Anglia digital repository

Does Confidence Reporting from the Crowd Benefit Crowdsourcing Performance?

Author: Jyothi Preethi
Karger David R.
Rocker Jana
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/04/2017
Field of study

We explore the design of an effective crowdsourcing system for an

M

-ary classification task. Crowd workers complete simple binary microtasks whose results are aggregated to give the final classification decision. We consider the scenario where the workers have a reject option so that they are allowed to skip microtasks when they are unable to or choose not to respond to binary microtasks. Additionally, the workers report quantized confidence levels when they are able to submit definitive answers. We present an aggregation approach using a weighted majority voting rule, where each worker's response is assigned an optimized weight to maximize crowd's classification performance. We obtain a couterintuitive result that the classification performance does not benefit from workers reporting quantized confidence. Therefore, the crowdsourcing system designer should employ the reject option without requiring confidence reporting.Comment: 6 pages, 4 figures, SocialSens 2017. arXiv admin note: text overlap with arXiv:1602.0057

arXiv.org e-Print Archive

Crossref

Special Libraries, December 1954

Author: Special Libraries Association
Publication venue: SJSU ScholarWorks
Publication date: 01/12/1954
Field of study

Volume 45, Issue 10https://scholarworks.sjsu.edu/sla_sl_1954/1009/thumbnail.jp

SJSU ScholarWorks

Implementation of an Intelligent Robotized GMAW Welding Cell, Part 1: Design and Simulation

Author: I. Davila-Rios
I. Lopez-Juarez
L. M. Torres-Trevino
Luis Martinez-Martinez
Publication venue: 'IntechOpen'
Publication date: 01/03/2010
Field of study

IntechOpen

Crossref

Special Libraries, January 1948

Author: Special Libraries Association
Publication venue: SJSU ScholarWorks
Publication date: 01/01/1947
Field of study

Volume 39, Issue 1https://scholarworks.sjsu.edu/sla_sl_1948/1000/thumbnail.jp

SJSU ScholarWorks

MSE News 2010

Author: Department of Materials Science and Engineering Michigan Technological University
Publication venue: Digital Commons @ Michigan Tech
Publication date: 01/01/2010
Field of study

Table of Contents MSE Welcomes New Professors Professor Emeritus Passes Alumni Achieves Milestone Hydrogen Storage Research Student Accomplishments Staff News Alumni Newshttps://digitalcommons.mtu.edu/materials-annualreports/1002/thumbnail.jp

Michigan Technological University

Automatic Visual Speech Recognition

Author: Alin Chiţu
Léon J.M. Rothkrantz
Publication venue: 'IntechOpen'
Publication date: 03/03/2012
Field of study

Intelligent SystemsElectrical Engineering, Mathematics and Computer Scienc

IntechOpen

Crossref

TU Delft Repository