Search CORE

908,368 research outputs found

Implementation of a Human-Computer Interface for Computer Assisted Translation and Handwritten Text Recognition

Author: Ocampo Sepúlveda Jorge Carlos
Publication venue: 'Universitat Politecnica de Valencia'
Publication date: 11/01/2012
Field of study

A human-computer interface is developed to provide services of computer assisted machine translation (CAT) and computer assisted transcription of handwritten text images (CATTI). The back-end machine translation (MT) and handwritten text recognition (HTR) systems are provided by the Pattern Recognition and Human Language Technology (PRHLT) research group. The idea is to provide users with easy to use tools to convert interactive translation and transcription feasible tasks. The assisted service is provided by remote servers with CAT or CATTI capabilities. The interface supplies the user with tools for efficient local edition: deletion, insertion and substitution.Ocampo Sepúlveda, JC. (2009). Implementation of a Human-Computer Interface for Computer Assisted Translation and Handwritten Text Recognition. http://hdl.handle.net/10251/14318Archivo delegad

RiuNet

Real-time face recognition for human tracking system

Author: Abdul Manaf Mohamad Shukri
Ahmad Nasarudin
Mohd Zaini Muhammad Zharif
Nawawi Sophan Wahyudi
Publication venue: 'Penerbit UTHM'
Publication date: 01/01/2020
Field of study

Computer-based human face detection and recognition systems proliferate in various sectors such as universities, factories and other organisation [1]. This system refers to a branch of computer technology that can recognise human faces in digital images..

UTHM Institutional Repository

Mars Rover imaging systems and directional filtering

Author: Wang Paul P.
Publication venue
Publication date
Field of study

Computer literature searches were carried out at Duke University and NASA Langley Research Center. The purpose is to enhance personal knowledge based on the technical problems of pattern recognition and image understanding which must be solved for the Mars Rover and Sample Return Mission. Intensive study effort of a large collection of relevant literature resulted in a compilation of all important documents in one place. Furthermore, the documents are being classified into: Mars Rover; computer vision (theory); imaging systems; pattern recognition methodologies; and other smart techniques (AI, neural networks, fuzzy logic, etc)

NASA Technical Reports Server

Prepared to Design Future Technologies: Students Study How to Build Computer Systems to Think for Themselves

Author: Dell\u27Agnese Courtney
Publication venue: DOCS@RWU
Publication date: 18/04/2019
Field of study

Have you ever wondered how your cell phone unlocks just by using your face? How is it able to detect the difference between your face and someone else’s? This feature, now prominent in most cell phones, is just one type of machine learning that RWU students are learning how to build. In a computer science course, students went behind the screen to learn how facial recognition computer systems are constructed and how this type of technology can be applied to future devices

DOCS@RWU

HELIN Digital Commons

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Author: Gupta Tanmay
Hoiem Derek
Shih Kevin
Singh Saurabh
Publication venue
Publication date: 16/10/2017
Field of study

An important goal of computer vision is to build systems that learn visual representations over time that can be applied to many tasks. In this paper, we investigate a vision-language embedding as a core representation and show that it leads to better cross-task transfer than standard multi-task learning. In particular, the task of visual recognition is aligned to the task of visual question answering by forcing each to use the same word-region embeddings. We show this leads to greater inductive transfer from recognition to VQA than standard multitask learning. Visual recognition also improves, especially for categories that have relatively few recognition training labels but appear often in the VQA setting. Thus, our paper takes a small step towards creating more general vision systems by showing the benefit of interpretable, flexible, and trainable core representations.Comment: Accepted in ICCV 2017. The arxiv version has an extra analysis on correlation with human attentio

arXiv.org e-Print Archive

Crossref