79,860 research outputs found

    Sign language recognition with transformer networks

    Get PDF
    Sign languages are complex languages. Research into them is ongoing, supported by large video corpora of which only small parts are annotated. Sign language recognition can be used to speed up the annotation process of these corpora, in order to aid research into sign languages and sign language recognition. Previous research has approached sign language recognition in various ways, using feature extraction techniques or end-to-end deep learning. In this work, we apply a combination of feature extraction using OpenPose for human keypoint estimation and end-to-end feature learning with Convolutional Neural Networks. The proven multi-head attention mechanism used in transformers is applied to recognize isolated signs in the Flemish Sign Language corpus. Our proposed method significantly outperforms the previous state of the art of sign language recognition on the Flemish Sign Language corpus: we obtain an accuracy of 74.7% on a vocabulary of 100 classes. Our results will be implemented as a suggestion system for sign language corpus annotation

    What the eye does not see: visualizations strategies for the data collection of personal networks

    Get PDF
    The graphic representation of relational data is one of the central elements of social network analysis. In this paper, the author describe the use of visualization in interview-based data collection procedures designed to obtain personal networks information, exploring four main contributions. First, the author shows a procedure by which the visualization is integrated with traditional name generators to facilitate obtaining information and reducing the burden of the interview process. Second, the author describes the reactions and qualitative interpretation of the interviewees when they are presented with an analytical visualization of their personal network. The most frequent strategies consist in identifying the key individuals, dividing the personal network in groups and classifying alters in concentric circles of relative importance. Next, the author explores how the visualization of groups in personal networks facilitates the enumeration of the communities in which individuals participate. This allows the author to reflect on the role of social circles in determining the structure of personal networks. Finally, the author compares the graphic representation obtained through spontaneous, hand-drawn sociograms with the analytical visualizations elicited through software tools. This allows the author to demonstrate that analytical procedures reveal aspects of the structure of personal networks that respondents are not aware of, as well as the advantages and disadvantages of using both modes of data collection. For this, the author presents findings from a study of highly skilled migrants living in Spain (n = 95) through which the author illustrates the challenges, in terms of data reliability, validity and burden on both the researcher and the participants

    A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

    Full text link
    Sequential data often possesses a hierarchical structure with complex dependencies between subsequences, such as found between the utterances in a dialogue. In an effort to model this kind of generative process, we propose a neural network-based generative architecture, with latent stochastic variables that span a variable number of time steps. We apply the proposed model to the task of dialogue response generation and compare it with recent neural network architectures. We evaluate the model performance through automatic evaluation metrics and by carrying out a human evaluation. The experiments demonstrate that our model improves upon recently proposed models and that the latent variables facilitate the generation of long outputs and maintain the context.Comment: 15 pages, 5 tables, 4 figure
    corecore