20,616 research outputs found

    Detecting Family Resemblance: Automated Genre Classification.

    Get PDF
    This paper presents results in automated genre classification of digital documents in PDF format. It describes genre classification as an important ingredient in contextualising scientific data and in retrieving targetted material for improving research. The current paper compares the role of visual layout, stylistic features and language model features in clustering documents and presents results in retrieving five selected genres (Scientific Article, Thesis, Periodicals, Business Report, and Form) from a pool of materials populated with documents of the nineteen most popular genres found in our experimental data set.

    INVESTIGATING THE ROLES OF MECHANORECEPTIVE CHANNELS IN TACTILE APPARENT MOTION PERCEPTION: A VIBROTACTILE STUDY

    Get PDF
    Tactile apparent motion (TAM) is a perceptual phenomenon in which consecutive presentation of multiple tactile stimuli creates an illusion of motion. Employing a novel tactile display device, the Latero, allowed us to investigate this. The current study focused on the Rapidly Adapting (RA) channel and Slowly Adapting I (SAI) channel on the index finger. The experiment implemented vibrotactile masking stimuli to target the mechanoreceptive channels with the goal of gaining better insight into the involvement of mechanoreceptive channels in the perception of TAM. Masking stimuli were used because previous studies have used them to differentiate between different channels; a certain masking stimulus will impact a mechanoreceptive channel more than others. The experiment began by measuring participants’ threshold for TAM stimuli by varying the stimulus intensity in a two-choice task (left vs right); participants received test trials consisting of TAM stimuli with 25 Hz and 6 Hz testing for the RA and SAI channels, respectively. Next, participants performed a series of test trials with vibrotactile masking stimuli that preceded the TAM stimuli mentioned above. The vibrotactile masking stimulus varied in duration (4 seconds vs 8 seconds) and intensity (two times vs three times the intensity of the TAM stimuli). The results suggest that there was no difference in accuracy when testing for the RA and SAI channels. The results also showed that the introduction of the masking stimuli significantly lowered accuracy. Overall, neither the RA nor the SAI channel may be uniquely involved in TAM perception. However, further improvement on the current design may aid in isolating each channel to help better understand the channel’s role in TAM perception

    Examining Variations of Prominent Features in Genre Classification.

    Get PDF
    This paper investigates the correlation between features of three types (visual, stylistic and topical types) and genre classes. The majority of previous studies in automated genre classification have created models based on an amalgamated representation of a document using a combination of features. In these models, the inseparable roles of different features make it difficult to determine a means of improving the classifier when it exhibits poor performance in detecting selected genres. In this paper we use classifiers independently modeled on three groups of features to examine six genre classes to show that the strongest features for making one classification is not necessarily the best features for carrying out another classification.

    Automating Metadata Extraction: Genre Classification

    Get PDF
    A problem that frequently arises in the management and integration of scientific data is the lack of context and semantics that would link data encoded in disparate ways. To bridge the discrepancy, it often helps to mine scientific texts to aid the understanding of the database. Mining relevant text can be significantly aided by the availability of descriptive and semantic metadata. The Digital Curation Centre (DCC) has undertaken research to automate the extraction of metadata from documents in PDF([22]). Documents may include scientific journal papers, lab notes or even emails. We suggest genre classification as a first step toward automating metadata extraction. The classification method will be built on looking at the documents from five directions; as an object of specific visual format, a layout of strings with characteristic grammar, an object with stylo-metric signatures, an object with meaning and purpose, and an object linked to previously classified objects and external sources. Some results of experiments in relation to the first two directions are described here; they are meant to be indicative of the promise underlying this multi-faceted approach.

    Formulating representative features with respect to document genre classification

    Get PDF
    Genre classification (e.g. whether a document is a scientific article or magazine article) is closely bound to the physical and conceptual structure of document as well as the level of depth involved in the text. Hence, it provides a means of ranking documents retrieved by search tools according to metrics other than topical similarity. Moreover, the structural information derived from genre classification can be used to locate target information within the text. In previous studies, the detection of genre classes has been attempted by using some normalised frequency of terms or combinations of terms in the document (here, we are using term as a reference to words, phrases, syntactic units, sentences and paragraphs, as well as other patterns derived from deeper linguistic or semantic analysis). These approaches largely neglect how the term is distributed throughout the document. Here, we report the results of automated experiments based on distributive statistics of words in order to present evidence that term distribution pattern is a better indicator of genre class than term frequency.

    Feature Type Analysis in Automated Genre Classification

    Get PDF
    In this paper, we compare classifiers based on language model, image, and stylistic features for automated genre classification. The majority of previous studies in genre classification have created models based on an amalgamated representation of a document using a multitude of features. In these models, the inseparable roles of different features make it difficult to determine a means of improving the classifier when it exhibits poor performance in detecting selected genres. By independently modeling and comparing classifiers based on features belonging to three types, describing visual, stylistic, and topical properties, we demonstrate that different genres have distinctive feature strengths.

    Building a document genre corpus: a profile of the KRYS I corpus

    Get PDF
    This paper describes the KRYS I corpus, consisting of documents classified into 70 genre classes. It has been constructed as part of an effort to automate document genre classification as distinct from topic detection. Previously there has been very little work on building corpora of texts which have been classified using a nontopical genre palette. The reason for this is partly due to the fact that genre as a concept, is rooted in philosophy, rhetoric and literature, and highly complex and domain dependent in its interpretation ([11]). The usefulness of genre in everyday information search is only now starting to be recognised and there is no genre classification schema that has been consolidated to have applicable value in this direction. By presenting here our experiences in constructing the KRYS I corpus, we hope to shed light on the information gathering and seeking behaviour and the role of genre in these activities, as well as a way forward for creating a better corpus for testing automated genre classification tasks and the application of these tasks to other domains.

    Digital populism in South Korea?

    Get PDF

    Making Meaning out of Learning in the Everyday Lives of Korean Married Women

    Get PDF
    This study examines the significance of the learning experience in the everyday life of married women in Korea and how they manage their learning. In doing so, it reveals a number of distinctive characteristics of Korean married women which differentiate them from western women in pursuing individual and collective values

    Emerging Market Multinationals: the New Model of Internationalization of Firms

    Get PDF
    The conventional global competitive landscape started to change since the 1990s with the emergence of emerging market multinationals (EMNEs). Most of EMNEs were marginal competitors just a decade ago, but, today, they are challenging the world’s most accomplished and establishedmultinational enterprises (MNEs)from advanced markets in a wide variety of industries and markets. EMNEs have adopted different global expansion strategies from advanced market MNEs in the process of internationalization. Therefore, it is timely and critical to question whether the existing theories on internationalization of MNEs that have been developed mainly studying MNEs from advanced markets are relevant to explain the behaviors and strategies of EMNEs. Through this paper, I try to elaborate the behaviors and strategies that have been adopted by EMNEs in their internationalization process to support the theory building efforts to explain the internationalization of EMNEs. Although we need to elaborate new theories to explain internationalization process specifically appropriate to EMNEs, we also need to identify which aspects of the existing theories on internationalization of MNEs are universally vali
    corecore