126 research outputs found

    Use of Coherent Point Drift in computer vision applications

    Get PDF
    This thesis presents the novel use of Coherent Point Drift in improving the robustness of a number of computer vision applications. CPD approach includes two methods for registering two images - rigid and non-rigid point set approaches which are based on the transformation model used. The key characteristic of a rigid transformation is that the distance between points is preserved, which means it can be used in the presence of translation, rotation, and scaling. Non-rigid transformations - or affine transforms - provide the opportunity of registering under non-uniform scaling and skew. The idea is to move one point set coherently to align with the second point set. The CPD method finds both the non-rigid transformation and the correspondence distance between two point sets at the same time without having to use a-priori declaration of the transformation model used. The first part of this thesis is focused on speaker identification in video conferencing. A real-time, audio-coupled video based approach is presented, which focuses more on the video analysis side, rather than the audio analysis that is known to be prone to errors. CPD is effectively utilised for lip movement detection and a temporal face detection approach is used to minimise false positives if face detection algorithm fails to perform. The second part of the thesis is focused on multi-exposure and multi-focus image fusion with compensation for camera shake. Scale Invariant Feature Transforms (SIFT) are first used to detect keypoints in images being fused. Subsequently this point set is reduced to remove outliers, using RANSAC (RANdom Sample Consensus) and finally the point sets are registered using CPD with non-rigid transformations. The registered images are then fused with a Contourlet based image fusion algorithm that makes use of a novel alpha blending and filtering technique to minimise artefacts. The thesis evaluates the performance of the algorithm in comparison to a number of state-of-the-art approaches, including the key commercial products available in the market at present, showing significantly improved subjective quality in the fused images. The final part of the thesis presents a novel approach to Vehicle Make & Model Recognition in CCTV video footage. CPD is used to effectively remove skew of vehicles detected as CCTV cameras are not specifically configured for the VMMR task and may capture vehicles at different approaching angles. A LESH (Local Energy Shape Histogram) feature based approach is used for vehicle make and model recognition with the novelty that temporal processing is used to improve reliability. A number of further algorithms are used to maximise the reliability of the final outcome. Experimental results are provided to prove that the proposed system demonstrates an accuracy in excess of 95% when tested on real CCTV footage with no prior camera calibration

    Irish Machine Vision and Image Processing Conference Proceedings 2017

    Get PDF

    Spaces of Multilingualism

    Get PDF
    This innovative collection explores critical issues in understanding multilingualism as a defining dimension of identity creation and negotiation in contemporary social life. Reinforcing interdisciplinary conversations on these themes, each chapter is co-authored by two different researchers, often those who have not written together before. The combined effect is a volume showcasing unique and dynamic perspectives on such topics as rethinking of language policy, testing of language rights, language pedagogy, meaning-making, and activism in the linguistic landscape. The book explores multilingualism through the lenses of spaces and policies as embodied in Elizabeth Lanza’s body of work in the field, with a focus on the latest research on linguistic landscapes in diverse settings. Taken together, the book offers a window into better understanding issues around processes of change in and of languages and societies. This ground breaking volume will be of particular interest to students and scholars in multilingualism, applied linguistics, and sociolinguistics

    Desperately seeking depth: global and local narratives of the South African general elections on television news, 1994 - 2014

    Get PDF
    Eric Louw, Jesper Stömbäck, and W. Lance Bennett call the trend in late-20th century political journalism "mediatisation", where the televisualisation of Western elections favours episodic, dramatic, fragmented, and event-driven reporting. This "hype-ocracy" results in narrow and shallow frames that entertain rather than enlighten. This thesis, titled "Desperately Seeking Depth", examines this trend in both international and local news about South African elections. While scholarship of Western elections on TV news is blossoming, analyses of news coverage of South African elections is sparse. There is particularly little analysis of the visual dimensions of TV news coverage, which remains a methodological challenge for media and communication scholars. This thesis draws together a comprehensive analysis of South Africa's general elections on international and local television news over two decades. It develops an innovative, multimodal analysis method dedicated to television news and adds meaningful data to the overall study of South African media and politics, and international communication. It combines analysis of previous studies of each election with the original analysis of over 150 news broadcasts to uncover the news narratives about the South African general elections between 1994 and 2014. This thesis demonstrates the difference between global and local journalism about South African elections. Restricted by mediatised news values that favour episodic reporting, Western journalists present entangled, contradictory narratives over the years. The fixation on 1994's violent-turned-miracle election narrative ignored the complexities of the new democracy, while an increasingly detached approach in covering the 2009 and 2014 ANC victories left journalists perplexed and unable to explore deeper narratives. Meanwhile, South African channels become progressively more hesitant to investigate controversial topics or criticise the ruling party. Avoidance of important issues such as the 1994 election violence, the AIDS crisis in 2004, and Zuma's Nkandla fiasco in 2014 results in narrow reporting that limits the substantive information available during the election periods. All channels to some extent seek narratives that attempt to explain and explore South Africa's complex democracy, but these narratives are often contradictory. The decline in journalists' engagement with political leaders and citizens means that the full picture of the elections is reduced to a few easily digestible frames that confirm neoliberal news values. This thesis offers a new model for the analysis of TV news coverage of elections that can provide the basis for future studies. "Desperately Seeking Depth" ultimately uncovers a picture of news industry that, both locally and globally, works as an echo chamber of sound bites that focused on elite voices

    The Impact of the Internationalisation of Higher Education on Scientists’ Multimodal Communication: A case study from Catalonia

    Get PDF
    Les universitats de tot el món són instades a participar en el procés d' ‘internacionalització’ com a distintiu de qualitat i com a reclam per atraure estudiants. Aquest estudi aborda aquesta qüestió des del context de les institucions catalanes d’educació superior, que afronten el dilema de donar suport a la/les llengua/gües local/s i, alhora, abraçar el multilingüisme i, sobretot, l’anglès. L'objectiu principal d'aquest estudi és examinar l'impacte de la internacionalització de l'educació superior en la comunicació diària dels científics. Les dades etnogràfiques s’han recopilat al llarg d’un període d’11 mesos d’observació de dos grups de recerca (RGs) multinacionals amb seu en una universitat catalana, i s’han contrastat amb dades extretes d’un RG amb seu a Alemanya i amb idees inspirades en les pràctiques del RG de la pròpia investigadora. De l'objectiu empíric n’ha derivat un objectiu teòric, que consisteix a dissenyar i provar un marc teòric adequat per estudiar el fenomen proposat de manera integral. Aquest estudi té l’objectiu de contribuir a la limitada literatura que descriu aquelles pràctiques comunicatives "informals" i inèdites dels científics, així com a la literatura sobre la internacionalització de l’ensenyament superior. A nivell pràctic, aquest treball pretén contribuir a la millora de les polítiques d’internacionalització de les institucions d’ensenyament superior de Catalunya, d’Europa i potencialment d’altres contextos arreu del món.Las universidades de todo el mundo son instadas a participar en el proceso de ‘internacionalización’ como distintivo de calidad y como reclamo para atraer estudiantes. Este estudio aborda esta cuestión desde el contexto de las instituciones catalanas de educación superior, que afrontan el dilema de apoyar la/s lengua/s local/es y, a la vez, abrazar el multilingüismo y, sobre todo, el inglés. El objetivo principal de este estudio es examinar el impacto de la internacionalización de la educación superior en la comunicación diaria de los científicos. Los datos etnográficos se han recopilado a lo largo de un período de 11 meses de observación de dos grupos de investigación (RGs) multinacionales con sede en una universidad catalana, y se han contrastado con datos extraídos de un RG con sede en Alemania y con ideas inspiradas en las prácticas del RG de la propia investigadora. Del objetivo empírico ha derivado un objetivo teórico, que consiste en diseñar y probar un marco teórico adecuado para estudiar el fenómeno propuesto de manera integral. Este estudio tiene el objetivo de contribuir a la limitada literatura que describe aquellas prácticas comunicativas "informales" e inéditas de los científicos, así como a la literatura sobre la internacionalización de la enseñanza superior. A nivel práctico, este trabajo pretende contribuir a la mejora de las políticas de internacionalización de las instituciones de enseñanza superior de Cataluña, de Europa y potencialmente de otros contextos en todo el mundo.Universities worldwide are urged to engage in the process of ‘internationalisation’ as a hallmark of quality and as a lure to attract students. The current study approaches this issue from the context of Catalan higher education institutions, which deal with the dilemma of supporting the local language(s) and at the same time embracing multilingualism and especially English. The main aim of this study is to examine the impact of the internationalisation of higher education on the daily communication of scientists. Ethnographic data have been collected throughout a period of 11 months from two multinational research groups (RGs) based in a Catalan state university, and contrasted with data taken from a RG based in Germany and with insights from the researcher’s own RG. From the empirical objective has derived a theoretical objective, consisting in designing and proving a suitable theoretical framework to study the phenomenon holistically. This study aims to contribute to the limited body of research describing scientists’ "informal" and unpublished communicative practices, as well as to the literature on the internationalisation of higher education. On a practical level, this work is intended to aid in the improvement of internationalisation policies of higher education institutions in Catalonia, in Europe and potentially in other contexts worldwide

    Inferring Complex Activities for Context-aware Systems within Smart Environments

    Get PDF
    The rising ageing population worldwide and the prevalence of age-related conditions such as physical fragility, mental impairments and chronic diseases have significantly impacted the quality of life and caused a shortage of health and care services. Over-stretched healthcare providers are leading to a paradigm shift in public healthcare provisioning. Thus, Ambient Assisted Living (AAL) using Smart Homes (SH) technologies has been rigorously investigated to help address the aforementioned problems. Human Activity Recognition (HAR) is a critical component in AAL systems which enables applications such as just-in-time assistance, behaviour analysis, anomalies detection and emergency notifications. This thesis is aimed at investigating challenges faced in accurately recognising Activities of Daily Living (ADLs) performed by single or multiple inhabitants within smart environments. Specifically, this thesis explores five complementary research challenges in HAR. The first study contributes to knowledge by developing a semantic-enabled data segmentation approach with user-preferences. The second study takes the segmented set of sensor data to investigate and recognise human ADLs at multi-granular action level; coarse- and fine-grained action level. At the coarse-grained actions level, semantic relationships between the sensor, object and ADLs are deduced, whereas, at fine-grained action level, object usage at the satisfactory threshold with the evidence fused from multimodal sensor data is leveraged to verify the intended actions. Moreover, due to imprecise/vague interpretations of multimodal sensors and data fusion challenges, fuzzy set theory and fuzzy web ontology language (fuzzy-OWL) are leveraged. The third study focuses on incorporating uncertainties caused in HAR due to factors such as technological failure, object malfunction, and human errors. Hence, existing studies uncertainty theories and approaches are analysed and based on the findings, probabilistic ontology (PR-OWL) based HAR approach is proposed. The fourth study extends the first three studies to distinguish activities conducted by more than one inhabitant in a shared smart environment with the use of discriminative sensor-based techniques and time-series pattern analysis. The final study investigates in a suitable system architecture with a real-time smart environment tailored to AAL system and proposes microservices architecture with sensor-based off-the-shelf and bespoke sensing methods. The initial semantic-enabled data segmentation study was evaluated with 100% and 97.8% accuracy to segment sensor events under single and mixed activities scenarios. However, the average classification time taken to segment each sensor events have suffered from 3971ms and 62183ms for single and mixed activities scenarios, respectively. The second study to detect fine-grained-level user actions was evaluated with 30 and 153 fuzzy rules to detect two fine-grained movements with a pre-collected dataset from the real-time smart environment. The result of the second study indicate good average accuracy of 83.33% and 100% but with the high average duration of 24648ms and 105318ms, and posing further challenges for the scalability of fusion rule creations. The third study was evaluated by incorporating PR-OWL ontology with ADL ontologies and Semantic-Sensor-Network (SSN) ontology to define four types of uncertainties presented in the kitchen-based activity. The fourth study illustrated a case study to extended single-user AR to multi-user AR by combining RFID tags and fingerprint sensors discriminative sensors to identify and associate user actions with the aid of time-series analysis. The last study responds to the computations and performance requirements for the four studies by analysing and proposing microservices-based system architecture for AAL system. A future research investigation towards adopting fog/edge computing paradigms from cloud computing is discussed for higher availability, reduced network traffic/energy, cost, and creating a decentralised system. As a result of the five studies, this thesis develops a knowledge-driven framework to estimate and recognise multi-user activities at fine-grained level user actions. This framework integrates three complementary ontologies to conceptualise factual, fuzzy and uncertainties in the environment/ADLs, time-series analysis and discriminative sensing environment. Moreover, a distributed software architecture, multimodal sensor-based hardware prototypes, and other supportive utility tools such as simulator and synthetic ADL data generator for the experimentation were developed to support the evaluation of the proposed approaches. The distributed system is platform-independent and currently supported by an Android mobile application and web-browser based client interfaces for retrieving information such as live sensor events and HAR results

    The identity construction and representation of diasporic Chinese content creators on YouTube

    Get PDF
    A significant number of diasporic Chinese content creators have emerged on YouTube in recent years. Unlike their parents, these Chinese diasporas in Western world spend most of their time in the receiving countries and have been marginalized by the mainstream society during their growing up period. With the intention to represent their own diasporic identity, a series of videos were made to share various cultural related content ranging from ethnic food preparation. generational relationships, and heritage language practices. Many of these videos have already received hundreds of thousands of views, showing its potential to have a large social influence. Thus, this study decided to examine how Chinese diaspora construct and represent their cultural identity on this platform, with a specific focus on the Chinese in Western countries. To understand the topic, this study will combine theories such as diaspora and transnationalism, cultural identity and semiotics, representation and power relations while also considering YouTube’s outstanding “participatory culture” and its commercial attributes. In terms of methodology, this study will treat YouTube’s environment as a whole and it has adopted a series of methods from online observation, semi-structured interview and textual analysis. The findings will be divided into three chapters with each chapter focusing on one cultural element (Chinese food, parents and heritage language) and the influence of these elements on Western Chinese identity construction and more importantly, how they represent these symbols online. During this process, power relations behind the representation process will be carefully investigated to understand how a hybrid identity was formulated through these online practices

    SiAM-dp : an open development platform for massively multimodal dialogue systems in cyber-physical environments

    Get PDF
    Cyber-physical environments enhance natural environments of daily life such as homes, factories, offices, and cars by connecting the cybernetic world of computers and communication with the real physical world. While under the keyword of Industrie 4.0, cyber-physical environments will take a relevant role in the next industrial revolution, and they will also appear in homes, offices, workshops, and numerous other areas. In this new world, classical interaction concepts where users exclusively interact with a single stationary device, PC or smartphone become less dominant and make room for new occurrences of interaction between humans and the environment itself. Furthermore, new technologies and a rising spectrum of applicable modalities broaden the possibilities for interaction designers to include more natural and intuitive non-verbal and verbal communication. The dynamic characteristic of a cyber-physical environment and the mobility of users confronts developers with the challenge of developing systems that are flexible concerning the connected and used devices and modalities. This implies new opportunities for cross-modal interaction that go beyond dual modalities interaction as is well known nowadays. This thesis addresses the support of application developers with a platform for the declarative and model based development of multimodal dialogue applications, with a focus on distributed input and output devices in cyber-physical environments. The main contributions can be divided into three parts: - Design of models and strategies for the specification of dialogue applications in a declarative development approach. This includes models for the definition of project resources, dialogue behaviour, speech recognition grammars, and graphical user interfaces and mapping rules, which convert the device specific representation of input and output description to a common representation language. - The implementation of a runtime platform that provides a flexible and extendable architecture for the easy integration of new devices and components. The platform realises concepts and strategies of multimodal human-computer interaction and is the basis for full-fledged multimodal dialogue applications for arbitrary device setups, domains, and scenarios. - A software development toolkit that is integrated in the Eclipse rich client platform and provides wizards and editors for creating and editing new multimodal dialogue applications.Cyber-physische Umgebungen (CPEs) erweitern natürliche Alltagsumgebungen wie Heim, Fabrik, Büro und Auto durch Verbindung der kybernetischen Welt der Computer und Kommunikation mit der realen, physischen Welt. Die möglichen Anwendungsgebiete hierbei sind weitreichend. Während unter dem Stichwort Industrie 4.0 cyber-physische Umgebungen eine bedeutende Rolle für die nächste industrielle Revolution spielen werden, erhalten sie ebenfalls Einzug in Heim, Büro, Werkstatt und zahlreiche weitere Bereiche. In solch einer neuen Welt geraten klassische Interaktionskonzepte, in denen Benutzer ausschließlich mit einem einzigen Gerät, PC oder Smartphone interagieren, immer weiter in den Hintergrund und machen Platz für eine neue Ausprägung der Interaktion zwischen dem Menschen und der Umgebung selbst. Darüber hinaus sorgen neue Technologien und ein wachsendes Spektrum an einsetzbaren Modalitäten dafür, dass sich im Interaktionsdesign neue Möglichkeiten für eine natürlichere und intuitivere verbale und nonverbale Kommunikation auftun. Die dynamische Natur von cyber-physischen Umgebungen und die Mobilität der Benutzer darin stellt Anwendungsentwickler vor die Herausforderung, Systeme zu entwickeln, die flexibel bezüglich der verbundenen und verwendeten Geräte und Modalitäten sind. Dies impliziert auch neue Möglichkeiten in der modalitätsübergreifenden Kommunikation, die über duale Interaktionskonzepte, wie sie heutzutage bereits üblich sind, hinausgehen. Die vorliegende Arbeit befasst sich mit der Unterstützung von Anwendungsentwicklern mit Hilfe einer Plattform zur deklarativen und modellbasierten Entwicklung von multimodalen Dialogapplikationen mit einem Fokus auf verteilte Ein- und Ausgabegeräte in cyber-physischen Umgebungen. Die bearbeiteten Aufgaben können grundlegend in drei Teile gegliedert werden: - Die Konzeption von Modellen und Strategien für die Spezifikation von Dialoganwendungen in einem deklarativen Entwicklungsansatz. Dies beinhaltet Modelle für das Definieren von Projektressourcen, Dialogverhalten, Spracherkennergrammatiken, graphischen Benutzerschnittstellen und Abbildungsregeln, die die gerätespezifische Darstellung von Ein- und Ausgabegeräten in eine gemeinsame Repräsentationssprache transformieren. - Die Implementierung einer Laufzeitumgebung, die eine flexible und erweiterbare Architektur für die einfache Integration neuer Geräte und Komponenten bietet. Die Plattform realisiert Konzepte und Strategien der multimodalen Mensch-Maschine-Interaktion und ist die Basis vollwertiger multimodaler Dialoganwendungen für beliebige Domänen, Szenarien und Gerätekonfigurationen. - Eine Softwareentwicklungsumgebung, die in die Eclipse Rich Client Plattform integriert ist und Entwicklern Assistenten und Editoren an die Hand gibt, die das Erstellen und Editieren von neuen multimodalen Dialoganwendungen unterstützen
    corecore