59 research outputs found

    Information embedding and retrieval in 3D printed objects

    Get PDF
    Deep learning and convolutional neural networks have become the main tools of computer vision. These techniques are good at using supervised learning to learn complex representations from data. In particular, under limited settings, the image recognition model now performs better than the human baseline. However, computer vision science aims to build machines that can see. It requires the model to be able to extract more valuable information from images and videos than recognition. Generally, it is much more challenging to apply these deep learning models from recognition to other problems in computer vision. This thesis presents end-to-end deep learning architectures for a new computer vision field: watermark retrieval from 3D printed objects. As it is a new area, there is no state-of-the-art on many challenging benchmarks. Hence, we first define the problems and introduce the traditional approach, Local Binary Pattern method, to set our baseline for further study. Our neural networks seem useful but straightfor- ward, which outperform traditional approaches. What is more, these networks have good generalization. However, because our research field is new, the problems we face are not only various unpredictable parameters but also limited and low-quality training data. To address this, we make two observations: (i) we do not need to learn everything from scratch, we know a lot about the image segmentation area, and (ii) we cannot know everything from data, our models should be aware what key features they should learn. This thesis explores these ideas and even explore more. We show how to use end-to-end deep learning models to learn to retrieve watermark bumps and tackle covariates from a few training images data. Secondly, we introduce ideas from synthetic image data and domain randomization to augment training data and understand various covariates that may affect retrieve real-world 3D watermark bumps. We also show how the illumination in synthetic images data to effect and even improve retrieval accuracy for real-world recognization applications

    Robust density modelling using the student's t-distribution for human action recognition

    Full text link
    The extraction of human features from videos is often inaccurate and prone to outliers. Such outliers can severely affect density modelling when the Gaussian distribution is used as the model since it is highly sensitive to outliers. The Gaussian distribution is also often used as base component of graphical models for recognising human actions in the videos (hidden Markov model and others) and the presence of outliers can significantly affect the recognition accuracy. In contrast, the Student's t-distribution is more robust to outliers and can be exploited to improve the recognition rate in the presence of abnormal data. In this paper, we present an HMM which uses mixtures of t-distributions as observation probabilities and show how experiments over two well-known datasets (Weizmann, MuHAVi) reported a remarkable improvement in classification accuracy. © 2011 IEEE

    Finding Home for Poetry in a Nomadic World: Joseph Brodsky and \uc1gnes Leh\uf3czky

    Get PDF
    This new line of research has been suggested to me by the life and work of the Russian poet and essayist Joseph Brodsky, who, after his exile from the Soviet Union in 1972, moved to the United States, to lead a culturally \u2018nomadic\u2019 existence, which culminated, in his last years, in the abandonment of the mother tongue for the full adoption of his second language, both for prose and poetry. Departing from Brodsky\u2019s last production and following the steps that directed him to approach and then elect English as his privileged means of expression \u2013 necessary for his personal and artistic evolution \u2013 I have examined his work focused on the urban environment, namely the one located in Venice. I have then tried to see if displacement and repeated cultural travels can be considered a \u2018sought-after\u2019 status of the contemporary writer, starting from the reading of some guiding texts, as Nomadic Subjects by Rosi Braidotti (1994), Cultural Mobility: A Manifesto by Stephen Greenblatt (2010), and Culture in a Liquid Modern World (2011) by Zygmunt Bauman, drawing from the interdisciplinary and rapidly evolving field of Migration Studies. After presenting a quick but exhaustive overview of Brodsky\u2019s work located in Venice, I addressed my research to contemporary English poetry, to which Brodsky was considered to belong, to look for a correspondence with a new author, who also focuses on cultural nomadism, displacement, and the adoption of English as vehicle of artistic creation and I found a thematic resonance in the recent work of \uc1gnes Leh\uf3czky, essayist and poet, Hungarian by birth, and British by adoption, who belongs to the cultural movement of the \u2018British Poetic Revival.\u2019 The focus of my research has then been the investigation of Leh\uf3czky\u2019s \u2018post-avant-garde\u2019 poetry \u2013 still unpublished in Italian \u2013 to highlight some affinities in the works of the two authors, who, although belonging to two generations and two essentially different stylistic registers, find similar ways to explore the reality around them. Leh\uf3czky's texts offer new visions of the urban spaces in the cultural crossroads offered by today's technologized cities, where global relationships and the coexistence of multiple languages contribute to the creation of new identities, but where history must also become a fundamental element in understanding the present. Space, time and language play the main role in building her original, \u2018holistic\u2019 and at the same time \u2018palimpsestic\u2019 view of the world. It is a vision that, while recognizing in the mobility of contemporary man the traces of a nomadism which has always existed, finds in Leh\uf3czky's poems a correspondence in the perspectives of the lyrical observer, to offer the readers visions that span in horizontality and in verticality, for instance from the top of a hill in Budapest, to the catacombs of an English gothic cathedral, according to the principles of 'psychogeography.' English, far from being simply a lingua franca, absorbes the influences of the authors\u2019 mother tongues \u2013 \u2018phagocyting\u2019 in some way these latter \u2013 and is thus enriched with new features, becoming not only a new language, but a \u2018space in-between\u2019 that protects and welcomes the nomadic writers, and forges their new identities. Faced with the impossibility of defining the boundary of language and identity, because of the fluid and nomadic nature of language itself, these authors suggest if not answers, new richer languages and modalities, to extend the boundaries of contemporary literary expression

    Neural Radiance Fields: Past, Present, and Future

    Full text link
    The various aspects like modeling and interpreting 3D environments and surroundings have enticed humans to progress their research in 3D Computer Vision, Computer Graphics, and Machine Learning. An attempt made by Mildenhall et al in their paper about NeRFs (Neural Radiance Fields) led to a boom in Computer Graphics, Robotics, Computer Vision, and the possible scope of High-Resolution Low Storage Augmented Reality and Virtual Reality-based 3D models have gained traction from res with more than 1000 preprints related to NeRFs published. This paper serves as a bridge for people starting to study these fields by building on the basics of Mathematics, Geometry, Computer Vision, and Computer Graphics to the difficulties encountered in Implicit Representations at the intersection of all these disciplines. This survey provides the history of rendering, Implicit Learning, and NeRFs, the progression of research on NeRFs, and the potential applications and implications of NeRFs in today's world. In doing so, this survey categorizes all the NeRF-related research in terms of the datasets used, objective functions, applications solved, and evaluation criteria for these applications.Comment: 413 pages, 9 figures, 277 citation

    Exploiting Spatio-Temporal Coherence for Video Object Detection in Robotics

    Get PDF
    This paper proposes a method to enhance video object detection for indoor environments in robotics. Concretely, it exploits knowledge about the camera motion between frames to propagate previously detected objects to successive frames. The proposal is rooted in the concepts of planar homography to propose regions of interest where to find objects, and recursive Bayesian filtering to integrate observations over time. The proposal is evaluated on six virtual, indoor environments, accounting for the detection of nine object classes over a total of ∼ 7k frames. Results show that our proposal improves the recall and the F1-score by a factor of 1.41 and 1.27, respectively, as well as it achieves a significant reduction of the object categorization entropy (58.8%) when compared to a two-stage video object detection method used as baseline, at the cost of small time overheads (120 ms) and precision loss (0.92).</p

    The wrist, the neck, and the waist : articulations of female sexuality in mid-nineteenth century culture

    Get PDF
    This thesis explores how mid-Victorian representations of the wrist, neck, and waist can be read as expressive of female sexuality. I read the appearance of these pieces of the body for their potential to contradict, challenge, or elude ideologies of nineteenth-century sexual regulation and control of women. In studying how desire could be displaced to portions of the body whose display was sanctioned, I draw together two key mid-Victorian preoccupations: the visibility of female sexuality and the subjectivity of artistic consumption. Successive chapters focus on different art forms between the 1850s and the 1870s, including some of the most popular works of the period, alongside critical and social perspectives on the era. I examine how concepts of agency of expression and interpretation negotiate with the strictures, social and physical, that shaped and curated the display of the female body. In doing so, I perform readings of poetry, painting, illustration, photography, art criticism, fashion journalism, and novels. The first chapter examines the representation of the neck in Christina Rossetti’s Goblin Market and Other Poems, both in the titular poem and illustrations by Dante Gabriel Rossetti. I interpret the neck as a spatially and sensually disruptive element of these works, which can facilitate a subjective physical experience of art by the consumer. In the second chapter I scrutinise the appearance of the waist in the photographs of Lady Clementina Hawarden, and in fashion criticism written by women. I analyse how women exercised creative agency by shaping representations of themselves, through the use of the corset and the camera. The final chapter looks at representations of the wrist and its coverings in George Eliot’s Middlemarch and Daniel Deronda. I read the wrist’s erotic significance in these novels, not as a space of subjugation or repression, but as one of sensual agency

    Literature and the Making of the World

    Get PDF
    This open access book positions itself at the intersection of world literature studies, literary anthropology and philosophical critiques of 'world' and 'globe' concepts. Doing so, it investigates how literature imagines and shapes worlds for its readers through linguistically specific cosmopolitan-vernacular dynamics, both at the level of textual engagement and on a material level of textual production and circulation. Moving from textual analyses in Part One – 'Worlds in Texts' – to combined analyses of texts, media and agents in the literary field in Part Two – 'Texts in Worlds' – the concerns of these nine chapters range from multilingualism, genre and style to material forms such as the little magazine or the scrapbook archive and finally to activities such as travel (as a writing profession) and literary promotion. With this focus on practice – which geographically engages with Constantinople, China, Russia, western Europe, North America, southern Africa and India – contributors demonstrate methodologically how world literature studies can bring the empirically specific detail to bear on global modes of analysis. It is precisely through such a dual optic that the world-making capacity of literature becomes apparent

    Activity in area V3A predicts positions of moving objects

    Get PDF
    No description supplie

    MediaSync: Handbook on Multimedia Synchronization

    Get PDF
    This book provides an approachable overview of the most recent advances in the fascinating field of media synchronization (mediasync), gathering contributions from the most representative and influential experts. Understanding the challenges of this field in the current multi-sensory, multi-device, and multi-protocol world is not an easy task. The book revisits the foundations of mediasync, including theoretical frameworks and models, highlights ongoing research efforts, like hybrid broadband broadcast (HBB) delivery and users' perception modeling (i.e., Quality of Experience or QoE), and paves the way for the future (e.g., towards the deployment of multi-sensory and ultra-realistic experiences). Although many advances around mediasync have been devised and deployed, this area of research is getting renewed attention to overcome remaining challenges in the next-generation (heterogeneous and ubiquitous) media ecosystem. Given the significant advances in this research area, its current relevance and the multiple disciplines it involves, the availability of a reference book on mediasync becomes necessary. This book fills the gap in this context. In particular, it addresses key aspects and reviews the most relevant contributions within the mediasync research space, from different perspectives. Mediasync: Handbook on Multimedia Synchronization is the perfect companion for scholars and practitioners that want to acquire strong knowledge about this research area, and also approach the challenges behind ensuring the best mediated experiences, by providing the adequate synchronization between the media elements that constitute these experiences

    Shortest Route at Dynamic Location with Node Combination-Dijkstra Algorithm

    Get PDF
    Abstract— Online transportation has become a basic requirement of the general public in support of all activities to go to work, school or vacation to the sights. Public transportation services compete to provide the best service so that consumers feel comfortable using the services offered, so that all activities are noticed, one of them is the search for the shortest route in picking the buyer or delivering to the destination. Node Combination method can minimize memory usage and this methode is more optimal when compared to A* and Ant Colony in the shortest route search like Dijkstra algorithm, but can’t store the history node that has been passed. Therefore, using node combination algorithm is very good in searching the shortest distance is not the shortest route. This paper is structured to modify the node combination algorithm to solve the problem of finding the shortest route at the dynamic location obtained from the transport fleet by displaying the nodes that have the shortest distance and will be implemented in the geographic information system in the form of map to facilitate the use of the system. Keywords— Shortest Path, Algorithm Dijkstra, Node Combination, Dynamic Location (key words
    • …
    corecore