Search CORE

3,137 research outputs found

Mixing Modalities of 3D Sketching and Speech for Interactive Model Retrieval in Virtual Reality

Author: Giunchi D
James S
Steed A
Sztrajman A
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2021
Field of study

Sketch and speech are intuitive interaction methods that convey complementary information and have been independently used for 3D model retrieval in virtual environments. While sketch has been shown to be an effective retrieval method, not all collections are easily navigable using this modality alone. We design a new challenging database for sketch comprised of 3D chairs where each of the components (arms, legs, seat, back) are independently colored. To overcome this, we implement a multimodal interface for querying 3D model databases within a virtual environment. We base the sketch on the state-of-the-art for 3D Sketch Retrieval, and use a Wizard-of-Oz style experiment to process the voice input. In this way, we avoid the complexities of natural language processing which frequently requires fine-tuning to be robust. We conduct two user studies and show that hybrid search strategies emerge from the combination of interactions, fostering the advantages provided by both modalities

UCL Discovery

Application of Machine Learning within Visual Content Production

Author: Giunchi Daniele
Publication venue: UCL (University College London)
Publication date: 28/07/2021
Field of study

We are living in an era where digital content is being produced at a dazzling pace. The heterogeneity of contents and contexts is so varied that a numerous amount of applications have been created to respond to people and market demands. The visual content production pipeline is the generalisation of the process that allows a content editor to create and evaluate their product, such as a video, an image, a 3D model, etc. Such data is then displayed on one or more devices such as TVs, PC monitors, virtual reality head-mounted displays, tablets, mobiles, or even smartwatches. Content creation can be simple as clicking a button to film a video and then share it into a social network, or complex as managing a dense user interface full of parameters by using keyboard and mouse to generate a realistic 3D model for a VR game. In this second example, such sophistication results in a steep learning curve for beginner-level users. In contrast, expert users regularly need to refine their skills via expensive lessons, time-consuming tutorials, or experience. Thus, user interaction plays an essential role in the diffusion of content creation software, primarily when it is targeted to untrained people. In particular, with the fast spread of virtual reality devices into the consumer market, new opportunities for designing reliable and intuitive interfaces have been created. Such new interactions need to take a step beyond the point and click interaction typical of the 2D desktop environment. The interactions need to be smart, intuitive and reliable, to interpret 3D gestures and therefore, more accurate algorithms are needed to recognise patterns. In recent years, machine learning and in particular deep learning have achieved outstanding results in many branches of computer science, such as computer graphics and human-computer interface, outperforming algorithms that were considered state of the art, however, there are only fleeting efforts to translate this into virtual reality. In this thesis, we seek to apply and take advantage of deep learning models to two different content production pipeline areas embracing the following subjects of interest: advanced methods for user interaction and visual quality assessment. First, we focus on 3D sketching to retrieve models from an extensive database of complex geometries and textures, while the user is immersed in a virtual environment. We explore both 2D and 3D strokes as tools for model retrieval in VR. Therefore, we implement a novel system for improving accuracy in searching for a 3D model. We contribute an efficient method to describe models through 3D sketch via an iterative descriptor generation, focusing both on accuracy and user experience. To evaluate it, we design a user study to compare different interactions for sketch generation. Second, we explore the combination of sketch input and vocal description to correct and fine-tune the search for 3D models in a database containing fine-grained variation. We analyse sketch and speech queries, identifying a way to incorporate both of them into our system's interaction loop. Third, in the context of the visual content production pipeline, we present a detailed study of visual metrics. We propose a novel method for detecting rendering-based artefacts in images. It exploits analogous deep learning algorithms used when extracting features from sketches

UCL Discovery

Technology Demonstration: Augmented Design Ideation

Author: Smith Oliver
Stopher Benjamin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/06/2017
Field of study

This technology demonstration showcases augmented design ideation tools that have been developed to explore hybrid design environments [1] using novel prototypes that use ambient informatics and ubiquitous computing approaches. The tool aims to enhance the creative scope of multi-participant design ideation by providing contextually relevant visual design prompts to enhance design ideation processes and conceptualisation [1]

Crossref

UAL Research Online

To “Sketch-a-Scratch”

Author: Alan Del Piccolo
Davide Andrea Mauro
Davide Rocchesso
Stefano Delle Monache
Stefano Papetti
Publication venue: place:Maynooth
Publication date: 01/01/2015
Field of study

A surface can be harsh and raspy, or smooth and silky, and everything in between. We are used to sense these features with our fingertips as well as with our eyes and ears: the exploration of a surface is a multisensory experience. Tools, too, are often employed in the interaction with surfaces, since they augment our manipulation capabilities. “Sketch-a-Scratch” is a tool for the multisensory exploration and sketching of surface textures. The user’s actions drive a physical sound model of real materials’ response to interactions such as scraping, rubbing or rolling. Moreover, different input signals can be converted into 2D visual surface profiles, thus enabling to experience them visually, aurally and haptically

Archivio istituzionale della ricerca - Università di Palermo

Web-based haptic applications for blind people to create virtual graphs

Author: Kangas K.
Yu W.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2003
Field of study

Haptic technology has great potentials in many applications. This paper introduces our work on delivery haptic information via the Web. A multimodal tool has been developed to allow blind people to create virtual graphs independently. Multimodal interactions in the process of graph creation and exploration are provided by using a low-cost haptic device, the Logitech WingMan Force Feedback Mouse, and Web audio. The Web-based tool also provides blind people with the convenience of receiving information at home. In this paper, we present the development of the tool and evaluation results. Discussions on the issues related to the design of similar Web-based haptic applications are also given

Enlighten

Experiential Role of Artefacts in Cooperative Design

Author: Caroll J.M.
Heylen Dirk K.J.
Nijholt Antinus
van der Veer Gerrit C.
Vyas Dhaval
Publication venue: ACM
Publication date: 01/01/2009
Field of study

The role of material artefacts in supporting distributed and co-located work practices has been well acknowledged within the HCI and CSCW research. In this paper, we show that in addition to their ecological, coordinative and organizational support, artefacts also play an ‘experiential’ role. In this case, artefacts not only improve efficiency or have a purely functional role (e.g. allowing people to complete tasks quickly), but the presence and manifestations of these artefacts bring quality and richness to people’s performance and help in making better sense of their everyday lives. In a domain like industrial design, such artefacts play an important role for supporting creativity and innovation. Based on our prolonged ethnographic fieldwork on understanding cooperative design practices of industrial design students and researchers, we describe several experiential practices that are supported by mundane artefacts like sketches, drawings, physical models and explorative prototypes – used and developed in designers’ everyday work. Our main intention to carry out this kind of research is to develop technologies to support designers’ everyday practices. We believe that with the emergence of ubiquitous computing, there is a growing need to focus on personal, emotional and social side of people’s everyday experiences. By focusing on the experiential practices of designers, we can provide a holistic view in the design of new interactive technologies

CiteSeerX

Crossref

Queensland University of Technology ePrints Archive

University of Twente Research Information

University of Queensland eSpace

Proceedings of the 20th BCS HCI Group conference Volume Two

Author: Fields Bob
Healey Patrick
Nickerson Louise Valgerdur
Stockman Tony
Publication venue
Publication date: 30/12/2013
Field of study

Queen Mary Research Online

Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data

Author: Astrinaki Maria
Babacan Onur
Barbulescu Adela
Cakmak Huseyin
Dall Rasmus
d’Alessandro Nicolas
Hu Qiong
Hueber Thomas
Huguenin Victor
Kalaycı Emine Sümeyye
Moinet Alexis
Parfait Valentin
Ravet Thierry
Tilmanne Joëlle
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/07/2013
Field of study

Part 1: Fundamental IssuesInternational audienceThis paper presents the results of our participation to the ninth eNTERFACE workshop on multimodal user interfaces. Our target for this workshop was to bring some technologies currently used in speech recognition and synthesis to a new level, i.e. being the core of a new HMM-based mapping system. The idea of statistical mapping has been investigated, more precisely how to use Gaussian Mixture Models and Hidden Markov Models for realtime and reactive generation of new trajectories from inputted labels and for realtime regression in a continuous-to-continuous use case. As a result, we have developed several proofs of concept, including an incremental speech synthesiser, a software for exploring stylistic spaces for gait and facial motion in realtime, a reactive audiovisual laughter and a prototype demonstrating the realtime reconstruction of lower body gait motion strictly from upper body motion, with conservation of the stylistic properties. This project has been the opportunity to formalise HMM-based mapping, integrate various of these innovations into the Mage library and explore the development of a realtime gesture recognition tool

CiteSeerX

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server