Search CORE

5,004 research outputs found

Design and application of reconfigurable circuits and systems

Author: Cheung Peter
Publication venue: Electrical & Electronic Engineering, Imperial College London
Publication date: 01/12/2015
Field of study

Open Acces

Spiral - Imperial College Digital Repository

Visual Speech Recognition Using a 3D Convolutional Neural Network

Author: Rochford Matthew
Publication venue: DigitalCommons@CalPoly
Publication date: 01/12/2019
Field of study

Main stream automatic speech recognition (ASR) makes use of audio data to identify spoken words, however visual speech recognition (VSR) has recently been of increased interest to researchers. VSR is used when audio data is corrupted or missing entirely and also to further enhance the accuracy of audio-based ASR systems. In this research, we present both a framework for building 3D feature cubes of lip data from videos and a 3D convolutional neural network (CNN) architecture for performing classification on a dataset of 100 spoken words, recorded in an uncontrolled envi- ronment. Our 3D-CNN architecture achieves a testing accuracy of 64%, comparable with recent works, but using an input data size that is up to 75% smaller. Overall, our research shows that 3D-CNNs can be successful in finding spatial-temporal features using unsupervised feature extraction and are a suitable choice for VSR-based systems

DigitalCommons@CalPoly

Recommended from our members

Novel entropy coding and its application of the compression of 3D image and video signals

Author: Amal Mehanna
Publication venue: Brunel University London
Publication date: 01/01/2013
Field of study

This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University LondonThe broadcast industry is moving future Digital Television towards Super high resolution TV (4k or 8k) and/or 3D TV. This ultimately will increase the demand on data rate and subsequently the demand for highly efficient codecs. One of the techniques that researchers found it one of the promising technologies in the industry in the next few years is 3D Integral Image and Video due to its simplicity and mimics the reality, independently on viewer aid, one of the challenges of the 3D Integral technology is to improve the compression algorithms to adequate the high resolution and exploit the advantages of the characteristics of this technology. The research scope of this thesis includes designing a novel coding for the 3D Integral image and video compression. Firstly to address the compression of 3D Integral imaging the research proposes novel entropy coding which will be implemented first on 2D traditional images content in order to compare it with the other traditional common standards then will be applied on 3D Integra image and video. This approach seeks to achieve high performance represented by high image quality and low bit rate in association with low computational complexity. Secondly, new algorithm will be proposed in an attempt to improve and develop the transform techniques performance, initially by using a new adaptive 3D-DCT algorithm then by proposing a new hybrid 3D DWT-DCT algorithm via exploiting the advantages of each technique and get rid of the artifact that each technique of them suffers from. Finally, the proposed entropy coding will be further implemented to the 3D integral video in association with another proposed algorithm that based on calculating the motion vector on the average viewpoint for each frame. This approach seeks to minimize the complexity and reduce the speed without affecting the Human Visual System (HVS) performance. Number of block matching techniques will be used to investigate the best block matching technique that is adequate for the new proposed 3D integral video algorithm

Brunel University Research Archive

Faking Sensor Noise Information

Author: Chang Justin
Publication venue: SJSU ScholarWorks
Publication date: 01/01/2022
Field of study

Noise residue detection in digital images has recently been used as a method to classify images based on source camera model type. The meteoric rise in the popularity of using Neural Network models has also been used in conjunction with the concept of noise residuals to classify source camera models. However, many papers gloss over the details on the methods of obtaining noise residuals and instead rely on the self- learning aspect of deep neural networks to implicitly discover this themselves. For this project I propose a method of obtaining noise residuals (“noiseprints”) and denoising an image, as well as a Generative model that can learn how to reproduce noise resembling a target digital camera model’s noise noiseprint. Applying a noiseprint generated by this model onto a denoised image will be able to fool a discriminating model into classifying the wrong digital camera model. To the best of my knowledge, this is the first work that will explicitly detail denoising methods and noiseprint generation in a 128 by 128 resolution for specific camera models and individual cameras for the goal of fooling a classification model

SJSU ScholarWorks

Application of machine learning to microseismic event detection in distributed acoustic sensing data

Author: Baird Alan F.
Clarke Andy
Horne Steve A.
Kendall J.-Michael
Lapins Sacha
Naldrett Garth
Stork Anna L.
Verdon James P.
Williams Anna
Wookey James
Publication venue: 'Society of Exploration Geophysicists'
Publication date: 17/08/2020
Field of study

Explore Bristol Research

The simultaneity of complementary conditions:re-integrating and balancing analogue and digital matter(s) in basic architectural education

Author: Wiesner Thomas
Publication venue
Publication date: 01/10/2008
Field of study

The actual, globally established, general digital procedures in basic architectural education,producing well-behaved, seemingly attractive up-to-date projects, spaces and first general-researchon all scale levels, apparently present a certain growing amount of deficiencies. These limitations surface only gradually, as the state of things on overall extents is generally deemed satisfactory. Some skills, such as “old-fashioned” analogue drawing are gradually eased-out ofundergraduate curricula and overall modus-operandi, due to their apparent slow inefficiencies in regard to various digital media’s rapid readiness, malleability and unproblematic, quotidian availabilities. While this state of things is understandable, it nevertheless presents a definite challenge. The challenge of questioning how the assessment of conditions and especially their representation,is conducted, prior to contextual architectural action(s) of any kind

Crossref

Directory of Open Access Journals

Research in Architecture, Design and Conservation

ENQUIRY: The ARCC Journal

Efficient Synthesis of Room Acoustics via Scattering Delay Networks

Author: Cvetkovic Zoran
De Sena Enzo
Hacihabiboglu Huseyin
Smith III Julius O.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/07/2015
Field of study

An acoustic reverberator consisting of a network of delay lines connected via scattering junctions is proposed. All parameters of the reverberator are derived from physical properties of the enclosure it simulates. It allows for simulation of unequal and frequency-dependent wall absorption, as well as directional sources and microphones. The reverberator renders the first-order reflections exactly, while making progressively coarser approximations of higher-order reflections. The rate of energy decay is close to that obtained with the image method (IM) and consistent with the predictions of Sabine and Eyring equations. The time evolution of the normalized echo density, which was previously shown to be correlated with the perceived texture of reverberation, is also close to that of IM. However, its computational complexity is one to two orders of magnitude lower, comparable to the computational complexity of a feedback delay network (FDN), and its memory requirements are negligible

arXiv.org e-Print Archive

Crossref

King's Research Portal

OpenMETU (Middle East Technical University)