36 research outputs found
Inferring Room Semantics Using Acoustic Monitoring
Having knowledge of the environmental context of the user i.e. the knowledge
of the users' indoor location and the semantics of their environment, can
facilitate the development of many of location-aware applications. In this
paper, we propose an acoustic monitoring technique that infers semantic
knowledge about an indoor space \emph{over time,} using audio recordings from
it. Our technique uses the impulse response of these spaces as well as the
ambient sounds produced in them in order to determine a semantic label for
them. As we process more recordings, we update our \emph{confidence} in the
assigned label. We evaluate our technique on a dataset of single-speaker human
speech recordings obtained in different types of rooms at three university
buildings. In our evaluation, the confidence\emph{ }for the true label
generally outstripped the confidence for all other labels and in some cases
converged to 100\% with less than 30 samples.Comment: 2017 IEEE International Workshop on Machine Learning for Signal
Processing, Sept.\ 25--28, 2017, Tokyo, Japa
Recommended from our members
Audio-Based Semantic Concept Classification for Consumer Video
This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen for their usefulness to users, viability of automatic detection and of annotator labeling, and sufficiency of representation in available video collections. A set of 1873 videos from real users has been annotated with these concepts. Starting with a basic representation of each video clip as a sequence of mel-frequency cepstral coefficient (MFCC) frames, we experiment with three clip-level representations: single Gaussian modeling, Gaussian mixture modeling, and probabilistic latent semantic analysis of a Gaussian component histogram. Using such summary features, we produce support vector machine (SVM) classifiers based on the Kullback-Leibler, Bhattacharyya, or Mahalanobis distance measures. Quantitative evaluation shows that our approaches are effective for detecting interesting concepts in a large collection of real-world consumer video clips
Studies on binaural and monaural signal analysis methods and applications
Sound signals can contain a lot of information about the environment and the sound sources present in it. This thesis presents novel contributions to the analysis of binaural and monaural sound signals. Some new applications are introduced in this work, but the emphasis is on analysis methods. The three main topics of the thesis are computational estimation of sound source distance, analysis of binaural room impulse responses, and applications intended for augmented reality audio.
A novel method for binaural sound source distance estimation is proposed. The method is based on learning the coherence between the sounds entering the left and right ears. Comparisons to an earlier approach are also made. It is shown that these kinds of learning methods can correctly recognize the distance of a speech sound source in most cases.
Methods for analyzing binaural room impulse responses are investigated. These methods are able to locate the early reflections in time and also to estimate their directions of arrival. This challenging problem could not be tackled completely, but this part of the work is an important step towards accurate estimation of the individual early reflections from a binaural room impulse response.
As the third part of the thesis, applications of sound signal analysis are studied. The most notable contributions are a novel eyes-free user interface controlled by finger snaps, and an investigation on the importance of features in audio surveillance.
The results of this thesis are steps towards building machines that can obtain information on the surrounding environment based on sound. In particular, the research into sound source distance estimation functions as important basic research in this area. The applications presented could be valuable in future telecommunications scenarios, such as augmented reality audio
λ¨μΌ μν₯ μΌμλ₯Ό μ¬μ©νλ λ°μ΄ν° κΈ°λ° λ€μΈ΅ μ² κ·Ό μ½ν¬λ¦¬νΈ 건물 λ΄ μμμ μ’ λ₯μ μμΉ μΆμ
νμλ
Όλ¬Έ (λ°μ¬) -- μμΈλνκ΅ λνμ : 곡과λν μ‘°μ ν΄μ곡νκ³Ό, 2021. 2. μ±μ°μ .The construction of multi-story residential buildings triggers indoor noise. Indoor noise in residential areas has been investigated to ascertain the effect of noise on occupants and to improve their quality of life. In buildings, indoor acoustic noise transmitted from various sources travels through these structures and exerts an unpleasant effect on occupants. Inter-floor noise is identified as a severe type of indoor noise in residential areas. The identification of noise is considered a fundamental step that is essential for studying the challenges of noise pollution. By harnessing a sound level meter, long-term measurement, and site surveying, previous studies have been conducted on the identification of noise in residential areas to estimate the level, type, and position of generated noise. However, it is challenging to identify the source type and position of noise travelling through multi-story residential buildings owing to the difficulty of the human ear in intercepting these sounds. Recent studies on the identification of indoor noise are limited to noise sources and receivers on a single level of the floor, and they require multiple sensor channels to determine the time difference of arrival. Residential buildings, which are usually reinforced concrete structures, are considered to be concrete, steel, and fluid-mixed media with high structural complexity and occupants that have insufficient knowledge of the details of their properties. In this study, we propose a data-driven identification of noise in reinforced concrete buildings via the learning-based localization method using a single sensor. Actual experiments were conducted in a campus building, as well as two apartment buildings. Performance was analyzed according to several source types and positions that apply the deep convolutional neural network (CNN)-based supervised learning. The validations against the datasets obtained in three buildings verified the generalizability of the proposed method. In addition, noise identification data transferred within different floor sections in a single building and between similar buildings were presented in this study. Although indoor noise identification is emphasized in this work, the proposed method can be beneficial for other noise identification methods that employ a single sensor.곡λμ£Όνμ μ¦κ°λ‘ 건물 λ΄ μ΄μ κ°μ μμ λ¬Έμ κ° μ¬νμ μΌλ‘ λλλκ³ μλ€. κ±°μ£Όμμκ² λ
ΈμΆλ μμμ κ±°μ£Όμμ κ±΄κ° λ¬Έμ μ μ§κ²°λ μλ μμΌλ―λ‘ κ±΄λ¬Ό λ΄ μμμ κ΄ν μ¬λ¬ μ°κ΅¬κ° μ§νλμ΄ μλ€. λ€μΈ΅ 건물 λ΄μμ λ°μν μμμ 건물μ ꡬ쑰λ₯Ό λ°λΌ λ€λ₯Έ μΈ΅μΌλ‘ μ λ¬λλ©° μ΄λ¬ν μΈ΅κ°μμμ μ£Όλ³ μ΄μμκ² κ³ ν΅μΌλ‘ λ€κ°μ¬ μ μλ€. μμμμ κ·λͺ
μ μμμ λ€λ£° λ μ νλμ΄μΌ νλ λ° κ±΄λ¬Ό λ΄ μμμ μ€μ, μ’
λ₯, μμΉ νμ
μ κ΄λ ¨λ μ°κ΅¬λ€μ΄ μ§νλμ΄ μλ€. μμμ μ€μλ μμμΈ‘μ κΈ°λ₯Ό μ¬μ©νμ¬ μΈ‘μ μ΄ κ°λ₯νλ 건물μ ꡬ쑰λ₯Ό λ°λΌ μ λ¬λ μμμ μ’
λ₯μ μμΉλ₯Ό νλ³νλ κ²μ μΆμ μ΄ νμν λ¬Έμ μ΄λ©° μ¬λμ μ²λ ₯μ μμ‘΄νμ¬μ νκΈ°λ μ΄λ ΅λ€. μ΅κ·Ό μ°κ΅¬λ κ΄λ ¨ μ°κ΅¬λ₯Ό μ΄ν΄λ³΄λ©΄ 건물 λ΄ μμμ μ’
λ₯λ₯Ό λΆλ₯νλ μ°κ΅¬λ κ±°μ λ€λ€μ§μ§ μμκ³ , μμμ μμΉ μΆμ μ°κ΅¬μ κ²½μ° λμΌ μΈ΅μ μμμκ³Ό μ¬λ¬ μ±λμ μμ κΈ°κ° μμΉν κ²½μ°λ₯Ό λ€μ€μΈ‘λ (multilateration) μ ν΅νμ¬ μ νμ μΌλ‘ λ€λ€λ€. μΌλ°μ μΌλ‘ νλ κ±°μ£Όμ© κ±΄μΆλ¬Όμ λλΆλΆμ μ² κ·Ό μ½ν¬λ¦¬νΈ ꡬ쑰μ΄λ©° μΈ΅κ°μ μμ μ λ¬ νκ²½μ μ½ν¬λ¦¬νΈ, μ² κ·Ό, μ μ²΄κ° νΌμ¬νλ 볡μ‘ν νκ²½μ΄λ€. μΌλ°μΈ κ±°μ£Όμκ° μ΄λ¬ν νκ²½μμμ μμ μ λ¬ νκ²½μ νμ
νκ³ μμμ μ λ¬ λͺ¨λΈμ μΈμ μμμ κ·λͺ
νλ κ²μ μ΄λ ΅λ€. λ³Έ λ
Όλ¬Έμ λͺ¨λ°μΌ μ₯μΉ (mobile device) μ λ¨μΌ μν₯ μΌμλ‘ μΈ‘μ ν μμκ³Ό ν©μ±κ³± μ κ²½λ§μ νμ©νμ¬ λ°μ΄ν° κΈ°λ° (data-driven) μ 건물 λ΄ μμ κ·λͺ
λ°©λ²μ μ μνκ³ ν κ°μ μΊ νΌμ€ 건물과 λ μννΈ κ±΄λ¬Όμμ μ§νν μ€νμ ν΅νμ¬ μ΄ κΈ°λ²μ μ μ©μ±κ³Ό 보νΈμ±μ 보μλ€. λν ν μΈ΅κ°μμ νμ΅ν μμ κ·λͺ
μ§μμ λμΌ κ±΄λ¬Όμ λ€λ₯Έ μΈ΅κ°μμμ μμ κ·λͺ
μ, ν 건물μμ νμ΅ν μμ κ·λͺ
μ§μμ λ€λ₯Έ 건물 λ΄ μμ κ·λͺ
μ νμ© ν μ μμμ 보μλ€. μ μνλ κΈ°λ²μ μμ μ λ¬ νκ²½ νμ
λ° λͺ¨λΈμ μ»κΈ° μ΄λ €μ΄ λΆμΌμμμ μ μ©μλ μ μ©ν κ²μΌλ‘ κΈ°λνλ€.Abstract I
Contents iii
List of Figures vi
List of Tables ix
1 Introduction 2
1.1 Backgrounds 2
1.2 Approach 5
1.2.1 Data-driven noise identification 5
1.2.2 Source type classification and localization 7
1.2.3 Knowledge transfer 10
1.3 Contributions 15
1.4 Outline of the Dissertation 16
2 Source type classification and localization of acoustic noises in a reinforced concrete structure 28
2.1 Introduction 29
2.1.1 Motivation 29
2.1.2 Related literature 29
2.1.3 Approach 30
2.1.4 Contributions of this chapter 31
2.2 Campus building inter-floor noise dataset 32
2.2.1 Selecting source type and source position 32
2.2.2 Generating and collecting inter-floor noise 33
2.3 Supervised learning of inter-floor noises 36
2.3.1 Convolutional neural networks for acoustic scene classification 36
2.3.2 Network architecture 36
2.3.3 Evaluation 40
2.3.4 Source type classification results 41
2.3.5 Localizationresults....................... 41
2.4 Source type classification and localization of inter-floor noises generated on unlearned positions 47
2.4.1 Source type classification of inter-floor noises from unlearned positions 48
2.4.2 Localization of inter-floor noises from unlearned positions 50
2.5 Summary 52
2.6 Acknowledgments 53
3 Knowledge transfer between reinforced concrete structures 61
3.1 Introduction 62
3.1.1 Motivation 62
3.1.2 Related Literature 62
3.1.3 Approach 63
3.1.4 Contributions of this chapter 63
3.2 Apartment building inter-floor noise dataset 64
3.3 Inter-floor noise classification 70
3.3.1 Onset detection 70
3.3.2 Convolutional neural network-based classifier 71
3.3.3 Network training 75
3.3.4 Source type classification and localization tasks 75
3.4 Performance Evaluation 79
3.4.1 Source type classification results in a single apartment building 79
3.4.2 Localization results in a single apartment building 80
3.4.3 Results of knowledge transfer between the apartment buildings 81
3.5 Summary 87
3.6 Acknowledgments 94
4 Conclusions 96
4.1 Findings and limitations 97
4.2 Applications 97
4.2.1 Marine structures 98
4.2.2 Mobile application 98
4.3 Future study 100
4.3.1 Learning with building structure representation 100
4.3.2 Learning with data measured at multiple receiver locations 100
4.3.3 Task oriented algorithm 101
A Precision, recall, and F1 score of the classification results 102
B Data analysis 105
C Using a one-dimensional convolutional neural network and feature visualization 112
Abstract (In Korean) 124Docto
AI augmented Edge and Fog computing: trends and challenges
In recent years, the landscape of computing paradigms has witnessed a gradual yet remarkable shift from monolithic computing to distributed and decentralized paradigms such as Internet of Things (IoT), Edge, Fog, Cloud, and Serverless. The frontiers of these computing technologies have been boosted by shift from manually encoded algorithms to Artificial Intelligence (AI)-driven autonomous systems for optimum and reliable management of distributed computing resources. Prior work focuses on improving existing systems using AI across a wide range of domains, such as efficient resource provisioning, application deployment, task placement, and service management. This survey reviews the evolution of data-driven AI-augmented technologies and their impact on computing systems. We demystify new techniques and draw key insights in Edge, Fog and Cloud resource management-related uses of AI methods and also look at how AI can innovate traditional applications for enhanced Quality of Service (QoS) in the presence of a continuum of resources. We present the latest trends and impact areas such as optimizing AI models that are deployed on or for computing systems. We layout a roadmap for future research directions in areas such as resource management for QoS optimization and service reliability. Finally, we discuss blue-sky ideas and envision this work as an anchor point for future research on AI-driven computing systems