36 research outputs found

    Inferring Room Semantics Using Acoustic Monitoring

    Full text link
    Having knowledge of the environmental context of the user i.e. the knowledge of the users' indoor location and the semantics of their environment, can facilitate the development of many of location-aware applications. In this paper, we propose an acoustic monitoring technique that infers semantic knowledge about an indoor space \emph{over time,} using audio recordings from it. Our technique uses the impulse response of these spaces as well as the ambient sounds produced in them in order to determine a semantic label for them. As we process more recordings, we update our \emph{confidence} in the assigned label. We evaluate our technique on a dataset of single-speaker human speech recordings obtained in different types of rooms at three university buildings. In our evaluation, the confidence\emph{ }for the true label generally outstripped the confidence for all other labels and in some cases converged to 100\% with less than 30 samples.Comment: 2017 IEEE International Workshop on Machine Learning for Signal Processing, Sept.\ 25--28, 2017, Tokyo, Japa

    Audio-Based Semantic Concept Classification for Consumer Video

    Full text link

    Studies on binaural and monaural signal analysis methods and applications

    Get PDF
    Sound signals can contain a lot of information about the environment and the sound sources present in it. This thesis presents novel contributions to the analysis of binaural and monaural sound signals. Some new applications are introduced in this work, but the emphasis is on analysis methods. The three main topics of the thesis are computational estimation of sound source distance, analysis of binaural room impulse responses, and applications intended for augmented reality audio. A novel method for binaural sound source distance estimation is proposed. The method is based on learning the coherence between the sounds entering the left and right ears. Comparisons to an earlier approach are also made. It is shown that these kinds of learning methods can correctly recognize the distance of a speech sound source in most cases. Methods for analyzing binaural room impulse responses are investigated. These methods are able to locate the early reflections in time and also to estimate their directions of arrival. This challenging problem could not be tackled completely, but this part of the work is an important step towards accurate estimation of the individual early reflections from a binaural room impulse response. As the third part of the thesis, applications of sound signal analysis are studied. The most notable contributions are a novel eyes-free user interface controlled by finger snaps, and an investigation on the importance of features in audio surveillance. The results of this thesis are steps towards building machines that can obtain information on the surrounding environment based on sound. In particular, the research into sound source distance estimation functions as important basic research in this area. The applications presented could be valuable in future telecommunications scenarios, such as augmented reality audio

    단일 음ν–₯ μ„Όμ„œλ₯Ό μ‚¬μš©ν•˜λŠ” 데이터 기반 λ‹€μΈ΅ μ² κ·Ό 콘크리트 건물 λ‚΄ μ†ŒμŒμ˜ μ’…λ₯˜μ™€ μœ„μΉ˜ μΆ”μ •

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (박사) -- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 쑰선해양곡학과, 2021. 2. μ„±μš°μ œ.The construction of multi-story residential buildings triggers indoor noise. Indoor noise in residential areas has been investigated to ascertain the effect of noise on occupants and to improve their quality of life. In buildings, indoor acoustic noise transmitted from various sources travels through these structures and exerts an unpleasant effect on occupants. Inter-floor noise is identified as a severe type of indoor noise in residential areas. The identification of noise is considered a fundamental step that is essential for studying the challenges of noise pollution. By harnessing a sound level meter, long-term measurement, and site surveying, previous studies have been conducted on the identification of noise in residential areas to estimate the level, type, and position of generated noise. However, it is challenging to identify the source type and position of noise travelling through multi-story residential buildings owing to the difficulty of the human ear in intercepting these sounds. Recent studies on the identification of indoor noise are limited to noise sources and receivers on a single level of the floor, and they require multiple sensor channels to determine the time difference of arrival. Residential buildings, which are usually reinforced concrete structures, are considered to be concrete, steel, and fluid-mixed media with high structural complexity and occupants that have insufficient knowledge of the details of their properties. In this study, we propose a data-driven identification of noise in reinforced concrete buildings via the learning-based localization method using a single sensor. Actual experiments were conducted in a campus building, as well as two apartment buildings. Performance was analyzed according to several source types and positions that apply the deep convolutional neural network (CNN)-based supervised learning. The validations against the datasets obtained in three buildings verified the generalizability of the proposed method. In addition, noise identification data transferred within different floor sections in a single building and between similar buildings were presented in this study. Although indoor noise identification is emphasized in this work, the proposed method can be beneficial for other noise identification methods that employ a single sensor.κ³΅λ™μ£Όνƒμ˜ μ¦κ°€λ‘œ 건물 λ‚΄ 이웃 κ°„μ˜ μ†ŒμŒ λ¬Έμ œκ°€ μ‚¬νšŒμ μœΌλ‘œ λŒ€λ‘λ˜κ³  μžˆλ‹€. κ±°μ£Όμžμ—κ²Œ λ…ΈμΆœλœ μ†ŒμŒμ€ 거주자의 건강 λ¬Έμ œμ— 직결될 μˆ˜λ„ μžˆμœΌλ―€λ‘œ 건물 λ‚΄ μ†ŒμŒμ— κ΄€ν•œ μ—¬λŸ¬ 연ꡬ가 μ§„ν–‰λ˜μ–΄ μ™”λ‹€. λ‹€μΈ΅ 건물 λ‚΄μ—μ„œ λ°œμƒν•œ μ†ŒμŒμ€ 건물의 ꡬ쑰λ₯Ό 따라 λ‹€λ₯Έ 측으둜 μ „λ‹¬λ˜λ©° μ΄λŸ¬ν•œ μΈ΅κ°„μ†ŒμŒμ€ μ£Όλ³€ μ΄μ›ƒμ—κ²Œ κ³ ν†΅μœΌλ‘œ λ‹€κ°€μ˜¬ 수 μžˆλ‹€. μ†ŒμŒμ›μ˜ 규λͺ…은 μ†ŒμŒμ„ λ‹€λ£° λ•Œ μ„ ν–‰λ˜μ–΄μ•Ό ν•˜λŠ” λ°” 건물 λ‚΄ μ†ŒμŒμ˜ μ€€μœ„, μ’…λ₯˜, μœ„μΉ˜ νŒŒμ•…μ— κ΄€λ ¨λœ 연ꡬ듀이 μ§„ν–‰λ˜μ–΄ μ™”λ‹€. μ†ŒμŒμ˜ μ€€μœ„λŠ” μ†ŒμŒμΈ‘μ •κΈ°λ₯Ό μ‚¬μš©ν•˜μ—¬ 츑정이 κ°€λŠ₯ν•˜λ‚˜ 건물의 ꡬ쑰λ₯Ό 따라 μ „λ‹¬λœ μ†ŒμŒμ˜ μ’…λ₯˜μ™€ μœ„μΉ˜λ₯Ό νŒλ³„ν•˜λŠ” 것은 좔정이 ν•„μš”ν•œ 문제이며 μ‚¬λžŒμ˜ μ²­λ ₯에 μ˜μ‘΄ν•˜μ—¬μ„œ 풀기도 μ–΄λ ΅λ‹€. 졜근 μ—°κ΅¬λœ κ΄€λ ¨ 연ꡬλ₯Ό μ‚΄νŽ΄λ³΄λ©΄ 건물 λ‚΄ μ†ŒμŒμ˜ μ’…λ₯˜λ₯Ό λΆ„λ₯˜ν•˜λŠ” μ—°κ΅¬λŠ” 거의 닀뀄지지 μ•Šμ•˜κ³ , μ†ŒμŒμ› μœ„μΉ˜ μΆ”μ • μ—°κ΅¬μ˜ 경우 동일 측에 μ†ŒμŒμ›κ³Ό μ—¬λŸ¬ μ±„λ„μ˜ μˆ˜μ‹ κΈ°κ°€ μœ„μΉ˜ν•œ 경우λ₯Ό λ‹€μ€‘μΈ‘λŸ‰ (multilateration) 을 ν†΅ν•˜μ—¬ μ œν•œμ μœΌλ‘œ λ‹€λ€˜λ‹€. 일반적으둜 ν˜„λŒ€ 거주용 κ±΄μΆ•λ¬Όμ˜ λŒ€λΆ€λΆ„μ€ μ² κ·Ό 콘크리트 ꡬ쑰이며 μΈ΅κ°„μ˜ μ†ŒμŒ 전달 ν™˜κ²½μ€ 콘크리트, μ² κ·Ό, μœ μ²΄κ°€ ν˜Όμž¬ν•˜λŠ” λ³΅μž‘ν•œ ν™˜κ²½μ΄λ‹€. 일반인 κ±°μ£Όμžκ°€ μ΄λŸ¬ν•œ ν™˜κ²½μ—μ„œμ˜ μ†ŒμŒ 전달 ν™˜κ²½μ„ νŒŒμ•…ν•˜κ³  μ†ŒμŒμ˜ 전달 λͺ¨λΈμ„ μ„Έμ›Œ μ†ŒμŒμ„ 규λͺ…ν•˜λŠ” 것은 μ–΄λ ΅λ‹€. λ³Έ 논문은 λͺ¨λ°”일 μž₯치 (mobile device) 의 단일 음ν–₯ μ„Όμ„œλ‘œ μΈ‘μ •ν•œ μ†ŒμŒκ³Ό ν•©μ„±κ³± 신경망을 ν™œμš©ν•˜μ—¬ 데이터 기반 (data-driven) 의 건물 λ‚΄ μ†ŒμŒ 규λͺ… 방법을 μ œμ•ˆν•˜κ³  ν•œ 개의 캠퍼슀 건물과 두 μ•„νŒŒνŠΈ κ±΄λ¬Όμ—μ„œ μ§„ν–‰ν•œ μ‹€ν—˜μ„ ν†΅ν•˜μ—¬ 이 κΈ°λ²•μ˜ μœ μš©μ„±κ³Ό λ³΄νŽΈμ„±μ„ λ³΄μ˜€λ‹€. λ˜ν•œ ν•œ μΈ΅κ°„μ—μ„œ ν•™μŠ΅ν•œ μ†ŒμŒ 규λͺ… 지식을 동일 건물의 λ‹€λ₯Έ μΈ΅κ°„μ—μ„œμ˜ μ†ŒμŒ 규λͺ…에, ν•œ κ±΄λ¬Όμ—μ„œ ν•™μŠ΅ν•œ μ†ŒμŒ 규λͺ… 지식을 λ‹€λ₯Έ 건물 λ‚΄ μ†ŒμŒ 규λͺ…에 ν™œμš© ν•  수 μžˆμŒμ„ λ³΄μ˜€λ‹€. μ œμ•ˆν•˜λŠ” 기법은 μ†ŒμŒ 전달 ν™˜κ²½ νŒŒμ•… 및 λͺ¨λΈμ„ μ–»κΈ° μ–΄λ €μš΄ λΆ„μ•Όμ—μ„œμ˜ μ μš©μ—λ„ μœ μš©ν•  κ²ƒμœΌλ‘œ κΈ°λŒ€ν•œλ‹€.Abstract I Contents iii List of Figures vi List of Tables ix 1 Introduction 2 1.1 Backgrounds 2 1.2 Approach 5 1.2.1 Data-driven noise identification 5 1.2.2 Source type classification and localization 7 1.2.3 Knowledge transfer 10 1.3 Contributions 15 1.4 Outline of the Dissertation 16 2 Source type classification and localization of acoustic noises in a reinforced concrete structure 28 2.1 Introduction 29 2.1.1 Motivation 29 2.1.2 Related literature 29 2.1.3 Approach 30 2.1.4 Contributions of this chapter 31 2.2 Campus building inter-floor noise dataset 32 2.2.1 Selecting source type and source position 32 2.2.2 Generating and collecting inter-floor noise 33 2.3 Supervised learning of inter-floor noises 36 2.3.1 Convolutional neural networks for acoustic scene classification 36 2.3.2 Network architecture 36 2.3.3 Evaluation 40 2.3.4 Source type classification results 41 2.3.5 Localizationresults....................... 41 2.4 Source type classification and localization of inter-floor noises generated on unlearned positions 47 2.4.1 Source type classification of inter-floor noises from unlearned positions 48 2.4.2 Localization of inter-floor noises from unlearned positions 50 2.5 Summary 52 2.6 Acknowledgments 53 3 Knowledge transfer between reinforced concrete structures 61 3.1 Introduction 62 3.1.1 Motivation 62 3.1.2 Related Literature 62 3.1.3 Approach 63 3.1.4 Contributions of this chapter 63 3.2 Apartment building inter-floor noise dataset 64 3.3 Inter-floor noise classification 70 3.3.1 Onset detection 70 3.3.2 Convolutional neural network-based classifier 71 3.3.3 Network training 75 3.3.4 Source type classification and localization tasks 75 3.4 Performance Evaluation 79 3.4.1 Source type classification results in a single apartment building 79 3.4.2 Localization results in a single apartment building 80 3.4.3 Results of knowledge transfer between the apartment buildings 81 3.5 Summary 87 3.6 Acknowledgments 94 4 Conclusions 96 4.1 Findings and limitations 97 4.2 Applications 97 4.2.1 Marine structures 98 4.2.2 Mobile application 98 4.3 Future study 100 4.3.1 Learning with building structure representation 100 4.3.2 Learning with data measured at multiple receiver locations 100 4.3.3 Task oriented algorithm 101 A Precision, recall, and F1 score of the classification results 102 B Data analysis 105 C Using a one-dimensional convolutional neural network and feature visualization 112 Abstract (In Korean) 124Docto

    Learning Sensory Representations with Minimal Supervision

    Get PDF

    AI augmented Edge and Fog computing: trends and challenges

    Get PDF
    In recent years, the landscape of computing paradigms has witnessed a gradual yet remarkable shift from monolithic computing to distributed and decentralized paradigms such as Internet of Things (IoT), Edge, Fog, Cloud, and Serverless. The frontiers of these computing technologies have been boosted by shift from manually encoded algorithms to Artificial Intelligence (AI)-driven autonomous systems for optimum and reliable management of distributed computing resources. Prior work focuses on improving existing systems using AI across a wide range of domains, such as efficient resource provisioning, application deployment, task placement, and service management. This survey reviews the evolution of data-driven AI-augmented technologies and their impact on computing systems. We demystify new techniques and draw key insights in Edge, Fog and Cloud resource management-related uses of AI methods and also look at how AI can innovate traditional applications for enhanced Quality of Service (QoS) in the presence of a continuum of resources. We present the latest trends and impact areas such as optimizing AI models that are deployed on or for computing systems. We layout a roadmap for future research directions in areas such as resource management for QoS optimization and service reliability. Finally, we discuss blue-sky ideas and envision this work as an anchor point for future research on AI-driven computing systems
    corecore