5,044 research outputs found

    Recent Trends in Computational Intelligence

    Get PDF
    Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications

    Soft Biometric Analysis: MultiPerson and RealTime Pedestrian Attribute Recognition in Crowded Urban Environments

    Get PDF
    Traditionally, recognition systems were only based on human hard biometrics. However, the ubiquitous CCTV cameras have raised the desire to analyze human biometrics from far distances, without people attendance in the acquisition process. Highresolution face closeshots are rarely available at far distances such that facebased systems cannot provide reliable results in surveillance applications. Human soft biometrics such as body and clothing attributes are believed to be more effective in analyzing human data collected by security cameras. This thesis contributes to the human soft biometric analysis in uncontrolled environments and mainly focuses on two tasks: Pedestrian Attribute Recognition (PAR) and person reidentification (reid). We first review the literature of both tasks and highlight the history of advancements, recent developments, and the existing benchmarks. PAR and person reid difficulties are due to significant distances between intraclass samples, which originate from variations in several factors such as body pose, illumination, background, occlusion, and data resolution. Recent stateoftheart approaches present endtoend models that can extract discriminative and comprehensive feature representations from people. The correlation between different regions of the body and dealing with limited learning data is also the objective of many recent works. Moreover, class imbalance and correlation between human attributes are specific challenges associated with the PAR problem. We collect a large surveillance dataset to train a novel gender recognition model suitable for uncontrolled environments. We propose a deep residual network that extracts several posewise patches from samples and obtains a comprehensive feature representation. In the next step, we develop a model for multiple attribute recognition at once. Considering the correlation between human semantic attributes and class imbalance, we respectively use a multitask model and a weighted loss function. We also propose a multiplication layer on top of the backbone features extraction layers to exclude the background features from the final representation of samples and draw the attention of the model to the foreground area. We address the problem of person reid by implicitly defining the receptive fields of deep learning classification frameworks. The receptive fields of deep learning models determine the most significant regions of the input data for providing correct decisions. Therefore, we synthesize a set of learning data in which the destructive regions (e.g., background) in each pair of instances are interchanged. A segmentation module determines destructive and useful regions in each sample, and the label of synthesized instances are inherited from the sample that shared the useful regions in the synthesized image. The synthesized learning data are then used in the learning phase and help the model rapidly learn that the identity and background regions are not correlated. Meanwhile, the proposed solution could be seen as a data augmentation approach that fully preserves the label information and is compatible with other data augmentation techniques. When reid methods are learned in scenarios where the target person appears with identical garments in the gallery, the visual appearance of clothes is given the most importance in the final feature representation. Clothbased representations are not reliable in the longterm reid settings as people may change their clothes. Therefore, developing solutions that ignore clothing cues and focus on identityrelevant features are in demand. We transform the original data such that the identityrelevant information of people (e.g., face and body shape) are removed, while the identityunrelated cues (i.e., color and texture of clothes) remain unchanged. A learned model on the synthesized dataset predicts the identityunrelated cues (shortterm features). Therefore, we train a second model coupled with the first model and learns the embeddings of the original data such that the similarity between the embeddings of the original and synthesized data is minimized. This way, the second model predicts based on the identityrelated (longterm) representation of people. To evaluate the performance of the proposed models, we use PAR and person reid datasets, namely BIODI, PETA, RAP, Market1501, MSMTV2, PRCC, LTCC, and MIT and compared our experimental results with stateoftheart methods in the field. In conclusion, the data collected from surveillance cameras have low resolution, such that the extraction of hard biometric features is not possible, and facebased approaches produce poor results. In contrast, soft biometrics are robust to variations in data quality. So, we propose approaches both for PAR and person reid to learn discriminative features from each instance and evaluate our proposed solutions on several publicly available benchmarks.This thesis was prepared at the University of Beria Interior, IT Instituto de Telecomunicaรงรตes, Soft Computing and Image Analysis Laboratory (SOCIA Lab), Covilhรฃ Delegation, and was submitted to the University of Beira Interior for defense in a public examination session

    Visual Analysis of Extremely Dense Crowded Scenes

    Get PDF
    Visual analysis of dense crowds is particularly challenging due to large number of individuals, occlusions, clutter, and fewer pixels per person which rarely occur in ordinary surveillance scenarios. This dissertation aims to address these challenges in images and videos of extremely dense crowds containing hundreds to thousands of humans. The goal is to tackle the fundamental problems of counting, detecting and tracking people in such images and videos using visual and contextual cues that are automatically derived from the crowded scenes. For counting in an image of extremely dense crowd, we propose to leverage multiple sources of information to compute an estimate of the number of individuals present in the image. Our approach relies on sources such as low confidence head detections, repetition of texture elements (using SIFT), and frequency-domain analysis to estimate counts, along with confidence associated with observing individuals, in an image region. Furthermore, we employ a global consistency constraint on counts using Markov Random Field which caters for disparity in counts in local neighborhoods and across scales. We tested this approach on crowd images with the head counts ranging from 94 to 4543 and obtained encouraging results. Through this approach, we are able to count people in images of high-density crowds unlike previous methods which are only applicable to videos of low to medium density crowded scenes. However, the counting procedure just outputs a single number for a large patch or an entire image. With just the counts, it becomes difficult to measure the counting error for a query image with unknown number of people. For this, we propose to localize humans by finding repetitive patterns in the crowd image. Starting with detections from an underlying head detector, we correlate them within the image after their selection through several criteria: in a pre-defined grid, locally, or at multiple scales by automatically finding the patches that are most representative of recurring patterns in the crowd image. Finally, the set of generated hypotheses is selected using binary integer quadratic programming with Special Ordered Set (SOS) Type 1 constraints. Human Detection is another important problem in the analysis of crowded scenes where the goal is to place a bounding box on visible parts of individuals. Primarily applicable to images depicting medium to high density crowds containing several hundred humans, it is a crucial pre-requisite for many other visual tasks, such as tracking, action recognition or detection of anomalous behaviors, exhibited by individuals in a dense crowd. For detecting humans, we explore context in dense crowds in the form of locally-consistent scale prior which captures the similarity in scale in local neighborhoods with smooth variation over the image. Using the scale and confidence of detections obtained from an underlying human detector, we infer scale and confidence priors using Markov Random Field. In an iterative mechanism, the confidences of detections are modified to reflect consistency with the inferred priors, and the priors are updated based on the new detections. The final set of detections obtained are then reasoned for occlusion using Binary Integer Programming where overlaps and relations between parts of individuals are encoded as linear constraints. Both human detection and occlusion reasoning in this approach are solved with local neighbor-dependent constraints, thereby respecting the inter-dependence between individuals characteristic to dense crowd analysis. In addition, we propose a mechanism to detect different combinations of body parts without requiring annotations for individual combinations. Once human detection and localization is performed, we then use it for tracking people in dense crowds. Similar to the use of context as scale prior for human detection, we exploit it in the form of motion concurrence for tracking individuals in dense crowds. The proposed method for tracking provides an alternative and complementary approach to methods that require modeling of crowd flow. Simultaneously, it is less likely to fail in the case of dynamic crowd flows and anomalies by minimally relying on previous frames. The approach begins with the automatic identification of prominent individuals from the crowd that are easy to track. Then, we use Neighborhood Motion Concurrence to model the behavior of individuals in a dense crowd, this predicts the position of an individual based on the motion of its neighbors. When the individual moves with the crowd flow, we use Neighborhood Motion Concurrence to predict motion while leveraging five-frame instantaneous flow in case of dynamically changing flow and anomalies. All these aspects are then embedded in a framework which imposes hierarchy on the order in which positions of individuals are updated. The results are reported on eight sequences of medium to high density crowds and our approach performs on par with existing approaches without learning or modeling patterns of crowd flow. We experimentally demonstrate the efficacy and reliability of our algorithms by quantifying the performance of counting, localization, as well as human detection and tracking on new and challenging datasets containing hundreds to thousands of humans in a given scene

    ์ ๋ถ„ ๋ฐ ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฒ• ์œตํ•ฉ์„ ์ด์šฉํ•œ ์Šค๋งˆํŠธํฐ ๋‹ค์ค‘ ๋™์ž‘์—์„œ ๋ณดํ–‰ ํ•ญ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ๊ธฐ๊ณ„ํ•ญ๊ณต๊ณตํ•™๋ถ€, 2020. 8. ๋ฐ•์ฐฌ๊ตญ.In this dissertation, an IA-PA fusion-based PDR (Pedestrian Dead Reckoning) using low-cost inertial sensors is proposed to improve the indoor position estimation. Specifically, an IA (Integration Approach)-based PDR algorithm combined with measurements from PA (Parametric Approach) is constructed so that the algorithm is operated even in various poses that occur when a pedestrian moves with a smartphone indoors. In addition, I propose an algorithm that estimates the device attitude robustly in a disturbing situation by an ellipsoidal method. In addition, by using the machine learning-based pose recognition, it is possible to improve the position estimation performance by varying the measurement update according to the poses. First, I propose an adaptive attitude estimation based on ellipsoid technique to accurately estimate the direction of movement of a smartphone device. The AHRS (Attitude and Heading Reference System) uses an accelerometer and a magnetometer as measurements to calculate the attitude based on the gyro and to compensate for drift caused by gyro sensor errors. In general, the attitude estimation performance is poor in acceleration and geomagnetic disturbance situations, but in order to effectively improve the estimation performance, this dissertation proposes an ellipsoid-based adaptive attitude estimation technique. When a measurement disturbance comes in, it is possible to update the measurement more accurately than the adaptive estimation technique without considering the direction by adjusting the measurement covariance with the ellipsoid method considering the direction of the disturbance. In particular, when the disturbance only comes in one axis, the proposed algorithm can use the measurement partly by updating the other two axes considering the direction. The proposed algorithm shows its effectiveness in attitude estimation under disturbances through the rate table and motion capture equipment. Next, I propose a PDR algorithm that integrates IA and PA that can be operated in various poses. When moving indoors using a smartphone, there are many degrees of freedom, so various poses such as making a phone call, texting, and putting a pants pocket are possible. In the existing smartphone-based positioning algorithms, the position is estimated based on the PA, which can be used only when the pedestrian's walking direction and the device's direction coincide, and if it does not, the position error due to the mismatch in angle is large. In order to solve this problem, this dissertation proposes an algorithm that constructs state variables based on the IA and uses the position vector from the PA as a measurement. If the walking direction and the device heading do not match based on the pose recognized through machine learning technique, the position is updated in consideration of the direction calculated using PCA (Principal Component Analysis) and the step length obtained through the PA. It can be operated robustly even in various poses that occur. Through experiments considering various operating conditions and paths, it is confirmed that the proposed method stably estimates the position and improves performance even in various indoor environments.๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ €๊ฐ€ํ˜• ๊ด€์„ฑ์„ผ์„œ๋ฅผ ์ด์šฉํ•œ ๋ณดํ–‰ํ•ญ๋ฒ•์‹œ์Šคํ…œ (PDR: Pedestrian Dead Reckoning)์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ๊ตฌ์ฒด์ ์œผ๋กœ ๋ณดํ–‰์ž๊ฐ€ ์‹ค๋‚ด์—์„œ ์Šค๋งˆํŠธํฐ์„ ๋“ค๊ณ  ์ด๋™ํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ๋‹ค์–‘ํ•œ ๋™์ž‘ ์ƒํ™ฉ์—์„œ๋„ ์šด์šฉ๋  ์ˆ˜ ์žˆ๋„๋ก, ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฐ˜ ์ธก์ •์น˜๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ ๋ถ„ ๊ธฐ๋ฐ˜์˜ ๋ณดํ–‰์ž ํ•ญ๋ฒ• ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ตฌ์„ฑํ•œ๋‹ค. ๋˜ํ•œ ํƒ€์›์ฒด ๊ธฐ๋ฐ˜ ์ž์„ธ ์ถ”์ • ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ตฌ์„ฑํ•˜์—ฌ ์™ธ๋ž€ ์ƒํ™ฉ์—์„œ๋„ ๊ฐ•์ธํ•˜๊ฒŒ ์ž์„ธ๋ฅผ ์ถ”์ •ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ์ถ”๊ฐ€์ ์œผ๋กœ ๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜์˜ ๋™์ž‘ ์ธ์‹ ์ •๋ณด๋ฅผ ์ด์šฉ, ๋™์ž‘์— ๋”ฐ๋ฅธ ์ธก์ •์น˜ ์—…๋ฐ์ดํŠธ๋ฅผ ๋‹ฌ๋ฆฌํ•จ์œผ๋กœ์จ ์œ„์น˜ ์ถ”์ • ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค. ๋จผ์ € ์Šค๋งˆํŠธํฐ ๊ธฐ๊ธฐ์˜ ์ด๋™ ๋ฐฉํ–ฅ์„ ์ •ํ™•ํ•˜๊ฒŒ ์ถ”์ •ํ•˜๊ธฐ ์œ„ํ•ด ํƒ€์›์ฒด ๊ธฐ๋ฒ• ๊ธฐ๋ฐ˜ ์ ์‘ ์ž์„ธ ์ถ”์ •์„ ์ œ์•ˆํ•œ๋‹ค. ์ž์„ธ ์ถ”์ • ๊ธฐ๋ฒ• (AHRS: Attitude and Heading Reference System)์€ ์ž์ด๋กœ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ž์„ธ๋ฅผ ๊ณ„์‚ฐํ•˜๊ณ  ์ž์ด๋กœ ์„ผ์„œ์˜ค์ฐจ์— ์˜ํ•ด ๋ฐœ์ƒํ•˜๋Š” ๋“œ๋ฆฌํ”„ํŠธ๋ฅผ ๋ณด์ •ํ•˜๊ธฐ ์œ„ํ•ด ์ธก์ •์น˜๋กœ ๊ฐ€์†๋„๊ณ„์™€ ์ง€์ž๊ณ„๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ์ผ๋ฐ˜์ ์œผ๋กœ ๊ฐ€์† ๋ฐ ์ง€์ž๊ณ„ ์™ธ๋ž€ ์ƒํ™ฉ์—์„œ๋Š” ์ž์„ธ ์ถ”์ • ์„ฑ๋Šฅ์ด ๋–จ์–ด์ง€๋Š”๋ฐ, ์ถ”์ • ์„ฑ๋Šฅ์„ ํšจ๊ณผ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ํƒ€์›์ฒด ๊ธฐ๋ฐ˜ ์ ์‘ ์ž์„ธ ์ถ”์ • ๊ธฐ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. ์ธก์ •์น˜ ์™ธ๋ž€์ด ๋“ค์–ด์˜ค๋Š” ๊ฒฝ์šฐ, ์™ธ๋ž€์˜ ๋ฐฉํ–ฅ์„ ๊ณ ๋ คํ•˜์—ฌ ํƒ€์›์ฒด ๊ธฐ๋ฒ•์œผ๋กœ ์ธก์ •์น˜ ๊ณต๋ถ„์‚ฐ์„ ์กฐ์ •ํ•ด์คŒ์œผ๋กœ์จ ๋ฐฉํ–ฅ์„ ๊ณ ๋ คํ•˜์ง€ ์•Š์€ ์ ์‘ ์ถ”์ • ๊ธฐ๋ฒ•๋ณด๋‹ค ์ •ํ™•ํ•˜๊ฒŒ ์ธก์ •์น˜ ์—…๋ฐ์ดํŠธ๋ฅผ ํ•  ์ˆ˜ ์žˆ๋‹ค. ํŠนํžˆ ์™ธ๋ž€์ด ํ•œ ์ถ•์œผ๋กœ๋งŒ ๋“ค์–ด์˜ค๋Š” ๊ฒฝ์šฐ, ์ œ์•ˆํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๋ฐฉํ–ฅ์„ ๊ณ ๋ คํ•ด ๋‚˜๋จธ์ง€ ๋‘ ์ถ•์— ๋Œ€ํ•ด์„œ๋Š” ์—…๋ฐ์ดํŠธ ํ•ด์คŒ์œผ๋กœ์จ ์ธก์ •์น˜๋ฅผ ๋ถ€๋ถ„์ ์œผ๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ ˆ์ดํŠธ ํ…Œ์ด๋ธ”, ๋ชจ์…˜ ์บก์ณ ์žฅ๋น„๋ฅผ ํ†ตํ•ด ์ œ์•ˆํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์ž์„ธ ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋จ์„ ํ™•์ธํ•˜์˜€๋‹ค. ๋‹ค์Œ์œผ๋กœ ๋‹ค์–‘ํ•œ ๋™์ž‘์—์„œ๋„ ์šด์šฉ ๊ฐ€๋Šฅํ•œ ์ ๋ถ„ ๋ฐ ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฒ•์„ ์œตํ•ฉํ•˜๋Š” ๋ณดํ–‰ํ•ญ๋ฒ• ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ์Šค๋งˆํŠธํฐ์„ ์ด์šฉํ•ด ์‹ค๋‚ด๋ฅผ ์ด๋™ํ•  ๋•Œ์—๋Š” ์ž์œ ๋„๊ฐ€ ํฌ๊ธฐ ๋•Œ๋ฌธ์— ์ „ํ™” ๊ฑธ๊ธฐ, ๋ฌธ์ž, ๋ฐ”์ง€ ์ฃผ๋จธ๋‹ˆ ๋„ฃ๊ธฐ ๋“ฑ ๋‹ค์–‘ํ•œ ๋™์ž‘์ด ๋ฐœ์ƒ ๊ฐ€๋Šฅํ•˜๋‹ค. ๊ธฐ์กด์˜ ์Šค๋งˆํŠธํฐ ๊ธฐ๋ฐ˜ ๋ณดํ–‰ ํ•ญ๋ฒ•์—์„œ๋Š” ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฒ•์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์œ„์น˜๋ฅผ ์ถ”์ •ํ•˜๋Š”๋ฐ, ์ด๋Š” ๋ณดํ–‰์ž์˜ ์ง„ํ–‰ ๋ฐฉํ–ฅ๊ณผ ๊ธฐ๊ธฐ์˜ ๋ฐฉํ–ฅ์ด ์ผ์น˜ํ•˜๋Š” ๊ฒฝ์šฐ์—๋งŒ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•˜๋ฉฐ ์ผ์น˜ํ•˜์ง€ ์•Š๋Š” ๊ฒฝ์šฐ ์ž์„ธ ์˜ค์ฐจ๋กœ ์ธํ•œ ์œ„์น˜ ์˜ค์ฐจ๊ฐ€ ํฌ๊ฒŒ ๋ฐœ์ƒํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ ๋ถ„ ๊ธฐ๋ฐ˜ ๊ธฐ๋ฒ•์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์ƒํƒœ๋ณ€์ˆ˜๋ฅผ ๊ตฌ์„ฑํ•˜๊ณ  ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฒ•์„ ํ†ตํ•ด ๋‚˜์˜ค๋Š” ์œ„์น˜ ๋ฒกํ„ฐ๋ฅผ ์ธก์ •์น˜๋กœ ์‚ฌ์šฉํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ๋งŒ์•ฝ ๊ธฐ๊ณ„ํ•™์Šต์„ ํ†ตํ•ด ์ธ์‹ํ•œ ๋™์ž‘์„ ๋ฐ”ํƒ•์œผ๋กœ ์ง„ํ–‰ ๋ฐฉํ–ฅ๊ณผ ๊ธฐ๊ธฐ ๋ฐฉํ–ฅ์ด ์ผ์น˜ํ•˜์ง€ ์•Š๋Š” ๊ฒฝ์šฐ, ์ฃผ์„ฑ๋ถ„ ๋ถ„์„์„ ํ†ตํ•ด ๊ณ„์‚ฐํ•œ ์ง„ํ–‰๋ฐฉํ–ฅ์„ ์ด์šฉํ•ด ์ง„ํ–‰ ๋ฐฉํ–ฅ์„, ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ธฐ๋ฒ•์„ ํ†ตํ•ด ์–ป์€ ๋ณดํญ์œผ๋กœ ๊ฑฐ๋ฆฌ๋ฅผ ์—…๋ฐ์ดํŠธํ•ด ์คŒ์œผ๋กœ์จ ๋ณดํ–‰ ์ค‘ ๋ฐœ์ƒํ•˜๋Š” ์—ฌ๋Ÿฌ ๋™์ž‘์—์„œ๋„ ๊ฐ•์ธํ•˜๊ฒŒ ์šด์šฉํ•  ์ˆ˜ ์žˆ๋‹ค. ๋‹ค์–‘ํ•œ ๋™์ž‘ ์ƒํ™ฉ ๋ฐ ๊ฒฝ๋กœ๋ฅผ ๊ณ ๋ คํ•œ ์‹คํ—˜์„ ํ†ตํ•ด ์œ„์—์„œ ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์ด ๋‹ค์–‘ํ•œ ์‹ค๋‚ด ํ™˜๊ฒฝ์—์„œ๋„ ์•ˆ์ •์ ์œผ๋กœ ์œ„์น˜๋ฅผ ์ถ”์ •ํ•˜๊ณ  ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋จ์„ ํ™•์ธํ•˜์˜€๋‹ค.Chapter 1 Introduction 1 1.1 Motivation and Background 1 1.2 Objectives and Contribution 5 1.3 Organization of the Dissertation 6 Chapter 2 Pedestrian Dead Reckoning System 8 2.1 Overview of Pedestrian Dead Reckoning 8 2.2 Parametric Approach 9 2.2.1 Step detection algorithm 11 2.2.2 Step length estimation algorithm 13 2.2.3 Heading estimation 14 2.3 Integration Approach 15 2.3.1 Extended Kalman filter 16 2.3.2 INS-EKF-ZUPT 19 2.4 Activity Recognition using Machine Learning 21 2.4.1 Challenges in HAR 21 2.4.2 Activity recognition chain 22 Chapter 3 Attitude Estimation in Smartphone 26 3.1 Adaptive Attitude Estimation in Smartphone 26 3.1.1 Indirect Kalman filter-based attitude estimation 26 3.1.2 Conventional attitude estimation algorithms 29 3.1.3 Adaptive attitude estimation using ellipsoidal methods 30 3.2 Experimental Results 36 3.2.1 Simulation 36 3.2.2 Rate table experiment 44 3.2.3 Handheld rotation experiment 46 3.2.4 Magnetic disturbance experiment 49 3.3 Summary 53 Chapter 4 Pedestrian Dead Reckoning in Multiple Poses of a Smartphone 54 4.1 System Overview 55 4.2 Machine Learning-based Pose Classification 56 4.2.1 Training dataset 57 4.2.2 Feature extraction and selection 58 4.2.3 Pose classification result using supervised learning in PDR 62 4.3 Fusion of the Integration and Parametric Approaches in PDR 65 4.3.1 System model 67 4.3.2 Measurement model 67 4.3.3 Mode selection 74 4.3.4 Observability analysis 76 4.4 Experimental Results 82 4.4.1 AHRS results 82 4.4.2 PCA results 84 4.4.3 IA-PA results 88 4.5 Summary 100 Chapter 5 Conclusions 103 5.1 Summary of the Contributions 103 5.2 Future Works 105 ๊ตญ๋ฌธ์ดˆ๋ก 125 Acknowledgements 127Docto

    The Hanabi Challenge: A New Frontier for AI Research

    Full text link
    From the early days of computing, games have been important testbeds for studying how well machines can do sophisticated decision making. In recent years, machine learning has made dramatic advances with artificial agents reaching superhuman performance in challenge domains like Go, Atari, and some variants of poker. As with their predecessors of chess, checkers, and backgammon, these game domains have driven research by providing sophisticated yet well-defined challenges for artificial intelligence practitioners. We continue this tradition by proposing the game of Hanabi as a new challenge domain with novel problems that arise from its combination of purely cooperative gameplay with two to five players and imperfect information. In particular, we argue that Hanabi elevates reasoning about the beliefs and intentions of other agents to the foreground. We believe developing novel techniques for such theory of mind reasoning will not only be crucial for success in Hanabi, but also in broader collaborative efforts, especially those with human partners. To facilitate future research, we introduce the open-source Hanabi Learning Environment, propose an experimental framework for the research community to evaluate algorithmic advances, and assess the performance of current state-of-the-art techniques.Comment: 32 pages, 5 figures, In Press (Artificial Intelligence

    A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

    Full text link
    In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusion, and machine learning, to name a few. This means that the RS community should be aware of, if not at the leading edge of, of advancements like DL. Herein, we provide the most comprehensive survey of state-of-the-art RS DL research. We also review recent new developments in the DL field that can be used in DL for RS. Namely, we focus on theories, tools and challenges for the RS community. Specifically, we focus on unsolved challenges and opportunities as it relates to (i) inadequate data sets, (ii) human-understandable solutions for modelling physical phenomena, (iii) Big Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and learning algorithms for spectral, spatial and temporal data, (vi) transfer learning, (vii) an improved theoretical understanding of DL systems, (viii) high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote Sensin

    Algorithms for Autonomous Personal Navigation Systems

    Get PDF
    Personal positioning is a challenging topic in the area of navigation mainly because of the cost, size and power consumption constraints imposed on the hardware. Satellite based positioning techniques can meet the requirements for many applications, but cover well only outdoor environment. Problems like weak satellite signals make the positioning impossible indoors. Urban canyons are also difficult areas for GNSS based navigation because of large multipath errors and satellite signal outages. Many applications require seamless positioning in all environments. However, there is no overall solution for navigation in GNSS denied environment, which is reliable, accurate, cost effective and quickly installed. Recently developed systems for indoor positioning often require pre-installed infrastructure. Another approach is to use fully autonomous navigation systems based on self-contained sensors and street or indoor maps. This thesis is concerned with autonomous personal navigation devices, which do not rely on the reception of external information, like satellite or terrestrial signals. The three proposed algorithms can be integrated into personal navigation systems. The first algorithm computes positioning for a map aided navigation system designed for land vehicles traveling on road network. The novelty is in application of particle filtering to vehicle navigation using road network database. The second algorithm is aimed at map aided vehicle navigation indoors. The novelty is in the method for correction of position and heading. The third algorithm computes solution for pedestrian navigation system, which is based on body mounted inertial measurement unit and models of human gait
    • โ€ฆ
    corecore