53 research outputs found

    Collaborative Intelligent Cross-Camera Video Analytics at Edge: Opportunities and Challenges

    Full text link
    Nowadays, video cameras are deployed in large scale for spatial monitoring of physical places (e.g., surveillance systems in the context of smart cities). The massive camera deployment, however, presents new challenges for analyzing the enormous data, as the cost of high computational overhead of sophisticated deep learning techniques imposes a prohibitive overhead, in terms of energy consumption and processing throughput, on such resource-constrained edge devices. To address these limitations, this paper envisions a collaborative intelligent cross-camera video analytics paradigm at the network edge in which camera nodes adjust their pipelines (e.g., inference) to incorporate correlated observations and shared knowledge from other nodes' contents. By harassing redundant spatio-temporal to reduce the size of the inference search space in one hand, and intelligent collaboration between video nodes on the other, we discuss how such collaborative paradigm can considerably improve accuracy, reduce latency and decrease communication bandwidth compared to non-collaborative baselines. This paper also describes major opportunities and challenges in realizing such a paradigm.Comment: First International Workshop on Challenges in Artificial Intelligence and Machine Learnin

    Efficient Cross Layer Designs for IEEE 802.11 Wireless Networks

    Get PDF
    Various properties of wireless networks, such as mobility, frequent disconnections and varying channel conditions, have made it a challenging task to design networking protocols for wireless communications. In this dissertation, we address several problems related to both the routing layer and medium access control (MAC) layer in wireless networks aiming to enhance the network performance. First, we study the effect of the channel noise on the network performance. We present mechanisms to compute energy-efficient paths in noisy environments for ad hoc networks by exploiting the IEEE 802.11 fragmentation mechanism. These mechanisms enhance the network performance up to orders of magnitude in terms of energy and throughput. We also enhance the IEEE 802.11 infrastructure networks with a capability to differentiate between different types of unsuccessful transmissions to enhance the network performance. Second, we study the effects of the physical layer capture phenomena on network performance. We modify the IEEE 802.11 protocol in a way to increase the concurrent transmissions by exploiting the capture phenomena. We analytically study the potential performance enhancement of our mechanism over the original IEEE 802.11. The analysis shows that up to 35% of the IEEE 802.11 blocking decisions are unnecessary. The results are verified by simulation in which we show that our enhanced mechanism can achieve up to 22% more throughput. Finally, we exploit the spatial reuse of the directional antenna in the IEEE 802.11 standards by developing two novel opportunistic enhancement mechanisms. The first mechanism augments the IEEE 802.11 protocol with additional information that gives a node the flexibility to transmit data while other transmissions are in its vicinity. The second mechanism changes the access routines of the IEEE 802.11 data queue. We show analytically how the IEEE 802.11 protocol using directional antenna is conservative in terms of assessing channel availability, with as much as 60% of unnecessary blocking assessments and up to 90% when we alter the accessing mechanism of the data queue. By simulation, we show an improvement in network throughput of 40% in the case of applying the first mechanism, and up to 60% in the case of applying the second mechanism

    Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual Speech

    Full text link
    Recognizing emotions in spoken communication is crucial for advanced human-machine interaction. Current emotion detection methodologies often display biases when applied cross-corpus. To address this, our study amalgamates 16 diverse datasets, resulting in 375 hours of data across languages like English, Chinese, and Japanese. We propose a soft labeling system to capture gradational emotional intensities. Using the Whisper encoder and data augmentation methods inspired by contrastive learning, our method emphasizes the temporal dynamics of emotions. Our validation on four multilingual datasets demonstrates notable zero-shot generalization. We publish our open source model weights and initial promising results after fine-tuning on Hume-Prosody.Comment: Accepted as talk at NeurIPS ML for Audio worksho

    Demographic Prediction of Mobile User from Phone Usage

    Get PDF
    In this paper, we describe how we use the mobile phone usage of users to predict their demographic attributes. Using call log, visited GSM cells information, visited Bluetooth devices, visited Wireless LAN devices, accelerometer data, and so on, we predict the gender, age, marital status, job and number of people in household of users. The accuracy of developed classifiers for these classification problems ranges from 45-87% depending upon the particular classification problem

    TrafficView: Towards a Scalable Traffic Monitoring System

    Get PDF
    Vehicles are part of people's life in modern society, into which more and more high-tech devices are integrated, and a common platform for inter-vehicle communication is necessary to realize an intelligent transportation system supporting safe driving, dynamic route scheduling, emergency message dissemination, and traffic condition monitoring. TrafficView, which is a part of the e-Road project, defines a framework to disseminate and gather information about the vehicles on the road. Using such a system will provide a vehicle driver with road traffic information, which helps driving in situations as foggy weather, or finding an optimal route in a trip several miles long. This paper describes the basic design of TrafficView and different algorithms used in the system. (UMIACS-TR-2003-98

    Burnout among surgeons before and during the SARS-CoV-2 pandemic: an international survey

    Get PDF
    Background: SARS-CoV-2 pandemic has had many significant impacts within the surgical realm, and surgeons have been obligated to reconsider almost every aspect of daily clinical practice. Methods: This is a cross-sectional study reported in compliance with the CHERRIES guidelines and conducted through an online platform from June 14th to July 15th, 2020. The primary outcome was the burden of burnout during the pandemic indicated by the validated Shirom-Melamed Burnout Measure. Results: Nine hundred fifty-four surgeons completed the survey. The median length of practice was 10 years; 78.2% included were male with a median age of 37 years old, 39.5% were consultants, 68.9% were general surgeons, and 55.7% were affiliated with an academic institution. Overall, there was a significant increase in the mean burnout score during the pandemic; longer years of practice and older age were significantly associated with less burnout. There were significant reductions in the median number of outpatient visits, operated cases, on-call hours, emergency visits, and research work, so, 48.2% of respondents felt that the training resources were insufficient. The majority (81.3%) of respondents reported that their hospitals were included in the management of COVID-19, 66.5% felt their roles had been minimized; 41% were asked to assist in non-surgical medical practices, and 37.6% of respondents were included in COVID-19 management. Conclusions: There was a significant burnout among trainees. Almost all aspects of clinical and research activities were affected with a significant reduction in the volume of research, outpatient clinic visits, surgical procedures, on-call hours, and emergency cases hindering the training. Trial registration: The study was registered on clicaltrials.gov "NCT04433286" on 16/06/2020
    corecore