Search CORE

25 research outputs found

Dual Semantic Fusion Network for Video Object Detection

Author: Bertasius Gedas
Chen Yuntao
Girshick Ross
Guo Yundi
Kang Kai
Li Jianan
Li Jianan
Liu Mason
Lu Jiwen
Nascimento Jacinto C
Ota Kaoru
Ren Shaoqing
Tang Yuxing
Tao Qingyi
Wang Shiyao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/09/2020
Field of study

Video object detection is a tough task due to the deteriorated quality of video sequences captured under complex environments. Currently, this area is dominated by a series of feature enhancement based methods, which distill beneficial semantic information from multiple frames and generate enhanced features through fusing the distilled information. However, the distillation and fusion operations are usually performed at either frame level or instance level with external guidance using additional information, such as optical flow and feature memory. In this work, we propose a dual semantic fusion network (abbreviated as DSFNet) to fully exploit both frame-level and instance-level semantics in a unified fusion framework without external guidance. Moreover, we introduce a geometric similarity measure into the fusion process to alleviate the influence of information distortion caused by noise. As a result, the proposed DSFNet can generate more robust features through the multi-granularity fusion and avoid being affected by the instability of external guidance. To evaluate the proposed DSFNet, we conduct extensive experiments on the ImageNet VID dataset. Notably, the proposed dual semantic fusion network achieves, to the best of our knowledge, the best performance of 84.1\% mAP among the current state-of-the-art video object detectors with ResNet-101 and 85.4\% mAP with ResNeXt-101 without using any post-processing steps.Comment: 9 pages,6 figure

arXiv.org e-Print Archive

Crossref

Feasibility study of common electronic equipment for shuttle sortie experiment payloads

Author
Publication venue
Publication date
Field of study

A study was conducted to determine the feasibility of using standardized electronic equipment on the space shuttle vehicle in an effort to reduce the cost estimates. The standards for Nuclear Instrument Modules (NIM) and CAMAC electronic equipment are presented and described. It was determined that the CAMAC electronic equipment was more suitable for use with the space shuttle systems. Specific applications of the CAMAC equipment are analyzed. Illustrations of the equipment and circuit diagrams of the subsystems are provided

NASA Technical Reports Server

Feature based dynamic intra-video indexing

Author: Asghar Muhammad Nabeel
Publication venue: University of Bedfordshire
Publication date: 01/09/2014
Field of study

A thesis submitted in partial fulfillment for the degree of Doctor of PhilosophyWith the advent of digital imagery and its wide spread application in all vistas of life, it has become an important component in the world of communication. Video content ranging from broadcast news, sports, personal videos, surveillance, movies and entertainment and similar domains is increasing exponentially in quantity and it is becoming a challenge to retrieve content of interest from the corpora. This has led to an increased interest amongst the researchers to investigate concepts of video structure analysis, feature extraction, content annotation, tagging, video indexing, querying and retrieval to fulfil the requirements. However, most of the previous work is confined within specific domain and constrained by the quality, processing and storage capabilities. This thesis presents a novel framework agglomerating the established approaches from feature extraction to browsing in one system of content based video retrieval. The proposed framework significantly fills the gap identified while satisfying the imposed constraints of processing, storage, quality and retrieval times. The output entails a framework, methodology and prototype application to allow the user to efficiently and effectively retrieved content of interest such as age, gender and activity by specifying the relevant query. Experiments have shown plausible results with an average precision and recall of 0.91 and 0.92 respectively for face detection using Haar wavelets based approach. Precision of age ranges from 0.82 to 0.91 and recall from 0.78 to 0.84. The recognition of gender gives better precision with males (0.89) compared to females while recall gives a higher value with females (0.92). Activity of the subject has been detected using Hough transform and classified using Hiddell Markov Model. A comprehensive dataset to support similar studies has also been developed as part of the research process. A Graphical User Interface (GUI) providing a friendly and intuitive interface has been integrated into the developed system to facilitate the retrieval process. The comparison results of the intraclass correlation coefficient (ICC) shows that the performance of the system closely resembles with that of the human annotator. The performance has been optimised for time and error rate

University of Bedfordshire Repository

Anxiolytic effects of propranolol and diphenoxylate on mice and automated stretch-attend posture analysis

Author: Holly Kevin Scott
Publication venue: Louisiana Tech Digital Commons
Publication date: 01/10/2016
Field of study

The prevention of social anxiety, performance anxiety, and social phobia via the combination of two generic drugs, diphenoxylate HC1 (opioid) plus atropine sulfate (anticholinergic) and propranolol HCl (beta blocker) was evaluated in mice through behavioral studies. A patent published on a September 8, 2011 by Benjamin D. Holly, US 2011/0218215 Al, prompted the research. The drug combination of diphenoxylate and atropine plus propranolol could be an immediate treatment for patients suffering from acute phobic and social anxiety disorders. Demonstrating the anxiolytic effects of the treatment on mice would validate a mouse model for neuroscientist to be used to detect the mechanism of action behind the drug combination. To detect more sensitive anxiety measures in mice, a MATLAB-based software called MATSAP was developed as a quick, consistent, and open source program that provides objective automated analysis of stretch-attend posture in rodent behavioral experiments. Stretch-attend posture occurs during risk assessment and is prevalent in common rodent behavioral tests. Stretch-attend posture is a more sensitive measure of the effects of anxiolytics than traditional spatiotemporal indices. However, quantifying stretch-attend posture using human observers is time consuming, somewhat subjective, and prone to errors. Unlike human observers, MATSAP is not susceptible to fatigue or subjectivity. MATSAP performance was assessed with videos of male Swiss mice moving in an open field box and in an elevated plus maze. MATSAP reliably detected stretch-attend posture on par with human observers. This freely-available program can be broadly used by biologists and psychologists to accelerate neurological, pharmacological, and behavioral studies. To further expand on methods to automate the detection of SAP, EthoStock was developed. This not only can detect SAP, but has the potential to detect other ethological behaviors such as grooming and rearing

Louisiana Tech Digital Commons

Context-Aware and Adaptable eLearning Systems

Author: Stoyanov Stanimir
Publication venue: Software Technology Research Laboratory
Publication date: 01/01/2012
Field of study

The full text file attached to this record contains a copy of the thesis without the authors publications attached. The list of publications that are attached to the complete thesis can be found on pages 6-7 in the thesis.This thesis proposed solutions to some shortcomings to current eLearning architectures. The proposed DeLC architecture supports context-aware and adaptable provision of eLearning services and electronic content. The architecture is fully distributed and integrates service-oriented development with agent technology. Central to this architecture is that a node is our unit of computation (known as eLearning node) which can have purely service-oriented architecture, agent-oriented architecture or mixed architecture. Three eLeaerning Nodes have been implemented in order to demonstrate the vitality of the DeLC concept. The Mobile eLearning Node uses a three-level communication network, called InfoStations network, supporting mobile service provision. The services, displayed on this node, are to be aware of its context, gather required learning material and adapted to the learner request. This is supported trough a multi-layered hybrid (service- and agent-oriented) architecture whose kernel is implemented as middleware. For testing of the middleware a simulation environment has been developed. In addition, the DeLC development approach is proposed. The second eLearning node has been implemented as Education Portal. The architecture of this node is poorly service-oriented and it adopts a client-server architecture. In the education portal, there are incorporated education services and system services, called engines. The electronic content is kept in Digital Libraries. Furthermore, in order to facilitate content creators in DeLC, the environment Selbo2 was developed. The environment allows for creating new content, editing available content, as well as generating educational units out of preexisting standardized elements. In the last two years, the portal is used in actual education at the Faculty of Mathematics and Informatics, University of Plovdiv. The third eLearning node, known as Agent Village, exhibits a purely agent-oriented architecture. The purpose of this node is to provide intelligent assistance to the services deployed on the Education Pportal. Currently, two kinds of assistants are implemented in the node - eTesting Assistants and Refactoring eLearning Environment (ReLE). A more complex architecture, known as Education Cluster, is presented in this thesis as well. The Education Cluster incorporates two eLearning nodes, namely the Education Portal and the Agent Village. eLearning services and intelligent agents interact in the cluster

De Montfort University Open Research Archive

Application of CCDs (Charge Coupled Devices) to Polarimetry and Spectropolarimetry

Author: Naghizadeh-Khouei Jaber
Publication venue: ProQuest Dissertations & Theses,
Publication date: 01/01/1998
Field of study

Ideally, we would like to have at our disposal a "universal" instrument which could record all available information, i.e. the full Stokes vectors I, Q, U, V as function of spatial coordinates X and Y and wavelength A at given time t. The practical limitations of spectral dispersers now being used for high resolution observations, narrow band filters or spectrographs each permit only the measurement of a limited sub-set of these parameters. Filters record I(?,Y) at a fixed wavelength and the spectrograph record (I,?,Y) at a fixed X, where X is the direction perpendicular to the slit and Y along it. An important element to any instrumentation is the detector system and to achieve the above, a detector system must possess 2-dimensional characteristics. The application of CCDs to stellar polarimetry has been slow, one problem being that the polarimetric modulators favoured with photomultiplier detection are not immediately applicable. Since, instrumental, observational and data reduction techniques are very different in these two detector systems. At the start of this work (1991) high-precision polarimetric data are obtained by po-larimeters which employ photomultiplier detectors, with measurement accuracy controlled by photon counting statistics. For low levels of polarization, simple error analysis provides uncertainty estimates for q and u of sigmaq = sigmau =(2/N

Glasgow Theses Service

OpenGrey Repository

Software for DAQ systems

Author: Jones B
Publication venue: CERN
Publication date: 01/01/1995
Field of study

CERN Document Server

Image Segmentation by Energy and Related Functional Minimization Methods

Author: Mason Eric Hudson
Publication venue: IRL @ UMSL
Publication date: 18/07/2006
Field of study

Effective and efficient methods for partitioning a digital image into image segments, called ¿image segmentation,¿ have a wide range of applications that include pattern recognition, classification, editing, rendering, and compressed data for image search. In general, image segments are described by their geometry and similarity measures that identify them. For example, the well-known optimization model proposed and studied in depth by David Mumford and Jayant Shah is based on an L2 total energy functional that consists of three terms that govern the geometry of the image segments, the image fidelity (or closeness to the observed image), and the prior (or image smoothness). Recent work in the field of image restoration suggests that a more suitable choice for the fidelity measure is, perhaps, the l1 norm. This thesis explores that idea applied to the study of image segmentation along the line of the Mumford and Shah optimization model, but eliminating the need of variational calculus and regularization schemes to derive the approximating Euler-Lagrange equations. The main contribution of this thesis is a formulation of the problem that avoids the need for the calculus of variation. The energy functional represents a global property of an image. It turns out to be possible, however, to predict how localized changes to the segmentation will affect its value. This has been shown previously in the case of the l2 norm, but no similar method is available for other norms. The method described here solves the problem for the l1 norm, and suggests how it would apply to other forms of the fidelity measure. Existing methods rely on a fixed initial condition. This can lead to an algorithm finding local instead of global optimizations. The solution given here shows how to specify the initial condition based on the content of the image and avoid finding local minima

University of Missouri, St. Louis