179 research outputs found
Cars as a tool for monitoring and protecting biodiversity
Modern cars have an array of sensors that allow different objects to be recognized, including large and small animals. They thus have the potential to become a tool for monitoring biodiversity and improving driver safety. But to achieve this various challenges in computing, communications and privacy need to be addressed.info:eu-repo/semantics/publishedVersio
An assistive model of obstacle detection based on deep learning: YOLOv3 for visually impaired people
The World Health Organization (WHO) reported in 2019 that at least 2.2 billion people were visual-impairment or blindness. The main problem of living for visually impaired people have been facing difficulties in moving even indoor or outdoor situations. Therefore, their lives are not safe and harmful. In this paper, we proposed an assistive application model based on deep learning: YOLOv3 with a Darknet-53 base network for visually impaired people on a smartphone. The Pascal VOC2007 and Pascal VOC2012 were used for the training set and used Pascal VOC2007 test set for validation. The assistive model was installed on a smartphone with an eSpeak synthesizer which generates the audio output to the user. The experimental result showed a high speed and also high detection accuracy. The proposed application with the help of technology will be an effective way to assist visually impaired people to interact with the surrounding environment in their daily life
Object Detection in 20 Years: A Survey
Object detection, as of one the most fundamental and challenging problems in
computer vision, has received great attention in recent years. Its development
in the past two decades can be regarded as an epitome of computer vision
history. If we think of today's object detection as a technical aesthetics
under the power of deep learning, then turning back the clock 20 years we would
witness the wisdom of cold weapon era. This paper extensively reviews 400+
papers of object detection in the light of its technical evolution, spanning
over a quarter-century's time (from the 1990s to 2019). A number of topics have
been covered in this paper, including the milestone detectors in history,
detection datasets, metrics, fundamental building blocks of the detection
system, speed up techniques, and the recent state of the art detection methods.
This paper also reviews some important detection applications, such as
pedestrian detection, face detection, text detection, etc, and makes an in-deep
analysis of their challenges as well as technical improvements in recent years.Comment: This work has been submitted to the IEEE TPAMI for possible
publicatio
A Comparison of Image Processing Techniques for Bird Detection
Orchard fruits and vegetable crops are vulnerable to wild birds and animals. These wild birds and animals can cause critical damage to the produce. Traditional methods of scaring away birds such as scarecrows are not long-term solutions but short-term solutions. This is a huge problem especially near areas like San Luis Obispo where there are vineyards. Bird damage can be as high as 50% for grapes being grown in vineyards. The total estimated revenue lost annually in the 10 counties in California due to bird and rodent damage to 22 selected crops ranged from 504 million (in 2009 dollars). A more effective and permanent system needs to be put into place. Monitoring systems in agricultural settings could potentially provide a lot of data for image processing. Most current monitoring systems however don’t focus on image processing but instead really heavily on sensors. Just having sensors for certain systems work, but for birds, monitoring it is not an option because they are not domesticated like pigs, cows etc. in which most these agricultural monitoring systems work on. Birds can fly in and out of the area whereas domesticated animals can be confined to certain physical regions. The most crucial step in a smart scarecrow system would be how a threat would v be detected. Image processing methods can be effectively applied to detecting items in video footage. This paper will focus on bird detection and will analyze motion detection with image subtraction, bird detection with template matching, and bird detection with the Viola-Jones Algorithm. Of the methods considered, bird detection with the Viola-Jones Algorithm had the highest accuracy (87%) with a somewhat low false positive rate. This image processing step would ideally be incorporated with hardware (such as a microcontroller or FPGA, sensors, a camera etc.) to form a smart scarecrow system
Robust Vehicle Detection and Distance Estimation Under Challenging Lighting Conditions
Avoiding high computational costs and calibration issues involved in stereo-vision-based algorithms, this paper proposes real-time monocular-vision-based techniques for simultaneous vehicle detection and inter-vehicle distance estimation, in which the performance and robustness of the system remain competitive, even for highly challenging benchmark datasets. This paper develops a collision warning system by detecting vehicles ahead and, by identifying safety distances to assist a distracted driver, prior to occurrence of an imminent crash. We introduce adaptive global Haar-like features for vehicle detection, tail-light segmentation, virtual symmetry detection, intervehicle distance estimation, as well as an efficient single-sensor multifeature fusion technique to enhance the accuracy and robustness of our algorithm. The proposed algorithm is able to detect vehicles ahead at both day or night and also for short- and long-range distances. Experimental results under various weather and lighting conditions (including sunny, rainy, foggy, or snowy) show that the proposed algorithm outperforms state-of-the-art algorithms
Exploring the frontier of smart video surveillance: novel domains and fine-grain event understanding
PlantDet: A benchmark for Plant Detection in the Three-Rivers-Source Region
The Three-River-Source region is a highly significant natural reserve in
China that harbors a plethora of untamed botanical resources. To meet the
practical requirements of botanical research and intelligent plant management,
we construct a large-scale dataset for Plant detection in the
Three-River-Source region (PTRS). This dataset comprises 6965 high-resolution
images of 2160*3840 pixels, captured by diverse sensors and platforms, and
featuring objects of varying shapes and sizes. Subsequently, a team of
botanical image interpretation experts annotated these images with 21 commonly
occurring object categories. The fully annotated PTRS images contain 122, 300
instances of plant leaves, each labeled by a horizontal rectangle. The PTRS
presents us with challenges such as dense occlusion, varying leaf resolutions,
and high feature similarity among plants, prompting us to develop a novel
object detection network named PlantDet. This network employs a window-based
efficient self-attention module (ST block) to generate robust feature
representation at multiple scales, improving the detection efficiency for small
and densely-occluded objects. Our experimental results validate the efficacy of
our proposed plant detection benchmark, with a precision of 88.1%, a mean
average precision (mAP) of 77.6%, and a higher recall compared to the baseline.
Additionally, our method effectively overcomes the issue of missing small
objects. We intend to share our data and code with interested parties to
advance further research in this field.Comment: 10 pages, 5 figure
Unveiling the frontiers of deep learning: innovations shaping diverse domains
Deep learning (DL) enables the development of computer models that are
capable of learning, visualizing, optimizing, refining, and predicting data. In
recent years, DL has been applied in a range of fields, including audio-visual
data processing, agriculture, transportation prediction, natural language,
biomedicine, disaster management, bioinformatics, drug design, genomics, face
recognition, and ecology. To explore the current state of deep learning, it is
necessary to investigate the latest developments and applications of deep
learning in these disciplines. However, the literature is lacking in exploring
the applications of deep learning in all potential sectors. This paper thus
extensively investigates the potential applications of deep learning across all
major fields of study as well as the associated benefits and challenges. As
evidenced in the literature, DL exhibits accuracy in prediction and analysis,
makes it a powerful computational tool, and has the ability to articulate
itself and optimize, making it effective in processing data with no prior
training. Given its independence from training data, deep learning necessitates
massive amounts of data for effective analysis and processing, much like data
volume. To handle the challenge of compiling huge amounts of medical,
scientific, healthcare, and environmental data for use in deep learning, gated
architectures like LSTMs and GRUs can be utilized. For multimodal learning,
shared neurons in the neural network for all activities and specialized neurons
for particular tasks are necessary.Comment: 64 pages, 3 figures, 3 table
- …