29,398 research outputs found
Automatic plant pest detection and recognition using k-means clustering algorithm and correspondence filters
Plant pest recognition and detection is vital for food security, quality of life and a stable agricultural economy. This research demonstrates the combination of the k-means clustering algorithm and the correspondence filter to achieve pest detection and recognition. The detection of the dataset is achieved by partitioning the data space into Voronoi cells, which tends to find clusters of comparable spatial extents, thereby separating the objects (pests) from the background (pest habitat). The detection is established by extracting the variant distinctive attributes between the pest and its habitat (leaf, stem) and using the correspondence filter to identify the plant pests to obtain correlation peak values for different datasets. This work further establishes that the recognition probability from the pest image is directly proportional to the height of the output signal and inversely proportional to the viewing angles, which further confirmed that the recognition of plant pests is a function of their position and viewing angle. It is encouraging to note that the correspondence filter can achieve rotational invariance of pests up to angles of 360 degrees, which proves the effectiveness of the algorithm for the detection and recognition of plant pests
Unsupervised User Stance Detection on Twitter
We present a highly effective unsupervised framework for detecting the stance
of prolific Twitter users with respect to controversial topics. In particular,
we use dimensionality reduction to project users onto a low-dimensional space,
followed by clustering, which allows us to find core users that are
representative of the different stances. Our framework has three major
advantages over pre-existing methods, which are based on supervised or
semi-supervised classification. First, we do not require any prior labeling of
users: instead, we create clusters, which are much easier to label manually
afterwards, e.g., in a matter of seconds or minutes instead of hours. Second,
there is no need for domain- or topic-level knowledge either to specify the
relevant stances (labels) or to conduct the actual labeling. Third, our
framework is robust in the face of data skewness, e.g., when some users or some
stances have greater representation in the data. We experiment with different
combinations of user similarity features, dataset sizes, dimensionality
reduction methods, and clustering algorithms to ascertain the most effective
and most computationally efficient combinations across three different datasets
(in English and Turkish). We further verified our results on additional tweet
sets covering six different controversial topics. Our best combination in terms
of effectiveness and efficiency uses retweeted accounts as features, UMAP for
dimensionality reduction, and Mean Shift for clustering, and yields a small
number of high-quality user clusters, typically just 2--3, with more than 98\%
purity. The resulting user clusters can be used to train downstream
classifiers. Moreover, our framework is robust to variations in the
hyper-parameter values and also with respect to random initialization
Trajectory Clustering and an Application to Airspace Monitoring
This paper presents a framework aimed at monitoring the behavior of aircraft
in a given airspace. Nominal trajectories are determined and learned using data
driven methods. Standard procedures are used by air traffic controllers (ATC)
to guide aircraft, ensure the safety of the airspace, and to maximize the
runway occupancy. Even though standard procedures are used by ATC, the control
of the aircraft remains with the pilots, leading to a large variability in the
flight patterns observed. Two methods to identify typical operations and their
variability from recorded radar tracks are presented. This knowledge base is
then used to monitor the conformance of current operations against operations
previously identified as standard. A tool called AirTrajectoryMiner is
presented, aiming at monitoring the instantaneous health of the airspace, in
real time. The airspace is "healthy" when all aircraft are flying according to
the nominal procedures. A measure of complexity is introduced, measuring the
conformance of current flight to nominal flight patterns. When an aircraft does
not conform, the complexity increases as more attention from ATC is required to
ensure a safe separation between aircraft.Comment: 15 pages, 20 figure
Automatic Crack Detection in Built Infrastructure Using Unmanned Aerial Vehicles
This paper addresses the problem of crack detection which is essential for
health monitoring of built infrastructure. Our approach includes two stages,
data collection using unmanned aerial vehicles (UAVs) and crack detection using
histogram analysis. For the data collection, a 3D model of the structure is
first created by using laser scanners. Based on the model, geometric properties
are extracted to generate way points necessary for navigating the UAV to take
images of the structure. Then, our next step is to stick together those
obtained images from the overlapped field of view. The resulting image is then
clustered by histogram analysis and peak detection. Potential cracks are
finally identified by using locally adaptive thresholds. The whole process is
automatically carried out so that the inspection time is significantly improved
while safety hazards can be minimised. A prototypical system has been developed
for evaluation and experimental results are included.Comment: In proceeding of The 34th International Symposium on Automation and
Robotics in Construction (ISARC), pp. 823-829, Taipei, Taiwan, 201
Peptide mass fingerprinting using field-programmable gate arrays
The reconfigurable computing paradigm, which exploits the flexibility and versatility of field-programmable gate arrays (FPGAs), has emerged as a powerful solution for speeding up time-critical algorithms. This paper describes a reconfigurable computing solution for processing raw mass spectrometric data generated by MALDI-TOF instruments. The hardware-implemented algorithms for denoising, baseline correction, peak identification, and deisotoping, running on a Xilinx Virtex-2 FPGA at 180 MHz, generate a mass fingerprint that is over 100 times faster than an equivalent algorithm written in C, running on a Dual 3-GHz Xeon server. The results obtained using the FPGA implementation are virtually identical to those generated by a commercial software package MassLynx
Map++: A Crowd-sensing System for Automatic Map Semantics Identification
Digital maps have become a part of our daily life with a number of commercial
and free map services. These services have still a huge potential for
enhancement with rich semantic information to support a large class of mapping
applications. In this paper, we present Map++, a system that leverages standard
cell-phone sensors in a crowdsensing approach to automatically enrich digital
maps with different road semantics like tunnels, bumps, bridges, footbridges,
crosswalks, road capacity, among others. Our analysis shows that cell-phones
sensors with humans in vehicles or walking get affected by the different road
features, which can be mined to extend the features of both free and commercial
mapping services. We present the design and implementation of Map++ and
evaluate it in a large city. Our evaluation shows that we can detect the
different semantics accurately with at most 3% false positive rate and 6% false
negative rate for both vehicle and pedestrian-based features. Moreover, we show
that Map++ has a small energy footprint on the cell-phones, highlighting its
promise as a ubiquitous digital maps enriching service.Comment: Published in the Eleventh Annual IEEE International Conference on
Sensing, Communication, and Networking (IEEE SECON 2014
- …