57 research outputs found
Classification of Large-Scale Fundus Image Data Sets: A Cloud-Computing Framework
Large medical image data sets with high dimensionality require substantial
amount of computation time for data creation and data processing. This paper
presents a novel generalized method that finds optimal image-based feature sets
that reduce computational time complexity while maximizing overall
classification accuracy for detection of diabetic retinopathy (DR). First,
region-based and pixel-based features are extracted from fundus images for
classification of DR lesions and vessel-like structures. Next, feature ranking
strategies are used to distinguish the optimal classification feature sets. DR
lesion and vessel classification accuracies are computed using the boosted
decision tree and decision forest classifiers in the Microsoft Azure Machine
Learning Studio platform, respectively. For images from the DIARETDB1 data set,
40 of its highest-ranked features are used to classify four DR lesion types
with an average classification accuracy of 90.1% in 792 seconds. Also, for
classification of red lesion regions and hemorrhages from microaneurysms,
accuracies of 85% and 72% are observed, respectively. For images from STARE
data set, 40 high-ranked features can classify minor blood vessels with an
accuracy of 83.5% in 326 seconds. Such cloud-based fundus image analysis
systems can significantly enhance the borderline classification performances in
automated screening systems.Comment: 4 pages, 6 figures, [Submitted], 38th Annual International Conference
of the IEEE Engineering in Medicine and Biology Society 201
A Survey of the Trends in Facial and Expression Recognition Databases and Methods
Automated facial identification and facial expression recognition have been
topics of active research over the past few decades. Facial and expression
recognition find applications in human-computer interfaces, subject tracking,
real-time security surveillance systems and social networking. Several holistic
and geometric methods have been developed to identify faces and expressions
using public and local facial image databases. In this work we present the
evolution in facial image data sets and the methodologies for facial
identification and recognition of expressions such as anger, sadness,
happiness, disgust, fear and surprise. We observe that most of the earlier
methods for facial and expression recognition aimed at improving the
recognition rates for facial feature-based methods using static images.
However, the recent methodologies have shifted focus towards robust
implementation of facial/expression recognition from large image databases that
vary with space (gathered from the internet) and time (video recordings). The
evolution trends in databases and methodologies for facial and expression
recognition can be useful for assessing the next-generation topics that may
have applications in security systems or personal identification systems that
involve "Quantitative face" assessments.Comment: 16 pages, 4 figures, 3 tables, International Journal of Computer
Science and Engineering Survey, October, 201
Computer Aided Detection of Anemia-like Pallor
Paleness or pallor is a manifestation of blood loss or low hemoglobin
concentrations in the human blood that can be caused by pathologies such as
anemia. This work presents the first automated screening system that utilizes
pallor site images, segments, and extracts color and intensity-based features
for multi-class classification of patients with high pallor due to anemia-like
pathologies, normal patients and patients with other abnormalities. This work
analyzes the pallor sites of conjunctiva and tongue for anemia screening
purposes. First, for the eye pallor site images, the sclera and conjunctiva
regions are automatically segmented for regions of interest. Similarly, for the
tongue pallor site images, the inner and outer tongue regions are segmented.
Then, color-plane based feature extraction is performed followed by machine
learning algorithms for feature reduction and image level classification for
anemia. In this work, a suite of classification algorithms image-level
classifications for normal (class 0), pallor (class 1) and other abnormalities
(class 2). The proposed method achieves 86% accuracy, 85% precision and 67%
recall in eye pallor site images and 98.2% accuracy and precision with 100%
recall in tongue pallor site images for classification of images with pallor.
The proposed pallor screening system can be further fine-tuned to detect the
severity of anemia-like pathologies using controlled set of local images that
can then be used for future benchmarking purposes.Comment: 4 pages,2 figures, 2 table
Journey of Hallucination-minimized Generative AI Solutions for Financial Decision Makers
Generative AI has significantly reduced the entry barrier to the domain of AI
owing to the ease of use and core capabilities of automation, translation, and
intelligent actions in our day to day lives. Currently, Large language models
(LLMs) that power such chatbots are being utilized primarily for their
automation capabilities for software monitoring, report generation etc. and for
specific personalized question answering capabilities, on a limited scope and
scale. One major limitation of the currently evolving family of LLMs is
'hallucinations', wherein inaccurate responses are reported as factual.
Hallucinations are primarily caused by biased training data, ambiguous prompts
and inaccurate LLM parameters, and they majorly occur while combining
mathematical facts with language-based context. Thus, monitoring and
controlling for hallucinations becomes necessary when designing solutions that
are meant for decision makers. In this work we present the three major stages
in the journey of designing hallucination-minimized LLM-based solutions that
are specialized for the decision makers of the financial domain, namely:
prototyping, scaling and LLM evolution using human feedback. These three stages
and the novel data to answer generation modules presented in this work are
necessary to ensure that the Generative AI chatbots, autonomous reports and
alerts are reliable and high-quality to aid key decision-making processes.Comment: 4 pages, 2 Figure
FuSSI-Net: Fusion of Spatio-temporal Skeletons for Intention Prediction Network
Pedestrian intention recognition is very important to develop robust and safe
autonomous driving (AD) and advanced driver assistance systems (ADAS)
functionalities for urban driving. In this work, we develop an end-to-end
pedestrian intention framework that performs well on day- and night- time
scenarios. Our framework relies on objection detection bounding boxes combined
with skeletal features of human pose. We study early, late, and combined (early
and late) fusion mechanisms to exploit the skeletal features and reduce false
positives as well to improve the intention prediction performance. The early
fusion mechanism results in AP of 0.89 and precision/recall of 0.79/0.89 for
pedestrian intention classification. Furthermore, we propose three new metrics
to properly evaluate the pedestrian intention systems. Under these new
evaluation metrics for the intention prediction, the proposed end-to-end
network offers accurate pedestrian intention up to half a second ahead of the
actual risky maneuver.Comment: 5 pages, 6 figures, 5 tables, IEEE Asilomar SS
Developing regional capacity in operations research and economic evaluation in South Asia
This project contributed significantly to the capacity-building of regional professionals in planning, implementing, and monitoring of reproductive health programs in South Asia. During 2001–05, professionals from 17 countries received training in various aspects of reproductive health in nine workshops, including operations research, economic evaluation, qualitative research methods, proposal writing, and process documentation and enhancing the utilization of research findings in reproductive health programs. Forty-three percent of workshop participants were program managers from government health programs and nongovernmental organizations. Success in leveraging resources from other collaborating agencies and other donors helped the project to organize more workshops than originally planned and train more professionals than expected. A survey of the workshop participants four to 38 months after training revealed that 70 percent of respondents were using their newly acquired skills in programmatic improvement, program development, and conducting operations research
- …
