4 research outputs found
DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition
The task of Visual Relationship Recognition (VRR) aims to identify
relationships between two interacting objects in an image and is particularly
challenging due to the widely-spread and highly imbalanced distribution of
triplets. To overcome the resultant performance
bias in existing VRR approaches, we introduce DiffAugment -- a method which
first augments the tail classes in the linguistic space by making use of
WordNet and then utilizes the generative prowess of Diffusion Models to expand
the visual space for minority classes. We propose a novel hardness-aware
component in diffusion which is based upon the hardness of each triplet
and demonstrate the effectiveness of hardness-aware diffusion in generating
visual embeddings for the tail classes. We also propose a novel subject and
object based seeding strategy for diffusion sampling which improves the
discriminative capability of the generated visual embeddings. Extensive
experimentation on the GQA-LT dataset shows favorable gains in the
subject/object and relation average per-class accuracy using Diffusion
augmented samples
Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit
Automatic group emotion recognition plays an important role in understanding
complex human-human interaction. This paper introduces, Emolysis, a
Python-based, standalone open-source group emotion analysis toolkit for use in
different social situations upon getting consent from the users. Given any
input video, Emolysis processes synchronized multimodal input and maps it to
group level emotion, valence and arousal. Additionally, the toolkit supports
major mobile and desktop platforms (Android, iOS, Windows). The Emolysis
platform also comes with an intuitive graphical user interface that allows
users to select different modalities and target persons for more fine-grained
emotion analysis. Emolysis is freely available for academic research and
encourages application developers to extend it to application specific
environments on top of the existing system. We believe that the extension
mechanism is quite straightforward. Our code models and interface are available
at https://github.com/ControlNet/emolysis.Comment: Accepted by ACII Demo 2024. Both Shreya Ghosh and Zhixi Cai
contributed equally to this researc
FakeBuster:a deepfakes detection tool for video conferencing scenarios
This paper proposes a new DeepFake detector FakeBuster for detecting
impostors during video conferencing and manipulated faces on social media.
FakeBuster is a standalone deep learning based solution, which enables a user
to detect if another person's video is manipulated or spoofed during a video
conferencing based meeting. This tool is independent of video conferencing
solutions and has been tested with Zoom and Skype applications. It uses a 3D
convolutional neural network for predicting video segment-wise fakeness scores.
The network is trained on a combination of datasets such as Deeperforensics,
DFDC, VoxCeleb, and deepfake videos created using locally captured (for video
conferencing scenarios) images. This leads to different environments and
perturbations in the dataset, which improves the generalization of the deepfake
network.Comment: 5 Pages, 3 Figure
Empagliflozin in Patients with Chronic Kidney Disease
Background The effects of empagliflozin in patients with chronic kidney disease who are at risk for disease progression are not well understood. The EMPA-KIDNEY trial was designed to assess the effects of treatment with empagliflozin in a broad range of such patients. Methods We enrolled patients with chronic kidney disease who had an estimated glomerular filtration rate (eGFR) of at least 20 but less than 45 ml per minute per 1.73 m(2) of body-surface area, or who had an eGFR of at least 45 but less than 90 ml per minute per 1.73 m(2) with a urinary albumin-to-creatinine ratio (with albumin measured in milligrams and creatinine measured in grams) of at least 200. Patients were randomly assigned to receive empagliflozin (10 mg once daily) or matching placebo. The primary outcome was a composite of progression of kidney disease (defined as end-stage kidney disease, a sustained decrease in eGFR to < 10 ml per minute per 1.73 m(2), a sustained decrease in eGFR of & GE;40% from baseline, or death from renal causes) or death from cardiovascular causes. Results A total of 6609 patients underwent randomization. During a median of 2.0 years of follow-up, progression of kidney disease or death from cardiovascular causes occurred in 432 of 3304 patients (13.1%) in the empagliflozin group and in 558 of 3305 patients (16.9%) in the placebo group (hazard ratio, 0.72; 95% confidence interval [CI], 0.64 to 0.82; P < 0.001). Results were consistent among patients with or without diabetes and across subgroups defined according to eGFR ranges. The rate of hospitalization from any cause was lower in the empagliflozin group than in the placebo group (hazard ratio, 0.86; 95% CI, 0.78 to 0.95; P=0.003), but there were no significant between-group differences with respect to the composite outcome of hospitalization for heart failure or death from cardiovascular causes (which occurred in 4.0% in the empagliflozin group and 4.6% in the placebo group) or death from any cause (in 4.5% and 5.1%, respectively). The rates of serious adverse events were similar in the two groups. Conclusions Among a wide range of patients with chronic kidney disease who were at risk for disease progression, empagliflozin therapy led to a lower risk of progression of kidney disease or death from cardiovascular causes than placebo
