Search CORE

6 research outputs found

Gaussian filter based à-trous algorithm for image fusion

Author: Shukla Pancham
Vekkot Susmitha
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/08/2015
Field of study

Image fusion integrates complementary information from various perspectives in order to provide a meaningful interpretation of useful features and textures in multisource images. Here, we present a multiresolution algorithm based on Stationary Wavelet Transform for fusion of two test images of same size. The algorithm uses a Gaussian low-pass filtering technique for the high frequency subbands of SWT decomposition. The new approach gave sharper edges and structural enhancement than region based approaches involving calculation of energy around salient features. The key feature of Gaussian filtering is the flexibility of using filters with different values for standard deviation depending on the application and the range of detail necessary for processing

London Met Repository

Crossref

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

Author: Chen Junyu
Shukla Pancham
Vekkot Susmitha
Publication venue
Publication date: 15/09/2023
Field of study

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music. While deep learning methods have shown impressive results, there is a trend toward larger models. In our paper, we introduce a novel and lightweight architecture called DTTNet, which is based on Dual-Path Module and Time-Frequency Convolutions Time-Distributed Fully-connected UNet (TFC-TDF UNet). DTTNet achieves 10.12 dB cSDR on 'vocals' compared to 10.01 dB reported for Bandsplit RNN (BSRNN) but with 86.7% fewer parameters. We also assess pattern-specific performance and model generalization for intricate audio patterns.Comment: Submitted to ICASSP 202

arXiv.org e-Print Archive

Medical Image Fusion using Combined Discrete Wavelet and Ripplet Transforms

Author: Burt
C. Chellamuthu
C.T. Kavitha
Candes
Do
Emmanuel Candes
Le Pennec
Piella
Qi-guang Miao
R. Rajesh
Susmitha Vekkot
Toet
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Crossref

Dementia Detection from Speech Using Machine Learning and Deep Learning Architectures

Author: Deepa Gupta
Kamran Shaukat
M. Rupesh Kumar
Mohammed Zakariah
S. Lalitha
Susmitha Vekkot
Varasiddhi Jayasuryaa Govindraj
Yousef Ajami Alotaibi
Publication venue: 'MDPI AG'
Publication date: 01/11/2022
Field of study

Dementia affects the patient’s memory and leads to language impairment. Research has demonstrated that speech and language deterioration is often a clear indication of dementia and plays a crucial role in the recognition process. Even though earlier studies have used speech features to recognize subjects suffering from dementia, they are often used along with other linguistic features obtained from transcriptions. This study explores significant standalone speech features to recognize dementia. The primary contribution of this work is to identify a compact set of speech features that aid in the dementia recognition process. The secondary contribution is to leverage machine learning (ML) and deep learning (DL) models for the recognition task. Speech samples from the Pitt corpus in Dementia Bank are utilized for the present study. The critical speech feature set of prosodic, voice quality and cepstral features has been proposed for the task. The experimental results demonstrate the superiority of machine learning (87.6 percent) over deep learning (85 percent) models for recognizing Dementia using the compact speech feature combination, along with lower time and memory consumption. The results obtained using the proposed approach are promising compared with the existing works on dementia recognition using speech

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central

Prosodic transformation in vocal emotion conversion for multi-lingual scenarios: a pilot study

Author: CH Wu
D Govind
D Govind
D Pravena
Deepa Gupta
E Helander
HK Vydana
HQ Nguyen
IR Murray
J Tao
J Yadav
JE Cahn
KS Rao
KSR Murty
L Rachman
M Schröder
M Theune
PM Bakshi
R Aihara
S Desai
S Vekkot
SR Kadiri
Susmitha Vekkot
T Toda
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref