Search CORE

20 research outputs found

Interpreting intermediate feature representations of raw-waveform deep CNNs by sonification

Author: Yadav Sarthak
Publication venue
Publication date: 01/01/2022
Field of study

The majority of the recent works that address the interpretability of raw waveform based deep neural networks (DNNs) for audio processing focus on interpreting spectral and frequency response information, often limiting to visual and signal theoretic means of interpretation, solely for the first layer. This work proposes sonification, a method to interpret intermediate feature representations of sound event recognition (SER) 1D-convolutional neural networks (1D-CNNs) trained on raw waveforms by mapping these representations back into the discrete-time input signal domain, highlighting substructures in the input that maximally activate a feature map as intelligible acoustic events. Sonification is used to compare supervised and contrastive self-supervised feature representations, observing how the latter learn more acoustically discernible representations, especially in the deeper layers. A metric to quantify acoustic similarity between the interpretations and their corresponding inputs is proposed, and a layer-by-layer analysis of the trained feature representations using this metric supports the observations made

Glasgow Theses Service

Analysing the consumer preference of fluid milk in province no. 2 of Nepal

Author: Dahal Niroj
Ghimire Sarthak
Shingh Shuvam
Yadav Om Prakash
Publication venue: Agriculture and Environmental Science Academy, Haridwar, India
Publication date: 25/09/2020
Field of study

Information is an asset for any industry. Some information such as the consumer preference is hidden deep in the mind of the consumer which is difficult to access. Studies have revealed that the consumer preferences can be measured effectively and their research may provide a deeper understanding of the choices that consumers make when deciding to select an offer against another. Milk is one of the major components of diet for the people around the globe. The demand for milk and other dairy products is generally income elastic. The marketing of fluid milk is not similar as compared to other consumer-based goods. The demand for milk and milk products depend considerably on the consumption pattern, food habits, geographical region, urbanization and life style. The study was conducted to analyse the consumer preference of fluid milk in Province no. 2 of Nepal. Rautahat and Saptari districts from Province no. 2 were selected for the study. The total sample size of 180 household was selected for study but data from 159 households was only taken for consideration. Consumer preference was analysed using tabular and percentage analysis. Garret’s ranking technique was adopted to analyse the reason for preference of fluid milk by household consumer. From the study it was clear that almost all the households irrespective of the income and other socio-economic factors, preferred fluid milk. Nutritive value was found to be the most important reason for preference of fluid milk. The other reason for preference of fluid milk were taste, quality, availability, price and satisfaction. The consumption of fluid milk was found to be dependent over several socio- economic factors such as education, income, gender etc. The differences in consumption behaviour of the consumers provide an important inference to marketing and promotion strategies of dairy/ food products. Different promotion strategies based on different consumption determinants are perhaps necessary for effective marketing in a specific area

Archives of Agriculture and Environmental Science

Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

Author: Hansen Lars Kai
Tan Zheng-Hua
Theodoridis Sergios
Yadav Sarthak
Publication venue: 'Center for Open Science'
Publication date: 01/06/2023
Field of study

In this work, we propose a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention (MW-MHA) module that facilitates the modelling of local-global interactions in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standard MAEs in overall performance and learn better general-purpose audio representations, along with demonstrating considerably better scaling characteristics. Investigating attention distances and entropies reveals that MW-MAE encoders learn heads with broader local and global attention. Analyzing attention head feature representations through Projection Weighted Canonical Correlation Analysis (PWCCA) shows that attention heads with the same window sizes across the decoder layers of the MW-MAE learn correlated feature representations which enables each block to independently capture local and global information, leading to a decoupled decoder feature hierarchy. Code for feature extraction and downstream experiments along with pre-trained models will be released publically

VBN

Masked Autoencoders with Multi-Window Attention Are Better Audio Learners

Author: Hansen Lars Kai
Tan Zheng-Hua
Theodoridis Sergios
Yadav Sarthak
Publication venue
Publication date: 01/06/2023
Field of study

Several recent works have adapted Masked Autoencoders (MAEs) for learning general-purpose audio representations. However, they do not address two key aspects of modelling multi-domain audio data: (i) real-world audio tasks consist of a combination of local+global contexts, and (ii) real-world audio signals are complex compositions of several acoustic elements with different time-frequency characteristics. To address these concerns, this work proposes a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention module that can capture information at multiple local and global contexts in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standard MAEs in overall performance and learn better general-purpose audio representations, as well as demonstrate considerably better scaling characteristics. Exploratory analyses of the learned representations reveals that MW-MAE encoders learn attention heads with more distinct entropies compared to those learned by MAEs, while attention heads across the different transformer blocks in MW-MAE decoders learn correlated feature representations, enabling each block to independently capture local and global information, leading to a decoupled feature hierarchy. Code for feature extraction and downstream experiments along with pre-trained weights can be found at https://github.com/10997NeurIPS23/10997_mwmae

arXiv.org e-Print Archive

Federated learning enables big data for rare cancer boundary detection.

Author: Abayazeed Aly
Abbassy Ahmed
Abello Ana
Adabi Saba
Agzarian Marc
Ahn Sung Soo
Ak Murat
Alafandi Ahmed
Alexander Gregory S
Alexiou Sotiris
Allen Bryan
Apgar Charles
Badve Chaitra
Baek Stephen
Baid Ujjwal
Bakas Spyridon
Bapuraj J Rajiv
Bareja Rohan
Barnholtz-Sloan Jill S
Beets-Tan Regina G H
Belouali Anas
Bencheqroun Camelia
Bendszus Martin
Benson Sean
Bernal Jose
Bhardwaj Sargam
Bhuvaneshwar Krithika
Bialecki Brian
Bilello Michel
Bink Andrea
Booth Thomas C
Boss Michael A
Brugnara Gianluca
Buatti John M
Calabrese Evan
Capellades Jaume
Cha Soonmee
Chambless Lola B
Chang Jong Hee
Chang Ken
Chelliah Alysha
Chen Cheng
Chen Jonathan
Choi Joseph
Choi Yoon Seong
Chong Chee
Chotai Silky
Cimino Lisa
Cloughesy Timothy F
Colen Rivka R
Currie Stuart
Cutler Danielle
Dako Farouk
Davatzikos Christos
Dicker Adam P
Dixon Luke V M
Dorcas Adeleye
Dostál Marek
Dou Qi
Dragos Carmen
Dubbink Hendrikus J
Edwards Brandon
Ellingson Benjamin M
Escobar William
Ezhov Ivan
Falcão Alexandre Xavier
Farinhas Joaquim
Fatania Kavi
Flanders Adam E
Foley Patrick
Fortin David
Franco-Maldonado Heydy
French Pim J
Frood Russell
Fu Eric
Gahrmann Renske
Gamal Shady
Garrett John
Ghodasara Satyam
Gimpel James
Glocker Ben
Gruzdev Alexey
Guevara Pamela
Gusev Yuriy
Gómez Jhon
Haas Rourke
Hagiwara Akifumi
Haliassos Ilias
Hamghalam Mohammad
Hau Ann-Christin
Haunschmidt Andreas
Heng Pheng Ann
Herrera-Trujillo Alejandro
Hill Michael
Holcomb James
Hu Ricky
Huang Raymond Y
Incekara Fatih
Ingalhalikar Madhura
Ismael Heba
Jadhav Manali
Jain Rajan
Jeraj Robert
Jiang Meirui
Jones Craig K
Kalogeropoulou Christina
Kamnitsas Konstantinos
Kapsas Georgios
Kardamakis Dimitrios M
Karkada Deepthi
Keunen Olivier
Keřkovský Miloš
Kim Ho Sung
Kim Yusung
Klein Stefan
Kofler Florian
Kolodziej Kenneth
Kopřivová Tereza
Kotrotsou Aikaterini
Kozubek Michal
Kumar Neeraj
Kurc Tahsin
LaMontagne Pamela
Landman Bennett
Larson Matthew
Lee Joonsang
Lee Matthew
Lee Seung-Koo
Lepage Martin
Li Hongwei
Liem Spencer
Loayza Francis
Lombardo Joseph
Lucio Diego R
Lui Yvonne W
Luo Bing
Lux Filip
López Eduardo
Madhavan Subha
Mahajan Abhishek
Maier-Hein Klaus
Maldjian Joseph A
Mandel Jacob
Mani Kartik M
Marcus Daniel
Marella Sailaja
Martin Jason
Martins Samuel B
Matula Petr
McKinley Richard
Meckel Stephan
Meier Raphael
Mekhaimar Mahmoud
Mendoza Cristobal
Menotti David
Menze Bjoern
Metz Marie
Michálek Jan
Mistry Akshitkumar
Mitchell J Ross
Modat Marc
Mohan Suyash
Moraes Fabio Y
Moritani Toshio
Morón Fanny
Moustakas Konstantinos
Murcia Derrick
Muzi Mark
Necker Georg
Niclou Simone P
Odafe-Oyibotha Olubunmi
Ogbole Godwin
Ormond David Ryan
Osobu Babatunde
Oughourlian Talia
Oyekunle Dotun
Palmer Joshua D
Panagiotopoulos Vasileios
Pandey Umang
Park Ji Eun
Pati Sarthak
Payne David
Pei Linmin
Pelaez Enrique
Peoples Jacob J
Pichler Josef
Pinho Marco C
Poisson Laila
Pouymayou Bertrand
Prabhudesai Snehal
Prasanna Prateek
Preetha Chandrakanth J
Price Cynthia
Puig Josep
Qayati Mohamed
Quevedo Sebastian
Quintero Carmen Balaña
Radojewski Piotr
Ramadass Karthik
Rao Arvind
Raymond Catalina
Reddy Divya
Reina G Anthony
Reyes Mauricio
Rudie Jeffrey
Ríos Elvis
Sahm Felix
Saini Jitender
Sair Haris I
Sako Chiharu
Saltz Joel
Sayah Anousheh
Schmidt Kendall
Schouten Joost W
Shah Prashant
Sharma Sonam
Shaykh Hassan F
Sheller Micah
Shrestha Sampurna
Shu'aibu Mustapha
Shuaib Haris
Shukla Gaurav
Simpson Amber L
Sloan Andrew E
Slotboom Johannes
Smits Marion
So Tiffany Y
Soneye Mayowa
Sprenger Flávia
Srinivasan Ashok
Teixeira Bernardo C A
Teuwen Jonas
Thompson John
Thompson Reid C
Tiwari Pallavi
To Minh-Son
Torche Esteban
Tran Anh
Trenkler Johannes
Trujillo Maria
Tseng Tzu-Chi
Tsiganos Panagiotis
Turk Sevcan
Vadmal Vachan
Vallières Martin
van den Bent Martin J
van der Voort Sebastian R
Veettil Deepak Kattil
Velastin Sergio A
Venkataraman Archana
Vera Franco
Verma Ruchika
Villanueva-Meyer Javier
Vincent Arnaud J P E
Vogelbaum Michael A
Vollmuth Philipp
Vybíhal Václav
Wagner Benjamin C
Waite Kristin
Wang Chencai
Wang Nicholas
Wang Shih-Han
Weiss Tobias
Weller Michael
Wen Ning
Wick Wolfgang
Wiest Roland
Wiestler Benedikt
Wijnenga Maarten M J
Williams Matthew
Xu Kaiwen
Yadav Ipsa
Yogananda Chandan Ganesh Bangalore
Yoshiaki Ota
Yuan Yading
Yun Jihye
Zacharaki Evangelia I
Zampakis Peter
Zenk Maximilian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/04/2022
Field of study

Although machine learning (ML) has shown promise across disciplines, out-of-sample generalizability is concerning. This is currently addressed by sharing multi-site data, but such centralization is challenging/infeasible to scale due to various limitations. Federated ML (FL) provides an alternative paradigm for accurate and generalizable ML, by only sharing numerical model updates. Here we present the largest FL study to-date, involving data from 71 sites across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, reporting the largest such dataset in the literature (n = 6, 314). We demonstrate a 33% delineation improvement for the surgically targetable tumor, and 23% for the complete tumor extent, over a publicly trained model. We anticipate our study to: 1) enable more healthcare studies informed by large diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further analyses for glioblastoma by releasing our consensus model, and 3) demonstrate the FL effectiveness at such scale and task-complexity as a paradigm shift for multi-site collaborations, alleviating the need for data-sharing

arXiv.org e-Print Archive

Henry Ford Health System Scholarly Commons

PubMed Central

EUR Research Repository

Bern Open Repository and Information System (BORIS)

King's Research Portal

Open Repository and Bibliography - Luxembourg

Jefferson Digital Commons

Hochschulschriftenserver - Universität Frankfurt am Main