Search CORE

34 research outputs found

Unsupervised Learning of Semantic Audio Representations

Author: Ellis Daniel P. W.
Hershey Shawn
Jansen Aren
Liu Jiayang
Moore R. Channing
Pandya Ratheet
Plakal Manoj
Saurous Rif A.
Publication venue
Publication date: 06/11/2017
Field of study

Even in the absence of any explicit semantic annotation, vast collections of audio recordings provide valuable information for learning the categorical structure of sounds. We consider several class-agnostic semantic constraints that apply to unlabeled nonspeech audio: (i) noise and translations in time do not change the underlying sound category, (ii) a mixture of two sound events inherits the categories of the constituents, and (iii) the categories of events in close temporal proximity are likely to be the same or related. Without labels to ground them, these constraints are incompatible with classification loss functions. However, they may still be leveraged to identify geometric inequalities needed for triplet loss-based training of convolutional neural networks. The result is low-dimensional embeddings of the input spectrograms that recover 41% and 84% of the performance of their fully-supervised counterparts when applied to downstream query-by-example sound retrieval and sound event classification tasks, respectively. Moreover, in limited-supervision settings, our unsupervised embeddings double the state-of-the-art classification performance.Comment: Submitted to ICASSP 201

arXiv.org e-Print Archive

Crossref

CNN Architectures for Large-Scale Audio Classification

Author: Chaudhuri Sourish
Ellis Daniel P. W.
Gemmeke Jort F.
Hershey Shawn
Jansen Aren
Moore R. Channing
Plakal Manoj
Platt Devin
Saurous Rif A.
Seybold Bryan
Slaney Malcolm
Weiss Ron J.
Wilson Kevin
Publication venue
Publication date: 10/01/2017
Field of study

Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 video-level labels. We examine fully connected Deep Neural Networks (DNNs), AlexNet [1], VGG [2], Inception [3], and ResNet [4]. We investigate varying the size of both training set and label vocabulary, finding that analogs of the CNNs used in image classification do well on our audio classification task, and larger training and label sets help up to a point. A model using embeddings from these classifiers does much better than raw features on the Audio Set [5] Acoustic Event Detection (AED) classification task.Comment: Accepted for publication at ICASSP 2017 Changes: Added definitions of mAP, AUC, and d-prime. Updated mAP/AUC/d-prime numbers for Audio Set based on changes of latest Audio Set revision. Changed wording to fit 4 page limit with new addition

arXiv.org e-Print Archive

Crossref

Receptor-Mediated Gonadotropin Action in Ovary

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/66088/1/j.1432-1033.1981.tb05481.x.pd

Crossref

Deep Blue Documents at the University of Michigan

Recommended from our members

The bii4africa dataset of faunal and floral population intactness estimates across Africa’s major land uses

Author: Abdoulaye Diarrassouba
Aebischer Thierry
Aguirre-Gutiérrez Jesús
Alexander Graham J.
Ali Abdullahi H.
Allan David G.
Amoako Esther E.
Angedakin Samuel
Aruna Edward
Avenant Nico L.
Badjedjea Gabriel
Bakayoko Adama
Bamba-kaya Abraham
Bates Michael F.
Bates Paul J. J.
Belmain Steven R.
Bennitt Emily
Biggs Reinette
Blanchard Ryan
Bonginkosi C. Gumbi Bonginkosi C.
Bradley James
Brewster Chris A.
Brown Michael B.
Brown Michelle
Bryja Josef
Butynski Thomas M.
Carvalho Filipe
Channing Alan
Chapman Colin A.
Child Matthew
Clements Hayley S.
Cohen Callan
Cords Marina
Cramer Jennifer D.
Cronk Nadine
Cunneyworth Pamela M. K.
Dalerum Fredrik
Danquah Emmanuel
Davies-Mostert Harriet T.
de Blocq Andrew D.
De Jong Yvonne A.
De Vos Alta
Demos Terrence C.
Denys Christiane
Djagoun Chabi A. M. S.
Do Linh San Emmanuel
Doherty-Bone Thomas M.
Drouilly Marine
du Toit Johan T.
Ehlers Smith David A.
Ehlers Smith Yvette C.
Eiseb Seth J.
Esler Karen J.
Fashing Peter J.
Ferguson Adam W.
Fernández-García José M.
Finckh Manfred
Fischer Claude
Gandiwa Edson
Gaubert Philippe
Gaugris Jerome Y.
Gibbs Dalton J.
Gil-Sánchez Jose M.
Gilchrist Jason S.
Githitho Anthony N.
Goodman Peter S.
Granjon Laurent
Gvozdik Vaclav
Hamann Maike
Harvey James
Hauptfleisch Morgan
Hayder Firas
Hema Emmanuel M.
Hempson Gareth
Herbst Marna
Houngbédji Mariano
Huntley Brian J.
Hutterer Rainer
Ivande Samuel T.
J. Paul Grobler J. Paul
Jackson Kate
Jongsma Gregory F. M.
Juste Javier
Kadjo Blaise
Kaleme Prince K.
Kamugisha Edwin
Kaplin Beth A.
Kato Humphrey N.
Kiffner Christian
Kimuyu Duncan M.
Kityo Robert M.
Kouamé N’goran G.
Kouete T. Marcel
le Roux Aliza
Lee Alan T. K.
Linden Birthe
Loft Ty
Lykke Anne Mette
Lötter Mervyn C.
MacFadyen Duncan N.
Macharia Gacheru P.
Madikiza Zimkitha J. K.
Mahlaba Themb’alilahlwa A. M.
Mallon David
Mamba Mnqobi L.
Mande Claude
Marchant Rob A.
Maritz Bryan
Maritz Robin A.
Markotter Wanda
McIntyre Trevor
Measey John
Mekonnen Addisu
Meller Paulina
Melville Haemish I.
Mganga Kevin Z.
Mills Michael G. L.
Minnie Liaan
Missoup Alain Didier
Mohammad Abubakr
Moinde Nancy N.
Moise Bakwo Fils E.
Monadjem Ara
Monterroso Pedro
Moore Jennifer F.
Musila Simon
Nago Sedjro Gilles A.
Namoto Maganizo W.
Niang Fatimata
Nicolas Violaine
Nkenku Jerry B.
Nkrumah Evans E.
Nono Gonwouo L.
Norbert Mulavwa M.
Nowak Katarzyna
Obitte Benneth C.
Okoni-Williams Arnold D.
Onongo Jonathan
Osinubi Samuel T.
O’Riain M. Justin
Parker Daniel M.
Parrini Francesca
Peel Mike J. S.
Penner Johannes
Pietersen Darren W.
Plumptre Andrew J.
Ponsonby Damian W.
Porembski Stefan
Power R. John
Radloff Frans G. T.
Rambau Ramugondo V.
Ramesh Tharmalingam
Reyers Belinda
Reynolds Chevonne
Richards Leigh R.
Rollinson Dominic P.
Rovero Francesco
Rödel Mark-Oliver
Saleh Mostafa A.
Schmiedel Ute
Schoeman M. Corrie
Scholte Paul
Selomane Odirilwe
Serfass Thomas L.
Shapiro Julie Teresa
Shema Sidney
Siebert Frances
Siebert Stefan J.
Skowno Andrew L.
Slingsby Jasper A.
Sliwa Alexander
Smit-Robinson Hanneline A.
Sogbohossou Etotepe A.
Somers Michael J.
Spawls Stephen
Stevens Nicola
Streicher Jarryd P.
Swanepoel Lourens
Tanshi Iroro
Taylor Peter J.
Taylor William A.
te Beest Mariska
Telfer Paul T.
Thompson Dave I.
Tobi Elie
Tolley Krystal A.
Tshoke Tshegofatso
Turner Andrew A.
Twine Wayne
Van Cakenberghe Victor
Van de Perre Frederik
van der Merwe Helga
van Niekerk Chris J. G.
van Wyk Pieter C. V.
Venter Jan A.
Verburgt Luke
Veron Geraldine
Vetter Susanne
Vorontsova Maria S.
Wagner Thomas C.
Webala Paul W.
Weber Natalie
Weier Sina M.
White Paula A.
Whitecross Melissa A.
Wigley Benjamin J.
Willems Frank J.
Winterbach Christiaan W.
Woodhouse Galena M.
Publication venue: Nature Research
Publication date: 01/01/2024
Field of study

Sub-Saharan Africa is under-represented in global biodiversity datasets, particularly regarding the impact of land use on species’ population abundances. Drawing on recent advances in expert elicitation to ensure data consistency, 200 experts were convened using a modified-Delphi process to estimate ‘intactness scores’: the remaining proportion of an ‘intact’ reference population of a species group in a particular land use, on a scale from 0 (no remaining individuals) to 1 (same abundance as the reference) and, in rare cases, to 2 (populations that thrive in human-modified landscapes). The resulting bii4africa dataset contains intactness scores representing terrestrial vertebrates (tetrapods: ±5,400 amphibians, reptiles, birds, mammals) and vascular plants (±45,000 forbs, graminoids, trees, shrubs) in sub-Saharan Africa across the region’s major land uses (urban, cropland, rangeland, plantation, protected, etc.) and intensities (e.g., large-scale vs smallholder cropland). This dataset was co-produced as part of the Biodiversity Intactness Index for Africa Project. Additional uses include assessing ecosystem condition; rectifying geographic/ taxonomic biases in global biodiversity indicators and maps; and informing the Red List of Ecosystems

Greenwich Academic Literature Archive

Directory of Open Access Journals

Enlighten

Repository@Napier

Noise-invariant neurons in the avian auditory cortex: hearing the song in noise.

Author: Moore R Channing,
Publication venue
Publication date: 15/05/2020
Field of study

Ezid

Noise-invariant Neurons in the Avian Auditory Cortex: Hearing the Song in Noise

Author: Frédéric E. Theunissen (5651095)
R. Channing Moore (386311)
Tyler Lee (386312)
Publication venue
Publication date: 01/01/2013
Field of study

<div>Given the extraordinary ability of humans and animals to recognize communication signals over a background of noise, describing noise invariant neural responses is critical not only to pinpoint the brain regions that are mediating our robust perceptions but also to understand the neural computations that are performing these tasks and the underlying circuitry. Although invariant neural responses, such as rotation-invariant face cells, are well described in the visual system, high-level auditory neurons that can represent the same behaviorally relevant signal in a range of listening conditions have yet to be discovered. Here we found neurons in a secondary area of the avian auditory cortex that exhibit noise-invariant responses in the sense that they responded with similar spike patterns to song stimuli presented in silence and over a background of naturalistic noise. By characterizing the neurons' tuning in terms of their responses to modulations in the temporal and spectral envelope of the sound, we then show that noise invariance is partly achieved by selectively responding to long sounds with sharp spectral structure. Finally, to demonstrate that such computations could explain noise invariance, we designed a biologically inspired noise-filtering algorithm that can be used to separate song or speech from noise. This novel noise-filtering method performs as well as other state-of-the-art de-noising algorithms and could be used in clinical or consumer oriented applications. Our biologically inspired model also shows how high-level noise-invariant responses could be created from neural responses typically found in primary auditory cortex. </div

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

MPG.PuRe

FigShare

Model STRFs for noise reduction.

Author: Frédéric E. Theunissen (5651095)
R. Channing Moore (386311)
Tyler Lee (386312)
Publication venue
Publication date
Field of study

A. The eight most positively (top) and most negatively (bottom) weighted STRFs from the noise reduction algorithm trained with a background of colony noise. B, Same as in A, but for the model trained with a background of modulation-limited noise. C. The ensemble modulation transfer functions for the top 16 and bottom 16 STRFs for the model trained in colony noise, sorted as in A. D Same as in C, but for the model trained in modulation-limited noise.</p

FigShare