216 research outputs found

    Multi-Label Classifier Chains for Bird Sound

    Full text link
    Bird sound data collected with unattended microphones for automatic surveys, or mobile devices for citizen science, typically contain multiple simultaneously vocalizing birds of different species. However, few works have considered the multi-label structure in birdsong. We propose to use an ensemble of classifier chains combined with a histogram-of-segments representation for multi-label classification of birdsong. The proposed method is compared with binary relevance and three multi-instance multi-label learning (MIML) algorithms from prior work (which focus more on structure in the sound, and less on structure in the label sets). Experiments are conducted on two real-world birdsong datasets, and show that the proposed method usually outperforms binary relevance (using the same features and base-classifier), and is better in some cases and worse in others compared to the MIML algorithms.Comment: 6 pages, 1 figure, submission to ICML 2013 workshop on bioacoustics. Note: this is a minor revision- the blind submission format has been replaced with one that shows author names, and a few corrections have been mad

    New tools for old questions: studying vocal communication in the Zebra Finch (Taeniopygia guttata).

    Get PDF
    The Adult Zebra finches (Taeniopygia guttata) have a crystallised song and different types of calls. However, the exact number of the calls and their function is not completely understood. The pattern of calls might be associated with a specific context and with the kind of relationships between two interacting birds. The aim of this project is: to test the correlation between an experimentally controlled context and the pattern of the calls-songs elicited. The acoustic signals produced by the same pairs of Zebra finches exposed to three different conditions were recorded. Each pair of birds was first kept in a small sound box, then two couples were placed together in a larger aviary and finally nest material was added. The birds were equipped with a miniaturised microphones tied on their back, in order to ascertain the identity of the bird emitting the sound. Video recording was used to correlate the birds’ behaviour with vocalizations. The males were implanted with an electrode suitable for Local Field Potential (LFP) recording placed in Nucleus Robustus of Arcopallium (RA). This nucleus is involved in the modulation of the learned features of songs and calls and in perceptual processing. Quantitative analysis of temporal association between individual calls reveals that are used in bidirectional communication: precise patterns of association of calls are established into the pair. The type of relationship existing between two birds, for instance “members of a couple” or “dominance hierarchy between males”, and the environmental context, for example “being in a favourable breeding condition” are likely to be described by patterns of temporal associations of calls combinations. It was possible to describe the change of activity of the RA during songs\calls production through the analysis of the LFP signal. Moreover the LFP showed a repeatable signal after several days, this demonstrate the suitability of this device for studying the development of long processes, for instance song learning

    Acoustic sequences in non-human animals: a tutorial review and prospectus.

    Get PDF
    Animal acoustic communication often takes the form of complex sequences, made up of multiple distinct acoustic units. Apart from the well-known example of birdsong, other animals such as insects, amphibians, and mammals (including bats, rodents, primates, and cetaceans) also generate complex acoustic sequences. Occasionally, such as with birdsong, the adaptive role of these sequences seems clear (e.g. mate attraction and territorial defence). More often however, researchers have only begun to characterise - let alone understand - the significance and meaning of acoustic sequences. Hypotheses abound, but there is little agreement as to how sequences should be defined and analysed. Our review aims to outline suitable methods for testing these hypotheses, and to describe the major limitations to our current and near-future knowledge on questions of acoustic sequences. This review and prospectus is the result of a collaborative effort between 43 scientists from the fields of animal behaviour, ecology and evolution, signal processing, machine learning, quantitative linguistics, and information theory, who gathered for a 2013 workshop entitled, 'Analysing vocal sequences in animals'. Our goal is to present not just a review of the state of the art, but to propose a methodological framework that summarises what we suggest are the best practices for research in this field, across taxa and across disciplines. We also provide a tutorial-style introduction to some of the most promising algorithmic approaches for analysing sequences. We divide our review into three sections: identifying the distinct units of an acoustic sequence, describing the different ways that information can be contained within a sequence, and analysing the structure of that sequence. Each of these sections is further subdivided to address the key questions and approaches in that area. We propose a uniform, systematic, and comprehensive approach to studying sequences, with the goal of clarifying research terms used in different fields, and facilitating collaboration and comparative studies. Allowing greater interdisciplinary collaboration will facilitate the investigation of many important questions in the evolution of communication and sociality.This review was developed at an investigative workshop, “Analyzing Animal Vocal Communication Sequences” that took place on October 21–23 2013 in Knoxville, Tennessee, sponsored by the National Institute for Mathematical and Biological Synthesis (NIMBioS). NIMBioS is an Institute sponsored by the National Science Foundation, the U.S. Department of Homeland Security, and the U.S. Department of Agriculture through NSF Awards #EF-0832858 and #DBI-1300426, with additional support from The University of Tennessee, Knoxville. In addition to the authors, Vincent Janik participated in the workshop. D.T.B.’s research is currently supported by NSF DEB-1119660. M.A.B.’s research is currently supported by NSF IOS-0842759 and NIH R01DC009582. M.A.R.’s research is supported by ONR N0001411IP20086 and NOPP (ONR/BOEM) N00014-11-1-0697. S.L.DeR.’s research is supported by the U.S. Office of Naval Research. R.F.-i-C.’s research was supported by the grant BASMATI (TIN2011-27479-C04-03) from the Spanish Ministry of Science and Innovation. E.C.G.’s research is currently supported by a National Research Council postdoctoral fellowship. E.E.V.’s research is supported by CONACYT, Mexico, award number I010/214/2012.This is the accepted manuscript. The final version is available at http://dx.doi.org/10.1111/brv.1216
    • 

    corecore