1,038 research outputs found
Efficient Supervised Training of Audio Transformers for Music Representation Learning
In this work, we address music representation learning using convolution-free
transformers. We build on top of existing spectrogram-based audio transformers
such as AST and train our models on a supervised task using patchout training
similar to PaSST. In contrast to previous works, we study how specific design
decisions affect downstream music tagging tasks instead of focusing on the
training task. We assess the impact of initializing the models with different
pre-trained weights, using various input audio segment lengths, using learned
representations from different blocks and tokens of the transformer for
downstream tasks, and applying patchout at inference to speed up feature
extraction. We find that 1) initializing the model from ImageNet or AudioSet
weights and using longer input segments are beneficial both for the training
and downstream tasks, 2) the best representations for the considered downstream
tasks are located in the middle blocks of the transformer, and 3) using
patchout at inference allows faster processing than our convolutional baselines
while maintaining superior performance. The resulting models, MAEST, are
publicly available and obtain the best performance among open models in music
tagging tasks.Comment: Accepted at the 2023 International Society for Music Information
Retrieval Conference (ISMIR'23
INDUSTRIAL CLUSTERS IN THE DEVELOPING COUNTRIES: A CASE STUDY OF THE CHILE FISH CLUSTER
The issue of existence of an industrial cluster should be correlated with the stage of life cycle of basic industry production. The technological cycle of some industries lasts decades, and the life cycle of industrial cluster can be shorter. While developing consequent stages of the life cycle of the cluster it is necessary to consider the factor of availability of resources or the duration of exploitation, created for functioning of this sector of infrastructure.
The development of practically all industrial clusters took a lot of time, the development of the Russian clusters is calculated for 5–10 years. In the world practice there are no cases of creating efficient industrial companies in the developing countries over such periods. There is often a need to use ready objects of infrastructure, but even in this case it may need renovation. Therefore, at the stage of formation, it is desirable to carry out the minimum number of investments that are mainly associated with the creation of the necessary infrastructure and complementary set of institutional conditions that maximize the use of existing infrastructure
VALUE: EMPIRICS AND THEORY
Disclosure of the essence of the category "value" is not a prerequisite for its use: the introduction of this concept is correct and simple a priori. At the same time, it is obviously desirable. Objectives: 1) to show the connection between the concept of "value" and the empirical level; 2)to justify the interpretation of value from the perspective of information theory. Results. Manifestations of the "value" category in economic practice: the notions of "fair price", "reasonable profit", statistics of input-output balances, the world practice of planning of long-term energy production projects, etc. An adequate interpretation of value is the information concept: at the basis of value as the result of labor, and the rarity of a thing is information. Value is an information measure of the object’s worth. The complexity of operationalization does not ensue the unscientific (metaphysical) character of the concept of "value" as such. From the recognition of value as an objective basis for the observable price phenomenon, there appear very specific consequences: 1) feedback through the market must have an objective basis as a starting measure, that is, money must have a standard; 2) in some cases direct pricing (and / or their directive definition) is justified and appropriate
How Low Can You Go? Reducing Frequency and Time Resolution in Current CNN Architectures for Music Auto-tagging
Automatic tagging of music is an important research topic in Music
Information Retrieval and audio analysis algorithms proposed for this task have
achieved improvements with advances in deep learning. In particular, many
state-of-the-art systems use Convolutional Neural Networks and operate on
mel-spectrogram representations of the audio. In this paper, we compare
commonly used mel-spectrogram representations and evaluate model performances
that can be achieved by reducing the input size in terms of both lesser amount
of frequency bands and larger frame rates. We use the MagnaTagaTune dataset for
comprehensive performance comparisons and then compare selected configurations
on the larger Million Song Dataset. The results of this study can serve
researchers and practitioners in their trade-off decision between accuracy of
the models, data storage size and training and inference times.Comment: The 28th European Signal Processing Conference (EUSIPCO
Bad Directions in Cryptographic Hash Functions
A 25-gigabyte "point obfuscation" challenge "using security parameter 60" was announced at the Crypto 2015 rump session; "point obfuscation" is another name for password hashing. This paper shows that the particular matrix-multiplication hash function used in the challenge is much less secure than previous password-hashing functions are believed to be. This paper's attack algorithm broke the challenge in just 19 minutes using a cluster of 21 PCs. Keywords: symmetric cryptography, hash functions, password hashing, point obfuscation, matrix multiplication, meet-in-the-middle attacks, meet-in-many-middles attack
Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity
In this work, we investigate an approach that relies on contrastive learning
and music metadata as a weak source of supervision to train music
representation models. Recent studies show that contrastive learning can be
used with editorial metadata (e.g., artist or album name) to learn audio
representations that are useful for different classification tasks. In this
paper, we extend this idea to using playlist data as a source of music
similarity information and investigate three approaches to generate anchor and
positive track pairs. We evaluate these approaches by fine-tuning the
pre-trained models for music multi-label classification tasks (genre, mood, and
instrument tagging) and music similarity. We find that creating anchor and
positive track pairs by relying on co-occurrences in playlists provides better
music similarity and competitive classification results compared to choosing
tracks from the same artist as in previous works. Additionally, our best
pre-training approach based on playlists provides superior classification
performance for most datasets.Comment: Accepted at the 2023 International Conference on Acoustics, Speech,
and Signal Processing (ICASSP'23
Function of the ribosomal E-site: a mutagenesis study
Ribosomes synthesize proteins according to the information encoded in mRNA. During this process, both the incoming amino acid and the nascent peptide are bound to tRNA molecules. Three binding sites for tRNA in the ribosome are known: the A-site for aminoacyl-tRNA, the P-site for peptidyl-tRNA and the E-site for the deacylated tRNA leaving the ribosome. Here, we present a study of Escherichia coli ribosomes with the E-site binding destabilized by mutation C2394G of the 23S rRNA. Expression of the mutant 23S rRNA in vivo caused increased frameshifting and stop codon readthrough. The progression of these ribosomes through the ribosomal elongation cycle in vitro reveals ejection of deacylated tRNA during the translocation step or shortly after. E-site compromised ribosomes can undergo translocation, although in some cases it is less efficient and results in a frameshift. The mutation affects formation of the P/E hybrid site and leads to a loss of stimulation of the multiple turnover GTPase activity of EF-G by deacylated tRNA bound to the ribosome
- …