31 research outputs found
Harmonic analysis of music using combinatory categorial grammar
FP7 grant 249520 (GRAMPLUS)Various patterns of the organization of Western tonal music exhibit hierarchical structure,
among them the harmonic progressions underlying melodies and the metre underlying
rhythmic patterns. Recognizing these structures is an important part of unconscious
human cognitive processing of music. Since the prosody and syntax of natural
languages are commonly analysed with similar hierarchical structures, it is reasonable
to expect that the techniques used to identify these structures automatically in natural
language might also be applied to the automatic interpretation of music.
In natural language processing (NLP), analysing the syntactic structure of a sentence
is prerequisite to semantic interpretation. The analysis is made difficult by the
high degree of ambiguity in even moderately long sentences. In music, a similar sort of
structural analysis, with a similar degree of ambiguity, is fundamental to tasks such as
key identification and score transcription. These and other tasks depend on harmonic
and rhythmic analyses. There is a long history of applying linguistic analysis techniques
to musical analysis. In recent years, statistical modelling, in particular in the
form of probabilistic models, has become ubiquitous in NLP for large-scale practical
analysis of language. The focus of the present work is the application of statistical
parsing to automatic harmonic analysis of music.
This thesis demonstrates that statistical parsing techniques, adapted from NLP with
little modification, can be successfully applied to recovering the harmonic structure
underlying music. It shows first how a type of formal grammar based on one used
for linguistic syntactic processing, Combinatory Categorial Grammar (CCG), can be
used to analyse the hierarchical structure of chord sequences. I introduce a formal
language similar to first-order predicate logical to express the hierarchical tonal harmonic
relationships between chords. The syntactic grammar formalism then serves as
a mechanism to map an unstructured chord sequence onto its structured analysis.
In NLP, the high degree of ambiguity of the analysis means that a parser must
consider a huge number of possible structures. Chart parsing provides an efficient
mechanism to explore them. Statistical models allow the parser to use information
about structures seen before in a training corpus to eliminate improbable interpretations
early on in the process and to rank the final analyses by plausibility. To apply the
same techniques to harmonic analysis of chord sequences, a corpus of tonal jazz chord
sequences annotated by hand with harmonic analyses is constructed. Two statistical
parsing techniques are adapted to the present task and evaluated on their success at recovering the annotated structures. The experiments show that parsing using a statistical
model of syntactic derivations is more successful than a Markovian baseline
model at recovering harmonic structure. In addition, the practical technique of statistical
supertagging serves to speed up parsing without any loss in accuracy.
This approach to recovering harmonic structure can be extended to the analysis of
performance data symbolically represented as notes. Experiments using some simple
proof-of-concept extensions of the above parsing models demonstrate one probabilistic
approach to this. The results reported provide a baseline for future work on the task of
harmonic analysis of performances
A Comparison of Unsupervised Methods for Ad hoc Cross-Lingual Document Retrieval
We address the problem of linking related documents across languages in a multilingual collection. We evaluate three diverse unsupervised methods to represent and compare documents: (1) multilingual topic model; (2) cross-lingual document embeddings; and (3) Wasserstein distance. We test the performance of these methods in retrieving news articles in Swedish that are known to be related to a given Finnish article. The results show that ensembles of the methods outperform the stand-alone methods, suggesting that they capture complementary characteristics of the documents.Peer reviewe
Multilingual Dynamic Topic Model
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current DTMs are applicable only to monolingual datasets. In this paper we present the multilingual dynamic topic model (ML-DTM), a novel topic model that combines DTM with an existing multilingual topic modeling method to capture crosslingual topics that evolve across time. We present results of this model on a parallel German-English corpus of news articles and a comparable corpus of Finnish and Swedish news articles. We demonstrate the capability of ML-DTM to track significant events related to a topic and show that it finds distinct topics and performs as well as existing multilingual topic models in aligning cross-lingual topics.Peer reviewe
Silo NLP's Participation at WAT2022
This paper provides the system description of "Silo NLP's" submission to the Workshop on Asian Translation (WAT2022). We have participated in the Indic Multimodal tasks (English->Hindi, English->Malayalam, and English->Bengali Multimodal Translation). For text-only translation, we trained Transformers from scratch and fine-tuned mBART-50 models. For multimodal translation, we used the same mBART architecture and extracted object tags from the images to use as visual features concatenated with the text sequence. Our submission tops many tasks including English->Hindi multimodal translation (evaluation test), English->Malayalam text-only and multimodal translation (evaluation test), English->Bengali multimodal translation (challenge test), and English->Bengali text-only translation (evaluation test).Peer reviewe
Silo NLP's Participation at WAT2022
This paper provides the system description of "Silo NLP's" submission to the Workshop on Asian Translation (WAT2022). We have participated in the Indic Multimodal tasks (English->Hindi, English->Malayalam, and English->Bengali Multimodal Translation). For text-only translation, we trained Transformers from scratch and fine-tuned mBART-50 models. For multimodal translation, we used the same mBART architecture and extracted object tags from the images to use as visual features concatenated with the text sequence. Our submission tops many tasks including English->Hindi multimodal translation (evaluation test), English->Malayalam text-only and multimodal translation (evaluation test), English->Bengali multimodal translation (challenge test), and English->Bengali text-only translation (evaluation test).Peer reviewe
Meta4meaning : Automatic Metaphor Interpretation Using Corpus-Derived Word Associations
We propose a novel metaphor interpretation method, Meta4meaning. It provides interpretations for nominal metaphors by generating a list of properties that the metaphor expresses. Meta4meaning uses word associations extracted from a corpus to retrieve an approximation to properties of concepts. Interpretations are then obtained as an aggregation or difference of the saliences of the properties to the tenor and the vehicle. We evaluate Meta4meaning using a set of human-annotated interpretations of 84 metaphors and compare with two existing methods for metaphor interpretation. Meta4meaning significantly outperforms the previous methods on this task.Peer reviewe
Personal Research Assistant for Online Exploration of Historical News
Demostration paperWe present a novel environment for exploratory search in large collections of historical newspapers developed as a part of the News- Eye project. In this paper we focus on the intelligent Personal Research Assistant (PRA) component in the environment and the web interface. The PRA is an interactive exploratory engine that combines results of various text analysis tools in an unsupervised fashion to conduct au- tonomous investigations on the data according to usersâ needs. The PRA is freely available online together with some datasets of European his- torical newspapers. The methods used by the assistant are of potential benefit to other exploratory search applications.Peer reviewe
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
In the investment industry, it is often essential to carry out fine-grained
company similarity quantification for a range of purposes, including market
mapping, competitor analysis, and mergers and acquisitions. We propose and
publish a knowledge graph, named CompanyKG, to represent and learn diverse
company features and relations. Specifically, 1.17 million companies are
represented as nodes enriched with company description embeddings; and 15
different inter-company relations result in 51.06 million weighted edges. To
enable a comprehensive assessment of methods for company similarity
quantification, we have devised and compiled three evaluation tasks with
annotated test sets: similarity prediction, competitor retrieval and similarity
ranking. We present extensive benchmarking results for 11 reproducible
predictive methods categorized into three groups: node-only, edge-only, and
node+edge. To the best of our knowledge, CompanyKG is the first large-scale
heterogeneous graph dataset originating from a real-world investment platform,
tailored for quantifying inter-company similarity.Comment: Paper (13 pages, 5 figures and 2 tables) + Appendix (18 pages, 4
figures and 5 tables