Search CORE

31 research outputs found

Harmonic analysis of music using combinatory categorial grammar

Author: Granroth-Wilding Mark Thomas
Wilding Mark Thomas Granroth
Publication venue: The University of Edinburgh
Publication date: 28/11/2013
Field of study

FP7 grant 249520 (GRAMPLUS)Various patterns of the organization of Western tonal music exhibit hierarchical structure, among them the harmonic progressions underlying melodies and the metre underlying rhythmic patterns. Recognizing these structures is an important part of unconscious human cognitive processing of music. Since the prosody and syntax of natural languages are commonly analysed with similar hierarchical structures, it is reasonable to expect that the techniques used to identify these structures automatically in natural language might also be applied to the automatic interpretation of music. In natural language processing (NLP), analysing the syntactic structure of a sentence is prerequisite to semantic interpretation. The analysis is made difficult by the high degree of ambiguity in even moderately long sentences. In music, a similar sort of structural analysis, with a similar degree of ambiguity, is fundamental to tasks such as key identification and score transcription. These and other tasks depend on harmonic and rhythmic analyses. There is a long history of applying linguistic analysis techniques to musical analysis. In recent years, statistical modelling, in particular in the form of probabilistic models, has become ubiquitous in NLP for large-scale practical analysis of language. The focus of the present work is the application of statistical parsing to automatic harmonic analysis of music. This thesis demonstrates that statistical parsing techniques, adapted from NLP with little modification, can be successfully applied to recovering the harmonic structure underlying music. It shows first how a type of formal grammar based on one used for linguistic syntactic processing, Combinatory Categorial Grammar (CCG), can be used to analyse the hierarchical structure of chord sequences. I introduce a formal language similar to first-order predicate logical to express the hierarchical tonal harmonic relationships between chords. The syntactic grammar formalism then serves as a mechanism to map an unstructured chord sequence onto its structured analysis. In NLP, the high degree of ambiguity of the analysis means that a parser must consider a huge number of possible structures. Chart parsing provides an efficient mechanism to explore them. Statistical models allow the parser to use information about structures seen before in a training corpus to eliminate improbable interpretations early on in the process and to rank the final analyses by plausibility. To apply the same techniques to harmonic analysis of chord sequences, a corpus of tonal jazz chord sequences annotated by hand with harmonic analyses is constructed. Two statistical parsing techniques are adapted to the present task and evaluated on their success at recovering the annotated structures. The experiments show that parsing using a statistical model of syntactic derivations is more successful than a Markovian baseline model at recovering harmonic structure. In addition, the practical technique of statistical supertagging serves to speed up parsing without any loss in accuracy. This approach to recovering harmonic structure can be extended to the analysis of performance data symbolically represented as notes. Experiments using some simple proof-of-concept extensions of the above parsing models demonstrate one probabilistic approach to this. The results reported provide a baseline for future work on the task of harmonic analysis of performances

Edinburgh Research Archive

Harmonic Analysis of Jazz MIDI Files Using Statistical Parsing

Author: Granroth-Wilding Mark
Steedman Mark
Publication venue
Publication date: 01/01/2012
Field of study

Edinburgh Research Explorer

A Comparison of Unsupervised Methods for Ad hoc Cross-Lingual Document Retrieval

Author: Granroth-Wilding Mark
Pivovarova Lidia
Zosa Elaine
Publication venue: European Language Resources Association (ELRA)
Publication date: 03/05/2020
Field of study

We address the problem of linking related documents across languages in a multilingual collection. We evaluate three diverse unsupervised methods to represent and compare documents: (1) multilingual topic model; (2) cross-lingual document embeddings; and (3) Wasserstein distance. We test the performance of these methods in retrieving news articles in Swedish that are known to be related to a given Finnish article. The results show that ensembles of the methods outperform the stand-alone methods, suggesting that they capture complementary characteristics of the documents.Peer reviewe

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Helsingin yliopiston digitaalinen arkisto

Multilingual Dynamic Topic Model

Author: Granroth-Wilding Mark
Zosa Elaine
Publication venue: INCOMA
Publication date: 01/09/2019
Field of study

Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current DTMs are applicable only to monolingual datasets. In this paper we present the multilingual dynamic topic model (ML-DTM), a novel topic model that combines DTM with an existing multilingual topic modeling method to capture crosslingual topics that evolve across time. We present results of this model on a parallel German-English corpus of news articles and a comparable corpus of Finnish and Swedish news articles. We demonstrate the capability of ML-DTM to track significant events related to a topic and show that it finds distinct topics and performs as well as existing multilingual topic models in aligning cross-lingual topics.Peer reviewe

Crossref

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Helsingin yliopiston digitaalinen arkisto

Cross-Family Similarity Learning for Cognate Identification in Low-Resource Languages

Author: Granroth-Wilding Mark
Soisalon-Soininen Eliel
Publication venue: INCOMA
Publication date: 04/09/2019
Field of study

Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Silo NLP's Participation at WAT2022

Author: Granroth-Wilding Mark
Grönroos Stig-Arne
Koistinen Mika
Panda Subhadarshi
Parida Shantipriya
Publication venue: COLING
Publication date: 17/10/2022
Field of study

This paper provides the system description of "Silo NLP's" submission to the Workshop on Asian Translation (WAT2022). We have participated in the Indic Multimodal tasks (English->Hindi, English->Malayalam, and English->Bengali Multimodal Translation). For text-only translation, we trained Transformers from scratch and fine-tuned mBART-50 models. For multimodal translation, we used the same mBART architecture and extracted object tags from the images to use as visual features concatenated with the text sequence. Our submission tops many tasks including English->Hindi multimodal translation (evaluation test), English->Malayalam text-only and multimodal translation (evaluation test), English->Bengali multimodal translation (challenge test), and English->Bengali text-only translation (evaluation test).Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Silo NLP's Participation at WAT2022

Author: Granroth-Wilding Mark
Grönroos Stig-Arne
Koistinen Mika
Panda Subhadarshi
Parida Shantipriya
Publication venue: COLING
Publication date: 02/08/2022
Field of study

arXiv.org e-Print Archive

Helsingin yliopiston digitaalinen arkisto

Meta4meaning : Automatic Metaphor Interpretation Using Corpus-Derived Word Associations

Author: Agres Kat
Alnajjar Khalid
Granroth-Wilding Mark
Toivonen Hannu
Xiao Ping
Publication venue: Sony CSL Paris
Publication date: 01/01/2016
Field of study

We propose a novel metaphor interpretation method, Meta4meaning. It provides interpretations for nominal metaphors by generating a list of properties that the metaphor expresses. Meta4meaning uses word associations extracted from a corpus to retrieve an approximation to properties of concepts. Interpretations are then obtained as an aggregation or difference of the saliences of the properties to the tenor and the vehicle. We evaluate Meta4meaning using a set of human-annotated interpretations of 84 metaphors and compare with two existing methods for metaphor interpretation. Meta4meaning significantly outperforms the previous methods on this task.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Personal Research Assistant for Online Exploration of Historical News

Author: Alnajjar Khalid
Avikainen Jari
Granroth-Wilding Mark
Jean-Caurant Axel
Leppänen Leo
Pivovarova Lidia
Toivonen Hannu
Zosa Elaine
Publication venue: Springer
Publication date: 01/01/2020
Field of study

Demostration paperWe present a novel environment for exploratory search in large collections of historical newspapers developed as a part of the News- Eye project. In this paper we focus on the intelligent Personal Research Assistant (PRA) component in the environment and the web interface. The PRA is an interactive exploratory engine that combines results of various text analysis tools in an unsupervised fashion to conduct au- tonomous investigations on the data according to users’ needs. The PRA is freely available online together with some datasets of European his- torical newspapers. The methods used by the assistant are of potential benefit to other exploratory search applications.Peer reviewe

Crossref

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Helsingin yliopiston digitaalinen arkisto

CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

Author: Cao Lele
Catovic Armin
Granroth-Wilding Mark
McCornack Andrew
Rocha Dhiana Deva Cavacanti
Stahl Richard Anselmo
von Ehrenheim Vilhelm
Publication venue
Publication date: 25/09/2023
Field of study

In the investment industry, it is often essential to carry out fine-grained company similarity quantification for a range of purposes, including market mapping, competitor analysis, and mergers and acquisitions. We propose and publish a knowledge graph, named CompanyKG, to represent and learn diverse company features and relations. Specifically, 1.17 million companies are represented as nodes enriched with company description embeddings; and 15 different inter-company relations result in 51.06 million weighted edges. To enable a comprehensive assessment of methods for company similarity quantification, we have devised and compiled three evaluation tasks with annotated test sets: similarity prediction, competitor retrieval and similarity ranking. We present extensive benchmarking results for 11 reproducible predictive methods categorized into three groups: node-only, edge-only, and node+edge. To the best of our knowledge, CompanyKG is the first large-scale heterogeneous graph dataset originating from a real-world investment platform, tailored for quantifying inter-company similarity.Comment: Paper (13 pages, 5 figures and 2 tables) + Appendix (18 pages, 4 figures and 5 tables

arXiv.org e-Print Archive