146 research outputs found
Avoiding a Data Science Winter by Keeping the Expectations Low
In this paper we present and discuss some aspects related to what we consider as some of the most important corporate challenges of Data Science, AI and Machine Learning regarding both human talents and business. We examine the case of a discussion that took place over Quora and in particular we focus on an answer we have selected as indicative of a potentially threatening situation for the sustainable development of the Data Science, AI and Machine Learning disciplines as well as the growth of the respective demand and supply sides and the corresponding ecosystem these form. We then make an attempt to examine the setting by means of analyzing the case, using as our guide the provided narrative
Stakeholders of Cardiovascular Innovation Ecosystems in Germany: A First Level Analysis and an Example
This paper aims to provide a first attempt towards analysis innovation ecosystems for cardiovascular pathologies in Germany through the use of a stakeholder model. We present essential stakeholders for the development and deployment of innovations in the field of cardiovascular research and medicine, and the primary functions they fulfill in the context of these innovation ecosystems. The adopted approach consists of the implementation of a multilevel system model for analyzing stakeholders in this particular field. Data acquisition transpired through systematic literature review of multiple articles and studies. Data analysis phases were executed until reaching a point at which the considerable amount of data was discovered, ensuring consistency across various sources. We demonstrate that innovation ecosystems in cardiovascular medicine involve interconnected networks of stakeholders across different fields. Moreover, through an investigation of innovation ecosystems of cardiovascular pathologies particularly in Germany, we present the functions undertaken by each stakeholder, which are essential for the participation in the innovation ecosystems. The findings presented in this paper hold the potential to bring better understanding of cardiovascular pathology innovation ecosystems in Germany. This assertion is substantiated through a comprehensive examination of relevant scientific literature
Content-based access to spoken audio
The amount of archived audio material in digital form is increasing rapidly, as
advantage is taken of the growth in available storage and processing power.
Computational resources are becoming less of a bottleneck to digitally record and
archive vast amounts of spoken material, both television and radio broadcasts and
individual conversations. However, listening to this ever-growing amount of spoken
audio sequentially is too slow, and the bottleneck will become the development of
effective ways to access content in these voluminous archives. The provision of
accurate and efficient computer-mediated content access is a challenging task,
because spoken audio combines information from multiple levels (phonetic, acoustic,
syntactic, semantic and discourse). Most systems that assist humans in accessing
spoken audio content have approached the problem by performing automatic speech
recognition, followed by text-based information access. These systems have
addressed diverse tasks including indexing and retrieving voicemail messages,
searching for broadcast news, and extracting information from recordings of
meetings and lectures. Spoken audio content is far richer than what a simple textual
transcription can capture as it has additional cues that disclose the intended meaning
and speaker’s emotional state. However, the text transcription alone still provides a
great deal of useful information in applications.
This article describes approaches to content-based access to spoken audio with a
qualitative and tutorial emphasis. We describe how the analysis, retrieval and
delivery phases contribute making spoken audio content more accessible, and we
outline a number of outstanding research issues. We also discuss the main
application domains and try to identify important issues for future developments. The
structure of the article is based on general system architecture for content-based access which is depicted in Figure 1. Although the tasks within each processing
stage may appear unconnected, the interdependencies and the sequence with which
they take place vary
Transcription and Summarization of Voicemail Speech
This paper describes the development of a system to transcribe and summarize voicemail messages. The results of the research we present are two-fold. First, a hybrid connectionist approach to the Voicemail transcription task shows that competitive performance can be achieved using a context-independent system with fewer parameters than those based on mixtures of Gaussian likelihoods. Second, an effective and robust combination of statistical with prior knowledge sources for term weighting is used to extract information from the decoder’s output in order to deliver summaries to the message recipients via a GSM Short Message Service (SMS) gateway
Evaluation of extractive voicemail summarization.
This paper is about the evaluation of a system that generates short text summaries of voicemail messages, suitable for transmission as text messages. Our approach to summarization is based on a speech-recognized transcript of the voicemail message, from which a set of summary words is extracted. The system uses a classifier to identify the summary words, with each word being identified by a vector of lexical and prosodic features. The features are selected using Parcel, an ROC-based algorithm. Our evaluations of the system, using a slot error rate metric, have compared manual and automatic summarization, and manual and automatic recognition (using two different recognizers). We also report on two subjective evaluations using mean opinion score of summaries, and a set of comprehension tests. The main results from these experiments were that the perceived difference in quality of summarization was affected more by errors resulting from automatic transcription, than by the automatic summarization process
The role of prosody in a voicemail summarization system
When a speaker leaves a voicemail message there are prosodic cues that emphasize the important points in the message, in addition to lexical content. In this paper we compare and visualize the relative contribution of these two types of features within a voicemail summarization system. We describe the system's ability to generate summaries of two test sets, having trained and validated using 700 messages from the IBM Voicemail corpus. Results measuring the quality of summary artifacts show that combined lexical and prosodic features are at least as robust as combined lexical features alone across all operating conditions
Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models
Abstract. The purpose of extractive summarization is to automatically select indicative sentences, passages, or paragraphs from an original document according to a certain target summarization ratio, and then sequence them to form a concise summary. In this paper, in contrast to conventional approaches, our objective is to deal with the extractive summarization problem under a probabilistic modeling framework. We investigate the use of the hidden Markov model (HMM) for spoken document summarization, in which each sentence of a spoken document is treated as an HMM for generating the document, and the sentences are ranked and selected according to their likelihoods. In addition, the relevance model (RM) of each sentence, estimated from a contemporary text collection, is integrated with the HMM model to improve the representation of the sentence model. The experiments were performed on Chinese broadcast news compiled in Taiwan. The proposed approach achieves noticeable performance gains over conventional summarization approaches
A critical comparison on attitude estimation: from gaussian approximate filters to coordinate-free dual optimal control
This paper conveys attitude and rate estimation without rate sensors by performing a critical comparison, validated by extensive simulations. The two dominant approaches to facilitate attitude estimation are based on stochastic and set-membership reasoning. The first one mostly utilizes the commonly known Gaussian-approximate filters, namely the EKF and UKF. Although more conservative, the latter seems to be more promising as it considers the inherent geometric characteristics of the underline compact state space and accounts—from first principles—for large model errors. The set-theoretic approach from a control point of view is addressed, and it is shown that it can overcome reported deficiencies of the Bayesian architectures related to this problem, leading to coordinate-free optimal filters
The Role of Non-Coding RNAs in Myelodysplastic Neoplasms
Myelodysplastic syndromes or neoplasms (MDS) are a heterogeneous group of myeloid clonal disorders characterized by peripheral blood cytopenias, blood and marrow cell dysplasia, and increased risk of evolution to acute myeloid leukemia (AML). Non-coding RNAs, especially microRNAs and long non-coding RNAs, serve as regulators of normal and malignant hematopoiesis and have been implicated in carcinogenesis. This review presents a comprehensive summary of the biology and role of non-coding RNAs, including the less studied circRNA, siRNA, piRNA, and snoRNA as potential prognostic and/or predictive biomarkers or therapeutic targets in MDS
- …