12 research outputs found

    Intelligent Data Mining using Kernel Functions and Information Criteria

    Get PDF
    Radial Basis Function (RBF) Neural Networks and Support Vector Machines (SVM) are two powerful kernel related intelligent data mining techniques. The current major problems with these methods are over-fitting and the existence of too many free parameters. The way to select the parameters can directly affect the generalization performance(test error) of theses models. Current practice in how to choose the model parameters is an art, rather than a science in this research area. Often, some parameters are predetermined, or randomly chosen. Other parameters are selected through repeated experiments that are time consuming, costly, and computationally very intensive. In this dissertation, we provide a two-stage analytical hybrid-training algorithm by building a bridge among regression tree, EM algorithm, and Radial Basis Function Neural Networks together. Information Complexity (ICOMP) criterion of Bozdogan along with other information based criteria are introduced and applied to control the model complexity, and to decide the optimal number of kernel functions. In the first stage of the hybrid, regression tree and EM algorithm are used to determine the kernel function parameters. In the second stage of the hybrid, the weights (coefficients) are calculated and information criteria are scored. Kernel Principal Component Analysis (KPCA) using EM algorithm for feature selection and data preprocessing is also introduced and studied. Adaptive Support Vector Machines (ASVM) and some efficient algorithms are given to deal with massive data sets in support vector classifications. Versatility and efficiency of the new proposed approaches are studied on real data sets and via Monte Carlo sim- ulation experiments

    Modelling Instrumental Gestures and Techniques: A Case Study of Piano Pedalling

    Get PDF
    PhD ThesisIn this thesis we propose a bottom-up approach for modelling instrumental gestures and techniques, using piano pedalling as a case study. Pedalling gestures play a vital role in expressive piano performance. They can be categorised into di erent pedalling techniques. We propose several methods for the indirect acquisition of sustain-pedal techniques using audio signal analyses, complemented by the direct measurement of gestures with sensors. A novel measurement system is rst developed to synchronously collect pedalling gestures and piano sound. Recognition of pedalling techniques starts by using the gesture data. This yields high accuracy and facilitates the construction of a ground truth dataset for evaluating the audio-based pedalling detection algorithms. Studies in the audio domain rely on the knowledge of piano acoustics and physics. New audio features are designed through the analysis of isolated notes with di erent pedal e ects. The features associated with a measure of sympathetic resonance are used together with a machine learning classi er to detect the presence of legato-pedal onset in the recordings from a speci c piano. To generalise the detection, deep learning methods are proposed and investigated. Deep Neural Networks are trained using a large synthesised dataset obtained through a physical-modelling synthesiser for feature learning. Trained models serve as feature extractors for frame-wise sustain-pedal detection from acoustic piano recordings in a proposed transfer learning framework. Overall, this thesis demonstrates that recognising sustain-pedal techniques is possible to a high degree of accuracy using sensors and also from audio recordings alone. As the rst study that undertakes pedalling technique detection in real-world piano performance, it complements piano transcription methods. Moreover, the underlying relations between pedalling gestures, piano acoustics and audio features are identi ed. The varying e ectiveness of the presented features and models can also be explained by di erences in pedal use between composers and musical eras

    Computational Methods for Assisting Radio Drama Production.

    Get PDF
    PhD Theses.Radio Drama is a theatrical form of art that usually exists solely in the acoustic domain consisting of music, speech, and sound effects and is most often consumed through broadcast radio. This thesis proposes methods for assisting a human creator in producing radio dramas. Much research has been done to aiding creativity using artificial intelligence techniques in storytelling, music composition, the visual arts, and fi lm. Despite that, radio drama is under-represented in such research. Radio drama consists of both literary aspects, such as plot, story characters, or environments, as well as production aspects, such as speech, music, and sound effects. While plenty of research has been examining each of those aspects individually there is currently no research that combines such studies in the context of radio drama production. In this thesis, an interdisciplinary approach to assisting a human creator in radio drama production is developed. The task is explored through the joint prism of natural language processing, music information retrieval, and automatic mixing. We show that individual literary aspects of radio drama can be automatically extracted from a story draft provided by a human creator, by using natural language processing methods. Formal rules can be used to express the aforementioned elements in the form of a script able to be read and altered by both the human creator and the computer. We devise recommender systems for sound, music, and audio effects to retrieve the assets required for production. Rules derived from radio drama literature can then use those recorded assets to produce a radio drama mix in a semi-automatic way. Furthermore, an adaptive reverberation effect suggests reverberation settings for each track based on track content and past user choices. The degree of success for individual tasks in aiding production is demonstrated using examples of radio drama production from raw stories and validated through objective evaluation metrics, and listening tests

    Äriprotsesside ajaliste nĂ€itajate selgitatav ennustav jĂ€lgimine

    Get PDF
    Kaasaegsed ettevĂ”tte infosĂŒsteemid vĂ”imaldavad ettevĂ”tetel koguda detailset informatsiooni Ă€riprotsesside tĂ€itmiste kohta. Eelnev koos masinĂ”ppe meetoditega vĂ”imaldab kasutada andmejuhitavaid ja ennustatavaid lĂ€henemisi Ă€riprotsesside jĂ”udluse jĂ€lgimiseks. Kasutades ennustuslike Ă€riprotsesside jĂ€lgimise tehnikaid on vĂ”imalik jĂ”udluse probleeme ennustada ning soovimatu tegurite mĂ”ju ennetavalt leevendada. TĂŒĂŒpilised kĂŒsimused, millega tegeleb ennustuslik protsesside jĂ€lgimine on “millal antud Ă€riprotsess lĂ”ppeb?” vĂ”i “mis on kĂ”ige tĂ”enĂ€olisem jĂ€rgmine sĂŒndmus antud Ă€riprotsessi jaoks?”. Suurim osa olemasolevatest lahendustest eelistavad tĂ€psust selgitatavusele. Praktikas, selgitatavus on ennustatavate tehnikate tĂ€htis tunnus. Ennustused, kas protsessi tĂ€itmine ebaĂ”nnestub vĂ”i selle tĂ€itmisel vĂ”ivad tekkida raskused, pole piisavad. On oluline kasutajatele seletada, kuidas on selline ennustuse tulemus saavutatud ning mida saab teha soovimatu tulemuse ennetamiseks. Töö pakub vĂ€lja kaks meetodit ennustatavate mudelite konstrueerimiseks, mis vĂ”imaldavad jĂ€lgida Ă€riprotsesse ning keskenduvad selgitatavusel. Seda saavutatakse ennustuse lahtivĂ”tmisega elementaarosadeks. NĂ€iteks, kui ennustatakse, et Ă€riprotsessi lĂ”puni on jÀÀnud aega 20 tundi, siis saame anda seletust, et see aeg on moodustatud kĂ”ikide seni kĂ€sitlemata tegevuste lĂ”petamiseks vajalikust ajast. Töös vĂ”rreldakse omavahel eelmainitud meetodeid, kĂ€sitledes Ă€riprotsesse erinevatest valdkondadest. Hindamine toob esile erinevusi selgitatava ja tĂ€psusele pĂ”hinevale lĂ€henemiste vahel. Töö teaduslik panus on ennustuslikuks protsesside jĂ€lgimiseks vabavaralise tööriista arendamine. SĂŒsteemi nimeks on Nirdizati ning see sĂŒsteem vĂ”imaldab treenida ennustuslike masinĂ”ppe mudeleid, kasutades nii töös kirjeldatud meetodeid kui ka kolmanda osapoole meetodeid. Hiljem saab treenitud mudeleid kasutada hetkel kĂ€ivate Ă€riprotsesside tulemuste ennustamiseks, mis saab aidata kasutajaid reaalajas.Modern enterprise systems collect detailed data about the execution of the business processes they support. The widespread availability of such data in companies, coupled with advances in machine learning, have led to the emergence of data-driven and predictive approaches to monitor the performance of business processes. By using such predictive process monitoring approaches, potential performance issues can be anticipated and proactively mitigated. Various approaches have been proposed to address typical predictive process monitoring questions, such as what is the most likely continuation of an ongoing process instance, or when it will finish. However, most existing approaches prioritize accuracy over explainability. Yet in practice, explainability is a critical property of predictive methods. It is not enough to accurately predict that a running process instance will end up in an undesired outcome. It is also important for users to understand why this prediction is made and what can be done to prevent this undesired outcome. This thesis proposes two methods to build predictive models to monitor business processes in an explainable manner. This is achieved by decomposing a prediction into its elementary components. For example, to explain that the remaining execution time of a process execution is predicted to be 20 hours, we decompose this prediction into the predicted execution time of each activity that has not yet been executed. We evaluate the proposed methods against each other and various state-of-the-art baselines using a range of business processes from multiple domains. The evaluation reaffirms a fundamental trade-off between explainability and accuracy of predictions. The research contributions of the thesis have been consolidated into an open-source tool for predictive business process monitoring, namely Nirdizati. It can be used to train predictive models using the methods described in this thesis, as well as third-party methods. These models are then used to make predictions for ongoing process instances; thus, the tool can also support users at runtime

    Coopération de réseaux de caméras ambiantes et de vision embarquée sur robot mobile pour la surveillance de lieux publics

    Get PDF
    Actuellement, il y a une demande croissante pour le dĂ©ploiement de robots mobile dans des lieux publics. Pour alimenter cette demande, plusieurs chercheurs ont dĂ©ployĂ© des systĂšmes robotiques de prototypes dans des lieux publics comme les hĂŽpitaux, les supermarchĂ©s, les musĂ©es, et les environnements de bureau. Une principale prĂ©occupation qui ne doit pas ĂȘtre nĂ©gligĂ©, comme des robots sortent de leur milieu industriel isolĂ© et commencent Ă  interagir avec les humains dans un espace de travail partagĂ©, est une interaction sĂ©curitaire. Pour un robot mobile Ă  avoir un comportement interactif sĂ©curitaire et acceptable - il a besoin de connaĂźtre la prĂ©sence, la localisation et les mouvements de population Ă  mieux comprendre et anticiper leurs intentions et leurs actions. Cette thĂšse vise Ă  apporter une contribution dans ce sens en mettant l'accent sur les modalitĂ©s de perception pour dĂ©tecter et suivre les personnes Ă  proximitĂ© d'un robot mobile. Comme une premiĂšre contribution, cette thĂšse prĂ©sente un systĂšme automatisĂ© de dĂ©tection des personnes visuel optimisĂ© qui prend explicitement la demande de calcul prĂ©vue sur le robot en considĂ©ration. DiffĂ©rentes expĂ©riences comparatives sont menĂ©es pour mettre clairement en Ă©vidence les amĂ©liorations de ce dĂ©tecteur apporte Ă  la table, y compris ses effets sur la rĂ©activitĂ© du robot lors de missions en ligne. Dans un deuxiĂš contribution, la thĂšse propose et valide un cadre de coopĂ©ration pour fusionner des informations depuis des camĂ©ras ambiant affixĂ© au mur et de capteurs montĂ©s sur le robot mobile afin de mieux suivre les personnes dans le voisinage. La mĂȘme structure est Ă©galement validĂ©e par des donnĂ©es de fusion Ă  partir des diffĂ©rents capteurs sur le robot mobile au cours de l'absence de perception externe. Enfin, nous dĂ©montrons les amĂ©liorations apportĂ©es par les modalitĂ©s perceptives dĂ©veloppĂ©s en les dĂ©ployant sur notre plate-forme robotique et illustrant la capacitĂ© du robot Ă  percevoir les gens dans les lieux publics supposĂ©s et respecter leur espace personnel pendant la navigation.This thesis deals with detection and tracking of people in a surveilled public place. It proposes to include a mobile robot in classical surveillance systems that are based on environment fixed sensors. The mobile robot brings about two important benefits: (1) it acts as a mobile sensor with perception capabilities, and (2) it can be used as means of action for service provision. In this context, as a first contribution, it presents an optimized visual people detector based on Binary Integer Programming that explicitly takes the computational demand stipulated into consideration. A set of homogeneous and heterogeneous pool of features are investigated under this framework, thoroughly tested and compared with the state-of-the-art detectors. The experimental results clearly highlight the improvements the different detectors learned with this framework bring to the table including its effect on the robot's reactivity during on-line missions. As a second contribution, the thesis proposes and validates a cooperative framework to fuse information from wall mounted cameras and sensors on the mobile robot to better track people in the vicinity. Finally, we demonstrate the improvements brought by the developed perceptual modalities by deploying them on our robotic platform and illustrating the robot's ability to perceive people in supposed public areas and respect their personal space during navigation

    Kernel Methods and Measures for Classification with Transparency, Interpretability and Accuracy in Health Care

    Get PDF
    Support vector machines are a popular method in machine learning. They learn from data about a subject, for example, lung tumors in a set of patients, to classify new data, such as, a new patient’s tumor. The new tumor is classified as either cancerous or benign, depending on how similar it is to the tumors of other patients in those two classes—where similarity is judged by a kernel. The adoption and use of support vector machines in health care, however, is inhibited by a perceived and actual lack of rationale, understanding and transparency for how they work and how to interpret information and results from them. For example, a user must select the kernel, or similarity function, to be used, and there are many kernels to choose from but little to no useful guidance on choosing one. The primary goal of this thesis is to create accurate, transparent and interpretable kernels with rationale to select them for classification in health care using SVM—and to do so within a theoretical framework that advances rationale, understanding and transparency for kernel/model selection with atomic data types. The kernels and framework necessarily co-exist. The secondary goal of this thesis is to quantitatively measure model interpretability for kernel/model selection and identify the types of interpretable information which are available from different models for interpretation. Testing my framework and transparent kernels with empirical data I achieve classification accuracy that is better than or equivalent to the Gaussian RBF kernels. I also validate some of the model interpretability measures I propose

    Historical Land use/Land cover classification and its change detection mapping using Different Remotely Sensed Data from LANDSAT (MSS, TM and ETM+) and Terra (ASTER) sensors: a case study of the Euphrates River Basin in Syria with focus on agricultural irrigation projects

    Get PDF
    This thesis deals spatially and regionally with the natural boundaries of the Euphrates River Basin (ERB) in Syria. Scientifically, the research covers the application of remote sensing science (optical remote sensing: LANDSAT-MSS, TM, and ETM+; and TERRA: ASTER); and methodologically, in Land Use/Land Cover (LULC) classification and mapping, automatically and/or semi-automatically; in LULC-change detection; and finally in the mapping of historical irrigation and agricultural projects for the extraction of differing crop types and the estimation of their areas. With regard to time, the work is based on the years 1975, 1987, 2005 and 2007. Initially, preprocessing of the satellite data (geometric- and radiometric- processing, image enhancement, best bands composite selection, transformation, mosaicing and finally subsetting) was carried out. Then, the Land Use/Land Cover Classification System (LCCS) of the Food and Agriculture Organization (FAO) was chosen. The following steps were followed in LULC- classification and change detection mapping: visual interpretation in addition to digital image processing techniques; pixel-based classification methods; unsupervised classification: ISODATA-method; and supervised classification and multistage supervised approaches using the algorithms: Maximum Likelihood Classifier (MLC), Neural Network classifier (NN) and Support Vector Machines (SVM). These were trialed on a test area to determine the optimized classification approach/algorithm for application on the whole study area (ERB) based on the available imagery. Pre- and post- classification change detection methods (comparison approaches) were used to detect changes in land use/land cover-classes (for the years 1975, 1987 and 2007) in the study area. The remote sensing methods show a high potential in mapping historical and present land use/land cover classes and its changes over time. Significant results are also possible for agricultural crop classification in relatively large regional areas (the ERB in Syria is almost 50,335 kmÂČ). Change trends in the study area and period was characterized by land-intensive agricultural expansion. The rapid, more labor- and capital- intensive growth in the agricultural sector was enabled by the introduction of fertilizer, improved access to rural roads and markets, and the expansion of the government irrigation projects. Irrigated areas increased 148 % in the past 32 years from 249,681 ha in 1975 to 596,612 ha in 2007

    Ti plasmids

    No full text
    corecore