23 research outputs found

    Constructive Nonstandard Analysis without Actual Infinity

    Get PDF

    Old Content and Modern Tools : Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771–1910

    Get PDF
    Named Entity Recognition (NER), search, classification and tagging of names and name-like informational elements in texts, has become a standard information extraction procedure for textual data. NER has been applied to many types of texts and different types of entities: newspapers, fiction, historical records, persons, locations, chemical compounds, protein families, animals etc. In general, the performance of a NER system is genre- and domain-dependent and also used entity categories vary [Nadeau and Sekine 2007]. The most general set of named entities is usually some version of a tripartite categorization of locations, persons, and organizations. In this paper we report trials and evaluation of NER with data from a digitized Finnish historical newspaper collection (Digi). Experiments, results, and discussion of this research serve development of the web collection of historical Finnish newspapers. Digi collection contains 1,960,921 pages of newspaper material from 1771–1910 in both Finnish and Swedish. We use only material of Finnish documents in our evaluation. The OCRed newspaper collection has lots of OCR errors; its estimated word level correctness is about 70–75 % [Kettunen and Pääkkönen 2016]. Our principal NE tagger is a rule-based tagger of Finnish, FiNER, provided by the FIN-CLARIN consortium. We also show results of limited category semantic tagging with tools of the Semantic Computing Research Group (SeCo) of the Aalto University. Three other tools are also evaluated briefly. This paper reports the first large scale results of NER in a historical Finnish OCRed newspaper collection. Results of this research supplement NER results of other languages with similar noisy data. As the results are also achieved with a small and morphologically rich language, they illuminate the relatively well-researched area of Named Entity Recognition from a new perspective.Peer reviewe

    Identification of Motor Symptoms Related to Parkinson Disease Using Motion-Tracking Sensors at Home (KAVELI) : Protocol for an Observational Case-Control Study

    Get PDF
    Background: Clinical characterization of motion in patients with Parkinson disease (PD) is challenging: symptom progression, suitability of medication, and level of independence in the home environment can vary across time and patients. Appointments at the neurological outpatient clinic provide a limited understanding of the overall situation. In order to follow up these variations, longer-term measurements performed outside of the clinic setting could help optimize and personalize therapies. Several wearable sensors have been used to estimate the severity of symptoms in PD; however, longitudinal recordings, even for a short duration of a few days, are rare. Home recordings have the potential benefit of providing a more thorough and objective follow-up of the disease while providing more information about the possible need to change medications or consider invasive treatments. Objective: The primary objective of this study is to collect a dataset for developing methods to detect PD-related symptoms that are visible in walking patterns at home. The movement data are collected continuously and remotely at home during the normal lives of patients with PD as well as controls. The secondary objective is to use the dataset to study whether the registered medication intakes can be identified from the collected movement data by looking for and analyzing short-term changes in walking patterns. Methods: This paper described the protocol for an observational case-control study that measures activity using three different devices: (1) a smartphone with a built-in accelerometer, gyroscope, and phone orientation sensor, (2) a Movesense smart sensor to measure movement data from the wrist, and (3) a Forciot smart insole to measure the forces applied on the feet. The measurements are first collected during the appointment at the clinic conducted by a trained clinical physiotherapist. Subsequently, the subjects wear the smartphone at home for 3 consecutive days. Wrist and insole sensors are not used in the home recordings. Results: Data collection began in March 2018. Subject recruitment and data collection will continue in spring 2019. The intended sample size was 150 subjects. In 2018, we collected a sample of 103 subjects, 66 of whom were diagnosed with PD. Conclusions: This study aims to produce an extensive movement-sensor dataset recorded from patients with PD in various phases of the disease as well as from a group of control subjects for effective and impactful comparison studies. The study also aims to develop data analysis methods to monitor PD symptoms and the effects of medication intake during normal life and outside of the clinic setting. Further applications of these methods may include using them as tools for health care professionals to monitor PD remotely and applying them to other movement disorders.Peer reviewe

    Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910

    Get PDF
    Named entity recognition (NER), search, classification and tagging of names and name like frequent informational elements in texts, has become a standard information extraction procedure for textual data. NER has been applied to many types of texts and different types of entities: newspapers, fiction, historical records, persons, locations, chemical compounds, protein families, animals etc. In general a NER system’s performance is genre and domain dependent and also used entity categories vary [1]. The most general set of named entities is usually some version of three partite categorization of locations, persons and organizations. In this paper we report first trials and evaluation of NER with data out of a digitized Finnish historical newspaper collection Digi. Digi collection contains 1,960,921 pages of newspaper material from years 1771– 1910 both in Finnish and Swedish. We use only material of Finnish documents in our evaluation. The OCRed newspaper collection has lots of OCR errors; its estimated word level correctness is about 74–75 % [2]. Our principal NER tagger is a rule-based tagger of Finnish, FiNER, provided by the FIN-CLARIN consortium. We show also results of limited category semantic tagging with tools of the Semantic Computing Research Group (SeCo) of the Aalto University. FiNER is able to achieve up to 60.0 F-score with named entities in the evaluation data. Seco’s tools achieve 30.0–60.0 F-score with locations and persons. Performance of FiNER and SeCo’s tools with the data shows that at best about half of named entities can be recognized even in a quite erroneous OCRed textNamed entity recognition (NER), search, classification and tagging of names and name like frequent informational elements in texts, has become a standard information extraction procedure for textual data. NER has been applied to many types of texts and different types of entities: newspapers, fiction, historical records, persons, locations, chemical compounds, protein families, animals etc. In general a NER system’s performance is genre and domain dependent and also used entity categories vary [1]. The most general set of named entities is usually some version of three partite categorization of locations, persons and organizations. In this paper we report first trials and evaluation of NER with data out of a digitized Finnish historical newspaper collection Digi. Digi collection contains 1,960,921 pages of newspaper material from years 1771– 1910 both in Finnish and Swedish. We use only material of Finnish documents in our evaluation. The OCRed newspaper collection has lots of OCR errors; its estimated word level correctness is about 74–75 % [2]. Our principal NER tagger is a rule-based tagger of Finnish, FiNER, provided by the FIN-CLARIN consortium. We show also results of limited category semantic tagging with tools of the Semantic Computing Research Group (SeCo) of the Aalto University. FiNER is able to achieve up to 60.0 F-score with named entities in the evaluation data. Seco’s tools achieve 30.0–60.0 F-score with locations and persons. Performance of FiNER and SeCo’s tools with the data shows that at best about half of named entities can be recognized even in a quite erroneous OCRed text.Peer reviewe

    Parkinson’s disease detection from 20-step walking tests using inertial sensors of a smartphone: Machine learning approach based on an observational case-control study

    Get PDF
    Parkinson’s disease (PD) is a neurodegenerative disease inducing dystrophy of the motor system. Automatic movement analysis systems have potential in improving patient care by enabling personalized and more accurate adjust of treatment. These systems utilize machine learning to classify the movement properties based on the features derived from the signals. Smartphones can provide an inexpensive measurement platform with their built-in sensors for movement assessment. This study compared three feature selection and nine classification methods for identifying PD patients from control subjects based on accelerometer and gyroscope signals measured with a smartphone during a 20-step walking test. Minimum Redundancy Maximum Relevance (mRMR) and sequential feature selection with both forward (SFS) and backward (SBS) propagation directions were used in this study. The number of selected features was narrowed down from 201 to 4–15 features by applying SFS and mRMR methods. From the methods compared in this study, the highest accuracy for individual steps was achieved with SFS (7 features) and Naive Bayes classifier (accuracy 75.3%), and the second highest accuracy with SFS (4 features) and k Nearest neighbours (accuracy 75.1%). Leave-one-subject-out cross-validation was used in the analysis. For the overall classification of each subject, which was based on the majority vote of the classified steps, k Nearest Neighbors provided the most accurate result with an accuracy of 84.5% and an error rate of 15.5%. This study shows the differences in feature selection methods and classifiers and provides generalizations for optimizing methodologies for smartphone-based monitoring of PD patients. The results are promising for further developing the analysis system for longer measurements carried out in free-living conditions.Peer reviewe

    Identification of Motor Symptoms Related to Parkinson Disease Using Motion-Tracking Sensors at Home (KAVELI): Protocol for an Observational Case-Control Study

    Get PDF
    Background: Clinical characterization of motion in patients with Parkinson disease (PD) is challenging: symptom progression, suitability of medication, and level of independence in the home environment can vary across time and patients. Appointments at the neurological outpatient clinic provide a limited understanding of the overall situation. In order to follow up these variations, longer-term measurements performed outside of the clinic setting could help optimize and personalize therapies. Several wearable sensors have been used to estimate the severity of symptoms in PD; however, longitudinal recordings, even for a short duration of a few days, are rare. Home recordings have the potential benefit of providing a more thorough and objective follow-up of the disease while providing more information about the possible need to change medications or consider invasive treatments.Objective: The primary objective of this study is to collect a dataset for developing methods to detect PD-related symptoms that are visible in walking patterns at home. The movement data are collected continuously and remotely at home during the normal lives of patients with PD as well as controls. The secondary objective is to use the dataset to study whether the registered medication intakes can be identified from the collected movement data by looking for and analyzing short-term changes in walking patterns.Methods: This paper described the protocol for an observational case-control study that measures activity using three different devices: (1) a smartphone with a built-in accelerometer, gyroscope, and phone orientation sensor, (2) a Movesense smart sensor to measure movement data from the wrist, and (3) a Forciot smart insole to measure the forces applied on the feet. The measurements are first collected during the appointment at the clinic conducted by a trained clinical physiotherapist. Subsequently, the subjects wear the smartphone at home for 3 consecutive days. Wrist and insole sensors are not used in the home recordings.Results: Data collection began in March 2018. Subject recruitment and data collection will continue in spring 2019. The intended sample size was 150 subjects. In 2018, we collected a sample of 103 subjects, 66 of whom were diagnosed with PD.Conclusions: This study aims to produce an extensive movement-sensor dataset recorded from patients with PD in various phases of the disease as well as from a group of control subjects for effective and impactful comparison studies. The study also aims to develop data analysis methods to monitor PD symptoms and the effects of medication intake during normal life and outside of the clinic setting. Further applications of these methods may include using them as tools for health care professionals to monitor PD remotely and applying them to other movement disorders

    Mobilemicroservices Architecture for Remote Monitoring of Patients : A Feasibility Study

    Get PDF
    Recent developments in smart mobile devices (SMDs), wearable sensors, the Internet, mobile networks, and computing power provide new healthcare opportunities that are not restricted geographically. This paper aims to introduce Mobilemicroservices Architecture (MMA) based on a study on architectures. In MMA, an HTTP-based Mobilemicroservivce (MM) is allocated to each SMD's sensor. The key benefits are extendibility, scalability, ease of use for the patient, security, and the possibility to collect raw data without the necessity to involve cloud services. Feasibility was investigated in a two-year project, where MMA-based solutions were used to collect motor function data from patients with Parkinson's disease. First, we collected motor function data from 98 patients and healthy controls during their visit to a clinic. Second, we monitored the same subjects in real-time for three days in their everyday living environment. These MMA applications represent HTTP-based business-logic computing in which the SMDs' resources are accessible globally.publishedVersionPeer reviewe

    Digital healthcare platform ecosystem design : A case study of an ecosystem for Parkinson's disease patients

    Get PDF
    This study investigates how a digital health care platform ecosystem and its essential core elements can be constructed considering specific characteristics of health care. A single case study of an ecosystem planned for patients with Parkinson's disease was conducted following a constructive research approach. The study reveals that the keystones for digital platform ecosystem design for health care purposes are as follows: 1) a focus on multi-sided mutual value propositions as a starting point for the design process; 2) understanding the technological core of the platform, including artefacts, interfaces and infrastructure; 3) understanding the actor layer, including actor resources, roles, positions and mutual interactions; and 4) understanding environmental influences in the form of data-related issues, prevailing institutions, practices and regulation. Several recommendations are offered to consider in designing the digital health care platform ecosystem. Managerially, this understanding could facilitate the development of new means of diagnostics, impactful and just-in-time care, functioning data collection and resource coordination in health care.publishedVersionPeer reviewe
    corecore