57 research outputs found

    Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study

    Get PDF
    Background Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia is a common cause of vision loss. Current approaches to control myopia progression are effective but have significant side effects. Therefore, identifying those at greatest risk who should undergo targeted therapy is of great clinical importance. The objective of this study was to apply big data and machine learning technology to develop an algorithm that can predict the onset of high myopia, at specific future time points, among Chinese school-aged children. Methods and findings Real-world clinical refraction data were derived from electronic medical record systems in 8 ophthalmic centres from January 1, 2005, to December 30, 2015. The variables of age, spherical equivalent (SE), and annual progression rate were used to develop an algorithm to predict SE and onset of high myopia (SE ≤ −6.0 dioptres) up to 10 years in the future. Random forest machine learning was used for algorithm training and validation. Electronic medical records from the Zhongshan Ophthalmic Centre (a major tertiary ophthalmic centre in China) were used as the training set. Ten-fold cross-validation and out-of-bag (OOB) methods were applied for internal validation. The remaining 7 independent datasets were used for external validation. Two population-based datasets, which had no participant overlap with the ophthalmic-centre-based datasets, were used for multi-resource validation testing. The main outcomes and measures were the area under the curve (AUC) values for predicting the onset of high myopia over 10 years and the presence of high myopia at 18 years of age. In total, 687,063 multiple visit records (≥3 records) of 129,242 individuals in the ophthalmic-centre-based electronic medical record databases and 17,113 follow-up records of 3,215 participants in population-based cohorts were included in the analysis. Our algorithm accurately predicted the presence of high myopia in internal validation (the AUC ranged from 0.903 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.888 for 8 years), external validation (the AUC ranged from 0.874 to 0.976 for 3 years, 0.847 to 0.921 for 5 years, and 0.802 to 0.886 for 8 years), and multi-resource testing (the AUC ranged from 0.752 to 0.869 for 4 years). With respect to the prediction of high myopia development by 18 years of age, as a surrogate of high myopia in adulthood, the algorithm provided clinically acceptable accuracy over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837). Meanwhile, our algorithm achieved clinically acceptable prediction of the actual refraction values at future time points, which is supported by the regressive performance and calibration curves. Although the algorithm achieved balanced and robust performance, concerns about the compromised quality of real-world clinical data and over-fitting issues should be cautiously considered. Conclusions To our knowledge, this study, for the first time, used large-scale data collected from electronic health records to demonstrate the contribution of big data and machine learning approaches to improved prediction of myopia prognosis in Chinese school-aged children. This work provides evidence for transforming clinical practice, health policy-making, and precise individualised interventions regarding the practical control of school-aged myopia.This study was funded by the National Key R&D Program of China (2018YFC0116500), the National Natural Science Foundation of China (91546101, 81822010), the Guangdong Science and Technology Innovation Leading Talents (2017TX04R031), and Youth Pearl River Scholar in Guangdong (2016)

    Estimation of Photovoltaic Energy in China Based on Global Land High-Resolution Cloud Climatology

    No full text
    As clean, renewable energy, photovoltaic (PV) energy can reduce the ozone-layer loss and climate deterioration caused by the use of traditional types of energy to generate electricity. At present, most PV energy products involve the influence of cloud cover on solar radiation. However, the resolution and precision of most cloud cover data are not fine enough to reflect the actual cloud distribution in local areas. This leads to incorrect distribution results of PV energy in areas with high-spatial-variability clouds. Using high-resolution and high-precision cloud cover data obtained by satellite remote sensing to estimate the distribution of PV energy can solve this problem. In this study, the Global Land High-Resolution Cloud Climatology (GLHCC), a 10-day cloud frequency product with a resolution of 1 km and located in China, was used to construct a cloud-based solar radiation estimation model. Using the inverse relationship between cloud cover and solar radiation, the GLHCC was converted into sunshine percentage data. Using meteorological station data in China, a Least Squares Fit (LSF) and error check were carried out on the A-P, Lqbal, Bahel and Sen Models to determine the optimal solar radiation estimation model (Sen Model). Based on the sunshine percentage data, the Sen Model and terrain shielding factors, the distribution of PV energy in China was estimated. Finally, comparing to the Global Horizontal Irradiance (GHI) of the World Bank and the yearly average global irradiance of the Photovoltaic Geographic Information System (PVGIS), PV energy data in this paper more accurately reflected the distribution of PV energy in China, especially in areas with high-spatial-variability clouds

    Cross-ancestry genome-wide meta-analysis of 61,047 cases and 947,237 controls identifies new susceptibility loci contributing to lung cancer

    No full text
    To identify new susceptibility loci to lung cancer among diverse populations, we performed cross-ancestry genome-wide association studies in European, East Asian and African populations and discovered five loci that have not been previously reported. We replicated 26 signals and identified 10 new lead associations from previously reported loci. Rare-variant associations tended to be specific to populations, but even common-variant associations influencing smoking behavior, such as those with CHRNA5 and CYP2A6, showed population specificity. Fine-mapping and expression quantitative trait locus colocalization nominated several candidate variants and susceptibility genes such as IRF4 and FUBP1. DNA damage assays of prioritized genes in lung fibroblasts indicated that a subset of these genes, including the pleiotropic gene IRF4, potentially exert effects by promoting endogenous DNA damage

    Reciprocal Regulation between lncRNA ANRIL and p15 in Steroid-Induced Glaucoma

    No full text
    Steroid-induced glaucoma (SIG) is the most common adverse steroid-related effect on the eyes. SIG patients can suffer from trabecular meshwork (TM) dysfunction, intraocular pressure (IOP) elevation, and irreversible vision loss. Previous studies have mainly focused on the role of extracellular matrix turnover in TM dysfunction; however, whether the cellular effects of TM cells are involved in the pathogenesis of SIG remains unclear. Here, we found that the induction of cellular senescence was associated with TM dysfunction, causing SIG in cultured cells and mouse models. Especially, we established the transcriptome landscape in the TM tissue of SIG mice via microarray screening and identified ANRIL as the most differentially expressed long non-coding RNA, with a 5.4-fold change. The expression level of ANRIL was closely related to ocular manifestations (IOP elevation, cup/disc ratio, and retinal nerve fiber layer thickness). Furthermore, p15, the molecular target of ANRIL, was significantly upregulated in SIG and was correlated with ocular manifestations in an opposite direction to ANRIL. The reciprocal regulation between ANRIL and p15 was validated using luciferase reporter assay. Through depletion in cultured cells and a mouse model, ANRIL/p15 signaling was confirmed in cellular senescence via cyclin-dependent kinase activity and, subsequently, by phosphorylation of the retinoblastoma protein. ANRIL depletion imitated the SIG phenotype, most importantly IOP elevation. ANRIL depletion-induced IOP elevation in mice can be effectively suppressed by p15 depletion. Analyses of the single-cell atlas and transcriptome dynamics of human TM tissue showed that ANRIL/p15 expression is spatially enriched in human TM cells and is correlated with TM dysfunction. Moreover, ANRIL is colocalized with a GWAS risk variant (rs944800) of glaucoma, suggesting its potential role underlying genetic susceptibility of glaucoma. Together, our findings suggested that steroid treatment promoted cellular senescence, which caused TM dysfunction, IOP elevation, and irreversible vision loss. Molecular therapy targeting the ANRIL/p15 signal exerted a protective effect against steroid treatment and shed new light on glaucoma management

    Multiscale Entropy Analysis of Instantaneous Frequency Variation to Overcome the Cross-Over Artifact in Rhythmic EEG

    No full text
    Generally, for healthy adults, the entropy of electroencephalogram (EEG) signals gradually decreases from wake to sleep stages N1, N2, to N3, and increases during REM. However, some researchers found that multiscale entropy curves of sleep and wakefulness intercept, a cross-over phenomenon whose origin remains unexplored. The objective of the present work is to trace the origin of the cross-over phenomenon and to propose a workaround strategy. We simulated EEG by generating 1/f broadband signal and chirp signals with continuously varying frequencies. We then retrieved the rhythmic component from simulated EEG and real-world EEG and conducted MSE analysis of the instantaneous frequency variation (IFV) of the rhythmic component. The simulation revealed that this interception was ubiquitous in the MSE analysis of simulated EEG with rhythmic components of different frequencies. The cross-over point moved toward larger scale factors with the increasing sampling rate. We found that the MSE curve of IFV from real-world EEG for the wakefulness group was higher than that for sleep, showing no interception. These results suggest that (1) for a rhythmic signal like EEG, MSE analysis of the raw signal is highly affected by the rhythmic component, presenting artificial cross-over curves in sleep EEG study, (2) frequency variation of rhythmic components are complex signal which differs between wakefulness and sleep, in accordance with the complexity loss theory

    Proteomics analysis and proteogenomic characterization of different physiopathological human lenses

    No full text
    Abstract Background The aim of the present study was to identify the proteomic differences among human lenses in different physiopathological states and to screen for susceptibility genes/proteins via proteogenomic characterization. Methods The total proteomes identified across the regenerative lens with secondary cataract (RLSC), congenital cataract (CC) and age-related cataract (ARC) groups were compared to those of normal lenses using isobaric tagging for relative and absolute protein quantification (iTRAQ). The up-regulated proteins between the groups were subjected to biological analysis. Whole exome sequencing (WES) was performed to detect genetic variations. Results The most complete human lens proteome to date, which consisted of 1251 proteins, including 55.2% previously unreported proteins, was identified across the experimental groups. Bioinformatics functional annotation revealed the common involvement of cellular metabolic processes, immune responses and protein folding disturbances among the groups. RLSC-over-expressed proteins were characteristically enriched in the intracellular immunological signal transduction pathways. The CC groups featured biological processes relating to gene expression and vascular endothelial growth factor (VEGF) signaling transduction, whereas the molecular functions corresponding to external stress were specific to the ARC groups. Combined with WES, the proteogenomic characterization narrowed the list to 16 candidate causal molecules. Conclusions These findings revealed common final pathways with diverse upstream regulation of cataractogenesis in different physiopathological states. This proteogenomic characterization shows translational potential for detecting susceptibility genes/proteins in precision medicine
    • …
    corecore