286 research outputs found
Finding Important Terms for Patients in Their Electronic Health Records: A Learning-to-Rank Approach Using Expert Annotations
BACKGROUND: Many health organizations allow patients to access their own electronic health record (EHR) notes through online patient portals as a way to enhance patient-centered care. However, EHR notes are typically long and contain abundant medical jargon that can be difficult for patients to understand. In addition, many medical terms in patients\u27 notes are not directly related to their health care needs. One way to help patients better comprehend their own notes is to reduce information overload and help them focus on medical terms that matter most to them. Interventions can then be developed by giving them targeted education to improve their EHR comprehension and the quality of care.
OBJECTIVE: We aimed to develop a supervised natural language processing (NLP) system called Finding impOrtant medical Concepts most Useful to patientS (FOCUS) that automatically identifies and ranks medical terms in EHR notes based on their importance to the patients.
METHODS: First, we built an expert-annotated corpus. For each EHR note, 2 physicians independently identified medical terms important to the patient. Using the physicians\u27 agreement as the gold standard, we developed and evaluated FOCUS. FOCUS first identifies candidate terms from each EHR note using MetaMap and then ranks the terms using a support vector machine-based learn-to-rank algorithm. We explored rich learning features, including distributed word representation, Unified Medical Language System semantic type, topic features, and features derived from consumer health vocabulary. We compared FOCUS with 2 strong baseline NLP systems.
RESULTS: Physicians annotated 90 EHR notes and identified a mean of 9 (SD 5) important terms per note. The Cohen\u27s kappa annotation agreement was .51. The 10-fold cross-validation results show that FOCUS achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.940 for ranking candidate terms from EHR notes to identify important terms. When including term identification, the performance of FOCUS for identifying important terms from EHR notes was 0.866 AUC-ROC. Both performance scores significantly exceeded the corresponding baseline system scores (P \u3c .001). Rich learning features contributed to FOCUS\u27s performance substantially.
CONCLUSIONS: FOCUS can automatically rank terms from EHR notes based on their importance to patients. It may help develop future interventions that improve quality of care
Computer-aid molecular docking technology in cereal mycotoxin analysis: Poster
Computer-aid molecular docking is a simulative process that receptors and ligands recognize each other through energy matching and geometric matching. It is widely used in bioactive compounds simulative screening and preliminary exploring the bioactivity and toxicity of molecular, which plays important guiding role in toxicity and bioactivity study of molecular entities. In our study, we used the computer-aid molecular docking software-discovery studio 3.1 client to test the mechanism of aflatoxins such as aflatoxin B1, B2, M1, M2, G1, G2 and the results of our experiment help to illustrate the pathway of aflatoxin’s toxication. We also used this technology to test the preliminary toxicity of zearalenone (ZEN) and its two degradation products: a- zearalenol (a-ZOL) and ß-zearalenol (ß-ZOL), which indicates that these three products possessed significant estrogenic activity. The order of the estrogenic activity is: a-zearalenol > zearalenone >ß-zearalenol.Computer-aid molecular docking is a simulative process that receptors and ligands recognize each other through energy matching and geometric matching. It is widely used in bioactive compounds simulative screening and preliminary exploring the bioactivity and toxicity of molecular, which plays important guiding role in toxicity and bioactivity study of molecular entities. In our study, we used the computer-aid molecular docking software-discovery studio 3.1 client to test the mechanism of aflatoxins such as aflatoxin B1, B2, M1, M2, G1, G2 and the results of our experiment help to illustrate the pathway of aflatoxin’s toxication. We also used this technology to test the preliminary toxicity of zearalenone (ZEN) and its two degradation products: a- zearalenol (a-ZOL) and ß-zearalenol (ß-ZOL), which indicates that these three products possessed significant estrogenic activity. The order of the estrogenic activity is: a-zearalenol > zearalenone >ß-zearalenol
Optimizating Drilling Operating Parameters With Real-Time Surveillance and Mitigation System of Downhole Vibration in Deep Wells
Torsional vibration is the main bottleneck which leads to low efficiency of rock breaking. According to the characteristics of torsional dysfunctions performance, the paper defines three main types of stick-slip, and analyzed the relationship between ROP and energy from bit. Based on Newton’s equations of motion, established frequency domain, single degree of freedom, damped and forced drill string torsional vibration prediction model, with more accuracy for downhole drill string mechanical state description. On this basis, semi-analytical transfer matrix method is adopted to establish drill string response relationship between internal force wave and surface parameters changes in the condition of vibration, which greatly reduce the number of discrete elements and the associated computing time, enabling rapid screening of a large number of design alternatives on a PC. In addition, the response parameters for drillstring stick-slip are integrated into an index (Vibration Strength Estimate, VSE), which is used to quantitatively evaluate downhole stick-slip severity. A three-week pilot test has been conducted in deep wells in Yumen Oilfield in China, with 25% increase of the average ROP and 40% enhancement of the average bit footage compared with offset wells. Validation of field application shows that the downhole torsional vibration evaluation technology is an effective method for further excavate the potential of ROP and reduce drilling cost
Ranking Medical Terms to Support Expansion of Lay Language Resources for Patient Comprehension of Electronic Health Record Notes: Adapted Distant Supervision Approach
BACKGROUND: Medical terms are a major obstacle for patients to comprehend their electronic health record (EHR) notes. Clinical natural language processing (NLP) systems that link EHR terms to lay terms or definitions allow patients to easily access helpful information when reading through their EHR notes, and have shown to improve patient EHR comprehension. However, high-quality lay language resources for EHR terms are very limited in the public domain. Because expanding and curating such a resource is a costly process, it is beneficial and even necessary to identify terms important for patient EHR comprehension first.
OBJECTIVE: We aimed to develop an NLP system, called adapted distant supervision (ADS), to rank candidate terms mined from EHR corpora. We will give EHR terms ranked as high by ADS a higher priority for lay language annotation-that is, creating lay definitions for these terms.
METHODS: Adapted distant supervision uses distant supervision from consumer health vocabulary and transfer learning to adapt itself to solve the problem of ranking EHR terms in the target domain. We investigated 2 state-of-the-art transfer learning algorithms (ie, feature space augmentation and supervised distant supervision) and designed 5 types of learning features, including distributed word representations learned from large EHR data for ADS. For evaluating ADS, we asked domain experts to annotate 6038 candidate terms as important or nonimportant for EHR comprehension. We then randomly divided these data into the target-domain training data (1000 examples) and the evaluation data (5038 examples). We compared ADS with 2 strong baselines, including standard supervised learning, on the evaluation data.
RESULTS: The ADS system using feature space augmentation achieved the best average precision, 0.850, on the evaluation set when using 1000 target-domain training examples. The ADS system using supervised distant supervision achieved the best average precision, 0.819, on the evaluation set when using only 100 target-domain training examples. The 2 ADS systems both performed significantly better than the baseline systems (P \u3c .001 for all measures and all conditions). Using a rich set of learning features contributed to ADS\u27s performance substantially.
CONCLUSIONS: ADS can effectively rank terms mined from EHRs. Transfer learning improved ADS\u27s performance even with a small number of target-domain training examples. EHR terms prioritized by ADS were used to expand a lay language resource that supports patient EHR comprehension. The top 10,000 EHR terms ranked by ADS are available upon request
New results for fractional ordinary differential equations in fuzzy metric space
In this paper, we primarily focused on the existence and uniqueness of the initial value problem for fractional order fuzzy ordinary differential equations in a fuzzy metric space. First, definitions and relevant properties of the Gamma function and Beta function within a fuzzy metric space were provided. Second, by employing the principle of fuzzy compression mapping and Choquet integral of fuzzy numerical functions, we established the existence and uniqueness of solutions to initial value problems for fuzzy ordinary differential equations. Finally, several examples were presented to demonstrate the validity of our obtained results
Detecting Hypoglycemia Incidents Reported in Patients\u27 Secure Messages: Using Cost-Sensitive Learning and Oversampling to Reduce Data Imbalance
BACKGROUND: Improper dosing of medications such as insulin can cause hypoglycemic episodes, which may lead to severe morbidity or even death. Although secure messaging was designed for exchanging nonurgent messages, patients sometimes report hypoglycemia events through secure messaging. Detecting these patient-reported adverse events may help alert clinical teams and enable early corrective actions to improve patient safety.
OBJECTIVE: We aimed to develop a natural language processing system, called HypoDetect (Hypoglycemia Detector), to automatically identify hypoglycemia incidents reported in patients\u27 secure messages.
METHODS: An expert in public health annotated 3000 secure message threads between patients with diabetes and US Department of Veterans Affairs clinical teams as containing patient-reported hypoglycemia incidents or not. A physician independently annotated 100 threads randomly selected from this dataset to determine interannotator agreement. We used this dataset to develop and evaluate HypoDetect. HypoDetect incorporates 3 machine learning algorithms widely used for text classification: linear support vector machines, random forest, and logistic regression. We explored different learning features, including new knowledge-driven features. Because only 114 (3.80%) messages were annotated as positive, we investigated cost-sensitive learning and oversampling methods to mitigate the challenge of imbalanced data.
RESULTS: The interannotator agreement was Cohen kappa=.976. Using cross-validation, logistic regression with cost-sensitive learning achieved the best performance (area under the receiver operating characteristic curve=0.954, sensitivity=0.693, specificity 0.974, F1 score=0.590). Cost-sensitive learning and the ensembled synthetic minority oversampling technique improved the sensitivity of the baseline systems substantially (by 0.123 to 0.728 absolute gains). Our results show that a variety of features contributed to the best performance of HypoDetect.
CONCLUSIONS: Despite the challenge of data imbalance, HypoDetect achieved promising results for the task of detecting hypoglycemia incidents from secure messages. The system has a great potential to facilitate early detection and treatment of hypoglycemia
Newcastle disease virus-vectored Nipah encephalitis vaccines induce B and T cell responses in mice and long-lasting neutralizing antibodies in pigs
AbstractNipah virus (NiV), a member of the Paramyxoviridae family, causes deadly encephalitis in humans and huge economic losses to the pig industry. Here, we generated recombinant avirulent Newcastle disease virus (NDV) LaSota strains expressing the NiV G and F proteins respectively (designated as rLa-NiVG and rLa-NiVF), and evaluated their immunogenicity in mice and pigs. Both rLa-NiVG and rLa-NiVF displayed growth properties similar to those of LaSota virus in chicken eggs. Co-infection of rLa-NiVG and rLa-NiVF caused marked syncytia formation, while intracerebral co-inoculation of these viruses in mice showed they were safe in at least one mammalian species. Animal immunization studies showed rLa-NiVG and rLa-NiVF induced NiV neutralizing antibody responses in mice and pigs, and F protein-specific CD8+ T cell responses in mice. Most importantly, rLa-NiVG and rLa-NiVF administered alone or together, induced a long-lasting neutralizing antibody response in pigs. Recombinant rLa-NiVG/F thus appear to be promising NiV vaccine candidates for pigs and potentially humans
Recommended from our members
Longitudinal Lipidomic Profile of Hypertension in American Indians: Findings From the Strong Heart Family Study.
BACKGROUND: Dyslipidemia is an important risk factor for hypertension and cardiovascular disease. Standard lipid panel cannot reflect the complexity of blood lipidome. The associations of individual lipid species with hypertension remain to be determined in large-scale epidemiological studies, especially in a longitudinal setting. METHODS: Using liquid chromatography-mass spectrometry, we repeatedly measured 1542 lipid species in 3699 fasting plasma samples at 2 visits (1905 at baseline, 1794 at follow-up, ~5.5 years apart) from 1905 unique American Indians in the Strong Heart Family Study. We first identified baseline lipids associated with prevalent and incident hypertension, followed by replication of top hits in Europeans. We then conducted repeated measurement analysis to examine the associations of changes in lipid species with changes in systolic blood pressure, diastolic blood pressure, and mean arterial pressure. Network analysis was performed to identify lipid networks associated with the risk of hypertension. RESULTS: Baseline levels of multiple lipid species, for example, glycerophospholipids, cholesterol esters, sphingomyelins, glycerolipids, and fatty acids, were significantly associated with both prevalent and incident hypertension in American Indians. Some lipids were confirmed in Europeans. Longitudinal changes in multiple lipid species, for example, acylcarnitines, phosphatidylcholines, fatty acids, and triacylglycerols, were significantly associated with changes in blood pressure measurements. Network analysis identified distinct lipidomic patterns associated with the risk of hypertension. CONCLUSIONS: Baseline plasma lipid species and their longitudinal changes are significantly associated with hypertension development in American Indians. Our findings shed light on the role of dyslipidemia in hypertension and may offer potential opportunities for risk stratification and early prediction of hypertension
- …