Search CORE

56 research outputs found

Large AI Models in Health Informatics: Applications, Challenges, and the Future

Author: Dong Yinzhao
Lam Kyle
Li Lin
Lo Benny
Lo Frank P. -W.
Peng Jiachuan
Qiu Jianing
Shi Peilun
Sun Jiankai
Wang Ningli
Xiao Bo
Xu Dong
Yuan Wu
Zhang Ruiyang
Publication venue
Publication date: 24/09/2023
Field of study

Large AI models, or foundation models, are models recently emerging with massive scales both parameter-wise and data-wise, the magnitudes of which can reach beyond billions. Once pretrained, large AI models demonstrate impressive performance in various downstream tasks. A prime example is ChatGPT, whose capability has compelled people's imagination about the far-reaching influence that large AI models can have and their potential to transform different domains of our lives. In health informatics, the advent of large AI models has brought new paradigms for the design of methodologies. The scale of multi-modal data in the biomedical and health domain has been ever-expanding especially since the community embraced the era of deep learning, which provides the ground to develop, validate, and advance large AI models for breakthroughs in health-related areas. This article presents a comprehensive review of large AI models, from background to their applications. We identify seven key sectors in which large AI models are applicable and might have substantial influence, including 1) bioinformatics; 2) medical diagnosis; 3) medical imaging; 4) medical informatics; 5) medical education; 6) public health; and 7) medical robotics. We examine their challenges, followed by a critical discussion about potential future directions and pitfalls of large AI models in transforming the field of health informatics.Comment: This article has been accepted for publication in IEEE Journal of Biomedical and Health Informatic

arXiv.org e-Print Archive

Baichuan 2: Open Large-scale Language Models

Large language models (LLMs) have demonstrated remarkable performance on a variety of natural language tasks based on just a few examples of natural language instructions, reducing the need for extensive feature engineering. However, most powerful LLMs are closed-source or limited in their capability for languages other than English. In this technical report, we present Baichuan 2, a series of large-scale multilingual language models containing 7 billion and 13 billion parameters, trained from scratch, on 2.6 trillion tokens. Baichuan 2 matches or outperforms other open-source models of similar size on public benchmarks like MMLU, CMMLU, GSM8K, and HumanEval. Furthermore, Baichuan 2 excels in vertical domains such as medicine and law. We will release all pre-training model checkpoints to benefit the research community in better understanding the training dynamics of Baichuan 2.Comment: Baichuan 2 technical report. Github: https://github.com/baichuan-inc/Baichuan

arXiv.org e-Print Archive

Soil screening for identifying ecological risk stressors using a battery of in vitro cell bioassays

Author: Arcaro
Broman
Chunxia Wang
Citterio
Donato
Fernandez
Giesy
Guo Yu
Jin
Koh
Legler
Ma
Moore
Oda
Oda
Oh
Ohnishi
Pacheco
Plaza
Quillardet
Routledge
Ruiyang Xiao
Safe
Safe
Salanitro
Snape
Spink
Tao
TEPB (Tianjin Environmental Protection Bureau)
TEPB (Tianjin Environmental Protection Bureau)
Whong
Wu
Xu
Zacharewski
Zijian Wang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Simulation research on ONAN transformer winding temperature field based on temperature rise test

Author: Chi Cheng
Hua Mingsheng
Xiao Dongping
Xiao Ruiyang
Yang Chun
Yang Fan
Publication venue: 'National Library of Serbia'
Publication date: 01/01/2022
Field of study

Studying the oil-natural-air-natural (ONAN) transformer’s temperature field distribution characteristics and hot spot temperature rise during operation is a key step to evaluate the thermal insulation life of this type of transformer. Firstly, considering the transverse oil passages between the windings and the guiding effect of the oil baffle on the oil flow, this paper takes 35 kV ONAN transformer windings as the research object, which establishes the corresponding electromagnetic-thermal-fluid coupling model and uses the finite element method to calculate the overall distribution of the internal winding temperature field. The calculation results show that when the ONAN transformer reaches thermal equilibrium, the axial temperature distribution of the windings is extremely uneven due to the influence of the transformer oil flow rate, whose difference between both ends is as high as 17.4℃. However, the existence of the oil baffle increases the flow velocity of the inter-turn transverse oil passage to 12 times of the original. Affected by this, the oil temperature difference between the two ends of the oil baffle is as high as 2.42℃. Secondly, this paper conducted a short-circuit temperature rise test on the test prototype, and selected the different axial heights (18%, 72%, 96%, etc.) of the windings on both sides of the transformer as the characteristic sample points for embedding temperature measurement fiber for real-time temperature monitoring. After comparison, it is found that the simulation calculation is basically consistent with experiment data, and the relative error of the two is less than 3.3%

Directory of Open Access Journals

A Mixing Scheme Using a Decentralized Signature Protocol for Privacy Protection in Bitcoin Blockchain

Author: Kim-Kwang Raymond Choo
Ruiyang Xiao
Tianqing Zhu
Wei Ren
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Crossref

Surface modification of fly ash spheroidal particles and their application in the adsorption of phosphorus and chromium(VI) from single and competitive solute systems

Author: Chen Chen
Cheng Ting
Lv Ruiyang
Zhang Minte
Zhang Xiao
Publication venue: Politechnika Wrocławska. Oficyna Wydawnicza Politechniki Wrocławskiej
Publication date: 01/01/2021
Field of study

This work focuses on the surface modification of fly ash spheroidal particles and their application in phosphorus and chromium(VI) adsorption. The results show that through surface modification, amorphous silica-alumina gels precipitated on the spheroidal particle surface (by which the microsurface area of the reaction products is effectively enlarged) and the surface zeta potential was changed to fit for adsorbing anions. During the adsorption experiment (single and competitive solute systems), chromium(VI) was easier to adsorb. The surface zeta potential and the existence of competitive ions should be recognized as two important factors affecting adsorption efficiency. A higher temperature could improve the adsorption efficiencies of the two solute systems. The fitting results of the pseudo-second-order model (single and competitive solute systems) show better agreement than those of the pseudo-first-order model at every temperature. The Langmuir adsorption isotherm equation can better simulate the adsorption process in single solute sy039stems, but only the chromium(VI) adsorption process can be fitted by the competitive Langmuir adsorption isotherm in competitive solute systems

Biblioteka Nauki - repozytorium artykuÅÃ³w

Phenolic metabolites as therapeutic in inflammation and neoplasms: Molecular pathways explaining their efficacy

Author: Bo Liu
Ruiyang Ma
Wenshi Liu
Xiao Cui
Yifan Zhong
Yonghui Xia
Publication venue: Elsevier
Publication date: 01/07/2023
Field of study

Polyphenols, also known as phenolic compounds, are chemical substances containing aromatic rings as well as at least two hydroxyl groups. Natural phenolic compounds exist widely in plants, which protect plants from ultraviolet radiation and other insults. Phenolic compounds have superior pharmacological and nutritional properties (antimicrobial, antibacterial, antiviral, anti-sclerosis, antioxidant, and anti-inflammatory activities), which have been paid more and more attention by the scientific community. Phenols can protect key cellular components from reactive free radical damage, which is mainly due to their property to activate antioxidant enzymes and alleviate oxidative stress and inflammation. It can also inhibit or isolate reactive oxygen species and transfer electrons to free radicals, thereby avoiding cell damage. It has a regulatory role in glucose metabolism, which has a promising prospect in the prevention and intervention of diabetes. It also prevents cardiovascular disease by regulating blood pressure and blood lipids. Polyphenols can inhibit cell proliferation by affecting Erk1/2, CDK, and PI3K/Akt signaling pathways. Polyphenols can function as enhancers of intrinsic defense systems, including superoxide dismutase (SOD) and glutathione peroxidase (GPX). Simultaneously, they can modulate multiple proteins and transcription factors, making them promising candidates in the investigation of anti-cancer medications. This review focuses on multiple aspects of phenolic substances, including their natural origins, production process, disinfection activity, oxidative and anti-inflammatory functions, and the effects of different phenolic substances on tumors

Directory of Open Access Journals

Detection of Advanced Glycosylation End Products in the Cornea Based on Molecular Fluorescence and Machine Learning

Author: Haochen Zhong
Hua Li
Jianming Zhu
Ruiyang Sun
Sifeng Lian
Zhenbang Xiao
Publication venue: 'MDPI AG'
Publication date: 21/01/2023
Field of study

Advanced glycosylation end products (AGEs) are continuously produced and accumulated in the bodies of diabetic patients. To effectively predict disease trends in diabetic patients, a corneal fluorescence detection device was designed based on the autofluorescence properties of AGEs, and corneal fluorescence measurements were performed on 83 volunteers. Multiple linear regression (MLR), extreme gradient boosting (XGBoost), support vector regression (SVR), and back-propagation neural network (BPNN) were used to predict the human AGE content. Physiological parameters which may affect corneal AGE content were collected for a correlation analysis to select the features that had a strong correlation with the corneal concentration of AGEs to participate in modeling. By comparing the predictive effects of the four models in the two cases of a single-input feature and a multi-input feature, it was found that the model with the single-input feature had a better predictive effect. In this case, corneal AGE content was predicted by a single-input SVR model, with the average error rate (AER), mean square error (MSE), and determination coefficient R-squared (R2) of the SVR model calculated as 2.43%, 0.026, and 0.932, respectively. These results proved the potential of our method and device for noninvasive detection of the concentration of AGEs in the cornea

Multidisciplinary Digital Publishing Institute

Factors Influencing Pharmaceutical and Personal Care Product Degradation in Aqueous Solution Using Pulsed Wave Ultrasound

Author: David Diaz-Rivera (1987843)
Linda K. Weavers (1265241)
Ruiyang Xiao (1261182)
Publication venue
Publication date
Field of study

Continuous wave (CW) and pulsed wave (PW) ultrasound were used to degrade pharmaceuticals (carbamazepine, ibuprofen, acetaminophen, sulfamethoxazole, and ciprofloxacin), and personal care products (propyl gallate and diethyl phthalate). These compounds, covering a range of physicochemical properties and a diversity of structures, were explored to determine if and how PW ultrasound is advantageous to CW ultrasound. Degradation rates by PW ultrasound were faster for smaller compounds and slower for larger compounds than that under CW ultrasound. The addition of a bulk solution <sup>•</sup>OH trapping agent, acetic acid, to PPCP solutions indicates that the fraction of degradation occurring in bulk solution is positively correlated with the molar volume of the compound. Overall, smaller PPCP compounds with molar volumes less than 130 mL/mol are able to more readily diffuse to bubble interfaces and are impacted most by pulsing ultrasound

FigShare

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model

Author: Li Juntao
Liu Tie-Yan
Qin Tao
Wu Lijun
Xiao Yisheng
Xu Ruiyang
Zhang Min
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 26/06/2023
Field of study

Transformer-based autoregressive (AR) methods have achieved appealing performance for varied sequence-to-sequence generation tasks, e.g., neural machine translation, summarization, and code generation, but suffer from low inference efficiency. To speed up the inference stage, many non-autoregressive (NAR) strategies have been proposed in the past few years. Among them, the conditional masked language model (CMLM) is one of the most versatile frameworks, as it can support many different sequence generation scenarios and achieve very competitive performance on these tasks. In this paper, we further introduce a simple yet effective adaptive masking over masking strategy to enhance the refinement capability of the decoder and make the encoder optimization easier. Experiments on 3 different tasks (neural machine translation, summarization, and code generation) with 15 datasets in total confirm that our proposed simple method achieves significant performance improvement over the strong CMLM model. Surprisingly, our proposed model yields state-of-the-art performance on neural machine translation (34.62 BLEU on WMT16 EN to RO, 34.82 BLEU on WMT16 RO to EN, and 34.84 BLEU on IWSLT De to En) and even better performance than the AR Transformer on 7 benchmark datasets with at least 2.2x speedup. Our code is available at GitHub

Association for the Advancement of Artificial Intelligence: AAAI Publications