Search CORE

100 research outputs found

An Efficient Source Model Selection Framework in Model Databases

Author: Chen Lu
Du Yuntao
Gao Yunjun
Yang Keyu
Zhao Minjun
Publication venue
Publication date: 24/11/2021
Field of study

With the explosive increase of big data, training a Machine Learning (ML) model becomes a computation-intensive workload, which would take days or even weeks. Thus, reusing an already trained model has received attention, which is called transfer learning. Transfer learning avoids training a new model from scratch by transferring knowledge from a source task to a target task. Existing transfer learning methods mostly focus on how to improve the performance of the target task through a specific source model, and assume that the source model is given. Although many source models are available, it is difficult for data scientists to select the best source model for the target task manually. Hence, how to efficiently select a suitable source model in a model database for model reuse is an interesting but unsolved problem. In this paper, we propose SMS, an effective, efficient, and flexible source model selection framework. SMS is effective even when the source and target datasets have significantly different data labels, and is flexible to support source models with any type of structure, and is efficient to avoid any training process. For each source model, SMS first vectorizes the samples in the target dataset into soft labels by directly applying this model to the target dataset, then uses Gaussian distributions to fit for clusters of soft labels, and finally measures the distinguishing ability of the source model using Gaussian mixture-based metric. Moreover, we present an improved SMS (I-SMS), which decreases the output number of the source model. I-SMS can significantly reduce the selection time while retaining the selection performance of SMS. Extensive experiments on a range of practical model reuse workloads demonstrate the effectiveness and efficiency of SMS

arXiv.org e-Print Archive

SparDL: Distributed Deep Learning Training with Efficient Sparse Communication

Author: Chen Lu
Gao Yunjun
Liu Qing
Mao Yuren
Yin Yichen
Zhao Minjun
Publication venue
Publication date: 23/02/2024
Field of study

Top-k sparsification has recently been widely used to reduce the communication volume in distributed deep learning. However, due to the Sparse Gradient Accumulation (SGA) dilemma, the performance of top-k sparsification still has limitations. Recently, a few methods have been put forward to handle the SGA dilemma. Regrettably, even the state-of-the-art method suffers from several drawbacks, e.g., it relies on an inefficient communication algorithm and requires extra transmission steps. Motivated by the limitations of existing methods, we propose a novel efficient sparse communication framework, called SparDL. Specifically, SparDL uses the Spar-Reduce-Scatter algorithm, which is based on an efficient Reduce-Scatter model, to handle the SGA dilemma without additional communication operations. Besides, to further reduce the latency cost and improve the efficiency of SparDL, we propose the Spar-All-Gather algorithm. Moreover, we propose the global residual collection algorithm to ensure fast convergence of model training. Finally, extensive experiments are conducted to validate the superiority of SparDL

arXiv.org e-Print Archive

Large Language Models are reasoners with Self-Verification

Author: He Shizhu
Li Bin
Liu Kang
Weng Yixuan
Xia Fei
Zhao Jun
Zhu Minjun
Publication venue
Publication date: 29/03/2023
Field of study

When a large language model (LLM) performs complex reasoning by chain of thought (CoT), it can be highly sensitive to individual mistakes. We have had to train verifiers to address this issue. As we all know, after human inferring a conclusion, they often check it by re-verifying it, which can avoid some mistakes. We propose a new method called self-verification that uses the conclusion of the CoT as a condition to build a new sample and asks the LLM to re-predict the original conditions which be masked. We calculate an explainable verification score based on the accuracy. This method can improve the accuracy of multiple arithmetics and logical reasoning datasets when using few-shot learning. we have demonstrated that LLMs can conduct explainable self-verification of their own conclusions and achieve competitive reasoning performance. Extensive experimentals have demonstrated that our method can help multiple large language models with self-verification can avoid interference from incorrect CoT. Code is available at \url{https://github.com/WENGSYX/Self-Verification

arXiv.org e-Print Archive

Very low turn-on voltage and high brightness tris-(8-hydroxyquinoline) aluminum-based organic light-emitting diodes with a MoO(x) p-doping layer

Author: Chen Wen
Liu Minjun
Meng Yanlong
Tao Chen
Wu Fengmin
Xie Guohua
Xue Qin
Zhang Dandan
Zhao Yi
Publication venue: 'AIP Publishing'
Publication date: 04/03/2008
Field of study

University of Queensland eSpace

Metabolic profiling reveals therapeutic effects of Herba Cistanches in an animal model of hydrocortisone-induced 'kidney-deficiency syndrome'

Author: Chen Minjun
Jia Wei
Jiang Jian
Li Xin
Qiu Yunping
Su Mingming
Xie Guoxiang
Zhao Aihua
Zhou Mingmei
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Genome-Wide Expression Analysis in Down Syndrome: Insight into Immunodeficiency

Author: Bai Yun
Chen Qimin
Fu Lijun
Jin Lei
Li Chong
Wang Shengyue
Xiao Huasheng
Yang Minjun
Zhao Guoping
Publication venue
Publication date: 01/01/2012
Field of study

Down syndrome (DS) is caused by triplication of Human chromosome 21 (Hsa21) and associated with an array of deleterious phenotypes, including mental retardation, heart defects and immunodeficiency. Genome-wide expression patterns of uncultured peripheral blood cells are useful to understanding of DS-associated immune dysfunction. We used a Human Exon microarray to characterize gene expression in uncultured peripheral blood cells derived from DS individuals and age-matched controls from two age groups: neonate (N) and child (C). A total of 174 transcript clusters (gene-level) with eight located on Hsa21 in N group and 383 transcript clusters including 56 on Hsa21 in C group were significantly dysregulated in DS individuals. Microarray data were validated by quantitative polymerase chain reaction. Functional analysis revealed that the dysregulated genes in DS were significantly enriched in two and six KEGG pathways in N and C group, respectively. These pathways included leukocyte trans-endothelial migration, B cell receptor signaling pathway and primary immunodeficiency, etc., which causally implicated dysfunctional immunity in DS. Our results provided a comprehensive picture of gene expression patterns in DS at the two developmental stages and pointed towards candidate genes and molecular pathways potentially associated with the immune dysfunction in DS

Philadelphia College of Osteopathic Medicine: DigitalCommons@PCOM

Identification of Dysregulated Complement Activation Pathways Driven by N-Glycosylation Alterations in T2D Patients

Author: Bo Meng
Dan Yan
Dan Yan
Dan Yan
Man Wang
Man Wang
Minjun He
Xiang Fang
Xinhua Dai
Yang Zhao
Ying Gao
You Jiang
Zhichao Xue
Publication venue: 'Frontiers Media SA'
Publication date: 01/06/2021
Field of study

Diabetes has become a major public health concern worldwide, most of which are type 2 diabetes (T2D). The diagnosis of T2D is commonly based on plasma glucose levels, and there are no reliable clinical biomarkers available for early detection. Recent advances in proteome technologies offer new opportunity for the understanding of T2D; however, the underlying proteomic characteristics of T2D have not been thoroughly investigated yet. Here, using proteomic and glycoproteomic profiling, we provided a comprehensive landscape of molecular alterations in the fasting plasma of the 24 Chinese participants, including eight T2D patients, eight prediabetic (PDB) subjects, and eight healthy control (HC) individuals. Our analyses identified a diverse set of potential biomarkers that might enhance the efficiency and accuracy based on current existing biological indicators of (pre)diabetes. Through integrative omics analysis, we showed the capability of glycoproteomics as a complement to proteomics or metabolomics, to provide additional insights into the pathogenesis of (pre)diabetes. We have newly identified systemic site-specific N-glycosylation alterations underlying T2D patients in the complement activation pathways, including decreased levels of N-glycopeptides from C1s, MASP1, and CFP proteins, and increased levels of N-glycopeptides from C2, C4, C4BPA, C4BPB, and CFH. These alterations were not observed at proteomic levels, suggesting new opportunities for the diagnosis and treatment of this disease. Our results demonstrate a great potential role of glycoproteomics in understanding (pre)diabetes and present a new direction for diabetes research which deserves more attention

Directory of Open Access Journals

Causative agent distribution and antibiotic therapy assessment among adult patients with community acquired pneumonia in Chinese urban population

Abstract Background Knowledge of predominant microbial patterns in community-acquired pneumonia (CAP) constitutes the basis for initial decisions about empirical antimicrobial treatment, so a prospective study was performed during 2003–2004 among CAP of adult Chinese urban populations. Methods Qualified patients were enrolled and screened for bacterial, atypical, and viral pathogens by sputum and/or blood culturing, and by antibody seroconversion test. Antibiotic treatment and patient outcome were also assessed. Results Non-viral pathogens were found in 324/610 (53.1%) patients among whom <it>M. pneumoniae </it>was the most prevalent (126/610, 20.7%). Atypical pathogens were identified in 62/195 (31.8%) patients carrying bacterial pathogens. Respiratory viruses were identified in 35 (19%) of 184 randomly selected patients with adenovirus being the most common (16/184, 8.7%). The nonsusceptibility of <it>S. pneumoniae </it>to penicillin and azithromycin was 22.2% (Resistance (R): 3.2%, Intermediate (I): 19.0%) and 79.4% (R: 79.4%, I: 0%), respectively. Of patients (312) from whom causative pathogens were identified and antibiotic treatments were recorded, clinical cure rate with β-lactam antibiotics alone and with combination of a β-lactam plus a macrolide or with fluoroquinolones was 63.7% (79/124) and 67%(126/188), respectively. For patients having mixed <it>M. pneumoniae </it>and/or <it>C. pneumoniae </it>infections, a better cure rate was observed with regimens that are active against atypical pathogens (e.g. a β-lactam plus a macrolide, or a fluoroquinolone) than with β-lactam alone (75.8% vs. 42.9%, <it>p </it>= 0.045). Conclusion In Chinese adult CAP patients, <it>M. pneumoniae </it>was the most prevalent with mixed infections containing atypical pathogens being frequently observed. With <it>S. pneumoniae</it>, the prevalence of macrolide resistance was high and penicillin resistance low compared with data reported in other regions.</p

Crossref

Directory of Open Access Journals

PubMed Central

Genome Sequence of the Versatile Fish Pathogen Edwardsiella tarda Provides Insights into its Adaptation to Broad Host Ranges and Intracellular Niches

Author: A Siddique
AB Blanc-Potard
AD Phillips
AL Delcher
AV Lukashin
B Christen
B Wortham
BR Mohanty
C-H Chiu
CM Waters
D Schnappinger
DA Low
EA Miao
EP Nawrocki
F Padros
GS Chilcott
Guoping Zhao
H Ogata
H Tettelin
Haizhen Wu
HH Jarboe
Huajun Zheng
I Sommerset
J Bendtsen
J Bockemuhl
J Okuda
J Wuyts
JA Hoch
JD Thompson
JD Wong
JF Xiao
JF Xiao
Jingfan Xiao
JL Gardy
JM Janda
K Sun
KD Smith
Lili Xu
M Du
M Margulies
M Wu
M Zhang
Minjun Yang
ML Williams
Niyaz Ahmed
O Emanuelsson
P Sahoo
PP Gardner
PS Chain
PS Rao
PS Srinivasa Rao
Q Tu
Qin Liu
Qiyao Wang
QY Wang
RG Gerlach
RL Tatusov
RL Tatusov
RL Thune
RM Newman
RN Moore
RW Rose
S Ahmad
S Backert
S Guindon
S Schneiker
S Waack
SF Altschul
Shengyue Wang
SHM Ling
SL Abbott
SR Partridge
T Morohoshi
T Sakai
T Sakai
T Tanabe
TM Lowe
UA Ochsner
V Sperandio
X Wang
Xin Wang
Y Zhang
YP Laine MJ Zhang
YP Tan
Yuanxing Zhang
Yuanzhi Lv
Publication venue: Public Library of Science
Publication date: 01/10/2009
Field of study

BACKGROUND:Edwardsiella tarda is the etiologic agent of edwardsiellosis, a devastating fish disease prevailing in worldwide aquaculture industries. Here we describe the complete genome of E. tarda, EIB202, a highly virulent and multi-drug resistant isolate in China. METHODOLOGY/PRINCIPAL FINDINGS:E. tarda EIB202 possesses a single chromosome of 3,760,463 base pairs containing 3,486 predicted protein coding sequences, 8 ribosomal rRNA operons, and 95 tRNA genes, and a 43,703 bp conjugative plasmid harboring multi-drug resistant determinants and encoding type IV A secretion system components. We identified a full spectrum of genetic properties related to its genome plasticity such as repeated sequences, insertion sequences, phage-like proteins, integrases, recombinases and genomic islands. In addition, analysis also indicated that a substantial proportion of the E. tarda genome might be devoted to the growth and survival under diverse conditions including intracellular niches, with a large number of aerobic or anaerobic respiration-associated proteins, signal transduction proteins as well as proteins involved in various stress adaptations. A pool of genes for secretion systems, pili formation, nonfimbrial adhesions, invasions and hemagglutinins, chondroitinases, hemolysins, iron scavenging systems as well as the incomplete flagellar biogenesis might feature its surface structures and pathogenesis in a fish body. CONCLUSION/SIGNIFICANCE:Genomic analysis of the bacterium offered insights into the phylogeny, metabolism, drug-resistance, stress adaptation, and virulence characteristics of this versatile pathogen, which constitutes an important first step in understanding the pathogenesis of E. tarda to facilitate construction of a practical effective vaccine used for combating fish edwardsiellosis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central