Search CORE

7,875 research outputs found

Mathematical models for vulnerable plaques

Author: Bell J.
Breward C.
Chou T.
Fok P.-W.
Haugh J. M.
Li Q.
Rossi L.
Walter A.
Yang X.
Zemlyanova A.
Zhang N.
Publication venue
Publication date: 01/01/2009
Field of study

A plaque is an accumulation and swelling in the artery walls and typically consists of cells, cell debris, lipids, calcium deposits and fibrous connective tissue. A person is likely to have many plaques inside his/her body even if they are healthy. However plaques may become "vulnerable", "high-risk" or "thrombosis-prone" if the person engages in a high-fat diet and does not exercise regularly. In this study group, we proposed two mathematical models to describe plaque growth and rupture. The first model is a mechanical one that approximately treats the plaque as an inflating elastic balloon. In this model, the pressure inside the core increases and then decreases suggesting that plaque stabilization and prevention of rupture is possible. The second model is a biochemical one that focuses on the role of MMPs in degrading the fibrous plaque cap. The cap stress, MMP concentration, plaque volume and cap thickness are coupled together in a system of phenomenological equations. The equations always predict an eventual rupture since the volume, stresses and MMP concentrations generally grow without bound. The main weakness of the model is that many of the important parameters that control the behavior of the plaque are unknown. The two simple models suggested by this group could serve as a springboard for more realistic theoretical studies. But most importantly, we hope they will motivate more experimental work to quantify some of the important mechanical and biochemical properties of vulnerable plaques

Determining Absorption, Emissivity Reduction, and Local Suppression Coefficients inside Sunspots

Author: A.D. Crouch
A.G. Kosovichev
A.G. Kosovichev
B.W. Hindman
B.W. Lites
D.-Y. Chou
D.-Y. Chou
D.-Y. Chou
D.-Y. Chou
D.-Y. Chou
D.C. Braun
D.C. Braun
H.C. Spruit
J. Hollweg
Junwei Zhao
K.-R. Chen
K.V. Parchevsky
M. Goossens
M. Gordovskyy
N. Hurlburt
O. Burtseva
P.H. Scherrer
P.S. Cally
P.S. Cally
R. Leighton
R. Wachter
S.P. Rajaguru
Stathis Ilonidis
T. Bogdan
T. Sakurai
T.L. Duvall Jr.
Y.-Q. Lou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/10/2010
Field of study

The power of solar acoustic waves is reduced inside sunspots mainly due to absorption, emissivity reduction, and local suppression. The coefficients of these power-reduction mechanisms can be determined by comparing time-distance cross-covariances obtained from sunspots and from the quiet Sun. By analyzing 47 active regions observed by SOHO/MDI without using signal filters, we have determined the coefficients of surface absorption, deep absorption, emissivity reduction, and local suppression. The dissipation in the quiet Sun is derived as well. All of the cross-covariances are width corrected to offset the effect of dispersion. We find that absorption is the dominant mechanism of the power deficit in sunspots for short travel distances, but gradually drops to zero at travel distances longer than about 6 degrees. The absorption in sunspot interiors is also significant. The emissivity-reduction coefficient ranges from about 0.44 to 1.00 within the umbra and 0.29 to 0.72 in the sunspot, and accounts for only about 21.5% of the umbra's and 16.5% of the sunspot's total power reduction. Local suppression is nearly constant as a function of travel distance with values of 0.80 and 0.665 for umbrae and whole sunspots respectively, and is the major cause of the power deficit at large travel distances.Comment: 14 pages, 21 Figure

arXiv.org e-Print Archive

Crossref

On the Decoding Failure Rate of QC-MDPC Bit-Flipping Decoders

Author: A Nilsson
Q Guo
RG Gallager
S Heyse
T Chou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/12/2018
Field of study

International audienceQuasi-cyclic moderate density parity check codes allow the design of McEliece-like public-key encryption schemes with compact keys and a security that provably reduces to hard decoding problems for quasi-cyclic codes.In particular, QC-MDPC are among the most promising code-based key encapsulation mechanisms (KEM) that are proposed to the NIST call for standardization of quantum safe cryptography (two proposals, BIKE and QC-MDPC KEM).The first generation of decoding algorithms suffers from a small, but not negligible, decoding failure rate (DFR in the order of 10⁻⁷ to 10⁻¹⁰). This allows a key recovery attack presented by Guo, Johansson, and Stankovski (GJS attack) at Asiacrypt 2016 which exploits a small correlation between the faulty message patterns and the secret key of the scheme, and limits the usage of the scheme to KEMs using ephemeral public keys. It does not impact the interactive establishment of secure communications (e.g. TLS), but the use of static public keys for asynchronous applications (e.g. email) is rendered dangerous.Understanding and improving the decoding of QC-MDPC is thus of interest for cryptographic applications. In particular, finding parameters for which the failure rate is provably negligible (typically as low as 2⁻⁶⁴ or 2⁻¹²⁸) would allow static keys and increase the applicability of the mentioned cryptosystems.We study here a simple variant of bit-flipping decoding, which we call step-by-step decoding. It has a higher DFR but its evolution can be modeled by a Markov chain, within the theoretical framework of Julia Chaulet's PhD thesis. We study two other, more efficient, decoders. One is the textbook algorithm. The other is (close to) the BIKE decoder. For all those algorithms we provide simulation results, and, assuming an evolution similar to the step-by-step decoder, we extrapolate the value of the DFR as a function of the block length. This will give an indication of how much the code parameters must be increased to ensure resistance to the GJS attack

Crossref

INRIA a CCSD electronic archive server

Cryptology ePrint Archive

Multi-Label Multi-Kernel Transfer Learning for Human Protein Subcellular Localization

Author: A Dijk
A Garg
A Hoglund
A Pierleoni
B Boeckmann
D Barrell
Francisco José Esteban
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
J Platt
K Lee
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
L Rajendran
L Zhu
M Mak
Q Yang
S Altschul
S Mei
S Mei
S Mei
S Pan
Suyu Mei
T Blum
T Tung
T Wu
W Dai
W Dai
W Huang
W Huang
X Xiao
X Xiao
Y Tu
Publication venue: Public Library of Science
Publication date: 13/06/2012
Field of study

Recent years have witnessed much progress in computational modelling for protein subcellular localization. However, the existing sequence-based predictive models demonstrate moderate or unsatisfactory performance, and the gene ontology (GO) based models may take the risk of performance overestimation for novel proteins. Furthermore, many human proteins have multiple subcellular locations, which renders the computational modelling more complicated. Up to the present, there are far few researches specialized for predicting the subcellular localization of human proteins that may reside in multiple cellular compartments. In this paper, we propose a multi-label multi-kernel transfer learning model for human protein subcellular localization (MLMK-TLM). MLMK-TLM proposes a multi-label confusion matrix, formally formulates three multi-labelling performance measures and adapts one-against-all multi-class probabilistic outputs to multi-label learning scenario, based on which to further extends our published work GO-TLM (gene ontology based transfer learning model for protein subcellular localization) and MK-TLM (multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization) for multiplex human protein subcellular localization. With the advantages of proper homolog knowledge transfer, comprehensive survey of model performance for novel protein and multi-labelling capability, MLMK-TLM will gain more practical applicability. The experiments on human protein benchmark dataset show that MLMK-TLM significantly outperforms the baseline model and demonstrates good multi-labelling ability for novel human proteins. Some findings (predictions) are validated by the latest Swiss-Prot database. The software can be freely downloaded at http://soft.synu.edu.cn/upload/msy.rar

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Predicting Anatomical Therapeutic Chemical (ATC) Classification of Drugs by Integrating Chemical-Chemical Interactions and Similarities

Author: DN Georgiou
GA Watson
GP Zhou
GP Zhou
GP Zhou
H Gurulingappa
H Mohabatkar
H Mohabatkar
IW Althaus
J Andraos
J Lin
Kai-Yan Feng
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
Kuo-Chen Chou
L Hu
Lei Chen
M Dunkel
M Esmaeili
M Hattori
M Kanehisa
M Kanehisa
M Kuhn
Ozlem Keskin
P Jaccard
P Wang
Q Gu
R Sharan
T Huang
U Karaoz
Wei-Ming Zeng
WZ Lin
X Xiao
YD Cai
YD Cai
Yu-Dong Cai
ZC Wu
ZC Wu
Publication venue: Public Library of Science
Publication date: 13/04/2012
Field of study

The Anatomical Therapeutic Chemical (ATC) classification system, recommended by the World Health Organization, categories drugs into different classes according to their therapeutic and chemical characteristics. For a set of query compounds, how can we identify which ATC-class (or classes) they belong to? It is an important and challenging problem because the information thus obtained would be quite useful for drug development and utilization. By hybridizing the informations of chemical-chemical interactions and chemical-chemical similarities, a novel method was developed for such purpose. It was observed by the jackknife test on a benchmark dataset of 3,883 drug compounds that the overall success rate achieved by the prediction method was about 73% in identifying the drugs among the following 14 main ATC-classes: (1) alimentary tract and metabolism; (2) blood and blood forming organs; (3) cardiovascular system; (4) dermatologicals; (5) genitourinary system and sex hormones; (6) systemic hormonal preparations, excluding sex hormones and insulins; (7) anti-infectives for systemic use; (8) antineoplastic and immunomodulating agents; (9) musculoskeletal system; (10) nervous system; (11) antiparasitic products, insecticides and repellents; (12) respiratory system; (13) sensory organs; (14) various. Such a success rate is substantially higher than 7% by the random guess. It has not escaped our notice that the current method can be straightforwardly extended to identify the drugs for their 2nd-level, 3rd-level, 4th-level, and 5th-level ATC-classifications once the statistically significant benchmark data are available for these lower levels

Public Library of Science (PLOS)

Crossref

PubMed Central

FigShare

Electrokinetic behavior of two touching inhomogeneous biological cells and colloidal particles: Effects of multipolar interactions

Author: C. F. Chou
F. A. Sauer
F. F. Becker
G. Fuhr
G. Fuhr
G. Q. Gu
G. W. Milton
H. A. Pohl
H. Morgan
J. P. Huang
J. Voldman
J. Yang
K. L. Chan
K. L. Chan
K. Ratanachoo
K. W. Yu
L. Dong
M. P. Hughes
Mikko Karttunen
T. B. Jones
T. C. Choy
T. C. Choy
W. M. Arnold
Publication venue: 'American Physical Society (APS)'
Publication date: 07/11/2003
Field of study

We present a theory to investigate electro-kinetic behavior, namely, electrorotation and dielectrophoresis under alternating current (AC) applied fields for a pair of touching inhomogeneous colloidal particles and biological cells. These inhomogeneous particles are treated as graded ones with physically motivated model dielectric and conductivity profiles. The mutual polarization interaction between the particles yields a change in their respective dipole moments, and hence in the AC electrokinetic spectra. The multipolar interactions between polarized particles are accurately captured by the multiple images method. In the point-dipole limit, our theory reproduces the known results. We find that the multipolar interactions as well as the spatial fluctuations inside the particles can affect the AC electrokinetic spectra significantly.Comment: Revised version with minor changes: References added and discussion extende

arXiv.org e-Print Archive

Repository TU/e

Crossref

Pure OAI Repository

Gene ontology based transfer learning for protein subcellular localization

Author: A Bateman
A Dijk
A Hoglund
A Hoglund
A Pierleoni
C Chen
C Leslie
C Leslie
DH Haft
E Marcotte
EM Zdobnov
F Corpet
FM Li
G Lanckriet
G Schneider
H Ding
H Lin
H Lin
H Liu
H Rangwala
H Shen
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
J Cedano
J Schultz
J Shen
JD Qiu
JD Qiu
K Chou
K Chou
K Chou
K Hofmann
K Lee
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
L Nanni
M Ashburner
M Esmaeili
M Mak
M Wang
Q Gu
Q Yang
R Apweiler
R Kuang
R Kuang
S Mei
S Pan
Shuigeng Zhou
Suyu Mei
T Blum
T Tung
TK Attwood
W Dai
W Dai
W Huang
W Huang
Wang Fei
X Jiang
X Xiao
XB Zhou
YH Zeng
YS Ding
YS Ding
Z Lei
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as <it>GO</it>, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the <it>GO </it>terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (<it>GO-TLM</it>) for large-scale protein subcellular localization. The model transfers the signature-based homologous <it>GO </it>terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false <it>GO </it>terms that are resulted from evolutionary divergence. We derive three <it>GO </it>kernels from the three aspects of gene ontology to measure the <it>GO </it>similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for protein subcellular localization. We evaluate <it>GO-TLM </it>performance against three baseline models: <it>MultiLoc, MultiLoc-GO </it>and <it>Euk-mPLoc </it>on the benchmark datasets the baseline models adopted. 5-fold cross validation experiments show that <it>GO-TLM </it>achieves substantial accuracy improvement against the baseline models: 80.38% against model <it>Euk-mPLoc </it>67.40% with <it>12.98% </it>substantial increase; 96.65% and 96.27% against model <it>MultiLoc-GO </it>89.60% and 89.60%, with <it>7.05% </it>and <it>6.67% </it>accuracy increase on dataset <it>MultiLoc plant </it>and dataset <it>MultiLoc animal</it>, respectively; 97.14%, 95.90% and 96.85% against model <it>MultiLoc-GO </it>83.70%, 90.10% and 85.70%, with accuracy increase <it>13.44%</it>, <it>5.8% </it>and <it>11.15% </it>on dataset <it>BaCelLoc plant</it>, dataset <it>BaCelLoc fungi </it>and dataset <it>BaCelLoc animal </it>respectively. For <it>BaCelLoc </it>independent sets, <it>GO-TLM </it>achieves 81.25%, 80.45% and 79.46% on dataset <it>BaCelLoc plant holdout</it>, dataset <it>BaCelLoc plant holdout </it>and dataset <it>BaCelLoc animal holdout</it>, respectively, as compared against baseline model <it>MultiLoc-GO </it>76%, 60.00% and 73.00%, with accuracy increase <it>5.25%</it>, <it>20.45% </it>and <it>6.46%</it>, respectively. Conclusions Since direct homology-based <it>GO </it>term transfer may be prone to introducing noise and outliers to the target protein, we design an explicitly weighted kernel learning system (called Gene Ontology Based Transfer Learning Model, <it>GO-TLM</it>) to transfer to the target protein the known knowledge about related homologous proteins, which can reduce the risk of outliers and share knowledge between homologous proteins, and thus achieve better predictive performance for protein subcellular localization. Cross validation and independent test experimental results show that the homology-based <it>GO </it>term transfer and explicitly weighing the <it>GO </it>kernels substantially improve the prediction performance.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Heterogeneous network embedding enabling accurate disease association predictions.

Author: A Bordes
A Grover
A Ruepp
C Hailin
C Shi
C-H Chou
Chunlei Tang
D Botstein
D Wang
DA Peter
H Chen
J Piñero
J Tang
J Tang
L Bottou
Lu Ruan
M Campillos
M Lu
MA Van Driel
Mengjie Guo
O Vanunu
Q Jiang
Q Zou
Q Zou
S Chang
T Keshava Prasad
T Mikolov
U. M Singh-Blom
V McKusick
W Wang
Wei Wang
X Wu
X Zeng
X Zeng
Xiangnan Kong
Y Fan
Y LeCun
Y Shi
Y Sun
Yangyong Zhu
Yun Xiong
Publication venue: eScholarship, University of California
Publication date: 01/12/2019
Field of study

BackgroundIt is significant to identificate complex biological mechanisms of various diseases in biomedical research. Recently, the growing generation of tremendous amount of data in genomics, epigenomics, metagenomics, proteomics, metabolomics, nutriomics, etc., has resulted in the rise of systematic biological means of exploring complex diseases. However, the disparity between the production of the multiple data and our capability of analyzing data has been broaden gradually. Furthermore, we observe that networks can represent many of the above-mentioned data, and founded on the vector representations learned by network embedding methods, entities which are in close proximity but at present do not actually possess direct links are very likely to be related, therefore they are promising candidate subjects for biological investigation.ResultsWe incorporate six public biological databases to construct a heterogeneous biological network containing three categories of entities (i.e., genes, diseases, miRNAs) and multiple types of edges (i.e., the known relationships). To tackle the inherent heterogeneity, we develop a heterogeneous network embedding model for mapping the network into a low dimensional vector space in which the relationships between entities are preserved well. And in order to assess the effectiveness of our method, we conduct gene-disease as well as miRNA-disease associations predictions, results of which show the superiority of our novel method over several state-of-the-arts. Furthermore, many associations predicted by our method are verified in the latest real-world dataset.ConclusionsWe propose a novel heterogeneous network embedding method which can adequately take advantage of the abundant contextual information and structures of heterogeneous network. Moreover, we illustrate the performance of the proposed method on directing studies in biology, which can assist in identifying new hypotheses in biological investigation

Crossref

eScholarship - University of California