Search CORE

500 research outputs found

Software Crash Study

Author: Zhang Yantao
Publication venue
Publication date: 01/01/2015
Field of study

With the development of personal computers, the user experience has become a vital part of every day work and life of the majority of people on the planet. Hardware components are usually preconfigured and most people tend not to tune them. However, the software environments change much more often because of the configuration by users, the upgrading by vendors and the attacks by hackers. All of those activities can be a factor in the stability of software. In this work, by analyzing a sample of 600,000 machine weeks and around 16,000 applications used on them, we try to derive the relationship between the software environment and the crashes of software. We mainly used association rule mining and analyzed our data on Spark. We also examined the predictability of crashes using the association rules and the difference of predictability between different versions of a same application

Digital Repository at the University of Maryland

Flow induced dissolution of femtoliter surface droplet arrays

Author: Bao Lei
Dyetta Brendan
Lohseb Detlef
Spandan Vamsi
Verziccod Roberto
Yang Yantao
Zhang Xuehua
Publication venue
Publication date: 01/01/2018
Field of study

The dissolution of liquid nanodroplets is a crucial step in many applied processes, such as separation and dispersion in food industry, crystal formation of pharmaceutical products, concentrating and analysis in medical diagnosis, and drug delivery in aerosols. In this work, using both experiments and numerical simulations, we \textit{quantitatively} study the dissolution dynamics of femtoliter surface droplets in a highly ordered array under a uniform flow. Our results show that the dissolution of femoliter droplets strongly depends on their spatial positions relative to the flow direction, drop-to-drop spacing in the array, and the imposed flow rate. In some particular case, the droplet at the edge of the array can dissolve about 30% faster than the ones located near the centre. The dissolution rate of the droplet increases by 60% as the inter-droplet spacing is increased from 2.5

\mu

m to 20

\mu

m. Moreover, the droplets close to the front of flow commence to shrink earlier than those droplets in the center of the array. The average dissolution rate is faster for faster flow. As a result, the dissolution time

T_{i}

decreases with the Reynolds number Re of the flow as

T_{i}\propto Re^{-3/4}

. The experimental results are in good agreement with numerical simulations where the advection-diffusion equation for the concentration field is solved and the concentration gradient on the surface of the drop is computed. The findings suggest potential approaches to manipulate nanodroplet sizes in droplet arrays simply by dissolution controlled by an external flow. The obtained droplets with varying curvatures may serve as templates for generating multifocal microlens in one array

arXiv.org e-Print Archive

Crossref

RMIT Research Repository

ART

University of Twente Research Information

MPG.PuRe

Surgical outcomes of 23-gauge transconjunctival pars plana vitrectomy combined with lensectomy for glaucomatous eyes with extremely shallow anterior chamber and cataract

Author: Shaochong Zhang
Suo Qiu
Xintong Jiang
Yantao Wei
Zhaotian Zhang
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Springer - Publisher Connector

PI: An open-source software package for validation of the SEQUEST result and visualization of mass spectrum

Author: Bu Dongbo
Qiao Yantao
Sun Shiwei
Zhang Hong
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Tandem mass spectrometry (MS/MS) has emerged as the leading method for high- throughput protein identification in proteomics. Recent technological breakthroughs have dramatically increased the efficiency of MS/MS data generation. Meanwhile, sophisticated algorithms have been developed for identifying proteins from peptide MS/MS data by searching available protein sequence databases for the peptide that is most likely to have produced the observed spectrum. The popular SEQUEST algorithm relies on the cross-correlation between the experimental mass spectrum and the theoretical spectrum of a peptide. It utilizes a simplified fragmentation model that assigns a fixed and identical intensity for all major ions and fixed and lower intensity for their neutral losses. In this way, the common issues involved in predicting theoretical spectra are circumvented. In practice, however, an experimental spectrum is usually not similar to its SEQUEST -predicted theoretical one, and as a result, incorrect identifications are often generated. Results Better understanding of peptide fragmentation is required to produce more accurate and sensitive peptide sequencing algorithms. Here, we designed the software PI of novel and exquisite algorithms that make a good use of intensity property of a spectrum. Conclusions We designed the software PI with the novel and effective algorithms which made a good use of intensity property of the spectrum. Experiments have shown that PI was able to validate and improve the results of SEQUEST to a more satisfactory degree.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Microstructural analysis of skeletal muscle force generation during aging.

Author: Basava Ramya R
Chen Jiun-Shyan
He Qizhi
He Xiaolong
Hodgson John
Sinha Shantanu
Sinha Usha
Zhang Yantao
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Human aging results in a progressive decline in the active force generation capability of skeletal muscle. While many factors related to the changes of morphological and structural properties in muscle fibers and the extracellular matrix (ECM) have been considered as possible reasons for causing age-related force reduction, it is still not fully understood why the decrease in force generation under eccentric contraction (lengthening) is much less than that under concentric contraction (shortening). Biomechanically, it was observed that connective tissues (endomysium) stiffen as ages, and the volume ratio of connective tissues exhibits an age-related increase. However, limited skeletal muscle models take into account the microstructural characteristics as well as the volume fraction of tissue material. This study aims to provide a numerical investigation in which the muscle fibers and the ECM are explicitly represented to allow quantitative assessment of the age-related force reduction mechanism. To this end, a fiber-level honeycomb-like microstructure is constructed and modeled by a pixel-based Reproducing Kernel Particle Method (RKPM), which allows modeling of smooth transition in biomaterial properties across material interfaces. The numerical investigation reveals that the increased stiffness of the passive materials of muscle tissue reduces the force generation capability under concentric contraction while maintains the force generation capability under eccentric contraction. The proposed RKPM microscopic model provides effective means for the cellular-scale numerical investigation of skeletal muscle physiology. NOVELTY STATEMENT: A cellular-scale honeycomb-like microstructural muscle model constructed from a histological cross-sectional image of muscle is employed to study the causal relations between age-associated microstructural changes and age-related force loss using Reproducing Kernel Particle Method (RKPM). The employed RKPM offers an effective means for modeling biological materials based on pixel points in the medical images and allow modeling of smooth transition in the material properties across interfaces. The proposed microstructure-informed muscle model enables quantitative evaluation on how cellular-scale compositions contribute to muscle functionality and explain differences in age-related force changes during concentric, isometric and eccentric contractions

Crossref

eScholarship - University of California

A MapReduce-based nearest neighbor approach for big-data-driven traffic flow prediction

Author: Li Huaqing
Li Yantao
Wang Binfeng
Xia Dawen
Zhang Zili
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naïve Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup

Deakin Research Online

An efficient MapReduce-based parallel clustering algorithm for distributed traffic subarea division

Author: Li Yantao
Rong Zhuobo
Wang Binfeng
Xia Dawen
Zhang Zili
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

Traffic subarea division is vital for traffic system management and traffic network analysis in intelligent transportation systems (ITSs). Since existing methods may not be suitable for big traffic data processing, this paper presents a MapReduce-based Parallel Three-Phase K -Means (Par3PKM) algorithm for solving traffic subarea division problem on a widely adopted Hadoop distributed computing platform. Specifically, we first modify the distance metric and initialization strategy of K -Means and then employ a MapReduce paradigm to redesign the optimized K -Means algorithm for parallel clustering of large-scale taxi trajectories. Moreover, we propose a boundary identifying method to connect the borders of clustering results for each cluster. Finally, we divide traffic subarea of Beijing based on real-world trajectory data sets generated by 12,000 taxis in a period of one month using the proposed approach. Experimental evaluation results indicate that when compared with K -Means, Par2PK-Means, and ParCLARA, Par3PKM achieves higher efficiency, more accuracy, and better scalability and can effectively divide traffic subarea with big taxi trajectory data

Deakin Research Online

Directory of Open Access Journals

An Accurate and Efficient Time Delay Estimation Method of Ultra-High Frequency Signals for Partial Discharge Localization in Substations

Author: Dai Kejie
Jin Yantao
Li Pengfei
Liao Yuan
Liu Yushun
Zhang Tong
Publication venue: UKnowledge
Publication date: 01/10/2018
Field of study

Partial discharge (PD) localization in substations based on the ultra-high frequency (UHF) method can be used to efficiently assess insulation conditions. Localization accuracy is affected by the accuracy of the time delay (TD) estimation, which is critical for PD localization in substations. A review of existing TD estimation methods indicates that there is a need to develop methods that are both accurate and computationally efficient. In this paper, a novel TD estimation method is proposed to improve both accuracy and efficiency. The TD is calculated using an improved cross-correlation algorithm based on full-wavefronts of array UHF signals, which are extracted using the minimum cumulative energy method and zero-crossing points searching methods. The cross-correlation algorithm effectively suppresses the TD error caused by differences between full-wavefronts. To verify the method, a simulated PD source test in a laboratory and a field test in a 220 kV substation were carried out. The results show that the proposed method is accurate even in case of low signal-to-noise ratio, but with greatly improved computational efficiency

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

University of Kentucky

The Devil is the Classifier: Investigating Long Tail Relation Classification with Decoupling Analysis

Author: Chen Huajun
Deng Shumin
Jia Yantao
Yu Haiyang
Yuan Zonggang
Zhang Ningyu
Publication venue
Publication date: 15/09/2020
Field of study

Long-tailed relation classification is a challenging problem as the head classes may dominate the training phase, thereby leading to the deterioration of the tail performance. Existing solutions usually address this issue via class-balancing strategies, e.g., data re-sampling and loss re-weighting, but all these methods adhere to the schema of entangling learning of the representation and classifier. In this study, we conduct an in-depth empirical investigation into the long-tailed problem and found that pre-trained models with instance-balanced sampling already capture the well-learned representations for all classes; moreover, it is possible to achieve better long-tailed classification ability at low cost by only adjusting the classifier. Inspired by this observation, we propose a robust classifier with attentive relation routing, which assigns soft weights by automatically aggregating the relations. Extensive experiments on two datasets demonstrate the effectiveness of our proposed approach. Code and datasets are available in https://github.com/zjunlp/deepke

arXiv.org e-Print Archive