Search CORE

360 research outputs found

Recommended from our members

Formulation of HPV Dry Powder Wafers for Sublingual Vaccination

Author: Chen David Jiyuan
Publication venue: CU Scholar
Publication date: 09/04/2012
Field of study

Vaccination has been used to prevent diseases since the early 1800s.Throughout the history of vaccination, attention has only been paid to the type of virus that can be administered instead of the method of vaccination. Traditional needle-dependent vaccination requires sanitation and refrigeration which is not available in developing countries such as India, where viruses such as the human papillomavirus (HPV) are very prevalent. The goal of this study was to develop a sublingual wafer formulation that is robust and heat-stable. The sublingual wafer delivery method is expected to lower the cost of vaccination because of the ease of transportation and storage. Dry powder vaccine formulations were prepared using the Sievers et al patented Carbon Dioxide Assisted Nebulization with a Bubble Dryer (CAN- BD) drying process. Excipients commonly used in the pharmaceutical industry for oral tablet formation were tested for wafer formation with the CAN-BD processed protein powder. Desired wafer properties were tested, i.e. binding capabilities, target dissociation time

CU Scholar Institutional Repository

Concept learning of text documents

Author: An Jiyuan
Chen Yi-Ping Phoebe
Publication venue: IEEE Xplore
Publication date: 01/01/2004
Field of study

Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods. <br /

Deakin Research Online

Finding short patterns to classify text documents

Author: An Jiyuan
Chen Yi-Ping Phoebe
Publication venue: IEEE Xplore
Publication date: 01/01/2006
Field of study

Many classification methods have been proposed to find patterns in text documents. However, according to Occam\u27s razor principle, "the explanation of any phenomenon should make as few assumptions as possible", short patterns usually have more explainable and meaningful for classifying text documents. In this paper, we propose a depth-first pattern generation algorithm, which can find out short patterns from text document more effectively, comparing with breadth-first algorithm <br /

Deakin Research Online

Finding coverage using incremental attribute combinations

Author: An Jiyuan
Chen Yi-Ping Phoebe
Publication venue: ICIC International
Publication date: 01/05/2009
Field of study

Coverage is the range that covers only positive samples in attribute (or feature) space. Finding coverage is the kernel problem in induction algorithms because of the fact that coverage can be used as rules to describe positive samples. To reflect the characteristic of training samples, it is desirable that the large coverage that cover more positive samples. However, it is difficult to find large coverage, because the attribute space is usually very high dimensionality. Many heuristic methods such as ID3, AQ and CN2 have been proposed to find large coverage. A robust algorithm also has been proposed to find the largest coverage, but the complexities of time and space are costly when the dimensionality becomes high. To overcome this drawback, this paper proposes an algorithm that adopts incremental feature combinations to effectively find the largest coverage. In this algorithm, the irrelevant coverage can be pruned away at early stages because potentially large coverage can be found earlier. Experiments show that the space and time needed to find the largest coverage has been significantly reduced.<br /

Deakin Research Online

Finding rule groups to classify high dimensional gene expression datasets

Author: An Jiyuan
Chen Yi-Ping Phoebe
Publication venue: IEEE Xplore
Publication date: 01/01/2006
Field of study

Microarray data provides quantitative information about the transcription profile of cells. To analyze microarray datasets, methodology of machine learning has increasingly attracted bioinformatics researchers. Some approaches of machine learning are widely used to classify and mine biological datasets. However, many gene expression datasets are extremely high dimensionality, traditional machine learning methods can not be applied effectively and efficiently. This paper proposes a robust algorithm to find out rule groups to classify gene expression datasets. Unlike the most classification algorithms, which select dimensions (genes) heuristically to form rules groups to identify classes such as cancerous and normal tissues, our algorithm guarantees finding out best-k dimensions (genes), which are most discriminative to classify samples in different classes, to form rule groups for the classification of expression datasets. Our experiments show that the rule groups obtained by our algorithm have higher accuracy than that of other classification approaches <br /

Deakin Research Online

Keyword extraction for text categorization

Author: An Jiyuan
Chen Yi-Ping Phoebe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

Text categorization (TC) is one of the main applications of machine learning. Many methods have been proposed, such as Rocchio method, Naive bayes based method, and SVM based text classification method. These methods learn labeled text documents and then construct a classifier. A new coming text document\u27s category can be predicted. However, these methods do not give the description of each category. In the machine learning field, there are many concept learning algorithms, such as, ID3 and CN2. This paper proposes a more robust algorithm to induce concepts from training examples, which is based on enumeration of all possible keywords combinations. Experimental results show that the rules produced by our approach have more precision and simplicity than that of other methods.<br /

Deakin Research Online

Exploring Asymmetric Tunable Blind-Spots for Self-supervised Denoising in Real-World Scenarios

Author: Chen Shiyan
Huang Tiejun
Yu Zhaofei
Zhang Jiyuan
Publication venue
Publication date: 29/03/2023
Field of study

Self-supervised denoising has attracted widespread attention due to its ability to train without clean images. However, noise in real-world scenarios is often spatially correlated, which causes many self-supervised algorithms based on the pixel-wise independent noise assumption to perform poorly on real-world images. Recently, asymmetric pixel-shuffle downsampling (AP) has been proposed to disrupt the spatial correlation of noise. However, downsampling introduces aliasing effects, and the post-processing to eliminate these effects can destroy the spatial structure and high-frequency details of the image, in addition to being time-consuming. In this paper, we systematically analyze downsampling-based methods and propose an Asymmetric Tunable Blind-Spot Network (AT-BSN) to address these issues. We design a blind-spot network with a freely tunable blind-spot size, using a large blind-spot during training to suppress local spatially correlated noise while minimizing damage to the global structure, and a small blind-spot during inference to minimize information loss. Moreover, we propose blind-spot self-ensemble and distillation of non-blind-spot network to further improve performance and reduce computational complexity. Experimental results demonstrate that our method achieves state-of-the-art results while comprehensively outperforming other self-supervised methods in terms of image texture maintaining, parameter count, computation cost, and inference time

arXiv.org e-Print Archive

Causal-Based Supervision of Attention in Graph Neural Network: A Better and Simpler Choice towards Powerful Attention

Author: Chen Jiyuan
Du Lun
Fu Qiang
Han Shi
Song Xuan
Wang Hongjun
Publication venue
Publication date: 18/07/2023
Field of study

Recent years have witnessed the great potential of attention mechanism in graph representation learning. However, while variants of attention-based GNNs are setting new benchmarks for numerous real-world datasets, recent works have pointed out that their induced attentions are less robust and generalizable against noisy graphs due to lack of direct supervision. In this paper, we present a new framework which utilizes the tool of causality to provide a powerful supervision signal for the learning process of attention functions. Specifically, we estimate the direct causal effect of attention to the final prediction, and then maximize such effect to guide attention attending to more meaningful neighbors. Our method can serve as a plug-and-play module for any canonical attention-based GNNs in an end-to-end fashion. Extensive experiments on a wide range of benchmark datasets illustrated that, by directly supervising attention functions, the model is able to converge faster with a clearer decision boundary, and thus yields better performances

arXiv.org e-Print Archive

Concept Learning of Text Documents

Author: Jiyuan An
Yi-Ping Phoebe Chen
Publication venue
Publication date: 05/03/2020
Field of study

Abstrac

CiteSeerX