910,191 research outputs found
Automated Stellar Spectral Classification and Parameterization for the Masses
Stellar spectroscopic classification has been successfully automated by a
number of groups. Automated classification and parameterization work best when
applied to a homogeneous data set, and thus these techniques primarily have
been developed for and applied to large surveys. While most ongoing large
spectroscopic surveys target extragalactic objects, many stellar spectra have
been and will be obtained. We briefly summarize past work on automated
classification and parameterization, with emphasis on the work done in our
group. Accurate automated classification in the spectral type domain and
parameterization in the temperature domain have been relatively easy. Automated
parameterization in the metallicity domain, formally outside the MK system, has
also been effective. Due to the subtle effects on the spectrum, automated
classification in the luminosity domain has been somewhat more difficult, but
still successful. In order to extend the use of automated techniques beyond a
few surveys, we present our current efforts at building a web-based automated
stellar spectroscopic classification and parameterization machine. Our proposed
machinery would provide users with MK classifications as well as the
astrophysical parameters of effective temperature, surface gravity, mean
abundance, abundance anomalies, and microturbulence.Comment: 5 pages; to appear in The Garrison Festschrift conference proceeding
Prediction of protein-protein interaction types using association rule based classification
This article has been made available through the Brunel Open Access Publishing Fund - Copyright @ 2009 Park et alBackground: Protein-protein interactions (PPI) can be classified according to their characteristics into, for example obligate or transient interactions. The identification and characterization of these PPI types may help in the functional annotation of new protein complexes and in the prediction of protein interaction partners by knowledge driven approaches. Results: This work addresses pattern discovery of the interaction sites for four different interaction types to characterize and uses them for the prediction of PPI types employing Association Rule Based Classification (ARBC) which includes association rule generation and posterior classification. We incorporated domain information from protein complexes in SCOP proteins and identified 354 domain-interaction sites. 14 interface properties were calculated from amino acid and secondary structure composition and then used to generate a set of association rules characterizing these domain-interaction sites employing the APRIORI algorithm. Our results regarding the classification of PPI types based on a set of discovered association rules shows that the discriminative ability of association rules can significantly impact on the prediction power of classification models. We also showed that the accuracy of the classification can be improved through the use of structural domain information and also the use of secondary structure content. Conclusion: The advantage of our approach is that we can extract biologically significant information from the interpretation of the discovered association rules in terms of understandability and interpretability of rules. A web application based on our method can be found at http://bioinfo.ssu.ac.kr/~shpark/picasso/SHP was supported by the Korea Research Foundation Grant funded by the Korean Government(KRF-2005-214-E00050). JAR has been
supported by the Programme Alβan, the European Union Programme of High level Scholarships for Latin America, scholarship E04D034854CL. SK was supported by Soongsil University Research Fund
Transductive Learning with String Kernels for Cross-Domain Text Classification
For many text classification tasks, there is a major problem posed by the
lack of labeled data in a target domain. Although classifiers for a target
domain can be trained on labeled text data from a related source domain, the
accuracy of such classifiers is usually lower in the cross-domain setting.
Recently, string kernels have obtained state-of-the-art results in various text
classification tasks such as native language identification or automatic essay
scoring. Moreover, classifiers based on string kernels have been found to be
robust to the distribution gap between different domains. In this paper, we
formally describe an algorithm composed of two simple yet effective
transductive learning approaches to further improve the results of string
kernels in cross-domain settings. By adapting string kernels to the test set
without using the ground-truth test labels, we report significantly better
accuracy rates in cross-domain English polarity classification.Comment: Accepted at ICONIP 2018. arXiv admin note: substantial text overlap
with arXiv:1808.0840
- …