Search CORE

3,529 research outputs found

Learning from Multi-View Multi-Way Data via Structural Factorization Machines

Author: Cao Bokai
Ding Hao
He Lifang
Lu Chun-Ta
Yu Philip S.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Real-world relations among entities can often be observed and determined by different perspectives/views. For example, the decision made by a user on whether to adopt an item relies on multiple aspects such as the contextual information of the decision, the item's attributes, the user's profile and the reviews given by other users. Different views may exhibit multi-way interactions among entities and provide complementary information. In this paper, we introduce a multi-tensor-based approach that can preserve the underlying structure of multi-view data in a generic predictive model. Specifically, we propose structural factorization machines (SFMs) that learn the common latent spaces shared by multi-view tensors and automatically adjust the importance of each view in the predictive model. Furthermore, the complexity of SFMs is linear in the number of parameters, which make SFMs suitable to large-scale problems. Extensive experiments on real-world datasets demonstrate that the proposed SFMs outperform several state-of-the-art methods in terms of prediction accuracy and computational cost.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Developments of Machine Learning Schemes for Dynamic Time-Wrapping-Based Speech Recognition

Author: Chih-Ta Yen
Ing-Jr Ding
Yen-Ming Hsu
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2013
Field of study

Crossref

Directory of Open Access Journals

Optical music recognition of the singer using formant frequency estimation of vocal fold vibration and lip motion with interpolated GMM classifiers

Author: Che-Wei Chang
Chih-Ta Yen
He-Zhong Lin
Ing-Jr Ding
Publication venue: 'JVE International Ltd.'
Publication date: 15/08/2014
Field of study

The main work of this paper is to identify the musical genres of the singer by performing the optical detection of lip motion. Recently, optical music recognition has attracted much attention. Optical music recognition in this study is a type of automatic techniques in information engineering, which can be used to determine the musical style of the singer. This paper proposes a method for optical music recognition where acoustic formant analysis of both vocal fold vibration and lip motion are employed with interpolated Gaussian mixture model (GMM) estimation to perform musical genre classification of the singer. The developed approach for such classification application is called GMM-Formant. Since humming and voiced speech sounds cause periodic vibrations of the vocal folds and then the corresponding motion of the lip, the proposed GMM-Formant firstly operates to acquire the required formant information. Formant information is important acoustic feature data for recognition classification. The proposed GMM-Formant method then uses linear interpolation for combining GMM likelihood estimates and formant evaluation results appropriately. GMM-Formant will effectively adjust the estimated formant feature evaluation outcomes by referring to certain degree of the likelihood score derived from GMM calculations. The superiority and effectiveness of presented GMM-Formant are demonstrated by a series of experiments on musical genre classification of the singer

Journal of Vibroengineering

Identifying Ketamine Responses in Treatment-Resistant Depression Using a Wearable Forehead EEG

Author: Cao Zehong
Chen Mu-Hong
Ding Weiping
Li Cheng-Ta
Lin Chin-Teng
Su Tung-Ping
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

This study explores the responses to ketamine in patients with treatment-resistant depression (TRD) using a wearable forehead electroencephalography (EEG) device. We recruited fifty-five outpatients with TRD who were randomised into three approximately equal-sized groups (A: 0.5 mg/kg ketamine; B: 0.2 mg/kg ketamine; and C: normal saline) under double-blind conditions. The ketamine responses were measured by EEG signals and Hamilton Depression Rating Scale (HDRS) scores. At baseline, responders showed a significantly weaker EEG theta power than did non- responders (p < 0.05). Responders exhibited a higher EEG alpha power but lower EEG alpha asymmetry and theta cordance at post-treatment than at baseline (p < 0.05). Furthermore, our baseline EEG predictor classified responders and non-responders with 81.3 +- 9.5% accuracy, 82.1 +- 8.6% sensitivity and 91.9 +- 7.4% specificity. In conclusion, the rapid antidepressant effects of mixed doses of ketamine are associated with prefrontal EEG power, asymmetry and cordance at baseline and early post-treatment changes. The prefrontal EEG patterns at baseline may account for recognising ketamine effects in advance. Our randomised, double- blind, placebo-controlled study provides information regarding clinical impacts on the potential targets underlying baseline identification and early changes from the effects of ketamine in patients with TRD.Comment: This revised article is submitting to IEEE TBM

arXiv.org e-Print Archive

OPUS - University of Technology Sydney

University of Tasmania Open Access Repository

Developments of Machine Learning Schemes for Dynamic Time-Wrapping-Based Speech Recognition

Author: Chih-Ta Ding Jr
Hsu
Yen-Ming Yen
Publication venue
Publication date: 24/04/2020
Field of study

This paper presents a machine learning scheme for dynamic time-wrapping-based (DTW) speech recognition. Two categories of learning strategies, supervised and unsupervised, were developed for DTW. Two supervised learning methods, incremental learning and priority-rejection learning, were proposed in this study. The incremental learning method is conceptually simple but still suffers from a large database of keywords for matching the testing template. The priority-rejection learning method can effectively reduce the matching time with a slight decrease in recognition accuracy. Regarding the unsupervised learning category, an automatic learning approach, called "most-matching learning, " which is based on priority-rejection learning, was developed in this study. Most-matching learning can be used to intelligently choose the appropriate utterances for system learning. The effectiveness and efficiency of all three proposed machine-learning approaches for DTW were demonstrated using keyword speech recognition experiments

CiteSeerX

$\psi(3770)$ and $B$ meson exclusive decay $B \to \psi(3770) K$ in QCD factorization

Author: Abe)
Adam)
Adams)
Aubert)
Aubert)
Bai)
Ball
Bauer
Beneke
Beneke
Beneke
Bodwin
Buchalla
Ce Meng
Chen
Cheng
Coan)
Colangelo
Ding
Eichten
Eichten
Eidelman)
Gamash)
Gray
Heikkila
Ko
Kuang
Kuang-Ta Chao
Kühn
Liu
Melic
Rosner
Song
Song
Song
Ying-Jia Gao
Yuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Belle has observed surprisingly copious production of

\psi(3770)

B

meson decay

B\to \psi(3770)K

, of which the rate is comparable to that of

B\to \psi(3686)K

. We study this puzzling process in the QCD factorization approach with the effect of S-D mixing considered. We find that the soft scattering effects in the spectator interactions play an essential role. With a proper parametrization for the higher twist soft end-point singularities associated with kaon, and with the S-D mixing angle

\theta=-12^{\circ}

, the calculated decay rates can be close to the data. Implications of these soft spectator effects to other charmonium production in

B

exclusive decays are also emphasized.Comment: journal versio

arXiv.org e-Print Archive

CiteSeerX

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

CERN Document Server

A Method to Integrate GMM, SVM and DTW for Speaker Recognition

Author: Ding Ing-Jr
Ou Da-Cheng
Yen Chih-Ta
Publication venue: 'Taiwan Association of Engineering and Technology Innovation'
Publication date: 01/01/2014
Field of study

This paper develops an effective and efficient scheme to integrate Gaussian mixture model (GMM), support vector machine (SVM), and dynamic time wrapping (DTW) for automatic speaker recognition. GMM and SVM are two popular classifiers for speaker recognition applications. DTW is a fast and simple template matching method, and it is frequently seen in applications of speech recognition. In this work, DTW does not play a role to perform speech recognition, and it will be employed to be a verifier for verification of valid speakers. The proposed combination scheme of GMM, SVM and DTW, called SVMGMM-DTW, for speaker recognition in this study is a two-phase verification process task including GMM-SVM verification of the first phase and DTW verification of the second phase. By providing a double check to verify the identity of a speaker, it will be difficult for imposters to try to pass the security protection; therefore, the safety degree of speaker recognition systems will be largely increased. A series of experiments designed on door access control applications demonstrated that the superiority of the developed SVMGMM-DTW on speaker recognition accuracy

Taiwan Association of Engineering and Technology Innovation: E-Journals

Possible retardation effects of quark confinement on the meson spectrum

Author: A. B. Henriques
A. Gara
A. Gara
A. Huntley
C. Itzykson
C. Michael
Cong-feng Qiao
D. Barkai
D. Gromes
D. Lurie
D. P. Stanley
E. E. Salpeter
E. E. Salpeter
E. Eichten
E. Eichten
E. Eichten
E. Laermann
H. Pagels
H. Pagels
Han-Wen Huang
J. D. Stack
K. D. Born
K.T. Chao
Kuang-Ta Chao
L. J. Nickisch
L. Montanet
L. P. Fulcher
L. P. Fulcher
M. G. Olsson
N. Byers
P. Ditsas
S. Godfrey
S. N. Gupta
S. Otto
Y. B. Ding
Y.B. Ding
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1996
Field of study

The reduced Bethe-Salpeter equation with scalar confinement and vector gluon exchange is applied to quark-antiquark bound states. The so called intrinsic flaw of Salpeter equation with static scalar confinement is investigated. The notorious problem of narrow level spacings is found to be remedied by taking into consideration the retardation effect of scalar confinement. Good fit for the mass spectrum of both heavy and light quarkomium states is then obtained.Comment: 14 pages in LaTex for

arXiv.org e-Print Archive

Crossref