225,042 research outputs found
BL-MNE: Emerging Heterogeneous Social Network Embedding through Broad Learning with Aligned Autoencoder
Network embedding aims at projecting the network data into a low-dimensional
feature space, where the nodes are represented as a unique feature vector and
network structure can be effectively preserved. In recent years, more and more
online application service sites can be represented as massive and complex
networks, which are extremely challenging for traditional machine learning
algorithms to deal with. Effective embedding of the complex network data into
low-dimension feature representation can both save data storage space and
enable traditional machine learning algorithms applicable to handle the network
data. Network embedding performance will degrade greatly if the networks are of
a sparse structure, like the emerging networks with few connections. In this
paper, we propose to learn the embedding representation for a target emerging
network based on the broad learning setting, where the emerging network is
aligned with other external mature networks at the same time. To solve the
problem, a new embedding framework, namely "Deep alIgned autoencoder based
eMbEdding" (DIME), is introduced in this paper. DIME handles the diverse link
and attribute in a unified analytic based on broad learning, and introduces the
multiple aligned attributed heterogeneous social network concept to model the
network structure. A set of meta paths are introduced in the paper, which
define various kinds of connections among users via the heterogeneous link and
attribute information. The closeness among users in the networks are defined as
the meta proximity scores, which will be fed into DIME to learn the embedding
vectors of users in the emerging network. Extensive experiments have been done
on real-world aligned social networks, which have demonstrated the
effectiveness of DIME in learning the emerging network embedding vectors.Comment: 10 pages, 9 figures, 4 tables. Full paper is accepted by ICDM 2017,
In: Proceedings of the 2017 IEEE International Conference on Data Mining
A discrete hidden Markov model for SMS spam detection
Many machine learning methods have been applied for short messaging service (SMS) spam detection, including traditional methods such as naive Bayes (NB), vector space model (VSM), and support vector machine (SVM), and novel methods such as long short-term memory (LSTM) and the convolutional neural network (CNN). These methods are based on the well-known bag of words (BoW) model, which assumes documents are unordered collection of words. This assumption overlooks an important piece of information, i.e., word order. Moreover, the term frequency, which counts the number of occurrences of each word in SMS, is unable to distinguish the importance of words, due to the length limitation of SMS. This paper proposes a new method based on the discrete hidden Markov model (HMM) to use the word order information and to solve the low term frequency issue in SMS spam detection. The popularly adopted SMS spam dataset from the UCI machine learning repository is used for performance analysis of the proposed HMM method. The overall performance is compatible with deep learning by employing CNN and LSTM models. A Chinese SMS spam dataset with 2000 messages is used for further performance evaluation. Experiments show that the proposed HMM method is not language-sensitive and can identify spam with high accuracy on both datasets
Diffeomorphic Image Registration with Neural Velocity Field
Diffeomorphic image registration, offering smooth transformation and topology
preservation, is required in many medical image analysis tasks.Traditional
methods impose certain modeling constraints on the space of admissible
transformations and use optimization to find the optimal transformation between
two images. Specifying the right space of admissible transformations is
challenging: the registration quality can be poor if the space is too
restrictive, while the optimization can be hard to solve if the space is too
general. Recent learning-based methods, utilizing deep neural networks to learn
the transformation directly, achieve fast inference, but face challenges in
accuracy due to the difficulties in capturing the small local deformations and
generalization ability. Here we propose a new optimization-based method named
DNVF (Diffeomorphic Image Registration with Neural Velocity Field) which
utilizes deep neural network to model the space of admissible transformations.
A multilayer perceptron (MLP) with sinusoidal activation function is used to
represent the continuous velocity field and assigns a velocity vector to every
point in space, providing the flexibility of modeling complex deformations as
well as the convenience of optimization. Moreover, we propose a cascaded image
registration framework (Cas-DNVF) by combining the benefits of both
optimization and learning based methods, where a fully convolutional neural
network (FCN) is trained to predict the initial deformation, followed by DNVF
for further refinement. Experiments on two large-scale 3D MR brain scan
datasets demonstrate that our proposed methods significantly outperform the
state-of-the-art registration methods.Comment: WACV 202
Eigen-spectrograms: an interpretable feature space for bearing fault diagnosis based on artificial intelligence and image processing
The Intelligent Fault Diagnosis of rotating machinery proposes some
captivating challenges in light of the imminent big data era. Although results
achieved by artificial intelligence and deep learning constantly improve, this
field is characterized by several open issues. Models' interpretation is still
buried under the foundations of data driven science, thus requiring attention
to the development of new opportunities also for machine learning theories.
This study proposes a machine learning diagnosis model, based on intelligent
spectrogram recognition, via image processing. The approach is characterized by
the introduction of the eigen-spectrograms and randomized linear algebra in
fault diagnosis. The eigen-spectrograms hierarchically display inherent
structures underlying spectrogram images. Also, different combinations of
eigen-spectrograms are expected to describe multiple machine health states.
Randomized algebra and eigen-spectrograms enable the construction of a
significant feature space, which nonetheless emerges as a viable device to
explore models' interpretations. The computational efficiency of randomized
approaches further collocates this methodology in the big data perspective and
provides new reading keys of well-established statistical learning theories,
such as the Support Vector Machine (SVM). The conjunction of randomized algebra
and Support Vector Machine for spectrogram recognition shows to be extremely
accurate and efficient as compared to state of the art results.Comment: 14 pages, 13 figure
Recommended from our members
DeepSynergy: predicting anti-cancer drug synergy with Deep Learning.
MOTIVATION: While drug combination therapies are a well-established concept in cancer treatment, identifying novel synergistic combinations is challenging due to the size of combinatorial space. However, computational approaches have emerged as a time- and cost-efficient way to prioritize combinations to test, based on recently available large-scale combination screening data. Recently, Deep Learning has had an impact in many research areas by achieving new state-of-the-art model performance. However, Deep Learning has not yet been applied to drug synergy prediction, which is the approach we present here, termed DeepSynergy. DeepSynergy uses chemical and genomic information as input information, a normalization strategy to account for input data heterogeneity, and conical layers to model drug synergies. RESULTS: DeepSynergy was compared to other machine learning methods such as Gradient Boosting Machines, Random Forests, Support Vector Machines and Elastic Nets on the largest publicly available synergy dataset with respect to mean squared error. DeepSynergy significantly outperformed the other methods with an improvement of 7.2% over the second best method at the prediction of novel drug combinations within the space of explored drugs and cell lines. At this task, the mean Pearson correlation coefficient between the measured and the predicted values of DeepSynergy was 0.73. Applying DeepSynergy for classification of these novel drug combinations resulted in a high predictive performance of an AUC of 0.90. Furthermore, we found that all compared methods exhibit low predictive performance when extrapolating to unexplored drugs or cell lines, which we suggest is due to limitations in the size and diversity of the dataset. We envision that DeepSynergy could be a valuable tool for selecting novel synergistic drug combinations. AVAILABILITY AND IMPLEMENTATION: DeepSynergy is available via www.bioinf.jku.at/software/DeepSynergy. CONTACT: [email protected]. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
- …