151 research outputs found

    Adversarial Variational Embedding for Robust Semi-supervised Learning

    Full text link
    Semi-supervised learning is sought for leveraging the unlabelled data when labelled data is difficult or expensive to acquire. Deep generative models (e.g., Variational Autoencoder (VAE)) and semisupervised Generative Adversarial Networks (GANs) have recently shown promising performance in semi-supervised classification for the excellent discriminative representing ability. However, the latent code learned by the traditional VAE is not exclusive (repeatable) for a specific input sample, which prevents it from excellent classification performance. In particular, the learned latent representation depends on a non-exclusive component which is stochastically sampled from the prior distribution. Moreover, the semi-supervised GAN models generate data from pre-defined distribution (e.g., Gaussian noises) which is independent of the input data distribution and may obstruct the convergence and is difficult to control the distribution of the generated data. To address the aforementioned issues, we propose a novel Adversarial Variational Embedding (AVAE) framework for robust and effective semi-supervised learning to leverage both the advantage of GAN as a high quality generative model and VAE as a posterior distribution learner. The proposed approach first produces an exclusive latent code by the model which we call VAE++, and meanwhile, provides a meaningful prior distribution for the generator of GAN. The proposed approach is evaluated over four different real-world applications and we show that our method outperforms the state-of-the-art models, which confirms that the combination of VAE++ and GAN can provide significant improvements in semisupervised classification.Comment: 9 pages, Accepted by Research Track in KDD 201

    MDFRCNN: Malware Detection using Faster Region Proposals Convolution Neural Network

    Get PDF
    Technological advancement of smart devices has opened up a new trend: Internet of Everything (IoE), where all devices are connected to the web. Large scale networking benefits the community by increasing connectivity and giving control of physical devices. On the other hand, there exists an increased โ€˜Threatโ€™ of an โ€˜Attackโ€™. Attackers are targeting these devices, as it may provide an easier โ€˜backdoor entry to the usersโ€™ networkโ€™.MALicious softWARE (MalWare) is a major threat to user security. Fast and accurate detection of malware attacks are the sine qua non of IoE, where large scale networking is involved. The paper proposes use of a visualization technique where the disassembled malware code is converted into gray images, as well as use of Image Similarity based Statistical Parameters (ISSP) such as Normalized Cross correlation (NCC), Average difference (AD), Maximum difference (MaxD), Singular Structural Similarity Index Module (SSIM), Laplacian Mean Square Error (LMSE), MSE and PSNR. A vector consisting of gray image with statistical parameters is trained using a Faster Region proposals Convolution Neural Network (F-RCNN) classifier. The experiment results are promising as the proposed method includes ISSP with F-RCNN training. Overall training time of learning the semantics of higher-level malicious behaviors is less. Identification of malware (testing phase) is also performed in less time. The fusion of image and statistical parameter enhances system performance with greater accuracy. The benchmark database from Microsoft Malware Classification challenge has been used to analyze system performance, which is available on the Kaggle website. An overall average classification accuracy of 98.12% is achieved by the proposed method

    ๊ณผ๊ฑฐ ๊ฐ€๊ฒฉ ๋ฐ ํฌ์†Œํ•œ ํŠธ์œ—์„ ์ด์šฉํ•œ ์ฃผ๊ฐ€ ๋ณ€๋™ ์˜ˆ์ธก

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2021.8. ๊ฐ•์œ .Given historical stock prices and sparse tweets mentioning the stocks to predict, how can we precisely predict stock price movement? Many market analysts strive to use a large amount of information for prediction. However, they confront more noise when utilizing larger data for prediction. Thus, existing methods use only historical prices, or those along with a small amount of refined data such as news articles or tweets mentioning target stocks. However, they have the following limitations: 1) using only historical prices gives low performance since they have insufficient information, 2) news articles lack timeliness compared to social medias for predicting stock price movement, and 3) the previous methods using tweets do not handle stocks without tweets mentioning them. In this paper, we propose GLT (Stock Price Movement Prediction using Global and Local Trends of Tweets), an accurate stock price movement prediction method that works without tweets mentioning target stocks. GLT pre-trains both of stock and tweet representations in a self-supervised way. Then, GLT generates global and local tweet trends which represent global public opinion and the local trends related to target stocks, respectively. The trend vectors are combined to accurately predict stock price movement. Experimental results show that GLT provides the state-ofthe-art accuracy for stock price movement prediction.๊ณผ๊ฑฐ ์ฃผ๊ฐ€์™€ ์˜ˆ์ธกํ•  ์ฃผ์‹์„ ์–ธ๊ธ‰ํ•˜๋Š” ํฌ์†Œํ•œ ํŠธ์œ—์ด ์ฃผ์–ด์กŒ์„ ๋•Œ ์ฃผ๊ฐ€ ๋ณ€๋™์„ ์–ด๋–ป๊ฒŒ ์ •ํ™•ํ•˜๊ฒŒ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ์„๊นŒ? ๋งŽ์€ ์‹œ์žฅ ๋ถ„์„๊ฐ€๋“ค์€ ์˜ˆ์ธก์„ ์œ„ํ•ด ๋งŽ์€ ์–‘์˜ ์ • ๋ณด๋ฅผ ์‚ฌ์šฉํ•˜๋ ค๊ณ  ๋…ธ๋ ฅํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์˜ˆ์ธก์„ ์œ„ํ•ด ๋” ๋งŽ์€ ์–‘์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ• ์ˆ˜ ๋ก ๋” ๋งŽ์€ ๋…ธ์ด์ฆˆ์— ์ง๋ฉดํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๊ธฐ์กด ๋ฐฉ๋ฒ•์€ ๊ณผ๊ฑฐ ์ฃผ์‹ ๊ฐ€๊ฒฉ๋งŒ ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜ ๋‰ด์Šค ๊ธฐ์‚ฌ ํ˜น์€ ๋Œ€์ƒ ์ฃผ์‹์„ ์–ธ๊ธ‰ํ•˜๋Š” ํŠธ์œ—๊ณผ ๊ฐ™์€ ์†Œ๋Ÿ‰์˜ ์ •์ œ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉ ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํ•œ๊ณ„๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค: 1) ๊ณผ๊ฑฐ ์ฃผ์‹ ๊ฐ€๊ฒฉ๋งŒ ์‚ฌ์šฉํ•˜๋ฉด ์ •๋ณด๊ฐ€ ๋ถ€์กฑํ•˜์—ฌ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๊ณ , 2) ๋‰ด์Šค ๊ธฐ์‚ฌ๋Š” ์ฃผ๊ฐ€ ๋ณ€๋™์„ ์˜ˆ์ธกํ•˜๋Š” ๋ฐ ์†Œ์…œ ๋ฏธ๋””์–ด์— ๋น„ํ•ด ์ ์‹œ์„ฑ์ด ๋ถ€์กฑํ•˜๋ฉฐ, 3) ํŠธ์œ—์„ ์‚ฌ์šฉํ•˜๋Š” ์ด์ „ ๋ฐฉ๋ฒ•๋“ค์€ ํŠธ์œ—์ด ์–ธ๊ธ‰ํ•˜์ง€ ์•Š์€ ์ฃผ์‹๋“ค์„ ์ฒ˜๋ฆฌํ•˜์ง€ ๋ชปํ•ฉ๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๋ชฉํ‘œ ์ฃผ์‹์„ ์–ธ๊ธ‰ํ•˜๋Š” ํŠธ์œ— ์—†์ด๋„ ์ž‘๋™ํ•˜๋Š” ์ •ํ™•ํ•œ ์ฃผ๊ฐ€ ๋ณ€๋™ ์˜ˆ์ธก ๋ฐฉ๋ฒ•์ธ GLT (Stock Price Movement Prediction using Global and Local Trends of Tweets)๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. GLT๋Š” ์ž๊ฐ€ ๊ฐ๋… ๋ฐฉ์‹์„ ํ™œ์šฉํ•˜์—ฌ ์ฃผ์‹ ๋ฐ ํŠธ์œ— ์ž„๋ฒ ๋”ฉ์„ ์‚ฌ์ „ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฐ ๋‹ค์Œ GLT๋Š” ๊ฐ๊ฐ ๊ธ€๋กœ๋ฒŒ ์—ฌ๋ก ๊ณผ ๋ชฉํ‘œ ์ฃผ์‹๊ณผ ๊ด€๋ จ๋œ ํŠธ๋ Œ๋“œ ๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๊ธ€๋กœ๋ฒŒ ๋ฐ ๋กœ์ปฌ ํŠธ์œ— ํŠธ๋ Œ๋“œ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ถ”์„ธ ๋ฒกํ„ฐ๋“ค์€ ์ฃผ๊ฐ€ ๋ณ€๋™์„ ์ •ํ™•ํ•˜๊ฒŒ ์˜ˆ์ธกํ•˜๋Š”๋ฐ ๊ธฐ์—ฌํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ์— ๋”ฐ๋ฅด๋ฉด GLT๋Š” ์ฃผ๊ฐ€ ๋ณ€๋™ ์˜ˆ ์ธก์—์„œ ์ตœ๊ณ  ์ˆ˜์ค€์˜ ์ •ํ™•๋„๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.I. Introduction 1 II. Related Work 4 2.1 Stock Price Movement Prediction 4 2.2 Attentive LSTM 4 III. Proposed Method 6 3.1 Overview 6 3.2 Self-supervised Pre-training for Representing Tweets and Stocks 7 3.3 Global Tweet Trend 10 3.4 Local Tweet Trend 11 3.5 Stock Movement Prediction 11 IV. Experiment 13 4.1 Experiment Setting 13 4.2 Classification Performance 15 4.3 Ablation Study 16 4.4 Hyperparameter Robustness 16 V. Conclusion 18 References 19 Abstract in Korean 23์„

    Photonic integrated reconfigurable linear processors as neural network accelerators

    Get PDF
    Reconfigurable linear optical processors can be used to perform linear transformations and are instrumental in effectively computing matrixโ€“vector multiplications required in each neural network layer. In this paper, we characterize and compare two thermally tuned photonic integrated processors realized in silicon-on-insulator and silicon nitride platforms suited for extracting feature maps in convolutional neural networks. The reduction in bit resolution when crossing the processor is mainly due to optical losses, in the range 2.3โ€“3.3 for the silicon-on-insulator chip and in the range 1.3โ€“2.4 for the silicon nitride chip. However, the lower extinction ratio of Machโ€“Zehnder elements in the latter platform limits their expressivity (i.e., the capacity to implement any transformation) to 75%, compared to 97% of the former. Finally, the silicon-on-insulator processor outperforms the silicon nitride one in terms of footprint and energy efficiency

    Spiking Neural Networks for Computational Intelligence:An Overview

    Get PDF
    Deep neural networks with rate-based neurons have exhibited tremendous progress in the last decade. However, the same level of progress has not been observed in research on spiking neural networks (SNN), despite their capability to handle temporal data, energy-efficiency and low latency. This could be because the benchmarking techniques for SNNs are based on the methods used for evaluating deep neural networks, which do not provide a clear evaluation of the capabilities of SNNs. Particularly, the benchmarking of SNN approaches with regards to energy efficiency and latency requires realization in suitable hardware, which imposes additional temporal and resource constraints upon ongoing projects. This review aims to provide an overview of the current real-world applications of SNNs and identifies steps to accelerate research involving SNNs in the future

    Towards Evaluating User Profiling Methods Based on Explicit Ratings on Item Features

    Get PDF
    In order to improve the accuracy of recommendations, many recommender systems nowadays use side information beyond the user rating matrix, such as item content. These systems build user profiles as estimates of users' interest on content (e.g., movie genre, director or cast) and then evaluate the performance of the recommender system as a whole e.g., by their ability to recommend relevant and novel items to the target user. The user profile modelling stage, which is a key stage in content-driven RS is barely properly evaluated due to the lack of publicly available datasets that contain user preferences on content features of items. To raise awareness of this fact, we investigate differences between explicit user preferences and implicit user profiles. We create a dataset of explicit preferences towards content features of movies, which we release publicly. We then compare the collected explicit user feature preferences and implicit user profiles built via state-of-the-art user profiling models. Our results show a maximum average pairwise cosine similarity of 58.07\% between the explicit feature preferences and the implicit user profiles modelled by the best investigated profiling method and considering movies' genres only. For actors and directors, this maximum similarity is only 9.13\% and 17.24\%, respectively. This low similarity between explicit and implicit preference models encourages a more in-depth study to investigate and improve this important user profile modelling step, which will eventually translate into better recommendations
    • โ€ฆ
    corecore