1,542 research outputs found

    Hacking Smart Machines with Smarter Ones: How to Extract Meaningful Data from Machine Learning Classifiers

    Full text link
    Machine Learning (ML) algorithms are used to train computers to perform a variety of complex tasks and improve with experience. Computers learn how to recognize patterns, make unintended decisions, or react to a dynamic environment. Certain trained machines may be more effective than others because they are based on more suitable ML algorithms or because they were trained through superior training sets. Although ML algorithms are known and publicly released, training sets may not be reasonably ascertainable and, indeed, may be guarded as trade secrets. While much research has been performed about the privacy of the elements of training sets, in this paper we focus our attention on ML classifiers and on the statistical information that can be unconsciously or maliciously revealed from them. We show that it is possible to infer unexpected but useful information from ML classifiers. In particular, we build a novel meta-classifier and train it to hack other classifiers, obtaining meaningful information about their training sets. This kind of information leakage can be exploited, for example, by a vendor to build more effective classifiers or to simply acquire trade secrets from a competitor's apparatus, potentially violating its intellectual property rights

    Survey of quantitative investment strategies in the Russian stock market : Special interest in tactical asset allocation

    Get PDF
    Russia’s financial markets have been an uncharted area when it comes to exploring the performance of investment strategies based on modern portfolio theory. In this thesis, we focus on the country’s stock market and study whether profitable investments can be made while at the same time taking uncertainties, risks, and dependencies into account. We also pay particular interest in tactical asset allocation. The benefit of this approach is that we can utilize time series forecasting methods to produce trading signals in addition to optimization methods. We use two datasets in our empirical applications. The first one consists of nine sectoral indices covering the period from 2008 to 2017, and the other includes altogether 42 stocks listed on the Moscow Exchange covering the years 2011 – 2017. The strategies considered have been divided into five sections. In the first part, we study classical and robust mean-risk portfolios and the modeling of transaction costs. We find that the expected return should be maximized per unit expected shortfall while simultaneously requiring that each asset contributes equally to the portfolio’s tail risk. Secondly, we show that using robust covariance estimators can improve the risk-adjusted returns of minimum variance portfolios. Thirdly, we note that robust optimization techniques are best suited for conservative investors due to the low volatility allocations they produce. In the second part, we employ statistical factor models to estimate higher-order comoments and demonstrate the benefit of the proposed method in constructing risk-optimal and expected utility-maximizing portfolios. In the third part, we utilize the Almgren–Chriss framework and sort the expected returns according to the assumed momentum anomaly. We discover that this method produces stable allocations performing exceptionally well in the market upturn. In the fourth part, we show that forecasts produced by VECM and GARCH models can be used profitably in optimizations based on the Black–Litterman, copula opinion pooling, and entropy pooling models. In the final part, we develop a wealth protection strategy capable of timing market changes thanks to the return predictions based on an ARIMA model. Therefore, it can be stated that it has been possible to make safe and profitable investments in the Russian stock market even when reasonable transaction costs have been taken into account. We also argue that market inefficiencies could have been exploited by structuring statistical arbitrage and other tactical asset allocation-related strategies.Venäjän rahoitusmarkkinat ovat olleet kartoittamatonta aluetta tutkittaessa moderniin portfolioteoriaan pohjautuvien sijoitusstrategioiden käyttäytymistä. Tässä tutkielmassa keskitymme maan osakemarkkinoihin ja tarkastelemme, voidaanko taloudellisesti kannattavia sijoituksia tehdä otettaessa samalla huomioon epävarmuudet, riskit ja riippuvuudet. Kiinnitämme erityistä huomiota myös taktiseen varojen kohdentamiseen. Tämän lähestymistavan etuna on, että optimointimenetelmien lisäksi voimme hyödyntää aikasarjaennustamisen menetelmiä kaupankäyntisignaalien tuottamiseksi. Empiirisissä sovelluksissa käytämme kahta data-aineistoa. Ensimmäinen koostuu yhdeksästä teollisuusindeksistä kattaen ajanjakson 2008–2017, ja toinen sisältää 42 Moskovan pörssiin listattua osaketta kattaen vuodet 2011–2017. Tarkasteltavat strategiat on puolestaan jaoteltu viiteen osioon. Ensimmäisessä osassa tarkastelemme klassisia ja robusteja riski-tuotto -portfolioita sekä kaupankäyntikustannusten mallintamista. Havaitsemme, että odotettua tuottoa on syytä maksimoida suhteessa odotettuun vajeeseen edellyttäen samalla, että jokainen osake lisää sijoitussalkun häntäriskiä yhtä suurella osuudella. Toiseksi osoitamme, että minimivarianssiportfolioiden riskikorjattuja tuottoja voidaan parantaa robusteilla kovarianssiestimaattoreilla. Kolmanneksi toteamme robustien optimointitekniikoiden soveltuvan parhaiten konservatiivisille sijoittajille niiden tuottamien matalan volatiliteetin allokaatioiden ansiosta. Toisessa osassa hyödynnämme tilastollisia faktorimalleja korkeampien yhteismomenttien estimoinnissa ja havainnollistamme ehdotetun metodin hyödyllisyyttä riskioptimaalisten sekä odotettua hyötyä maksimoivien salkkujen rakentamisessa. Kolmannessa osassa käytämme Almgren–Chrissin viitekehystä ja asetamme odotetut tuotot suuruusjärjestykseen oletetun momentum-anomalian mukaisesti. Havaitsemme, että menetelmä tuottaa vakaita allokaatioita menestyen erityisen hyvin noususuhdanteessa. Neljännessä osassa osoitamme, että VECM- että GARCH-mallien tuottamia ennusteita voidaan hyödyntää kannattavasti niin Black–Littermanin malliin kuin kopulanäkemysten ja entropian poolaukseenkin perustuvissa optimoinneissa. Viimeisessä osassa laadimme varallisuuden suojausstrategian, joka kykenee ajoittamaan markkinoiden muutoksia ARIMA-malliin perustuvien tuottoennusteiden ansiosta. Voidaan siis todeta, että Venäjän osakemarkkinoilla on ollut mahdollista tehdä turvallisia ja tuottavia sijoituksia myös silloin kun kohtuulliset kaupankäyntikustannukset on huomioitu. Toiseksi väitämme, että markkinoiden tehottomuutta on voitu hyödyntää suunnittelemalla tilastolliseen arbitraasiin ja muihin taktiseen varojen allokointiin pohjautuvia strategioita

    A hybrid of bekk garch with neural network for modeling and forecasting time series

    Get PDF
    Gold prices change rapidly from time to time. The change is not only in the mean, but also in the variability of the series. The Malaysian Kijang Emas (MKE) is the official national bullion gold coin of Malaysia and it is high in demand. The purchase and resale prices of MKE are determined by the prevailing international gold market price. However, the value of Ringgit Malaysia (RM) that is used to purchase MKE is affected by United States (U.S.) dollar. Thus, the purpose of this study is to develop the best model for forecasting international gold prices, U.S. dollar index and MKE prices by investigating their co-movement. In an attempt to find the best model, fifteen years of data for MKE prices, international gold prices in U.S. dollar and U.S. dollar index were used. This study initially considered three standard methods namely bivariate generalized autoregressive conditional heteroskedasticity (GARCH), trivariate GARCH and multilayer feed-forward neural network (MFFNN). Bivariate and trivariate GARCH are from Baba-Engle-Kraft-Kroner (BEKK) GARCH. The current study further hybridized these methods to improve forecasting accuracy. Bivariate and trivariate GARCH were used to examine the relationship between gold prices and U.S. dollar. The trivariate GARCH was modified to develop GARCH-in-mean model due to the existence risk that was expected in the data. Analysis was done by using E-Views software. However, analysis using MFFNN model and hybridized models were carried out using MATLAB software. Analyses of performances were evaluated using mean absolute percentage error (MAPE) and mean square error (MSE). The MAPE for all in and out sample forecasts were less than 1%. The lowest values of MAPE were 0.8% for gold prices and 0.2% for U.S. dollar index. These low values were produced by using trivariate GARCH-in-mean model that was developed by the current study either as a single or hybdridized model with MFFNN. MSE recorded the values when trivariate GARCH-in-mean model was hybridized with MFFNN using 15 hidden nodes

    Hyperspectral imagery combined with machine learning to differentiate genetically modified (GM) and non-GM canola

    Get PDF
    Canola, also known as rapeseed (Brassica napus L.), is an oilseed that produces a healthy food-grade oil, canola meal by-product, and biofuel. It is the fourth most grown grain in Australia. Genetically modified (GM) canola currently represents approximately twenty percent of national canola production; hence, with clashing public and industry perceptions of genetically modified organisms (GMOs), transparency and traceability must be enabled throughout the supply chain to protect markets and relationships with consumers. GM canola must not cross-contaminate non-GM canola as our largest export market, Europe, has extremely strict protocols on GMOs. GM and non-GM canola cannot be differentiated by the human eye, with polymerase chain reaction (PCR) methods currently the main alternative, which is expensive and time-consuming. This thesis evaluates the potential to differentiate GM from non-GM canola using the novel, rapid, and non-destructive technique of hyperspectral imaging combined with machine learning. Hyperspectral imagery captures and processes wavelengths beyond simply red, green, and blue. It has a pre-existing multitude of uses including the characterisation and variety identification of other grains. In this study 500 images each of non-GM and GM canola seeds were captured. Seeds were placed on a black background with two lights sources. Images were captured from the 400nm to 1000nm wavelengths, a total of 80 bands, at a 25-millisecond exposure time. These images were run through a convolutional neural network in Keras for analysis. The high dynamic range and raw files were combined into a NumPy file for the hyperspectral image generator. Contrary to expectations, however, the models using the bitmap image files performed similarly to the models receiving the hyperspectral images. Regardless, both produced high validation accuracies around 90%, indicating a detectable phenotypical difference between the two, and further studies could lead to the development of a new approach to GM canola detection

    Analysing Financial Distress in Malaysian Islamic Banks: Exploring Integrative Predictive Methods

    Get PDF
    Against the background of global financial crisis, some argue in favour of the ‘resilience’ of Islamic finance, while others suggest that Islamic financial institutions are not more prone to distress and crisis than their conventional counterparts. However, there have been a number of cases of Islamic finance and banking distress in recent years, including instances in Malaysia. These cases, hence, motivated this study in terms of emphasising the importance of employing financial distress prediction models for analysing Islamic banks. This study aims at empirically exploring, examining and analysing the financial distress of the Malaysian Islamic banks. In doing so, the effectiveness of the existing early warning statistical insolvency prediction models that have been used in previous studies, and a particular model adapted by Islamic banks in Malaysia were critically evaluated. This study, hence, employed a number of models to predict the financial distress faced by Islamic banks in Malaysia. In addition, an attempt was made at the modification of the existing early warning insolvency prediction models in evaluating and analysing the financial distress of Malaysian Islamic banks. This research is constructed within four empirical chapters by employing three prediction models in assessing the financial distress of Islamic banks. The first empirical chapter analyses the secondary data collected from a sample of Islamic banks, based on selected ratios developed in the literature, whereby a comprehensive description of these selected financial ratios in terms of descriptive statistical analysis for the selected Islamic banks in Malaysia is provided. The second empirical chapter investigates the performance of the ‘emerging market Z-score’, introduced by Altman in predicting the performance of Islamic banks and conventional banks in Malaysia. The study aimed to introduce the EM Z-score as a valuable analytical tool in monitoring the deterioration of the performance of banks as well as looking at the impact of the global financial crisis on the performance of Islamic and conventional banks. This chapter examines thirteen Islamic banks and ten conventional banks during the period of 2005-2010. The results show that the EM Z-score for all banks is well above the cut-off point of 2.6, although for Islamic banks the EM Z-score showed a declining trend whilst for conventional banks it showed an increasing trend. This empirical evidence is important for the banks since it provides a warning signal to the banks’ management as well as the related parties involved in the planning, controlling and decision making process. The third empirical chapter presents the newly constructed integrated predictive model designed to evaluate and analyse the financial distress of Islamic banks in Malaysia, which can be used as an alternative model for regulators in monitoring the performance of Islamic banks that are experiencing any serious financial problems. This paper develops a preliminary model for the prediction of the performance level of Islamic financial institutions for the period of December 2005 to September 2010 by using quarterly data for ten selected Islamic banks in Malaysia. For this, factor analysis and three parametric models (discriminant analysis, logit analysis and probit analysis) are used. The results depict that the first few quarters before the benchmark quarter are the most important period for making a correct prediction and crucial decisions on the survival of Islamic banks. Thus, the results demonstrate the predictive ability of the integrated model to differentiate between the healthy and non-healthy Islamic banks, therefore reducing the expected cost of bank failure. The fourth empirical chapter conducts further exploration in predicting the financial distress position of Islamic banks by introducing new variables such as the funding structure, deposit composition, and macroeconomic variables. Using the same sample and data set for Islamic banks as in the previous chapter, this study shows the relationship between the banks’ funding profiles and other alternative variables, and the Islamic banks’ performance in Malaysia. For this, the logit model is used. Based on the results of all models, this study recommended two final models, which showed an excellent fit for predicting the Islamic banks’ performance. The results indicate that none of the macroeconomic variables included were significant, thus suggesting that the performance of Islamic banks in Malaysia was not affected by the economic conditions throughout the study period. This can perhaps be attributed to efficient regulation and supervision by the relevant authorities in the country

    Facility Location Decision for Global Entrepreneurial Small-to-Medium Enterprises Using Similarity Coefficient-based Clustering Algorithms

    Get PDF
    Decisions on location selection are critical for the survival of small-to-medium entrepreneurial organizations from the time they are established until later stages of operation and expansion. The selection of location for small and medium entrepreneurial businesses requires a selection strategy that incorporates relevant factors, quantifies these factors and develops a methodology that analyzes data for better decision-making. In the era of globalization where borders have become easier to transcend, many small ventures tend to choose more attractive international markets as a potential location for their operations where they can obtain higher returns on their investment. Thus, significant changes in the location decision process of the small and medium entrepreneurial companies have received great attention in the literature about small firms with global orientation as a response to the international entrepreneurship phenomenon. Therefore, consideration should be given to factors and attributes that reinforce the appeal of the international market to new businesses. These factors and attributes will provide the decision maker with an effective methodology for data analysis that will provide a framework for decision-making in the selection of locations for the entrepreneurial organization. In this research, the most frequent and critical attributes to select the best location for the entrepreneurial firms (globally) are extracted from relevant literature. Then, a similarity-based cluster analysis approach is introduced to quantify these attributes based on the existing data of economic metrics, such as technological advancement, expenditures on education, expenditures on research and development, the quality of the labor force, unemployment rates, domestic competitiveness, etc. Subsequently, the resulting outcomes are used to identify groups of prospective sites that fit the needs of the entrepreneurial firm. Last, the validity of the adopted methodology will be tested via numerical examples
    corecore