584 research outputs found

    Detection of Stock Price Manipulation Using Kernel Based Principal Component Analysis and Multivariate Density Estimation

    Get PDF
    Stock price manipulation uses illegitimate means to artificially influence market prices of several stocks. It causes massive losses and undermines investors’ confidence and the integrity of the stock market. Several existing research works focused on detecting a specific manipulation scheme using supervised learning but lacks the adaptive capability to capture different manipulative strategies. This begets the assumption of model parameter values specific to the underlying manipulation scheme. In addition, supervised learning requires the use of labelled data which is difficult to acquire due to confidentiality and the proprietary nature of trading data. The proposed research establishes a detection model based on unsupervised learning using Kernel Principal Component Analysis (KPCA) and applied increased variance of selected latent features in higher dimensions. A proposed Multidimensional Kernel Density Estimation (MKDE) clustering is then applied upon the selected components to identify abnormal patterns of manipulation in data. This research has an advantage over the existing methods in overcoming the ambiguity of assuming values of several parameters, reducing the high dimensions obtained from conventional KPCA and thereby reducing computational complexity. The robustness of the detection model has also been evaluated when two or more manipulative activities occur within a short duration of each other and by varying the window length of the dataset fed to the model. The results show a comprehensive assessment of the model on multiple datasets and a significant performance enhancement in terms of the F-measure values with a significant reduction in false alarm rate (FAR) has been achieved

    WALDATA : Wavelet transform based adversarial learning for the detection of anomalous trading activities

    Get PDF
    Detecting manipulative activities in stock market trading poses a significant challenge due to the complex temporal correlations inherent to the dynamically changing stock price data. This challenge is further exacerbated by the limited availability of labelled anomalous trading data instances. Stock price manipulations, which consist of infrequent anomalies in stock price trading data, are challenging to capture due to their sporadic occurrence and dynamically evolving nature. This scarcity and inherent complexity significantly complicate the creation of labelled datasets hence hinders the development of robust detection of different stock price manipulation schemes through supervised learning methods. Overcoming these challenges is crucial for enhancing our understanding of market dynamics and implementing robust market surveillance systems. To address these challenges, we introduce a novel stock price manipulation detection approach called WALDATA (Wavelet Transform based Adversarial Learning for the Detection of Anomalous Trading Activities). We leverage the Wavelet Transform (WT) to decompose non-stationary stock price time series into informative features and capture multi-scale dynamics within the data. We encode stock price data by transforming it into scalogram images through the Continuous Wavelet Transform, effectively converting stock price time series data into a 2D image representation. Subsequently, we employ a Generative Adversarial Network (GAN) architecture, originally applied to computer vision, to learn the underlying distribution of normal trading behaviour from the encoded images. We then train the discriminator as an anomaly detector for identifying manipulative trading activities in the stock market. The efficacy of WALDATA is rigorously evaluated on diverse real-world stock datasets using 1-level tick data from the LOBSTER project and the experimental results demonstrate the significant performance of our approach achieving an average AUC of 0.99 while maintaining low false alarm rates across various market conditions. These findings not only validate the effectiveness of the proposed WALDATA approach in accurately identifying stock price manipulations but also provide investors and regulators alike with valuable insights for the development of advanced market surveillance systems. This research demonstrates the promising potential of combining wavelet-based feature extraction and stock price time series to image representation with generative adversarial learning frameworks for anomaly detection in financial time series data. The successful implementation of WALDATA contributes to the development of advanced market surveillance systems and paves the way for further advancements in market surveillance, contributing towards a more efficient and robust financial system and a fair market environment

    Limit order books in statistical arbitrage and anomaly detection

    Full text link
    Cette thĂšse propose des mĂ©thodes exploitant la vaste information contenue dans les carnets d’ordres (LOBs). La premiĂšre partie de cette thĂšse dĂ©couvre des inefficacitĂ©s dans les LOBs qui sont source d’arbitrage statistique pour les traders haute frĂ©quence. Le chapitre 1 dĂ©veloppe de nouvelles relations thĂ©oriques entre les actions intercotĂ©es afin que leurs prix soient exempts d’arbitrage. Toute dĂ©viation de prix est capturĂ©e par une stratĂ©gie novatrice qui est ensuite Ă©valuĂ©e dans un nouvel environnement de backtesting permettant l’étude de la latence et de son importance pour les traders haute frĂ©quence. Le chapitre 2 dĂ©montre empiriquement l’existence d’arbitrage lead-lag Ă  haute frĂ©quence. Les relations dites lead-lag ont Ă©tĂ© bien documentĂ©es par le passĂ©, mais aucune Ă©tude n’a montrĂ© leur vĂ©ritable potentiel Ă©conomique. Un modĂšle Ă©conomĂ©trique original est proposĂ© pour prĂ©dire les rendements de l’actif en retard, ce qu’il rĂ©alise de maniĂšre prĂ©cise hors Ă©chantillon, conduisant Ă  des opportunitĂ©s d’arbitrage de courte durĂ©e. Dans ces deux chapitres, les inefficacitĂ©s des LOBs dĂ©couvertes sont dĂ©montrĂ©es comme Ă©tant rentables, fournissant ainsi une meilleure comprĂ©hension des activitĂ©s des traders haute frĂ©quence. La deuxiĂšme partie de cette thĂšse investigue les sĂ©quences anormales dans les LOBs. Le chapitre 3 Ă©value la performance de mĂ©thodes d’apprentissage automatique dans la dĂ©tection d’ordres frauduleux. En raison de la grande quantitĂ© de donnĂ©es, les fraudes sont difficilement dĂ©tectables et peu de cas sont disponibles pour ajuster les modĂšles de dĂ©tection. Un nouveau cadre d’apprentissage profond non supervisĂ© est proposĂ© afin de discerner les comportements anormaux du LOB dans ce contexte ardu. Celui-ci est indĂ©pendant de l’actif et peut Ă©voluer avec les marchĂ©s, offrant alors de meilleures capacitĂ©s de dĂ©tection pour les rĂ©gulateurs financiers.This thesis proposes methods exploiting the vast informational content of limit order books (LOBs). The first part of this thesis discovers LOB inefficiencies that are sources of statistical arbitrage for high-frequency traders. Chapter 1 develops new theoretical relationships between cross-listed stocks, so their prices are arbitrage free. Price deviations are captured by a novel strategy that is then evaluated in a new backtesting environment enabling the study of latency and its importance for high-frequency traders. Chapter 2 empirically demonstrates the existence of lead-lag arbitrage at high-frequency. Lead-lag relationships have been well documented in the past, but no study has shown their true economic potential. An original econometric model is proposed to forecast returns on the lagging asset, and does so accurately out-of-sample, resulting in short-lived arbitrage opportunities. In both chapters, the discovered LOB inefficiencies are shown to be profitable, thus providing a better understanding of high-frequency traders’ activities. The second part of this thesis investigates anomalous patterns in LOBs. Chapter 3 studies the performance of machine learning methods in the detection of fraudulent orders. Because of the large amount of LOB data generated daily, trade frauds are challenging to catch, and very few cases are available to fit detection models. A novel unsupervised deep learning–based framework is proposed to discern abnormal LOB behavior in this difficult context. It is asset independent and can evolve alongside markets, providing better fraud detection capabilities to market regulators

    Modular magnetite-filled nanomicelles for multimodal imaging-guided development of effective anticancer vaccines

    Get PDF
    188 p.Tanto las cĂ©lulas cancerosas como muchos de los patĂłgenos que causan enfermedades crĂłnicas han desarrollado mecanismos para evadir al sistema inmune. Por lo tanto, es necesario desarrollar nuevas tecnologĂ­as que nos permitan conseguir vacunas novedosas, seguras y efectivas contra este tipo de enfermedades.En esta tesis se propone un sistema de trasporte capaz de dirigir antĂ­genos tumorales y/o adyuvantes hasta los Ăłrganos secundarios linfoides (bazo y nĂłdulos linfĂĄticos) con el propĂłsito de generar una respuesta inmune potente y especĂ­fica contra el cĂĄncer. Este tipo de sistema de transporte se basa en nanopartĂ­culas de Ăłxido de hierro recubiertas con un polĂ­mero bio-compatible, lo cual permite conferir propiedades Ășnicas a estas vacunas `sub-unitariasÂż.Gracias al tamaño nanomĂ©trico, estas vacunas son capaces de transportar ambos tipos de molĂ©culas inmunogĂ©nicas a las mismas poblaciones de cĂ©lulas presentes en los nĂłdulos linfĂĄticos y, del mismo modo, permiten la interacciĂłn con receptores/compartimentos celulares especĂ­ficos. Este transporte dirigido implica una mejora considerable de la eficacia de la inmunizaciĂłn, incluso a dosis bajas, permitiendo una respuesta inmune humoral y celular especĂ­fica contra el tumor.Mediante el marcaje de estas nano-vacunas con un fluorĂłforo (rodamina B) y el radioisĂłtopo 67Ga, es posible estudiar su interacciĂłn con las cĂ©lulas, su comportamiento en medios fisiolĂłgicos y su biodistribuciĂłn in vivo, mediante tĂ©cnicas de imagen no-invasivas.Estas nano-vacunas han demostrado ser materiales muy prometedores para el desarrollo de nuevas vacunas contra el cĂĄncer.CIC BiomaGUN

    Role of herpes simplex virus 1 protein ICP47 in antigen presentation and pathogenesis

    Get PDF
    The herpes simplex virus (HSV) immunomodulatory protein, ICP47, conceals infected cells from CD8+ T cells by inhibiting the presentation of peptides on MHC class I. The mechanism by which ICP47 exerts this function is by binding to the transporter associated with antigen processing (TAP) protein, blocking peptide transport and loading onto MHC I molecules in the ER. The earliest studies of ICP47 supported by biochemical and in vitro observations noted marked species specificity with human but not mouse TAP being inhibited by this protein. However, later work demonstrated that ICP47 can contribute to HSV neurovirulence in mice. The discordance between biochemical and in vivo data leaves our understanding of ICP47 and its role in evading CD8+ T cells incomplete. Data from our laboratory suggested that ICP47 is likely to be expressed during the establishment and maintenance of HSV-1 latency, however, its exact function during these stages of infection is unknown. Therefore, in this study, we sought to re-visit the discrepancies discussed above and investigate the role of ICP47 during HSV-1 infection. We utilised different strains of HSV and mice, as well as an alternate infection model and unique methods to quantify the effect of ICP47 on levels of antigen presentation. In our mouse model, where HSV is confined to the peripheral nervous system, deletion of ICP47 from HSV-1 KOS did not alter lesion development, virus load, spread or reactivation. Likewise, latency was unaffected by ICP47 deficiency as determined using a sensitive Cre-marking mouse model. Further observations from the Cre-marking mouse model revealed that unlike the ICP47 promoter inserted in an ectopic locus, native promoters did not induce additional neuronal marking by Cre beyond lytic infection. We evaluated the reasons behind the difference in marking using newly generated recombinant viruses. Subsequent flank infection of ROSA26R mice with these viruses showed that the local genomic context is also important for regulation of gene expression. By contrast to our in vivo pathogenesis data, we were able to show that ICP47 does inhibit antigen presentation significantly on HSV-infected mouse cells using in vitro antigen presentation assays. However, in mouse cells, antigen presentation was ablated by 44%, compared to an 85% reduction in human cells. As CD8+ T cells have been shown to recognize very few peptide-MHC I complexes on the surface of target cells, it is important to consider the efficiency at which ICP47 inhibits human and mouse TAP. Therefore, we used mass spectrometry to identify and quantify MHC I bound peptides derived from HSV-1 during viral infection. We found that more peptide sequences were presented on mouse cells infected with ICP47 null virus compared to those infected with wild-type virus. We quantified the presentation of 14 of these peptides and the contribution of ICP47 to this process in human and mouse cells. We found that ICP47 almost entirely blocks human TAP-mediated peptide presentation, though the degree of inhibition was somewhat peptide-specific. Conversely, the effect of ICP47 on mouse TAP was far less profound, resulting in only up to five-fold reduction in MHC-peptide abundance. In conclusion, this study shows that despite significant inhibition of antigen presentation in mouse cells, ICP47 may not be an effective immune modulator in mice and suggests a need for re-evaluation of suitable mouse models
    • 

    corecore