47 research outputs found

    Fact Check: Analyzing Financial Events from Multilingual News Sources

    Full text link
    The explosion in the sheer magnitude and complexity of financial news data in recent years makes it increasingly challenging for investment analysts to extract valuable insights and perform analysis. We propose FactCheck in finance, a web-based news aggregator with deep learning models, to provide analysts with a holistic view of important financial events from multilingual news sources and extract events using an unsupervised clustering method. A web interface is provided to examine the credibility of news articles using a transformer-based fact-checker. The performance of the fact checker is evaluated using a dataset related to merger and acquisition (M\&A) events and is shown to outperform several strong baselines.Comment: Dem

    Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations

    Full text link
    Precisely recommending candidate news articles to users has always been a core challenge for personalized news recommendation systems. Most recent works primarily focus on using advanced natural language processing techniques to extract semantic information from rich textual data, employing content-based methods derived from local historical news. However, this approach lacks a global perspective, failing to account for users' hidden motivations and behaviors beyond semantic information. To address this challenge, we propose a novel model called GLORY (Global-LOcal news Recommendation sYstem), which combines global representations learned from other users with local representations to enhance personalized recommendation systems. We accomplish this by constructing a Global-aware Historical News Encoder, which includes a global news graph and employs gated graph neural networks to enrich news representations, thereby fusing historical news representations by a historical news aggregator. Similarly, we extend this approach to a Global Candidate News Encoder, utilizing a global entity graph and a candidate news aggregator to enhance candidate news representation. Evaluation results on two public news datasets demonstrate that our method outperforms existing approaches. Furthermore, our model offers more diverse recommendations.Comment: 10 pages, Recsys 202

    NumHTML : numeric-oriented hierarchical transformer model for multi-task financial forecasting

    Get PDF
    Financial forecasting has been an important and active area of machine learning research because of the challenges it presents and the potential rewards that even minor improvements in prediction accuracy or forecasting may entail. Traditionally, financial forecasting has heavily relied on quantitative indicators and metrics derived from structured financial statements. Earnings conference call data, including text and audio, is an important source of unstructured data that has been used for various prediction tasks using deep earning and related approaches. However, current deep learning-based methods are limited in the way that they deal with numeric data; numbers are typically treated as plain-text tokens without taking advantage of their underlying numeric structure. This paper describes a numeric-oriented hierarchical transformer model (NumHTML) to predict stock returns, and financial risk using multi-modal aligned earnings calls data by taking advantage of the different categories of numbers (monetary, temporal, percentages etc.) and their magnitude. We present the results of a comprehensive evaluation of NumHTML against several state-of-the-art baselines using a real-world publicly available dataset. The results indicate that NumHTML significantly outperforms the current state-of-the-art across a variety of evaluation metrics and that it has the potential to offer significant financial gains in a practical trading context

    Lensless polarimetric coded ptychography (pol-CP) for high-resolution, high-throughput birefringence imaging on a chip

    Full text link
    Polarimetric imaging provides valuable insights into the polarization state of light interacting with a sample. It can infer crucial birefringence properties of bio-specimens without using any labels, thereby facilitating the diagnosis of diseases such as cancer and osteoarthritis. In this study, we introduce a novel polarimetric coded ptychography (pol-CP) approach that enables high-resolution, high-throughput birefringence imaging on a chip. Our platform deviates from traditional lens-based polarization systems by employing an integrated polarimetric coded sensor for lensless diffraction data acquisition. Utilizing Jones calculus, we quantitatively determine the birefringence retardance and orientation information of bio-specimens from four recovered intensity images. Our portable pol-CP prototype can resolve the 435-nm linewidth on the resolution target and the imaging field of view for a single acquisition is limited only by the detector size of 41 mm^2. The prototype allows for the acquisition of gigapixel birefringence images with a 180-mm^2 field of view in ~3.5 minutes, achieving an imaging throughput comparable to that of a conventional whole slide scanner. To demonstrate its biomedical applications, we perform high-throughput imaging of malaria-infected blood smears, locating parasites using birefringence contrast. We also generate birefringence maps of label-free thyroid smears to identify thyroid follicles. Notably, the recovered birefringence maps emphasize the same regions as autofluorescence images, indicating the potential for rapid on-site evaluation of label-free biopsies. The reported approach offers a portable, turnkey solution for high-resolution, high-throughput polarimetric analysis without using lenses, with potential applications in disease diagnosis, sample screening, and label-free chemical imaging

    Status of cardiovascular health among adults in a rural area of Northwest China: Results from a cross-sectional study.

    Get PDF
    The aim of this study was to assess the status of cardiovascular health among a rural population in Northwest China and to determine the associated factors for cardiovascular health.A population-based cross-sectional study was conducted in the rural areas of Hanzhong in Northwest China. Interview, physical examination, and fasting blood glucose and lipid measurements were completed for 2693 adults. The construct of cardiovascular health and the definitions of cardiovascular health metrics proposed by the American Heart Association were used to assess cardiovascular health. The proportions of subjects with cardiovascular health metrics were calculated, adjusting for age and sex. The multiple logistic regression model was used to evaluate the association between ideal cardiovascular health and its associated factors.Only 0.5% (0.0% in men vs 0.9% in women, P = 0.002) of the participants had ideal cardiovascular health, whereas 33.8% (18.0% in men vs 50.0% in women, P < 0.001) and 65.7% (82.0% in men vs 49.1% in women, P < 0.001) of the participants had intermediate and poor cardiovascular health, respectively. The prevalence of poor cardiovascular health increased with increasing age (P < 0.001 for trend). Participants fulfilled, on average, 4.4 (95% confidence interval: 4.2-4.7) of the ideal cardiovascular health metrics. Also, 22.2% of the participants presented with 3 or fewer ideal metrics. Only 19.4% of the participants presented with 6 or more ideal metrics. 24.1% of the participants had all 4 ideal health factors, but only 1.1% of the participants had all 4 ideal health behaviors. Women were more likely to have ideal cardiovascular health, whereas adults aged 35 years or over and those who had a family history of hypertension were less likely to have ideal cardiovascular health.The prevalence of ideal cardiovascular health was extremely low among the rural population in Northwest China. Most adults, especially men and the elderly, had a poor cardiovascular health status. To improve cardiovascular health among the rural population, efforts, especially lifestyle improvements, education and interventions to make healthier food choices, reduce salt intake, increase physical activities, and cease smoking, will be required at the individual, population, and social levels

    MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction

    No full text
    The 29th ACM International Conference on Information and Knowledge Management, Virtual Event, (CIKM '20), 19-23 October 2020In the area of natural language processing, various financial datasets have informed recent research and analysis including financial news, financial reports, social media, and audio data from earnings calls. We introduce a new, large-scale multi-modal, text-audio paired, earnings-call dataset named MAEC, based on S&P 1500 companies. We describe the main features of MAEC, how it was collected and assembled, paying particular attention to the text-audio alignment process used. We present the approach used in this work as providing a suitable framework for processing similar forms of data in the future. The resulting dataset is more than six times larger than those currently available to the research community and we discuss its potential in terms of current and future research challenges and opportunities. All resources of this work are available at https://github.com/Earnings-Call-Dataset/Science Foundation IrelandInsight Research Centre2020-10-06 JG: PDF replaced with correct versio

    Leveraging BERT to Improve the FEARS Index for Stock Forecasting

    Get PDF
    The First Workshop on Financial Technology and Natural Language Processing, Macao, China, 12 August 2019Financial and Economic Attitudes Revealed by Search (FEARS) index reflects the attention and sentiment of public investors and is an important factor for predicting stock price return. In this paper, we take into account the semantics of the FEARS search terms by leveraging the Bidirectional Encoder Representations from Transformers (BERT), and further apply a self-attention deep learning model to our refined FEARS seamlessly for stock return prediction. We demonstrate the practical benefits of our approach by comparing to baseline works.Science Foundation Irelan
    corecore