6,044 research outputs found

    All Men Count with You, but None Too Much: Information Aggregation and Learning in Prediction Markets.

    Full text link
    Prediction markets are markets that are set up to aggregate information from a population of traders in order to predict the outcome of an event. In this thesis, we consider the problem of designing prediction markets with discernible semantics of aggregation whose syntax is amenable to analysis. For this, we will use tools from computer science (in particular, machine learning), statistics and economics. First, we construct generalized log scoring rules for outcomes drawn from high-dimensional spaces. Next, based on this class of scoring rules, we design the class of exponential family prediction markets. We show that this market mechanism performs an aggregation of private beliefs of traders under various agent models. Finally, we present preliminary results extending this work to understand the dynamics of related markets using probabilistic graphical model techniques. We also consider the problem in reverse: using prediction markets to design machine learning algorithms. In particular, we use the idea of sequential aggregation from prediction markets to design machine learning algorithms that are suited to situations where data arrives sequentially. We focus on the design of algorithms for recommender systems that are robust against cloning attacks and that are guaranteed to perform well even when data is only partially available.PHDComputer Science and EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/111398/1/skutty_1.pd

    Econometrics meets sentiment : an overview of methodology and applications

    Get PDF
    The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software

    Forecasting changes in the South African volatility index. A comparison of methods

    Get PDF
    Increased financial regulation with tougher capital standards and additional capital buffers has made understanding volatility in financial markets more imperative. This study investigates various forecasting techniques in their ability to forecast the South African Volatility Index (SAVI). In particular, a time-delay neural network’s forecasting ability is compared to more traditional methods. A comparison of the residual errors of all the forecasting tools used suggests that the time-delay neural network and the historical average model have superior forecasting ability over traditional forecasting models. From a practical perspective, this suggests that the historical average model is the best forecasting tool used in this study, as it is less computationally expensive to implement compared to the neural network.  Furthermore, the results suggest that the SAVI is extremely difficult to forecast, with the volatility index being purely a gauge of investor sentiment in the market, rather than being seen as a potential investment opportunity.&nbsp

    A hybrid algorithm for Bayesian network structure learning with application to multi-label learning

    Get PDF
    We present a novel hybrid algorithm for Bayesian network structure learning, called H2PC. It first reconstructs the skeleton of a Bayesian network and then performs a Bayesian-scoring greedy hill-climbing search to orient the edges. The algorithm is based on divide-and-conquer constraint-based subroutines to learn the local structure around a target variable. We conduct two series of experimental comparisons of H2PC against Max-Min Hill-Climbing (MMHC), which is currently the most powerful state-of-the-art algorithm for Bayesian network structure learning. First, we use eight well-known Bayesian network benchmarks with various data sizes to assess the quality of the learned structure returned by the algorithms. Our extensive experiments show that H2PC outperforms MMHC in terms of goodness of fit to new data and quality of the network structure with respect to the true dependence structure of the data. Second, we investigate H2PC's ability to solve the multi-label learning problem. We provide theoretical results to characterize and identify graphically the so-called minimal label powersets that appear as irreducible factors in the joint distribution under the faithfulness condition. The multi-label learning problem is then decomposed into a series of multi-class classification problems, where each multi-class variable encodes a label powerset. H2PC is shown to compare favorably to MMHC in terms of global classification accuracy over ten multi-label data sets covering different application domains. Overall, our experiments support the conclusions that local structural learning with H2PC in the form of local neighborhood induction is a theoretically well-motivated and empirically effective learning framework that is well suited to multi-label learning. The source code (in R) of H2PC as well as all data sets used for the empirical tests are publicly available.Comment: arXiv admin note: text overlap with arXiv:1101.5184 by other author

    From Social Simulation to Integrative System Design

    Full text link
    As the recent financial crisis showed, today there is a strong need to gain "ecological perspective" of all relevant interactions in socio-economic-techno-environmental systems. For this, we suggested to set-up a network of Centers for integrative systems design, which shall be able to run all potentially relevant scenarios, identify causality chains, explore feedback and cascading effects for a number of model variants, and determine the reliability of their implications (given the validity of the underlying models). They will be able to detect possible negative side effect of policy decisions, before they occur. The Centers belonging to this network of Integrative Systems Design Centers would be focused on a particular field, but they would be part of an attempt to eventually cover all relevant areas of society and economy and integrate them within a "Living Earth Simulator". The results of all research activities of such Centers would be turned into informative input for political Decision Arenas. For example, Crisis Observatories (for financial instabilities, shortages of resources, environmental change, conflict, spreading of diseases, etc.) would be connected with such Decision Arenas for the purpose of visualization, in order to make complex interdependencies understandable to scientists, decision-makers, and the general public.Comment: 34 pages, Visioneer White Paper, see http://www.visioneer.ethz.c

    05011 Abstracts Collection -- Computing and Markets

    Get PDF
    From 03.01.05 to 07.01.05, the Dagstuhl Seminar 05011``Computing and Markets\u27\u27 was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

    Impact of Network Connectedness

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์‚ฐ์—…๊ณตํ•™๊ณผ, 2022. 8. ์ด์žฌ์šฑ.๊ธˆ์œต ์ž์‚ฐ์€ ์–ธ์ œ๋‚˜ ๋ฆฌ์Šคํฌ์— ๋…ธ์ถœ๋˜์–ด ์žˆ๋‹ค. ์ด ๋ฆฌ์Šคํฌ์˜ ํฌ๊ธฐ์™€, ๊ฐ ์ž์‚ฐ์ด ๋ฆฌ์Šคํฌ์— ๋Œ€ํ•ด ์–ผ๋งˆ๋‚˜ ๋ณด์ƒ๋ฐ›๋Š” ์ง€๋ฅผ ์ •ํ™•ํžˆ ์ธก์ •ํ•˜๋Š” ๊ฒƒ์€ ์ž์‚ฐ์˜ ํŠน์„ฑ์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•œ ๋ฌธ์ œ์ด๋‹ค. ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜• (asset pricing model)์€ ์ž์‚ฐ์˜ ๋ฆฌ์Šคํฌ์™€ ๊ทธ ๋ณด์ƒ์„ ํ†ตํ•ด์„œ ๊ธˆ์œต ์ž์‚ฐ์˜ ์ˆ˜์ต๋ฅ ์„ ์„ค๋ช…ํ•˜๋ ค ํ•˜๋Š” ๋ชจํ˜•์ด๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์—ฌ๋Ÿฌ ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์˜ ํ˜•ํƒœ ์ค‘ ํŒฉํ„ฐ ๋ชจ๋ธ์— ์ง‘์ค‘ํ•˜์˜€๋‹ค. ํŒฉํ„ฐ ๋ชจ๋ธ์€ ์ดˆ๊ณผ ์ˆ˜์ต๋ฅ ์„ ํŒฉํ„ฐ์™€ ๋ฒ ํƒ€๋กœ ๋ถ„๋ฆฌํ•ด์„œ ์„ค๋ช…ํ•˜๋Š” ๋ชจ๋ธ์ด๋‹ค. ์ „ํ†ต์ ์ธ ํŒฉํ„ฐ ๋ชจ๋ธ๋“ค์€ ๊ฑฐ์‹œ ๊ธˆ์œต ๋ณ€์ˆ˜๋‚˜ ๊ธฐ์—… ๋ณ€์ˆ˜ ๋“ฑ์„ ํ†ตํ•˜์—ฌ ํŒฉํ„ฐ์™€ ๋ฒ ํƒ€๋ฅผ ์ถ”์ •ํ•˜๋Š”๋ฐ, ์ด ๋•Œ ์ž์‚ฐ ๊ฐ„์˜ ์—ฐ๊ฒฐ๊ด€๊ณ„๋ฅผ ๊ณ ๋ คํ•˜๋Š” ์—ฐ๊ตฌ๋Š” ๋งŽ์ด ์ง„ํ–‰๋˜์ง€ ์•Š์•˜๋‹ค. ๊ธˆ์œต ์ž์‚ฐ๋“ค์€ ์„œ๋กœ ์˜ํ–ฅ์„ ์ฃผ๋Š” ๊ด€๊ณ„์— ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ฐ๊ฐ์˜ ์ˆ˜์ต๋ฅ  ๋˜ํ•œ ๊ฐœ๋ณ„์ ์ด ์•„๋‹ˆ๋ผ ์ž์‚ฐ ๊ฐ„์˜ ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ๋ฅผ ๊ณ ๋ คํ•˜๋ฉฐ ๋™์‹œ์— ํ‰๊ฐ€๋˜์–ด์•ผ ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ํŒฉํ„ฐ ๋ชจ๋ธ์— ์ž์‚ฐ ๊ฐ„์˜ ์—ฐ๊ฒฐ ๊ตฌ์กฐ๋ฅผ ๋ฐ˜์˜ํ•˜๊ธฐ ์œ„ํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐ๋ฐ˜ ์‹ค์ฆ์  ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์„ ์ œ์•ˆํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ๋จผ์ € ๊ทธ๋ž˜ํ”„ ์ธ๊ณต์‹ ๊ฒฝ๋ง (GNN)์„ ๋ฐ”ํƒ•์œผ๋กœ ํ•œ ๋ฉ€ํ‹ฐ ํŒฉํ„ฐ ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. ์ด ๋•Œ ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” ๊ฒƒ ๋งŒํผ์ด๋‚˜ ์ค‘์š”ํ•œ ๊ฒƒ์€ ์ž์‚ฐ ๊ฐ„ ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ๋ฅผ ์–ด๋–ป๊ฒŒ ์ •์˜ํ•  ๊ฒƒ์ธ๊ฐ€๋ผ๋Š” ๋ฌธ์ œ์ด๋‹ค. GNN์€ ๊ทธ ์ž…๋ ฅ ๋ณ€์ˆ˜๋กœ์„œ ์ž˜ ์ •์˜๋œ ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ๋ฅผ ์š”๊ตฌํ•˜์ง€๋งŒ ์ž์‚ฐ ๊ฐ„์˜ ์—ฐ๊ฒฐ ๊ตฌ์กฐ๋Š” ๋ช…ํ™•ํ•˜๊ฒŒ ์ •์˜๋˜์ง€ ์•Š์•˜๊ธฐ ๋•Œ๋ฌธ์—, ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ž์‚ฐ ๊ฐ„์˜ ์—ฐ๊ฒฐ์„ฑ์„ ํ”ผ์–ด์Šจ ์ƒ๊ด€๊ณ„์ˆ˜๋ฅผ ์ด์šฉํ•˜์—ฌ ์ถ”์ •ํ•˜๊ณ  ์ด๋ฅผ ํŠน์ • ์ž„๊ณ„๊ฐ’์„ ํ†ตํ•ด 0๊ณผ 1๋กœ ์ด์ง„ํ™” ์‹œํ‚ค๋Š” ๋ฐฉ์‹์„ ์‚ฌ์šฉํ–ˆ๋‹ค. ์ œ์•ˆํ•œ ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋Š” ๋ฒ ํƒ€๋ฅผ ์ถ”์ •ํ•˜๋Š” ๋ถ€๋ถ„๊ณผ ํŒฉํ„ฐ๋ฅผ ์ถ”์ •ํ•˜๋Š” ๋ถ€๋ถ„์œผ๋กœ ๋‚˜๋‰˜์–ด์ง€๋Š”๋ฐ, ๊ฐ๊ฐ ๊ธฐ์—… ๋ณ€์ˆ˜์™€, ์ˆ˜์ต๋ฅ ์„ ์ด์šฉํ•ด์„œ ์ถ”์ •ํ•œ๋‹ค. 1957๋…„๋ถ€ํ„ฐ ๋ฏธ๊ตญ์— ์ƒ์žฅ๋œ ์ฃผ์‹๋“ค์„ ๋Œ€์ƒ์œผ๋กœ ํ•œ ์‹ค์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ, ์ œ์•ˆํ•œ ๋ชจ๋ธ์€ ์„ค๋ช…๋ ฅ๊ณผ ์˜ˆ์ธก ์„ฑ๋Šฅ ์ธก๋ฉด์—์„œ ๋ฒค์น˜๋งˆํฌ ๋ชจ๋ธ๋“ค๋ณด๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค. ๋˜ํ•œ ํ†ต๊ณ„์  ์„ฑ๋Šฅ ์ด์™ธ์—๋„ ํŒฉํ„ฐ์˜ ๊ฒฝ์ œ์  ์˜๋ฏธ๋ฅผ ์ธก์ •ํ•˜๋Š” ๋ฉด์—์„œ, ์ œ์•ˆํ•œ ๋ชจ๋ธ๋กœ๋ถ€ํ„ฐ ์ถ”์ •ํ•œ ํŒฉํ„ฐ๊ฐ€ ๊ฐ€์žฅ ํšจ์œจ์ ์ธ ํ™•๋ฅ ์  ํ• ์ธ์š”์†Œ (stochastic discount factor)๋ฅผ ์ถ”์ •ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์  ์—ญ์‹œ ํ™•์ธํ•˜์˜€๋‹ค. ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์˜ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ๋ชฉ์ ์€ ์ˆ˜์ต๋ฅ ์ด์ง€๋งŒ, ๋ณ€๋™์„ฑ ๋˜ํ•œ ๊ธˆ์œต ์ž์‚ฐ์˜ ์›€์ง์ž„์„ ์„ค๋ช…ํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•œ ์„ฑ์งˆ์ด๋‹ค. ๋งŽ์€ ์‚ฌ์ „ ์—ฐ๊ตฌ์—์„œ ๋ฐํ˜€์กŒ๋“ฏ ์ˆ˜์ต๋ฅ ๊ณผ ๋ณ€๋™์„ฑ ์‚ฌ์ด์—๋Š” ์ƒ๊ด€๊ด€๊ณ„๊ฐ€ ์กด์žฌํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋ณ€๋™์„ฑ์€ ์ˆ˜์ต๋ฅ ์„ ์„ค๋ช…ํ•˜๋Š” ์š”์ธ์ด ๋  ์ˆ˜ ์žˆ๋‹ค. ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์—์„œ์™€ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ์ž์‚ฐ๋“ค ๊ฐ„์˜ ์—ฐ๊ฒฐ ๊ตฌ์กฐ๋ฅผ ๊ณ ๋ คํ•˜๋Š” ๊ฒƒ์€ ๋ณ€๋™์„ฑ ์˜ˆ์ธก์—์„œ๋„ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ํฐ ์˜ํ–ฅ์„ ๋ฏธ์น  ์ˆ˜ ์žˆ๋‹ค. ๋ณ€๋™์„ฑ ๋ถ„์„์—์„œ๋Š” ์—ฌ๋Ÿฌ ์ž์‚ฐ์˜ ๋ณ€๋™์„ฑ์ด ์„œ๋กœ ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ๊ฒƒ์„ ์Šคํ•„์˜ค๋ฒ„ (spillover)๋ผ ๋ถ€๋ฅธ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์Šคํ•„์˜ค๋ฒ„ ํšจ๊ณผ๋ฅผ ์ง์ ‘์ ์œผ๋กœ ๋ฐ˜์˜ํ•˜๋Š” ๋ณ€๋™์„ฑ ์˜ˆ์ธก ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. ์ œ์•ˆํ•œ ๋ชจ๋ธ์€ ๋ณ€๋™์„ฑ์˜ ์ธก๋ฉด์—์„œ ์ž์‚ฐ ๊ฐ„ ์—ฐ๊ฒฐ ๊ตฌ์กฐ๋ฅผ ๋ณ€๋™์„ฑ ์Šคํ•„์˜ค๋ฒ„ ์ง€์ˆ˜๋กœ ๊ตฌ์„ฑํ•œ ์ธ์ ‘ํ–‰๋ ฌ๋กœ ์ •์˜ํ•˜๋ฉฐ, ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋กœ๋Š” ์‹œ๊ณต๊ฐ„์  ๊ทธ๋ž˜ํ”„ ์ธ๊ณต์‹ ๊ฒฝ๋ง (spatial-temporal GNN)๋ฅผ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ๊ธ€๋กœ๋ฒŒ ์‹œ์žฅ ์ง€์ˆ˜๋“ค์— ๋Œ€ํ•œ ์‹ค์ฆ ์‹คํ—˜์„ ํ†ตํ•ด์„œ ์ œ์•ˆํ•œ ๋ชจ๋ธ์€ ๋‹จ๊ธฐ์™€ ์ค‘๊ธฐ ๋ณ€๋™์„ฑ ์˜ˆ์ธก์—์„œ ๋ฒค์น˜๋งˆํฌ ๋ชจ๋ธ์— ๋น„ํ•ด ๊ฐ€์žฅ ์ข‹์€ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ๋ณด์ด๊ณ , ๋‹ค๋ฅธ ์‹œ์žฅ์— ํฐ ์˜ํ–ฅ์„ ์ฃผ๋Š” ์‹œ์žฅ์„ ์ด์šฉํ•˜์—ฌ ๋‹ค๋ฅธ ์‹œ์žฅ์— ๋Œ€ํ•œ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ํฌ๊ฒŒ ๋†’์ผ ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค. ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์— ๋ณ€๋™์„ฑ์„ ์ง์ ‘์ ์œผ๋กœ ๋ฐ˜์˜ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋ชจํ˜• ๋‚ด์—์„œ ๋ณ€๋™์„ฑ์ด ์–ด๋–ป๊ฒŒ ์ •์˜๋˜๋Š”๊ฐ€๋ฅผ ๋จผ์ € ์‚ดํŽด๋ณด์•„์•ผ ํ•œ๋‹ค. ๋ณ€๋™์„ฑ์€ ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜• ๋‚ด์—์„œ ์ž”์ฐจ์˜ ํ‘œ์ค€ํŽธ์ฐจ๋กœ ํ•ด์„ํ•  ์ˆ˜ ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์‹œ๊ณ„์—ด ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ก ์„ ์‚ฌ์šฉํ•˜์—ฌ ์ถ”์ •ํ•˜๋Š” ๊ธฐ์กด์˜ ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์€ ์‹œ๊ฐ„์— ๋”ฐ๋ผ ๋ถˆ๋ณ€ํ•˜๋Š” ๋ณ€๋™์„ฑ์„ ๊ฐ€์ •ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์‹œ๊ฐ„์— ๋”ฐ๋ผ ๋ณ€ํ•˜๋Š” ๋ณ€๋™์„ฑ์„ ์˜ˆ์ธก ๋ชจ๋ธ์„ ์ด์šฉํ•˜์—ฌ ์ถ”์ •ํ•˜๊ณ , ์ด๋ฅผ ํŒฉํ„ฐ ๋ชจ๋ธ์˜ ์†์‹คํ•จ์ˆ˜์— ์ •๊ทœํ™”๋กœ ์‚ฌ์šฉํ•จ์œผ๋กœ์จ ์‹œ๊ฐ„์— ๋”ฐ๋ผ ๋ณ€ํ™”ํ•˜๋Š” ๋ณ€๋™์„ฑ์˜ ํŠน์„ฑ์„ ๋ฐ˜์˜ํ•˜๋Š” ํŒฉํ„ฐ ๋ชจ๋ธ์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ๋ฏธ๊ตญ ์ƒ์žฅ ์ฃผ์‹์— ๋Œ€ํ•œ ์‹ค์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ ์ œ์•ˆํ•œ ๋ชจ๋ธ์€ ์‹œ๊ฐ„ ๋ถˆ๋ณ€ ๋ณ€๋™์„ฑ ์กฐ๊ฑด์„ ์™„ํ™”ํ•˜์ง€ ์•Š์€ ๋ชจ๋ธ์— ๋น„ํ•ด ๋ณ€๋™์„œ์ด ๋‚ฎ์€ ์‹œ๊ธฐ์—์„œ ํ†ต๊ณ„์  ์„ฑ๋Šฅ์ด ํฐ ํญ์œผ๋กœ ์ƒ์Šนํ•จ์„ ํ™•์ธํ•˜์˜€๋‹ค. ํ˜„์žฌ ๋ฌด์‹œํ•  ์ˆ˜ ์—†๋Š” ๊ทœ๋ชจ๋กœ ์„ฑ์žฅํ•œ ๊ฐ€์ƒํ™”ํ ์‹œ์žฅ์—๋Š” ๊ตฌ์กฐ์ ์œผ๋กœ ํ™•์‹คํ•˜๊ฒŒ ์—ฐ๊ฒฐ๋œ ์ž์‚ฐ์ด ์กด์žฌํ•œ๋‹ค. ๊ฐ™์€ ๋ธ”๋ก์ฒด์ธ ์ƒ์— ์กด์žฌํ•˜๋Š” ํ† ํฐ๋“ค์€ ํ•ด๋‹น ๋ธ”๋ก์ฒด์ธ ์œ„์—์„œ ๋ฐœํ–‰๋˜๊ณ  ๊ฑฐ๋ž˜๋˜๋ฏ€๋กœ ๋„คํŠธ์›Œํฌ ๊ตฌ์กฐ ์ƒ์œผ๋กœ ์—ฐ๊ฒฐ์„ฑ์„ ์ง€๋‹Œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์•ž์„œ ์ง„ํ–‰๋œ ์—ฐ๊ตฌ์— ๋Œ€ํ•œ ์‘์šฉ์œผ๋กœ, ๋ช…ํ™•ํžˆ ๊ตฌ์กฐ์ ์œผ๋กœ ์—ฐ๊ฒฐ๋œ ์ž์‚ฐ๋“ค์ด ์ดˆ๊ณผ ์ˆ˜์ต๋ฅ ์„ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋Š” ์ธก์ • ๊ฐ€๋Šฅํ•œ ๊ณตํ†ต๋œ ํŒฉํ„ฐ๋ฅผ ๊ฐ€์ง์„ ๋ณด์ด๊ณ ์ž ํ–ˆ๋‹ค. ์—ฐ๊ตฌ์˜ ๋Œ€์ƒ์„ ์ด๋”๋ฆฌ์›€ ๋ธ”๋ก์ฒด์ธ ์ƒ์˜ ํ† ํฐ๋“ค๋กœ ์ œํ•œํ•˜์—ฌ ์‹ค์ฆ ์‹คํ—˜์„ ์ง„ํ–‰ํ•œ ๊ฒฐ๊ณผ, EIP-1559 ์ ์šฉ ์ดํ›„์— ์ด๋”๋ฆฌ์›€ ๊ฐ€์Šค ์ˆ˜์ต๋ฅ ์ด ์‹œ์žฅ ์ˆ˜์ต๋ฅ ๊ณผ ํ•จ๊ป˜ ํ† ํฐ์˜ ์ˆ˜์ต๋ฅ ์„ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋Š” ํŒฉํ„ฐ๋กœ์„œ ์ž‘์šฉํ•จ์„ ๋ณด์˜€๋‹ค. ๋˜ํ•œ, ์ด๋”๋ฆฌ์›€ ๊ฐ€์Šค ์ˆ˜์ต๋ฅ ์€ ํ† ํฐ์˜ ๋ณ€๋™์„ฑ์— ์˜ํ–ฅ์„ ์ฃผ๋Š” ์š”์†Œ๋กœ, ํ† ํฐ ๋ณ€๋™์„ฑ ์˜ˆ์ธก์—๋„ ๋„์›€์„ ์ค„ ์ˆ˜ ์žˆ๋Š” ์š”์†Œ์ž„์„ ์Šคํ•„์˜ค๋ฒ„ ๊ธฐ๋ฐ˜ ๋ณ€๋™์„ฑ ์˜ˆ์ธก ๋ชจ๋ธ์„ ํ†ตํ•ด ํ™•์ธํ•˜์˜€๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์ž์‚ฐ ๊ฐ„์˜ ์—ฐ๊ฒฐ์„ฑ์„ ๊ณ ๋ คํ•œ ์ž์‚ฐ๊ฐ€๊ฒฉ๊ฒฐ์ •๋ชจํ˜•์„ ๊ตฌ์„ฑํ•˜์˜€์œผ๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด์„œ ๊ธˆ์œต ์ž์‚ฐ๋“ค์ด ๊ฐ–๋Š” ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ๊ฐ€ ์‹ค์งˆ์ ์œผ๋กœ ์ˆ˜์ต๋ฅ ์— ์˜ํ–ฅ์„ ๋ฏธ์นจ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด ์—ฐ๊ตฌ๊ฒฐ๊ณผ๋Š” ํ–ฅํ›„ ์ƒˆ๋กœ์šด ๊ธˆ์œต ์‹œ์žฅ์— ๋Œ€ํ•ด์„œ๋„ ์ ์šฉ ๊ฐ€๋Šฅํ•œ ํ™•์žฅ์„ฑ ์žˆ๋Š” ๋ชจ๋ธ์ด๋ฉฐ, ๊ธˆ์œต ์ž์‚ฐ์˜ ํ‰๊ฐ€์— ์žˆ์–ด ์—ฌ๋Ÿฌ ์ž์‚ฐ์„ ๋™์‹œ์— ์ƒ๊ด€๊ด€๊ณ„๋ฅผ ๊ณ ๋ คํ•˜๋ฉฐ ํ‰๊ฐ€ํ•ด์•ผ ํ•œ๋‹ค๋Š” ํ•จ์˜์ ์„ ์ œ๊ณตํ•˜๊ณ  ์žˆ๋‹ค.Financial assets are always exposed to risks. It is important to evaluate the risk properly and figure out how much each asset is compensated for its risk. Asset pricing model explains the behavior of financial asset return by evaluating the risk and risk exposure of asset return. We focused on factor model structure among asset pricing models, which explains excess return through factor and beta coefficients. While conventional factor models estimate factor or beta through various macroeconomic variables or firm-specific variables, there exist fewer studies considering the connectedness between assets. Since financial assets have connected dynamics, asset returns should be priced simultaneously considering the graph structure of assets. In this dissertation, we proposed the AI-based empirical asset pricing model to reflect the connected structure between assets in the factor model. We first proposed the graph neural network-based multi-factor asset pricing model. As important as the structure of the model in constructing an asset pricing model that reflects the structure of the connection between assets is, how to define the connectivity. Graph neural network requires a well-defined graph structure. We defined the connectedness between assets as the binary converted Pearson correlation coefficients of asset returns by the cutoff value. The proposed model consists of a beta estimation part and a factor estimation part, where each part is estimated with firm characteristics and excess returns, respectively. The empirical analysis of U.S equities reveals that the proposed model has more explanatory power and prediction ability than benchmark models. In addition, the most efficient stochastic discount factor can be estimated from the estimated factors. While return is the main object of asset pricing, volatility is also important property for explaining the behavior of financial assets. Volatility can be the factor in explaining return since many studies point out that return and volatility are correlated. As with the asset pricing model, considering the connected structure between assets in volatility prediction can be of great help in explaining the dynamics of assets. In the volatility analysis, what affects between volatility is called spillover. In this aspect, we proposed the volatility prediction model that can directly reflect this spillover effect. We estimated the graph structure between asset volatility using the volatility spillover index and utilized the spatial-temporal graph neural network structure for model construction. From the empirical analysis of global market indices, we confirm that the proposed model shows the best performance in short- and mid-term volatility forecasting. To include volatility in the asset pricing discussion, it is necessary to focus on how volatility is defined in the asset pricing model. In the asset pricing model, volatility can be interpreted as the variance of the residual of the model. However, asset pricing models with time-series estimation mostly have time-unvarying volatility constraints. We constructed an asset pricing model with time-varying volatility by estimating variability using the prediction model and reflecting it in the training loss of the asset pricing model. We identify that the proposed model can improve the statistical performance during the low volatility period through an empirical study of U.S equities. Currently, there are clearly structurally connected assets in the cryptocurrency market, which has grown to a scale that cannot be ignored. All of the same blockchain-based tokens are issued and traded on that blockchain, so they have strong structural connectivity. We tried to identify that an observable factor for explaining excess return exists in such connected tokens as an application of previous studies. We limited the analysis target to Ethereum-based tokens and showed that the Ethereum gas price became a factor for the macroeconomic factor model after the application of EIP-1559. Furthermore, we applied the volatility spillover index-based volatility prediction model using gas return and showed that gas return can increase the prediction performance of certain tokens' volatility.Chapter 1 Introduction 1 1.1 Motivation of the Dissertation 1 1.2 Aims of the Dissertation 10 1.3 Organization of the Dissertation 13 Chapter 2 Graph-based multi-factor asset pricing model 14 2.1 Chapter Overview 14 2.2 Preliminaries 17 2.2.1 Graph Neural Network 17 2.2.2 Graph Convolutional Network 18 2.3 Methodology 19 2.3.1 Multi-factor asset pricing model 19 2.3.2 Proposed method 21 2.3.3 Forward stagewise additive factor modeling 23 2.4 Empirical Studies 24 2.4.1 Data 24 2.4.2 Benchmark models 24 2.4.3 Empirical results 28 2.5 Chapter Summary 33 Chapter 3 Volatility prediction with volatility spillover index 37 3.1 Chapter Overview 37 3.2 Preliminaries 41 3.2.1 Realized Volatility 41 3.2.2 Volatility Spillover Measurements 42 3.2.3 Benchmark Models 45 3.3 Empirical Studies 50 3.3.1 Data 50 3.3.2 Descriptive Statistics 51 3.3.3 Proposed Method 52 3.3.4 Empirical Results 54 3.4 Chapter Summary 61 Chapter 4 Graph-based multi-factor model with time-varying volatility 64 4.1 Chapter overview 64 4.2 Preliminaries 67 4.2.1 Local-linear regression for time-varying parameter estimation 67 4.3 Methodology 68 4.3.1 Time-varying volatility implied loss function 68 4.3.2 Proposed model architecture 70 4.4 Empirical Studies 72 4.4.1 Data 72 4.4.2 Benchmark Models 72 4.4.3 Empirical Results 73 4.5 Chapter Summary 79 Chapter 5 Macroeconomic factor model and spillover-based volatility prediction for ERC-20 tokens 82 5.1 Chapter Overview 82 5.2 Preliminaries 85 5.3 Methodology 86 5.3.1 Relation analysis 86 5.3.2 Factor model analysis 89 5.3.3 Volatility prediction with volatility spillover index 90 5.4 Empirical Studies 90 5.4.1 Data 90 5.4.2 Empirical Results 98 5.5 Chapter Summary 102 Chapter 6 Conclusion 105 6.1 Contributions 105 6.2 Future Work 108 Bibliography 109 ๊ตญ๋ฌธ์ดˆ๋ก 130๋ฐ•

    Essays on Natural Language Processing and Central Banking.

    Get PDF
    Humans generally interact, communicate, and form social structures using natural language. Due to the high dimensionality of language, much of the wealth of information from these interactions has been barred from the economic profession. However, recent technological advancements lead to increasing use of text as an underlying datasource in economic and financial applications. This trend has been further accelerated by Nobel laureate Robert J. Shiller's presidential address to the American Economic Association "Narrative Economics", in which he argues for more elaboration on narratives - stories that affect individual decisions and collective actions - by the economic scientific community. Addressing this gap in the literature, research has been published utilizing textual information to quantify latent variables such as uncertainty, forecasting macroeconomic variables in real time, and asset price predictions. In conjunction with the rise of natural language processing applications, there has been a shift in perspective on monetary policy with regards to central bank transparency and communication. Transitioning from the presumption that monetary policy is limited to interest rate actions, communication has advanced to become a key tool in the central banker's toolbox. Ever since, words are used to anchor expectations and self-enforce the central banks' desired equilibrium path. As a result, research on monetary policy has been relentless in the pursuit of adopting novel techniques as well as incorporating new unstructured data sources such as news-articles, press conference statements, and speeches. This string of literature is regularly complemented by an extension of the traditional empirical toolbox, borrowing novel techniques from the field of machine learning. The here presented cumulative dissertation consists of four essays that touch on all these fields, namely text as data, monetary policy, and machine learning. My primary focus is on the European Central Bank (ECB), but the methodology and ideas can be extended to other central banks as well. Throughout this thesis, textual information is incorporated from different data sources, analyzed using different techniques in order to approximate different latent variables. As a result, text is employed as a dependent variable at times and as an independent variable at other times. Specifically, the first essay leverages the relative frequency of terms used in ECB press statements as anecdotal evidence for the diversity of the central banks' communication with regard to their topics, whereas the second essay counts positive and negative terms in speeches to approximate the latent variable of central bank loss. The third essay examines the impact of linguistic complexity on financial market participants by conducting a readability test on the ECB's introduction statements, and the final essay dives into computational linguistics to develop a novel central bank-specific language model for better quantifying monetary policy communication. The following is a brief summary of the four essays included in this thesis. My first essay analyzes rule-based monetary policy in the euro area before and after the financial crisis. Jonas Gross and I argue that the environment in which policymakers operate is far more complex than traditional model-based analysis of policy rules permits. We complement this view with evidence from ECB press conferences, demonstrating that the central bank discusses a wide range of topics beyond the traditional Taylor-rule variables. Since each variable has the potential to be relevant in understanding the central bank's reaction function, we combine a literature review with natural language processing to identify a set of potential determinants. The traditional approach of selecting a single interest rate response function is then contrasted by applying a Bayesian model averaging approach to these determinants. We account for model uncertainty by including a large number of determinants and estimating a total of 33.000 different model combinations. Our results suggest that in contrast to the ongoing criticism, the ECB primarily reacts to inflation in its interest rate decision. In fact, our analysis finds that inflation is a significant variable in almost all of the examined model combinations. Furthermore, we find that the ECB reacts to changes in economic activity determinants such as unemployment and production as well. These economic activity indicators were a priority for the ECB prior to the financial crisis but have since declined in relevance, suggesting that inflation is the sole driver of monetary policy decisions in the post-crisis period. Finally, we assess our findings with textual evidence from the ECB press conferences, where, in accordance with the previous results, we find the same shift. My second essay focuses on the ECB's objective itself, quantifying the central bank's satisfaction with current economic conditions through textual analysis. By maximizing an implied objective function, the ECB is assumed to pursue inflation targeting with a subordinate focus on supporting the general economic policy of the European Union. I compute the central bank's sentiment using the ECB's public communication by counting the number of positive and negative words in speeches, allowing me to quantify the objective. Assuming a typical functional form for the objective allows me to estimate the optimal levels with respect to inflation and economic activity, i.e. the bliss points in which the central banks communication is the most positive. Using a dictionary approach to estimate the sentiment index yields several interesting results. The most surprising is, unquestionable, a concave inflation objective with an implied inflation target beyond the banks' mandate and best described as 'above, but close to 2%'. Deviations from this bliss point appear to lower the satisfaction, and hence the optimistic language in speeches. With respect to the subordinate objective, I find a convex objective towards output growth and a linear objective towards the unemployment rate. Furthermore, my results suggest that deviations from the primary objective, the inflation rate, appear to have no greater effect on the speeches' language than deviations from either of the subordinate objectives. In fact, in contrast to inflation, both output and unemployment are consistently significant variables. Finally, contrary to findings in the United States, financial market conditions have no significant influence on the ECB's sentiment. In the third essay, Bernd Hayo, Kai Henseler, Marc Steffen Rapp, and I investigate the impact of central bank communication on financial markets. We are particularly interested in the communication's complexity and how it affects financial market trading. To examine this relationship empirically, we employ high-frequency data from European stock index futures during the introductory statement of the ECB's press conferences. A readability test on the introductory statement during the press conference determines the statements' linguistic complexity. In conjunction with the central banks' unique communication design, we are able to separate the effect of verbal complexity on trading during the introductory statements and the subsequent Q&A session. Our sample contains announcements of novel UMPM, enabling us to investigate whether the content of the introductory statements interacts with the reaction of traders to its linguistic complexity. We find that the Q&A sessions are - in terms of linguistic complexity - less complex and thus more comprehensible. When UMPM are announced, contemporaneous trading volumes are negatively correlated with complexity, resulting in a temporal shift of trading towards the less complex Q&A session. This shift is first indication that financial markets respond to linguistic complexity in a context-specific manner. This line of reasoning is strengthened further by the observation that events containing UMPM are less similar in terms of wording to previous statements. As a result, we believe that financial market traders are underreacting to novel complex information in introductory statements regarding UMPM. The subsequent discussion and clarification of the cognitively costly content during the Q&A session mitigates this effect, shifting trading from the introductory statement phase to the Q&A phase of the ECB's press conference. The final essay concerns the quantification of central bank communication, i.e. it explores how text in monetary policy can be effectively summarised and analysed. Martin Baumgรคrtner and I propose a novel language model, build on machine learning, as a tool to quantify central bankers qualitative information. The necessity and feasibility of measuring central bank communication in this manner stems from two major developments in the fields of monetary policy and machine learning over the last two decades. On the one hand, central bankers' communication, as well as its analysis, has increased substantially. This progress necessitates some form of quantification of the qualitative components, a research topic dominated by dictionary approaches. On the other hand, advances at the intersection of linguistics and computer science enabled the use of machine learning to train language models capable of adequately capturing the languages multidimensionality and context-dependence. The resulting models are regularly open source. However, the technical jargon of central bankers renders them generally unsuitable for use in the field. This essay aims to apply computational linguistics research to monetary policy by developing a language model exclusively trained on central bank communication. To accomplish this, we gather a large and diverse text corpus, which we use to compare a number of state-of-the-art machine learning algorithms. Choosing the most promising, we develop a central bank specific language model. Several applications are presented to showcase the broad applicability of our language model. First, we propose a novel technique for comparing central banks, affirming that similarity is driven by mutual objectives. Next, we construct a time-series index that reflects the ECB's willingness to act as a lender of last resort. The index suggests that communication similar to Mario Draghi's 'whatever it takes' speech can calm financial markets during times of high uncertainty. The third application emphasizes the presence of prejudices even in central bankers' technical language. We demonstrate how social patterns, such as occupational gender distribution, are reflected in their communication. The final application is a forecasting exercise that suggests that speeches may be more accurate predictors than previous research suggests
    • โ€ฆ
    corecore