3,070 research outputs found

    Detecting Singleton Review Spammers Using Semantic Similarity

    Full text link
    Online reviews have increasingly become a very important resource for consumers when making purchases. Though it is becoming more and more difficult for people to make well-informed buying decisions without being deceived by fake reviews. Prior works on the opinion spam problem mostly considered classifying fake reviews using behavioral user patterns. They focused on prolific users who write more than a couple of reviews, discarding one-time reviewers. The number of singleton reviewers however is expected to be high for many review websites. While behavioral patterns are effective when dealing with elite users, for one-time reviewers, the review text needs to be exploited. In this paper we tackle the problem of detecting fake reviews written by the same person using multiple names, posting each review under a different name. We propose two methods to detect similar reviews and show the results generally outperform the vectorial similarity measures used in prior works. The first method extends the semantic similarity between words to the reviews level. The second method is based on topic modeling and exploits the similarity of the reviews topic distributions using two models: bag-of-words and bag-of-opinion-phrases. The experiments were conducted on reviews from three different datasets: Yelp (57K reviews), Trustpilot (9K reviews) and Ott dataset (800 reviews).Comment: 6 pages, WWW 201

    In vitro inhibition of growth in Saprolegnia sp. isolated from the eggs of Persian sturgeon Acipenser persicus (Pisces: Acipenseriformes) by Pseudomonas aeroginosa (PTCC:1430)

    Get PDF
    Saprolegnia is one of the most important agents decreasing the eggs survival rate in sturgeon hatcheries.There are some chemical substances for controlling the fungal infection of eggs. In this study, an attempt was made to introduce a germ negative bacterium, Pseudomonas aeroginosa (PTCC1430)(Persian Type Culture Collection) as a biocontrolling agent of water mold. Saprolegnia was isolated from the eggs of some infected Persian sturgeon, Acipenser persicus in a sturgeon hatchery and then was purified. P.aeroginosa was cultured in Potato dextrose Agar (PDB) media and then was prepared in 5 concentrations (103,104,105,106and107cfu.ml-1) while challenging with fungi in petri dishes under laboratory conditions.The results showed that by increasing the concentration of the bacteria in plates, hyphal growth of the fungi was reduced. The highest concentration of P. aeroginosa concentration (107) roughly stopped the -fungi growth and the Minimum Inhibitory Concentration (MIC) was 104cfu.m-l. Results in this study implied the potential of P. aeroginosa (PTCC1430) as a biological agent in controlling saprolegniosis

    Distributions of Historic Market Data -- Relaxation and Correlations

    Full text link
    We investigate relaxation and correlations in a class of mean-reverting models for stochastic variances. We derive closed-form expressions for the correlation functions and leverage for a general form of the stochastic term. We also discuss correlation functions and leverage for three specific models -- multiplicative, Heston (Cox-Ingersoll-Ross) and combined multiplicative-Heston -- whose steady-state probability density functions are Gamma, Inverse Gamma and Beta Prime respectively, the latter two exhibiting "fat" tails. For the Heston model, we apply the eigenvalue analysis of the Fokker-Planck equation to derive the correlation function -- in agreement with the general analysis -- and to identify a series of time scales, which are observable in relaxation of cumulants on approach to the steady state. We test our findings on a very large set of historic financial markets data.Comment: 17 pages, 8 figures, 3 table

    Are there Dragon Kings in the Stock Market?

    Full text link
    We undertake a systematic study of historic market volatility spanning roughly five preceding decades. We focus specifically on the time series of realized volatility (RV) of the S&P500 index and its distribution function. As expected, the largest values of RV coincide with the largest economic upheavals of the period: Savings and Loan Crisis, Tech Bubble, Financial Crisis and Covid Pandemic. We address the question of whether these values belong to one of the three categories: Black Swans (BS), that is they lie on scale-free, power-law tails of the distribution; Dragon Kings (DK), defined as statistically significant upward deviations from BS; or Negative Dragons Kings (nDK), defined as statistically significant downward deviations from BS. In analyzing the tails of the distribution with RV > 40, we observe the appearance of "potential" DK which eventually terminate in an abrupt plunge to nDK. This phenomenon becomes more pronounced with the increase of the number of days over which the average RV is calculated -- here from daily, n=1, to "monthly," n=21. We fit the entire distribution with a modified Generalized Beta (mGB) distribution function, which terminates at a finite value of the variable but exhibits a long power-law stretch prior to that, as well as Generalized Beta Prime (GB2) distribution function, which has a power-law tail. We also fit the tails directly with a straight line on a log-log scale. In order to ascertain BS, DK or nDK behavior, all fits include their confidence intervals and p-values are evaluated for the data points to check if they can come from the respective distributions.Comment: 20 pages, 15 figue

    Distributions of Historic Market Data -- Implied and Realized Volatility

    Get PDF
    We undertake a systematic comparison between implied volatility, as represented by VIX (new methodology) and VXO (old methodology), and realized volatility. We compare visually and statistically distributions of realized and implied variance (volatility squared) and study the distribution of their ratio. We find that the ratio is best fitted by heavy-tailed -- lognormal and fat-tailed (power-law) -- distributions, depending on whether preceding or concurrent month of realized variance is used. We do not find substantial difference in accuracy between VIX and VXO. Additionally, we study the variance of theoretical realized variance for Heston and multiplicative models of stochastic volatility and compare those with realized variance obtained from historic market data.Comment: 28 pages, 40 figures, 16 table

    Distribution of Historic Market Data – Implied and Realized Volatility

    Get PDF
    We undertake a systematic comparison between implied volatility, as represented by VIX (new methodology) and VXO (old methodology) and realized volatility. We do not find substantial difference in accuracy between VIX and VXO. We compare visually and statistically the distributions of realized and implied variance (volatility squared) and study the distribution of their ratio. The ratio distributions are studied both for the known realized variance (for the current month) and for the predicted realized variance (for the following month). We show that the ratio of the two is best fitted by a Beta Prime distribution, whose shape parameters depend strongly on which of the two months is used
    • …
    corecore