61 research outputs found

    Identification of Opinion Spammers using Reviewer Reputation and Clustering Analysis

    Get PDF
    Online reviews have increasingly become a very important resource before making a purchasing decisions. Unfortunately, malicious sellers try to game the system by hiring a person or team (which is called spammers) to fabricate fake reviews to improve their reputation.Existing methods mainly take the problem as a general binary classification or focus on some heuristic rules. However, supervised learning methods relies heavily on a large number of labeled examples of deceptive and truthful opinions by domain experts, and most of features mentioned in the heuristic strategy ignore the characteristic of the group organization among spammers. In this paper, an effective method of identifying opinion spammers is proposed. Firstly, suspected spammers are detected by means of unsupervised learning based on reviewer’s reputation. We believe that the reviewer’s reputation has a direct relation with the quality of reviews. Generally, review written by user with lower reputation, shows lower quality and higher possibility to be fake. Therefore, the model assigns reputation score to each reviewer wherein the content based factors and activeness of reviewers are employed efficiently. On basis of all suspected spammers, k-center clustering algorithm is performed to further spot the spammers based on the observation of burst of review release time. Experimental results on Amazon’s dataset are encouraging and indicate that our approach poses high accuracy and recall, and good performance is achieved

    Assessing stellar yields in Galaxy chemical evolution: observational stellar abundance patterns

    Full text link
    One-zone Galactic Chemical Evolution (GCE) models have provided useful insights on a great wealth of average abundance patterns in many environments, especially for the Milky Way and its satellites. However, the scatter of such abundance patterns is still a challenging aspect to reproduce. The leading hypothesis is that dynamics is a likely major source of the dispersion. In this work we test another hypothesis, namely that different assumptions on yield modeling may be at play simultaneously. We compare whether the abundance patterns spanned by the models are consistent with those observed in Galactic data. First, we test the performance of recent yield tabulations, and we show which of these tabulations best fit Galactic stellar abundances. We then group the models and test if yield combinations match data scatter and standard deviation. On a fixed Milky-Way-like parametrization of NuPyCEE, we test a selection of yields for the three dominant yield sets: low-to-intermediate mass stars, massive stars, and Type Ia supernovae. We also include the production of r-process elements by neutron star mergers. We explore the statistical properties spanned by such yields. We identify the differences and commonalities among yield sets. We define criteria that estimate whether an element is in agreement with the data, or if the model overestimates or underestimates it in various redshift bins. While it is true that yields are a major source of uncertainty in GCE models, the scatter of abundances in stellar spectra cannot be explained by a simple averaging of runs across yield prescriptions.Comment: 22 pages, 19 figures, accepted for publication in MNRA

    Stellar Stream Candidates in the Solar Neighborhood Found in the LAMOST DR3 and TGAS

    Get PDF
    We have cross-matched the LAMOST DR3 with the Gaia DR1 TGAS catalogs and obtained a sample of 166,827 stars with reliable kinematics. A technique based on the wavelet transform was applied to detect significant overdensities in velocity space among five subsamples divided by spatial position. In total, 16 significant overdensities of stars with very similar kinematics were identified. Among these, four are new stream candidates and the rest are previously known groups. Both the U-V velocity and metallicity distributions of the local sample show a clear gap between the Hercules structure and the Hyades-Pleiades structure. The U-V positions of these peaks shift with the spatial position. Following a description of our analysis, we speculate on possible origins of our stream candidates.Comment: 16 pages, 5 figure

    Satellite galaxies' drag on field stars in the Milky Way

    Full text link
    With Gaia EDR3 data, velocity dispersion of Milky Way field stars around satellite galaxies have been investigated. We have fitted velocity dispersion against distance to satellite galaxy and found the gradient of velocity dispersion is related to the mass of satellite galaxy. With order-of-magnitude approximations, a linear correlation has been fitted between the mass of satellite galaxy and gradient of velocity dispersion caused by its gravitational drag. Though our result is an observational qualitative result, it shows better relation could be obtained with more observations in the future.Comment: 8 pages, 3 figure

    A Stochastic Adjustment Strategy for Coordination Process in Distributed Networks

    Get PDF
    Cloud computing has become a popular basis that integrated into amount of large platforms to support applications (e.g., multimedia, vehicle traffic, and IoT). It is critical to focus on coordinating the part of these applications that execute in the cloud to provide reliable, scalable and available services. Nevertheless, the problem of optimally coordinating the applications is rarely addressed. In this paper, we develop a stochastic model to analyze the fundamental characteristics that occur in ZooKeeper during the coordination process. The model primarily addresses two aspects: demands of followers and the load of a leader. Then, we derive the optimal strategy for provision with deployment of coordinated servers to achieve load balancing based on various factors (e.g. server capacity and network load), so that the overall network performance is optimized. We evaluate our algorithm under realistic settings and reveal the trend of factors such as CPU, memory utilization and network bandwidth with the increasing number of requests. We propose the algorithm that considers how many servers should be deployed and when. Our results demonstrate that the strategy guarantees the performance by making suitable deployment adjustment

    Nearly 30,000 Late-Type Main-Sequence Stars with Stellar Age from LAMOST DR5

    Get PDF
    We construct a sample of nearly 30,000 main-sequence stars with 4500K \u3c Teff \u3c 5000K and stellar ages estimated by the chromospheric activity−age relation. This sample is used to determine the age distribution in the R − Z plane of the Galaxy, where R is the projected Galactocentric distance in the disk midplane and Z is the height above the disk midplane. As |Z| increases, the percentage of old stars becomes larger. It is known that scale-height of Galactic disk increases as R increases, which is called flare. A mild flare from R ∼ 8.0 to 9.0 kpc in stellar age distribution is found. We also find that the velocity dispersion increases with age as confirmed by previous studies. Finally we present spiral-shaped structures in Z − υZ phase space in three stellar age bins. The spiral is clearly seen in the age bin of [0, 1] Gyr, which suggests that a vertical perturbation to the disk probably took place within the last ∼ 1.0 Gyr

    On the Migration Origin of the Hercules Moving Group with GAIA, LAMOST, APOGEE, and GALAH Surveys

    Full text link
    Using Gaia DR3 data and the wavelet transformation technique, we study the substructures of the Hercules moving group (HMG): Hercules 1 (H1) and Hercules 2 (H2). Spectroscopic survey data from LAMOST, APOGEE, and GALAH are used to obtain metallicities and ages of stars belonging to the HMG. Our analysis leads to several key findings as follows: (aa) the HMG is on average richer in metallicity than the Galactic disk, with H2 being metal richer than H1; (bb) the HMG likely has a radial metallicity gradient distinct from that of the disk; (cc) the HMG is on average older than the disk, with H2 being older than H1; (dd) the HMG likely has a radial age gradient distinct from that of the disk; and (ee) the metallicity and age distributions of the HMG depend mainly on the Galactic radius but show no dependence on the azimuthal velocity. Taken all together, we conclude that the HMG is composed primarily of stars undergoing radial migration. We suggest that the HMG is associated with a higher-order dynamical resonance of the bar of the Galaxy.Comment: 15 pages, 6 figures. Accepted by Ap
    corecore