109 research outputs found

    Iterative Matrix Factorization Method for Social Media Data Location Prediction

    Get PDF
    Since some of the location of where the users posted their tweets collected by social media company have varied accuracy, and some are missing. We want to use those tweets with highest accuracy to help fill in the data of those tweets with incomplete information. To test our algorithm, we used the sets of social media data from a city, we separated them into training sets, where we know all the information, and the testing sets, where we intentionally pretend to not know the location. One prediction method that was used in (Dukler, Han and Wang, 2016) requires appending one-hot encoding of the location to the bag of words matrix to do Location Oriented Nonnegative Matrix Factorization (LONMF). We improve further on this algorithm by introducing iterative LONMF. We found that when the threshold and number of iterations are chosen correctly, we can predict tweets location with higher accuracy than using LONMF

    User Behavior Mining in Microblogging

    Get PDF

    Advertising recommendation system based on dynamic data analysis on Turkish speaking Twitter users

    Get PDF
    Online okruženja, a posebno društvene mreže postala su snažna alternative objavljivanju oglasa. Za učinkovito oglašavanje važno je da se sadržaj poistovjećuje s očekivanjima ciljane publike. Uzimajući u obzir da se očekivanja mogu s vremenom promijeniti, potrebno je u realnom vremenu i dinamički prepoznati orijentaciju korisnika. U ovom su se radu u realnom vremenu analizirale poruke turskih korisnika Twittera i identificirala njihova trenutna očekivanja. U tu je svrhu dizajnirana web usluga koja analizira profil korisnika i daje oglase koji najbolje odgovaraju očekivanjima. Za filtriranje odgovarajućeg sadržaja oglašavanja korištena je metoda nazvana heuristička metoda odstranjivanja suvišnog (Heuristic Pruning Method - HPM). Razvijeni sustav je testiran na grupi volontera, aktivnih korisnika Twittera, a učinkovitost sustava se pokazala dobivenom povratnom informacijom-feedbackom.Online environments and especially social networks have become a great alternative to advertisement publishing. In order to accomplish effective advertising it is important that the contents coincide with the expectations of the target audience. Considering that expectations may change over time, it is required to identify the orientation of the users in real time and dynamically. In this study, the messages shared by Turkish Twitter users were analysed in real time and the instant expectations of the users have been identified. To perform this work, a web service was designed which analyses the user’s profile and presents the advertisements that suit best to expectations. A method called Heuristic Pruning Method (HPM) has been revealed in order to filter the most appropriate advertising content. The developed system has been tested on a voluntary participant group who actively uses Twitter, and the effectiveness of the system is demonstrated by the received feedback

    Trajectory Length Prediction for Intelligent Traffic Signaling: A Data-Driven Approach

    Get PDF
    Ship trajectory length prediction is vital for intelligent traffic signaling in the controlled waterways of the Yangtze River. In current intelligent traffic signaling systems (ITSSs), ships are supposed to travel exactly along the central line of the Yangtze River, which is often not a valid assumption and has caused a number of problems. Over the past few years, traffic data have been accumulated exponentially, leading to the big data era. This trend allows more accurate prediction of ships' travel trajectory length based on historical data. In this paper, ships' historical trajectories are first grouped by using the fuzzy c-means clustering algorithm. The relationship between some known factors (i.e., ship speed, loading capacity, self-weight, maximum power, ship length, ship width, ship type, and water level) and the resultant memberships are then modeled using artificial neural networks. The trajectory length is then estimated by the sum of the predicted probabilities multiplied by the trajectory cluster centers' length. To the best of our knowledge, this is the first time to predict the overall trajectory length of manually controlled ships. The experimental results show that the proposed method can reduce the probability of generating incorrect traffic control signals by 74.68% over existing ITSSs. This will significantly improve the efficiency of the Yangtze River traffic management system and increase the traffic capacity by reducing the traveling time
    corecore