92 research outputs found

    Towards a User Privacy-Aware Mobile Gaming App Installation Prediction Model

    Full text link
    Over the past decade, programmatic advertising has received a great deal of attention in the online advertising industry. A real-time bidding (RTB) system is rapidly becoming the most popular method to buy and sell online advertising impressions. Within the RTB system, demand-side platforms (DSP) aim to spend advertisers' campaign budgets efficiently while maximizing profit, seeking impressions that result in high user responses, such as clicks or installs. In the current study, we investigate the process of predicting a mobile gaming app installation from the point of view of a particular DSP, while paying attention to user privacy, and exploring the trade-off between privacy preservation and model performance. There are multiple levels of potential threats to user privacy, depending on the privacy leaks associated with the data-sharing process, such as data transformation or de-anonymization. To address these concerns, privacy-preserving techniques were proposed, such as cryptographic approaches, for training privacy-aware machine-learning models. However, the ability to train a mobile gaming app installation prediction model without using user-level data, can prevent these threats and protect the users' privacy, even though the model's ability to predict may be impaired. Additionally, current laws might force companies to declare that they are collecting data, and might even give the user the option to opt out of such data collection, which might threaten companies' business models in digital advertising, which are dependent on the collection and use of user-level data. We conclude that privacy-aware models might still preserve significant capabilities, enabling companies to make better decisions, dependent on the privacy-efficacy trade-off utility function of each case.Comment: 11 pages, 3 figure

    AN ENSEMBLE MODEL FOR CLICK THROUGH RATE PREDICTION

    Get PDF
    Internet has become the most prominent and accessible way to spread the news about an event or to pitch, advertise and sell a product, globally. The success of any advertisement campaign lies in reaching the right class of target audience and eventually convert them as potential customers in the future. Search engines like the Google, Yahoo, Bing are a few of the most used ones by the businesses to market their product. Apart from this, certain websites like the www.alibaba.com that has more traffic also offer services for B2B customers to set their advertisement campaign. The look of the advertisement, the maximum bill per day, the age and gender of the audience, the bid price for the position and the size of the advertisement are some of the key factors that are available for the businesses to tune. The businesses are predominantly charged based the number of clicks that they received for their advertisement while some websites also bill them with a fixed charge per billing cycle. This creates a necessity for the advertising platforms to analyze and study these influential factors to achieve the maximum possible gain through the advertisements. Additionally, it is equally important for the businesses to customize these factors rightly to achieve the maximum clicks. This research presents a click through rate prediction system that analyzes several of the factors mentioned above to predict if an advertisement will receive a click or not with improvements over the existing systems in terms of the sampling the data, the features used, and the methodologies handled to improve the accuracy. We used the ensemble model with weighted scheme and achieved an accuracy of 0.91 on a unit scale and predicted the probability for an advertisement to receive a click form the user

    Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

    Get PDF
    The most significant progress in recent years in online display advertising is what is known as the Real-Time Bidding (RTB) mechanism to buy and sell ads. RTB essentially facilitates buying an individual ad impression in real time while it is still being generated from a user’s visit. RTB not only scales up the buying process by aggregating a large amount of available inventories across publishers but, most importantly, enables direct targeting of individual users. As such, RTB has fundamentally changed the landscape of digital marketing. Scientifically, the demand for automation, integration and optimisation in RTB also brings new research opportunities in information retrieval, data mining, machine learning and other related fields. In this monograph, an overview is given of the fundamental infrastructure, algorithms, and technical solutions of this new frontier of computational advertising. The covered topics include user response prediction, bid landscape forecasting, bidding algorithms, revenue optimisation, statistical arbitrage, dynamic pricing, and ad fraud detection

    Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

    Get PDF
    The most significant progress in recent years in online display advertising is what is known as the Real-Time Bidding (RTB) mechanism to buy and sell ads. RTB essentially facilitates buying an individual ad impression in real time while it is still being generated from a user’s visit. RTB not only scales up the buying process by aggregating a large amount of available inventories across publishers but, most importantly, enables direct targeting of individual users. As such, RTB has fundamentally changed the landscape of digital marketing. Scientifically, the demand for automation, integration and optimisation in RTB also brings new research opportunities in information retrieval, data mining, machine learning and other related fields. In this monograph, an overview is given of the fundamental infrastructure, algorithms, and technical solutions of this new frontier of computational advertising. The covered topics include user response prediction, bid landscape forecasting, bidding algorithms, revenue optimisation, statistical arbitrage, dynamic pricing, and ad fraud detection

    Prediction of Conversion Rates in Online Marketing - A study of the application of logistic regression for predicting conversion rates in online marketing.

    Get PDF
    This thesis was written in collaboration with an anonymous European automotive company, Company X, which uses online marketing as a part of their business model. In online marketing it is of inrest to estimate conversion rates, that is the quota of a population at an initial state that will go on to perform a certain action. The action could be, but is not limited to, clicking on an advertisement, interacting in a certain way with the advertisers webpage, or buying a product. If the advertiser can estimate the value of the performed action, and the conversion rate to the action, the advertiser can then calculate the value of the initial state. In extension, is means that if a company knows the life time value of a customer, and can estimate the conversion rate from someone clicking on one of their advertisements to becoming a customer, they can calculate the value of that click. Generally online marketing space is sold through auctions. Different companies bifor the same given advertising space depending on the expected value of the space and pay for exposure. Exposure is either measured in how many users that has seen the ad (impressions) or how many users that have interacted with the ad (usually measured in clicks). Due to this, if a company can improve the precision of how they estimate the value of an impression or click they can spend their online marketing budget more effectively. Considering the size and rapid growth of the online marketing market, this is of high interest. In this thesis a logistic regression modeling appach was compared to a group average approach for predicting conversion rates. The group average approach is based on grouping different advertisements that have few observations into bigger populations and then using the average of the bigger population. The thesis finds that in most cases logistic regression models seems preferable. However, when the variance of the conversion rates is large, the Group average model can be prefereble

    Click Fraud Detection in Online and In-app Advertisements: A Learning Based Approach

    Get PDF
    Click Fraud is the fraudulent act of clicking on pay-per-click advertisements to increase a site’s revenue, to drain revenue from the advertiser, or to inflate the popularity of content on social media platforms. In-app advertisements on mobile platforms are among the most common targets for click fraud, which makes companies hesitant to advertise their products. Fraudulent clicks are supposed to be caught by ad providers as part of their service to advertisers, which is commonly done using machine learning methods. However: (1) there is a lack of research in current literature addressing and evaluating the different techniques of click fraud detection and prevention, (2) threat models composed of active learning systems (smart attackers) can mislead the training process of the fraud detection model by polluting the training data, (3) current deep learning models have significant computational overhead, (4) training data is often in an imbalanced state, and balancing it still results in noisy data that can train the classifier incorrectly, and (5) datasets with high dimensionality cause increased computational overhead and decreased classifier correctness -- while existing feature selection techniques address this issue, they have their own performance limitations. By extending the state-of-the-art techniques in the field of machine learning, this dissertation provides the following solutions: (i) To address (1) and (2), we propose a hybrid deep-learning-based model which consists of an artificial neural network, auto-encoder and semi-supervised generative adversarial network. (ii) As a solution for (3), we present Cascaded Forest and Extreme Gradient Boosting with less hyperparameter tuning. (iii) To overcome (4), we propose a row-wise data reduction method, KSMOTE, which filters out noisy data samples both in the raw data and the synthetically generated samples. (iv) For (5), we propose different column-reduction methods such as multi-time-scale Time Series analysis for fraud forecasting, using binary labeled imbalanced datasets and hybrid filter-wrapper feature selection approaches

    Reducing information asymmetry in used-car markets by using machine learning models

    Get PDF
    Information asymmetry in used-car markets results from knowledge differences between buyers and sellers about used cars. Naturally, someone who owns a used car for a certain period, develops a deeper understanding of the real value opposed to someone who did not. The goal of this work is to attempt to reduce information asymmetry in used-car markets by using state-of-the-art machine learning models. With data provided by a Polish used-car online marketplace, a price range estimation as well as a point estimation will be made for every car. A Median Absolute Percentage Error of 7.86%and Target Zone of 58.38% are achieved

    Machine learning applications in operations management and digital marketing

    Get PDF
    In this dissertation, I study how machine learning can be used to solve prominent problems in operations management and digital marketing. The primary motivation is to show that the application of machine learning can solve problems in ways that existing approaches cannot. In its entirety, this dissertation is a study of four problems—two in operations management and two in digital marketing—and develops solutions to these problems via data-driven approaches by leveraging machine learning. These four problems are distinct, and are presented in the form of individual self-containing essays. Each essay is the result of collaborations with industry partners and is of academic and practical importance. In some cases, the solutions presented in this dissertation outperform existing state-of-the-art methods, and in other cases, it presents a solution when no reasonable alternatives are available. The problems are: consumer debt collection (Chapter 3), contact center staffing and scheduling (Chapter 4), digital marketing attribution (Chapter 5), and probabilistic device matching (Chapters 6 and 7). An introduction of the thesis is presented in Chapter 1 and some basic machine learning concepts are described in Chapter 2
    • …
    corecore