7,013 research outputs found

    Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling

    Full text link
    We evaluate the impact of probabilistically-constructed digital identity data collected from Sep. to Dec. 2017 (approx.), in the context of Lookalike-targeted campaigns. The backbone of this study is a large set of probabilistically-constructed "identities", represented as small bags of cookies and mobile ad identifiers with associated metadata, that are likely all owned by the same underlying user. The identity data allows to generate "identity-based", rather than "identifier-based", user models, giving a fuller picture of the interests of the users underlying the identifiers. We employ off-policy techniques to evaluate the potential of identity-powered lookalike models without incurring the risk of allowing untested models to direct large amounts of ad spend or the large cost of performing A/B tests. We add to historical work on off-policy evaluation by noting a significant type of "finite-sample bias" that occurs for studies combining modestly-sized datasets and evaluation metrics involving rare events (e.g., conversions). We illustrate this bias using a simulation study that later informs the handling of inverse propensity weights in our analyses on real data. We demonstrate significant lift in identity-powered lookalikes versus an identity-ignorant baseline: on average ~70% lift in conversion rate. This rises to factors of ~(4-32)x for identifiers having little data themselves, but that can be inferred to belong to users with substantial data to aggregate across identifiers. This implies that identity-powered user modeling is especially important in the context of identifiers having very short lifespans (i.e., frequently churned cookies). Our work motivates and informs the use of probabilistically-constructed identities in marketing. It also deepens the canon of examples in which off-policy learning has been employed to evaluate the complex systems of the internet economy.Comment: Accepted by WSDM 201

    Tracking Human Mobility using WiFi signals

    Get PDF
    We study six months of human mobility data, including WiFi and GPS traces recorded with high temporal resolution, and find that time series of WiFi scans contain a strong latent location signal. In fact, due to inherent stability and low entropy of human mobility, it is possible to assign location to WiFi access points based on a very small number of GPS samples and then use these access points as location beacons. Using just one GPS observation per day per person allows us to estimate the location of, and subsequently use, WiFi access points to account for 80\% of mobility across a population. These results reveal a great opportunity for using ubiquitous WiFi routers for high-resolution outdoor positioning, but also significant privacy implications of such side-channel location tracking

    Ambient Intelligence As The Bridge To The Future of Pervasive Computing

    Get PDF
    One prediction about this future of pervasive technology is that people will carry the tools needed to interface with technological resources sprinkled through out the environment. A problem with this vision is the dark side of the network effect: early adopters will end up carrying around interfaces for technology that largely does not yet exist, and building managers will question the value of installing technology with features that almost no one will be able to use. An intermediate solution is that certain buildings with specific needs for efficiency or security (such as hospitals) may become smart, with technology insinuated into particular spaces. Since many, or even most of the people in these spaces will not have the technology to interface directly with the new pervasive resources, we must think of the interaction idiom as initially being closer to the notion of smart environments. These environments will have to sense, interpret, and facilitate the actions of the inhabitants, possibly with very little help from technology attached to the people involved, or even their cooperation. We survey a body of work on perceptual tools for smart buildings, built on the sensor network model, and focused on the idea that statistical methods and population dynamics can provide valuable information even in situations where detection of individual instances of behavior may be difficult to detect. These are some of the tools which will fuel the building optimization applications that will justify the efforts of early adopters to build smart buildings studded with pervasive technology
    corecore