Search CORE

129 research outputs found

Dynamic Ad Allocation: Bandits with Budgets

Author: Slivkins Aleksandrs
Publication venue
Publication date: 01/06/2013
Field of study

We consider an application of multi-armed bandits to internet advertising (specifically, to dynamic ad allocation in the pay-per-click model, with uncertainty on the click probabilities). We focus on an important practical issue that advertisers are constrained in how much money they can spend on their ad campaigns. This issue has not been considered in the prior work on bandit-based approaches for ad allocation, to the best of our knowledge. We define a simple, stylized model where an algorithm picks one ad to display in each round, and each ad has a \emph{budget}: the maximal amount of money that can be spent on this ad. This model admits a natural variant of UCB1, a well-known algorithm for multi-armed bandits with stochastic rewards. We derive strong provable guarantees for this algorithm

arXiv.org e-Print Archive

CiteSeerX

Contextual Bandits with Cross-learning

Author: Balseiro Santiago
Golrezaei Negin
Mahdian Mohammad
Mirrokni Vahab
Schneider Jon
Publication venue
Publication date: 03/01/2020
Field of study

In the classical contextual bandits problem, in each round

t

, a learner observes some context

c

, chooses some action

a

to perform, and receives some reward

r_{a,t}(c)

. We consider the variant of this problem where in addition to receiving the reward

r_{a,t}(c)

, the learner also learns the values of

r_{a,t}(c')

for all other contexts

c'

; i.e., the rewards that would have been achieved by performing that action under different contexts. This variant arises in several strategic settings, such as learning how to bid in non-truthful repeated auctions (in this setting the context is the decision maker's private valuation for each auction). We call this problem the contextual bandits problem with cross-learning. The best algorithms for the classical contextual bandits problem achieve

\tilde{O}(\sqrt{CKT})

regret against all stationary policies, where

C

is the number of contexts,

K

the number of actions, and

T

the number of rounds. We demonstrate algorithms for the contextual bandits problem with cross-learning that remove the dependence on

C

and achieve regret

O(\sqrt{KT})

(when contexts are stochastic with known distribution),

\tilde{O}(K^{1/3}T^{2/3})

(when contexts are stochastic with unknown distribution), and

\tilde{O}(\sqrt{KT})

(when contexts are adversarial but rewards are stochastic).Comment: 48 pages, 5 figure

arXiv.org e-Print Archive

DSpace@MIT

Dataretrieving for varied in different Composition Databases using Content aggregation

Author: Mangesh S. Khode, Mayur S. Dhait
Publication venue: Auricle Global Society of Education and Research
Publication date: 30/06/2016
Field of study

Keeping in mind with a variety of content choices, consumers are exhibiting diverse preferences for content; their preferences often depend on the context in which they consume content as well as various exogenous events. To satisfy the consumers� demand for such diverse content, multimedia content aggregators (CAs) haveemerged which gather content from numerous multimedia sources. A key challenge for such systems is to accurately predict whattype of content each of its consumers prefers in a certain context,and adapt these predictions to the evolving consumers preferences, contexts, and content characteristics This paper addressesgenerate text based file data sets, such as word, text files, image file data sets, and video file data sets, It also extract data from multiple databases, evaluate user preference based query, reduce time complexity by clustering data, and increase fetching speed by using query classification

International Journal on Future Revolution in Computer Science & Communication Engineering