YouPorn is one of the largest providers of adult content on the
web. Being free of charge, the video portal allows users - besides
watching - to upload, categorize, and comment on pornographic
videos. With this position paper, we point out the challenges of
analyzing the textual data offered with the videos. We report on
first experiments and problems with our
YouPorn dataset
, which we
extracted from the non-graphical content of the YP website. To gain
some insights, we performed association rule mining on the video
categories and tags, and investigated preferences of users based on
their nickname. Hoping that future research will be able to build
upon our initial experiences, we make the ready-to-use
YP dataset
publicly available