6,556 research outputs found

    Timestamp Estimation From Outdoor Scenes

    Get PDF
    The increasing availability of smartphones allowed people to easily capture and share images on the internet. These images are often associated with metadata, including the image capture time (timestamp) and the location where the image was captured (geolocation). The metadata associated with images provides valuable information to better understand scenes and events presented in these images. The timestamp can be manipulated intentionally to provide false information to convey a twisted version of reality. Images with manipulated timestamps are often used as a cover-up for wrongdoing or broadcasting false claims and competing views on the internet. Estimating the time of capture of a photograph is a challenging task that requires a comprehensive understanding of the scene and its geographical location. In this paper, we propose a learning-based approach based on deep learning to estimate when an outdoor image was captured. We provide a detailed quantitative and qualitative evaluation of the trained models for various settings and show that the proposed approach outperforms baseline methods

    Scripted Stereotypes In Reality TV

    Get PDF
    Diversity, or lack thereof, has always been an issue in both television and film for years. But another great issue that ties in with the lack of diversity is misrepresentation, or a substantial presence of stereotypes in media. While stereotypes often are commonplace in scripted television and film, the possibility of stereotypes appearing in a program that claims to be based on reality seems unfitting. It is commonly known that reality television is not completely “unscripted” and is actually molded by producers and editors. While reality television should not consist of stereotypes, they have curiously made their way onto the screen and into our homes. Through content analysis this thesis focuses on Latina/Hispanic-American and Asian-American contestants on ABCs’ The Bachelor and whether they present stereotypes typically found in scripted programming

    Blind Dates: Examining the Expression of Temporality in Historical Photographs

    Full text link
    This paper explores the capacity of computer vision models to discern temporal information in visual content, focusing specifically on historical photographs. We investigate the dating of images using OpenCLIP, an open-source implementation of CLIP, a multi-modal language and vision model. Our experiment consists of three steps: zero-shot classification, fine-tuning, and analysis of visual content. We use the \textit{De Boer Scene Detection} dataset, containing 39,866 gray-scale historical press photographs from 1950 to 1999. The results show that zero-shot classification is relatively ineffective for image dating, with a bias towards predicting dates in the past. Fine-tuning OpenCLIP with a logistic classifier improves performance and eliminates the bias. Additionally, our analysis reveals that images featuring buses, cars, cats, dogs, and people are more accurately dated, suggesting the presence of temporal markers. The study highlights the potential of machine learning models like OpenCLIP in dating images and emphasizes the importance of fine-tuning for accurate temporal analysis. Future research should explore the application of these findings to color photographs and diverse datasets

    Trust Needs Touch: Understanding the Building of Trust through Social Presence

    Get PDF
    Trust is gaining in importance in today’s digital world where interactions become more and more impersonal. In this context, many studies show that social presence, i.e. the feeling of human contact, has a positive effect on the formation of trust. However, the theoretical explanation of the relationship is still somewhat unexplored in the IS domain. In this study, we draw on psychology literature and derive a comprehensive framework to conceptualize and explain the relationship in detail. Particularly, we identify four mechanisms that were not yet explicated by IS research. Using the developed framework as a structuring device, we then carry out a structured literature review in the IS domain to identify existing studies and their theoretical focus as well as to point out research gaps. We are able to show that there is much more to the relationship between social presence and trust than the IS domain has yet recognized

    Learning to Map the Visual and Auditory World

    Get PDF
    The appearance of the world varies dramatically not only from place to place but also from hour to hour and month to month. Billions of images that capture this complex relationship are uploaded to social-media websites every day and often are associated with precise time and location metadata. This rich source of data can be beneficial to improve our understanding of the globe. In this work, we propose a general framework that uses these publicly available images for constructing dense maps of different ground-level attributes from overhead imagery. In particular, we use well-defined probabilistic models and a weakly-supervised, multi-task training strategy to provide an estimate of the expected visual and auditory ground-level attributes consisting of the type of scenes, objects, and sounds a person can experience at a location. Through a large-scale evaluation on real data, we show that our learned models can be used for applications including mapping, image localization, image retrieval, and metadata verification

    Deep Probabilistic Models for Camera Geo-Calibration

    Get PDF
    The ultimate goal of image understanding is to transfer visual images into numerical or symbolic descriptions of the scene that are helpful for decision making. Knowing when, where, and in which direction a picture was taken, the task of geo-calibration makes it possible to use imagery to understand the world and how it changes in time. Current models for geo-calibration are mostly deterministic, which in many cases fails to model the inherent uncertainties when the image content is ambiguous. Furthermore, without a proper modeling of the uncertainty, subsequent processing can yield overly confident predictions. To address these limitations, we propose a probabilistic model for camera geo-calibration using deep neural networks. While our primary contribution is geo-calibration, we also show that learning to geo-calibrate a camera allows us to implicitly learn to understand the content of the scene
    • …
    corecore