1,132 research outputs found
OperARtistry: An AR-based Interactive Application to Assist the Learning of Chinese Traditional Opera (Xiqu) Makeup
Chinese Traditional Opera (Xiqu) is an important type of intangible cultural
heritage and one key characteristic of Xiqu is its visual effects on face
achieved via makeup. However, Xiqu makeup process, especially the eye-area
makeup process, is complex and time-consuming, which poses a learning challenge
for potential younger inheritors. We introduce OperARtistry, an interactive
application based on Augmented Reality (AR) that offers in-situ Xiqu makeup
guidance for beginners. Our application provides a step-by-step guide for Xiqu
eye-area makeup, incorporating AR effects at each stage. Furthermore, we
conducted an initial user study (n=6) to compare our approach with existing
video-based tutorials to assess the effectiveness and usefulness of our
approach. Our findings show that OperARtisty helped participants achieve
high-quality eye-area makeup effects with less learning time.Comment: 11 pages, 9 figures, In Proceedings of The Eleventh International
Symposium of Chinese CHI (Chinese CHI 2023
LookBook: pioneering Inclusive beauty with artificial intelligence and machine learning algorithms
Technology's imperfections and biases inherited from historical norms are crucial to acknowledge. Rapid perpetuation and amplification of these biases necessitate transparency and proactive measures to mitigate their impact. The online visual culture reinforces Eurocentric beauty ideals through prioritized algorithms and augmented reality filters, distorting reality and perpetuating unrealistic standards of beauty.
Narrow beauty standards in technology pose a significant challenge to overcome. Algorithms personalize content, creating "filter bubbles" that reinforce these ideals and limit exposure to diverse representations of beauty. This cycle compels individuals to conform, hindering the embrace of their unique features and alternative definitions of beauty.
LookBook counters prevalent narrow beauty standards in technology. It promotes inclusivity and representation through self-expression, community engagement, and diverse visibility. LookBook comprises three core sections: Dash, Books, and Community. In Dash, users curate their experience through personalization algorithms. Books allow users to collect curated content for inspiration and creativity, while Community fosters connections with like-minded individuals.
Through LookBook, users create a reality aligned with their unique vision. They control consumed content, nurturing individualism through preferences and creativity. This personalization empowers individuals to break free from narrow beauty standards and embrace their distinctiveness.
LookBook stands out with its algorithmic training and data representation. It offers transparency on how personalization algorithms operate and ensures a balanced and diverse representation of physicalities and ethnicities. By addressing biases and embracing a wide range of identities, LookBook sparks a conversation for a technology landscape that amplifies all voices, fostering an environment celebrating diversity and prioritizing inclusivity
Emojis and the Neoliberal Coding of Diversity
This article focuses on the representation of ethnic diversity in multicultural emojis. Multicultural emojis are interpreted in this study as a neoliberal representation of diversity that has reincorporated white supremacist ideology traits, namely color classification, and the Vitruvian Man body design. Thus, I argue that multicultural emojis primarily reflect a typical Western worldview which supports a Eurocentric monoculturalism. Multicultural emojis can, therefore, be interpreted as serving as a set of body depictions whose façade shows diversity while keeping the privilege of the Caucasian body at its core. In the context of this article, code refers to both formulation in the form of symbols and signs, and the signs and signals of communication. The neoliberal coding of the human body, then, highlights how the human body is translated into neoliberal signals or symbols. Neoliberalism values a global market and embraces diversity within this rationale. I argue that instead of trying to eliminate racism by valuing diverse identities equally, neoliberalism lays the ground for the assimilation of diversity into the Western model of subjectivity, which, at its best, offers partial and biased perspectives. To discuss my point, I investigate two visual codes of multicultural emojis: color categorization and the Vitruvian Man body template. I propose that different ethnicities are displayed in emojis through a Jim Crow-type segregative mindset, which defines identity as ‘color.’ At the heart of this thinking, one can find the association of ‘whiteness’ with pureness, and ‘blackness’ with evilness. Second, the body template in multicultural emojis is limited to a Western body-drawing tradition rooted in the sketches of the Vitruvian Man; an illustration that has traditionally represented the Caucasian body model against whose proportions the body of others should be measured and considered normal/abnormal
The Science of Disguise
Technological advances have made digital cameras ubiquitous, to the point where it is difficult to purchase even a mobile phone without one. Coupled with similar advances in face recognition technology, we are seeing a marked increase in the use of biometrics, such as face recognition, to identify individuals. However, remaining unrecognized in an era of ubiquitous camera surveillance remains desirable to some citizens, notably those concerned with privacy. Since biometrics are an intrinsic part of a person\u27s identity, it may be that the only means of evading detection is through disguise.
We have created a comprehensive database of high-quality imagery that will allow us to explore the effectiveness of disguise as an approach to avoiding unwanted recognition. Using this database, we have evaluated the performance of a variety of automated machine-based face recognition algorithms on disguised faces. Our data-driven analysis finds that for the sample population contained in our database: (1) disguise is effective; (2) there are significant performance differences between individuals and demographic groups; and (3) elements including coverage, contrast, and disguise combination are determinative factors in the success or failure of face recognition algorithms on an image.
In this dissertation, we examine the present-day uses of face recognition and their interplay with privacy concerns. We sketch the capabilities of a new database of facial imagery, unique both in the diversity of the imaged population, and in the diversity and consistency of disguises applied to each subject. We provide an analysis of disguise performance based on both a highly-rated commercial face recognition system and an open-source algorithm available to the FR community. Finally, we put forth hypothetical models for these results, and provide insights into the types of disguises that are the most effective at defeating facial recognition for various demographic populations. As cameras become more sophisticated and algorithms become more advanced, disguise may become less effective. For security professionals, this is a laudable outcome; privacy advocates will certainly feel differently
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design
We introduce DEsignBench, a text-to-image (T2I) generation benchmark tailored
for visual design scenarios. Recent T2I models like DALL-E 3 and others, have
demonstrated remarkable capabilities in generating photorealistic images that
align closely with textual inputs. While the allure of creating visually
captivating images is undeniable, our emphasis extends beyond mere aesthetic
pleasure. We aim to investigate the potential of using these powerful models in
authentic design contexts. In pursuit of this goal, we develop DEsignBench,
which incorporates test samples designed to assess T2I models on both "design
technical capability" and "design application scenario." Each of these two
dimensions is supported by a diverse set of specific design categories. We
explore DALL-E 3 together with other leading T2I models on DEsignBench,
resulting in a comprehensive visual gallery for side-by-side comparisons. For
DEsignBench benchmarking, we perform human evaluations on generated images in
DEsignBench gallery, against the criteria of image-text alignment, visual
aesthetic, and design creativity. Our evaluation also considers other
specialized design capabilities, including text rendering, layout composition,
color harmony, 3D design, and medium style. In addition to human evaluations,
we introduce the first automatic image generation evaluator powered by GPT-4V.
This evaluator provides ratings that align well with human judgments, while
being easily replicable and cost-efficient. A high-resolution version is
available at
https://github.com/design-bench/design-bench.github.io/raw/main/designbench.pdf?download=Comment: Project page at https://design-bench.github.io
The Ticker, February 8, 2016
The Ticker is the student newspaper of Baruch College. It has been published continuously since 1932, when the Baruch College campus was the School of Business and Civic Administration of the City College of New York
- …