2 research outputs found
Finding image features associated with high aesthetic value by machine learning
A major goal of evolutionary art is to get images of high aesthetic value. We assume that some features of images are associated with high aesthetic value and want to find them. We have taken two image databases that have been rated by humans, a photographic database and one of abstract images generated by evolutionary art software. We have computed 55 features for each database. We have extracted two categories of rankings, the lowest and the highest. Using feature extraction methods from machine learning we have identified the features most associated with differences. For the photographic images the key features are wavelet and texture features. For the abstract images the features are colour based features
심미적 시지각에 대한 인지적고찰
학위논문 (박사)-- 서울대학교 대학원 : 협동과정 인지과학전공, 2015. 8. 장병탁.Answering for the question what is beauty? has emerged as an issue of psychology, neuroscience and computer science during the last decade, after the long history of exploration in the field of philosophy and aesthetics. Especially, in the field of computer science, computational aesthetics pursues implementing an automated aesthetic judgement system based on the low-level features and machine learning techniques, for tangible application such as content recommendation.
In this paper, as an effort of building a computational model of estimating aesthetic value of photos in content recommendation, a hypothesis that surface curvatures of objects and a place in a scene contribute to the estimation is proposed and implemented as a new visual descriptor named Local Slant Cue (LoSC) which represent catching 2.5D information which traditional local descriptors are hard to catch. Experimental results show its comparable performance just with the 30 percent of computational workload of the previous arts. However, comparative study reveals there exist a kind of glass ceiling regardless of feature selection, due to a weird attribute of the mediocre samples, which occupy an absolute majority of any given sample group, in machine learning framework.
Observation to the score distributions of the mediocre group leads to the discovery of significantly high variance in consensus level among human raters for the stimuli. For quantitative validation of the observation, skewness-kurtosis map is adopted as a tool of consensus analysis and applied to a massive photo aesthetics dataset consisting of 225,000 samples, followed by the result of showing validated universality of the observation as one of four patterns, which are incompatible with Gaussianity that has been expected so far.
Several computational models of visual aesthetic perception are proposed and tested from the view of how well they explain the observed patterns, finding the comparative advantage of dynamic systems model. As an effort of elaborating the idea of dynamic systems for the aesthetic perception, a new computational model named as DDM4AP (Drift-Diffusion Model for Aesthetic Perception) is proposed regarding visual aesthetic perception as a result of dynamic interaction between like factors and dislike factors.
While it is concentrating to explain the wide variance in consensus level, the proposed model predicts a significantly longer latency when appreciating photos the mediocre group rather than the good or the bad, regardless of consensus level. Human subject experiments validate the prediction, supporting the model as reflecting important attributes of visual aesthetic perception in human mind.
In conclusion, this study declares computational aesthetics requires new approaches of machine learning and computer vision considering dynamic interaction between two contrastive factors and selecting training data and features in accordance with such mixed data.CHAPTER 1. Introduction
1.1. Background
1.1.1 In Philosophy
1.1.2 In Psychology
1.1.3 In Neuroscience
1.2. Related Works: Computational Aesthetics
CHAPTER 2. Finding Features
2.1. Background
2.2. Local Slant Cue (LoSC)
2.2.1 Representation
2.2.2 Region Description
2.3. Experiments
2.4. Discussion
CHAPTER 3. Data Revisited
3.1. What Makes Glass Ceiling
3.2. Consensus Analysis
3.2.1 Data Set
3.2.2 Method
3.3. Analysis Results: 4 Patterns
3.3.1 Pattern 1: A Wide Kurtosis Range
3.3.2 Pattern 2: Consensus Asymmetry
3.3.3 Pattern 3: The 4/3 Power Law Regime
3.3.4 Pattern 4: Tag Effect
3.4. Discussion
CHAPTER 4. Modeling
4.1. Background
4.2. Static Models
4.3. Dynamic Models (DDM4AP)
4.4. Discussion
CHAPTER 5. Validation
5.1. Background: Prediction from DDM4AP
5.2. Method
5.3. Experimental Results
5.4. Discussion
Conclusion
References
Appendix 1. Free vs. Non-Free Study
Appendix 2. Summary of Skewness and Kurtosis
국문초록Docto