earticle

논문검색

Culture Convergence (CC)

Image Similarity Analysis in Generative AI

원문정보

초록

영어

In Consciousness Explained, Daniel Dennett argued that consciousness is a phenomenon emerging from the complex flow of information in the brain, and to understand it, an objective approach is necessary. While AI is increasingly mimicking human functions, it is difficult to say that AI possesses consciousness similar to humans. However, consciousness is an essential factor for perception, but perception does not necessarily require consciousness. Therefore, this study aims to analyze how similar the way AI, particularly the DALL-E model developed by OpenAI, processes visual information is to the structure of human perception. In the study, new images were generated using the GPT-4 DALL-E model based on five sets of reference images, and the structural similarity between the generated images and the reference images was analyzed using SSIM (Structural Similarity Index Measure). The SSIM scores of the images generated by DALL-E based on the reference images ranged between 0.131 and 0.63. This confirmed that AI learned some degree of the visual patterns from the reference images. However, AI did not generate images that perfectly aligned with human perception, and images that contained complex shapes or fine textures recorded lower SSIM scores. Notably, the AI showed limitations in depicting human portraits, suggesting that AI’s perception system is simplified compared to the complexity of human perception structures. This study demonstrated that while the DALL-E model has potential in processing visual information, there remains a clear difference from the complex human perception system. These results suggest that AI still has limitations in mimicking the way humans process visual information, indicating a need for further in-depth research into the independent characteristics of AI perception in the future

목차

Abstract
1. INTRODUCTION
2. THEORETICAL CONSIDERATIONS
2.1 Perception, Consciousness and AI
2.2 Generative AI
2.3 SSIM (Structural Similarity Index Measure)
3. RESEARCH METHOD
4. DATA ANALYSIS RESULTS
4.1 GPT-4, DALL-E Generated Images
4.2 SSIM Evaluation
5. RESULT
REFERENCES

저자정보

  • Choi Haerin Master Student, Department of Design, Pusan National Univ., South Korea
  • Lee Hyunseok Professor, Department of Design, Pusan National Univ., South Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.