Stability AI를 이용한 고서의 이미지 생성 연구

김정인; 고훈; 임광철

원문정보

Research on the Generation of Ancient Book Images Using Stability AI

김정인, 고훈, 임광철

한국차세대컴퓨팅학회 한국차세대컴퓨팅학회 논문지 Vol.20 No.6 2024.12 pp.92-100 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This paper proposes a method for generating images of historical texts using Stability AI’s latest model, Stable Diffusion 3 Medium. The paper compares the image generation capabilities of Stable Diffusion 3 Medium and OpenAI’s DALL-E model based on the text of historical documents. Experiments involved using the original text, Korean translation, and English translation of the historical document as input for both models, analyzing the results. Stable Diffusion 3 Medium demonstrated superior performance in accurately reflecting the content of the historical texts, particularly excelling with the Korean translation. In contrast, DALL-E highlighted specific parts of the text in its images. Stable Diffusion 3 Medium generally provided a more comprehensive interpretation of the text, better reproducing visual details.

한국어

본 논문은 Stability AI의 최신 모델인 Stable Diffusion 3 Medium을 활용하여 고서의 이미지를 생성하는 방법 에 대해 제안한다. 고서의 텍스트를 기반으로 한 이미지 생성 과정에서 Stable Diffusion 3 Medium과 OpenAI 의 DALL-E 모델을 비교 평가하였다. 실험에서는 고서의 원문, 한글 번역문, 영어 번역문을 각각 텍스트 입력으로 사용하여 두 모델의 이미지 생성 결과를 분석하였다. Stable Diffusion 3 Medium은 고서의 내용을 보다 정확히 반영한 이미지를 생성하는 데 있어 우수한 성능을 보였다. 특히, 한글 번역문을 사용한 경우, 고서의 내용을 충실히 반영한 이미지를 생성하였다. 반면, DALL-E는 텍스트의 특정 부분을 강조하여 이미지를 생성하였다. Stable Diffusion 3 Medium은 DALL-E에 비해 보다 일반적인 해석을 통해 이미지를 생성하는 경향이 있었으며, 고서의 시각적 세부 사항을 더 잘 재현하였다.

earticle

Stability AI를 이용한 고서의 이미지 생성 연구

원문정보

초록

목차

키워드

저자정보

참고문헌

함께 이용한 논문