웹페이지 분석을 위한 딥러닝 모델 학습과 구현에 관한 연구

김정환; 조재원; 김진산; 이한진

기술 융합(TC)

웹페이지 분석을 위한 딥러닝 모델 학습과 구현에 관한 연구

원문정보

Research on Training and Implementation of Deep Learning Models for Web Page Analysis

김정환, 조재원, 김진산, 이한진

국제문화기술진흥원 The Journal of the Convergence on Culture Technology (JCCT) Vol.10 No.2 2024.03 pp.519-526 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This study aims to train and implement a deep learning model for the fusion of website creation and artificial intelligence, in the era known as the AI revolution following the launch of the ChatGPT service. The deep learning model was trained using 3,000 collected web page images, processed based on a system of component and layout classification. This process was divided into three stages. First, prior research on AI models was reviewed to select the most appropriate algorithm for the model we intended to implement. Second, suitable web page and paragraph images were collected, categorized, and processed. Third, the deep learning model was trained, and a serving interface was integrated to verify the actual outcomes of the model. This implemented model will be used to detect multiple paragraphs on a web page, analyzing the number of lines, elements, and features in each paragraph, and deriving meaningful data based on the classification system. This process is expected to evolve, enabling more precise analysis of web pages. Furthermore, it is anticipated that the development of precise analysis techniques will lay the groundwork for research into AI's capability to automatically generate perfect web pages.

한국어

본 연구는 ChatGPT 서비스의 개시 이후 인공지능 혁명이라 일컬어지는 시대적 배경 속에서, 웹사이트의 제작 과 인공지능의 융합을 위해 딥러닝 모델을 학습 및 구현하고자 한다. 딥러닝 모델은 수집한 3,000개의 웹페이지 이미 지를 구성요소와 레이아웃 분류체계 기반의 데이터 가공을 통해 학습하였으며, 다음과 같은 세 가지 단계로 구분하여 진행하였다. 첫째, 인공지능 모델에 관한 선행연구를 조사하여 구현하고자 하는 모델에 가장 적합한 알고리즘을 선택 하였다. 둘째, 적합한 웹페이지 및 단락 이미지를 수집하고 분류 및 가공하였다. 셋째, 딥러닝 모델을 학습시키고 서 빙 인터페이스를 연동해 모델의 실제 결과를 확인하였다. 이렇게 구현된 모델은 실제 웹페이지를 구성하는 복수의 단 락을 탐지하고, 단락별 규모, 요소, 특징을 분석하여 분류체계를 기반으로 의미 있는 데이터를 도출할 것이다. 이 과 정은 점차 발전하여 웹페이지를 보다 정밀하게 분석할 수 있게 될 것이다. 그리고 정밀 분석기법을 역으로 설계하여, 인공지능이 완벽한 웹페이지를 자동으로 생성할 수 있는 연구의 초석이 될 것으로 기대한다.

요약
Abstract
Ⅰ. 서론
Ⅱ. 선행연구 검토
1. 심층학습과 신경망(Deep Learning and Neural Networks)
2. 이미지 분류 대표 알고리즘
3. 객체 탐지 대표 알고리즘
Ⅲ. 모델의 선택
1. 객체 탐지 모델 YOLOv8
2. 이미지 분류 모델 EfficientNetV2
Ⅳ. 수행결과
1. 데이터 수집 및 가공
Ⅴ. 결론
References

키워드

저자정보

김정환 Jung Hwan Kim. 정회원, 고려대학교 디지털경영학과 박사(컴퓨터공학 석사, 주식회사 위븐 대표이사)
조재원 Jae Won Cho. 정회원, 명지전문대학교 산업디자인학과 전문학사
김진산 Jin San Kim. 정회원, 서울사이버대 빅데이터 정보보호학과 학사
이한진 Han Jin Lee. 정회원, 한동대학교 창의융합교육원 교수

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle

웹페이지 분석을 위한 딥러닝 모델 학습과 구현에 관한 연구

원문정보

초록

목차

키워드

저자정보

참고문헌

함께 이용한 논문