earticle

논문검색

Broadcasting

On-Device AI-Based Real-Time Dynamic Subtitle Placement for IPTV Contents

초록

영어

In this paper, we propose a novel on-device AI design strategy for dynamically adjusting subtitle positioning in IPTV content that must be stored within a secure environment to prevent unauthorized distribution. Unlike previous studies that rely on external servers or unsecured environments for video analysis, our approach embeds the AI model directly into the secure zone of the chipset, ensuring privacy and real-time performance. We uniquely utilize on-device hardware resources to enable AI-based frame-by-frame analysis without external transmission. To achieve real-time efficiency, we implement a lightweight and device-optimized AI model by defining a Region of Interest (ROI) for input videos, applying model pruning, and utilizing 8-bit quantization techniques. Additionally, we enhance text recognition performance through data augmentation during training, addressing common challenges such as subtitle overlapping with on-screen graphics or embedded text. We demonstrate that our on-device strategy outperforms conventional models, improving recognition accuracy to 99.7% and processing speed to 60 fps. Through this work, we contribute a practical solution that ensures enhanced subtitle visibility for IPTV viewers using set-top boxes while maintaining content security in a non-trainable trusted execution environment.

목차

Abstract
1. Introduction
2. Related Works and Contributions
2.1 Related Works
2.2 Contributions
3. The Proposed System
3.1 Trust Zone and AI Model Deployment within the STB
3.2 Dynamic Subtitle Replacement Model System Architecture and Network Design
3.3 Lightweight Model for Dynamic Subtitle Replacement
3.4 Data Augmentation and Fine-Tuning for Model Performance Enhancement
3.5 OTA-Based On-Device AI Model Delivery and Maintenance
4. The Experimental Results
4.1 Development Environment Using the Reference Board for Commercial Application
4.2 Model Architecture Optimization and On-device Porting
4.3 Data Augmentation and Fine-Tuning
5. Conclusions
References

저자정보

  • Kyuho Lee Chief Research Engineer, Agent Engineering 2 Team, Technology Development Group, LGUplus
  • MyeungHoon Kim Senior Research Engineer, Agent Engineering 2 Team, Technology Development Group, LGUplus
  • Hansang Lee Chief Research Engineer, Agent Engineering 2 Team, Technology Development Group, LGUplus
  • Daewon Song Executive Director, Group Leader, Technology Development Group, LGUplus

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.