원문정보
초록
영어
In this paper, we propose a novel on-device AI design strategy for dynamically adjusting subtitle positioning in IPTV content that must be stored within a secure environment to prevent unauthorized distribution. Unlike previous studies that rely on external servers or unsecured environments for video analysis, our approach embeds the AI model directly into the secure zone of the chipset, ensuring privacy and real-time performance. We uniquely utilize on-device hardware resources to enable AI-based frame-by-frame analysis without external transmission. To achieve real-time efficiency, we implement a lightweight and device-optimized AI model by defining a Region of Interest (ROI) for input videos, applying model pruning, and utilizing 8-bit quantization techniques. Additionally, we enhance text recognition performance through data augmentation during training, addressing common challenges such as subtitle overlapping with on-screen graphics or embedded text. We demonstrate that our on-device strategy outperforms conventional models, improving recognition accuracy to 99.7% and processing speed to 60 fps. Through this work, we contribute a practical solution that ensures enhanced subtitle visibility for IPTV viewers using set-top boxes while maintaining content security in a non-trainable trusted execution environment.
목차
1. Introduction
2. Related Works and Contributions
2.1 Related Works
2.2 Contributions
3. The Proposed System
3.1 Trust Zone and AI Model Deployment within the STB
3.2 Dynamic Subtitle Replacement Model System Architecture and Network Design
3.3 Lightweight Model for Dynamic Subtitle Replacement
3.4 Data Augmentation and Fine-Tuning for Model Performance Enhancement
3.5 OTA-Based On-Device AI Model Delivery and Maintenance
4. The Experimental Results
4.1 Development Environment Using the Reference Board for Commercial Application
4.2 Model Architecture Optimization and On-device Porting
4.3 Data Augmentation and Fine-Tuning
5. Conclusions
References
