Topics
- 2024 BEITC Proceedings
- Application of 5G in Broadcasting
- Application of Large Language Model (LLM) in Media
- Applications of ATSC 3.0 Technology
- BPS as the Complementary PNT Solution
- Broadcast Facility Design
- Content Creation and Delivery Technology
- Cybersecurity for Broadcasters
- Data Delivery
- Digital Online Operations
- Emerging Technologies in Media Delivery
- Generative AI for Media
- Generative AI Uses and Video Transcoding
- Quantifying Quality in Video Technology
- Radio Topics
- Society of Cable Telecommunications Engineers
- Striving for Efficiency in Video Technology
- The NMCS Concept
- Timing Solutions for Broadcasters
- Video Encoding and Codecs
- Video Technology - Miscellaneous Topics
- 2023 BEITC Proceedings
- 2022 BEITC Proceedings
- 2021 BEITC Proceedings
- 2020 BEITC Proceedings
- Uncategorized
Generative AI Uses and Video Transcoding
Speech Intelligibility and Audio Monitoring in OTT - $15
Date: April 3, 2024Topics: 2024 BEITC Proceedings, Generative AI Uses and Video TranscodingThe paper details ways of controlling speech intelligibility in OTT and broadcast production, primarily based on a new metric, Loudness to Dialog Ratio (LDR), and on calibrated sound monitoring at a moderate listening level. Principles of Loudness normalization are described; as is a study to define and quantify LDR for use in broadcast and OTT. Findings may be applied during mixing or dubbing, used for default settings in NGA delivery, and in automated QC as a key machine learning parameter. The article is facts-based and free of commercial bias.
Thomas Lund | Genelec Inc | Natick, Mass., United States
The Power of Generative AI for Personalizing Video Content - $15
Date: April 3, 2024Topics: 2024 BEITC Proceedings, Generative AI Uses and Video TranscodingIn today’s rapidly evolving landscape of content creation, a brave new world is emerging; video service providers can utilize AI-assisted pipelines to streamline the content creation process. As technology advances, large language model (LLM)-powered tools are ushering in a new era for content personalization and targeting. This paper presents an architecture blueprint to boost the effectiveness of targeted ads by developing tailored messages for specific consumer segments focusing on their core values and beliefs. Central to this are three pillars: automated audience segmentation and identifying deep seated core values, GenAI-aided message crafting aligned with segment values and beliefs, and a closed-loop system for continuous improvement through automated A/B testing. This paper calls for a holistic approach to video personalization that considers the enforcement of ethical — particularly privacy-related — standards and appropriate advertisers’ guardrails, as well as the prevention of biases. By leveraging data-driven insights and continuous feedback, we aim to refine marketing strategies, ensuring messages effectively resonate with targeted groups.
Ofer Weintraub | Viaccess-Orca | Raanana, Israel
Alice Wittenberg | Viaccess-Orca | Raanana, Israel
Alain Nochimowski | Viaccess-Orca | Raanana, Israel
Unravelling the Power of Single-Pass Look-Ahead in Modern Codecs for optimised transcoding deployment - $15
Date: April 3, 2024Topics: 2024 BEITC Proceedings, Generative AI Uses and Video TranscodingWinner of the 2024 NAB BEIT Conference Proceedings Best Student Paper Award
Modern video encoders have evolved into sophisticated pieces of software in which various coding tools interact with each other. In the past, single-pass encoding was not considered for Video-On-Demand (VOD) use-cases. In this work, we evaluate production-ready encoders for H.264 (x264), H.265 (HEVC), AV1 (SVT-AV1) along with direct comparisons to the latest AV1 encoder inside NVIDIA GPUs (40 series), and AWS Mediaconvert’s AV1 implementation. Our experimental results demonstrate single pass encoding inside modern encoder implementations can give us very good quality at reasonable compute cost. The results are presented as three different scenarios targeting High, Medium, Low complexity accounting quality-bitrate-compute load. Finally, a set of recommendations are presented for end-users to help decide which encoder/preset combination might be more suited to their use case.
Vibhoothi | Sigmedia Group, Department of Electrical Engineering | Trinity College Dublin, Ireland
Julien Zouein | Sigmedia Group, Department of Electrical Engineering | Trinity College Dublin, Ireland
François Pitié | Sigmedia Group, Department of Electrical Engineering | Trinity College Dublin, Ireland
Anil Kokaram | Sigmedia Group, Department of Electrical Engineering | Trinity College Dublin, Ireland