Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding

This paper proposes a novel dual-domain audio watermarking approach based on flexible segmentation and adaptive embedding aimed to improve robustness and imperceptibility. Compared with conventional watermarking strategies, the proposed approach has two advantages. First, a novel audio beat detectio...

Full description

Bibliographic Details
Main Authors: Yifan Luo, Dezhong Peng, Yongsheng Sang, Yong Xiang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8606057/
id doaj-d007fea48d224ebe9fad433565a18060
record_format Article
spelling doaj-d007fea48d224ebe9fad433565a180602021-03-29T22:47:46ZengIEEEIEEE Access2169-35362019-01-017105331054510.1109/ACCESS.2019.28909728606057Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive EmbeddingYifan Luo0https://orcid.org/0000-0001-8136-9333Dezhong Peng1Yongsheng Sang2https://orcid.org/0000-0002-6266-2638Yong Xiang3https://orcid.org/0000-0003-3545-7863Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, ChinaMachine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, ChinaMachine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, ChinaSchool of Information Technology, Deakin University, Burwood, VIC, AustraliaThis paper proposes a novel dual-domain audio watermarking approach based on flexible segmentation and adaptive embedding aimed to improve robustness and imperceptibility. Compared with conventional watermarking strategies, the proposed approach has two advantages. First, a novel audio beat detection approach is designed to flexibly segment the audio, which provides stronger robustness to synchronization attacks. The audio is decomposed by the discrete wavelet packet transform. Then, the covariance relationships of the decomposition coefficients at different time instants are calculated to determine the locations of the beats and to establish a flexible segmentation model. Second, a dual-domain embedding approach is proposed to realize better robustness to compression attacks while maintaining imperceptibility. In each segment, the psychoacoustic model is used to calculate the audio masking threshold, which divides the signals into the masking signal domain and masked signal domain. The signals in the masking signal domain are robust to compression attacks, and the signals in the masked signal domain have better imperceptibility. To combine these advantages, we embed the watermark into the two domains simultaneously by using the distortion-compensated dither modulation quantization approach. To reduce the impact of the watermark on the original audio, the frequency band with the lowest mask-to-noise ratio is selected as the embedding position for each domain. Moreover, the adaptive quantization steps are calculated to control the embedding strength according to the masking effect. The adaptive embedding will improve the robustness to compression attacks without significantly affecting the original audio quality. The effectiveness of our approach is verified through simulation experiments.https://ieeexplore.ieee.org/document/8606057/Audio beatsdual-domainDWPTDC-DMpsychoacoustic model
collection DOAJ
language English
format Article
sources DOAJ
author Yifan Luo
Dezhong Peng
Yongsheng Sang
Yong Xiang
spellingShingle Yifan Luo
Dezhong Peng
Yongsheng Sang
Yong Xiang
Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
IEEE Access
Audio beats
dual-domain
DWPT
DC-DM
psychoacoustic model
author_facet Yifan Luo
Dezhong Peng
Yongsheng Sang
Yong Xiang
author_sort Yifan Luo
title Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
title_short Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
title_full Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
title_fullStr Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
title_full_unstemmed Dual-Domain Audio Watermarking Algorithm Based on Flexible Segmentation and Adaptive Embedding
title_sort dual-domain audio watermarking algorithm based on flexible segmentation and adaptive embedding
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description This paper proposes a novel dual-domain audio watermarking approach based on flexible segmentation and adaptive embedding aimed to improve robustness and imperceptibility. Compared with conventional watermarking strategies, the proposed approach has two advantages. First, a novel audio beat detection approach is designed to flexibly segment the audio, which provides stronger robustness to synchronization attacks. The audio is decomposed by the discrete wavelet packet transform. Then, the covariance relationships of the decomposition coefficients at different time instants are calculated to determine the locations of the beats and to establish a flexible segmentation model. Second, a dual-domain embedding approach is proposed to realize better robustness to compression attacks while maintaining imperceptibility. In each segment, the psychoacoustic model is used to calculate the audio masking threshold, which divides the signals into the masking signal domain and masked signal domain. The signals in the masking signal domain are robust to compression attacks, and the signals in the masked signal domain have better imperceptibility. To combine these advantages, we embed the watermark into the two domains simultaneously by using the distortion-compensated dither modulation quantization approach. To reduce the impact of the watermark on the original audio, the frequency band with the lowest mask-to-noise ratio is selected as the embedding position for each domain. Moreover, the adaptive quantization steps are calculated to control the embedding strength according to the masking effect. The adaptive embedding will improve the robustness to compression attacks without significantly affecting the original audio quality. The effectiveness of our approach is verified through simulation experiments.
topic Audio beats
dual-domain
DWPT
DC-DM
psychoacoustic model
url https://ieeexplore.ieee.org/document/8606057/
work_keys_str_mv AT yifanluo dualdomainaudiowatermarkingalgorithmbasedonflexiblesegmentationandadaptiveembedding
AT dezhongpeng dualdomainaudiowatermarkingalgorithmbasedonflexiblesegmentationandadaptiveembedding
AT yongshengsang dualdomainaudiowatermarkingalgorithmbasedonflexiblesegmentationandadaptiveembedding
AT yongxiang dualdomainaudiowatermarkingalgorithmbasedonflexiblesegmentationandadaptiveembedding
_version_ 1724190849347092480