TY - JOUR
T1 - S-Faster R-CNN
T2 - Intra-Spectral Similarity Learning for Audio Copy-Move Forgery Localization in IoT Security
AU - Qiu, Xin
AU - Shi, Canghong
AU - Li, Xiaojie
AU - Wu, Min
AU - Abdullahi, Sani M.
AU - Liu, Yong
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2025
Y1 - 2025
N2 - In the Internet of Audio Things, communication security of the audio control terminal is vulnerable to copy-move threats, and detecting and locating audio copy-move forgery remains challenging nowadays. The forgery detection method based on deep learning achieves higher detection accuracy but fails to localize forged regions. To address this issue, this article proposes an S-Faster R-CNN model for audio copy-move forgery detection and localization. We integrate a novel Similarity Computation Module (SCM) into the Faster R-CNN framework, forming the S-Faster R-CNN model. Obtaining the integration of the SCM, which allows the S-Faster R-CNN to precisely localize forgery regions within the spectrogram. Finally, the image coordinate transformation algorithm is used to map these forged regions to the corresponding locations of the original audio waveform, thus completing the audio copy-move forgery detection and localization. Evaluated on three datasets, our method achieves an average recall of 90%, an average precision of 84%, and an average F1-score of 87%, respectively. Experimental results indicate that the S-Faster R-CNN outperforms state-of-the-art methods in both forgery detection accuracy and especially in localization. Moreover, the proposed method shows good robustness under multiple post-processing.
AB - In the Internet of Audio Things, communication security of the audio control terminal is vulnerable to copy-move threats, and detecting and locating audio copy-move forgery remains challenging nowadays. The forgery detection method based on deep learning achieves higher detection accuracy but fails to localize forged regions. To address this issue, this article proposes an S-Faster R-CNN model for audio copy-move forgery detection and localization. We integrate a novel Similarity Computation Module (SCM) into the Faster R-CNN framework, forming the S-Faster R-CNN model. Obtaining the integration of the SCM, which allows the S-Faster R-CNN to precisely localize forgery regions within the spectrogram. Finally, the image coordinate transformation algorithm is used to map these forged regions to the corresponding locations of the original audio waveform, thus completing the audio copy-move forgery detection and localization. Evaluated on three datasets, our method achieves an average recall of 90%, an average precision of 84%, and an average F1-score of 87%, respectively. Experimental results indicate that the S-Faster R-CNN outperforms state-of-the-art methods in both forgery detection accuracy and especially in localization. Moreover, the proposed method shows good robustness under multiple post-processing.
KW - Audio copy-move forgery detection
KW - Audio forensic
KW - Audio forgery
KW - Faster R-CNN
KW - Forgery localization
KW - Hilbert-Huang Spectrogram
U2 - 10.1109/JIOT.2025.3569678
DO - 10.1109/JIOT.2025.3569678
M3 - Journal article
AN - SCOPUS:105005181220
SN - 2327-4662
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
ER -