Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

Accepted to Interspeech-2025

This repository accompanies the paper “Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy” by Elvir Karimov, Alexander Varlamov, Danil Ivanov, Dmitrii Korzh, and Oleg Y. Rogov. The paper presents a new method for generating Universal Adversarial Patches (UAPs) in the audio domain to protect speaker identity, introducing a novel Exponential Total Variance loss function and a length‐independent UAP insertion procedure.

Abstract

Deep learning voice models are commonly used nowadays, but the safety of processing of personal data, such as human identity and speech content, remains suspicious. To prevent malicious user identification, speaker obfuscation methods were proposed. Current methods, particularly based on universal adversarial patch (UAP) applications, have drawbacks such as significant degradation of audio quality, decreased speech recognition quality, low transferability across different voice biometrics models, and performance dependence on the input audio length. To mitigate these drawbacks, in this work, we introduce and leverage the novel Exponential Total Variance (TV) loss function and provide experimental evidence that it positively affects UAP strength and imperceptibility. Moreover, we present a novel scalable UAP insertion procedure and demonstrate its uniformly high performance for various audio lengths.

Contributions

Incorporation of the Novel Loss Function
We propose a novel Exponential Total Variance (TV) loss function inspired by TV loss from the image domain, designed to preserve the imperceptibility of UAPs.
Length-Independent UAP
We introduce a length-independent UAP generation approach by training on long audio samples with a repeat padding strategy, making it effective for real-world applications.
To the best of our knowledge, this strategy, although being well-known, has not been used in prior UAP training.
Length-Agnostic Evaluation Procedure
We establish a rigorous evaluation protocol that accounts for dataset biases, including variations in loudness levels. Furthermore, a proper padding strategy based on audio repetition is implemented to prevent the UAP from exploiting artificially silent segments, ensuring robustness across different audio lengths.

Citation

@article{karimov2025novel,
  title={Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy},
  author={Karimov, Elvir and Varlamov, Alexander and Ivanov, Danil and Korzh, Dmitrii and Rogov, Oleg Y},
  journal={arXiv preprint arXiv:2505.19951},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
scripts		scripts
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

Accepted to Interspeech-2025

Abstract

Contributions

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

AIRI-Institute/voice-uap

Folders and files

Latest commit

History

Repository files navigation

Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

Accepted to Interspeech-2025

Abstract

Contributions

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages