Enhanced Speech Processing for Air Traffic Control Using an Optimized Generative Adversarial Network

Authors

  • Shane Henderson
  • Kimberly Adams
  • Andrew Simmons
  • Patrick Green

DOI:

https://doi.org/10.59782/aai.v1i2.297

Keywords:

speech enhancement, generative adversarial network, convolutional neural network, air traffic control speech, low signal-to-noise ratio

Abstract

In order to improve the quality of voice calls during air traffic control, an improved SEGAN air traffic control speech enhancement algorithm is proposed. Aiming at the problem that the traditional speech enhancement algorithm based on generative adversarial network (SEGAN) is drowned under low signal-to-noise ratio conditions, a multi-stage, multi-mapping, multi-dimensional output generator and a multi-scale, multi-discriminator network model are proposed based on the SEGAN network model. First, the speech semantic features are extracted based on the deep neural network structure to complete the semantic segmentation of air traffic control speech. Secondly, multiple generators are set to further optimize the speech signal. Then, a downsampling module is added to the convolution layer to improve the model's utilization of speech information and reduce the loss of speech information. Finally, multi-scale, multiple discriminators are used to learn the distribution law and information of speech samples in multiple directions. The results show that under low signal-to-noise ratio conditions, the improved SEGAN model improves the short-term objective intelligibility (STOI) and the perceptual evaluation of speech quality (PESQ) by and respectively , which can quickly and effectively perform air traffic control speech enhancement and provide preparation for subsequent air traffic control speech recognition.

How to Cite

Henderson, S., Adams, K., Simmons, A., & Green, P. (2024). Enhanced Speech Processing for Air Traffic Control Using an Optimized Generative Adversarial Network. Journal of Applied Artificial Intelligence, 1(2), 144–156. https://doi.org/10.59782/aai.v1i2.297

Issue

Section

Articles