TY - GEN
T1 - Integrating RF-Visual Technologies for Improved Speech Recognition in Hearing Aids
AU - Chen, Zikang
AU - Tang, Chong
AU - Ge, Yao
AU - Imran, Muhammad
AU - Abbasi, Qammer H.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Traditional hearing aids solutions are based on audio and visual information, which has limited effectiveness in challenging scenarios, such as noisy environments and obstacles. Additionally, their level of comfort and privacy protection is often unsatisfactory. In this case, there has been increasing interest in recent years, aiming to develop a contactless and privacy-preserving alternative using radio frequency (RF) signals. However, the RF-based approach requires a large amount of training data to support its reliability and accuracy, which is undoubtedly very time-consuming and labour-intensive. To address these limitations and benefit future hearing aids with RF sensing, the fusion of multiple modalities provides us with a solution. In this paper, we propose an RF - Visual based speech recognition system based on the fusion of visual and RF information, which is based on a multi-input convolutional neural network (CNN) and can achieve up to 87.55% recognition accuracy. We have comprehensively compared and evaluated the system performance with single and multiple modalities, and can conclude the proposed RF-Visual-based SR system has great potential for advancing hearing aid technology.
AB - Traditional hearing aids solutions are based on audio and visual information, which has limited effectiveness in challenging scenarios, such as noisy environments and obstacles. Additionally, their level of comfort and privacy protection is often unsatisfactory. In this case, there has been increasing interest in recent years, aiming to develop a contactless and privacy-preserving alternative using radio frequency (RF) signals. However, the RF-based approach requires a large amount of training data to support its reliability and accuracy, which is undoubtedly very time-consuming and labour-intensive. To address these limitations and benefit future hearing aids with RF sensing, the fusion of multiple modalities provides us with a solution. In this paper, we propose an RF - Visual based speech recognition system based on the fusion of visual and RF information, which is based on a multi-input convolutional neural network (CNN) and can achieve up to 87.55% recognition accuracy. We have comprehensively compared and evaluated the system performance with single and multiple modalities, and can conclude the proposed RF-Visual-based SR system has great potential for advancing hearing aid technology.
KW - RF-based speech recognition
KW - Radio frequency sensing
KW - speech recognition
UR - https://www.scopus.com/pages/publications/85179128880
U2 - 10.1109/IMBioC56839.2023.10305132
DO - 10.1109/IMBioC56839.2023.10305132
M3 - Conference contribution
AN - SCOPUS:85179128880
T3 - 2023 IEEE MTT-S International Microwave Biomedical Conference, IMBioC 2023
SP - 97
EP - 99
BT - 2023 IEEE MTT-S International Microwave Biomedical Conference, IMBioC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE MTT-S International Microwave Biomedical Conference, IMBioC 2023
Y2 - 11 September 2023 through 13 September 2023
ER -