Skip to main navigation Skip to search Skip to main content

A Novel Multimodal LLM-Driven RF Sensing Method for Human Activity Recognition

  • Muhammad Zakir Khan
  • , Muhammad Bilal
  • , Hasan Abbas
  • , Muhammamd Imran
  • , Qammer H. Abbasi
  • University of Glasgow
  • National University of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Human activity recognition (HAR) using radio frequency (RF) sensing has attracted significant attention due to its unobtrusive and privacy-preserving nature. Traditional HAR methods rely on task-specific deep neural networks trained on large labeled datasets, which can be time-consuming and resource-intensive. To address these challenges, we propose a novel approach that leverages multimodal large language models (MLLMs) for RF-based HAR. Specifically, we fine-tune Florence-2, a pre-trained vision-language model (VLM), on RF spectrogram data from the open-source Xethru Radar dataset. Our approach frames activity detection as a question-answering task, allowing the model to associate radar spectrogram features with specific activity classes through prompt-based interactions. Testing on three distinct activities (sitting, bending, and crawling), our fine-tuned model achieves 98% classification accuracy with minimal misclassifications. This work demonstrates the effectiveness of integrating VLMs with RF sensing data for scalable and adaptive HAR applications, opening new research directions for unified, prompt-based models in complex multimodal sensing tasks.

Original languageEnglish
Title of host publication2025 2nd International Conference on Microwave, Antennas and Circuits, ICMAC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331518424
DOIs
StatePublished - 2025
Externally publishedYes
Event2nd International Conference on Microwave, Antennas and Circuits, ICMAC 2025 - Islamabad, Pakistan
Duration: 17 Apr 202518 Apr 2025

Publication series

Name2025 2nd International Conference on Microwave, Antennas and Circuits, ICMAC 2025

Conference

Conference2nd International Conference on Microwave, Antennas and Circuits, ICMAC 2025
Country/TerritoryPakistan
CityIslamabad
Period17/04/2518/04/25

Keywords

  • Multimodal Vision-Language Models
  • Radio Frequency Sensing
  • Visual Signal Processing

Fingerprint

Dive into the research topics of 'A Novel Multimodal LLM-Driven RF Sensing Method for Human Activity Recognition'. Together they form a unique fingerprint.

Cite this