Skip to main navigation Skip to search Skip to main content

TelcoGPT: A Hybrid Embedding Approach for Telecom-Specific Q&A and Code Retrieval

  • Muhammad Zakir Khan
  • , Yao Ge
  • , Ubaid Ullah
  • , Shuja Ansari
  • , Muhamamd Imran
  • , Qammer H. Abbasi
  • University of Glasgow
  • Department of Computer Science

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents TelcoGPT, a specialised question-answering (Q&A) and code retrieval system for telecommunications that combines retrieval-augmented generation (RAG) with domain-specific optimizations. TelcoGPT introduces three key enhancements: (1) a HybridEmbedding method-ology integrating text-embedding models with telecom-specific filtering mechanisms; (2) an advanced document processing pipeline with adaptive chunking and technical term density scoring; and (3) a dual-path query engine optimized for both question-answering and code retrieval tasks. Evaluation on the RedPajama-Data-1T arxiv subset demonstrates that hybrid embedding approach achieves mean reciprocal rank (MRR) of 0.89 and hit rate (HR) of 0.94 with optimal configuration (thresh-old=0.8, chunk size=12K, k=15), outperforming single-embedding approaches by 5-7%. The hybrid RAG implementation increases MRR by 8.5% (0.82 to 0.89) and HR by 10.6% (0.85 to 0.94). TelcoGPT achieves 95% accuracy in domain-specific Q&A tasks versus 87% for base models, while maintaining higher technical term density scores (0.90 vs 0.81). For code retrieval, our system demonstrates 93% execution success rate with comprehensive error handling, surpassing baseline approaches by 6-8%. Comparative analysis with GPT-3.5, GPT-4, and LLAMA-2/3 shows significant improvements in context relevance (0.92 vs 0.84), information accuracy (0.95 vs 0.89), faithfulness (0.84 to 0.92), and relevancy (0.83 to 0.93), demonstrating the effectiveness of our architecture for telecommunications applications.

Original languageEnglish
Title of host publicationIEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331543709
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025 - London, United Kingdom
Duration: 19 May 2025 → …

Publication series

NameIEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025

Conference

Conference2025 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025
Country/TerritoryUnited Kingdom
CityLondon
Period19/05/25 → …

Keywords

  • Retrieval-augmented generation
  • code generation
  • domain adaptation
  • hybrid embeddings
  • telecommunications

Fingerprint

Dive into the research topics of 'TelcoGPT: A Hybrid Embedding Approach for Telecom-Specific Q&A and Code Retrieval'. Together they form a unique fingerprint.

Cite this