OPTIMIZING DOMAIN-SPECIFIC LARGE LANGUAGE MODELS: A COMPARATIVE ANALYSIS OF RETRIEVAL-AUGMENTED GENERATION (RAG) AND FINE-TUNING METHODOLOGIES

Authors

  • Govind Geet Microsoft Certified AI Engineer, India
  • Agarwal Ankit Research Scholar, Malwanchal University, Indore, India
  • Dr. Rajesh D. Associate Professor, CIET-NCERT, India

DOI:

https://doi.org/10.29121/shodhkosh.v7.i7s.2026.7928

Keywords:

LLMS , Rag, Fine-Tuning, Raft, Enterprise AI, Knowledge Limits, Real-Time Data, Domain Specialization

Abstract [English]

Large Language Models (LLMs) demonstrate substantial general-world knowledge derived from large-scale pretraining corpora. However, their utility in enterprise environments is constrained by static training data, temporal knowledge cut-offs, and limited access to proprietary or real-time information. Two principal methodologies have emerged to address these constraints:


Retrieval-Augmented Generation (RAG) and Fine-Tuning. This paper provides a technical examination of both paradigms, analysing their architectures, operational trade-offs, cost profiles, and failure modes. It concludes by advocating for a hybrid framework—Retrieval-Augmented Fine-Tuning (RAFT)—as a robust strategy for domain-specialized enterprise deployments.

References

Devlin, J., et al. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.

Gao, L., et al. (2023). Retrieval-Augmented Fine-Tuning (RAFT) Frameworks in LLMs.

Hu, E. J., et al. (2021). LoRA: Low-Rank Adaptation of Large Language Models.

Izacard, G., & Grave, E. (2021). Leveraging Passage Retrieval with Generative Models.

Karpukhin, V., et al. (2020). Dense Passage Retrieval for Open-Domain Question Answering.

Lewis, P., et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.

Meta AI (2023). LLaMA: Open and Efficient Foundation Language Models.

OpenAI (2023–2025). Technical reports on GPT models.

Ouyang, L., et al. (2022). Training language models to follow instructions with human feedback.

Vaswani, A., et al. (2017). Attention Is All You Need.

Downloads

Published

2026-05-05

How to Cite

Geet, G., Ankit, A., & D, R. (2026). OPTIMIZING DOMAIN-SPECIFIC LARGE LANGUAGE MODELS: A COMPARATIVE ANALYSIS OF RETRIEVAL-AUGMENTED GENERATION (RAG) AND FINE-TUNING METHODOLOGIES. ShodhKosh: Journal of Visual and Performing Arts, 7(7s), 209–214. https://doi.org/10.29121/shodhkosh.v7.i7s.2026.7928