OPTIMIZING DOMAIN-SPECIFIC LARGE LANGUAGE MODELS: A COMPARATIVE ANALYSIS OF RETRIEVAL-AUGMENTED GENERATION (RAG) AND FINE-TUNING METHODOLOGIES

Govind Geet; Agarwal Ankit; Rajesh D

doi:10.29121/shodhkosh.v7.i7s.2026.7928

OPTIMIZING DOMAIN-SPECIFIC LARGE LANGUAGE MODELS: A COMPARATIVE ANALYSIS OF RETRIEVAL-AUGMENTED GENERATION (RAG) AND FINE-TUNING METHODOLOGIES

Authors

Govind Geet Microsoft Certified AI Engineer, India
Agarwal Ankit Research Scholar, Malwanchal University, Indore, India
Dr. Rajesh D. Associate Professor, CIET-NCERT, India

DOI:

https://doi.org/10.29121/shodhkosh.v7.i7s.2026.7928

Keywords:

LLMS , Rag, Fine-Tuning, Raft, Enterprise AI, Knowledge Limits, Real-Time Data, Domain Specialization

Abstract [English]

Large Language Models (LLMs) demonstrate substantial general-world knowledge derived from large-scale pretraining corpora. However, their utility in enterprise environments is constrained by static training data, temporal knowledge cut-offs, and limited access to proprietary or real-time information. Two principal methodologies have emerged to address these constraints:

Retrieval-Augmented Generation (RAG) and Fine-Tuning. This paper provides a technical examination of both paradigms, analysing their architectures, operational trade-offs, cost profiles, and failure modes. It concludes by advocating for a hybrid framework—Retrieval-Augmented Fine-Tuning (RAFT)—as a robust strategy for domain-specialized enterprise deployments.

References

Devlin, J., et al. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.

Gao, L., et al. (2023). Retrieval-Augmented Fine-Tuning (RAFT) Frameworks in LLMs.

Hu, E. J., et al. (2021). LoRA: Low-Rank Adaptation of Large Language Models.

Izacard, G., & Grave, E. (2021). Leveraging Passage Retrieval with Generative Models.

Karpukhin, V., et al. (2020). Dense Passage Retrieval for Open-Domain Question Answering.

Lewis, P., et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.

Meta AI (2023). LLaMA: Open and Efficient Foundation Language Models.

OpenAI (2023–2025). Technical reports on GPT models.

Ouyang, L., et al. (2022). Training language models to follow instructions with human feedback.

Vaswani, A., et al. (2017). Attention Is All You Need.

Downloads

Published

2026-05-05

How to Cite

Geet, G., Ankit, A., & D, R. (2026). OPTIMIZING DOMAIN-SPECIFIC LARGE LANGUAGE MODELS: A COMPARATIVE ANALYSIS OF RETRIEVAL-AUGMENTED GENERATION (RAG) AND FINE-TUNING METHODOLOGIES. ShodhKosh: Journal of Visual and Performing Arts, 7(7s), 209–214. https://doi.org/10.29121/shodhkosh.v7.i7s.2026.7928

Download Citation

Issue

Vol. 7 No. 7s (2026): SPECIAL ISSUE ON ART, DESIGN, AND MEDIA CONVERGENCE IN THE DIGITAL ERA: AN ANALYTICAL STUDY OF CREATIVE PRACTICES

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

With the licence CC-BY, authors retain the copyright, allowing anyone to download, reuse, re-print, modify, distribute, and/or copy their contribution. The work must be properly attributed to its author.

It is not necessary to ask for further permission from the author or journal board.

This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.