Home / Papers / Top Research Papers on RAG

Top Research Papers on RAG

Explore our curated list of top research papers on RAG. This comprehensive collection delves into various aspects and innovations surrounding RAG. Whether you're a student, researcher, or enthusiast, these studies will provide valuable insights and deepen your understanding of RAG. Dive into the world of research and discover the breakthroughs shaping the future of RAG.

Looking for research-backed answers?Try AI Search

RAG-Fusion: a New Take on Retrieval-Augmented Generation

45 Citations 2024

Zackary Rackauckas

ArXiv

It is found that RAG-Fusion was able to provide accurate and comprehensive answers due to the generated queries contextualizing the original query from various perspectives, however, some answers strayed off topic when the generated queries' relevance to the original query is insufficient.

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

41 Citations 2025

Aditi Singh, Abul Ehtesham, Saket Kumar + 1 more

ArXiv

This survey provides a comprehensive exploration of Agentic RAG, beginning with its foundational principles and the evolution of RAG paradigms, and presents a detailed taxonomy of Agentic RAG architectures, highlights key applications in industries such as healthcare, finance, and education, and examines practical implementation strategies.

The Journey to a Knowledgeable Assistant with Retrieval-Augmented Generation (RAG)

2 Citations 2024

Xin Luna Dong

Companion of the 2024 International Conference on Management of Data

This talk describes the journey in building a knowledgeable AI assistant by harnessing LLM techniques, and describes a federated Retrieval-Augmented Generation system that integrates external information from both the web and knowledge graphs for trustworthy text generation on real-time topics like stocks and sports, as well as on torso-to-tail entities like local restaurants.

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

87 Citations 2024

Chi-Min Chan, Chunpu Xu, Ruibin Yuan + 4 more

ArXiv

This paper proposes learning to Refine Query for Retrieval Augmented Generation (RQ-RAG), endeavoring to enhance the model by equipping it with capabilities for explicit rewriting, decomposition, and disambiguation, and demonstrates enhanced performance in handling complex, multi-hop QA datasets.

Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers

64 Citations 2024

Kunal Sawarkar, Abhilasha Mangal, S. R. Solanki

2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR)

This paper proposes the ‘Blended RAG’ method of leveraging semantic search techniques, such as Dense Vector indexes and Sparse Encoder indexes, blended with hybrid query strategies, which achieves better retrieval results and sets new benchmarks for IR datasets like NQ and TREC-COVID datasets.

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

47 Citations 2024

Zilong Wang, Zifeng Wang, Long T. Le + 9 more

ArXiv

Speculative RAG is introduced - a framework that leverages a larger generalist LM to efficiently verify multiple RAG drafts produced in parallel by a smaller, distilled specialist LM, which accelerates RAG by delegating drafting to the smaller specialist LM, with the larger generalist LM performing a single verification pass over the drafts.

RAGGED: Towards Informed Design of Scalable and Stable RAG Systems

13 Citations 2024

Jennifer Hsia, Afreen Shaikh, Zhiruo Wang + 1 more

journal unavailable

RAGGED, a framework for analyzing RAG configurations across various DBQA tasks, discovers distinct LM behaviors in response to varying context quantities, context qualities, and retrievers and provides a deeper analysis of these differences.

MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries

110 Citations 2024

Yixuan Tang, Yi Yang

ArXiv

A novel dataset, MultiHop-RAG, which consists of a knowledge base, a large collection of multi-hop queries, their ground-truth answers, and the associated supporting evidence, and it is hoped MultiHop-RAG will be a valuable resource for the community in developing effective RAG systems, thereby facilitating greater adoption of LLMs in practice.

Retrieval-Augmented Generation (RAG) and LLM Integration

2 Citations 2024

Büşra Tural, Zeynep Örpek, Zeynep Destan

2024 8th International Symposium on Innovative Approaches in Smart Technologies (ISAS)

This study emphasizes that the RAG architecture’s ability to retrieve information by dynamically using the learnings obtained from large datasets of LLMs strengthens applications in the field of NLP.

The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)

85 Citations 2024

Shenglai Zeng, Jiankun Zhang, Pengfei He + 8 more

journal unavailable

This work conducts extensive empirical studies with novel attack methods, which demonstrate the vulnerability of RAG systems on leaking the private retrieval database and reveals that RAG can mitigate the leakage of the LLMs' training data.

RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance

2 Citations 2025

Matin Mortaheb, M. A. Khojastepour, S. Chakradhar + 1 more

ArXiv

A novel framework to evaluate the reliability of multi-modal RAG using two performance measures: the relevancy score (RS), assessing the relevance of retrieved entries to the query, and the correctness score (CS), evaluating the accuracy of the generated response.

Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation

6 Citations 2024

Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi + 1 more

2024 2nd International Conference on Foundation and Large Language Models (FLLM)

A critical analysis of failure points in existing KG-based RAG methods is presented, identifying eight key areas of concern, including misinterpretation of question context, incorrect relation mapping, and ineffective ambiguity resolution, which aim to improve the reliability and effectiveness of KG-RAG systems.

LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal Domain

40 Citations 2024

Nicholas Pipitone, Ghita Houir Alami

ArXiv

LegalBench-RAG is introduced, the first benchmark specifically designed to evaluate the retrieval step of RAG pipelines within the legal space, and serves as a critical tool for companies and researchers focused on enhancing the accuracy and performance of RAG systems in the legal domain.

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models

23 Citations 2024

Mintong Kang, Nezihe Merve Gurel, Ning Yu + 2 more

ArXiv

C-RAG is proposed, the first framework to certify generation risks for RAG models and it is proved that RAG achieves a lower conformal generation risk than that of a single LLM when the quality of the retrieval model and transformer is non-trivial.

Harnessing Retrieval-Augmented Generation (RAG) for Uncovering Knowledge Gaps

1 Citations 2023

J. Hurtado

ArXiv

The study demonstrates the effectiveness of the RAG system in generating relevant suggestions with a consistent accuracy of 93% and highlights the value of identifying and understanding knowledge gaps to guide future endeavours.

Machine Against the RAG: Jamming Retrieval-Augmented Generation with Blocker Documents

39 Citations 2024

Avital Shafran, R. Schuster, Vitaly Shmatikov

ArXiv

It is demonstrated that RAG systems that operate on databases with untrusted content are vulnerable to denial-of-service attacks the authors call jamming, and a new method based on black-box optimization is described and measured, including a new method based on black-box optimization.

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

12 Citations 2024

Tian Yu, Shaolei Zhang, Yang Feng

ArXiv

Auto-RAG is introduced, an autonomous iterative retrieval model centered on the LLM's powerful decision-making capabilities that can autonomously adjust the number of iterations based on the difficulty of the questions and the utility of the retrieved knowledge, without requiring any human intervention.

Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation

4 Citations 2024

To Eun Kim, Fernando Diaz

ArXiv

This paper presents the first comprehensive study of RAG systems that incorporate fairness-aware rankings, focusing on both ranking fairness and attribution fairness - ensuring equitable exposure of sources cited in the final text.

Class-RAG: Real-Time Content Moderation with Retrieval Augmented Generation

2 Citations 2024

Jianfa Chen, Emily Shen, Trupti Bavalatti + 10 more

journal unavailable

Compared to model fine-tuning, Class-RAG demonstrates flexibility and transparency in decision-making, outperforms on classification and is more robust against adversarial attack, as evidenced by empirical studies.

The Journey to A Knowledgeable Assistant with Retrieval-Augmented Generation (RAG)

No citations 2024

Xin Luna Dong

Proceedings of the 17th ACM International Conference on Web Search and Data Mining

This talk describes the journey in building a knowledgeable AI assistant by harnessing LLM techniques, and constructed a federated Retrieval-Augmented Generation (RAG) system that integrates external information from both the web and knowledge graphs in text generation.

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

11 Citations 2024

Zhuoran Jin, Hongbang Yuan, Tianyi Men + 4 more

ArXiv

The RAG-RewardBench is proposed, the first benchmark for evaluating RMs in RAG settings and reveals that existing trained RALMs show almost no improvement in preference alignment, highlighting the need for a shift towards preference-aligned training.

P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task

1 Citations 2024

Weiye Xu, Min Wang, Wen-gang Zhou + 1 more

journal unavailable

This work proposes a novel approach called Progressive Retrieval Augmented Generation (P-RAG), which not only effectively leverages the powerful language processing capabilities of LLMs but also progressively accumulates task-specific knowledge without ground-truth.

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

41 Citations 2024

Yuanjie Lyu, Zhiyu Li, Simin Niu + 7 more

ACM Transactions on Information Systems

This paper constructs a large-scale and more comprehensive benchmark, and evaluates all the components of RAG systems in various RAG application scenarios, and refers to the CRUD actions that describe interactions between users and knowledge bases, and categorizes the range of RAG applications into four distinct types.

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

32 Citations 2024

Hongru Wang, Wenyu Huang, Yang Deng + 6 more

ArXiv

Experiments show that UniMS-RAG achieves state-of-the-art performance on the knowledge source selection and response generation task with itself as a retriever in a unified manner.

ICCA-RAG: Intelligent Customs Clearance Assistant Using Retrieval-Augmented Generation (RAG)

No citations 2025

Rong Hu, Sen Liu, Panpan Qi + 2 more

IEEE Access

Document processing and query generation tasks in customs declaration scenarios face key challenges such as the complexity of multimodal data, adaptability to dynamic regulations, and ambiguity in query semantics. This study proposes a Retrieval-Augmented Generation system (ICCA-RAG) that addresses the core issues of processing complex customs documents and dynamically generating queries through multimodal document parsing, sparse-dense hybrid storage, and context-driven large language model generation. In terms of multimodal document parsing, the system supports comprehensive parsing of PDFs,...

A Research of Challenges and Solutions in Retrieval Augmented Generation (RAG) Systems

No citations 2025

Jiafeng Gu

Highlights in Science, Engineering and Technology

This paper evaluates various advanced solutions proposed in recent literature, comparing their efficacy and discussing the trade-offs involved, and delves into the central architecture of RAG systems, encompassing retrieval components, generative components, and knowledge bases.

To Enhance Graph-Based Retrieval-Augmented Generation (RAG) with Robust Retrieval Techniques

No citations 2024

Maneeha Rani, Bhupesh Kumar Mishra, Dhavalkumar Thakker + 1 more

2024 18th International Conference on Open Source Systems and Technologies (ICOSST)

A novel knowledge graph-based RAG framework with a refined retrieval pipeline, robust chunking mechanism, and source traceability for enhanced diabetes-focused LLM is proposed, demonstrating effective performance in diabetes-focused LLM.

Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

30 Citations 2024

Hamed Zamani, Michael Bendersky

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

Stochastic RAG casts the retrieval process in RAG as a stochastic sampling without replacement process, and employs straight-through Gumbel-top-k that provides a differentiable approximation for sampling without replacement and enables effective end-to-end optimization for RAG.

GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation

2 Citations 2025

Linhao Luo, Zicheng Zhao, Gholamreza Haffari + 3 more

ArXiv

GFM-RAG is a novel graph foundation model (GFM) for retrieval augmented generation powered by an innovative graph neural network that reasons over graph structure to capture complex query-knowledge relationships, making it the first graph foundation model applicable to unseen datasets for retrieval without any fine-tuning required.

GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation

No citations 2024

B. Rappazzo, Yingheng Wang, Aaron Ferber + 1 more

ArXiv

Graphical Eigen Memories For Retrieval Augmented Generation (GEM-RAG), motivated by human memory encoding and retrieval, is introduced, which aims to improve over standard RAG methods by generating and encoding higher-level information and tagging the chunks by their utility to answer questions.

MAIN-RAG: Multi-Agent Filtering Retrieval-Augmented Generation

4 Citations 2024

Chia-yuan Chang, Zhimeng Jiang, Vineeth Rakesh + 8 more

ArXiv

Multi-Agent Filtering Retrieval-Augmented Generation (MAIN-RAG) is proposed, a training-free RAG framework that leverages multiple LLM agents to collaboratively filter and score retrieved documents and introduces an adaptive filtering mechanism that dynamically adjusts the relevance filtering threshold based on score distributions.

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

1 Citations 2024

Prakhar Verma, Sukruta Prakash Midigeshi, Gaurav Sinha + 3 more

journal unavailable

Plan*RAG is introduced, a novel framework that enables structured multi-hop reasoning in retrieval-augmented generation (RAG) through test-time reasoning plan generation by isolating the reasoning plan as a directed acyclic graph outside the LM's working memory.

Pistis-RAG: Enhancing Retrieval-Augmented Generation with Human Feedback

3 Citations 2024

Yu Bai, Yukai Miao, Li Chen + 6 more

journal unavailable

Experimental results indicate that Pistis-RAG improves alignment with human preferences relative to the baseline RAG system, showing a 6.06% increase in MMLU (English) and a 7.08% increase in C-EVAL (Chinese) accuracy metrics, highlighting Pistis-RAG's effectiveness in overcoming the limitations associated with traditional RAG approaches.

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

6 Citations 2024

Daniel Fleischer, Moshe Berchansky, Moshe Wasserblat + 1 more

ArXiv

RAG Foundry is introduced, an open-source framework for augmenting large language models for RAG use cases and demonstrates the framework effectiveness by augmenting and fine-tuning Llama-3 and Phi-3 models with diverse RAG configurations, showcasing consistent improvements across three knowledge-intensive datasets.

The Chronicles of RAG: The Retriever, the Chunk and the Generator

33 Citations 2024

Paulo Finardi, Leonardo Avila, Rodrigo Castaldoni + 5 more

ArXiv

This paper presents good practices to implement, optimize, and evaluate RAG for the Brazilian Portuguese language, focusing on the establishment of a simple pipeline for inference and experiments, and presents the complete architecture of the RAG.

Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

17 Citations 2024

Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam + 5 more

ArXiv

The TREC 2024 RAG Track is proposed, which introduces a web-based user interface for an interactive arena allowing benchmarking pairwise RAG systems by crowdsourcing and opens-source the Ragnar\"ok framework and baselines to achieve a unified standard for future RAG systems.

MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

15 Citations 2024

Xiaqiang Tang, Q. Gao, Jian Li + 3 more

journal unavailable

The approach leverages a multi-armed bandit algorithm, which treats each retrieval method as a distinct ``arm'' and adapts the selection process by balancing exploration and exploitation, and introduces a dynamic reward function that balances accuracy and efficiency.

Building a Retrieval-Augmented Generation (RAG) System for Academic Papers

No citations year unavailable

Anna Grigoryan, Habet Madoyan

journal unavailable

The RAG system utilizes a 2-step vector search using the vector search with cosine similarity metric on an HNSW index on the paper’s abstracts and the papers itself to pass only relevant information to LLM; this enables enhanced data retrieval and contextually aware text generation.

RAG Picking Helps: Retrieval Augmented Generation for Machine Translation

1 Citations year unavailable

Roee Aharoni, Yoav Goldberg, Unsupervised + 11 more

journal unavailable

It is shown that retrieval-augmented fine-tuning of NMT mod-012 els under the RAGMT framework results in an average improvement of 2.03 BLEU scores over simple fine-tuning approaches on English 015 to German domain-specific translation.

Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation

15 Citations 2023

Eric Melz

ArXiv

ARM-RAG (Auxiliary Rationale Memory for Retrieval Augmented Generation), a system that learns from its successes without incurring high training costs is proposed, and it is demonstrated that the storage and subsequent retrieval of reasoning chains have a positive influence on performance in grade-school math problems.

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

24 Citations 2024

Diji Yang, Jinmeng Rao, Kezhen Chen + 4 more

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

This work proposes a novel LLM-centric approach that integrates IR systems with LLMs to support multi-round RAG through learning Inner Monologues, and introduces a Refiner that improves the outputs from the Retriever, effectively bridging the gap between the Reasoner and IR modules with varying capabilities and fostering multi-round communications.

DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering

8 Citations 2024

Zijian Hei, Weiling Liu, Wenjie Ou + 5 more

ArXiv

The experimental results on multi-hop QA datasets show that DR-RAG can significantly improve the accuracy of the answers and achieve new progress in QA systems.

Evaluation of Semantic Search and its Role in Retrieved-Augmented-Generation (RAG) for Arabic Language

2 Citations 2024

Ali Mahboub, Muhy Eddin Za'ter, Bashar Alfrou + 3 more

ArXiv

This paper endeavors to establish a straightforward yet potent benchmark for semantic search in Arabic within the framework of retrieval augmented generation (RAG) and examines the effectiveness of metrics and the dataset.

Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems

6 Citations 2025

Fatemeh Nazary, Yashar Deldjoo, T. D. Noia

journal unavailable

Poison-RAG manipulates item metadata, such as tags and descriptions, to influence recommendation outcomes, and indicates that popular items are more susceptible to attacks, whereas long-tail items are harder to manipulate.

Spatial-RAG: Spatial Retrieval Augmented Generation for Real-World Spatial Reasoning Questions

7 Citations 2025

Dazhou Yu, Riyang Bao, Gengchen Mai + 1 more

ArXiv

This work proposes Spatial Retrieval-Augmented Generation (Spatial-RAG), a framework that extends RAG to spatial tasks by integrating sparse spatial retrieval and dense semantic retrieval in Large Language Models.

Mitigating the Privacy Issues in Retrieval-Augmented Generation (RAG) via Pure Synthetic Data

1 Citations 2024

Shenglai Zeng, Jiankun Zhang, Pengfei He + 7 more

ArXiv

This work proposes SAGE, a novel two-stage synthetic data generation paradigm that employs an attribute-based extraction and generation approach to preserve key contextual information from the original data and enhances the privacy properties of the synthetic data through an agent-based iterative refinement process.

LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation

No citations 2024

Steven Song, Anirudh Subramanyam, Irene Madejski + 1 more

journal unavailable

It is demonstrated that LaB-RAG achieves better results across natural language and radiology language metrics compared with other retrieval-based RRG methods, while attaining competitive results compared to other fine-tuned vision-language RRG models.

Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering

192 Citations 2022

Shamane Siriwardhana, Rivindu Weerasekera, Elliott Wen + 3 more

Transactions of the Association for Computational Linguistics

This paper proposes RAG-end2end, an extension to RAG that can adapt to a domain- specific knowledge base by updating all components of the external knowledge base during training and introduces an auxiliary training signal to inject more domain-specific knowledge.

Augmenting Code Sequencing with Retrieval-Augmented Generation (RAG) for Context-Aware Code Synthesis

3 Citations 2024

S. Rani, S. G. Deepika, D. Devdharshini + 1 more

2024 First International Conference on Software, Systems and Information Technology (SSITCON)

This project presents a novel system designed to enhance code generation by leveraging Retrieval-Augmented Generation, Grounding techniques, and Prompt Parameters, which represents a significant advancement in automating code generation from natural language descriptions.

RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots

9 Citations 2024

Philip Feldman, James R. Foulds, Shimei Pan

ArXiv

Results show that RAG increases accuracy in some cases, but can still be misled when prompts directly contradict the model's pre-trained understanding, highlighting the complex nature of hallucinations and the need for more robust solutions to ensure LLM reliability in real-world applications.