TY - GEN
T1 - Enhancing Retrieval-Augmented Generation Accuracy with Dynamic Chunking and Optimized Vector Search
AU - Tanyildiz, Derya
AU - Ayvaz, Serkan
AU - Amasyali, Mehmet Fatih
PY - 2024/12/19
Y1 - 2024/12/19
N2 - Retrieval-Augmented Generation (RAG) architectures depend on the integration of efficient retrieval and ranking mechanisms to enhance response accuracy and relevance. This study investigates a novel approach to improving the response performance of RAG systems, leveraging dynamic chunking for contextual coherence, Sentence-Transformers (all-mpnet-base-v2) for high-quality embeddings, and cross-encoder-based re-ranking for retrieval refinement. Our evaluation utilizes RAGAS metrics to assess key performance metrics, including faithfulness, relevancy, correctness, and context precision. Empirical evaluations highlighted the significant impact of index choice on the performance. Our proposed approach integrates the FAISS HNSW index with re-ranking, resulting in a balanced architecture that improves response fidelity without compromising efficiency. These insights underscore the importance of advanced indexing and retrieval techniques in bridging the gap between large-scale language models and domain-specific information needs. The findings provide a robust framework for future research in optimizing RAG systems, particularly in scenarios requiring high-context preservation and precision.
AB - Retrieval-Augmented Generation (RAG) architectures depend on the integration of efficient retrieval and ranking mechanisms to enhance response accuracy and relevance. This study investigates a novel approach to improving the response performance of RAG systems, leveraging dynamic chunking for contextual coherence, Sentence-Transformers (all-mpnet-base-v2) for high-quality embeddings, and cross-encoder-based re-ranking for retrieval refinement. Our evaluation utilizes RAGAS metrics to assess key performance metrics, including faithfulness, relevancy, correctness, and context precision. Empirical evaluations highlighted the significant impact of index choice on the performance. Our proposed approach integrates the FAISS HNSW index with re-ranking, resulting in a balanced architecture that improves response fidelity without compromising efficiency. These insights underscore the importance of advanced indexing and retrieval techniques in bridging the gap between large-scale language models and domain-specific information needs. The findings provide a robust framework for future research in optimizing RAG systems, particularly in scenarios requiring high-context preservation and precision.
KW - Retrieval-Augmented Generation
KW - Vector Database
KW - Semantic Search
KW - Cross Encoders
KW - Dynamic Chunking
KW - RAG
U2 - 10.56038/oprd.v5i1.516
DO - 10.56038/oprd.v5i1.516
M3 - Conference article
SN - 2980-020X
VL - 5
SP - 215
EP - 225
JO - Orclever Proceedings of Research and Development
JF - Orclever Proceedings of Research and Development
IS - 1
ER -