Please use this identifier to cite or link to this item: http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/3146
Title: Utilizing a RAG-powered LLM for information retrieval in a research repository
Authors: Tan, Timothy Marcus
Keywords: Large Language Model
Information Retrieval
Document Retrieval
Vector Space Model
Natural Language Processing
Text Generation
Research Repository
Keyword Search
Chatbots
Issue Date: Jun-2025
Abstract: Though widely use across many research repositories, keyword search may not be sufficient for people who are becoming more familiar with the use of chatbots like ChatGPT. The proposed system will serve as a search engine for the UPM IRS which is a repository for the university’s theses. The system will utilize the vector space model in retrieving documents by directly embedding the user’s query into a vector to be compared to the vectors stored in a vector store by cosine similarity. Retrieval Augmented Generation (RAG) will then be used as the top documents will be given to a large language model (LLM) to create an overview of the top documents. The combination of a semantic retrieval method and a LLM was able to yield a good user experience and relevant results to the users.
URI: http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/3146
Appears in Collections:BS Computer Science SP

Files in This Item:
File Description SizeFormat 
2025_Tan TM_Utilizing a RAG-powered LLM for Information Retrieval in a Research Repository.pdf
  Until 9999-01-01
1.19 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.