Please use this identifier to cite or link to this item: http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/3146
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTan, Timothy Marcus-
dc.date.accessioned2025-08-18T04:46:38Z-
dc.date.available2025-08-18T04:46:38Z-
dc.date.issued2025-06-
dc.identifier.urihttp://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/3146-
dc.description.abstractThough widely use across many research repositories, keyword search may not be sufficient for people who are becoming more familiar with the use of chatbots like ChatGPT. The proposed system will serve as a search engine for the UPM IRS which is a repository for the university’s theses. The system will utilize the vector space model in retrieving documents by directly embedding the user’s query into a vector to be compared to the vectors stored in a vector store by cosine similarity. Retrieval Augmented Generation (RAG) will then be used as the top documents will be given to a large language model (LLM) to create an overview of the top documents. The combination of a semantic retrieval method and a LLM was able to yield a good user experience and relevant results to the users.en_US
dc.subjectLarge Language Modelen_US
dc.subjectInformation Retrievalen_US
dc.subjectDocument Retrievalen_US
dc.subjectVector Space Modelen_US
dc.subjectNatural Language Processingen_US
dc.subjectText Generationen_US
dc.subjectResearch Repositoryen_US
dc.subjectKeyword Searchen_US
dc.subjectChatbotsen_US
dc.titleUtilizing a RAG-powered LLM for information retrieval in a research repositoryen_US
dc.typeThesisen_US
Appears in Collections:BS Computer Science SP

Files in This Item:
File Description SizeFormat 
2025_Tan TM_Utilizing a RAG-powered LLM for Information Retrieval in a Research Repository.pdf
  Until 9999-01-01
1.19 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.