Please use this identifier to cite or link to this item:
Title: A Text Retrieval System Using Latent Semantic Analysis
Authors: Baes, Gregorio B.
Mojar, Editha
Issue Date: Apr-2003
Abstract: A text retrieval system that uses the Latent Semantic Analysis for indexing is developed. A collection of 106 documents are represented as vectors in a 377-dimensional term space. The number of dimensions corresponds to the number of extracted content words found in all the document titles in the database. The 377 by 106 matrix representing the entire data set is decomposed using singular value decomposition and the resulting matrices are truncated to 10 orthogonal factors. The recombination of the truncated matrices forms the basis for the computation of the distances of each document from a query vector obtained by treating a query as a pseudo-document. Results indicate that indexing using LSA is promising tool for improving retrieval.
Appears in Collections:Computer Science SP

Files in This Item:
File Description SizeFormat 
A Text Retrieval System Using Latent Semantic Analysis.pdf458.92 kBAdobe PDFThumbnail

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.