Information Retrieval System
Using Python, NLTK, and the Vector Space Model, I built an Information Retrieval system. It processed 322 documents, tokenizing them and removing stop words, creating a substantial inverted index of 450k words. Employing TF-IDF weighting and cosine similarity, the system efficiently ranked documents for 50 test queries, resulting in an effective retrieval system.