An implementation of an information retrieval system on the CACM collection. We implemented the boolean model and the vector space model. Four similarity functions were implemented for the former (dot product, cosinus, Dice, Jaccard). In addition, a GUI was developped to test the vector space model on the query set proposed in the CACM collection. The metrics for the test were : recall, average precision metric and 11pt average precision metric. We also plot the precision-recall curve and the interpolated precision-recall curve. Feel free to check the report (in french) for more details about the implementation.
-
Notifications
You must be signed in to change notification settings - Fork 0
HichemAK/information-retrieval
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
An implementation of the boolean model and the vector space model on the CACM collection
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published