SimpleIRSystem

A simple IR system to index and query a collection.

Collaborators

Charles Du
Michael Kyeyune

Requirements

Python 3.4.3+

Utilising this system

To utilise this system you need to generate a collection from a testbed, index the collection then compare MAP/Avg NDCG results for the modified and unmodified engine.

generate collection from testbed
- python collect.py testbedx - x being the number of the testbed. This will generate a file testbedx_collection
index testbed collection
- python index.py testbedx_collection
analyse modified and unmodified engine performance
- python analyse.py testbedx
finding optimal indicative terms and top k documents to utilise for blind relevance feedback (BRF) for a single testbed
- python optimise.py -s testbedx 200 - 200 being the number of documents to consider in MAP/Avg NDCG calculations
finding optimal indicative terms and top k documents to utilise for BRF for all testbeds
- python optimise.py -a 200

Important Notices

Documents that are not in UTF-8 format are ignored when generating the collection for a testbed
All testbeds must be indexed before attempting to optimise across all testbeds

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
testbed1		testbed1
testbed11		testbed11
testbed12		testbed12
testbed13		testbed13
testbed14		testbed14
testbed15		testbed15
testbed16		testbed16
testbed2		testbed2
testbed3		testbed3
testbed4		testbed4
testbed5		testbed5
testbed6		testbed6
testbed7		testbed7
testbed8		testbed8
testbed9		testbed9
.gitignore		.gitignore
README.md		README.md
analyse.py		analyse.py
ap.py		ap.py
collect.py		collect.py
index.py		index.py
ndcg.py		ndcg.py
optimise.py		optimise.py
parameters.py		parameters.py
porter.py		porter.py
query.py		query.py
stop-word-list.txt		stop-word-list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimpleIRSystem

A simple IR system to index and query a collection.

Collaborators

Requirements

Utilising this system

Important Notices

About

Releases

Packages

Contributors 3

Languages

michael-xander/SimpleIRSystem

Folders and files

Latest commit

History

Repository files navigation

SimpleIRSystem

A simple IR system to index and query a collection.

Collaborators

Requirements

Utilising this system

Important Notices

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages