Analyze input text file return most relevant document and tf-idf score based on cosine similarity
undecided
- tokenize,stemming,remove stopwords from text
- generate vector for each doc
- generate vector for query
- compute cosine similary and sort .......