Skip to content

Materials for Information Retrieval and Advanced Information Retrieval courses

Notifications You must be signed in to change notification settings

AbdulmueezEmiola/information-retrieval

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Information Retrieval at Innopolis University

This repository aggregates reading materials, lab templates, datasets and other electronic resourses useful to learn about search, recommendations and other IR things.

Reading

  • In 2021, there is still THE BOOK which is both good and old. That is why this book is mandatory reading, because it covers all necessary topics. But unfortunately it was written right before multimedia retrieval, recommender systems and machine learning became a common place.

  • Latent space approximation is an important topic, you can refer to discussion of ALS, Word2Vec, BERT.

  • Indexing is the blood system of search. Proximity graphs lay on the 0th level of theory. Higher you will find NSW and HNSW graphs. In search trees don't forget to read about Annoy. For modern inverted indices please refer to this paper and it's predecessor.

  • Written in 2003, still important paper of audio retrieval from Shazam creator. Also consider Query by Humming, Hum to Search by Google.

  • Images refrieval start with low level features, like SIFT, Haralick and Xerox features. And continues with machine learning with classifiers and autoencoders.

  • Topic modelling is usually a side topic for IR, but it is very important when things come to clustering, debiasing, analysis. To leading approaches are PLSA and LDA. Here are some interesting materials: topic modelling (rus), tutorials, LDA original paper.

Course structure and material links

Please find them on a separate page.

Other courses with slides and labs

About

Materials for Information Retrieval and Advanced Information Retrieval courses

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%