Skip to content

Latest commit

 

History

History
42 lines (28 loc) · 2.33 KB

README.md

File metadata and controls

42 lines (28 loc) · 2.33 KB

Open-Hadith-Data-Project

An open Hadith Library that contains full databases of 9 different books (including the Six Books), with the Elaboration of the 'Ahadith', all the following are included:

  • Sahih al-bukhari (صحيح البخاري).
  • Sahih Muslim (صحيح مسلم).
  • Sunan Abu-Dawud (سُنن ابي داود).
  • Jami al-Tirmidhi (جامع الترمذي).
  • Al-Sunan al-Sughra, collected by al-Nasa'i (السُنن الصغري).
  • Sunan ibn Majah (سُنن بن ماجه).
  • Muwatta Malik (موطأ الإمام مالك).
  • Musnad Ahmad ibn Hanbal (مُسند الإمام أحمد بن حنبل).
  • Sunan al-Darimi (سُنن الدارمي).

Each folder contains 2 files:

  1. A (.csv) file of the 'Ahadith' and 'Tafseel' with Arabic diacritics (tashkeel تشكيل), and Elaboration of Hadith (Tafseel تفصيل الحديث).

  2. A (.csv) file of the 'Ahadith' only without Arabic diacritics.

this structure will overcome Arabic Searching problems stated here, so you can search user's input without diacritics, and return results with diacritics.


notes:

  • the version with diacritcs include special RTL character that surrounds special words, such as important words or narrator's names. the original writers did that so developers could highlight these words, more information about how to implement this properly is in this link.

  • if you feel uncomfortable with the RTL character, you can remove it, and you would still be able to highlight some important words, as they are surrounded by 2 spaces at the beginning and another 2 at the end.

  • the vresion without diacritcs is completely clean, no RTL character, no double spaces.

  • please don't use the version without diacritcs as your main database, alwayse show Hadith with diacritcs, so they can be read properly.

  • the code used to remove the diacritics can be found here.

  • UTF-8 encoding is always used.

  • both 'Musnad Ahmad ibn Hanbal' and 'Sunan al-Darimi' don't include elaboration.


The original CSV files can be found in hadith-islamware repository.