python cvs files Leeds University and King Saud University (LK) Hadith Corpus
Bilingual parallel corpus of English-Arabic Islamic Hadith
Extracted from the six canonical Hadith books
The corpus contains 39,038 annotated Hadiths that comprises more than 10 million tokens
Each component of the Hadith is extracted and allocated to a specific column:
Chapter_Number
Chapter_English
Chapter_Arabic
Section_Number
Section_English
Section_Arabic
Hadith_number
English_Hadith
English_Isnad
English_Matn
Arabic_Hadith
Arabic_Isnad
Arabic_Matn
Arabic_Comment
English_Grade
Arabic_Grade