-
Notifications
You must be signed in to change notification settings - Fork 895
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quran recitations split by Aya #2994
Comments
وعليكم السلام ورحمة الله وبركاته if you need full mp3s (i.e. one mp3 per sura) and timing files, I recommend using the Quran.com API for this. if you want an mp3 per ayah, you can try everyayah.com |
جزاكم الله خير الجزاء 😍😍 |
typically, this is done using machine learning models - initially it was this project, but others have built other models since. @nabil6391 maybe you can shed some light since most of the best recent work on this was done by the gtaf.org team. |
Thank you for your help. Let me more clear what we are building:
To do this we have to get : (data, machine learning model):
The dataThe data is the most important part. we have no data available ((
Are we done ? No we need to gather actual data (recitations annotated by expert reciters) after releasing the model to make it better If you have further suggestion, ideas, contributions I will be happy @ahmedre @nabil6391 |
I'll let Nabil answer since he knows this space better - but will share a few small things - first, the Tarteel dataset used to be open source, but it is closed now. They do link to the DeepSearch-Quran project which might be interesting to you. |
I notice something strange while calling The full output of then {
"recitations": [
{
"id": 2,
"reciter_name": "AbdulBaset AbdulSamad",
"style": "Murattal",
"translated_name": {
"name": "AbdulBaset AbdulSamad",
"language_name": "english"
}
},
{
"id": 1,
"reciter_name": "AbdulBaset AbdulSamad",
"style": "Mujawwad",
"translated_name": {
"name": "AbdulBaset AbdulSamad",
"language_name": "english"
}
},
{
"id": 3,
"reciter_name": "Abdur-Rahman as-Sudais",
"style": null,
"translated_name": {
"name": "Abdur-Rahman as-Sudais",
"language_name": "english"
}
},
{
"id": 4,
"reciter_name": "Abu Bakr al-Shatri",
"style": null,
"translated_name": {
"name": "Abu Bakr al-Shatri",
"language_name": "english"
}
},
{
"id": 5,
"reciter_name": "Hani ar-Rifai",
"style": null,
"translated_name": {
"name": "Hani ar-Rifai",
"language_name": "english"
}
},
{
"id": 12,
"reciter_name": "Mahmoud Khalil Al-Husary",
"style": "Muallim",
"translated_name": {
"name": "Mahmoud Khalil Al-Husary",
"language_name": "english"
}
},
{
"id": 6,
"reciter_name": "Mahmoud Khalil Al-Husary",
"style": null,
"translated_name": {
"name": "Mahmoud Khalil Al-Husary",
"language_name": "english"
}
},
{
"id": 7,
"reciter_name": "Mishari Rashid al-`Afasy",
"style": null,
"translated_name": {
"name": "Mishari Rashid al-`Afasy",
"language_name": "english"
}
},
{
"id": 9,
"reciter_name": "Mohamed Siddiq al-Minshawi",
"style": "Murattal",
"translated_name": {
"name": "Mohamed Siddiq al-Minshawi",
"language_name": "english"
}
},
{
"id": 8,
"reciter_name": "Mohamed Siddiq al-Minshawi",
"style": "Mujawwad",
"translated_name": {
"name": "Mohamed Siddiq al-Minshawi",
"language_name": "english"
}
},
{
"id": 10,
"reciter_name": "Sa`ud ash-Shuraym",
"style": null,
"translated_name": {
"name": "Sa`ud ash-Shuraym",
"language_name": "english"
}
},
{
"id": 11,
"reciter_name": "Mohamed al-Tablawi",
"style": null,
"translated_name": {
"name": "Mohamed al-Tablawi",
"language_name": "english"
}
}
]
} |
today they are not synced together, but that's one of the things we hope to do in sha' Allah in the future. |
I'm very sorry for this long thread and very grateful to your help truly جزاك الله خير الجزاء I hope to contribute with you in future projects إن شاء الله , but could you please give me the recitations metadata (android app API schema) (links and timings) in the android app as more data means better results. It will be hard for me to wait until is published on the api إن شاء الله @ahmedre
|
Assalamu Alaikum. MashaAllah brother, you have a lot of knowledge in this and yes you are in the right direction. I have used https://everyayah.com/ for training the model, obiously had to filter ayahs longer than 30s. We at gtaf.org have some of the similar goals as you have and we are considering open sourcing our model as well in sha Allah. If you want to know more and collaborate just contact me at Nabil@GTAF (nabil6391) in discord or [email protected]. |
I have sent you a friend request in discord my username is (abdullah.aml) @nabil6391 |
السلام عليكم ورحمة الله وبركاته I hope everyone is in good health. In general, we hope that all brothers will intensify their efforts and cooperate to find a model for converting audio to text and extracting the timing of each word specifically for the Quran and developing it. |
سلام عليكم
جزاكم الله خيرا على ذلك الجهد
I am working on creating an AI Quran recitation teacher project and I need recitations split by Aya as a training dataset for the segmenter part of the project.
Could you please give me the access of your recitations split by Aya !
The text was updated successfully, but these errors were encountered: