-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there an Arabic model in sherpa-ncnn? #321
Comments
No, we don't have. |
Can I create my own model, and how? |
please refer to icefall |
Can I use this code to create my own speech model for recognizing numbers from 0 to 13 (SherpaNcnn own model)?
!pip install torchaudio==2.0.2
from google.colab import drive Copy the audio files to a folder in Colab.
!git clone https://github.com/k2-fsa/icefall Set the Environment Variable import os
Install Lhotse !pip install git+https://github.com/lhotse-speech/lhotse Create Metadata from lhotse import Recording, SupervisionSegment, CutSet Create recordingsrecordings = [] Create supervision segmentssupervisions = [] Combine them into a CutSetcuts = CutSet.from_manifests(recordings=recordings, supervisions=supervisions) Save the datacuts.to_file("data/cuts.jsonl.gz")
!cd /content/icefall/egs/yesno/ASR && ./prepare.sh Update Dataset Configuration !export PYTHONPATH=/content/icefall:$PYTHONPATH &&
!export PYTHONPATH=/content/icefall:$PYTHONPATH &&
torch.jit.save(model, "model.pt") Are the previous steps correct? |
No. Only those 3 listed in the icefall.doc can.be converted.to. sherpa-ncnn. |
You said to train a custom model on Icefall and then export it to ncnn using: Export streaming Zipformer transducer models to ncnn Export ConvEmformer transducer models to ncnn Export LSTM transducer models to ncnn Is my previous statement correct? If it is correct, then why did you say no in the previous answer about creating my own model, while the code I copied is from Icefall, which only contains two files for Google Colab, and this link: If you have another link, you can provide it. |
Because.you.use tdnn |
Please.reread the doc tdnn doesn't belong to one of.the 3 supported models. |
Which of the two files in Icefall do you recommend I use to create my own model containing the numbers from 0 to 13 on Google Colab, so I can use it in my Android project? yes_no_dataset_recipe_with_CPU.ipynb ctc_forced_alignment_fst_based_kaldi.ipynb Also, which option should I choose? Export streaming Zipformer transducer models to ncnn Export ConvEmformer transducer models to ncnn Export LSTM transducer models to ncnn |
Neither of these two files. As said before, only the 3 models listed in the.doc can be exported to ncnn, which means you MUST choose one of them to.train your.own.model. |
I want to create a speech recognition model for numbers from 0 to 13 in Arabic using Google Colab. Something like this: https://youtu.be/m-JzldXm9bQ?feature=shared |
I don't see any difficulties of setting up the environment for icefall on.a.colab notebook Please describe your issue in detail. |
models Arabic
I only found this model icefall-asr-mgb2-conformer_ctc-2022-27-06 (Arabic) ( icefall )
I want this model Arabic ( sherpa-ncnn )
The text was updated successfully, but these errors were encountered: