kaldi-offline-transcriber issueshttps://koodivaramu.eesti.ee/taltechnlp/kaldi-offline-transcriber/-/issues2016-02-17T08:02:39Zhttps://koodivaramu.eesti.ee/taltechnlp/kaldi-offline-transcriber/-/issues/11Installing pyfst2016-02-17T08:02:39ZTANEL ALUMÄEInstalling pyfst*Created by: siilats*
On OSX you need:
CPPFLAGS="-I/home/speech/tools/kaldi-trunk/tools/openfst/include -L/home/speech/tools/kaldi-trunk/tools/openfst/lib -stdlib=libstdc++"
pip install pyfst
*Created by: siilats*
On OSX you need:
CPPFLAGS="-I/home/speech/tools/kaldi-trunk/tools/openfst/include -L/home/speech/tools/kaldi-trunk/tools/openfst/lib -stdlib=libstdc++"
pip install pyfst
https://koodivaramu.eesti.ee/taltechnlp/kaldi-offline-transcriber/-/issues/6Attempting to adapt to English2019-02-04T16:19:34ZTANEL ALUMÄEAttempting to adapt to English*Created by: aolney*
I'm a Kaldi noob but interested in using your set up for English. I looked at your other project and the Kaldi discussion boards, and this model seems like a good fit
http://kaldi-asr.org/downloads/build/8/trunk/
...*Created by: aolney*
I'm a Kaldi noob but interested in using your set up for English. I looked at your other project and the Kaldi discussion boards, and this model seems like a good fit
http://kaldi-asr.org/downloads/build/8/trunk/
However I'm not sure how to adapt your Makefile to use the new model. It seems I would need to at least swap out these lines:
```
# Main language model (should be slightly pruned), used for rescoring
LM ?=language_model/pruned.vestlused-dev.splitw2.arpa.gz
# More aggressively pruned LM, used in decoding
PRUNED_LM ?=language_model/pruned6.vestlused-dev.splitw2.arpa.gz
COMPOUNDER_LM ?=language_model/compounder-pruned.vestlused-dev.splitw.arpa.gz
# Vocabulary in dict format (no pronouncation probs for now)
VOCAB?=language_model/vestlused-dev.splitw2.dict
```
but I'm not finding comparable files in Fisher.
https://koodivaramu.eesti.ee/taltechnlp/kaldi-offline-transcriber/-/issues/18README: Speaker ID process clarification2017-09-28T12:42:32ZTANEL ALUMÄEREADME: Speaker ID process clarification*Created by: lkraav*
Perhaps the README could clarify what the expected process output is when speaker ID feature is enabled? What is supposed to look different in the text output compared to disabling speaker ID. Is it possible to give...*Created by: lkraav*
Perhaps the README could clarify what the expected process output is when speaker ID feature is enabled? What is supposed to look different in the text output compared to disabling speaker ID. Is it possible to give speakers names via some transcription configuration file, or is that post-text-editing work?https://koodivaramu.eesti.ee/taltechnlp/kaldi-offline-transcriber/-/issues/13Problem with Makefile.options file2016-09-14T16:55:07ZTANEL ALUMÄEProblem with Makefile.options file*Created by: Wickee*
Created a Makefile.options file with the following:
`KALDI_ROOT=/home/$USER/tools/kaldi`
But when running the makefile, somehow the symlinks created, namely sid, steps and utils, are broken. When I examined the ...*Created by: Wickee*
Created a Makefile.options file with the following:
`KALDI_ROOT=/home/$USER/tools/kaldi`
But when running the makefile, somehow the symlinks created, namely sid, steps and utils, are broken. When I examined the symlinks, instead of the expansion for `$USER, I see`SER`. I do not know why this happens since I am not well versed in programming in general and linux shell in particular. When I change the Makefile.options to use the expanded username instead of the variable, the script runs fine.