How to use espnet/owsm_ctc_v3.1_1B with ESPnet:
from espnet2.bin.asr_inference import Speech2Text model = Speech2Text.from_pretrained( "espnet/owsm_ctc_v3.1_1B" ) speech, rate = soundfile.read("speech.wav") text, *_ = model(speech)[0]
Thanks!
· Sign up or log in to comment