How to use espnet/owsm_v4_base_102M with ESPnet:
from espnet2.bin.asr_inference import Speech2Text model = Speech2Text.from_pretrained( "espnet/owsm_v4_base_102M" ) speech, rate = soundfile.read("speech.wav") text, *_ = model(speech)[0]
This PR adds the library_name and pipeline_tag to the model card metadata for better searchability and discoverability.
library_name
pipeline_tag
· Sign up or log in to comment