kazoo Speech

This core library governs integrations with Text-to-Speech (TTS) providers and with (Automatic Speech Recognition (ASR) engines)[https://en.wikipedia.org/wiki/Speech_recognition] to allow Kazoo callflows to take advantage of the advanced functionality these provide.

TTS

Kazoo, by default, will use the internal mod_flite to play the text from a TTS callflow action.

There are also modules for iSpeech and VoiceFabric.

ASR

Kazoo currently supports as an ASR engine:

  1. iSpeech
  2. Google Cloud Speech API

Service Plan Definitions

  1. The ASR block defines usage rates for the providers
  2. The transcription block defines rates for having transcribe enabled on a vmbox
"asr": {
       "google": {
           "rate": 1,
           "name": "Google ASR"
       },
       "ispeech": {
           "rate": 1,
           "name": "ispeech ASR"
       }
   },
   "plan": {
   ....
       "voicemails": {
           "mailbox": {
               "name": "Voicemail Box",
               "rate": 1.99,
               "cascade": true
           },
           "transcription": {
               "cascade": true,
               "rate": 1,
               "name": "VMBox Transcription MRC"
           }
       },
....

On this page