Speech-to-text service implementation using Google Cloud’s Speech-to-Text V2 API
GoogleSTTService
provides real-time speech recognition using Google Cloud’s Speech-to-Text V2 API with support for 125+ languages, multiple models, voice activity detection, and advanced features like automatic punctuation and word-level confidence scores.
InputAudioRawFrame
- Raw PCM audio data (16-bit, configurable sample rate, mono)STTUpdateSettingsFrame
- Runtime transcription configuration updatesSTTMuteFrame
- Mute audio input for transcriptionInterimTranscriptionFrame
- Real-time transcription updatesTranscriptionFrame
- Final transcription results with confidence scoresErrorFrame
- Connection or processing errorsModel | Description | Best For |
---|---|---|
latest_long | Optimized for long-form speech | Conversations, meetings, podcasts |
chirp_2 | LLM powered ASR model | Streaming and multilingual |
telephony | Optimized for phone call audio | Call centers, phone interviews |
medical_dictation | Medical terminology optimized | Healthcare dictation |
medical_conversation | Doctor-patient conversation optimized | Medical consultations |
Region | Description | Best For |
---|---|---|
global | Default global endpoint | General use, auto-routing |
us-central1 | US Central region | North American users |
europe-west1 | Europe West region | European users |
asia-northeast1 | Asia Northeast region | Asian users |
View All Supported Languages
Language Code | Description | Service Codes |
---|---|---|
Language.AF | Afrikaans | af-ZA |
Language.SQ | Albanian | sq-AL |
Language.AM | Amharic | am-ET |
Language.AR | Arabic (Egypt) | ar-EG |
Language.AR_AE | Arabic (UAE) | ar-AE |
Language.AR_SA | Arabic (Saudi Arabia) | ar-SA |
Language.HY | Armenian | hy-AM |
Language.AZ | Azerbaijani | az-AZ |
Language.EU | Basque | eu-ES |
Language.BN | Bengali (India) | bn-IN |
Language.BN_BD | Bengali (Bangladesh) | bn-BD |
Language.BS | Bosnian | bs-BA |
Language.BG | Bulgarian | bg-BG |
Language.MY | Burmese | my-MM |
Language.CA | Catalan | ca-ES |
Language.ZH | Chinese (Simplified) | cmn-Hans-CN |
Language.ZH_TW | Chinese (Traditional) | cmn-Hant-TW |
Language.YUE | Chinese (Cantonese) | yue-Hant-HK |
Language.HR | Croatian | hr-HR |
Language.CS | Czech | cs-CZ |
Language.DA | Danish | da-DK |
Language.NL | Dutch | nl-NL |
Language.NL_BE | Dutch (Belgium) | nl-BE |
Language.EN | English (US) | en-US |
Language.EN_AU | English (Australia) | en-AU |
Language.EN_CA | English (Canada) | en-CA |
Language.EN_GB | English (UK) | en-GB |
Language.EN_IN | English (India) | en-IN |
Language.ET | Estonian | et-EE |
Language.FIL | Filipino | fil-PH |
Language.FI | Finnish | fi-FI |
Language.FR | French | fr-FR |
Language.FR_CA | French (Canada) | fr-CA |
Language.GL | Galician | gl-ES |
Language.KA | Georgian | ka-GE |
Language.DE | German | de-DE |
Language.DE_AT | German (Austria) | de-AT |
Language.EL | Greek | el-GR |
Language.GU | Gujarati | gu-IN |
Language.HE | Hebrew | iw-IL |
Language.HI | Hindi | hi-IN |
Language.HU | Hungarian | hu-HU |
Language.IS | Icelandic | is-IS |
Language.ID | Indonesian | id-ID |
Language.IT | Italian | it-IT |
Language.JA | Japanese | ja-JP |
Language.JV | Javanese | jv-ID |
Language.KN | Kannada | kn-IN |
Language.KK | Kazakh | kk-KZ |
Language.KM | Khmer | km-KH |
Language.KO | Korean | ko-KR |
Language.LO | Lao | lo-LA |
Language.LV | Latvian | lv-LV |
Language.LT | Lithuanian | lt-LT |
Language.MK | Macedonian | mk-MK |
Language.MS | Malay | ms-MY |
Language.ML | Malayalam | ml-IN |
Language.MR | Marathi | mr-IN |
Language.MN | Mongolian | mn-MN |
Language.NE | Nepali | ne-NP |
Language.NO | Norwegian | no-NO |
Language.FA | Persian | fa-IR |
Language.PL | Polish | pl-PL |
Language.PT | Portuguese | pt-PT |
Language.PT_BR | Portuguese (Brazil) | pt-BR |
Language.PA | Punjabi | pa-Guru-IN |
Language.RO | Romanian | ro-RO |
Language.RU | Russian | ru-RU |
Language.SR | Serbian | sr-RS |
Language.SI | Sinhala | si-LK |
Language.SK | Slovak | sk-SK |
Language.SL | Slovenian | sl-SI |
Language.ES | Spanish (Spain) | es-ES |
Language.ES_MX | Spanish (Mexico) | es-MX |
Language.ES_US | Spanish (US) | es-US |
Language.SU | Sundanese | su-ID |
Language.SW | Swahili | sw-TZ |
Language.SV | Swedish | sv-SE |
Language.TA | Tamil | ta-IN |
Language.TE | Telugu | te-IN |
Language.TH | Thai | th-TH |
Language.TR | Turkish | tr-TR |
Language.UK | Ukrainian | uk-UA |
Language.UR | Urdu | ur-IN |
Language.UZ | Uzbek | uz-UZ |
Language.VI | Vietnamese | vi-VN |
Language.XH | Xhosa | xh-ZA |
Language.ZU | Zulu | zu-ZA |
Language.EN_US
- English (US) - en-US
Language.ES
- Spanish - es-ES
Language.FR
- French - fr-FR
Language.DE
- German - de-DE
Language.ZH
- Chinese (Simplified) - cmn-Hans-CN
Language.JA
- Japanese - ja-JP
GoogleSTTService
and use it in a pipeline:
STTUpdateSettingsFrame
for the GoogleSTTService
: