Microsoft Server Speech Text to Speech Voice (en-NG, EzinneNeural):
- Name: Ezinne
- Gender: Female
- Locale: English (Nigeria)
- Locale Code: en-NG
- Sample Rate: 48000 Hz
- Voice Type: Neural
- Words Per Minute: 142
See all the supported languages and voices here.
⌖ Lipsync Face Padding
Adjust the detected face bounding box. Often leads to improved results. Recommended to give at least 10 padding for the chin region.
📄 Knowledge Base
Create Synthetic Data
To improve answer quality, pick a synthetic data maker workflow to scan & OCR any images in your documents or transcribe & translate any videos. It also can synthesize a helpful FAQ. Adds ~2 minutes of one-time processing per file.
Advanced Settings
In general, you should not need to adjust these.
🔠 Language Model Settings