Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
MIT License
Bot releases are visible (Hide)
Published by VRCWizard 10 months ago
{HREmoji (BPM: 0 E: 💀)(BPM: 40-59 E: 💔)(BPM: 60-100 E: ❤️)(BPM: 101-120 E: 💓)}
Published by VRCWizard 10 months ago
{HREmoji (BPM: 0 E: 💀)(BPM: 40-59 E: 💔)(BPM: 60-100 E: ❤️)(BPM: 101-120 E: 💓)}
Published by VRCWizard 10 months ago
Published by VRCWizard 10 months ago
Published by VRCWizard 10 months ago
Published by VRCWizard 10 months ago
Calibration added for deepgram to automatically adjust your silence threshold based on your background noise
VoiceForge TTS voices
Minimum Valid VAD Duration (s)
is used to prevent sending blank outputs to the API (saves your usage)
Silence Scale
can be used to adjust how long the pause is before sending audio. 30000 the default is around a 1 second pauseIt is highly recommended to use noise filtering software in conjunction with TTS Voice Wizard such as Nvidia Broadcast (Broadcast is the continuation of RTX Voice) or Nahimic (software that comes with MSI boards)
Voice Activation Detection (VAD) for Whisper Speech-To-Text
HighQuality
(not a lot of background noise) to VeryAggressive
(you are in an environment with alot of noise or fan sounds)Whisper GPU selection for Whisper (may not work)
{originalText}
, {translatedText}
, {inputLangCode}
, {inputLangName}
, {outputLangCode}
, {outputLangName}
as you choose{nline}
to start a newline[{inputLangCode}] {originalText}{nline}[{outputLangCode}] {translatedText}{nline}{inputLangName} ---> {outputLangName}
cheerful
, empathetic
, neutral
, uncertain
speaking styles added for IBM Watson Expressive
voicesdefault
Published by VRCWizard 10 months ago
Published by VRCWizard 11 months ago
Calibration added for deepgram to automatically adjust your silence threshold based on your background noise
VoiceForge TTS voices
Published by VRCWizard 11 months ago
Minimum Valid VAD Duration (s)
is used to prevent sending blank outputs to the API (saves your usage)
Silence Scale
can be used to adjust how long the pause is before sending audio. 30000 the default is around a 1 second pauseIt is highly recommended to use noise filtering software in conjunction with TTS Voice Wizard such as Nvidia Broadcast (Broadcast is the continuation of RTX Voice) or Nahimic (software that comes with MSI boards)
Published by VRCWizard 11 months ago
Voice Activation Detection (VAD) for Whisper Speech-To-Text
HighQuality
(not a lot of background noise) to VeryAggressive
(you are in an environment with alot of noise or fan sounds)Whisper GPU selection for Whisper (may not work)
{originalText}
, {translatedText}
, {inputLangCode}
, {inputLangName}
, {outputLangCode}
, {outputLangName}
as you choose{nline}
to start a newline[{inputLangCode}] {originalText}{nline}[{outputLangCode}] {translatedText}{nline}{inputLangName} ---> {outputLangName}
Published by VRCWizard 11 months ago
Expressive
voicesdefault
Published by VRCWizard 11 months ago
{progressBar E:◯ L:40}
Published by VRCWizard 12 months ago
{progressBar E:◯ L:40}
Published by VRCWizard 12 months ago
Published by VRCWizard 12 months ago
Published by VRCWizard 12 months ago
Published by VRCWizard 12 months ago
{nline}
for media outputPublished by VRCWizard 12 months ago
{nline}
for media outputPublished by VRCWizard 12 months ago
{nline}
for media outputPublished by VRCWizard about 1 year ago
HRPercent
parameter added (0-256 mapped to float 0.0-1.0)