TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

MIT License

Stars
582
Committers
7

Bot releases are hidden (Show)

TTS-Voice-Wizard - v1.6.9.8 Latest Release

Published by VRCWizard 3 months ago

TTS-Voice-Wizard - v1.6.9.6

Published by VRCWizard 3 months ago

  • Word Boundaries option for Word Replacement tab
    • Isolates whole words so that they can be replaced without replacing parts of other words.
    • enabled by default
TTS-Voice-Wizard - v1.6.9.5

Published by VRCWizard 4 months ago

TTS-Voice-Wizard - v1.6.9

Published by VRCWizard 5 months ago

  • VRChat keyboard will no longer be displayed for whitespace/empty strings (when the option is enabled)
  • Option added to disable speech to text translations from azure
  • Upon an unsafe launch a permanent save is now created for the last good backup file when it is loaded
  • Counters Tab updated
    • large quantities of OSC data will no longer crash TTS Voice Wizard when counters are enabled
    • you now need to click the apply changes button upon changing any of the text fields in the Counter Tab
    • more counter slots
TTS-Voice-Wizard - v1.6.8.5 Lyrics

Published by VRCWizard 6 months ago

Lyrics can now be sent from BigAtomikku/VRC-Spotify-OSC to TTS Voice Wizard

  1. When setting up the config file set the sent port to 4026 (very important step)
    image
  2. In TTS Voice Wizard Activate the OSC Listener on the same port 4026
    image
  3. In TTS Voice Wizard use the {lyrics} variable to output lyrics
  4. Note the chatbox can only update as quick as 1500ms so if a song has really fast lyrics (think Eminem) some lyrics will be skipped
    image
TTS-Voice-Wizard - v1.6.8.3

Published by VRCWizard 6 months ago

  • fixes for using chatgpt integration
TTS-Voice-Wizard - v1.6.8.2

Published by VRCWizard 6 months ago

v1.6.8.2

  • fix for naming inconsistency (same as previous version)

v1.6.8.1

  • minor fixes to chatgpt integration tab

v1.6.8

  • OpenAI TTS
    • Available with Pro and with your own API key
  • ChatGPT Integration (Text to ChatGPT to Speech)
    • Available with Pro and with your own API key. (with pro you can send a prompt and your current message. With your own key you can send full unlimited context)
  • Code optimization to reduce response times, a couple ms have been shaved off of several TTS and STT options
    • Faster responses from pro tts options and deepgram speech to text
  • VRChat chatbox typing indicator toggle
  • Minor UI changes
    • Deepgram silence scale
    • tooltips added for some ui elements
  • Bug fixes
    • fixed issue when switched stt methods when whisper was running

v1.6.7.3

  • Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
  • Added button to add progressbar for media integration
  • Added tooltips to media integration buttons and some general setting buttons
  • Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
  • Added "/TTSVoiceWizard/TextToText" OSC endpoint
  • hotkeys no longer trigger multiple times when held down
  • Removed zh-CN-XiaoxuanNeural voice from azure voice list
TTS-Voice-Wizard - v1.6.8.1

Published by VRCWizard 7 months ago

v1.6.8.1

  • minor fixes to chatgpt integration tab

v1.6.8

  • OpenAI TTS
    • Available with Pro and with your own API key
  • ChatGPT Integration (Text to ChatGPT to Speech)
    • Available with Pro and with your own API key. (with pro you can send a prompt and your current message. With your own key you can send full unlimited context)
  • Code optimization to reduce response times, a couple ms have been shaved off of several TTS and STT options
    • Faster responses from pro tts options and deepgram speech to text
  • VRChat chatbox typing indicator toggle
  • Minor UI changes
    • Deepgram silence scale
    • tooltips added for some ui elements
  • Bug fixes
    • fixed issue when switched stt methods when whisper was running

v1.6.7.3

  • Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
  • Added button to add progressbar for media integration
  • Added tooltips to media integration buttons and some general setting buttons
  • Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
  • Added "/TTSVoiceWizard/TextToText" OSC endpoint
  • hotkeys no longer trigger multiple times when held down
  • Removed zh-CN-XiaoxuanNeural voice from azure voice list
TTS-Voice-Wizard - v1.6.8

Published by VRCWizard 7 months ago

v1.6.8 (current)

  • OpenAI TTS
    • Available with Pro and with your own API key
  • ChatGPT Integration (Text to ChatGPT to Speech)
    • Available with Pro and with your own API key. (with pro you can send a prompt and your current message. With your own key you can send full unlimited context)
  • Code optimization to reduce response times, a couple ms have been shaved off of several TTS and STT options
    • Faster responses from pro tts options and deepgram speech to text
  • VRChat chatbox typing indicator toggle
  • Minor UI changes
    • Deepgram silence scale
    • tooltips added for some ui elements
  • Bug fixes
    • fixed issue when switched stt methods when whisper was running

v1.6.7.3

  • Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
  • Added button to add progressbar for media integration
  • Added tooltips to media integration buttons and some general setting buttons
  • Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
  • Added "/TTSVoiceWizard/TextToText" OSC endpoint
  • hotkeys no longer trigger multiple times when held down
  • Removed zh-CN-XiaoxuanNeural voice from azure voice list
TTS-Voice-Wizard - v1.6.7.3 Deepgram Aura

Published by VRCWizard 8 months ago

  • Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
  • Added button to add progressbar for media integration
  • Added tooltips to media integration buttons and some general setting buttons
  • Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
  • Added "/TTSVoiceWizard/TextToText" OSC endpoint
  • hotkeys no longer trigger multiple times when held down
  • Removed zh-CN-XiaoxuanNeural voice from azure voice list
TTS-Voice-Wizard - v1.6.7.1

Published by VRCWizard 8 months ago

TTS-Voice-Wizard - v1.6.7

Published by VRCWizard 8 months ago

v1.6.7 (current)

  • Updated the CoreOSC package to allow for receiving utf8 characters via OSC
  • Changed the elements of the progress bar variable {progressBar E:◯ L:13} to use more fitting utf characters. ┣━━━━┫ vs |-----------|
    • You will likely need to lower the length (L) variable for the change in characters. A length of 13 will fill a line of the VRChat chatbox.
      image

v1.6.6.8

  • 2 optional override bools added for OSC to TTS endpoint /TTSVoiceWizard/TextToSpeech
    • bool useChatbox
    • bool useKillFrenzyAvatarText

The bools allow you to optionally forward text the VRC Chatbox or Killfrenzy Avatar Text without adjusting the in app settings. When sending an OSC message to this endpoint you may still send just a string or both bools (sending only one of the bools will not override the settings).

Example of possible combinations in TouchOSC:
image

v1.6.6.7

  • fixed vrchat counter causing timeout if different counters were activated in quick succession.
  • spotify exceptions should always give a message.

v1.6.6.5

  • minor bug fixes
  • ElevenLabs turbo model added
  • removed voices for TikTok API that no longer functioned (optimus prime voices)
  • New Azure Voices:
es-ES | Spanish (Spain) | XimenaNeural (Female)
fr-CA | French (Canada) | ThierryNeural (Male)
fr-FR | French (France) | VivienneNeural (Female)
it-IT | Italian (Italy) | GiuseppeNeural (Male)
ko-KR | Korean (Korea) | HyunsuNeural (Male)
pt-BR | Portuguese (Brazil) | ThalitaNeural (Female)
zh-CN-XiaoxiaoDialectsNeural
de-DE | German (Germany) | de-DE-FlorianMultilingualNeural (Male)
de-DE | German (Germany) | de-DE-SeraphinaMultilingualNeural (Female)
en-US | English (United States) | en-US-AvaMultilingualNeural (Female)
en-US | English (United States) | en-US-EmmaMultilingualNeural (Female)
fr-FR | French (France) | fr-FR-RemyMultilingualNeural (Male)
en-US | English (United States) | en-US-BrianMultilingualNeural (Male)
en-US | English (United States) | en-US-AndrewMultilingualNeural (Male)
fr-FR | French (France) | fr-FR-VivienneMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaoxiaoMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaochenMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-YunyiMultilingualNeural (Male)

v1.6.5.2

  • uberduck voice selection fix (although some voices in the list no longer function)
  • now using latest version of windows media controller, (report any crashing and include event viewer details)
  • {albumArtist} added for windows media and Spotify media integration

v1.6.5

  • vrchat "/chatbox/typing" indicator added for whisper and deepgram (if VAD is active)
  • fix for whisper timestamps being offset for VAD so super short activations wouldn't be recognized
  • possible fix for group in word replacement tab breaking on some computers
  • hotkeys backend refactored, new hotkeys added for changing voice presets quickly
TTS-Voice-Wizard - v1.6.6.8

Published by VRCWizard 8 months ago

  • 2 optional override bools added for OSC to TTS endpoint /TTSVoiceWizard/TextToSpeech
    • bool useChatbox
    • bool useKillFrenzyAvatarText

The bools allow you to optionally forward text the VRC Chatbox or Killfrenzy Avatar Text without adjusting the in app settings. When sending an OSC message to this endpoint you may still send just a string or both bools (sending only one of the bools will not override the settings).

Example of possible combinations in TouchOSC:
image

TTS-Voice-Wizard - v1.6.6.7

Published by VRCWizard 9 months ago

  • fixed vrchat counter causing timeout if different counters were activated in quick succession.
  • spotify exceptions should always give a message.
TTS-Voice-Wizard - v1.6.6.5

Published by VRCWizard 9 months ago

v1.6.6.5 (current)

  • minor bug fixes
  • ElevenLabs turbo model added
  • removed voices for TikTok API that no longer functioned (optimus prime voices)
  • New Azure Voices:
es-ES | Spanish (Spain) | XimenaNeural (Female)
fr-CA | French (Canada) | ThierryNeural (Male)
fr-FR | French (France) | VivienneNeural (Female)
it-IT | Italian (Italy) | GiuseppeNeural (Male)
ko-KR | Korean (Korea) | HyunsuNeural (Male)
pt-BR | Portuguese (Brazil) | ThalitaNeural (Female)
zh-CN-XiaoxiaoDialectsNeural
de-DE | German (Germany) | de-DE-FlorianMultilingualNeural (Male)
de-DE | German (Germany) | de-DE-SeraphinaMultilingualNeural (Female)
en-US | English (United States) | en-US-AvaMultilingualNeural (Female)
en-US | English (United States) | en-US-EmmaMultilingualNeural (Female)
fr-FR | French (France) | fr-FR-RemyMultilingualNeural (Male)
en-US | English (United States) | en-US-BrianMultilingualNeural (Male)
en-US | English (United States) | en-US-AndrewMultilingualNeural (Male)
fr-FR | French (France) | fr-FR-VivienneMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaoxiaoMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaochenMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-YunyiMultilingualNeural (Male)

v1.6.5.2

  • uberduck voice selection fix (although some voices in the list no longer function)
  • now using latest verson of windows media controller, report any crashing and include event viewer details
  • {albumArtist} added for windows media and spotify media integration

v1.6.5

  • vrchat "/chatbox/typing" indicator added for whisper and deepgram (if VAD is active)
  • fix for whisper timestamps being offset for VAD so super short activations wouldn't be recognized
  • possible fix for group in word replacement tab breaking on some computers
  • hotkeys backend refactored, new hotkeys added for changing voice presets quickly
TTS-Voice-Wizard - v1.6.6.1

Published by VRCWizard 9 months ago

This is a demo of a different whisper library, it has not been thoroughly tested so there are likely bugs

v1.6.6.1 (current)

  • more robust output purification and removal
  • metrics on by default for this version
  • metrics changed to be more visible and distinguishable
  • fixed model download loop for models that were considered too small to be real

v1.6.6

  • Fixed memory leak in previous version (which caused crashing, oopsie)
  • Min VAD Duration (s) and Min Avg Confidence have been added to help fight against hallucinated outputs, you can enable debug text to see everything that gets filtered out

v1.6.5.9

TTS-Voice-Wizard - v1.6.6 New Whisper Library

Published by VRCWizard 9 months ago

This is a demo of a different whisper library, it has not been thoroughly tested so there are likely bugs

v1.6.6 (current)

  • Fixed memory leak in previous version (which caused crashing, oopsie)
  • Min VAD Duration (s) and Min Avg Confidence have been added to help fight against hallucinated outputs, you can enable debug text to see everything that gets filtered out

v1.6.5.9

TTS-Voice-Wizard - v1.6.5.9 Whisper Library Changed

Published by VRCWizard 9 months ago

This is a demo of a different whisper library, it has not been thoroughly tested so there are likely bugs

Warning: With this library/implementation for whisper running larger models will now cause a much more noticeable delay before output

TTS-Voice-Wizard - v1.6.5.2

Published by VRCWizard 9 months ago

  • uberduck voice selection fix (although some voices in the list no longer function)
  • now using latest verson of windows media controller, report any crashing and include event viewer details
  • {albumArtist} added for windows media and spotify media integration
TTS-Voice-Wizard - v1.6.5

Published by VRCWizard 9 months ago

  • vrchat "/chatbox/typing" indicator added for whisper and deepgram (if VAD is active)
  • fix for whisper timestamps being offset for VAD so super short activations wouldn't be recognized
  • possible fix for group in word replacement tab breaking on some computers
  • hotkeys backend refactored, new hotkeys added for changing voice presets quickly