ChatdollKit - v0.8.0 Latest Release

Published by uezo about 2 months ago

💎 What's New in Version 0.8 Beta

To run the demo for version 0.8.0 beta, please follow the steps below after importing the dependencies:

Open scene Demo/Demo08.
Select AIAvatarVRM object in scene.
Set OpenAI API key to following components on inspector:
- ChatGPTService
- OpenAITTSLoader
- OpenAISpeechListener
Run on Unity Editor.
Say "こんにちは" or word longer than 3 characters.
Enjoy👍

⚡ Optimized AI Dialog Processing

We've boosted response speed with parallel processing and made it easier for you to customize behavior with your own code. Enjoy faster, more flexible AI conversations!

Optimize AI-driven interactions by @uezo in https://github.com/uezo/ChatdollKit/pull/335

🥰 Emotionally Rich Speech

Adjusts vocal tone dynamically to match the conversation, delivering more engaging and natural interactions.

Improve expressiveness of text-to-speech output by @uezo in https://github.com/uezo/ChatdollKit/pull/336
Allow adding emotion to speech synthesis by @uezo in https://github.com/uezo/ChatdollKit/pull/337

🎤 Enhanced Microphone Control

Microphone control is now more flexible than ever! Easily start/stop devices, mute/unmute, and adjust voice recognition thresholds independently.

Add new SpeechListener namespace with voice input modules by @uezo in https://github.com/uezo/ChatdollKit/pull/334

🍩 Other Changes

Fix some bugs in StyleBertVITSTTSLoader by @uezo in https://github.com/uezo/ChatdollKit/pull/333
Update for v0.8.0 beta by @uezo in https://github.com/uezo/ChatdollKit/pull/338

Full Changelog: https://github.com/uezo/ChatdollKit/compare/0.7.7...0.8.0

ChatdollKit - v0.7.7

Published by uezo about 2 months ago

🥰 Support StyleBertVits2

We've added support for Text-to-Speech using the StyleBertVits2 API! 🎙️✨ Now, your AI characters can speak with even more expressive and dynamic voices, making them shine brighter than ever! 😎 Get ready to take your character's charm to the next level! 🚀💫

Support StyleBertVits2 API as TTS service by @uezo in https://github.com/uezo/ChatdollKit/pull/327

💕 Support Cohere Command R 💕

Add experimental support for Command R by @uezo in https://github.com/uezo/ChatdollKit/pull/329
Add experimental support for Command R on WebGL by @uezo in https://github.com/uezo/ChatdollKit/pull/331

🐸 Other Changes

Fix bug in handling response when using Azure OpenAI by @uezo in https://github.com/uezo/ChatdollKit/pull/325
Add option to completely disable WakeWordListener by @uezo in https://github.com/uezo/ChatdollKit/pull/326
Fix bug causing ToolCalls to fail by @uezo in https://github.com/uezo/ChatdollKit/pull/328
Provide workaround to clear state data, including LLM context by @uezo in https://github.com/uezo/ChatdollKit/pull/330
Update WebGLMicrophone.jslib by @uezo in https://github.com/uezo/ChatdollKit/pull/332

Full Changelog: https://github.com/uezo/ChatdollKit/compare/0.7.6...v0.7.7

ChatdollKit - v0.7.6

Published by uezo 3 months ago

What's Changed

🎓LLM related updates

Add support for Dify Agents by @uezo in https://github.com/uezo/ChatdollKit/pull/315
Add support for custom logic at the end of LLM streaming by @uezo in https://github.com/uezo/ChatdollKit/pull/316
Internalize Dify ConversationId in state data by @uezo in https://github.com/uezo/ChatdollKit/pull/318
Use GPT-4o mini as the default model for ChatGPT by @uezo in https://github.com/uezo/ChatdollKit/pull/321

🗣️ Dialog control

Fix WebGL microphone input handling by @uezo in https://github.com/uezo/ChatdollKit/pull/317
Improve WakewordListener functionality and debugging by @uezo in https://github.com/uezo/ChatdollKit/pull/320

🥰 3D model control

Enable runtime loading of VRM models from URL and byte data by @uezo in https://github.com/uezo/ChatdollKit/pull/322

🐈 Others

Update demo v0.7.6 by @uezo in https://github.com/uezo/ChatdollKit/pull/323

Full Changelog: https://github.com/uezo/ChatdollKit/compare/0.7.5...0.7.6

ChatdollKit - v0.7.5

Published by uezo 4 months ago

Dify Support 💙

Add support for Dify💙 by @uezo in https://github.com/uezo/ChatdollKit/pull/309
Add support for Dify TTS and STT by @uezo in https://github.com/uezo/ChatdollKit/pull/311

Other changes

Fix bug where mic volume changes are not applied immediately by @uezo in https://github.com/uezo/ChatdollKit/pull/307
Enhance camera functionality with manual still capture and sub-camera switching by @uezo in https://github.com/uezo/ChatdollKit/pull/308

Full Changelog: https://github.com/uezo/ChatdollKit/compare/0.7.4...0.7.5

ChatdollKit - v0.7.4

Published by uezo 4 months ago

👀 Enhanced Vision Capabilities

This update introduces autonomous vision input for Gemini and Claude, and adds vision input support for WebGL. Now, various AIs can offer richer conversational experiences with integrated vision input across different platforms.

Support autonomous vision input for Gemini✨ https://github.com/uezo/ChatdollKit/pull/302
Refactor Vision input and various related improvements https://github.com/uezo/ChatdollKit/pull/303
Support autonomous vision input for Claude✹ https://github.com/uezo/ChatdollKit/pull/304
Add vision input support for WebGL https://github.com/uezo/ChatdollKit/pull/305

Full Changelog: https://github.com/uezo/ChatdollKit/compare/0.7.3...0.7.4

ChatdollKit - v0.7.3

Published by uezo 4 months ago

👀 Support dynamic vision input for ChatGPT

By adding a SimpleCamera to the scene and including [vision:camera] in the response message, the system will autonomously capture images when visual input is required for a response.

Add autonomous image input handling for ChatGPT https://github.com/uezo/ChatdollKit/pull/298

📦 Easy setup by modularized UI components

Microphone volume sliders and request input forms have been modularized. These can now be used immediately by simply adding the prefabs to the scene without any additional setup.

Modularize UI components for easy scene addition in https://github.com/uezo/ChatdollKit/pull/300

🎙️ dB-based microphone volume

Change volume measurement from amplitude to decibels https://github.com/uezo/ChatdollKit/pull/296
Fix incorrect volume measurement bug https://github.com/uezo/ChatdollKit/pull/299

✨ Other changes

Switch from function call to tool call for ChatGPT Function Calling https://github.com/uezo/ChatdollKit/pull/297
Remove deprecated ChatGPT-related modules in https://github.com/uezo/ChatdollKit/pull/301

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.7.2...0.7.3

ChatdollKit - v0.7.2

Published by uezo 7 months ago

🖼️ Support vision

Set image to request message as payload for multimodal conversation.

Support message with image in GPT-4V and Claude3 https://github.com/uezo/ChatdollKit/pull/286

🚀 Configuration-free demo

Just start without any configurations. Set API key (and others if you want) at runtime.

Add configuration-free demo https://github.com/uezo/ChatdollKit/pull/288

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.7.1...v0.7.2

ChatdollKit - v0.7.1

Published by uezo 10 months ago

🥰😇 Change avatar on runtime

Support changing avatar on runtime. Use ModelController.SetAvatar to change the avatar to another on runtime.
If you want to try it on editor, set another avatar on the scene to the inspector of ModelController and push Change Avatar button(appears runtime only).

Add support for changing avatar on runtime https://github.com/uezo/ChatdollKit/pull/280

🎙️ Better microphone management

Mute microphone when not listening for WakeWord or VoiceRequest.
Also added IsMuted property to DialogController so that you can control mute/unmute manually. Default is false.

IsMuted == false: Microphone will be on when WakeWordListener or VoiceRequestProvider is listening.
IsMuted == true: Microphone will be off.

You can change the value manually on inspector or in script.

Mute microphone when not listening for WakeWord or VoiceRequest https://github.com/uezo/ChatdollKit/pull/281
Mute microphone when avatar speaking https://github.com/uezo/ChatdollKit/pull/282

🍱 Other changes

Set message windows automatically without configurations https://github.com/uezo/ChatdollKit/pull/278
Prevent UnityWebRequest error when calling LLMs https://github.com/uezo/ChatdollKit/pull/279
Update demo for v0.7.1 https://github.com/uezo/ChatdollKit/pull/283

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.7.0...v0.7.1

ChatdollKit - v0.7.0

Published by uezo 10 months ago

🤖 LLM-based Dialog Processing

✅Multiple LLMs: ChatGPT / Azure OpenAI Service, Anthropic Claude, Google Gemini Pro and others
✅Agents: Function Calling (ChatGPT / Gemini) or your prompt engineering
✅Multimodal: GPT-4V and Gemini-Pro-Vision are suppored
✅Emotions: Autonomous face expression and animation

We've developed a versatile framework that standardizes the processes of Routing, Chatting (including facial expressions and motions), and executing Tools using various Large Language Models (LLMs). This framework allows for easy customization by simply swapping out LLM-specific components.

Additionally, you can support any LLM by creating your own components that implement the ILLMService interface.

To use this new LLM-based dialog, attach LLMRouter, LLMContentSkill and ILLMService component like ChatGPTService in ChatdollKit.LLM package. Also attach function skills that extends LLMFunctionSkillBase if you build an AI agent. See DemoChatGPT that works right out-of-the-box🎁.

Support multiple LLMs: ChatGPT, Claude and Gemini✨ https://github.com/uezo/ChatdollKit/pull/271
Support custom request parameters and headers for LLM APIs https://github.com/uezo/ChatdollKit/pull/272
Fix bug that iOS build failed on Xcode https://github.com/uezo/ChatdollKit/pull/273
Fix bug that LLMFunctionSkill fails https://github.com/uezo/ChatdollKit/pull/275

NOTE: ChatdollKit.Dialog.Processor.ChatGPT* components are deprecated.

🐉 Other Changes

Refactoring for Future tech integration and reduced dependencies https://github.com/uezo/ChatdollKit/pull/270
Improve stability of AzureStreamVoiceRequestProvider https://github.com/uezo/ChatdollKit/pull/274
Make microphone volume controller works with NonRecordingVRP https://github.com/uezo/ChatdollKit/pull/276
Update demo for v0.7.0 https://github.com/uezo/ChatdollKit/pull/277

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.6...v0.7.0

ChatdollKit - v0.6.6

Published by uezo 11 months ago

🐘 Improve stability

Previously, ChatGPT (OpenAI API) sometimes does not return a response, which prevents the character from continuing the conversation.😔
However, in this update, the system will automatically retry in the absence of a response, allowing the conversation to continue.

Retry when no response from ChatGPT https://github.com/uezo/ChatdollKit/pull/263

💃 Autonomous animation

Set animations before starting conversation. (e.g. at Start())

chatGPTContentSkill.RegisterAnimation("waving_arm", new Model.Animation("BaseParam", 10, 3.0f));
chatGPTContentSkill.RegisterAnimation("look_away", new Model.Animation("BaseParam", 6, 3.0f, "AGIA_Layer_look_away_01", "Additive Layer"));

List them in the prompt and provide a guide on how to use them.

* You can express your emotions through the following animations:

- waving_arm
- look_away

* If you want to express emotions with gestures, insert the animation into the response message like [anim:waving_arm].

Support autonomous animation💃 https://github.com/uezo/ChatdollKit/pull/264

😴 Multiple idling mode

You can have multiple idling mode. The default is normal. Add animations and face expressions if you want to add another mode like below:

modelController.AddIdleAnimation(new Model.Animation("BaseParam", 101, 5f), mode: "sleep");
modelController.AddIdleFace("sleep", "Blink");

Switch using ChangeIdlingModeAsync.

modelController.ChangeIdlingModeAsync("sleep");

The animation for sleep starts immediately.

Support switching idling mode https://github.com/uezo/ChatdollKit/pull/267

🍩 Other updates

Make it easier to set microphone device name https://github.com/uezo/ChatdollKit/pull/265
Small changes for v0.6.6 https://github.com/uezo/ChatdollKit/pull/266
Update demo https://github.com/uezo/ChatdollKit/pull/268

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.5...v0.6.6

ChatdollKit - v0.6.5

Published by uezo 12 months ago

🌊 Azure OpenAI Service

To use Azure OpenAI Service set following info on inspector of ChatGPTService component:

Endpoint url with configurations to Chat Completion Url

format: https://{your-resource-name}.openai.azure.com/openai/deployments/{deployment-id}/chat/completions?api-version={api-version}

API Key to Api Key
Set true to Is Azure

NOTE: Model on inspector is ignored. Engine in url is used.

Add support for Azure OpenAI Service https://github.com/uezo/ChatdollKit/pull/259

🗣️ OpenAI Speech Services

Support multiple languages and allow you to switch between them without changing configurations.

NOTE: Text-to-Speech doesn't support WebGL.

Add support for OpenAI Speech Services https://github.com/uezo/ChatdollKit/pull/260

🚰 Real-time stream speech recognition

Use AzureStreamVoiceRequestProvider to perform real-time speech recognition.
This component depends on Azure Speech SDK.

NOTE: Microphone volume slider doesn't support controlling this component.

Add support for stream speech recognition https://github.com/uezo/ChatdollKit/pull/261

☺️ Other small changes

Small changes for v0.6.5 https://github.com/uezo/ChatdollKit/pull/262

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.4...v0.6.5

ChatdollKit - v0.6.4

Published by uezo about 1 year ago

🥰 Make Unagirl more kawaii

Make Unagirl (3D model for demo) more kawaii https://github.com/uezo/ChatdollKit/pull/256

By applying eye makeup, I enhanced the attractiveness of her face.🪄

Enhanced the double eyelids.
Increased the volume of the lower eyelashes.

🐕 Support Retrieval Augmented Generation

Make it possible to use request, state and user info in ChatGPTFunctionSkills https://github.com/uezo/ChatdollKit/pull/257
Update demo and example by @uezo in https://github.com/uezo/ChatdollKit/pull/258

You can create RAG-based AI agent just by adding ChatGPT function skill. See RetrievalQASkill example that can answer based on OpenAI terms of use.

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.3...v0.6.4

ChatdollKit - v0.6.3

Published by uezo about 1 year ago

Feature updates⚡️

Minor updates but they are necessary for creating apps.

Make it possible to set role to the next message after function calling https://github.com/uezo/ChatdollKit/pull/247
Add support for muting microphone https://github.com/uezo/ChatdollKit/pull/248
Add character message window https://github.com/uezo/ChatdollKit/pull/249
Support favoring specific phrases on Google Text-to-Speech https://github.com/uezo/ChatdollKit/pull/250

Unagirl, a new demo character debuts🥰

Unagirl debuts as a new demo character🥰 https://github.com/uezo/ChatdollKit/pull/255

See Unagirl in dress at vroid hub https://hub.vroid.com/characters/7204218679890800631/models/6961078537805052952

Other updates and fixes🐈

Improve HTTP error debugging https://github.com/uezo/ChatdollKit/pull/251
Use UniTask in HTTP request https://github.com/uezo/ChatdollKit/pull/252
Fix VoiceRecorder fails in starting https://github.com/uezo/ChatdollKit/pull/253
Fix warning message when no skill router is attached https://github.com/uezo/ChatdollKit/pull/254

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.2...v0.6.3

ChatdollKit - v0.6.2

Published by uezo about 1 year ago

🥳Create ChatGPT-based AI assistant smarter

ChatGPT is no longer just one of the various skills but is integrated as a conversation processing system of ChatdollKit.

Add autonomous routing system by ChatGPT⚡️ https://github.com/uezo/ChatdollKit/pull/241

You can add skills much smarter: All you have to do is making the definitions of the interface of ChatGPT function calling and to implement the logic to handle the result from the function.

🌎 Better experience on WebGL

Support ChatGPT stream mode in WebGL https://github.com/uezo/ChatdollKit/pull/244

👍 Other updates

Add support for ChatGPT function calling https://github.com/uezo/ChatdollKit/pull/239
Support starting conversation without prompt https://github.com/uezo/ChatdollKit/pull/243
Fix bug that ChatGPT func is not registered https://github.com/uezo/ChatdollKit/pull/245
Fix performing response bugs https://github.com/uezo/ChatdollKit/pull/246

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6.1...v0.6.2

ChatdollKit - v0.6.1

Published by uezo over 1 year ago

⚡️ Improve performance, stability and developer experience

Make GetDialogToken public https://github.com/uezo/ChatdollKit/pull/225
Improve message window https://github.com/uezo/ChatdollKit/pull/226
Add example of using ChatGPT in streaming mode https://github.com/uezo/ChatdollKit/pull/227
Start idling when no animations on start #230 https://github.com/uezo/ChatdollKit/pull/231
Avoid errors when setting unregistered face expressions #232 https://github.com/uezo/ChatdollKit/pull/233
Add missing inheritance to SkillRouterBase https://github.com/uezo/ChatdollKit/pull/234
Make compatible easily with the latest VOICEVOX https://github.com/uezo/ChatdollKit/pull/235
Improve ChatGPT response speed🚅 https://github.com/uezo/ChatdollKit/pull/236
Update demo for v0.6.1 https://github.com/uezo/ChatdollKit/pull/237

🐞 Fix bug

Fix bug that iOS build fails #218
Fix bug that voice cache doesn't work https://github.com/uezo/ChatdollKit/pull/224

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.6...v0.6.1

ChatdollKit - v0.6

Published by uezo over 1 year ago

❤️ Emotion Simulation powered by ChatGPT

Add ChatGPTEmotionSkill as an out-of-the-box example that express the emotion as face expressions while conversation.
Also add sample motions while thinking (waiting response from ChatGPT) and support multiple idle animations (change pose periodically).
You can build rich 3D AI virtual assistant lightning fast⚡️

Add example of ChatGPT with emotion engine by @uezo in https://github.com/uezo/ChatdollKit/pull/209
Support multiple idle animations by @uezo in https://github.com/uezo/ChatdollKit/pull/214

💡 Improve prompt engineering interface

Support one-shot prompt: system message content and a pair of user and assistant message content as an example turn. You can configure it on inspector, and even runtime for they are the public fields.

Support prompt engineering for ChatGPT by @uezo in https://github.com/uezo/ChatdollKit/pull/207
Easy to customize chatgptskill by @uezo in https://github.com/uezo/ChatdollKit/pull/208

🚅 Support runtime VRM loading

Support configuration at runtime. This enables you to build apps that uses user own VRM.

Support runtime VRM loading by @uezo in https://github.com/uezo/ChatdollKit/pull/205

⚡️ Improve performance and stability

Improve FPS about 30% on my machine and stability of controlling animation gets better. They are basically internal refactoring but that includes some destructive changes.

Use animator parameters to set animation instead of state name.

// ~ v0.5.3
response.AddAnimation("AnimationName");

// v0.6
response.AddAnimation("BaseParam", 6);

PreGap and PostGap no longer available for the argument of Animation and FaceExpression. Use duration to adjust timing.

Improve animation performance⚡️ by @uezo in https://github.com/uezo/ChatdollKit/pull/210
Make FaceExpression feature runs in Update() by @uezo in https://github.com/uezo/ChatdollKit/pull/211
Simplify the interface to control model by @uezo in https://github.com/uezo/ChatdollKit/pull/212

Other changes

Add Azure OpenAI's ChatGPT Skill by @nakazax in https://github.com/uezo/ChatdollKit/pull/202
Use uLipSync as the default LipSync library by @uezo in https://github.com/uezo/ChatdollKit/pull/203
Change Azure Text-to-Speech default voice by @uezo in https://github.com/uezo/ChatdollKit/pull/204
Make it possible to observe the status of dialog by @uezo in https://github.com/uezo/ChatdollKit/pull/206
Fix build error in uLipSyncHelper by @uezo in https://github.com/uezo/ChatdollKit/pull/213
Messagebox in hierarchy by @uezo in https://github.com/uezo/ChatdollKit/pull/215
Make FaceClipConfiguration create various expressions by @uezo in https://github.com/uezo/ChatdollKit/pull/216

New Contributors

@nakazax made their first contribution in https://github.com/uezo/ChatdollKit/pull/202

Full Changelog: https://github.com/uezo/ChatdollKit/compare/v0.5.3...v0.6

ChatdollKit - v0.5.3

Published by uezo over 1 year ago

ChatGPT support✨

Skill for contextual conversation with ChatGPT and it's demo is included!
You can develop and start talking 3D model based virtual agent lightning fast⚡️

Add ChatGPT ready example and demo #198

Add UI components😊

Add slider for microphone volume and FPS indicator.

Add UI Components (Mic volume and FPS) #199

Others

Fix some bugs for WebGL platform.

Fix some build errors #197

ChatdollKit - v0.5.2

Published by uezo almost 2 years ago

Separate ChatdollKit components from avatar object

Separate ChatdollKit components from avatar object to manage components easier; setting up, changing 3D model or something like that use cases. At this version, the developer can complete configuration by just adding ChatdollKit prefab to the hierarchy and configuring some components attached to this prefab. https://github.com/uezo/ChatdollKit/pull/190

Support uLipSync configuration

To set up uLipSync select uLipSync on the inspector of ModelController and then start Setup ModelController. https://github.com/uezo/ChatdollKit/pull/191

Demo

Add a skill using GPT3 based chatting API by OpenAI. https://github.com/uezo/ChatdollKit/pull/193
Out-of-the-Box demo for Oculus VR. Enjoy chat with the ChatdollKit demo girl in VR. https://github.com/uezo/ChatdollKit/pull/195

ChatdollKit - v0.5.1

Published by uezo over 2 years ago

This update includes small/internal changes, bugfixes and Watson Assisntant support.

Add support for implementing prompter in RequestProcessor #182
Add support for Watson Assistant #184
Fix bug sending ping to remote dialog server before URL is set #183
Fix index error in Watson TTSLoader #185

ChatdollKit - v0.5.0

Published by uezo over 2 years ago

Support server side dialog management

Devide request processing from conversation loop #169

Improve editor

Switch speech services and request processor on editor #177
Make it possible to send text to WakeWordListener instead of voice #170
Move dialog configurations to DialogController #171

Other updates

Fix bug for updating face clip fails #168
Make constructors of Request and Response to take more arguments #172
Change file structure

ChatdollKit

💎 What's New in Version 0.8 Beta

⚡ Optimized AI Dialog Processing

🥰 Emotionally Rich Speech

🎤 Enhanced Microphone Control

🍩 Other Changes

🥰 Support StyleBertVits2

💕 Support Cohere Command R 💕

🐸 Other Changes

What's Changed

🎓LLM related updates

🗣️ Dialog control

🥰 3D model control

🐈 Others

Dify Support 💙

Other changes

👀 Enhanced Vision Capabilities

👀 Support dynamic vision input for ChatGPT

📦 Easy setup by modularized UI components

🎙️ dB-based microphone volume

✨ Other changes

🥰😇 Change avatar on runtime

🎙️ Better microphone management

🍱 Other changes

🤖 LLM-based Dialog Processing

🐉 Other Changes

🐘 Improve stability

💃 Autonomous animation

😴 Multiple idling mode

🍩 Other updates

🌊 Azure OpenAI Service

🗣️ OpenAI Speech Services

🚰 Real-time stream speech recognition

☺️ Other small changes

🥰 Make Unagirl more kawaii

🐕 Support Retrieval Augmented Generation

Feature updates⚡️

Unagirl, a new demo character debuts🥰

Other updates and fixes🐈

🥳Create ChatGPT-based AI assistant smarter

🌎 Better experience on WebGL

👍 Other updates

⚡️ Improve performance, stability and developer experience

🐞 Fix bug

❤️ Emotion Simulation powered by ChatGPT

💡 Improve prompt engineering interface

🚅 Support runtime VRM loading

⚡️ Improve performance and stability

Other changes

New Contributors

ChatGPT support✨

Add UI components😊

Others

Separate ChatdollKit components from avatar object

Support uLipSync configuration

Demo

Support server side dialog management

Improve editor

Other updates

Related Projects

speech-to-element

ollamazure