Assistive Speech Technology System

Overview

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs. This system integrates multiple modules that specialize in different aspects of speech and audio analysis, catering to a range of applications from assisting individuals with speech impairments to detecting emotional tones and verifying audio authenticity.

Modules

1. Dysarthric ASR (Automatic Speech Recognition)

This module is designed to recognize and transcribe speech from individuals with dysarthria. 
It uses advanced machine learning models to accurately interpret impaired speech patterns and convert them into text.

2. Emotion Classification

The Emotion Classification module analyzes the speech input to detect and classify the speaker's emotional state. 
It helps in understanding the sentiment behind the spoken words, which can be crucial for various assistive and therapeutic applications.

3. Deep Fake Audio Detection

This module is focused on identifying synthetic or manipulated audio, commonly known as deep fake audio. 
It uses state-of-the-art algorithms to detect signs of audio tampering, ensuring the authenticity of the speech.

4. Infant Cry Detection

The Infant Cry Detection module is designed to analyze audio signals to detect and classify infant cries. 
It can distinguish between different types of cries (e.g., hunger, discomfort) and provides insights for caregivers or automated monitoring systems.

5. Voice Liveness Detection

This module ensures that the speech input is coming from a live person rather than a recording. 
It uses various signal processing techniques to detect signs of voice spoofing, adding a layer of security to voice-based systems.

Technologies Used

Python: The core programming language used for implementing the modules.
TensorFlow/PyTorch: For building and training machine learning models.
Flask: To create the API for the backend.
HTML/CSS/Bootstrap: For the front-end interface.
JavaScript (React): To enhance user interactions and manage state in the front-end.

How to Clone the Repository

To clone this repository, follow the steps below:

Open your terminal and navigate to the directory where you want to clone the project.

Run the following command:

git clone https://github.com/your-username/assistive-speech-technology.git

Related Projects

GenAI_based_Shopping_Assistant

A conversational chatbot that provides shopping recommendations to users based on their preferences

19 Aug 2024 1

Tech-Enhanced-AI-Interview-Learning-Platform

Developed a sophisticated machine learning model capable of generating diverse interview question...

08 Apr 2024 27

AI_Attendance

Attendence system using Artificial Intelligence - A collaboration project by team Mavericks

11 Jan 2022 17

Transcription

This project transcribes audio using whisper and provides an api

15 Aug 2024 0

Jarvis

Jarvis is an innovative intelligent assistant tailored to support students in their academic ende...

24 Aug 2024 0

ResurrectAI

ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great ...

08 Sep 2024 2

flask-ocr-app

A web application that allows users to upload an image and convert it to text using Optical Chara...

19 Jun 2024 6

scouter

Get github contribution with a face detection app. Dragon ball fantasy!

19 Jun 2018 13

ClassifyXR.ai

The Customer Support Ticket Classification and Response System combines advance AI models with RA...

16 Aug 2024 0

Intel_Sentiment_Analysis

Intel Review Analyzer is a powerful tool designed to help businesses understand customer sentimen...

08 Jul 2024 4

Mediquity

A pinnacle of healthcare innovation, where cutting-edge AI meets compassionate care. Our platform...

28 Jan 2024 19

tts-stt

Small pyhon flask container allowing us to convert Text to Speech and Speech to Text

26 Mar 2021 9

whatsapp-voice-gpt

SonicAI is a WhatsApp Chatbot designed to provide users with a convenient and engaging way to int...

13 Apr 2023 25

TechJam2024

2Waffles.Ai - An innovative dual-powered, intelligent assistant AI CRM assistant designed to enha...

09 Jun 2024 3

Llama_RAG_System

Llama_RAG_System is a local Retrieval-Augmented Generation (RAG) system that leverages the LLaMA ...

21 Oct 2024 3