Python Speech Recognition

Speech Emotion Recognition using Branched-based Convolutional Neural Network Technique

Abstract: Speech Emotion Recognition is a significant pattern recognition of human speech using feature extraction for communication media. This paper aims to recognize speech emotion through the CNN ...

Scientific American

New research reveals how the brain separates speech into words

Speech blurs together unless you know the language; scientists found the brain signal that separates the words ...

North Penn Now

Machine Learning Using Python: A Complete Learning Path With Practical Projects

Machine learning is an essential component of artificial intelligence. Whether it’s powering recommendation engines, fraud detection systems, self-driving cars, generative AI, or any of the countless ...

Blavity on MSN

Spelman students develop PlantGPT, an AI system that allows verbal communication with plants

Students at Spelman College have developed an artificial intelligence system that allows anyone to communicate verbally with ...

eWeek

Mistral AI’s Voxtral Transcribe 2 Launch Breaks Sound Barrier

Voxtral Transcribe 2 consists of two speech-to-text models with transcription quality, diarization, and ultra-low latency.

Awesome DIY Offline Raspberry Pi Al Chatbot is Now Faster

Keep a Raspberry Pi AI chatbot responsive by preloading the LLM and offloading with Docker, reducing first reply lag for ...

OSTechNix

Pocket TTS: High-Quality Local Voice Cloning Without GPU

Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...

IEEE

Convolution-Augmented Transformers for Enhanced Speaker-Independent Dysarthric Speech Recognition

Abstract: Dysarthria is a motor speech disorder characterized by muscle movement difficulties that complicate verbal communication. It poses significant challenges to Automatic Speech Recognition (ASR ...

GitHub

SSMD - Speech Synthesis Markdown

SSMD (Speech Synthesis Markdown) is a lightweight Python library that provides a human-friendly markdown-like syntax for creating SSML (Speech Synthesis Markup Language) documents. It's designed to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results