Speech Recognition Python

New research reveals how the brain separates speech into words

Speech blurs together unless you know the language; scientists found the brain signal that separates the words ...

XDA Developers on MSN

Whisper transcribes my voice notes faster than I can type, and it runs entirely offline

I'd rather keep voice notes to myself.

Machine Learning Using Python: A Complete Learning Path With Practical Projects

Machine learning is an essential component of artificial intelligence. Whether it’s powering recommendation engines, fraud detection systems, self-driving cars, generative AI, or any of the countless ...

Blavity on MSN

Spelman students develop PlantGPT, an AI system that allows verbal communication with plants

Students at Spelman College have developed an artificial intelligence system that allows anyone to communicate verbally with ...

IEEE

Lip Enhancement and Multi-View Simulation for Robust Visual Speech Recognition in MAVSR 2025

Abstract: In this paper, we present our work for Visual Speech Recognition (VSR) in the Mandarin Audio-Visual Speech Recognition (MAVSR) Challenge 2025, with a particular focus on improving lipreading ...

Awesome DIY Offline Raspberry Pi Al Chatbot is Now Faster

Keep a Raspberry Pi AI chatbot responsive by preloading the LLM and offloading with Docker, reducing first reply lag for ...

OSTechNix

Pocket TTS: High-Quality Local Voice Cloning Without GPU

Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...

IEEE

Convolution-Augmented Transformers for Enhanced Speaker-Independent Dysarthric Speech Recognition

Abstract: Dysarthria is a motor speech disorder characterized by muscle movement difficulties that complicate verbal communication. It poses significant challenges to Automatic Speech Recognition (ASR ...

GitHub

offline-transcription

"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind ...

GitHub

offline-transcription

An advanced study tool that transforms raw audio recordings and PDF slides into structured, professional LaTeX university notes. Powered by fast local transcription (Whisper) and Google Gemini AI for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results