Python Text Recognition

Pocket TTS: High-Quality Local Voice Cloning Without GPU

Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...

New AI model enables native speakers and foreign learners to read undiacritized Arabic texts with greater fluency

Reading an Arabic newspaper, a book, or academic prose fluently, whether digital or in print, remains challenging for many ...

GitHub

adelbeke/react-native-speech-to-text

A powerful, easy-to-use React Native library for real-time speech-to-text conversion. Built with the New Architecture (Turbo Modules) for optimal performance on both iOS and Android.

IEEE

Self-Supervised Discovery of Cross-Lingual Shared Knowledge for Continual Text Recognition

Abstract: Incremental multilingual text recognition (IMLTR) aims to advance continual learning by retaining knowledge from previously learned languages while adapting to new ones. Existing methods ...

GitHub

Android OCR Text Recognition Scanner – Optical Character Recognition for Android (ML Kit, Tesseract, Cloud Vision)

Whether you want to build a document scanner, digitize receipts, or add text recognition to your mobile app, this project is a perfect starting point. This project is provided for educational and ...

IEEE

Multi-Level Interaction for Emotion Recognition from Unaligned Speech and Text

Abstract: In multimodal emotion recognition, the diversity and temporal unalignment of speech and text modalities pose significant challenges for effective fusion. To address this issue, Multi-level ...

Frontiers

Deep learning-enabled hybrid systems for accurate recognition of text in seal images

Chinese seals are widely used in various fields within Chinese society as a tool for certifying legal documents. However, recognizing text on these seals presents challenges due to background text, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results