mkdir models && cd models curl -LO https://github.com/thewh1teagle/kokoro-onnx/releases/download/model-files-v1.0/kokoro-v1.0.onnx curl -LO https://github.com ...
Abstract: Segmentation of defects in electroluminescence (EL) images of photovoltaic (PV) cells is critical for enabling automated quality inspection, as such defects directly impact energy yield, ...
Abstract: Multi-speaker text-to-speech (TTS) systems play a crucial role in different applications, such as personalized voice assistants, audiobooks, and multilingual speech synthesis. These systems ...
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including ...