Speech
11 items across the graph — tagged with Speech.
From the graph · 11
ModelScope: bring the notion of Model-as-a-Service to life.
Hugging Face model with 6436 likes. Tags: text-to-speech, en, arxiv:2306.07691, arxiv:2203.02395, base_model:yl4579/StyleTTS2-LJSpeech, base_model:finetune:yl45…
Hugging Face model with 5908 likes. Tags: transformers, pytorch, jax, safetensors, whisper, automatic-speech-recognition, audio, hf-asr-leaderboard, en, zh
Build local voice agents with open-source models
Hugging Face model with 3633 likes. Tags: coqui, text-to-speech, license:other, region:us
Hugging Face model with 3127 likes. Tags: transformers, safetensors, whisper, automatic-speech-recognition, audio, en, zh, de, es, ru
Data manipulation and transformation for audio signal processing, powered by PyTorch
Hugging Face model with 2884 likes. Tags: safetensors, model_hub_mixin, pytorch_model_hub_mixin, text-to-speech, en, arxiv:2305.09636, license:apache-2.0, regio…
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method keyboard application for the Android platform based on Kotlin
A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to generate content, produce videos, and automatically upload…
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio…
