Topic

Speech

11 items across the graph — tagged with Speech.

From the graph · 11

repo
modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

model
hexgrad/Kokoro-82M

Hugging Face model with 6436 likes. Tags: text-to-speech, en, arxiv:2306.07691, arxiv:2203.02395, base_model:yl4579/StyleTTS2-LJSpeech, base_model:finetune:yl45…

model
openai/whisper-large-v3

Hugging Face model with 5908 likes. Tags: transformers, pytorch, jax, safetensors, whisper, automatic-speech-recognition, audio, hf-asr-leaderboard, en, zh

repo
huggingface/speech-to-speech

Build local voice agents with open-source models

model
coqui/XTTS-v2

Hugging Face model with 3633 likes. Tags: coqui, text-to-speech, license:other, region:us

model
openai/whisper-large-v3-turbo

Hugging Face model with 3127 likes. Tags: transformers, safetensors, whisper, automatic-speech-recognition, audio, en, zh, de, es, ru

repo
pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

model
nari-labs/Dia-1.6B

Hugging Face model with 2884 likes. Tags: safetensors, model_hub_mixin, pytorch_model_hub_mixin, text-to-speech, en, arxiv:2305.09636, license:apache-2.0, regio…

repo
BryceWG/BiBi-Keyboard

说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method keyboard application for the Android platform based on Kotlin

repo
ChaitanyaEswarRajeshJakki/gemini-youtube-automation

A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to generate content, produce videos, and automatically upload…

repo
AssemblyAI/assemblyai-node-sdk

The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio…

Related topics