Topic

Speech

11 items across the graph — tagged with Speech.

From the graph · 11

ModelScope: bring the notion of Model-as-a-Service to life.

Hugging Face model with 6436 likes. Tags: text-to-speech, en, arxiv:2306.07691, arxiv:2203.02395, base_model:yl4579/StyleTTS2-LJSpeech, base_model:finetune:yl45…

→model

openai/whisper-large-v3

Hugging Face model with 5908 likes. Tags: transformers, pytorch, jax, safetensors, whisper, automatic-speech-recognition, audio, hf-asr-leaderboard, en, zh

→repo

huggingface/speech-to-speech

Build local voice agents with open-source models

→model

coqui/XTTS-v2

Hugging Face model with 3633 likes. Tags: coqui, text-to-speech, license:other, region:us

→model

openai/whisper-large-v3-turbo

Hugging Face model with 3127 likes. Tags: transformers, safetensors, whisper, automatic-speech-recognition, audio, en, zh, de, es, ru

→repo

pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

→model

nari-labs/Dia-1.6B

Hugging Face model with 2884 likes. Tags: safetensors, model_hub_mixin, pytorch_model_hub_mixin, text-to-speech, en, arxiv:2305.09636, license:apache-2.0, regio…

→repo

BryceWG/BiBi-Keyboard

说点啥（BiBi Keyboard）:一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method keyboard application for the Android platform based on Kotlin

→repo

ChaitanyaEswarRajeshJakki/gemini-youtube-automation

A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to generate content, produce videos, and automatically upload…

→repo

AssemblyAI/assemblyai-node-sdk

The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio…

→

From the graph · 11

Related topics