Skip to content

Popular repositories

  1. FunASR FunASR Public

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。

    Python 3.7k 431

  2. FunClip FunClip Public

    Open-source, accurate and easy-to-use video clipping tool, LLM based AI clipping intergrated || 开源、精准、方便的视频切片工具,集成了大语言模型AI智能剪辑功能

    Python 1.5k 153

  3. 3D-Speaker 3D-Speaker Public

    A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

    Python 735 55

  4. KAN-TTS KAN-TTS Public

    KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

    Python 442 71

  5. FunCodec FunCodec Public

    FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

    Python 289 23

  6. former3d former3d Public

    Python 94 8

Repositories

Showing 10 of 22 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.