Skip to content
Change the repository type filter

All

    Repositories list

    • icefall

      Public
      Python
      Apache License 2.0
      294000Updated Oct 29, 2024Oct 29, 2024
    • Website and documentation
      HTML
      211612Updated Oct 26, 2024Oct 26, 2024
    • Resources that make every language unique
      Apache License 2.0
      0600Updated Oct 26, 2024Oct 26, 2024
    • Dart
      Apache License 2.0
      4154120Updated Oct 26, 2024Oct 26, 2024
    • vosk-api

      Public
      Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
      Jupyter Notebook
      Apache License 2.0
      1.1k8k45433Updated Oct 24, 2024Oct 24, 2024
    • SDDPM

      Public
      [WACV 2024] Spiking Denoising Diffusion Probabilistic Models
      Python
      6000Updated Oct 9, 2024Oct 9, 2024
    • Russian speech technology links
      Apache License 2.0
      1420900Updated Sep 7, 2024Sep 7, 2024
    • WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
      Python
      Apache License 2.0
      248919766Updated Aug 31, 2024Aug 31, 2024
    • kaldi

      Public
      An official git mirror of Kaldi project SVN repo
      Shell
      Other
      5.3k5102Updated Aug 23, 2024Aug 23, 2024
    • clapack

      Public
      CLAPACK clone for our builds
      C
      Other
      8210Updated Aug 23, 2024Aug 23, 2024
    • openfst

      Public
      Openfst mirror with some fixes
      C++
      Other
      131020Updated Aug 23, 2024Aug 23, 2024
    • Faster Whisper ASR transcription with CTranslate2
      Python
      MIT License
      1k000Updated Aug 19, 2024Aug 19, 2024
    • Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
      C++
      Apache License 2.0
      410500Updated Aug 12, 2024Aug 12, 2024
    • A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
      Apache License 2.0
      226200Updated Aug 11, 2024Aug 11, 2024
    • vosk-tts

      Public
      Text To Speech Synthesis with Vosk
      Python
      Apache License 2.0
      18127170Updated Aug 3, 2024Aug 3, 2024
    • Speech Recognition in Asterisk with Vosk Server
      C
      GNU General Public License v2.0
      41105163Updated Jun 21, 2024Jun 21, 2024
    • RHVoice

      Public
      a free and open source speech synthesizer for Russian and other languages
      C++
      GNU General Public License v2.0
      230200Updated May 28, 2024May 28, 2024
    • Python
      Apache License 2.0
      0000Updated Apr 24, 2024Apr 24, 2024
    • TTS

      Public
      🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Python
      Mozilla Public License 2.0
      4.3k100Updated Apr 8, 2024Apr 8, 2024
    • ffmpeg

      Public
      C
      Other
      12k000Updated Apr 1, 2024Apr 1, 2024
    • 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
      Python
      MIT License
      4k100Updated Mar 20, 2024Mar 20, 2024
    • aiortc

      Public
      WebRTC and ORTC implementation for Python using asyncio
      Python
      BSD 3-Clause "New" or "Revised" License
      762000Updated Dec 13, 2023Dec 13, 2023
    • Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
      Python
      Other
      117100Updated Dec 5, 2023Dec 5, 2023
    • aioice

      Public
      asyncio-based Interactive Connectivity Establishment (RFC 5245)
      Python
      BSD 3-Clause "New" or "Revised" License
      51000Updated Nov 27, 2023Nov 27, 2023
    • Offline speech recognition for Android with Vosk library.
      Java
      Apache License 2.0
      203751685Updated Nov 24, 2023Nov 24, 2023
    • Application of MB-iSTFT-VITS components to vits2_pytorch
      Python
      MIT License
      27400Updated Oct 29, 2023Oct 29, 2023
    • Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
      Python
      14000Updated Oct 20, 2023Oct 20, 2023
    • OpenAI Whisper Prompt Examples
      Apache License 2.0
      23900Updated Jul 17, 2023Jul 17, 2023
    • piper

      Public
      A fast, local neural text to speech system
      C++
      MIT License
      470300Updated Jun 15, 2023Jun 15, 2023
    • lhotse

      Public
      Tools for handling speech data in machine learning projects.
      Python
      Apache License 2.0
      217000Updated May 28, 2023May 28, 2023