OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
-
Updated
Jun 12, 2024 - C++
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python SDK for working with Voicegain Speech-to-Text
Official Python SDK for Deepgram's automated speech recognition APIs.
A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.
V.I.S.O.R., my in-development assistant
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!
Generates a continuously morphing dynamic wallpaper from real-time speech input.
A PyTorch-based Speech Toolkit
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
🗣 Discord voice-chat speech recognition
Official repository of my Master's Thesis project: "Developing an AI-Powered Voice Assistant for an iOS Payment App"
Tools for handling speech data in machine learning projects.
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
Repositori ini berisi proyek deteksi dini depresi menggunakan MFCC dan CNN dalam aplikasi kesehatan mental.
Swift native on-device speech recognition with Whisper for Apple Silicon
Speech to Text and KB input captions for OBS, VRChat, Twitch chat and Discord
In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."