speech-recognition

A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!

osc openvr speech-recognition openai speech-to-text obs whisper vrchat vrchat-avatars vrchat-tool vrchat-sdk3 vrchat-osc openai-whisper

Updated Jun 12, 2024
Python

heypoom / stable-diffusion-from-speech

Star

Generates a continuously morphing dynamic wallpaper from real-time speech input.

typescript ai vue speech-recognition stable-diffusion

Updated Jun 12, 2024
TypeScript

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 12, 2024
Python

huggingface / distil-whisper

Star

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audio speech-recognition whisper

Updated Jun 12, 2024
Python

jaoafa / ChatWatcher

Star

🗣 Discord voice-chat speech recognition

discord-bot voice-recognition speech-recognition discord-voice

Updated Jun 12, 2024
Java

mariomastrandrea-poli / payments-vocal-assistant

Star

Official repository of my Master's Thesis project: "Developing an AI-Powered Voice Assistant for an iOS Payment App"

nlp swift ios machine-learning text-to-speech deep-learning text-classification tensorflow speech-synthesis speech-recognition speech-to-text ios-development swiftui tensorflow-lite bert-model bert-fine-tuning

Updated Jun 12, 2024
Swift

lhotse-speech / lhotse

Star

Tools for handling speech data in machine learning projects.

audio python data machine-learning ai deep-learning speech pytorch speech-recognition kaldi

Updated Jun 12, 2024
Python

stefantaubert / pinyin-to-ipa

Star

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

linguistics tts speech-synthesis pinyin speech-recognition cyrillic chinese phonetics transcription bopomofo zhuyin international-phonetic-alphabet

Updated Jun 12, 2024
Python

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jun 12, 2024
Python

rizkyyanuark / PrediksiDepression-DataSpeech

Sponsor

Star

Repositori ini berisi proyek deteksi dini depresi menggunakan MFCC dan CNN dalam aplikasi kesehatan mental.

speech-recognition cnn-classification mental-health-awareness

Updated Jun 12, 2024
Jupyter Notebook

argmaxinc / WhisperKit

Star

Swift native on-device speech recognition with Whisper for Apple Silicon

macos swift ios watchos transformers inference speech-recognition pretrained-models whisper visionos

Updated Jun 12, 2024
Swift

mmpneo / curses

Star

Speech to Text and KB input captions for OBS, VRChat, Twitch chat and Discord

windows text-to-speech twitch captions speech-recognition speech-to-text obs vrchat tauri

Updated Jun 12, 2024
TypeScript

Amir-Hofo / Speech-commands-Classification

Star

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 12, 2024

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,656 public repositories matching this topic...

openvinotoolkit / openvino

huggingface / transformers

voicegain / python-sdk

deepgram / deepgram-python-sdk

Detilisi / Umbrella

piaseckijulian / Sentinel

Edw590 / VISOR---A-Virtual-Assistant

I5UCC / VRCTextboxSTT

heypoom / stable-diffusion-from-speech

speechbrain / speechbrain

huggingface / distil-whisper

jaoafa / ChatWatcher

mariomastrandrea-poli / payments-vocal-assistant

lhotse-speech / lhotse

stefantaubert / pinyin-to-ipa

KevKibe / African-Whisper

rizkyyanuark / PrediksiDepression-DataSpeech

argmaxinc / WhisperKit

mmpneo / curses

Amir-Hofo / Speech-commands-Classification

Improve this page

Add this topic to your repo