Skip to main content

Search

Items tagged with: whisper


🏴‍☠️ Current source code 👇
https://codeberg.org/TheCoffeMaker/WhisperBot

#XMPP #Whisper #bot


@Mara It's deepl.com. At least I think this is what @Paweł Masarczyk mentioned.
More about #whisper. I like the fact it can all be run locally.


How accurate is transcribing with openAI’s #Whisper technology? I tried it on myself but now want to try it on something more substantial. anybody want to share experiences?


I'm sure everyone who wants to know about this already does but, just in case anyone has, particularly if #blind or #DeafBlind, been looking for a local method of converting speech to text ... Whisper is an ML model from OpenAI which allows doing that. It can be used accessibly with all screen readers on Windows. Obviously, this is great for those of us with impaired hearing, it is certainly far more accurate than any of the speech to text programs I've seen, needs no training, and can handle background noise quite well. The audio duration limits are set by your hard drive space and the amount of time you're willing to put into transcription, I've transcribed several hours of audio without difficulty, it just takes time. It's available on Windows using https://github.com/Softcatala/whisper-ctranslate2 which just seems to need python. A GPU makes it faster, but it's usable on an I5 CPU. The model is also available online at https://freesubtitles.ai though that requires payment or waiting for long periods to transcribe limited amounts of audio. Thanks to @Bryn@mindly.social for the pointer at whisper-ctranslate2. #whisper #SpeechToText


Been dabbling a bit with the amazing #whisper port to C++ coded by @ggerganov. Very good results in both Dutch and English, amazing to see how fast progress has been made on speech to text in the last few years. For those who have got it running and are looking for some tooling: i wrote some Python terminal wrappers for easy use (including converting media using ffmpeg) and converting the SRT files to other formats: https://github.com/hay/audio2text/


Mindblowing 🤯

#Whisper is an #openSource #speechRecognition model written in #Python by #OpenAI. I’ve just seen it in action. Extract an #mp3 from a video, run it through Whisper, and it turns every spoken word into text. It even does a very decent job in #Danish. Perfect for subtitling #TV and #video. I am very impressed.

https://github.com/openai/whisper

#ai #language #transcription #speechToText