LocalVocal: Local Live Captions & Translation On-the-Go

LocalVocal: Local Live Captions & Translation On-the-Go v0.3.5

Much rejoicing! Introducing AMD ROCm / hipBLAS acceleration, as well as CoreML and Metal on Apple!
This was tested to give tremendous performance boosts on Apple and AMD GPUs.

Also in this release:
  • bumping to the latest whisper.cpp
  • fixing the captions stream issue on Twitch and YouTube
  • fixing the translation captions disappearance.
Enjoy and let me know of any problems!

Download:
  • Like
Reactions: Tallicia and uuoocl
Big release! Many improvements, solving a lot of bugs and introducing far better performance.
In points:
  • VAD based segmentation - no more "3 seconds segments"! when speech is detected the transcription kicks in
  • Incremental output: The captions will appear continuously as you speak, a live transcription effect
  • Bumping the version of whisper.cpp
  • Many more options for real-time translation
  • A lot of bug fixing...
Download
Introducing: CUDA for Windows!
Now compiling Whisper.cpp vs. CUDA 11 and 12 for GPU accelerated runtime on Windows, which can result in ~x5-x10 faster inference.
With GPU acceleration you can now go for bigger models like Medium and still have very fast transcription with incredible accuracy in many languages.

Also introducing MacOS Apple Silicon optimizations for M1/M2/M3 processors, which results in performance bumps and utilizes your hardware better.

Enjoy!

Download:
  • Like
Reactions: BenAndo
Top