• Stars
    star
    677
  • Rank 66,694 (Top 2 %)
  • Language
    Swift
  • License
    GNU General Publi...
  • Created almost 2 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Whisperboard

An iOS app for recording and transcribing audio on the go, based on OpenAI's Whisper model.



Twitter: @sa1k0s Commit Activity License Powered by Tuist

Features

  • Easy-to-use voice recording and playback
  • Transcription of recorded audio using Whisper from OpenAI
  • Import and export audio files
  • Select microphone for recording
  • Model selection screen with the ability to download any Whisper model

Future Plans

  • Optimize the transcription process by eliminating silent portions of audio, which can reduce the processing time and improve overall efficiency.
  • Implement resumable transcription so that users can continue transcribing after the app has been terminated during the transcription process.
  • Enable background transcription when the app is minimized, allowing users to perform other tasks while the transcription proceeds.
  • Improve visualization of the current state of the transcription process, providing users with a clearer understanding of its progress.
  • Provide an estimated time remaining for the transcription to complete, helping users plan accordingly.
  • Introduce a transcription queue that allows users to select multiple files for transcription, streamlining the process for bulk transcriptions.
  • Implement real-time transcription using smaller, more efficient models, offering users faster results.
  • Offer an additional remote transcription option to enhance user experience with larger models, potentially providing more accurate transcriptions or faster processing times.

Installation

  1. Clone this repository
  2. Run make
  3. Open the project in Xcode

License

This project is licensed under the GPL-3.0 license.

Links

Buy Me A Coffee