• Stars
    star
    143
  • Rank 257,007 (Top 6 %)
  • Language
    Python
  • License
    MIT License
  • Created about 2 years ago
  • Updated 6 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Podalize: Podcast Transcription and Analysis

Podalize: Podcast Transcription and Analysis

This GitHub repository contains a Streamlit app that allows users to transcribe podcasts and video/audio content, as well as perform text analysis on the transcript. The app uses OpenAI's Whisper for transcription and Pyannote.audio for speaker diarization. Users have the option to manually enter speaker names and the app works with YouTube URLs, audio URLs, and MP3 files. The app outputs spoken time, a word cloud per speaker, and a transcript of the audio, and the results can be downloaded as a PDF file.

Sample episode

usage

How to install

Note: This code was only tested on Ubuntu 20.04.5 LTS.

  • Install Anaconda

  • Clone/download this repo to your local machine.

  • Get a pyannote.adudio access token by following the instructions: here

  • Launch anaconda prompt and navigate to the repo on your local machine

  • Create a conda environment from environment.yml

$ conda env create -f environment.yml
  • Activate the conda environment
$ conda activate podalize
  • Run streamlit app
$ streamlit run podalize_app.py

Tips

Usage

Either upload a .mp3 file or provide a YouTube/Podcast URL for transcription and analysis.

Refrencess

Contributions Welcome

TODo

  • running the app on windows and macos
  • dockerize