This project is a Python-based tool designed to:
- Download audio from YouTube videos.
- Transcribe the audio using the AssemblyAI API.
- Generate a word cloud from the transcription.
- Integrate with Excel for user inputs and outputs using
xlwings
. - Run the Python script directly from a button in the Excel interface and display the process in designated cells.
- YouTube Audio Downloader: Extracts and converts YouTube videos to MP3 audio files.
- Audio Transcription: Uploads audio to AssemblyAI and retrieves a transcription.
- Word Cloud Generator: Creates a graphical word cloud representation of the transcription.
- Excel Integration: Uses an Excel spreadsheet as a simple user interface.
- Excel Macro Integration: Runs the Python script directly from a button in Excel.
Ensure the following Python libraries are installed:
requests
xlwings
pytube
wordcloud
Install them using:
pip install requests xlwings pytube wordcloud
- Microsoft Excel (for
xlwings
and macro integration).
transcriber1.py
: Main Python script.transcriber1.xlsm
: Excel file for user interaction and macro execution.
-
Clone the repository:
git clone https://github.com/sahilgoyal7214/VocalTranscriber cd VocalTranscriber
-
Install the required Python libraries:
pip install -r requirements.txt
-
Open the Excel file (
transcriber1.xlsm
). The following features are pre-configured:- A button to run the Python script.
- Cells for entering inputs:
YOUTUBE_URL
: Enter the YouTube video URL.API_KEY
: Provide your AssemblyAI API key.TRANSCRIBE
: Set toTrue
to enable transcription.WORDCLOUD
: Set toTrue
to generate a word cloud.STATUS_CELL
: Displays the current status of the process.TITLE_CELL
: Displays the YouTube video title.
-
Use the button in the Excel interface to trigger the Python script. The process and results will be displayed in the designated cells.
- Enter the required data in the Excel file.
- Click the button in the Excel file to trigger the script.
- Outputs:
- Transcribed text will be saved as a
.txt
file in the project directory. - Word cloud will be saved as a
.png
file in the project directory.
- Transcribed text will be saved as a
- Requires an active internet connection for downloading videos and interacting with AssemblyAI.
- Processes one video at a time.
Contributions are always welcome!
See contributing.md
for ways to get started.