Transcribe voice messages to text

Overview

This project automates the transcription and summarization of voice memos into text files. It's designed for people like me who record voice memos while walking and wish to integrate these recordings into a note-keeping app like Obsidian. The solution uses macOS's Automator to execute a script that processes these voice memos, leveraging technologies like ffmpeg, ollama, and whisper.cpp.

Functionality

Voice memos, recorded on either a phone or computer in m4a format, are synced to a macOS computer via iCloud. These files are located in ~/Library/Group Containers/group.com.apple.VoiceMemos.shared/Recordings/. A folder action triggers a Python script which:

Converts the m4a file to wav format.
ranscribes the audio.
.Summarizes the content using ollama.
Saves the output as a text file.

Limitations

Compatibility: Exclusively for macOS.
Testing: Confirmed functionality on MacBook Pro (M3).

Setup Guide

ffmpeg Installation: Run brew install ffmpeg.
Ollama Installation: Download from Ollama.
Whisper.cpp Installation: Follow the [Readme for Core ML support]((https://github.com/ggerganov/whisper.cpp?tab=readme-ov-file#core-ml-support).
Terminal Access Configuration:
- Navigate to System Preferences > Security & Privacy > Privacy.
- Go to Files and Folders.
- Grant Terminal access to the voice memos folder.
Automator Folder Action:
- Open Automator, select File > Open, and choose Transcribe.workflow from this repo.
- Save the workflow.
Script Activation:
- Go to ~/Library/Group Containers/group.com.apple.VoiceMemos.shared/.
- Right-click Recordings, select Services > Folder Actions Setup.
- Choose and confirm the Transcribe workflow.

Now all your voice memos will be transcribed locally.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Transcribe.workflow/Contents		Transcribe.workflow/Contents
tests		tests
.gitignore		.gitignore
README.md		README.md
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcribe voice messages to text

Overview

Functionality

Limitations

Setup Guide

About

Releases

Packages

Languages

Thimm/transcribe_apple_voice_memos

Folders and files

Latest commit

History

Repository files navigation

Transcribe voice messages to text

Overview

Functionality

Limitations

Setup Guide

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages