Podcast RSS Whisper Fetcher

Data & Research web

Downloads the newest episodes from a podcast RSS feed and transcribes them on-device with whisper.cpp, producing a dataset containing metadata + transcript for each episode.

Live output preview

A plan is required to view this content

Choose a plan to access input format, sample outputs, and live previews.

View Plans →

About the skill

Podcast RSS Whisper Fetcher

A fully local collection pipeline that extracts transcripts from audio-only podcasts. It works from a single input (RSS feed URL + the newest N episodes): it parses the RSS feed (standard XML compatible with Apple/Spotify), downloads each episode's MP3, converts it to 16kHz mono WAV with ffmpeg, and transcribes it on-device with whisper.cpp. On Apple M-series chips, thanks to Metal GPU acceleration, it runs at ~40x real time (43x verified on M2).

When to use: When you want to batch-transcribe a podcast's latest episodes, mine patterns from founder/startup podcasts, archive academic lecture or conference recordings, or transcribe Turkish podcasts (--language tr + ggml-medium).

Output: For each episode, a markdown transcript file with frontmatter (transcripts/<episode_id>__<title>.md) and a _whisper-log.jsonl audit record tracking the collection process. This showcase presents the row-by-row result of a harvest run (episode metadata + transcript summary + performance/status fields) as a single dataset.

Advantage: Quota-free — no OpenAI or Apify API key required, zero API cost. The only cost is disk space (model + temporary WAV ~30MB/episode). With the resume feature it skips previously processed episodes; it automatically discards trailer/intro episodes shorter than 10 minutes.

How do I use this skill?

You don't "run" a skill — after installing it you just tell the agent your task (e.g. ask for the relevant job), and the skill kicks in by itself when its description matches.

Upload the podcast-rss-whisper-fetcher.zip you downloaded as-is — no packaging needed, the format is already correct (folder at root).

Open Settings → Customize → Skills
Upload → select the podcast-rss-whisper-fetcher.zip you downloaded
Claude reads SKILL.md; the name + description appear. Ready ✅

Scripts run in Anthropic's code-execution environment (sandbox) — not on your machine.