Ghost Pepper
Source: matthartman/ghost-pepper | License: MIT | Type: open-source
What It Does
Ghost Pepper is a macOS menu bar application for Apple Silicon that provides fully local, hold-to-talk speech-to-text dictation. The interaction model is simple: hold the Control key to record, release to transcribe and paste into the active application. No data leaves the machine. Transcriptions are never written to disk; debug logs are in-memory only.
The application runs a two-stage pipeline: WhisperKit performs speech recognition (model options from 75 MB to 1.4 GB), and a small Qwen LLM via LLM.swift post-processes the raw transcript to strip filler words and resolve self-corrections before pasting the cleaned result. The cleanup prompt is user-customizable via a Settings panel. Ghost Pepper is positioned as a free, open-source alternative to SuperWhisper and similar commercial dictation tools.
Key Features
- Hold-to-talk interface: Control key triggers recording; release triggers transcription and paste
- Fully local pipeline: WhisperKit (transcription) + Qwen via LLM.swift (cleanup), no cloud API calls
- Multiple WhisperKit model sizes: tiny.en (~75 MB), small.en (~466 MB), multilingual variants, Parakeet v3 (~1.4 GB)
- Multiple Qwen cleanup model sizes: 0.8B (~535 MB, 1–2s), 2B (~1.3 GB, 2–4s), 4B (~2.8 GB, 5–7s)
- Customizable cleanup prompt — can be tuned to specific cleanup styles or professional vocabularies
- Menu bar operation with no dock icon; launches at login by default
- Microphone selection and per-feature toggles in Settings
- Enterprise MDM support: Accessibility permissions can be pre-authorized via MDM profile (bundle ID
com.github.matthartman.ghostpepper, Team IDBBVMGXR9AY) - All model files cached locally after one-time Hugging Face download
Use Cases
- Use case 1: Developers, writers, or journalists who dictate frequently and want no cloud subscription or privacy exposure
- Use case 2: Professionals handling sensitive content (legal, medical, internal communications) where audio data must not leave the device
- Use case 3: Evaluating the WhisperKit + local LLM cleanup pattern for a more polished production app
Adoption Level Analysis
Small teams (<20 engineers): Good fit as a personal productivity tool. Zero cost, MIT license, Apple Silicon requirement is not a barrier for teams standardized on modern Macs. Self-service — no ops required beyond granting microphone and accessibility permissions.
Medium orgs (20–200 engineers): Limited fit as an organizational standard. No MDM distribution packaging beyond the accessibility permission pre-authorization workaround. IT departments will want a signed, notarized app with a clear update story — Ghost Pepper’s update mechanism (Sparkle) is present but the project is a personal side project without a published release cadence.
Enterprise (200+ engineers): Does not fit. No enterprise support, no SLA, no audit logging, single maintainer. Commercial alternatives (SuperWhisper, VoiceInk) or managed STT APIs are more appropriate.
Alternatives
| Alternative | Key Difference | Prefer when… |
|---|---|---|
| SuperWhisper | Polished commercial app, tiered pricing | You need reliability and polish over free-and-rough |
| VoiceInk | $25 one-time, local Whisper + cloud model option | You want a maintained product with one-time payment |
| MacWhisper | File transcription focus, not real-time dictation | Your use case is transcribing recordings, not live dictation |
| Handy | Rust-based infrastructure, multi-platform, auto LLM post-processing | You want a more robust local app with similar cleanup feature |
| TypeWhisper | Similar on-device Whisper dictation, macOS | You want another FOSS option |
Evidence & Sources
- Ghost Pepper GitHub repository
- Hacker News — Show HN: Ghost Pepper community discussion
- Ghost Pepper as open-source SuperWhisper alternative — opensource.builders
- SuperWhisper alternatives comparison 2026 — Voibe
Notes & Caveats
- Single-maintainer, early-stage project. The author self-describes the project as “rough.” No published roadmap, no issue triage SLA, no organizational backing. Do not treat this as production infrastructure.
- Prompt injection failure mode. When transcribed speech resembles an AI instruction (e.g., “create tests and ensure all tests pass”), the Qwen cleanup model may attempt to execute the instruction rather than clean the transcript. The default system prompt does not guard against this. Users in developer contexts are especially susceptible. Customizing the cleanup prompt in Settings partially mitigates this.
- Model download on first use. Models are downloaded from Hugging Face at first launch, which requires internet access and raises supply-chain questions for high-security environments.
- Apple Silicon only. Requires macOS 14.0+ and M1 or newer. Intel Mac users are excluded.
- Saturated market. The macOS Whisper-dictation category is crowded with near-identical apps. Ghost Pepper’s LLM cleanup step is a genuine but small differentiator. The r/macapps community has designated this category “saturated” and requires clear differentiation in new submissions.
- Paste mechanism requires Accessibility permission. Ghost Pepper simulates keystrokes to paste, which requires granting Accessibility permissions — a permission that many enterprise security policies restrict or monitor.