Skip to content

Ghost Pepper

★ New
assess
AI / ML open-source MIT open-source

At a Glance

MIT-licensed macOS menu bar app for Apple Silicon that combines WhisperKit speech recognition and a local Qwen LLM (via LLM.swift) for fully on-device hold-to-talk dictation with filler-word cleanup.

Type
open-source
Pricing
open-source
License
MIT
Adoption fit
small

Ghost Pepper

Source: matthartman/ghost-pepper | License: MIT | Type: open-source

What It Does

Ghost Pepper is a macOS menu bar application for Apple Silicon that provides fully local, hold-to-talk speech-to-text dictation. The interaction model is simple: hold the Control key to record, release to transcribe and paste into the active application. No data leaves the machine. Transcriptions are never written to disk; debug logs are in-memory only.

The application runs a two-stage pipeline: WhisperKit performs speech recognition (model options from 75 MB to 1.4 GB), and a small Qwen LLM via LLM.swift post-processes the raw transcript to strip filler words and resolve self-corrections before pasting the cleaned result. The cleanup prompt is user-customizable via a Settings panel. Ghost Pepper is positioned as a free, open-source alternative to SuperWhisper and similar commercial dictation tools.

Key Features

  • Hold-to-talk interface: Control key triggers recording; release triggers transcription and paste
  • Fully local pipeline: WhisperKit (transcription) + Qwen via LLM.swift (cleanup), no cloud API calls
  • Multiple WhisperKit model sizes: tiny.en (~75 MB), small.en (~466 MB), multilingual variants, Parakeet v3 (~1.4 GB)
  • Multiple Qwen cleanup model sizes: 0.8B (~535 MB, 1–2s), 2B (~1.3 GB, 2–4s), 4B (~2.8 GB, 5–7s)
  • Customizable cleanup prompt — can be tuned to specific cleanup styles or professional vocabularies
  • Menu bar operation with no dock icon; launches at login by default
  • Microphone selection and per-feature toggles in Settings
  • Enterprise MDM support: Accessibility permissions can be pre-authorized via MDM profile (bundle ID com.github.matthartman.ghostpepper, Team ID BBVMGXR9AY)
  • All model files cached locally after one-time Hugging Face download

Use Cases

  • Use case 1: Developers, writers, or journalists who dictate frequently and want no cloud subscription or privacy exposure
  • Use case 2: Professionals handling sensitive content (legal, medical, internal communications) where audio data must not leave the device
  • Use case 3: Evaluating the WhisperKit + local LLM cleanup pattern for a more polished production app

Adoption Level Analysis

Small teams (<20 engineers): Good fit as a personal productivity tool. Zero cost, MIT license, Apple Silicon requirement is not a barrier for teams standardized on modern Macs. Self-service — no ops required beyond granting microphone and accessibility permissions.

Medium orgs (20–200 engineers): Limited fit as an organizational standard. No MDM distribution packaging beyond the accessibility permission pre-authorization workaround. IT departments will want a signed, notarized app with a clear update story — Ghost Pepper’s update mechanism (Sparkle) is present but the project is a personal side project without a published release cadence.

Enterprise (200+ engineers): Does not fit. No enterprise support, no SLA, no audit logging, single maintainer. Commercial alternatives (SuperWhisper, VoiceInk) or managed STT APIs are more appropriate.

Alternatives

AlternativeKey DifferencePrefer when…
SuperWhisperPolished commercial app, tiered pricingYou need reliability and polish over free-and-rough
VoiceInk$25 one-time, local Whisper + cloud model optionYou want a maintained product with one-time payment
MacWhisperFile transcription focus, not real-time dictationYour use case is transcribing recordings, not live dictation
HandyRust-based infrastructure, multi-platform, auto LLM post-processingYou want a more robust local app with similar cleanup feature
TypeWhisperSimilar on-device Whisper dictation, macOSYou want another FOSS option

Evidence & Sources

Notes & Caveats

  • Single-maintainer, early-stage project. The author self-describes the project as “rough.” No published roadmap, no issue triage SLA, no organizational backing. Do not treat this as production infrastructure.
  • Prompt injection failure mode. When transcribed speech resembles an AI instruction (e.g., “create tests and ensure all tests pass”), the Qwen cleanup model may attempt to execute the instruction rather than clean the transcript. The default system prompt does not guard against this. Users in developer contexts are especially susceptible. Customizing the cleanup prompt in Settings partially mitigates this.
  • Model download on first use. Models are downloaded from Hugging Face at first launch, which requires internet access and raises supply-chain questions for high-security environments.
  • Apple Silicon only. Requires macOS 14.0+ and M1 or newer. Intel Mac users are excluded.
  • Saturated market. The macOS Whisper-dictation category is crowded with near-identical apps. Ghost Pepper’s LLM cleanup step is a genuine but small differentiator. The r/macapps community has designated this category “saturated” and requires clear differentiation in new submissions.
  • Paste mechanism requires Accessibility permission. Ghost Pepper simulates keystrokes to paste, which requires granting Accessibility permissions — a permission that many enterprise security policies restrict or monitor.

Related