How to use the voice-to-text assistant

Everything you need to know about using Ultramic

Getting Started with Ultramic

Ultramic is a speech-to-text transcription assistant that works across all your desktop applications. It offers two modes: Cloud (OpenAI) for maximum accuracy and Local (Whisper) for privacy and offline use.

We believe in a pay-as-you-go model for cloud usage, so you only pay for what you use. No subscriptions, no hidden fees. Bring your own API key and you are ready.

Choosing a Provider

Ultramic gives you the freedom to choose how your audio is processed:

  • OpenAI (Cloud): Best for accuracy and speed on all machines. Requires an API key. Small cost per minute.
  • Local Whisper (On-device): Best for privacy and offline use. Runs entirely on your computer. Free forever. Performance depends on your hardware (Mac with Apple Silicon recommended).

Option 1: OpenAI API (Cloud)

To use Ultramic with OpenAI, you need to get your API key. Follow these steps to get your own key:

  1. Go to OpenAI's platform and create an account if you don't already have one.
  2. Once logged in, navigate to the API Keys section in your account dashboard.
  3. Click on "Create new secret key" and give it a name (e.g., "Ultramic").
  4. Copy the generated API key immediately (you won't be able to see it again).
  5. Open Ultramic and paste your API key in the settings.

Note: Keep your API key secure and never share it with others.

Why Do You Need Your Own API Key?

We've designed Ultramic with a pay-as-you-go model instead of a subscription for several reasons:

  • Cost Efficiency: You only pay for what you actually use. If you transcribe occasionally, your costs will be minimal.
  • Transparency: You have direct control over your usage and can monitor costs through your OpenAI dashboard.
  • Privacy: Your audio data is processed directly through your own API key, giving you more control over your data.
  • No Subscriptions: We believe subscriptions are becoming overwhelming for users. Our approach eliminates another monthly payment.

OpenAI's Whisper API is very cost-effective, typically costing just pennies per hour of transcription.

Option 2: Local Whisper (On-device)

If you prefer not to send data to the cloud or want to use Ultramic offline, you can use the Local Whisper provider. This runs the transcription model directly on your Mac.

  • Private: Your audio never leaves your device.
  • Free: No API keys or costs involved.
  • Offline: Works without an internet connection.

To enable it, simply select "Local Whisper" in the application settings. The first time you use it, Ultramic will download the necessary model file (approx. 150MB - 500MB depending on the model quality selected).

Note: Local transcription is resource-intensive. It works best on Macs with Apple Silicon (M1/M2/M3 chips).

Shortcuts

Ultramic uses keyboard shortcuts to make transcription easy and accessible. You can choose from two available shortcuts:

  • Default Shortcut: Option+space
  • Custom Shortcut: You can change the shortcut to your liking in the settings.

The shortcuts work globally across your system, so you can use them in any application where you need to input text.

How Ultramic Works

Ultramic simplifies the process of converting speech to text. Here is a typical workflow:

  1. Start Recording: Press the shortcut key (Option+space by default) to start recording.
  2. Speak Clearly: Say what you want to transcribe. A small indicator will show that Ultramic is listening.
  3. Stop Recording: Press the shortcut key again to stop recording.
  4. Automatic Transcription: Ultramic processes your audio using your selected provider (OpenAI Cloud or Local Whisper).
  5. Text Insertion: The transcribed text is automatically pasted into your active text field, whether it's an email, document, or any other application.

The entire process happens quickly, allowing you to write up to 3× faster than typing.

Frequently Asked Questions

What languages does Ultramic support?

Ultramic supports 98 languages, including English, Spanish, French, German, Czech, and many more, thanks to the Whisper model (used in both Cloud and Local modes).

How much does it cost to use Ultramic?

It depends on the provider you choose:

  • Local Whisper: Completely free.
  • OpenAI: You pay for your usage of the OpenAI API, which is typically around $0.006 per minute of audio.

Can I use Ultramic for long recordings?

Ultramic is designed for short to medium-length dictations. For very long recordings, you might want to use a specialized transcription service.

What if I encounter issues with transcription accuracy?

Try speaking clearly and at a moderate pace. Ensure you're in a quiet environment with minimal background noise for best results.