Skip to main content
“og:title”: “Stenox Docs”

Fast, High-Quality Cloud Setup

Cloud providers offer the fastest processing and highest accuracy for Stenox dictation. This guide shows you the recommended configurations for optimal results.
Most recommended: Stenox Cloud — fine-tuned for Stenox, zero configuration, fastest setup. Included with Plus and Lifetime Plus.

Why Use Cloud Providers?

Ultra-fast processing

Sub-second transcription and enhancement (5-10x faster than local).

Higher accuracy

State-of-the-art models optimized for production use.

No local storage

No need to download multi-GB models.

Works on any Mac

Intel and Apple Silicon, old and new.
Best for: Fastest setup, best out-of-the-box experience Configuration:
  • Transcription: Stenox Cloud Transcription
  • Enhancement: Stenox Cloud AI Enhancement
Setup time: 2 minutes (just activate your license) Processing time: ~1-2 seconds total Availability: Plus and Lifetime Plus plans
1

Get Plus or Lifetime Plus

Visit stenox.app/pricing and choose your plan
2

Activate License

Settings → License → Sign in with your purchase email
3

Select Stenox Cloud

Settings → Transcription → Provider: Stenox Cloud Settings → AI Enhancement → Provider: Stenox Cloud
4

Start Dictating

That’s it — no API keys, no configuration, just works
Why Stenox Cloud?
  • Fine-tuned for Stenox — optimized specifically for voice dictation workflows
  • Zero configuration — no API keys to manage, no accounts to create
  • Consistent quality — we maintain and update the models for best results
  • Fast and accurate — production-grade speed and accuracy

Learn More

Full Stenox Cloud details

Option 2: Maximum Free Credits (BYOK)

Best for: Getting started free, maximum free usage Configuration:
  • Transcription: DeepGram Nova-3 ($200 free credits)
  • Enhancement: Google Gemini 2.5 Flash (1,500/day free)
Setup time: 10 minutes Processing time: ~1-2 seconds total Free usage: 6-12 months typical
1

Get DeepGram API key

  1. Visit console.deepgram.com
  2. Sign up (no credit card required)
  3. Create API key
  4. Get $200 in free credits
2

Configure DeepGram in Stenox

  1. Settings → Transcription tab
  2. Provider: DeepGram
  3. Paste API key
  4. Model: Nova-3 (best accuracy)
3

Get Google Gemini API key

  1. Visit aistudio.google.com
  2. Sign in with Google (no credit card)
  3. Create API key
  4. Get 1,500 free requests/day
4

Configure Gemini in Stenox

  1. Settings → AI Enhancement tab
  2. Provider: Google Gemini
  3. Paste API key
  4. Model: Gemini 2.5 Flash (fastest)
Get detailed setup instructions →

Option 3: Groq Stack (Single API Key)

Best for: Simplicity, one account for everything Configuration:
  • Transcription: Groq Whisper v3-turbo
  • Enhancement: Groq Llama 3.3 70B or DeepSeek R1
Setup time: 5 minutes Processing time: ~1-2 seconds total Free usage: Ongoing free tier with rate limits
1

Get Groq API key

  1. Visit console.groq.com
  2. Sign up (no credit card required)
  3. Create API key
  4. One key for both transcription and enhancement!
2

Configure Groq Transcription

  1. Settings → Transcription tab
  2. Provider: Groq
  3. Paste API key
  4. Model: whisper-large-v3-turbo
3

Configure Groq Enhancement

  1. Settings → AI Enhancement tab
  2. Provider: Groq
  3. Paste same API key
  4. Model: Llama 3.3 70B Versatile
Benefits:
  • Single API key management
  • Ultra-fast LPU inference
  • Ongoing free tier (doesn’t expire)
  • Simple setup
Get detailed setup instructions →

Option 4: Best Quality (Paid BYOK)

Best for: Professional use, maximum accuracy Configuration:
  • Transcription: DeepGram Nova-3 or AssemblyAI Universal-1
  • Enhancement: Google Gemini 2.5 Pro
Cost: ~$0.75-1.50 per hour of dictation Processing time: ~1-2 seconds total When to use:
  • Professional transcription services
  • Critical accuracy requirements
  • Business or enterprise use
  • After free credits are exhausted

Performance Comparison

ConfigurationSpeedAccuracySetupCostBest For
Stenox Cloud⚡⚡⚡⭐⭐⭐⭐⭐InstantPlus/Lifetime PlusBest experience, zero config
DeepGram + Gemini⚡⚡⚡⭐⭐⭐⭐10 min$$ after freeMaximum free credits
Groq Stack⚡⚡⚡⭐⭐⭐⭐5 minFree tierOne API key simplicity
AssemblyAI + Gemini⚡⚡⚡⭐⭐⭐⭐10 min$Accent optimization
Local (WhisperKit + MLX)⭐⭐⭐5 minFree foreverPrivacy, offline

Detailed Provider Configurations

DeepGram Nova-3 Settings

Recommended configuration:
SettingValueWhy
ModelNova-3Latest, best accuracy
LanguageEnglish (or auto-detect)Optimize for your language
StreamingEnabled (automatic)Fastest results
PunctuationEnabled (automatic)Auto-punctuation
In Stenox:
  • Provider: DeepGram
  • Model: Nova-3 or Nova-3-English
  • Language: English or Auto-detect
DeepGram full guide →

Google Gemini 2.5 Flash Settings

Recommended configuration:
SettingValueWhy
ModelGemini 2.5 FlashFastest with excellent quality
Custom promptDefault or ProfessionalBased on use case
TemperatureDefaultStenox handles this
In Stenox:
  • Provider: Google Gemini
  • Model: Gemini 2.5 Flash
  • Custom prompt: (optional, see below)
Google Gemini full guide →

Enhancement Prompts

Fine-tuned out of the box. Both Stenox Cloud and local providers come with fine-tuned enhancements that we continue to refine — no configuration needed for most users.
For BYOK providers, you can configure custom enhancement behaviors:
Fix grammar and punctuation errors.
Keep the original meaning and tone unchanged.
Use for: General dictation, notes

Multi-Profile Cloud Setup

For maximum flexibility, create profiles for different scenarios:

Profile Strategy

1

Work Email Profile

  • Name: “Work Email”
  • Transcription: DeepGram Nova-3
  • Enhancement: Gemini 2.5 Flash
  • Custom prompt: Professional email tone
  • Auto-activate: When Gmail or Outlook is active
2

Quick Notes Profile

  • Name: “Quick Notes”
  • Transcription: Groq Whisper (fastest)
  • Enhancement: None (skip for speed)
  • Auto-activate: When Notes app is active
3

Documentation Profile

  • Name: “Documentation”
  • Transcription: DeepGram Nova-3
  • Enhancement: Gemini 2.5 Flash
  • Custom prompt: Technical documentation
  • Auto-activate: When VSCode or similar is active
4

Fallback Local Profile

  • Name: “Offline”
  • Transcription: WhisperKit
  • Enhancement: MLX or None
  • Manual activation: When internet unavailable
Learn about Profiles →

Usage Optimization

Maximize Free Credits

DeepGram ($200 credits):
  • Use for important dictation (work emails, documents)
  • Switch to Groq or local for casual notes
  • Track usage in DeepGram console
Google Gemini (1,500/day):
  • More than enough for most users
  • If you hit limit, create “no enhancement” profile
  • Resets daily at midnight PST
Groq (free tier):
  • Use as primary option — no API costs
  • Rate limits refresh quickly
  • Good for privacy-conscious users who prefer not to create accounts

Cost Control

After free tiers, control costs:
Groq’s ongoing free tier is sufficient for most personal use. Only upgrade if you consistently hit rate limits.
Create a profile without AI enhancement for quick personal notes. Saves on Gemini API calls.
If you dictate heavily (2+ hours/day), switch to WhisperKit + MLX to avoid costs.
AssemblyAI costs only $0.0009/min after free credits - very affordable for continued use.

Troubleshooting Cloud Setup

  • Check your internet connection speed
  • Try switching to different model (e.g., Flux instead of Nova-3)
  • Check provider status pages for outages
  • Test with local provider to isolate issue
  • Ensure you’re using Nova-3 or latest models
  • Check microphone input device (Settings → Audio tab)
  • Reduce background noise
  • Speak more clearly or use better microphone
  • Try AssemblyAI if you have strong accent
  • Verify API keys are correct (no extra spaces)
  • Check API key status in provider dashboards
  • Ensure you have remaining credits/quota
  • Try creating new API keys
  • Create local fallback profile (WhisperKit + MLX)
  • Set up multiple cloud providers and rotate
  • Consider upgrading to paid tier if needed

Internet Connection Requirements

Minimum requirements:
  • Speed: 1 Mbps upload, 1 Mbps download
  • Latency: < 100ms preferred
  • Stability: Consistent connection (not intermittent)
Optimal:
  • Speed: 5+ Mbps upload/download
  • Latency: < 50ms
  • Type: Wi-Fi or Ethernet (not cellular for best results)
Cloud processing sends ~1 MB per minute of audio. Most home internet connections are more than sufficient.

Next Steps

Free Tier Guide

Maximize free usage across all cloud providers.

Create Profiles

Set up different profiles for different scenarios.

DeepGram Setup

Complete DeepGram configuration guide.

Gemini Setup

Complete Google Gemini configuration guide.
Best setup: Stenox Cloud — fine-tuned for Stenox, zero configuration, just works. Get started at stenox.app/pricing.