Installation Guide

5 min read Last updated: November 2025 Version 3.1

System Requirements

Before installing Subtitle It, make sure your device meets these requirements:

  • iOS 15.0 or later
  • iPhone 8 or newer (for best performance)
  • At least 150MB of free storage
  • Microphone access (required for transcription)
Tip: The app works best on iPhone 11 and newer models due to improved Neural Engine capabilities for on-device transcription.

Download from App Store

Follow these steps to download Subtitle It from the App Store:

Open the App Store

Tap the App Store icon on your iPhone's home screen.

Search for "Subtitle It"

Use the search tab at the bottom and type "Subtitle It" or "SpeakySee".

Download the App

Tap the "Get" button, then authenticate with Face ID, Touch ID, or your Apple ID password.

Wait for Installation

The app will download and install automatically. This typically takes 30-60 seconds.

First Launch Setup

When you first open Subtitle It, you'll go through a quick setup process:

Welcome Screen

You'll see our welcome screen with the app's key features. Tap "Continue" to proceed.

Choose Your Appearance

Select between Light, Dark, or System theme. This can be changed later in Settings → Accessibility → Appearance.

Settings → Accessibility → Appearance → Theme

Text Size and Display

The app will ask you to configure your preferred text size. Choose what's most comfortable for reading transcriptions in real-time.

Important: Make sure to grant microphone permission when prompted. The app cannot function without it.

Permissions Setup

Subtitle It requires certain permissions to work properly:

Required Permissions

  • Microphone: Essential for capturing audio to transcribe
  • Speech Recognition: Enables on-device transcription using iOS Speech framework

Optional Permissions

  • Notifications: For transcription reminders and completion alerts
  • Files: To save and export transcriptions to your preferred location
  • Network: Only if using cloud transcription services (OpenAI, AssemblyAI)
Privacy First: By default, all transcription happens on your device. Your conversations never leave your iPhone unless you explicitly enable cloud transcription services.

Verify Installation

To ensure everything is working correctly:

  1. Open the app
  2. Tap the red record button at the bottom
  3. Say "Testing, one, two, three"
  4. You should see your words appear on screen within 1-2 seconds
Success! If you see your words transcribed, you're all set! Check out our First Transcription Guide to learn more about using the app effectively.

Your First Transcription

Now that you've installed the app, let's create your first transcription:

Start Recording

  1. Tap the large red microphone button at the bottom of the screen
  2. The button will turn into a pulsing red indicator showing recording is active
  3. Start speaking naturally - the app will begin transcribing immediately

Stop Recording

When you're finished, tap the red button again to stop recording. Your transcription will be saved automatically.

Tips for Best Results

  • Speak clearly and at a normal pace
  • Minimize background noise when possible
  • Hold your iPhone at a comfortable distance (6-12 inches from your mouth)
  • Use the microphone test feature (Settings → Audio & Performance → Test Microphone) to optimize your setup

Setting Up Permissions

Proper permissions are essential for Subtitle It to function correctly. Here's how to manage them:

Microphone Permission

The app will request microphone access when you first try to record. If you accidentally denied it:

  1. Open iOS Settings
  2. Scroll down and tap "Subtitle It"
  3. Tap "Microphone" and toggle it ON
  4. Return to the app and try recording again

Speech Recognition Permission

iOS will ask for speech recognition permission separately. This allows the app to use Apple's on-device speech recognition:

iOS Settings → Privacy & Security → Speech Recognition → Subtitle It (ON)

Troubleshooting Permissions

Common Issue: If transcription isn't working, check both Microphone AND Speech Recognition permissions. Both must be enabled.

Quick Start Guide

Get started with Subtitle It in under 2 minutes:

Launch the App

Open Subtitle It from your home screen

Grant Permissions

Allow microphone and speech recognition when prompted

Tap to Record

Tap the large red microphone button at the bottom

Start Speaking

Speak naturally - transcription appears in real-time

Stop Recording

Tap the red button again when finished

Pro Tips

  • Test First: Use the microphone test (Settings → Audio & Performance → Test Microphone) to ensure optimal audio quality
  • Choose Language: Set your language in Settings → Language and Translation before your first recording
  • Quiet Environment: Start in a quiet room for best results
  • Clear Speech: Speak at a normal pace with clear enunciation

Real-Time Transcription

Subtitle It provides instant transcription as you speak, with minimal latency:

How It Works

The app uses Apple's Speech Recognition framework to process audio on your device in real-time. As you speak:

  • Audio is captured through your iPhone's microphone
  • Speech is processed instantly using the Neural Engine
  • Text appears on screen with ~1-2 second latency
  • Transcription refines itself as you continue speaking

Customize Display

Adjust text appearance for optimal readability:

  • Text Size: Choose from System, Small, Medium, Large, or Custom sizes
  • Themes: Light, Dark, or High Contrast modes
  • Typeface: System, Serif, Monospaced, Rounded, or Dyslexia Friendly fonts
  • Bold Text: Enable for extra clarity

Recording Controls

While recording, you can:

  • Start/Stop Recording: Tap the large record button to begin or end transcription
  • Clear Text: Tap the clear button to erase the current transcription and start fresh
  • Share: Export your transcription at any time via the share button
Best For: Meetings, lectures, conversations, interviews, or any situation where you need live captions.

Offline Mode

Subtitle It works completely offline by default - no internet connection required!

How Offline Mode Works

All processing happens on your iPhone using Apple's on-device Speech Recognition:

  • No Data Sent: Your audio never leaves your device
  • Complete Privacy: Everything stays local
  • Works Anywhere: Airplane mode, remote locations, no problem
  • No Costs: No API fees or subscription charges

Offline Translation

On iOS 18+, translation also works offline:

  1. Download language packs in iOS Settings → General → Language & Region → Translation Languages
  2. Select your language pair in Subtitle It settings
  3. Enjoy offline real-time translation

When Internet IS Available

You can optionally enable cloud transcription services (OpenAI or AssemblyAI) for:

  • Enhanced accuracy in noisy environments
  • Support for more languages
  • Specialized vocabulary recognition
Privacy: Cloud services are disabled by default and require your explicit opt-in and API key.

Language Support

Subtitle It supports transcription in multiple languages using iOS's built-in Speech Recognition framework:

Supported Languages

The app supports all languages available in iOS Speech Recognition, including:

  • English (US, UK, Australian, Indian, and more)
  • Spanish (Spain, Mexico, Latin America)
  • French, German, Italian, Portuguese
  • Chinese (Mandarin, Cantonese)
  • Japanese, Korean, Russian
  • Arabic, Hebrew, Hindi
  • And many more...

Changing Language

To change your transcription language:

Settings → Language and Translation → Source Language

Translation Features

On iOS 18 and later, Subtitle It can translate your transcriptions in real-time using Apple's Translation framework.

Enable Translation

Open Translation Settings

Navigate to Settings → Language and Translation

Select Languages

Choose your source language (what you're speaking) and target language (what you want to see)

Download Language Packs

Translation works offline, but requires downloading language packs. Tap "How to Download Language Packs" for instructions

Display Options: You can choose to show both original and translated text, or translation only. Configure this in Settings → Language and Translation → Show Original Text.

Export Options

Save and share your transcriptions:

Export Methods

  • Share Sheet: Tap the share button to send via Messages, Mail, Notes, or save to Files
  • Text File (.txt): Automatically creates a timestamped file (e.g., "Transcript_2025-11-29_14-30-45.txt")
  • Universal Compatibility: Plain text format works with all apps and platforms

How to Export

Finish Recording

Stop your transcription by tapping the record button

Tap Share

Tap the share icon (square with arrow pointing up)

Choose Destination

Select where to save or send your transcription - Messages, Mail, Notes, Files, or any sharing-enabled app

Tip: Transcripts are exported as plain text (.txt files) with automatic timestamped filenames. If you need to share both original and translated text, toggle the translation view before exporting to include both languages on screen.

Cloud Transcription

For enhanced accuracy, you can optionally use cloud-based AI services:

Supported Providers

Subtitle It supports two major cloud transcription services:

OpenAI Whisper

  • Accuracy: Industry-leading transcription quality
  • Languages: 98+ languages supported
  • Pricing: $0.006 per minute
  • Best For: High accuracy requirements, multilingual content

AssemblyAI

  • Accuracy: Excellent for English and major languages
  • Languages: 99+ languages supported
  • Pricing: $0.00025 per second (~$0.015/minute)
  • Best For: Real-time streaming, speaker diarization

Setup Cloud Transcription

Get API Key

Sign up at OpenAI or AssemblyAI and obtain an API key

Enable in Settings

Go to Settings → Remote Transcription → Enable Remote Transcription

Choose Provider

Select your preferred service and enter your API key

Select Mode

Choose Server Only (cloud only) or AI Enhanced (hybrid local + cloud)

Privacy Notice: When cloud transcription is enabled, your audio is sent to the selected third-party service for processing. Review their privacy policy before enabling.

Transcription Modes

  • Local Only: All processing on-device (default, no internet needed)
  • Server Only: Uses cloud service exclusively
  • AI Enhanced: Combines local and cloud for best results

VoiceOver Support

Subtitle It is fully compatible with iOS VoiceOver for blind and low-vision users:

VoiceOver Features

  • Full Navigation: All UI elements properly labeled
  • Live Transcription Announcements: VoiceOver reads new transcribed text automatically
  • Gesture Support: Standard VoiceOver gestures work throughout the app
  • Rotor Actions: Quick access to common functions via the rotor

Enable VoiceOver

iOS Settings → Accessibility → VoiceOver → ON

Using with VoiceOver

  • Start Recording: Double-tap the "Record" button
  • Hear Transcription: Swipe right to navigate to transcribed text
  • Stop Recording: Double-tap the "Stop" button

High Contrast Mode

Enhance readability with WCAG 2.1 compliant high contrast display options:

Color Schemes (All on Black Background)

All colors meet or exceed WCAG 2.1 accessibility standards:

  • Green on Black: 15.3:1 contrast ratio - Exceeds AAA standard (default, classic terminal style)
  • Blue on Black: 8.2:1 contrast ratio - Exceeds AAA standard (colorblind friendly)
  • Red on Black: 5.7:1 contrast ratio - Meets AA standard
  • Yellow on Black: 19.6:1 contrast ratio - Exceeds AAA standard (highest contrast)

Enable High Contrast

Settings → Accessibility → Appearance → Theme → High Contrast

Then select your preferred color:

Settings → Accessibility → Appearance → High Contrast Color
Accessibility: All colors exceed minimum WCAG 2.1 AA standards (4.5:1). Blue is specially optimized for red-green colorblindness. High contrast mode is excellent for outdoor use, presentations, or users with visual impairments.

Text Size Options

Customize text size for optimal readability:

Size Presets

  • System: Follows iOS Dynamic Type settings
  • Small: Compact text for more content on screen (14pt)
  • Medium: Balanced size (17pt, default)
  • Large: Easier to read from a distance (20pt)
  • Custom: Use the accessibility text scale slider for fine control

Accessibility Text Scale

When "Custom" text size is selected, fine-tune with a slider (1.0x to 3.0x):

  • 1.0x: 20pt (same as Large)
  • 2.0x: 40pt
  • 3.0x: 60pt (maximum)
Settings → Accessibility → Text → Text Size → Custom Settings → Accessibility → Text → Accessibility Text Scale

Large Icons

Enable larger UI buttons and controls (140% scale):

Settings → Accessibility → Text → Large Icons

Accessibility Gestures

Subtitle It supports accessibility gestures for easier one-handed operation:

Haptic Feedback

Enable haptic feedback for button presses and actions:

Settings → Accessibility → Haptic Feedback → Enable Haptic Feedback

Custom Haptic Actions

Choose which buttons provide haptic feedback:

  • Record button
  • Settings button
  • Export button
  • Clear text button

Voice Control

Use iOS Voice Control to operate the app hands-free:

iOS Settings → Accessibility → Voice Control → ON

Audio Quality Settings

Optimize audio capture for your environment:

Audio Enhancement

Enable the audio enhancement pipeline for better quality:

Settings → Audio & Performance → Enable Audio Enhancement

Enhancement Features

  • Noise Reduction: Filters background noise (Off/Light/Moderate/Aggressive)
  • Volume Normalization: Automatic/Adaptive/Manual gain control
  • Speech Frequency Boost: Enhances clarity of human speech (300-3400 Hz)
  • Dynamic Range Compression: Balances loud and quiet sounds

Audio Buffer Size

Adjust latency vs. quality trade-off:

  • Small (512 samples): Lowest latency, use for real-time needs
  • Medium (1024 samples): Balanced (recommended)
  • Large (2048 samples): Better quality, slightly higher latency

Noise Reduction

Reduce background noise for clearer transcriptions:

Noise Reduction Levels

  • Off: No processing (cleanest in quiet environments)
  • Light: Subtle reduction, preserves natural sound
  • Moderate: Balanced noise removal (recommended)
  • Aggressive: Maximum noise suppression for very noisy environments

When to Adjust

  • Quiet room: Use "Off" or "Light"
  • Office/café: Use "Moderate"
  • Loud environment: Use "Aggressive"
  • Outdoor/windy: Use "Aggressive" + enable "Remove Silence Gaps"
Pro Tip: Start with "Moderate" and adjust based on your results. Too much noise reduction can affect speech clarity.

Battery Optimization

Manage power consumption for longer transcription sessions:

Battery Impact Factors

  • Transcription Mode: Local uses more CPU, cloud uses more network
  • Audio Enhancement: Processing increases battery usage
  • Screen Brightness: Large text displays consume power
  • Background Processing: Keeps running when app is in background

Power Saving Tips

  1. Disable audio enhancement if not needed
  2. Use larger buffer sizes (less CPU intensive)
  3. Enable "Pause Processing in Background"
  4. Lower screen brightness
  5. Disable unnecessary haptic feedback

Estimated Battery Life

Use the microphone test to see estimated battery impact:

Settings → Audio & Performance → Test Microphone

Microphone Testing

Test your microphone setup before important recordings:

Running a Test

  1. Go to Settings → Audio & Performance
  2. Tap "Test Microphone"
  3. Tap "Start Test"
  4. Speak for 5-10 seconds
  5. Review the results

Test Results

The test provides:

  • Audio Waveform: Real-time visualization of your voice
  • Volume Levels: Peak and average volume indicators
  • Battery Impact: Estimated power consumption
  • Quality Score: Overall audio quality rating
Best Practice: Run a microphone test whenever you change your environment or audio settings.