CamoVoice User Guide

Fully Private, Offline Speech-to-Text

Looking for CamoVoice International?

CamoVoice International supports transcription in English, Español, Deutsch, Français, Italiano, Português, Nederlands, Polski, Svenska, and Русский.

Jump to International Guide

Automating transcription from the terminal?

CamoVoice CLI transcribes audio files headlessly from scripts, scheduled jobs, and folder watchers using the same local engine as the desktop app.

Jump to CLI Section

Overview

CamoVoice is a desktop speech-to-text application designed with one core principle: your voice stays on your device. Unlike cloud-based transcription services, CamoVoice processes everything locally using bundled AI models — no internet connection required, no data leaves your computer, and absolutely zero telemetry.

Privacy & Architecture

100% Offline: All transcription happens on your machine using faster-whisper, which runs entirely locally
Zero Telemetry: CamoVoice collects nothing. No usage analytics, no crash reports, no audio samples
No Cloud Dependencies: Once installed, the app works without any network connection
Local Settings: Your preferences are stored in a simple settings.json file on your device and auto-loaded at open
English Optimized: Purpose-built for English transcription with maximum accuracy
Headless CLI: The optional camovoice command-line tool transcribes files from a terminal or automation workflow with the same offline stack (see CamoVoice CLI and the CLI documentation)

This architecture makes CamoVoice ideal for transcribing sensitive notes, legal dictation, personal journals, or any audio where privacy matters.

User Interface

CamoVoice features a high-contrast, dark-themed interface designed for simplicity and accessibility.

Main Window Layout

┌─────────────────────────────────────────────────────────────┐
│  [Load Audio File]          🎙 Record Button     Status     │
│  [Settings]        "Click or hold spacebar"     🎤 Device   │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│                   Transcription Output                      │
│                      (editable text)                        │
│                                                             │
├─────────────────────────────────────────────────────────────┤
│  [Copy Text] [▶ Play] [Save As]       [Undo Clear] [Clear] │
├─────────────────────────────────────────────────────────────┤
│  [Input Audio: ████████░░] ← (appears during recording)     │
├─────────────────────────────────────────────────────────────┤
│  Text Size: ═══════●═══════    Mode: [Fast|Thinking]        │
└─────────────────────────────────────────────────────────────┘

 ● REC  ▌▌▌▌▌░░   ← live status pill (top-left of primary screen
                    while the global voice-typing hotkey is active)

Recording Audio

There are two ways to record:

Click the Microphone Button: Click once to start, click again to stop
Hold Spacebar: Hold spacebar to record, release to stop (push-to-talk style)
- A brief 200ms delay prevents accidental recordings from quick taps
- If you're typing in the text area, spacebar works normally

During recording, the microphone button pulses and fills with the accent color. When you stop, transcription begins automatically.

Global Voice Typing Hotkey New

CamoVoice can also record and paste transcriptions into any other application — no need to have the CamoVoice window focused. In Settings, enable “Enable global hotkey (Ctrl+` / ⌃`)”, then:

Place your cursor in any text field (email, chat, document, browser, IDE, etc.).
Press Ctrl + ` on Windows or ⌃ + ` on macOS to start recording.
A small dark REC pill appears in the top-left of your primary display with a blinking indicator and a live audio-level bar, so you always know the mic is hot.
Press the same shortcut again to stop. CamoVoice transcribes locally, copies the result to your clipboard, and automatically pastes it at your cursor.

The global hotkey is disabled by default and only activates while CamoVoice is running. The default combo is Ctrl+` (backtick) — chosen rather than Ctrl+Space or Ctrl+Shift+Space so it doesn't collide with macOS Spotlight, the input-source switcher, or common editor shortcuts.

Custom hotkey New: You can now change the combo. In Settings, click Change Hotkey, then press any combination of Ctrl, Alt, or Shift plus one non-modifier key. The new combo is shown next to the enable checkbox and is saved with your settings, so it persists across restarts.

On macOS, global voice typing also requires one-time Accessibility and Input Monitoring permission (steps in Settings → macOS Permissions for Global Hotkey below).

Privacy note: Transcription still happens entirely on your device. Nothing about the global hotkey changes CamoVoice's offline-first architecture — there is no background listening when the hotkey is idle, and no audio is ever transmitted.

Recording Duration Timer

For longer recordings, a duration timer appears after 1 minute of live recording, showing your elapsed time and the mode’s limit (e.g., 3:45 / 15:00 in Fast mode). The timer changes color as you approach the limit:

Gray: Normal — plenty of time remaining
Orange: Warning — less than 1 minute remaining
Red: Critical — less than 30 seconds remaining

This helps you pace your recordings and avoid hitting the automatic cutoff unexpectedly.

Transcription Animation

When CamoVoice is processing your audio, you'll see an animated "Transcribing..." status with cycling dots, and the text area border will pulse with the accent color. This provides clear visual feedback that your audio is being processed.

Audio Level Indicator

While recording, CamoVoice displays a thin Input Audio meter below the action buttons that shows real-time audio levels:

Orange gradient bar: Shows current audio input level
Yellow peak indicator: Marks the highest audio peaks
Automatic display: Appears when recording starts
Smart hiding: Disappears 1.5 seconds after recording stops

This visual feedback confirms your microphone is working and picking up audio.

Loading Audio & Video Files New

Click Load File or drag and drop a supported file onto the CamoVoice window to transcribe existing audio — or the audio track of a video. CamoVoice supports:

WAV — Any sample rate or bit depth
MP3 — Decoded natively with the bundled miniaudio library (no ffmpeg required)
M4A / AAC / FLAC / OGG
MP4 / WEBM / MOV / AVI / MKV / M4V / WMV video — The audio track is extracted; a clear error is shown if a video has no audio track

The app automatically resamples to the required 16 kHz and converts stereo to mono. Files within both the mode’s duration and size limits are transcribed in a single pass; larger or longer files are split into sequential chunks (see Audio Duration & Size Limits).

Note: All compressed-audio and video support works fully offline after installation.

To transcribe many files without opening the GUI, use CamoVoice CLI in batch mode (camovoice --input-dir). The full command reference is in the CamoVoice CLI Guide.

Transcription Output

The large text area displays your transcriptions. Each new recording or file appends to existing text, making it easy to build up longer documents.

Working with Text

Edit freely: Click into the text area to make corrections or additions
Copy Text: Copies all text to your clipboard (button flashes green to confirm)
▶ Play: Reads the transcription aloud using text-to-speech (see Playback Voice below)
Save As: Opens a save dialog with timestamp options (see below)
Undo Clear: Restores the most recently cleared text (grayed out when nothing to undo)
Clear: Removes all text and timestamps

Undo Clear

If you accidentally clear your text, you have two options:

Click the Undo Clear button (located to the left of Clear)
Press Ctrl+Z (Windows) or ⌘+Z (macOS)

This restores both the text and any associated timestamps from the most recent clear action. The Undo Clear button is grayed out when there's nothing to restore.

If the undo itself was a mistake and you want to re-clear the restored text, press Ctrl+Shift+Z (Windows) or ⇧+⌘+Z (macOS). The text and timestamps remain available for undo afterward, so you can freely toggle between the two states.

Save As Options

When you click Save As (or press Ctrl+S / ⌘S), a dialog appears with options:

Include date at top of document: Adds a header with the current date
Include timestamp after each transcription: Adds a [Recorded: DD-Mon-YYYY HH:MM:SS] line after each transcription segment (e.g., [Recorded: 09-Jan-2026 14:32:15])
Include edits to transcription: Saves the text exactly as it appears in the app, including any spelling corrections or edits you've made. When this is enabled, timestamps are disabled since edited text may no longer match the original transcription segments.

Export Formats

CamoVoice supports three export formats:

Format	Extension	Best For
Text File	`.txt`	Universal compatibility, email, plain text editors
Word Document	`.docx`	Professional documents, further editing in Microsoft Word
PDF Document	`.pdf`	Sharing, archiving, printing (read-only format)
JSON (CLI only)	`.json`	Automation, structured metadata, scriptable workflows

Simply select your desired format from the "Save as type" dropdown in the file dialog. From the terminal, CamoVoice CLI can also write .txt, .docx, .pdf, and .json outputs — see the CLI documentation for --format and batch options.

Example output with both timestamp options enabled:

Transcription Record - 09-Jan-2026
========================================

This is my first transcription segment.
[Recorded: 09-Jan-2026 14:32:15]

This is my second transcription segment.
[Recorded: 09-Jan-2026 14:35:42]

Transcription Modes

CamoVoice offers two transcription modes, selectable at the bottom of the window:

Model Mode	Speed	Accuracy	Best For
Fast	★★★★★	★★★☆☆	Quick notes, short recordings, real-time feel
Thinking	★★★☆☆	★★★★★	Important transcriptions, longer recordings

Fast mode: Uses the compact base model (~140MB) with greedy decoding for maximum speed. Great for quick voice memos, short dictation, or when you want near-instant results. Trades some accuracy for responsiveness.
Thinking mode: Uses the full large model (~1.4GB) with beam search and voice activity detection. Better accuracy for complex vocabulary, accents, background noise, or professional transcription work.

Your mode selection is automatically saved. The model loads on first use of each mode, so the first transcription after opening the app or switching modes may take slightly longer.

Audio Duration & Size Limits

The limits below apply to both live recording and single-pass file loading. If a file exceeds either the duration or size threshold for your current mode, it is treated as oversized.

Single-pass thresholds (Fast vs Thinking)

Mode	Max duration (single pass)	Max file size (single pass)
Fast	15 minutes (900 s)	90 MB (~39 min as uncompressed WAV, or ~90 min as 128 kbps MP3)
Thinking	7 minutes (420 s)	45 MB (~19 min as WAV, or ~45 min as 128 kbps MP3)

Duration and size are separate checks. A small but very long file can exceed the time limit. A short but high-bitrate file can exceed the size limit. Either one triggers the oversized-file path.

Loaded audio files

Without chunking (file fits both limits): the entire file is loaded and transcribed in one pass.

With chunking (file exceeds size or duration):

There is no coded maximum total file length or total file size.
The app splits the file into sequential chunks and transcribes each chunk one after another until the whole file is done.
GUI (Load Audio File / drag-and-drop): a confirmation dialog shows size, length, estimated chunk count, and chunk length. Declining cancels transcription with an error message; approving continues with no upper cap on total length.
CLI (camovoice): oversized files are auto-split with no prompt.

Chunk size

Each chunk stays within the mode limits:

Duration cap per chunk: ~90% of the mode’s max recording time — Fast: ~13.5 min (810 s, floor 60 s); Thinking: ~6.3 min (378 s, floor 60 s).
Size cap per chunk: if the file is large for its duration (high bitrate), chunks may be shortened further (minimum 30 s per chunk).
Each chunk is sized to satisfy both duration and size constraints.

Live microphone recording

Live recording has a hard stop at the mode limit — no chunking, no continuation:

Fast: 15 minutes, then auto-stop and transcribe
Thinking: 7 minutes, then auto-stop and transcribe

A duration timer appears after 1 minute, showing elapsed time and the limit (see Recording Duration Timer above).

Minimum and quality limits

These apply to all audio sources (live, loaded, and CLI):

Minimum clip length: 0.2 seconds
Near-silent audio is skipped (returns empty / “No speech detected”) using peak and average-amplitude thresholds
Very short repetitive Whisper hallucinations (e.g. repeated “you”) are filtered out

Practical limits

These are not hard-coded caps, but you may still encounter them:

Processing time scales with total audio length; many chunks mean a long wait (Thinking mode is slower per minute than Fast).
RAM: if a file is over the duration limit but under the size limit, the GUI loads the entire decoded audio into memory first, then splits in memory. Very long low-bitrate files can still use a lot of RAM. Files over the size limit use streaming chunk reads and are kinder to memory.
Disk / patience: there is no absolute hour cap in code, but multi-hour files mean dozens of sequential model runs.

Bottom line: Single-pass limit = both size and duration (mode-dependent). Chunking removes the total-length cap for loaded files — limits apply per chunk, not to the whole file. Live recording always stops at 15 min (Fast) or 7 min (Thinking).

Settings

Click the Settings button to open the settings panel.

macOS Permissions for Global Hotkey

After downloading the DMG and installing CamoVoice, the first time you record you will be prompted to allow microphone access. Click Allow. In-app recording should then work immediately using your default microphone (or the input device you selected in Settings).

To enable global hotkey recording / voice typing in text fields outside the app (⌃+` or Ctrl+` depending on keyboard), complete these one-time macOS steps:

Open System Settings → Privacy & Security → Accessibility.
Click +, select CamoVoice from Applications, and ensure it is toggled on.
Open System Settings → Privacy & Security → Input Monitoring.
Click +, select CamoVoice from Applications, and ensure it is toggled on.
Quit and re-open CamoVoice.
Test hotkey recording in a text window outside the app.

Input Device

Select which microphone to use for recording. Options include:

Default: Uses your system's default input device
All detected input devices are listed with their audio API

If you have multiple microphones (e.g., headset, webcam, USB mic), select the one you want to use here.

Playback Voice

Choose the voice used for the ▶ Play feature, which reads your transcription aloud.

These are system voices installed on your device, not voices bundled with CamoVoice. To add or remove voices:

Windows: Settings → Time & Language → Speech → Manage voices
macOS: System Settings → Accessibility → Spoken Content → System Voice → Manage Voices

The available voices depend on what you've installed on your operating system.

Show Timestamps in Transcriptions

When enabled, each transcription segment displays its recording time directly in the app, formatted like a 24-hour ISO-style legal/medical record with unambiguous month abbreviations:

Your transcribed text here.
[Recorded: 09-Jan-2026 14:32:15]

For live in-app recordings, the timestamp reflects when the recording was made. When you load or drag and drop an audio file for transcription, CamoVoice uses the timestamp embedded in the audio file (when available) instead of the time of transcription.

This is useful for keeping track of when recordings were made. Timestamps are always stored internally — this setting just controls whether they're visible in the app. When you save, you can choose whether to include timestamps regardless of this display setting.

Keep Window Always on Top

When enabled, the CamoVoice window stays visible above other applications. This is useful for:

Dictating while reading documents or browsing
Keeping CamoVoice accessible during multitasking
Hands-free workflows where you can't easily switch windows

Toggle this in Settings → "Keep window always on top".

Custom Words

CamoVoice allows you to add custom vocabulary — specialized terms, names, phrases, jargon, or unusual words that you frequently use. This helps the speech recognition model recognize these words more accurately.

To manage custom words:

Open Settings
Click Custom Words (n) (where n is your current word count)
In the Custom Words window:
- Type a word or phrase and click Add (or press Enter)
- View all your current custom words in the scrollable list
- Click the ✕ next to any word to remove it
- Click Clear All to remove all custom words
- Click Done when finished

Important notes:

Custom words are suggestions that increase recognition likelihood — they are not guaranteed to be transcribed exactly
You can add up to 100 custom words
Words are case-preserved but duplicate-checked case-insensitively
Custom words are saved automatically and persist between sessions

Best uses for custom words:

Proper names (people, companies, products)
Technical jargon or industry-specific terms
Acronyms that should be recognized as spoken
Unusual spellings or foreign words used in English context

Custom words saved in the GUI are read by CamoVoice CLI by default. You can also pass words for a single CLI run with --custom-word or --custom-words-json (documented in the CamoVoice CLI Guide).

Accessibility Features

Scalable Interface

The Text Size slider adjusts the font size throughout the entire application — not just the transcription area. This includes:

Transcription text
All buttons and labels
Status messages
Mode selector

Slide right for larger text, left for smaller. Your preference is automatically saved.

This makes CamoVoice usable for:

Users with visual impairments
High-DPI displays
Presentations or screen sharing
Personal comfort preferences

Keyboard Shortcuts

Action	Windows	macOS
Hold to record (in-app)	`Spacebar` (hold)	`Spacebar` (hold)
Global voice-typing toggle (system-wide)	`Ctrl`+`	`⌃`+`
Save As	`Ctrl`+`S`	`⌘`+`S`
Undo clear	`Ctrl`+`Z`	`⌘`+`Z`
Redo clear (revert undo)	`Ctrl`+`Shift`+`Z`	`⇧`+`⌘`+`Z`
Close dialogs	`Escape`	`Escape`

Note: The Escape key closes Settings and Save As dialogs without saving changes.

CamoVoice CLI New

CamoVoice CLI is the headless command-line mode for transcribing supported audio files locally from a terminal, script, scheduled job, folder watcher, or automation agent. It uses the same bundled local transcription stack as the desktop app — no cloud APIs, accounts, telemetry, or internet connection required.

On Windows, the installer can add the camovoice command to your PATH. After installation, verify with:

camovoice --version

Common tasks include single-file transcription, batch folder processing, and machine-readable output for scripts:

camovoice -i .\meeting.wav -o .\meeting.txt
camovoice --input-dir .\recordings --output-dir .\transcripts --format txt --recursive
camovoice -i .\meeting.wav -o .\meeting.txt --json

The CLI reads the same settings.json as the GUI (including Fast/Thinking mode and Custom Words) unless you pass --no-settings. Oversized files are auto-split using the same per-mode thresholds and chunk sizing as the desktop app, with no confirmation prompt. See Audio Duration & Size Limits for details.

For the complete command reference, installation notes, exit codes, and automation examples, see the CamoVoice CLI Guide on GitHub.

How It Works (Technical)

CamoVoice is built on several key technologies:

Speech Recognition

Two model options: Fast mode uses base.en (~140MB), Thinking mode uses distil-large-v3 (~1.4GB)
faster-whisper: CTranslate2-based implementation that's 4-6x faster than the original Whisper with lower memory usage
Models are bundled with the application — no download required after installation
All processing uses your CPU with int8 quantization for efficiency

Audio Capture

SoundDevice: Cross-platform audio recording
Records at 16kHz mono (Whisper's native sample rate)
Audio is processed directly in memory — no temporary files

Text-to-Speech

pyttsx3: Interfaces with your operating system's speech synthesis
Uses voices installed on your system (SAPI5 on Windows, NSSpeechSynthesizer on macOS)

User Interface

CustomTkinter: Modern-looking Python GUI framework
Dark theme with orange accent colors
Responsive layout that scales with window size (minimum 640×480)

Troubleshooting

"No voices available" for Playback

Your system may not have TTS voices installed. Install voices through your operating system's speech settings.

Recording doesn't start

Check that your microphone is connected and selected in Settings
Ensure no other application is exclusively using the microphone
Try selecting a different Input Device

Transcription is inaccurate

Speak clearly and at a moderate pace
Reduce background noise
Try switching to Thinking mode for better accuracy
Ensure you're speaking English (CamoVoice is optimized for English)

App is slow on first transcription

The first transcription may take slightly longer as the model initializes. Subsequent transcriptions will be faster. Switching modes also requires loading a different model.

`camovoice` is not recognized (CLI)

Open a new terminal after installation. On Windows, confirm the installer option Add camovoice command line tool to PATH was selected. See the CamoVoice CLI Guide for PATH and troubleshooting steps.

Tips for Best Results

Speak clearly with natural pacing — the model handles conversational speech well
Minimize background noise — even with noise handling, clearer audio = better results
Use Fast mode for quick notes or short sequential dictations, Thinking mode for important uninterrupted long transcriptions
Edit as you go — freely adjust the text manually to fix mistakes or add/remove text before or after transcriptions

Enterprise Customization & Expansion Options

17 input languages: English, Spanish, French, German, Italian, Portuguese, Dutch, Russian, Chinese, Japanese, Korean, Polish, Turkish, Ukrainian, Arabic, Hindi, and Vietnamese
Option to auto-detect input language
Option to auto-translate to English
Additional file formats for input sounds and output files
Enhanced audit trail features and export formats

Privacy Commitment

✓ No internet connection required after installation
✓ No accounts, no sign-ups, no authentication
✓ No analytics or usage tracking
✓ No audio sent to any server
✓ Settings stored locally in plain JSON
✓ Models bundled locally — no APIs, no third party exposure

CamoVoice v1.0.0

CamoVoice International User Guide

Fully Private, Offline Multilingual Speech-to-Text

Overview

CamoVoice International is a desktop speech-to-text application designed with one core principle: your voice stays on your device. Unlike cloud-based transcription services, CamoVoice processes everything locally using a bundled multilingual AI model — no internet connection required, no data leaves your computer, and absolutely zero telemetry.

The English desktop build also includes CamoVoice CLI for headless file transcription; see the CLI documentation for command-line usage.

Privacy & Architecture

100% Offline: All transcription happens on your machine using faster-whisper, which runs entirely locally
Zero Telemetry: CamoVoice collects nothing. No usage analytics, no crash reports, no audio samples
No Cloud Dependencies: Once installed, the app works without any network connection
Local Settings: Your preferences are stored in a simple settings.json file on your device and auto-loaded at open
Multilingual: Transcribes English, Spanish, German, French, Italian, Portuguese, Dutch, Polish, Swedish, and Russian — or auto-detects the language from your audio

This architecture makes CamoVoice ideal for transcribing sensitive notes, legal dictation, personal journals, or any audio where privacy matters.

Multilingual Support

CamoVoice International supports transcription in ten languages out of the box, plus automatic language detection from the audio itself.

Supported Languages

Language	Code	Interface Language Available
Auto-detect	—	— (UI stays in English)
English	en	✓
Español	es	✓
Deutsch	de	✓
Français	fr	✓
Italiano	it	—
Português	pt	—
Nederlands	nl	—
Polski	pl	—
Svenska	sv	—
Русский	ru	—

Choosing a Language

Open Settings and use the Input Language dropdown. The selection does two things:

Transcription: The selected language is passed to faster-whisper so it knows which phonemes to expect. This generally improves accuracy over auto-detect when you know the language up front.
Interface language: If you choose English, Spanish, German, or French, the main-window buttons, Settings window, Custom Words window, Save As dialog, and common status messages re-label into that language. Other languages transcribe correctly but keep the interface in English.

Auto-Detect

If you leave Input Language on Auto-detect, faster-whisper inspects the first thirty seconds of audio and decides the language on its own. This is convenient for multilingual teams or when loading an unfamiliar audio file, but transcription starts slightly slower because detection runs before decoding.

User Interface

CamoVoice features a high-contrast, dark-themed interface designed for simplicity and accessibility.

Main Window Layout

┌──────────────────────────────────────────────────────────────┐ │ [Load Audio File] 🎙 Record Button Status │ │ [Settings] "Click or hold spacebar" 🎤 Device │ ├──────────────────────────────────────────────────────────────┤ │ │ │ Transcription Output │ │ (editable text) │ │ │ ├──────────────────────────────────────────────────────────────┤ │ [Copy Text] [▶ Play] [Save As] [Undo Clear] [Clear] │ ├──────────────────────────────────────────────────────────────┤ │ [Input Audio: ████████░░] ← (appears during recording) │ ├──────────────────────────────────────────────────────────────┤ │ Text Size: ═══════●═══════ Mode: [Fast|Thinking] │ └──────────────────────────────────────────────────────────────┘

Recording Audio

There are two ways to record:

Click the Microphone Button: Click once to start, click again to stop
Hold Spacebar: Hold spacebar to record, release to stop (push-to-talk style)
A brief 200 ms delay prevents accidental recordings from quick taps
If you're typing in the text area, spacebar works normally

During recording, the microphone button pulses and fills with the accent color. When you stop, transcription begins automatically.

Global Dictation Hotkey

CamoVoice can also record and paste transcriptions into any other application — no need to have the CamoVoice window focused. Enable "Enable global hotkey (Ctrl+/ ⌃)" in Settings, then:

Place your cursor in any text field (email, chat, document, etc.).
Press Ctrl+** on Windows or **⌃ on macOS to start recording.
A small dark REC pill appears in the top-left of your primary screen with a blinking dot and a live audio-level bar so you always know the mic is hot.
Press Ctrl+** / **⌃ again to stop. CamoVoice transcribes in your chosen input language, copies the result to your clipboard, and automatically pastes it where your cursor is.

The global hotkey is disabled by default and only activates while the app is running. The default combo is Ctrl+` (backtick) — chosen rather than Ctrl+Space or Ctrl+Shift+Space so it doesn't collide with macOS Spotlight, the input-source switcher, or common editor shortcuts.

Custom hotkey: Click Change Hotkey in Settings and press any combination of Ctrl, Alt, or Shift plus one non-modifier key to rebind the shortcut. The new combo is shown next to the enable checkbox and is saved with your settings.

Recording Duration Timer

For longer recordings, a duration timer appears after 1 minute of recording, showing your elapsed time and the mode's limit (e.g., 3:45 / 15:00). The timer changes color as you approach the limit:

Gray: Normal — plenty of time remaining
Orange: Warning — less than 1 minute remaining
Red: Critical — less than 30 seconds remaining

This helps you pace your recordings and avoid hitting the automatic cutoff unexpectedly.

Transcription Animation

When CamoVoice is processing your audio, you'll see an animated "Transcribing..." status with cycling dots (translated into your UI language), and the text area border will pulse with the accent color. This provides clear visual feedback that your audio is being processed.

Audio Level Indicator

While recording, CamoVoice displays a thin Input Audio meter below the action buttons that shows real-time audio levels:

Orange gradient bar: Shows current audio input level
Yellow peak indicator: Marks the highest audio peaks
Automatic display: Appears when recording starts
Smart hiding: Disappears 1.5 seconds after recording stops

This visual feedback confirms your microphone is working and picking up audio.

Loading Audio & Video Files

Click Load File or drag and drop a supported file onto the CamoVoice window to transcribe existing audio — or the audio track of a video. CamoVoice supports:

WAV — Any sample rate or bit depth
MP3 — Decoded natively with the bundled miniaudio library
M4A / AAC / FLAC / OGG
MP4 / WEBM / MOV / AVI / MKV / M4V / WMV video — The audio track is extracted; a clear error is shown if a video has no audio track

Note: All compressed-audio and video support works fully offline after installation.

Transcription Output

The large text area displays your transcriptions. Each new recording or file appends to existing text, making it easy to build up longer documents.

Working with Text

Edit freely: Click into the text area to make corrections or additions
Copy Text: Copies all text to your clipboard (button flashes green to confirm)
▶ Play: Reads the transcription aloud using text-to-speech (see Playback Voice below)
Save As: Opens a save dialog with timestamp options (see below)
Undo Clear: Restores the most recently cleared text (grayed out when nothing to undo)
Clear: Removes all text and timestamps

Undo Clear

If you accidentally clear your text, you have two options: - Click the Undo Clear button (located to the left of Clear) - Press Ctrl+Z (Windows) or ⌘Z (macOS)

This restores both the text and any associated timestamps from the most recent clear action. The Undo Clear button is grayed out when there's nothing to restore.

If the undo itself was a mistake and you want to re-clear the restored text, press Ctrl+Shift+Z (Windows) or ⇧⌘Z (macOS). The text and timestamps remain available for undo afterward, so you can freely toggle between the two states.

Save As Options

When you click Save As (or press Ctrl+S / ⌘S), a dialog appears with options:

Include date at top of document: Adds a header with the current date
Include timestamp after each transcription: Adds a [Recorded: DD-Mon-YYYY HH:MM:SS] line after each transcription segment
Include edits to transcription: Saves the text exactly as it appears in the app, including any spelling corrections or edits you've made. When this is enabled, timestamps are disabled since edited text may no longer match the original transcription segments.

Export Formats

Format	Extension	Best For
Text File	`.txt`	Universal compatibility, email, plain text editors
Word Document	`.docx`	Professional documents, further editing in Microsoft Word
PDF Document	`.pdf`	Sharing, archiving, printing (read-only format)

Select your desired format from the "Save as type" dropdown in the file dialog.

Transcription Modes

CamoVoice International offers two transcription modes, selectable at the bottom of the window. Both modes share the same multilingual base model (~74 MB covering 99 languages). They differ only in decoding parameters — so switching modes is instant and never reloads the model.

Mode	Speed	Accuracy	Best For
Fast	★★★★★	★★★☆☆	Quick notes, short recordings, real-time feel
Thinking	★★★☆☆	★★★★★	Important transcriptions, longer recordings

Fast mode: Greedy decoding (beam_size=1) with no voice-activity detection. Great for quick voice memos, short dictation, or when you want near-instant results.
Thinking mode: Beam-search decoding (beam_size=5) with voice-activity detection. Better accuracy for complex vocabulary, accents, and noisy audio.

Live recording hard-stops at 15 minutes (Fast) or 7 minutes (Thinking) with no chunking. Loaded files that exceed either the duration or size limit are split automatically; see Audio Duration & Size Limits above.

Settings

Click the Settings button to open the settings panel.

Input Device

Select which microphone to use for recording. Options include: - Default: Uses your system's default input device - All detected input devices are listed with their audio API

Input Language

Choose the language of the audio you will transcribe, or leave it on Auto-detect. If you choose English, Spanish, German, or French, the app's interface text also switches to that language. Other languages transcribe correctly but keep the interface in English. See the Multilingual Support section above for details.

Playback Voice

Choose the voice used for the ▶ Play feature, which reads your transcription aloud.

These are system voices installed on your device, not voices bundled with CamoVoice. To add or remove voices:

Windows: Settings → Time & Language → Speech → Manage voices
macOS: System Settings → Accessibility → Spoken Content → System Voice → Manage Voices

Show Timestamps in Transcriptions

When enabled, each transcription segment displays its recording time directly in the app:

Your transcribed text here. [Recorded: 09-Jan-2026 14:32:15]

Timestamps are always stored internally — this setting controls only whether they're visible in the app. You can still include or exclude them when saving.

Keep Window Always on Top

When enabled, the CamoVoice window stays visible above other applications. Useful for dictating while reading documents or multitasking.

Enable Global Hotkey

Toggles the system-wide Ctrl+` dictation hotkey described in the Global Dictation Hotkey section.

Custom Words

CamoVoice allows you to add custom vocabulary — specialized terms, names, jargon, or unusual words. This helps the model recognise these words more accurately.

To manage custom words:

Open Settings
Click Custom Words (n) (where n is your current word count)
In the Custom Words window:
Type a word or phrase and click Add (or press Enter)
View all your current custom words in the scrollable list
Click the ✕ next to any word to remove it
Click Clear All to remove all custom words
Click Done when finished

Notes: - Custom words are suggestions that increase recognition likelihood — they are not guaranteed to be transcribed exactly - You can add up to 100 custom words - Words are case-preserved but duplicate-checked case-insensitively - Custom words are saved automatically and persist between sessions

Accessibility Features

Scalable Interface

The Text Size slider adjusts the font size throughout the entire application — transcription text, buttons, labels, status messages, and the mode selector. Slide right for larger text, left for smaller. Your preference is automatically saved.

Keyboard Shortcuts

Action	Windows	macOS
Hold to record	Spacebar (hold)	Spacebar (hold)
Global dictation toggle (system-wide)	Ctrl+`\| ⌃`
Save As	Ctrl+S	⌘S
Undo clear	Ctrl+Z	⌘Z
Redo clear (revert undo)	Ctrl+Shift+Z	⇧⌘Z
Close dialogs	Escape	Escape

Note: The Escape key closes Settings and Save As dialogs without saving changes.

How It Works (Technical)

Model: A single multilingual base model (~74 MB, 99 languages). Both Fast and Thinking modes use the same weights.
faster-whisper: CTranslate2-based implementation that's 4–6× faster than the original Whisper with lower memory usage.
CPU with int8 quantisation: No GPU required.
Audio: Recorded at 16 kHz mono (Whisper's native sample rate) via SoundDevice, processed in memory — no temporary files.
TTS: pyttsx3 drives SAPI5 on Windows and NSSpeechSynthesizer on macOS.
UI: CustomTkinter, dark theme, minimum window 640×480.

Troubleshooting

Transcription is in the wrong language

Open Settings → Input Language and pick the language you are speaking, or leave on Auto-detect for mixed-language audio.

"No voices available" for Playback

Your system may not have TTS voices installed. Install voices through your operating system's speech settings.

Recording doesn't start

Check that your microphone is connected and selected in Settings
Ensure no other application is exclusively using the microphone
Try selecting a different Input Device

Transcription is inaccurate

Speak clearly and at a moderate pace
Reduce background noise
Try switching to Thinking mode for better accuracy
Make sure Input Language matches the actual language in the audio

App is slow on first transcription

The first transcription may take slightly longer as the model initialises. Subsequent transcriptions will be faster.

Tips for Best Results

Speak clearly with natural pacing — the model handles conversational speech well
Minimise background noise — clearer audio = better results
Use Fast mode for quick notes, Thinking mode for important transcriptions
Edit as you go — freely adjust the text manually to fix mistakes or reshape text around transcriptions
Set Input Language when you know it up front — it avoids the auto-detection pass and can improve accuracy

Privacy Commitment

✓ No internet connection required after installation
✓ No accounts, no sign-ups, no authentication
✓ No analytics or usage tracking
✓ No audio sent to any server
✓ Settings stored locally in plain JSON
✓ Model bundled locally — no APIs, no third-party exposure

Guía del usuario (Español)

Reconocimiento de voz multilingüe, privado y sin conexión

Descripción general

CamoVoice International es una aplicación de reconocimiento de voz para escritorio diseñada con un principio fundamental: tu voz permanece en tu dispositivo. A diferencia de los servicios de transcripción en la nube, CamoVoice procesa todo localmente mediante un modelo de IA multilingüe incluido — sin conexión a internet, sin que los datos salgan de tu equipo y sin ninguna telemetría.

Privacidad y arquitectura

100 % sin conexión: toda la transcripción se realiza en tu equipo con faster-whisper, que funciona localmente
Cero telemetría: CamoVoice no recopila nada. Sin análisis de uso, sin informes de errores, sin muestras de audio
Sin dependencias en la nube: una vez instalado, la aplicación funciona sin conexión
Configuración local: tus preferencias se guardan en un archivo settings.json en tu dispositivo y se cargan automáticamente al abrir
Multilingüe: transcribe inglés, español, alemán, francés, italiano, portugués, neerlandés, polaco, sueco y ruso — o detecta el idioma automáticamente

Compatibilidad multilingüe

CamoVoice International admite la transcripción en diez idiomas, además de la detección automática del idioma en el audio.

Elegir un idioma

Abre Configuración y usa el desplegable Idioma de entrada. La selección tiene dos efectos:

Transcripción: el idioma elegido se pasa a faster-whisper para que sepa qué fonemas esperar. Esto suele mejorar la precisión frente a la detección automática cuando conoces el idioma de antemano.
Idioma de la interfaz: si eliges inglés, español, alemán o francés, los botones de la ventana principal, la ventana de Configuración, la ventana de Palabras personalizadas, el cuadro de diálogo de Guardar como y los mensajes de estado habituales se muestran en ese idioma. Los demás idiomas se transcriben correctamente, pero la interfaz permanece en inglés.

Detección automática

Si dejas el idioma de entrada en Detección automática, faster-whisper analiza los primeros treinta segundos de audio y determina el idioma. Es útil para equipos multilingües o para archivos desconocidos, aunque la transcripción comienza algo más lenta porque la detección se ejecuta antes de la decodificación.

Interfaz

CamoVoice usa una interfaz oscura de alto contraste pensada para la sencillez y la accesibilidad.

Grabar audio

Botón del micrófono: haz clic una vez para iniciar, haz clic de nuevo para detener.
Mantener la barra espaciadora: mantén la barra espaciadora para grabar, suéltala para detener (estilo pulsar-para-hablar).
Un retardo breve de 200 ms evita grabaciones accidentales al pulsar rápidamente.
Si estás escribiendo en el área de texto, la barra espaciadora funciona con normalidad.

Atajo global de dictado

CamoVoice también puede grabar y pegar transcripciones en cualquier otra aplicación. Activa «Activar atajo global (Ctrl+/ ⌃)» en Configuración y:

Coloca el cursor en cualquier campo de texto.
Pulsa Ctrl+** (Windows) o **⌃ (macOS) para empezar a grabar.
Aparece una pequeña píldora oscura REC en la esquina superior izquierda con un punto parpadeante y una barra de nivel de audio en directo.
Pulsa Ctrl+** / **⌃ de nuevo para detener. CamoVoice transcribe en el idioma seleccionado, copia el resultado al portapapeles y lo pega donde esté el cursor.

Cargar archivos de audio

Haz clic en Cargar archivo. Se admiten audio WAV, MP3, M4A, AAC, FLAC y OGG, además de vídeo MP4, WEBM, MOV, AVI, MKV, M4V y WMV (se transcribe la pista de audio). Los archivos se remuestrean automáticamente a 16 kHz mono. Los límites son 90 MB (Rápido) y 45 MB (Preciso).

Trabajar con el texto

Editar libremente: haz clic en el área de texto para hacer correcciones o añadidos
Copiar texto: copia todo el texto al portapapeles
▶ Reproducir: lee la transcripción en voz alta con texto a voz
Guardar como: abre el cuadro de diálogo con opciones de marcas de tiempo
Deshacer borrado: restaura el último texto borrado (Ctrl+Z / ⌘Z). Para rehacerlo, Ctrl+Shift+Z / ⇧⌘Z.

Opciones de Guardar como

Al pulsar Guardar como (Ctrl+S / ⌘S) aparece un cuadro con opciones para incluir la fecha, incluir una marca de tiempo tras cada transcripción o incluir las ediciones manuales. Formatos de exportación: TXT, DOCX y PDF.

Modos de transcripción

Ambos modos usan el mismo modelo multilingüe base (~74 MB, 99 idiomas) y se diferencian sólo en la decodificación:

Modo	Velocidad	Precisión	Mejor para
Rápido	★★★★★	★★★☆☆	Notas rápidas, grabaciones cortas
Preciso	★★★☆☆	★★★★★	Grabaciones importantes o largas

Las grabaciones tienen un límite (15 minutos en Rápido, 7 minutos en Preciso); se detienen y transcriben automáticamente al llegar al límite.

Configuración

Dispositivo de entrada: elige el micrófono.
Idioma de entrada: elige el idioma o Detección automática. También cambia el idioma de la interfaz para en/es/de/fr.
Voz de reproducción: voces del sistema operativo (no incluidas con la app).
Mostrar marcas de tiempo en las transcripciones: muestra fecha y hora tras cada transcripción.
Mantener la ventana siempre visible: mantiene CamoVoice por encima de otras aplicaciones.
Activar atajo global (Ctrl+`): activa el dictado a nivel de sistema.
Palabras personalizadas: hasta 100 términos, nombres o jerga para mejorar el reconocimiento.

Atajos de teclado

Acción	Windows	macOS
Mantener para grabar	Espacio (mantener)	Espacio (mantener)
Atajo global de dictado	Ctrl+`\| ⌃`
Guardar como	Ctrl+S	⌘S
Deshacer borrado	Ctrl+Z	⌘Z
Rehacer borrado	Ctrl+Shift+Z	⇧⌘Z
Cerrar diálogos	Esc	Esc

Solución de problemas

La transcripción está en otro idioma: abre Configuración → Idioma de entrada y selecciona el idioma correcto o déjalo en Detección automática.
"No hay voces disponibles": instala voces a través de la configuración de voz de tu sistema operativo.
No inicia la grabación: comprueba el micrófono, que no esté bloqueado por otra aplicación, y prueba otro dispositivo de entrada.
Transcripción inexacta: habla con claridad, reduce el ruido de fondo, prueba el modo Preciso y confirma el idioma de entrada.

Compromiso de privacidad

✓ Sin conexión a internet tras la instalación
✓ Sin cuentas ni registros
✓ Sin análisis ni seguimiento
✓ Sin envío de audio a ningún servidor
✓ Configuración guardada localmente en JSON
✓ Modelo incluido localmente, sin APIs de terceros

Benutzerhandbuch (Deutsch)

Vollständig private, offline funktionierende mehrsprachige Sprache-zu-Text-Anwendung

Überblick

CamoVoice International ist eine Desktop-Sprache-zu-Text-Anwendung mit einem Kernprinzip: Ihre Stimme bleibt auf Ihrem Gerät. Im Gegensatz zu Cloud-Transkriptionsdiensten verarbeitet CamoVoice alles lokal mit einem mitgelieferten mehrsprachigen KI-Modell — keine Internetverbindung nötig, keine Daten verlassen Ihren Computer, keinerlei Telemetrie.

Datenschutz und Architektur

100 % offline: Die gesamte Transkription läuft lokal über faster-whisper
Keine Telemetrie: CamoVoice sammelt nichts. Keine Nutzungsdaten, keine Absturzberichte, keine Audiobeispiele
Keine Cloud-Abhängigkeiten: Nach der Installation arbeitet die App ohne Netzverbindung
Lokale Einstellungen: Ihre Präferenzen werden in einer settings.json-Datei auf Ihrem Gerät gespeichert
Mehrsprachig: Transkribiert Englisch, Spanisch, Deutsch, Französisch, Italienisch, Portugiesisch, Niederländisch, Polnisch, Schwedisch und Russisch — oder erkennt die Sprache automatisch

Mehrsprachigkeit

CamoVoice International unterstützt zehn Sprachen sowie automatische Spracherkennung.

Sprache wählen

Öffnen Sie Einstellungen und nutzen Sie das Dropdown-Menü Eingabesprache. Die Auswahl hat zwei Effekte:

Transkription: Die gewählte Sprache wird an faster-whisper übergeben, damit es die richtigen Phoneme erwartet. Das verbessert die Genauigkeit gegenüber der automatischen Erkennung, wenn Sie die Sprache kennen.
Oberflächensprache: Bei Englisch, Spanisch, Deutsch oder Französisch wechseln Hauptfenster-Schaltflächen, Einstellungen, Fenster für eigene Wörter, Speichern-unter-Dialog und gängige Statusmeldungen in diese Sprache. Andere Sprachen werden korrekt transkribiert, die Oberfläche bleibt jedoch auf Englisch.

Automatische Erkennung

Bei Automatisch erkennen prüft faster-whisper die ersten dreißig Sekunden des Audios und ermittelt die Sprache. Praktisch für mehrsprachige Teams oder unbekannte Dateien; die Transkription startet etwas langsamer.

Oberfläche

Aufnehmen

Mikrofon-Schaltfläche: Klicken zum Starten, erneut klicken zum Stoppen.
Leertaste halten: Leertaste halten zum Aufnehmen, loslassen zum Stoppen.
Eine kurze Verzögerung von 200 ms verhindert versehentliche Aufnahmen.
Beim Tippen im Textfeld funktioniert die Leertaste normal.

Globales Tastenkürzel

CamoVoice kann Transkriptionen auch in jede andere Anwendung einfügen. Aktivieren Sie „Globales Tastenkürzel aktivieren (Ctrl+/ ⌃)" in den Einstellungen und:

Cursor in ein beliebiges Textfeld setzen.
Ctrl+** (Windows) oder **⌃ (macOS) drücken, um die Aufnahme zu starten.
Eine kleine dunkle REC-Pille erscheint oben links mit blinkendem Punkt und Live-Pegelanzeige.
Erneut Ctrl+** / **⌃ drücken, um zu stoppen. CamoVoice transkribiert, kopiert das Ergebnis in die Zwischenablage und fügt es am Cursor ein.

Audiodateien laden

Klicken Sie auf Datei laden. Unterstützt werden WAV, MP3, M4A, AAC, FLAC und OGG (Audio) sowie MP4, WEBM, MOV, AVI, MKV, M4V und WMV (Video; die Tonspur wird transkribiert). Dateien werden automatisch auf 16 kHz Mono konvertiert. Größenlimits: 90 MB (Schnell) und 45 MB (Präzise).

Text bearbeiten

Frei bearbeiten: In das Textfeld klicken, um Korrekturen oder Ergänzungen vorzunehmen
Text kopieren: Kopiert den gesamten Text in die Zwischenablage
▶ Abspielen: Liest die Transkription per TTS vor
Speichern unter: Öffnet den Speichern-Dialog mit Zeitstempel-Optionen
Löschen rückgängig: Stellt den zuletzt gelöschten Text wieder her (Ctrl+Z / ⌘Z). Wiederherstellen mit Ctrl+Shift+Z / ⇧⌘Z.

Speichern-unter-Optionen

Bei Speichern unter (Ctrl+S / ⌘S) erscheint ein Dialog mit Optionen: Datum am Anfang einfügen, Zeitstempel nach jeder Transkription einfügen, Bearbeitungen einbeziehen. Formate: TXT, DOCX, PDF.

Transkriptionsmodi

Beide Modi verwenden dasselbe mehrsprachige base-Modell (~74 MB, 99 Sprachen) und unterscheiden sich nur in den Dekodierungsparametern:

Modus	Geschwindigkeit	Genauigkeit	Am besten für
Schnell	★★★★★	★★★☆☆	Kurze Notizen, schnelle Aufnahmen
Präzise	★★★☆☆	★★★★★	Wichtige oder längere Aufnahmen

Aufnahmezeitlimits: 15 Minuten (Schnell), 7 Minuten (Präzise); das Aufnehmen stoppt automatisch am Limit.

Einstellungen

Eingabegerät: Mikrofon wählen.
Eingabesprache: Sprache oder Automatisch erkennen. Ändert auch die Oberflächensprache für en/es/de/fr.
Wiedergabestimme: Systemstimmen (nicht von CamoVoice bereitgestellt).
Zeitstempel in Transkriptionen anzeigen: zeigt Datum/Uhrzeit nach jeder Transkription.
Fenster immer im Vordergrund: hält CamoVoice über anderen Anwendungen.
Globales Tastenkürzel aktivieren (Ctrl+`): aktiviert systemweites Diktat.
Eigene Wörter: bis zu 100 Begriffe, Namen oder Fachausdrücke zur Erkennungsverbesserung.

Tastenkürzel

Aktion	Windows	macOS
Halten zum Aufnehmen	Leertaste (halten)	Leertaste (halten)
Globales Diktat	Ctrl+`\| ⌃`
Speichern unter	Ctrl+S	⌘S
Löschen rückgängig	Ctrl+Z	⌘Z
Wiederholen	Ctrl+Shift+Z	⇧⌘Z
Dialoge schließen	Esc	Esc

Fehlerbehebung

Falsche Sprache: Einstellungen → Eingabesprache auf die gesprochene Sprache setzen oder auf Automatisch belassen.
„Keine Stimmen verfügbar": TTS-Stimmen über die Systemeinstellungen installieren.
Aufnahme startet nicht: Mikrofon prüfen, anderes Eingabegerät testen.
Ungenaue Transkription: deutlich sprechen, Hintergrundgeräusche reduzieren, Präzise-Modus versuchen, Eingabesprache prüfen.

Datenschutzversprechen

✓ Keine Internetverbindung nach der Installation
✓ Keine Konten, keine Anmeldung
✓ Keine Analytik, kein Tracking
✓ Kein Audio an Server gesendet
✓ Einstellungen lokal in JSON
✓ Modell lokal gebündelt, keine Drittanbieter-APIs

Guide de l'utilisateur (Français)

Reconnaissance vocale multilingue, privée et hors ligne

Aperçu

CamoVoice International est une application de reconnaissance vocale de bureau conçue autour d'un principe : votre voix reste sur votre appareil. Contrairement aux services de transcription en ligne, CamoVoice traite tout localement avec un modèle d'IA multilingue intégré — aucune connexion internet, aucune donnée ne quitte votre ordinateur, aucune télémétrie.

Confidentialité et architecture

100 % hors ligne : toute la transcription se fait sur votre appareil via faster-whisper
Zéro télémétrie : CamoVoice ne collecte rien. Pas d'analyse, pas de rapport d'incident, pas d'échantillon audio
Aucune dépendance cloud : après installation, l'application fonctionne sans réseau
Paramètres locaux : vos préférences sont dans un fichier settings.json chargé automatiquement
Multilingue : transcrit anglais, espagnol, allemand, français, italien, portugais, néerlandais, polonais, suédois et russe — ou détecte la langue automatiquement

Prise en charge multilingue

CamoVoice International prend en charge dix langues, ainsi que la détection automatique de la langue à partir de l'audio.

Choisir une langue

Ouvrez Paramètres et utilisez la liste déroulante Langue d'entrée. Cette sélection :

Transcription : la langue choisie est transmise à faster-whisper pour orienter la reconnaissance phonétique. Cela améliore généralement la précision par rapport à la détection automatique.
Langue de l'interface : si vous choisissez anglais, espagnol, allemand ou français, les boutons de la fenêtre principale, la fenêtre Paramètres, la fenêtre Mots personnalisés, le dialogue Enregistrer sous et les messages d'état courants passent dans cette langue. Les autres langues transcrivent correctement mais l'interface reste en anglais.

Détection automatique

En mode Détection automatique, faster-whisper analyse les trente premières secondes de l'audio et détermine la langue. Utile pour les équipes multilingues ou les fichiers inconnus ; la transcription démarre un peu plus lentement.

Interface

Enregistrer

Bouton du microphone : un clic pour démarrer, un clic pour arrêter.
Maintenir la barre d'espace : maintenez la barre d'espace pour enregistrer, relâchez pour arrêter.
Un délai bref de 200 ms évite les enregistrements accidentels.
Lors de la saisie dans la zone de texte, la barre d'espace fonctionne normalement.

Raccourci de dictée global

CamoVoice peut également dicter dans n'importe quelle autre application. Activez « Activer le raccourci global (Ctrl+/ ⌃) » dans Paramètres, puis :

Placez le curseur dans un champ de texte.
Appuyez sur Ctrl+** (Windows) ou **⌃ (macOS) pour démarrer l'enregistrement.
Une petite pastille sombre REC apparaît en haut à gauche avec un point clignotant et un indicateur de niveau audio.
Appuyez à nouveau sur Ctrl+** / **⌃ pour arrêter. CamoVoice transcrit, copie le résultat dans le presse-papiers et le colle à l'emplacement du curseur.

Charger des fichiers audio

Cliquez sur Charger un fichier. Formats pris en charge : audio WAV, MP3, M4A, AAC, FLAC et OGG, ainsi que vidéo MP4, WEBM, MOV, AVI, MKV, M4V et WMV (la piste audio est transcrite). Les fichiers sont automatiquement rééchantillonnés à 16 kHz mono. Limites de taille : 90 Mo (Rapide) et 45 Mo (Précis).

Travailler avec le texte

Modifier librement : cliquez dans la zone de texte pour corriger ou ajouter
Copier le texte : copie tout le texte dans le presse-papiers
▶ Lire : lit la transcription à voix haute
Enregistrer sous : ouvre le dialogue avec options d'horodatage
Annuler l'effacement : restaure le dernier texte effacé (Ctrl+Z / ⌘Z). Rétablir : Ctrl+Shift+Z / ⇧⌘Z.

Options d'enregistrement

À Enregistrer sous (Ctrl+S / ⌘S), un dialogue propose d'inclure la date en haut du document, un horodatage après chaque transcription ou les modifications manuelles. Formats : TXT, DOCX, PDF.

Modes de transcription

Les deux modes utilisent le même modèle multilingue base (~74 Mo, 99 langues) et ne diffèrent que par les paramètres de décodage :

Mode	Vitesse	Précision	Idéal pour
Rapide	★★★★★	★★★☆☆	Notes rapides, enregistrements courts
Précis	★★★☆☆	★★★★★	Transcriptions importantes ou longues

Limites d'enregistrement : 15 minutes (Rapide), 7 minutes (Précis) ; l'enregistrement s'arrête automatiquement à la limite.

Paramètres

Périphérique d'entrée : choisissez le micro.
Langue d'entrée : la langue ou Détection automatique. Change aussi la langue de l'interface pour en/es/de/fr.
Voix de lecture : voix du système d'exploitation (non fournies par CamoVoice).
Afficher les horodatages dans les transcriptions : affiche date/heure après chaque transcription.
Garder la fenêtre toujours au premier plan : maintient CamoVoice au-dessus des autres applications.
Activer le raccourci global (Ctrl+`) : active la dictée système.
Mots personnalisés : jusqu'à 100 termes, noms ou jargon pour améliorer la reconnaissance.

Raccourcis clavier

Action	Windows	macOS
Maintenir pour enregistrer	Espace (maintenir)	Espace (maintenir)
Dictée globale	Ctrl+`\| ⌃`
Enregistrer sous	Ctrl+S	⌘S
Annuler l'effacement	Ctrl+Z	⌘Z
Rétablir	Ctrl+Shift+Z	⇧⌘Z
Fermer les dialogues	Échap	Échap

Dépannage

Transcription dans la mauvaise langue : Paramètres → Langue d'entrée, choisir la langue correcte ou laisser sur Détection automatique.
« Aucune voix disponible » : installez des voix TTS via les paramètres de voix du système.
L'enregistrement ne démarre pas : vérifiez le micro, essayez un autre périphérique d'entrée.
Transcription imprécise : articulez clairement, réduisez le bruit, essayez le mode Précis, vérifiez la langue d'entrée.

Engagement de confidentialité

✓ Aucune connexion internet requise après installation
✓ Aucun compte, aucune inscription
✓ Aucune analytique, aucun suivi
✓ Aucun audio envoyé à un serveur
✓ Paramètres stockés localement au format JSON
✓ Modèle intégré localement, sans API tierce

Guida utente (Italiano)

Riconoscimento vocale multilingue, privato e offline

Panoramica

CamoVoice International è un'applicazione desktop di riconoscimento vocale progettata intorno a un principio fondamentale: la tua voce resta sul tuo dispositivo. A differenza dei servizi di trascrizione cloud, CamoVoice elabora tutto localmente tramite un modello di IA multilingue integrato — nessuna connessione internet, nessun dato lascia il computer, nessuna telemetria.

Privacy e architettura

100 % offline: tutta la trascrizione avviene sul tuo dispositivo tramite faster-whisper
Zero telemetria: CamoVoice non raccoglie nulla. Nessuna analisi, nessun rapporto, nessun campione audio
Nessuna dipendenza cloud: dopo l'installazione, l'app funziona senza rete
Impostazioni locali: le preferenze sono in settings.json sul dispositivo
Multilingue: trascrive inglese, spagnolo, tedesco, francese, italiano, portoghese, olandese, polacco, svedese e russo — oppure rileva la lingua automaticamente

Supporto multilingue

CamoVoice International supporta dieci lingue e il rilevamento automatico della lingua.

Scegliere una lingua

Apri Impostazioni e usa il menu a tendina Lingua di ingresso. La selezione:

Trascrizione: la lingua scelta viene passata a faster-whisper per orientare il riconoscimento fonetico, migliorando di solito la precisione rispetto al rilevamento automatico.
Lingua dell'interfaccia: se scegli inglese, spagnolo, tedesco o francese, i pulsanti della finestra principale, la finestra Impostazioni, Parole personalizzate, Salva con nome e i messaggi di stato comuni passano in quella lingua. Le altre lingue trascrivono correttamente ma l'interfaccia resta in inglese.

Scegliendo Rilevamento automatico, faster-whisper analizza i primi trenta secondi dell'audio per determinare la lingua.

Interfaccia

Registrare

Pulsante del microfono: clic per iniziare, clic per fermare.
Tieni premuta la barra spaziatrice: tieni premuto per registrare, rilascia per fermare.
Un breve ritardo di 200 ms evita registrazioni accidentali.

Tasto di dettatura globale

Attiva « Abilita tasto globale (Ctrl+/ ⌃) » in Impostazioni. Premi Ctrl+/ ⌃ per avviare e fermare; CamoVoice incolla la trascrizione nel punto del cursore.

Caricare file audio

Formati: audio WAV, MP3, M4A, AAC, FLAC, OGG; video MP4, WEBM, MOV, AVI, MKV, M4V, WMV (viene trascritta la traccia audio). Ricampionati automaticamente a 16 kHz mono. Limiti: 90 MB (Veloce) e 45 MB (Preciso).

Lavorare con il testo

Modifica libera: clicca nell'area di testo per correggere o aggiungere
Copia testo: copia tutto negli appunti
▶ Riproduci: legge la trascrizione ad alta voce
Salva con nome: dialogo con opzioni di timestamp (data, timestamp per segmento, includere modifiche)
Annulla cancellazione: ripristina l'ultimo testo cancellato (Ctrl+Z / ⌘Z). Ripeti: Ctrl+Shift+Z / ⇧⌘Z.

Formati di esportazione: TXT, DOCX, PDF.

Modalità di trascrizione

Entrambe le modalità usano lo stesso modello multilingue base (~74 MB, 99 lingue):

Modalità	Velocità	Precisione
Veloce	★★★★★	★★★☆☆
Preciso	★★★☆☆	★★★★★

Limiti di tempo: 15 minuti (Veloce), 7 minuti (Preciso).

Impostazioni

Dispositivo di ingresso: scegli il microfono.
Lingua di ingresso: lingua o Rilevamento automatico.
Voce di riproduzione: voci del sistema operativo.
Mostra timestamp nelle trascrizioni.
Mantieni la finestra sempre in primo piano.
Abilita tasto globale (Ctrl+`).
Parole personalizzate: fino a 100 termini per migliorare il riconoscimento.

Scorciatoie da tastiera

Azione	Windows	macOS
Tenere premuto per registrare	Spazio	Spazio
Dettatura globale	Ctrl+`\| ⌃`
Salva con nome	Ctrl+S	⌘S
Annulla cancellazione	Ctrl+Z	⌘Z
Ripeti	Ctrl+Shift+Z	⇧⌘Z
Chiudi dialoghi	Esc	Esc

Risoluzione dei problemi

Trascrizione nella lingua sbagliata: Impostazioni → Lingua di ingresso.
"Nessuna voce disponibile": installa voci TTS dal sistema.
Registrazione non parte: controlla il microfono e prova un altro dispositivo.
Trascrizione imprecisa: parla chiaramente, riduci il rumore, prova Preciso.

Impegno per la privacy

✓ Nessuna connessione internet dopo l'installazione
✓ Nessun account
✓ Nessuna analisi o tracciamento
✓ Nessun audio inviato a server
✓ Impostazioni in JSON locale
✓ Modello integrato localmente

Guia do utilizador (Português)

Reconhecimento de voz multilingue, privado e offline

Visão geral

CamoVoice International é uma aplicação de desktop de reconhecimento de voz concebida em torno de um princípio: a sua voz fica no seu dispositivo. Ao contrário dos serviços de transcrição na nuvem, o CamoVoice processa tudo localmente através de um modelo de IA multilingue incluído — sem ligação à internet, sem que os dados saiam do computador, sem telemetria.

Privacidade e arquitetura

100 % offline: toda a transcrição é feita localmente via faster-whisper
Zero telemetria: o CamoVoice não recolhe nada. Sem análises, sem relatórios de falhas, sem amostras de áudio
Sem dependências na nuvem: após a instalação, a aplicação funciona sem rede
Definições locais: as preferências são guardadas num ficheiro settings.json
Multilingue: transcreve inglês, espanhol, alemão, francês, italiano, português, neerlandês, polaco, sueco e russo — ou deteta a língua automaticamente

Suporte multilingue

Abra Configurações e escolha no menu Idioma de entrada. A seleção determina:

Transcrição: a língua escolhida é passada ao faster-whisper para orientar o reconhecimento fonético, normalmente melhorando a precisão face à deteção automática.
Idioma da interface: se escolher inglês, espanhol, alemão ou francês, os botões da janela principal, a janela Configurações, Palavras personalizadas, Guardar como e as mensagens de estado comuns passam para essa língua. As outras línguas transcrevem corretamente mas mantêm a interface em inglês.

Com Deteção automática, o faster-whisper analisa os primeiros trinta segundos do áudio.

Interface

Gravar

Botão do microfone: clique para iniciar, clique para parar.
Manter a barra de espaço: manter para gravar, largar para parar.

Tecla de atalho global

Em Configurações ative « Ativar atalho global (Ctrl+/ ⌃) ». Prima Ctrl+/ ⌃ para começar e parar; o CamoVoice cola a transcrição na posição do cursor.

Carregar ficheiros de áudio

Formatos: áudio WAV, MP3, M4A, AAC, FLAC e OGG; vídeo MP4, WEBM, MOV, AVI, MKV, M4V, WMV (a faixa de áudio é transcrita). Remostrados automaticamente a 16 kHz mono. Limites: 90 MB (Rápido) e 45 MB (Preciso).

Trabalhar com o texto

Editar livremente: clique na área de texto para corrigir ou acrescentar
Copiar texto: copia tudo para a área de transferência
▶ Reproduzir: lê a transcrição em voz alta
Guardar como: abre o diálogo com opções de data e marcas temporais
Anular limpeza: restaura o último texto apagado (Ctrl+Z / ⌘Z). Refazer: Ctrl+Shift+Z / ⇧⌘Z.

Formatos de exportação: TXT, DOCX, PDF.

Modos de transcrição

Ambos os modos usam o mesmo modelo multilingue base (~74 MB, 99 línguas):

Modo	Velocidade	Precisão
Rápido	★★★★★	★★★☆☆
Preciso	★★★☆☆	★★★★★

Limites: 15 min (Rápido), 7 min (Preciso).

Configurações

Dispositivo de entrada: escolher o microfone.
Idioma de entrada: língua ou Deteção automática.
Voz de reprodução: vozes do sistema operativo.
Mostrar marcas temporais nas transcrições.
Manter a janela sempre visível.
Ativar atalho global (Ctrl+`).
Palavras personalizadas: até 100 termos.

Atalhos de teclado

Ação	Windows	macOS
Manter para gravar	Espaço	Espaço
Ditado global	Ctrl+`\| ⌃`
Guardar como	Ctrl+S	⌘S
Anular limpeza	Ctrl+Z	⌘Z
Refazer	Ctrl+Shift+Z	⇧⌘Z
Fechar diálogos	Esc	Esc

Resolução de problemas

Transcrição na língua errada: Configurações → Idioma de entrada.
"Nenhuma voz disponível": instale vozes de TTS nas definições do sistema.
A gravação não começa: verifique o microfone, tente outro dispositivo.
Transcrição imprecisa: fale claramente, reduza o ruído, experimente o modo Preciso.

Compromisso de privacidade

✓ Sem ligação à internet após a instalação
✓ Sem contas
✓ Sem análise ou rastreio
✓ Sem áudio enviado a servidores
✓ Definições em JSON local
✓ Modelo incluído localmente

Gebruikershandleiding (Nederlands)

Volledig privé, offline meertalige spraak-naar-tekst

Overzicht

CamoVoice International is een desktopapplicatie voor spraak-naar-tekst met één kernprincipe: uw stem blijft op uw apparaat. In tegenstelling tot clouddiensten verwerkt CamoVoice alles lokaal met een meegeleverd meertalig AI-model — geen internetverbinding vereist, geen data verlaat uw computer, geen telemetrie.

Privacy en architectuur

100 % offline: alle transcriptie gebeurt lokaal via faster-whisper
Nul telemetrie: CamoVoice verzamelt niets
Geen cloud-afhankelijkheden: na installatie werkt de app zonder netwerk
Lokale instellingen: voorkeuren staan in settings.json op uw apparaat
Meertalig: transcribeert Engels, Spaans, Duits, Frans, Italiaans, Portugees, Nederlands, Pools, Zweeds en Russisch — of detecteert de taal automatisch

Meertalige ondersteuning

Open Instellingen en gebruik het vervolgkeuzemenu Invoertaal. Deze keuze:

Transcriptie: de gekozen taal wordt doorgegeven aan faster-whisper om de fonetische herkenning te sturen, wat doorgaans de nauwkeurigheid verbetert ten opzichte van automatische detectie.
Interfacetaal: bij Engels, Spaans, Duits of Frans schakelen de knoppen van het hoofdvenster, het venster Instellingen, Eigen woorden, Opslaan als en veelvoorkomende statusmeldingen naar die taal. Andere talen transcriberen correct, maar de interface blijft in het Engels.

Met Automatisch detecteren analyseert faster-whisper de eerste dertig seconden.

Interface

Opnemen

Microfoonknop: klik om te starten, klik om te stoppen.
Spatiebalk ingedrukt houden: ingedrukt houden om op te nemen, loslaten om te stoppen.

Globale dicteersneltoets

Activeer « Globale sneltoets inschakelen (Ctrl+/ ⌃) » in Instellingen. Druk op Ctrl+/ ⌃ om te starten en te stoppen; CamoVoice plakt de transcriptie op de cursorpositie.

Audiobestanden laden

Formaten: audio WAV, MP3, M4A, AAC, FLAC en OGG; video MP4, WEBM, MOV, AVI, MKV, M4V, WMV (het audiospoor wordt getranscribeerd). Automatische herbemonstering naar 16 kHz mono. Limieten: 90 MB (Snel) en 45 MB (Nauwkeurig).

Werken met tekst

Vrij bewerken: klik in het tekstgebied
Tekst kopiëren: kopieert alles naar het klembord
▶ Afspelen: leest de transcriptie voor
Opslaan als: dialoog met tijdstempelopties
Wissen ongedaan maken: herstelt laatst gewiste tekst (Ctrl+Z / ⌘Z). Opnieuw: Ctrl+Shift+Z / ⇧⌘Z.

Exportformaten: TXT, DOCX, PDF.

Transcriptiemodi

Beide modi gebruiken hetzelfde meertalige base-model (~74 MB, 99 talen):

Modus	Snelheid	Nauwkeurigheid
Snel	★★★★★	★★★☆☆
Nauwkeurig	★★★☆☆	★★★★★

Tijdslimieten: 15 minuten (Snel), 7 minuten (Nauwkeurig).

Instellingen

Invoerapparaat: kies de microfoon.
Invoertaal: taal of Automatisch detecteren.
Afspeelstem: systeemstemmen.
Tijdstempels in transcripties tonen.
Venster altijd op de voorgrond houden.
Globale sneltoets inschakelen (Ctrl+`).
Eigen woorden: tot 100 termen.

Sneltoetsen

Actie	Windows	macOS
Vasthouden om op te nemen	Spatie	Spatie
Globaal dicteren	Ctrl+`\| ⌃`
Opslaan als	Ctrl+S	⌘S
Wissen ongedaan maken	Ctrl+Z	⌘Z
Opnieuw	Ctrl+Shift+Z	⇧⌘Z
Dialogen sluiten	Esc	Esc

Problemen oplossen

Transcriptie in verkeerde taal: Instellingen → Invoertaal.
"Geen stemmen beschikbaar": installeer TTS-stemmen via de systeemspraakinstellingen.
Opname start niet: controleer microfoon, probeer een ander invoerapparaat.
Onnauwkeurige transcriptie: spreek duidelijk, verminder ruis, probeer Nauwkeurig.

Privacybelofte

✓ Geen internet na installatie
✓ Geen accounts
✓ Geen analyses of tracking
✓ Geen audio naar servers
✓ Instellingen lokaal in JSON
✓ Model lokaal meegeleverd

Podręcznik użytkownika (Polski)

W pełni prywatne, wielojęzyczne rozpoznawanie mowy offline

Przegląd

CamoVoice International to aplikacja do rozpoznawania mowy na komputer, zbudowana wokół jednej zasady: Twój głos pozostaje na Twoim urządzeniu. W odróżnieniu od usług chmurowych CamoVoice przetwarza wszystko lokalnie, korzystając z dołączonego wielojęzycznego modelu AI — bez połączenia z internetem, bez wysyłania danych, bez telemetrii.

Prywatność i architektura

100 % offline: cała transkrypcja odbywa się lokalnie przez faster-whisper
Zero telemetrii: CamoVoice nic nie zbiera
Brak zależności od chmury: po instalacji aplikacja działa bez sieci
Ustawienia lokalne: preferencje zapisane w pliku settings.json
Wielojęzyczność: transkrybuje angielski, hiszpański, niemiecki, francuski, włoski, portugalski, niderlandzki, polski, szwedzki i rosyjski — lub wykrywa język automatycznie

Obsługa wielojęzyczna

Otwórz Ustawienia i użyj listy rozwijanej Język wejściowy. Wybór:

Transkrypcja: wybrany język trafia do faster-whisper, co zwykle zwiększa dokładność w porównaniu z wykrywaniem automatycznym.
Język interfejsu: wybór angielskiego, hiszpańskiego, niemieckiego lub francuskiego tłumaczy przyciski głównego okna, Ustawienia, Własne słowa, Zapisz jako i typowe komunikaty statusu. Inne języki są poprawnie transkrybowane, ale interfejs pozostaje po angielsku.

W trybie Wykrywanie automatyczne faster-whisper analizuje pierwsze trzydzieści sekund audio.

Interfejs

Nagrywanie

Przycisk mikrofonu: kliknięcie uruchamia, kliknięcie zatrzymuje.
Przytrzymanie spacji: przytrzymaj, aby nagrywać; puść, aby zatrzymać.

Globalny skrót dyktowania

Włącz « Włącz globalny skrót (Ctrl+/ ⌃) » w Ustawieniach. Naciśnij Ctrl+/ ⌃, aby rozpocząć i zatrzymać; CamoVoice wkleja transkrypcję w miejscu kursora.

Wczytywanie plików audio

Formaty: audio WAV, MP3, M4A, AAC, FLAC, OGG; wideo MP4, WEBM, MOV, AVI, MKV, M4V, WMV (transkrybowana jest ścieżka dźwiękowa). Automatyczne próbkowanie do 16 kHz mono. Limity: 90 MB (Szybki) i 45 MB (Precyzyjny).

Praca z tekstem

Swobodna edycja: kliknij w obszarze tekstu
Kopiuj tekst: kopiuje wszystko do schowka
▶ Odtwórz: czyta transkrypcję na głos
Zapisz jako: okno z opcjami znaczników czasu
Cofnij wyczyszczenie: przywraca ostatnio wyczyszczony tekst (Ctrl+Z / ⌘Z). Ponów: Ctrl+Shift+Z / ⇧⌘Z.

Formaty eksportu: TXT, DOCX, PDF.

Tryby transkrypcji

Oba tryby używają tego samego wielojęzycznego modelu base (~74 MB, 99 języków):

Tryb	Szybkość	Dokładność
Szybki	★★★★★	★★★☆☆
Precyzyjny	★★★☆☆	★★★★★

Limity czasu: 15 min (Szybki), 7 min (Precyzyjny).

Ustawienia

Urządzenie wejściowe: wybór mikrofonu.
Język wejściowy: język lub Wykrywanie automatyczne.
Głos odtwarzania: głosy systemu operacyjnego.
Pokaż znaczniki czasu w transkrypcjach.
Trzymaj okno zawsze na wierzchu.
Włącz globalny skrót (Ctrl+`).
Własne słowa: do 100 terminów.

Skróty klawiaturowe

Akcja	Windows	macOS
Przytrzymaj, aby nagrywać	Spacja	Spacja
Globalne dyktowanie	Ctrl+`\| ⌃`
Zapisz jako	Ctrl+S	⌘S
Cofnij wyczyszczenie	Ctrl+Z	⌘Z
Ponów	Ctrl+Shift+Z	⇧⌘Z
Zamknij okna dialogowe	Esc	Esc

Rozwiązywanie problemów

Transkrypcja w złym języku: Ustawienia → Język wejściowy.
„Brak dostępnych głosów": zainstaluj głosy TTS w ustawieniach systemu.
Nagrywanie nie startuje: sprawdź mikrofon, wypróbuj inne urządzenie.
Niedokładna transkrypcja: mów wyraźnie, zmniejsz hałas, spróbuj trybu Precyzyjnego.

Zobowiązanie prywatności

✓ Brak połączenia z internetem po instalacji
✓ Brak kont
✓ Brak analityki ani śledzenia
✓ Brak dźwięku wysyłanego na serwery
✓ Ustawienia lokalnie w JSON
✓ Model dołączony lokalnie

Användarhandbok (Svenska)

Helt privat, flerspråkig tal-till-text offline

Översikt

CamoVoice International är en skrivbordsapp för tal-till-text byggd kring en grundprincip: din röst stannar på din enhet. Till skillnad från molntjänster bearbetar CamoVoice allt lokalt med en medföljande flerspråkig AI-modell — ingen internetanslutning krävs, inga data lämnar din dator, ingen telemetri.

Integritet och arkitektur

100 % offline: all transkribering sker lokalt via faster-whisper
Ingen telemetri: CamoVoice samlar inte in något
Inga molnberoenden: efter installation fungerar appen utan nätverk
Lokala inställningar: dina preferenser sparas i settings.json
Flerspråkig: transkriberar engelska, spanska, tyska, franska, italienska, portugisiska, nederländska, polska, svenska och ryska — eller detekterar språket automatiskt

Flerspråkigt stöd

Öppna Inställningar och använd rullgardinsmenyn Inmatningsspråk. Valet styr:

Transkribering: det valda språket skickas till faster-whisper för att styra den fonetiska igenkänningen, vilket oftast ger högre träffsäkerhet än automatisk detektering.
Gränssnittsspråk: vid val av engelska, spanska, tyska eller franska översätts huvudfönstrets knappar, Inställningar, Egna ord, Spara som och vanliga statusmeddelanden till det språket. Övriga språk transkriberar korrekt men gränssnittet är på engelska.

Med Automatisk identifiering analyserar faster-whisper de första trettio sekunderna.

Gränssnitt

Spela in

Mikrofonknappen: ett klick för att starta, ett klick för att stoppa.
Håll ner mellanslag: håll för att spela in, släpp för att stoppa.

Global diktatgenväg

Aktivera « Aktivera global genväg (Ctrl+/ ⌃) » i Inställningar. Tryck på Ctrl+/ ⌃ för att starta och stoppa; CamoVoice klistrar in transkriberingen vid markörpositionen.

Läsa in ljudfiler

Format: ljud WAV, MP3, M4A, AAC, FLAC och OGG; video MP4, WEBM, MOV, AVI, MKV, M4V, WMV (ljudspåret transkriberas). Automatisk omsampling till 16 kHz mono. Gränser: 90 MB (Snabb) och 45 MB (Noggrann).

Arbeta med texten

Redigera fritt: klicka i textfältet
Kopiera text: kopierar allt till urklipp
▶ Spela upp: läser upp transkriberingen
Spara som: dialog med tidsstämpelalternativ
Ångra rensning: återställer senast rensad text (Ctrl+Z / ⌘Z). Gör om: Ctrl+Shift+Z / ⇧⌘Z.

Exportformat: TXT, DOCX, PDF.

Transkriberingslägen

Båda lägena använder samma flerspråkiga base-modell (~74 MB, 99 språk):

Läge	Hastighet	Noggrannhet
Snabb	★★★★★	★★★☆☆
Noggrann	★★★☆☆	★★★★★

Tidsgränser: 15 min (Snabb), 7 min (Noggrann).

Inställningar

Inmatningsenhet: välj mikrofon.
Inmatningsspråk: språk eller Automatisk identifiering.
Uppspelningsröst: systemröster.
Visa tidsstämplar i transkriberingar.
Håll fönstret alltid överst.
Aktivera global genväg (Ctrl+`).
Egna ord: upp till 100 termer.

Kortkommandon

Åtgärd	Windows	macOS
Håll för inspelning	Mellanslag	Mellanslag
Global diktering	Ctrl+`\| ⌃`
Spara som	Ctrl+S	⌘S
Ångra rensning	Ctrl+Z	⌘Z
Gör om	Ctrl+Shift+Z	⇧⌘Z
Stäng dialoger	Esc	Esc

Felsökning

Transkribering på fel språk: Inställningar → Inmatningsspråk.
"Inga röster tillgängliga": installera TTS-röster via systemets röstinställningar.
Inspelning startar inte: kontrollera mikrofonen, prova en annan inmatningsenhet.
Oprecis transkribering: tala tydligt, minska bakgrundsbrus, prova Noggrann.

Integritetsåtagande

✓ Ingen internetanslutning krävs efter installation
✓ Inga konton
✓ Ingen analys eller spårning
✓ Inget ljud skickas till servrar
✓ Inställningar lokalt i JSON
✓ Modell medföljer lokalt

Руководство пользователя (Русский)

Полностью приватное, многоязычное распознавание речи без подключения к сети

Обзор

CamoVoice International — это настольное приложение распознавания речи, построенное вокруг одного принципа: ваш голос остаётся на вашем устройстве. В отличие от облачных сервисов, CamoVoice обрабатывает всё локально с помощью встроенной многоязычной ИИ-модели — без интернета, без передачи данных, без телеметрии.

Конфиденциальность и архитектура

100 % офлайн: вся расшифровка выполняется локально через faster-whisper
Без телеметрии: CamoVoice ничего не собирает
Без облачных зависимостей: после установки приложение работает без сети
Локальные настройки: параметры хранятся в файле settings.json
Многоязычность: расшифровывает английский, испанский, немецкий, французский, итальянский, португальский, нидерландский, польский, шведский и русский — или определяет язык автоматически

Поддержка языков

Откройте Настройки и выберите Язык ввода. Выбор определяет:

Расшифровка: выбранный язык передаётся faster-whisper для настройки фонетической модели, что обычно повышает точность по сравнению с автоматическим определением.
Язык интерфейса: при выборе английского, испанского, немецкого или французского кнопки главного окна, окно Настройки, Пользовательские слова, диалог Сохранить как и типичные сообщения статуса переводятся на этот язык. Остальные языки расшифровываются корректно, но интерфейс остаётся на английском.

В режиме Автоопределение faster-whisper анализирует первые тридцать секунд аудио.

Интерфейс

Запись

Кнопка микрофона: клик — начать, клик — остановить.
Удержание пробела: удерживайте, чтобы записывать; отпустите, чтобы остановить.

Глобальная горячая клавиша диктовки

Включите « Включить глобальную горячую клавишу (Ctrl+/ ⌃) » в Настройках. Нажмите Ctrl+/ ⌃, чтобы начать и остановить; CamoVoice вставит расшифровку в месте курсора.

Загрузка аудиофайлов

Форматы: аудио WAV, MP3, M4A, AAC, FLAC и OGG; видео MP4, WEBM, MOV, AVI, MKV, M4V, WMV (транскрибируется звуковая дорожка). Автоматическая передискретизация до 16 кГц моно. Лимиты: 90 МБ (Быстрый) и 60 МБ (Точный).

Работа с текстом

Свободное редактирование: щёлкните по текстовой области
Копировать текст: копирует всё в буфер обмена
▶ Воспроизвести: читает расшифровку вслух
Сохранить как: диалог с настройками меток времени
Отменить очистку: восстанавливает последний очищенный текст (Ctrl+Z / ⌘Z). Повтор: Ctrl+Shift+Z / ⇧⌘Z.

Форматы экспорта: TXT, DOCX, PDF.

Режимы расшифровки

Оба режима используют одну и ту же многоязычную модель base (~74 МБ, 99 языков):

Режим	Скорость	Точность
Быстрый	★★★★★	★★★☆☆
Точный	★★★☆☆	★★★★★

Ограничения по времени: 15 мин (Быстрый), 10 мин (Точный).

Настройки

Устройство ввода: выбор микрофона.
Язык ввода: язык или Автоопределение.
Голос воспроизведения: голоса операционной системы.
Показывать метки времени в расшифровках.
Держать окно поверх других.
Включить глобальную горячую клавишу (Ctrl+`).
Пользовательские слова: до 100 терминов.

Горячие клавиши

Действие	Windows	macOS
Удерживать для записи	Пробел	Пробел
Глобальная диктовка	Ctrl+`\| ⌃`
Сохранить как	Ctrl+S	⌘S
Отменить очистку	Ctrl+Z	⌘Z
Повторить	Ctrl+Shift+Z	⇧⌘Z
Закрыть диалоги	Esc	Esc

Устранение неполадок

Расшифровка на неправильном языке: Настройки → Язык ввода.
«Голоса недоступны»: установите TTS-голоса через настройки речи ОС.
Запись не начинается: проверьте микрофон, попробуйте другое устройство ввода.
Неточная расшифровка: говорите чётко, уменьшите фоновый шум, попробуйте режим Точный.

Обязательство о конфиденциальности

✓ Интернет не требуется после установки
✓ Без учётных записей
✓ Без аналитики и отслеживания
✓ Без отправки аудио на серверы
✓ Настройки локально в JSON
✓ Модель встроена локально

CamoVoice International v1.0.0

CamoVoice User Guide

Looking for CamoVoice International?

Automating transcription from the terminal?

Overview

Privacy & Architecture

User Interface

Main Window Layout

Recording Audio

Global Voice Typing Hotkey New

Recording Duration Timer

Transcription Animation

Audio Level Indicator

Loading Audio & Video Files New

Transcription Output

Working with Text

Undo Clear

Save As Options

Export Formats

Transcription Modes

Audio Duration & Size Limits

Single-pass thresholds (Fast vs Thinking)

Loaded audio files

Chunk size

Live microphone recording

Minimum and quality limits

Practical limits

Settings

macOS Permissions for Global Hotkey

Input Device

Playback Voice

Show Timestamps in Transcriptions

Keep Window Always on Top

Custom Words

Accessibility Features

Scalable Interface

Keyboard Shortcuts

CamoVoice CLI New

How It Works (Technical)

Speech Recognition

Audio Capture

Text-to-Speech

User Interface

Troubleshooting

"No voices available" for Playback

Recording doesn't start

Transcription is inaccurate

App is slow on first transcription

camovoice is not recognized (CLI)

Tips for Best Results

Enterprise Customization & Expansion Options

Privacy Commitment

CamoVoice International User Guide

Overview

Privacy & Architecture

Multilingual Support

Supported Languages

Choosing a Language

Auto-Detect

User Interface

Main Window Layout

Recording Audio

Global Dictation Hotkey

Recording Duration Timer

Transcription Animation

Audio Level Indicator

Loading Audio & Video Files

Transcription Output

Working with Text

Undo Clear

Save As Options

Export Formats

Transcription Modes

Settings

Input Device

Input Language

Playback Voice

Show Timestamps in Transcriptions

Keep Window Always on Top

Enable Global Hotkey

Custom Words

`camovoice` is not recognized (CLI)