全球视频内容本地化
一套面向可重复、高吞吐工作流设计的内容处理系统。可清理视频中的硬编码字幕,将 PPT 或 PDF 幻灯片转换为配音视频,并通过 Refine Skills 在规模化处理下保持字幕的准确性与一致性。

What is EchoSubs?
A desktop AI app for subtitle removal, captions, and slide-to-video — all offline.
EchoSubs is a desktop AI application for macOS and Windows that removes hardcoded subtitles from videos using local AI inpainting, generates accurate captions in 60+ languages, and converts PPT/PDF presentations into narrated videos — all running 100% offline on your machine.
- Platforms
- macOS 12+, Windows 10+
- Processing
- 100% offline, no upload
- Pricing
- From $5.99/mo or $69 lifetime
- Trial
- Free with watermark
- Languages
- 6 UI / 60+ subtitle
- Licence
- One-time lifetime available
Best for
- Teams handling NDA-protected or client-confidential video content
- Creators batch-processing dozens to hundreds of videos
- Educators converting slide decks into narrated video lessons
- Localisation workflows that need multi-language subtitle generation
- Organisations under GDPR or data-residency constraints that rule out cloud SaaS
Not ideal for
- One-off subtitle removal where a free online tool is sufficient
- Workflows that need a real-time cloud API — EchoSubs is a desktop app, not a SaaS endpoint
- Live transcription — EchoSubs is built for post-production
Security & Privacy
Your videos never leave your machine.
EchoSubs is a desktop application — not a SaaS. Every operation, from subtitle detection to AI inpainting to PPT-to-video rendering, executes on your local CPU/GPU. There is no cloud upload path at all, even as an opt-in. That makes EchoSubs the only realistic option for NDA-protected, client-confidential, or regulated video work.
100% offline processing
Every frame of inpainting, every audio transcription, every TTS render runs on your CPU/GPU. Internet is only used to validate your licence.
Zero cloud upload
Source files never leave your filesystem. There is no S3 bucket, no temp-upload endpoint, no opt-in cloud mode — by design.
No telemetry on content
EchoSubs does not log filenames, durations, frame contents, audio tracks, or transcripts. Anonymous app metrics (crash, version) can be disabled.
GDPR-friendly by architecture
Because videos and subtitles never traverse our servers, EchoSubs has no controller/processor role over your content. Self-contained workflow, no DPA needed.
Alternatives & Comparison
How EchoSubs compares to Veed.io, CapCut, and GiliSoft.
A quick scan of where EchoSubs wins, where it ties, and where another tool may serve you better. We try to be honest about the trade-offs — there are workflows where a free online tool genuinely is the right call.
| Capability | EchoSubs | Veed.io | CapCut | GiliSoft |
|---|---|---|---|---|
| Runs 100% offline | ||||
| AI inpainting (no blur) | ||||
| Lossless passthrough encoding | ||||
| PPT/PDF → narrated video | ||||
| Batch (folder) processing | ||||
| One-time purchase option | ||||
| No watermark on free trial |
Pick EchoSubs if…
You handle NDA-protected, client-confidential, or regulated video, run batches of 10+ files at a time, or need PPT/PDF → narrated MP4 in the same tool.
Pick Veed.io or CapCut if…
You need to clean up one or two short videos a year, the content isn't sensitive, and you'd rather not install desktop software.
Pick GiliSoft if…
You're on Windows-only and already own a GiliSoft licence — quality is decent but it re-encodes every video, costing more time and quality than EchoSubs' lossless pipeline.
Proof
Real videos, processed locally, on real hardware.
EchoSubs ships a working product, not a waitlist. Below are the only numbers we publish — we'd rather under-state and back the claim with the actual binary than oversell.
Measured on the internal benchmark set of 200 mixed-language clips.
On Apple M2 / RTX 3060-class hardware; CPU-only is 3–4× slower.
No upload, no queue, no re-encode — measured against Veed and Vmake on the same 5-minute 1080p clip.
We publish customer case studies only with written, on-the-record permission. If EchoSubs has saved your team meaningful hours and you'd be open to a short quote, please get in touch — we'd love to credit you here.
Getting Started
Get your first cleaned video in under 5 minutes.
No account, no cloud upload, no waiting in a queue. Install once and you're processing on your own hardware. Here's the entire flow:
- 1
Download EchoSubs
macOS .dmg or Windows installer. No account required for the trial.
60 sec
- 2
Drag your video into the app
Drop a single file or an entire folder. Supports MP4, MKV, MOV, AVI, WebM, FLV, and more.
30 sec
- 3
Let AI suggest detection parameters
EchoSubs auto-detects the subtitle region, guesses the spoken language, and recommends an inpainting model. Accept the defaults or tweak any of them.
~5 sec, automatic
- 4
Click Remove (or Generate, or Convert)
Processing runs on your local GPU. A 5-minute 1080p video takes about 2–3 minutes on M2 / RTX 3060-class hardware.
3–5 min
Time to first result: ~5 minutes. You'll see the cleaned video appear in the same folder as the original. No account needed for the trial.
Frequently asked
The questions buyers ask before they commit.
Security, price, migration, integrations, onboarding time, trial limits, offline capability, hardware. Direct answers — no jargon, no upsell.
精细化字幕校正能力(Skills)
超越单次转录。Refine Skills 应用结构化后处理来改善时间对齐、句子边界和一致性 —— 在长视频内容中保持字幕质量。

为每位创作者量身打造的方案
免费版
- ✓ 每月 30 分钟字幕时长
- ✓ 基础背景主题与语音库
- ✗ 导出带水印
- ✗ 无 AI 翻译或字幕优化
- ✗ 无 AI 处理
- ✗ 无硬字幕去除功能
标准版
- ✓ 每月 5 小时字幕时长
- ✓ 基础背景主题与语音库
- ✓ 无水印导出
- ✓ 硬字幕去除
- ✗ 无 AI 翻译或字幕优化
- ✗ 无 AI 处理
专业创作者版
- ✓ 每月 30 小时字幕时长
- ✓ 更多背景主题与语音库
- ✓ 无水印导出
- ✓ 硬字幕去除
- ✓ 包含 3 小时的 AI 翻译/字幕优化
- ✓ 每月 3 小时 AI 处理
企业版
- ✓ 无限字幕时长
- ✓ 更多背景主题与语音库
- ✓ 无水印导出
- ✓ 全部功能解锁
- ✓ 包含 15 小时 AI 翻译/字幕优化
- ✓ 每月 15 小时 AI 处理
🪙 Token 点数包
按量付费的 AI 处理时长包。可为你的方案扩展额外 AI 生成时间。
客户反馈
给需要更快完成视频工作的团队使用
以下反馈来自日常处理课程、培训材料和客户视频的创作者与团队。
我主要用 EchoSubs 处理已经带硬字幕的旧视频。以前要重新剪一遍,现在可以先清理画面,再继续做新的字幕版本。
Maya Chen
YouTube 教学创作者
课程片段硬字幕清理
我们有很多内部培训 PPT,以前一直堆在文件夹里。现在可以在桌面端转成配音视频,不需要把资料上传到其他网站。
Daniel Brooks
学习运营经理
离线 PPT 转视频流程
最明显的变化是批量处理。晚上把客户视频排进队列,第二天早上检查结果,只把时间花在需要人工处理的片段上。
Sofia Ramirez
本地化制作人
批量字幕处理
我们的愿景
本地优先处理
服务全球内容
EchoSubs 的独特之处
本地执行架构
所有处理都在本地机器上运行。无需云端上传、无排队延迟、无网络依赖。内容控制和运营稳定性内置于架构中。
Refine Skills 精准优化
字幕不是一次性输出。Refine Skills 应用结构化后处理来修复时间漂移、句子边界和上下文错误 —— 对长视频尤为关键。
吞吐量优先,非时间线编辑
不是传统视频编辑器。EchoSubs 将演示文稿转视频、字幕去除和本地化作为结构化流水线处理 —— 数小时视频常在几分钟内完成。
企业级可靠性
确定性、可重复的输出。GPU 加速处理,跨运行保持一致质量 —— 专为教育工作者、媒体团队和国际运营打造。
本地执行架构
所有处理都在本地机器上运行。无需云端上传、无排队延迟、无网络依赖。内容控制和运营稳定性内置于架构中。
Refine Skills 精准优化
字幕不是一次性输出。Refine Skills 应用结构化后处理来修复时间漂移、句子边界和上下文错误 —— 对长视频尤为关键。
吞吐量优先,非时间线编辑
不是传统视频编辑器。EchoSubs 将演示文稿转视频、字幕去除和本地化作为结构化流水线处理 —— 数小时视频常在几分钟内完成。
企业级可靠性
确定性、可重复的输出。GPU 加速处理,跨运行保持一致质量 —— 专为教育工作者、媒体团队和国际运营打造。
我们正在招募合作伙伴
我们正在寻找全球经销商、分销渠道、内容工作室以及集成合作伙伴,共同拓展 EchoSubs 的全球市场。
如果你的业务服务创作者、教育机构或企业 —— 欢迎与我们合作。
联系我们
对企业方案、合作伙伴关系或投资有兴趣?
contact@echosubs.com
美国华盛顿州西雅图


