Accessibility
Reach viewers watching with the sound off, in noisy environments, or who are deaf or hard of hearing. Captions make your content easy to understand.
AI Auto Captions
KineMaster automatically transcribes spoken audio from your video or audio track into editable caption layers. All processing runs locally on your phone or tablet — nothing is uploaded. Works offline.
What it does
Auto Captions uses on-device AI to transcribe the singing or narration in your video or audio track and creates synchronized caption layers on your timeline. Caption layers are batch-editable, formattable, and movable.
Download the language pack that matches your audio and generate captions entirely on-device.
Each language asset is approximately 400–450 MB. Can be used offline after downloading.
How to use
Five steps from raw footage to synchronized caption layers.
Open your KineMaster project. To caption the entire project, tap the AI button on the horizontal toolbar. To caption a single track, select the video or audio track that contains the spoken audio.
Tap Auto Captions in the Tool Panel.
Select the language that matches your audio and tap Download.
~400–450 MB per languageAI processes your audio entirely on your device. Nothing is uploaded. Processing speed depends on device performance. Requires at least 4GB of RAM on device.
Caption layers appear on your timeline as individual text layers. Tap any caption to edit text, font, color, size, or position.
Use cases
From accessibility to Reels retention, captions can open your content to more viewers.
Reach viewers watching with the sound off, in noisy environments, or who are deaf or hard of hearing. Captions make your content easy to understand.
Styled captions hold attention through slow moments and boost watch-time on short-form platforms where most users scroll with content on mute.
FAQ
Quick answers about pricing, offline use, language assets, device compatibility, and caption styling.
Yes. Auto Captions is free for all KineMaster users. There is no subscription required.
36 languages, including English, Korean, Hindi, Portuguese, Spanish, Arabic, Indonesian, Thai, Japanese, French, German, and more. Each language is downloaded as a separate asset.
Yes. After downloading the language asset, Auto Captions run 100% on-device. No internet connection is required during generation, and your media is never uploaded anywhere.
Approximately 400 to 450 MB per language. Download each asset once; after that, you can use it every time you caption content in that language.
Exporting captions as a separate .srt file is not supported. Captions are burned into the final, saved video. However, you can import an existing .srt file into KineMaster.
Yes! Caption layers can be batch edited to change the font, color, size, and position.
Auto Captions is optimized for spoken language. It may produce errors when generating subtitles for songs. Caption layers are easy to edit with the Text Input tool after generation, however.
Auto Captions works on all iOS devices running KineMaster and Android devices with 4 GB or more of RAM. It has been available since KineMaster version 6.4.
Auto Captions rely on your phone or tablet's processor to generate. Longer clips and older devices take more time.
Caption layer tools like Transform, Color, Glow, and Drop Shadow apply to all caption layers at once.
Related features
Combine Auto Captions with other KineMaster features to take your editing to the next level.
Download KineMaster free on iOS and Android.