Kapwing
Kapwing for creators: text on video, captions, and the social video toolset 2026
Kapwing text to video, text on video, putting text on video, writing text on video, adding text to video — Kapwing's text and caption tools for creators.
Kapwing and the meme / social video market
Kapwing is a browser-based video editor focused on memes, social formats, and quick collaborative editing. Its strength is the friction-free workflow — log in, drag in a video, add text or captions, export, share — with minimal learning curve. For users searching "kapwing text to video," "putting text on video," "writing text on video," "text on video," "text over video," "adding text to video," "adding text to video online," or "adding text to video iphone," Kapwing is one of the most-recommended starting points.
Kapwing distinguishes itself from Veed and Descript by leaning into collaboration (multiple editors on the same project), a large meme template library, and a "good enough" approach to advanced features that keeps the UX simple. For transcription specifically, Kapwing has competent auto-subtitles; for text overlays, it has hundreds of preset styles and animations.
Kapwing text overlays — the core feature
For "text on video" / "text over video" / "writing text on video" / "putting text on video," the Kapwing workflow is: upload video → click the Text tool → choose a preset style or custom text → drag onto the timeline at the right moment → adjust font, size, color, animation. Kapwing has hundreds of preset text styles including memes, captions, lower-thirds, scrolling marquees, and animated titles. Position text anywhere on the frame; animate in / out; adjust per-segment timing.
- 01Upload your video to kapwing.com.
- 02Click the Text tool in the left panel.
- 03Choose a preset style (Meme, Title, Lower-third, Caption, etc.) or click Plain Text.
- 04Drag the text element onto the timeline at the moment it should appear.
- 05Customise font, size, color, alignment, animation in / out.
- 06Repeat for additional text segments.
- 07Export the video.
Kapwing auto-captions for spoken-word video
For "kapwing text to video" specifically — note this phrasing is slightly ambiguous. Most users mean either (a) generating captions from spoken audio in a video (transcription) or (b) generating a full video from a written prompt (AI text-to-video generation). Kapwing primarily does (a) — auto-captions from spoken video. For (b), Kapwing has been adding AI generation features but the primary positioning is editor + captions, not full text-to-video generation.
For auto-captions: Subtitles tab → Auto Subtitles → pick language → Generate. Captions appear on the timeline; edit inline; style with the inspector. Export with burnt-in captions or download .srt. Quality is comparable to other browser editors — fine for social-format work; lower-stakes than dedicated transcription tools for multi-speaker content.
Adding text to video on iPhone
"Adding text to video iphone" / "adding text to iphone video" specifically asks about adding text overlays to videos on an iPhone. Two paths: (1) Use the iPhone Photos app — open a video → tap Edit → choose the markup tool → add text. This is good for single static text overlays on short clips. (2) Use a dedicated mobile editor — iMovie (free, Apple), CapCut (free, ByteDance), Splice (free), VLLO, or Kapwing's mobile app. For more than one text overlay or any animation, a dedicated editor is better.
For users who want text added to iPhone videos through a web browser rather than installing an app, Kapwing's web editor works on Safari mobile — open kapwing.com on iPhone, upload via the browser, edit, export. Slightly clunkier than the desktop experience but functional.
Closing: Kapwing for fast social video text
For social-format creators who need to add text or captions to videos quickly without installing anything, Kapwing is one of the strongest no-install options. For higher-stakes transcription with named speaker diarization, dedicated transcription tools are better. For dedicated text-to-video AI generation, Pictory or Synthesia or Runway are more focused.
Keep reading
Speaker Identification
The Speaker 1 problem: why every transcription tool fumbles who said what
9 min →
Audio to Text
Audio to text in 2026: a guide that actually accounts for accuracy, speakers, and privacy
10 min →
Video to Text
Video to text: how to convert video to clean, usable transcripts without losing context
9 min →