Glossary

Definitions for terms used throughout Video Tap and these docs.

A

AI Content Level: the quality tier of generated content (Good, Better, Best). Higher tiers use stronger models and produce more polished output. Available levels depend on your plan.

Aspect ratio: the shape of a video frame. 16:9 is standard horizontal, 9:16 is vertical (TikTok / Reels), 4:5 is portrait, 1:1 is square (Instagram feed).

B

Blog prompt: the instructions you can write in Settings → Blog Prompt that Video Tap applies to every blog post it generates. Used to enforce house tone, length, structure, and CTAs.

Burn-in captions: captions that are rendered directly into the video file. Visible in any player. Opposite: sidecar captions (SRT / VTT files).

C

Chapter: an auto-detected section of your video, with a title and timestamp. Used for YouTube descriptions and blog-post structure.

Clip: a short segment of a longer video, suitable for short-form platforms (TikTok, Reels, Shorts). Generated automatically or created manually from the transcript.

Clip score: an internal rating of how strong a clip is as a stand-alone piece (hook quality, completeness, clarity). Used to rank auto-generated clips.

F

Fit / Fill / Split: the three layout options for a scene when converting between aspect ratios (set in the Scene Reframing Editor). Fit keeps the full source visible with black bars filling the gap. Fill crops the source to fill the frame with no bars. Split pins a manual crop region. See Reframing.

Folder: a way to organize videos in your dashboard. Each upload can be assigned to a folder.

M

Minutes: your monthly processing quota. Based on video duration, not processing time. A 10-minute video uses 10 minutes of quota.

P

Processing: the full pipeline: upload → transcription → content generation. Status shown on each video card.

R

Reframing: converting a video between aspect ratios (e.g. horizontal podcast to vertical TikTok). See Reframing.

Rendering: the final step that bakes your selected aspect ratio, captions, and styling into a downloadable MP4.

S

Scene: an auto-detected segment within a clip. Scenes are the unit for per-scene layouts (Fit / Fill / Split) in the Scene Reframing Editor.

Scene Reframing Editor: the per-scene panel where you set Fit / Fill / Split and (optionally) drag a manual crop region for one scene.

Smart Reframe: AI-driven reframing that analyzes each scene and applies a Fill crop automatically. Triggered from the Switch to 9:16 or 1:1 button in the Reframe sidebar.

SRT / VTT: sidecar caption file formats. SRT works almost everywhere; VTT is preferred for HTML5 video players.

Spelling correction: a word-replacement rule (e.g. iphone → iPhone) applied to every transcription. Different from vocabulary, which only tells the engine to listen for a word.

T

Transcription: the text version of your video’s audio, with word-level timestamps. The foundation for all other content types.

V

Vocabulary: custom words you can add in your workspace settings to help the transcription engine catch tricky terms. See Custom vocabulary.

W

Workspace: your account container. Holds videos, team members, settings, and billing. Each workspace is independent.