/scenefixerBeta

Technical guide

How Scene Fixer works

Scene Fixer is the continuity supervisor for AI video. It splits your sequence into shots, uses Claude Opus 4.8 to catch every inconsistency — wardrobe, lighting, props, eyeline — then inpaints each fix frame-by-frame with Runway Aleph. Works on output from any AI video model. Here's exactly what happens under the hood.

1

Upload your video

Drag and drop a video file — MP4, MOV, or most common formats. The file is uploaded directly to Firebase Storage over a signed URL, so it never passes through our web servers. Maximum file size is 200MB.

If you already know what the error is, describe it in the hint field above the drop zone before uploading — e.g. “coffee cup on the table in front of the queen”. The hint is passed to Claude alongside the keyframes and steers detection toward the region you flagged.

No account required to try it. First-time visitors get one free beta fix — tracked with an anonymous UUID stored in your browser, no email needed.

2

Shot decomposition

PySceneDetectFFmpeg

The video is split into individual shots using PySceneDetect, which detects cuts by analyzing frame-level histogram differences. For each shot, FFmpeg extracts 5 evenly-spaced keyframes — these become the visual input for detection.

Why keyframes? Continuity errors live in individual frames. Sending full video to a vision model would be 100× more expensive for the same detection quality.
Scene Fixer splitting video into shots — pipeline status showing step 2 active
3

Continuity error detection

Claude Opus 4.8Vision

Every adjacent shot pair is sent to Claude Opus 4.8, Anthropic's most capable vision model. It receives the last keyframe of shot A and the first keyframe of shot B and identifies continuity errors across six categories: props, wardrobe, hair & makeup, lighting, set dressing, and eyeline.

For each inconsistency, Claude returns a structured JSON object with the error type, severity (low / medium / high), a human-readable description of what doesn't match and roughly where it is, and a fix suggestion. It does not return precise pixel coordinates — that's the next step. In the review UI you confirm the exact region yourself (or adjust it in the Adjust location modal), which is what actually guides Aleph.

Example output:
{
  "type": "prop",
  "severity": "high",
  "description": "Modern disposable coffee cup visible on the table in front of the character in Shot B.",
  "fix_suggestion": "Remove the coffee cup from the table.",
  "object_query": "modern coffee cup",
  "fix_target_shot": "B"
}
Scene Fixer running Claude Opus 4.8 detection — pipeline at step 3
4

You review and confirm

Before anything is fixed, every detected error is shown side-by-side with its two keyframes. A red bounding box highlights the exact region. You choose for each one: Remove (erase and fill in the background) or Replace with… (swap for something you describe).

If the auto-detection bounding box isn't quite right, you can open the Adjust location modal — pick the keyframe where the error is clearest and drag to draw a precise box. You can also manually mark errors the AI missed using + Add another fix.

Scene Fixer error review — Remove and Replace with options, red bounding box on keyframe
Adjust location modal — drawing a tight bounding box around the continuity error
Error confirmed — Will remove and fill in the background, Fix 1 error button
5

AI inpainting with Runway Gen-4 Aleph

Runway Gen-4 AlephVideo inpainting

For each confirmed error, the pipeline extracts the clip segment around that shot, composites a red mask over the flagged region, and sends it to Runway Gen-4 Aleph. The text prompt is generated automatically from your Remove / Replace instruction.

Aleph is video-native — it understands camera motion, lighting, and temporal consistency across frames. It fills the masked region across the full clip duration while keeping surrounding content identical. Output is 1280×720 at 24fps.

The fix runs as a background job — you can safely close your browser and come back later. Each Runway call takes 30–90 seconds. Up to 8 errors are processed per session.

Scene Fixer fixing with Runway Aleph — background job notice, fixing status active
6

Verification

Claude Opus 4.8Vision

After inpainting, each fixed clip is passed back to Claude Opus 4.8 for independent verification. It compares the original and fixed frames and returns one of three verdicts: Verified fixed, Error still visible, or Inconclusive — plus a confidence rating and a short explanation.

This isn't a formality. Runway occasionally misses faint or textured objects — the verification step catches those cases so you can decide whether to retry or accept. The full result is visible on your job page, including the marker frame that was sent to Aleph.

Fixed error detail — Before/After clips, marker frame sent to Aleph, Verified fixed result
7

Stitch, scale & download

FFmpeg

FFmpeg stitches the fixed clips back into the original video timeline — only replacing the modified shots, leaving everything else frame-identical to your source. Audio is preserved throughout.

The output is then scaled to your plan's quality using bicubic resampling:

PlanOutputWatermark
Free480p (downscaled)"Fixed with Scene Fixer"
Starter720p (native Aleph output)None
Pro / Studio1080p (bicubic upscale)None
Pay-per-fix720pNone
Scene Fixer download ready — 720p badge, Download button, Before/After diff player

Full pipeline at a glance

UploadFirebase Storage (signed URL, direct from browser)
Shot detectionPySceneDetect → 5 keyframes / shot
Error detectionClaude Opus 4.8 vision — flags what's inconsistent and describes where
User reviewConfirm each flag, mark the exact region, or add missed errors manually
InpaintingRunway Gen-4 Aleph (per confirmed error, sequential)
VerificationClaude Opus 4.8 vision (per fixed clip)
Stitch + scaleFFmpeg → 480p / 720p / 1080p depending on plan

The whole pipeline runs in the cloud — no software to install, no GPU required.

Try it free →