Correct for different camera viewpoints without needing manual calibration.
This file is used to prove that the architecture can: 21206mp4
A visual "heatmap" or mask overlaying the video, showing that the AI successfully located the change requested in the text. Technical Significance 21206mp4