Multicam tracks are aligned, but playback not in sync


The two multicam tracks are visually aligned (the audio waveform looks aligned) but on playback it is not

This is coming from the fcpxml import

What might be the issue?

This is the fcpxml start (just the resources tag where the multicam is declared)


<resources>

<media id="r0-multicam" name="Untitled Multicam Clip">

<multicam format="rf-28b546c4-c779-4c6a-839c-962d13cd13d2" tcStart="0s">

<mc-angle

angleID="0"

name="Angle 1">

<asset-clip

ref="r28b546c4-c779-4c6a-839c-962d13cd13d2"

offset="0/24000s"

duration="8480472/24000s"

start="0/360000s"

audioRole="dialogue"

>

<adjust-transform scale="1 1" anchor="0 0" position="0 0" />

</asset-clip>


<asset-clip

ref="r4f90e4cb-57bf-4780-986a-270de21c1f77"

offset="8480472/24000s"

duration="8288280/24000s"

start="0/360000s"

audioRole="dialogue"

>

<adjust-transform scale="1 1" anchor="0 0" position="0 0" />

</asset-clip>

</mc-angle>


<mc-angle

angleID="1"

name="Angle 2">

<asset-clip

ref="ra3d4fdb9-4ac3-4742-8b23-21680dedbeb9"

offset="162162/24000s"

duration="16469760/24000s"

start="0/48000s"

audioRole="dialogue"

>

<adjust-transform scale="1 1" anchor="0 0" position="0 0" />

</asset-clip>

</mc-angle>

</multicam>

</media>

<asset

id="r28b546c4-c779-4c6a-839c-962d13cd13d2"

start="0/360000s"

duration="127207080/360000s"

hasVideo="1"

hasAudio="1"

format="rf-28b546c4-c779-4c6a-839c-962d13cd13d2"

audioSources="1"

audioChannels="2"

audioRate="48000"

>

<media-rep kind="original-media" src="file:///Users/yonatanbd/Downloads/P1000152-001.MP4"/>

</asset>

<format

id="rf-28b546c4-c779-4c6a-839c-962d13cd13d2"

height="2160"

width="3840"

frameDuration="1001/24000s"

name="FFVideoFormat3840x2160p2398"/>

<asset

id="r4f90e4cb-57bf-4780-986a-270de21c1f77"

start="0/360000s"

duration="124324200/360000s"

hasVideo="1"

hasAudio="1"

format="rf-4f90e4cb-57bf-4780-986a-270de21c1f77"

audioSources="1"

audioChannels="2"

audioRate="48000"

>

<media-rep kind="original-media" src="file:///Users/yonatanbd/Downloads/P1000153-002.MP4"/>

</asset>

<format

id="rf-4f90e4cb-57bf-4780-986a-270de21c1f77"

height="2160"

width="3840"

frameDuration="1001/24000s"

name="FFVideoFormat3840x2160p2398"/>

<asset

id="ra3d4fdb9-4ac3-4742-8b23-21680dedbeb9"

start="0/48000s"

duration="32939519/48000s"

hasVideo="0"

hasAudio="1"

audioSources="1"

audioChannels="2"

audioRate="48000"

>

<media-rep kind="original-media" src="file:///Users/yonatanbd/Downloads/Aan%20het%20opnemen%20(57).wav"/>

</asset>

<format

id="rf-a3d4fdb9-4ac3-4742-8b23-21680dedbeb9"

height="1080"

width="1920"

frameDuration="1/60s"

name="FFVideoFormat1080p60"/>


<effect id="titleEffect" name="Basic Title" uid=".../Titles.localized/Bumper:Opener.localized/Basic Title.localized/Basic Title.moti"/>

</resources>

MacBook Pro 14″, macOS 15.5

Posted on Jul 1, 2025 7:38 AM

Reply
Question marked as Top-ranking reply

Posted on Jul 9, 2025 12:19 PM

I ran the Gling fcpxml (v1.9) and the FCP-generated fcpxml (v1.13) through Claude and described the audio sync problem with instructions to inspect them for meaningful differences. Of course Claude can be wrong and/or BS us, but here's the response:


"After carefully analyzing both FCPXML files, I've identified the root cause of the 18-frame audio sync issue. The problem lies in how the 3rd party tool calculated the audio offset and timeline synchronization.


The main issues are:

Missing Gap Compensation: The working Info.fcpxml uses a gap element (<gap name="Gap" offset="0s" start="86400314/24000s" duration="18018/24000s"/>) to properly align the audio timeline before the audio clip starts. The broken Moose 29.fcpxml completely omits this gap.


Incorrect Offset Calculation:

    • Info.fcpxml: Uses 36036/48000s = 0.75075s for audio offset
    • Moose 29.fcpxml: Uses 17017/24000s = 0.709041667s for audio offset
    • The difference combined with the missing gap creates a timing shortfall of approximately 0.79 seconds, which equals about 19 frames at 24fps - very close to your reported 18 frames.


Sample Rate Inconsistency: The Moose 29.fcpxml uses a 24000Hz denominator for the audio offset timing, but the audio is actually 48kHz. This fundamental mismatch in how time is calculated for audio vs video elements causes the synchronization to break.


The fix would be for the 3rd party tool to:

  • Include the proper gap element before the audio clip
  • Calculate the audio offset using the correct sample rate (48kHz)
  • Properly account for the timeline synchronization needed between video and audio-only angles


This explains why the audio waveforms appear incorrectly positioned and why the playhead doesn't snap properly - the timeline calculations are fundamentally flawed in the generated FCPXML."

I then asked Claude to double-check the audio asset definitions between the two fcpxmls in case they are referencing the audio differently.


"The audio asset definitions are functionally identical in terms of timing - both start at 3600 seconds and have the same duration (1998848/48000s). The synchronization issue is NOT caused by differences in the asset definitions themselves.


This confirms that the problem lies entirely in how the multicam angles are constructed and synchronized, specifically:

  1. The missing gap element in the Moose 29.fcpxml
  2. The incorrect offset calculation (17017/24000s vs 36036/48000s)
  3. The improper timeline alignment between video and audio-only angles

The asset definitions are correct - it's the multicam implementation that's broken in the 3rd party generated file."

17 replies
Question marked as Top-ranking reply

Jul 9, 2025 12:19 PM in response to reynoldsm89

I ran the Gling fcpxml (v1.9) and the FCP-generated fcpxml (v1.13) through Claude and described the audio sync problem with instructions to inspect them for meaningful differences. Of course Claude can be wrong and/or BS us, but here's the response:


"After carefully analyzing both FCPXML files, I've identified the root cause of the 18-frame audio sync issue. The problem lies in how the 3rd party tool calculated the audio offset and timeline synchronization.


The main issues are:

Missing Gap Compensation: The working Info.fcpxml uses a gap element (<gap name="Gap" offset="0s" start="86400314/24000s" duration="18018/24000s"/>) to properly align the audio timeline before the audio clip starts. The broken Moose 29.fcpxml completely omits this gap.


Incorrect Offset Calculation:

    • Info.fcpxml: Uses 36036/48000s = 0.75075s for audio offset
    • Moose 29.fcpxml: Uses 17017/24000s = 0.709041667s for audio offset
    • The difference combined with the missing gap creates a timing shortfall of approximately 0.79 seconds, which equals about 19 frames at 24fps - very close to your reported 18 frames.


Sample Rate Inconsistency: The Moose 29.fcpxml uses a 24000Hz denominator for the audio offset timing, but the audio is actually 48kHz. This fundamental mismatch in how time is calculated for audio vs video elements causes the synchronization to break.


The fix would be for the 3rd party tool to:

  • Include the proper gap element before the audio clip
  • Calculate the audio offset using the correct sample rate (48kHz)
  • Properly account for the timeline synchronization needed between video and audio-only angles


This explains why the audio waveforms appear incorrectly positioned and why the playhead doesn't snap properly - the timeline calculations are fundamentally flawed in the generated FCPXML."

I then asked Claude to double-check the audio asset definitions between the two fcpxmls in case they are referencing the audio differently.


"The audio asset definitions are functionally identical in terms of timing - both start at 3600 seconds and have the same duration (1998848/48000s). The synchronization issue is NOT caused by differences in the asset definitions themselves.


This confirms that the problem lies entirely in how the multicam angles are constructed and synchronized, specifically:

  1. The missing gap element in the Moose 29.fcpxml
  2. The incorrect offset calculation (17017/24000s vs 36036/48000s)
  3. The improper timeline alignment between video and audio-only angles

The asset definitions are correct - it's the multicam implementation that's broken in the 3rd party generated file."

Jul 8, 2025 6:58 PM in response to reynoldsm89

I brought the assets in and manually created a multicam, then exported that out as an fcpxml and compared it to the Gling fcpxml. The audio asset durations and start times differ between the two fcpxmls.


In FCP, the imported Gling multicam audio is 18 frames off. The audio angle also behaves oddly where the playhead doesn't snap to the beginning, like it's confused about where the clip actually starts. There appears to be a phantom snapping point at about 18 frames into the gap area before the audio clip starts. Also, the audio angle waveform is off by around 18 frames, meaning the visual display doesn't match the actual audio.


I'm thinking there's a miscalculation of the audio asset, but I didn't have time to dig into it further. Maybe tomorrow.

Jul 8, 2025 8:39 AM in response to yonatan166

I'm experiencing this issue as well. I've made a Loom that shows the problem in action. The imported xml file is visually showing the files in sync, but audibly they are not. Even when I replace the audio in the angle editor and sync to monitoring angle again, it is still out of sync. Only when I delete the angle entirely, add a new angle, and sync the audio to monitoring angle does it finally sync. https://www.loom.com/share/17689855873c4fc29b9875672e1bab11?sid=83887c56-a74d-41c4-93eb-67e708650163

Jul 8, 2025 1:23 PM in response to yonatan166

I've been talking with their support team for a couple of weeks on this, which lead me to here. I think they're trying to figure out of this is a Gling issue or a FCP issue. Their CTO is the author of this thread. Here's a link to the XML file and the original media if that happens to be helpful too. https://drive.google.com/drive/folders/1vwgUzn_yoBLGP0gEPodpfwmzUfV-JsWZ?usp=sharing

This thread has been closed by the system or the community team. You may vote for any posts you find helpful, or search the Community for additional answers.

Multicam tracks are aligned, but playback not in sync

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple Account.