tma2mma[PIPE_DEPTH]
→
TMA signals MMA: SMEM data ready for stage k
Type: TMABar (TMA arrive with byte count)
mma2tma[PIPE_DEPTH]
→
MMA signals TMA: SMEM buffer free, can reuse stage k
Type: TCGen05Bar (tcgen05.commit)
mma2ld
→
MMA signals Writeback: TMEM accumulation complete
Type: TCGen05Bar (tcgen05.commit)
ld2mma
→
Writeback signals MMA: TMEM read complete, safe to overwrite
Type: MBarrier (128 threads arrive)