arrive.expect_tx(4096) decrements pending to 0 and sets tx_count. cp.async.bulk triggers TMA transfer.
Executes cp.async.bulk dispatched from T0. Auto-decrements tx_count on completion.
try_wait suspends until pending==0 AND tx==0. HW auto-decrements tx_count when transfer completes.