4 calls to
ALTALLTOALLV

TRANSPOSE_BLOCK_TO_CHUNK
TRANSPOSE_BLOCK_TO_CHUNK
TRANSPOSE_CHUNK_TO_BLOCK
TRANSPOSE_CHUNK_TO_BLOCK