COSTA is a communication-optimal, highly-optimised algorithm for data
redistribution accross multiple processors, using MPI and OpenMP and
offering the possibility to transpose and scale some or all data.
