blas
richBlas
broadcastLHSColOpFromBinOp
CuMatrixFuns
broadcastRHSColOpFromBinOp
CuMatrixFuns