We show that n~2-problem solvers and level-3 BLAS's for matrix multiplication can be implemented on all types of parallel computers based on a 1-dimensional systolic loop scheme. We can speed up the computations considerably utilizing the hyper-systolic algorithm recently introducted.
展开▼