The International Conference for High Performance Computing, Networking, Storage and Analysis
Optimization and Highly Parallel Implementation of Domain Decomposition Based Algorithms.
Authors: Lubomir Riha (IT4Innovations), Tomas Brzobohaty (IT4Innovations), Alexandros Markopoulos (IT4Innovations), Marta Jarosova (IT4Innovations), Tomas Kozubek (IT4Innovations)
Abstract: We describe an implementation and scalability results of a hybrid FETI (Finite Element Tearing and Interconnecting) solver based on our variant of the FETI type domain decomposition method called Total FETI. In our approach a small
number of neighboring subdomains is aggregated into clusters, which results into a smaller coarse problem. Current implementation of the solver is focused on the optimal performance of the main CG solver, including: implementation of ommunication hiding and avoiding techniques for global communications; optimization of the nearest neighbor communication - multiplication with global gluing matrix; and optimization of the parallel CG algorithm to iterate over local Lagrange multipliers only. The performance is demonstrated on a linear elasticity synthetic 3D cube and real world benchmarks.