Very low parallel efficiency (10 times slower) for multi-nodes computing

LuckyAustin · April 15, 2020, 7:30pm

I was using FEniCS to solve a 3D linear wave propagation problem in frequency domain with direct solver “MUMPS”.

Due to the large-system requirement (~7M unknowns), I used 4 nodes and 56 cores/node, parallel run with command “ibrun”. The time consumption was 5 h 25 min, and memory comsumption was bout 630 GB.

However, if I used 112 cores in 1 node, it only took about 28 m 43 s.

Therefore, multi-nodes computing was at least 10 times slower than a single-node computing. Could anyone give me any suggestions? Thank you!

Topic		Replies	Views
Parallel computation between Intel i7 and M2 pro mesh	2	290	August 30, 2023
FEniCS + MPI on docker inefficient?	12	2556	September 12, 2020
Mesh Node Numbering mesh	2	610	July 15, 2019
The programme is slower in HPC computer installation mpi	13	111	March 6, 2025
Parallelisation issue with FenicsX dolfinx dolfinx , mpi	0	210	January 4, 2024

Very low parallel efficiency (10 times slower) for multi-nodes computing

Related topics