Mumps in parallel slower than in serial

nice-butt · April 24, 2019, 12:19am

Hello

I am playing with Hyperelasticity demo. I use MUMPS as the linear solver. I use 40x40x40 for the mesh. When doing:
mpirun -np xxx python demo.py
with xxx being 1, the computing time (after meshing until simulation ends) is 153s. When increasing xxx to 4, 6, 8, 10, the timing goes to 175, 161, 188, 199s

I set ghost_mode to be shared_facet and use ParMETIS (I don’t think these are relevant tho)

Does anyone observe this so it is how it is or I missed to set something important?

Thanks
Victor

plugged · April 24, 2019, 7:42am

Hey,

I had a similar problem which is due to petsc (or in particular its mumps solver) using thread parallelism via OpenMP. Let us say your PC has n threads available.
Then, when you run your program in serial it already uses n threads (afaik this is the default).

Using mpirun with m instances now lets each of these create another n openmp threads, so you have in total m*n openmp threads, allthough your system can only support n. Therefore, you are slower.

You can fix this behaviour setting the omp_ num_threads environment variable like this:

export OMP_NUM_THREADS = 1

in the bash before you run your program. This will make each mpi instance only create a single omp thread.

Topic		Replies	Views
Assembling in Parallel and Solving in Serial Linear Algebra	4	1643	October 16, 2019
Running in parallel slower than serial?	5	2219	October 8, 2019
Problem solving in parallel slower than in serial General	12	1387	April 4, 2023
Parallel Solver is Slow Linear Algebra	5	1445	August 25, 2019
MPI acceleration with FEniCSx General	13	142	January 24, 2025

Mumps in parallel slower than in serial

Related topics