Different results between repeated runs in parallel

Jari · March 9, 2024, 12:32pm

I am solving a linear elasticity problem in every loop of an optimization procedure. Even tiny differences may potentially lead to fairly different results after many loops which makes the replication of optimization results difficult.

Running mpirun -np 1 python3 demo_elasticity.py gives identical results between repeated runs as confirmed by the output:

Solution vector norm: 0.05007291838351104
Solution vector norm: 0.05007291838351104

Naively I would have expected the results to be identical also between repeated runs in parallel
However, mpirun -np 3 python3 demo_elasticity.py shows tiny differences (last digit) between repeated runs as confirmed by the output:

Solution vector norm: 0.05007291839575202
Solution vector norm: 0.05007291839575208

What is the reason for this dynamic behaviour in parallel and is there anything to do about it?

dokken · March 9, 2024, 1:00pm

the difference is less than 1e-16, which is machine precision. This is like numerical noise from accumulating values from different processes.

Also note that the elasticity solver uses an iterative solver with way less precision (rtol=1e-8), and the gamg algorithm is dependent on partitioning.

Topic		Replies	Views
Inconsistent results in parallel computation dolfinx	3	417	July 5, 2021
Robustness issue, two the same runs behave differently dolfinx	2	43	April 19, 2024
Different results in serial and parallel run dolfinx dolfinx	23	2350	October 26, 2020
Serial and parallel runs give different output for fluids demos dolfinx	2	550	September 28, 2022
Limited number of cores in parallel (How to distribute equally)	0	543	February 4, 2019

Different results between repeated runs in parallel

Related Topics