I know that I could use mpirun, but I am not sure that I want to run ncore instances of the same program in parallel.
Besides duplicating plots, etc., this would not be efficient, as it is commented, for instance, in this post.
I actually want to exploit the multi-thread capability of linear algebra solvers that works fine on the workstation and I don’t understand why the same doesn’t work on the laptop.
To be more explicit, a sequential run of a code on the workstation uses all 40 cores when solving a linear system. As far as I understand, this should be the default behavior.