You should tag each individual tetrahedron with a cell marker, and use these to determine what youngs modulus is required on your processor.
See for instance:
You could create this marker with a serial run, and then load it in a slightly modified parallel program.
Usually, mesh markers are determined geometrically, or be loaded in as input from the mesh generator.
Note that as long as your mesh only have six cells, there is not going to be any speedup when running in parallel.