Regarding the vector reordering problem caused by parallel grid partitioning?

wanghao · January 7, 2025, 1:17pm

Dear all,
Grid partitioning can lead to the problem of reordering the same vector. How can I obtain this index?（global to local map?）
My question is :
How can I extract rows belonging to u from vector b while running in parallel?
The smallest example is as follows：

from mpi4py import MPI

try:
    from petsc4py import PETSc

    import dolfinx

    if not dolfinx.has_petsc:
        print("This demo requires DOLFINx to be compiled with PETSc enabled.")
        exit(0)
except ModuleNotFoundError:
    print("This demo requires petsc4py.")
    exit(0)

import numpy as np

import ufl
from basix.ufl import element, mixed_element
from dolfinx import default_real_type, fem, la
from dolfinx.fem import (
    Constant,
    Function,
    dirichletbc,
    extract_function_spaces,
    form,
    functionspace,
    locate_dofs_topological,
)
from dolfinx.fem.petsc import assemble_matrix_block, assemble_vector_block
from dolfinx.io import XDMFFile
from dolfinx.mesh import CellType, create_rectangle, locate_entities_boundary
from ufl import div, dx, grad, inner
from dolfinx.fem.petsc import (apply_lifting, assemble_matrix_block, assemble_vector_block, 
                                 create_vector_block, set_bc,create_matrix_block,create_matrix,create_vector)

comm = MPI.COMM_WORLD 
rank=MPI.COMM_WORLD.Get_rank() 
size = comm.Get_size() 


msh = create_rectangle(
    MPI.COMM_WORLD, [np.array([0, 0]), np.array([1, 1])], [32, 32], CellType.triangle
)

P2 = element("Lagrange", msh.basix_cell(), 2, shape=(msh.geometry.dim,), dtype=default_real_type)
P1 = element("Lagrange", msh.basix_cell(), 1, dtype=default_real_type)
V, Q = functionspace(msh, P2), functionspace(msh, P1)

noslip = np.zeros(msh.geometry.dim, dtype=PETSc.ScalarType) 

# Define variational problem
(u, p) = ufl.TrialFunction(V), ufl.TrialFunction(Q)
(v, q) = ufl.TestFunction(V), ufl.TestFunction(Q)

x = ufl.SpatialCoordinate(msh)
f1=ufl.as_vector([x[0],x[1]])
f2=ufl.as_vector([x[1]**2,x[0]**2])
a = form([[inner(grad(u), grad(v)) * dx, inner(p, div(v)) * dx], [inner(div(u), q) * dx, None]])
L = form([inner(f1, v) * dx, inner(f1, v) * dx])  

bb=  create_vector_block(L)      
assemble_vector_block(bb, L, a, bcs=[])  

ccb=bb.getArray()[:] 

print(f'rank{rank} size is {len(ccb)} ,values is',ccb)

dokken · January 9, 2025, 8:48am

As explained in DOLFINx: The next generation FEniCS problem solving environment (Chapter 4.1).
DOLFINx uses the concept of an IndexMap to describe the parallel layout of data.

In the case of a single function space, you can look at the index map of the dofmap, and in particular its local_range (multiplied by the number of vector components, i.e. the multiplicative product of all entries in value shape (np.prod(V.element.value_shape)).

Example:

from mpi4py import MPI

import ufl
import dolfinx.fem.petsc

from basix.ufl import element
import numpy as np
comm = MPI.COMM_WORLD 
rank=MPI.COMM_WORLD.Get_rank() 
size = comm.Get_size() 


msh = dolfinx.mesh.create_unit_square(comm, 10, 10)

P2 = element("Lagrange", msh.basix_cell(), 2, shape=(msh.geometry.dim,), dtype=dolfinx.default_real_type)
P1 = element("Lagrange", msh.basix_cell(), 1, dtype=dolfinx.default_real_type)
V = dolfinx.fem.functionspace(msh, P2)
Q = dolfinx.fem.functionspace(msh, P1)
local_V_range = np.asarray(V.dofmap.index_map.local_range)
local_Q_range = np.asarray(Q.dofmap.index_map.local_range)
print(MPI.COMM_WORLD.rank, local_V_range*V.dofmap.index_map_bs, local_Q_range*Q.dofmap.index_map_bs)

yields

0 [  0 298] [ 0 40]
1 [298 584] [40 77]
2 [584 882] [ 77 121]

A blocked space is ordered such that all the dofs of the first block comes first, then all the dofs of the second block, i.e.
(dofs_V, dofs_Q).
However, when you can getArray() on a process, you only get the owned entries:

num_owned_V_dofs = (local_V_range[1] - local_V_range[0])*V.dofmap.index_map_bs
num_owned_Q_dofs = (local_Q_range[1] - local_Q_range[0])*Q.dofmap.index_map_bs

(v, q) = ufl.TestFunction(V), ufl.TestFunction(Q)

one = dolfinx.fem.Constant(msh,  dolfinx.default_scalar_type(1.0))
one_v = ufl.as_vector([one, one])
L = dolfinx.fem.form([ufl.inner(one_v, v) * ufl.dx, ufl.inner(one, q) * ufl.dx])  

bb = dolfinx.fem.petsc.create_vector_block(L)

ccb=bb.getArray()[:] 

assert len(ccb) == num_owned_V_dofs + num_owned_Q_dofs

i.e. ccb=(V_dofs_local, Q_dofs_local).
Thus, you can get the global index by:

V_global_start = 0
Q_global_start = V.dofmap.index_map_bs * V.dofmap.index_map.size_global

global_indices = np.zeros( num_owned_V_dofs + num_owned_Q_dofs, dtype=np.int64)
global_indices[:num_owned_V_dofs] = np.arange(V_global_start + local_V_range[0]*V.dofmap.index_map_bs, V_global_start + local_V_range[1]*V.dofmap.index_map_bs, dtype=np.int64)
global_indices[num_owned_V_dofs:] = np.arange(Q_global_start + local_Q_range[0]*Q.dofmap.index_map_bs, Q_global_start + local_Q_range[1]*Q.dofmap.index_map_bs, dtype=np.int64)
print(global_indices)

Stein · January 9, 2025, 10:51am

Coincidentally I was just reading the cited section as this reply came up, so maybe I can ask a related follow-up question here.

How is it possible that index_maps are always contiguous? This seemed like a ‘special case’ to me, but from the paper I understand this is a basic property. E.g., the nodes in a mesh are partitioned and ownership is distributed. Naturally, an optimal partitioning does not yield contiguously sets of (a priorily defined) node numbers. Is the numbering of the nodes in the mesh then updated a-posteriori? It seems similar issues should arise all the time.

dokken · January 9, 2025, 1:15pm

For creating structures in DOLFINx, there are several things at play.

Mesh data is read in (cell-node connectivity and node coordinates), and these are distributed over M processes with a graph partitioner. This will try to find an optimal partitioning of the cells, minimizing communication.
Once the cells have been partitioned (and ghost information has been distributed), one has to assign ownership (and renumber) the mesh nodes, such that those owned by process 0 is numbered [0, M-1] those owned by process 2 is numbered [M, N-1], etc.
Nodes that are on a process and not owned is assigned to the index map as ghosts.
Similar procedure for the vertices (a subset of mesh nodes if the geometry is higher order).
Any other structure that needs an index map is based of these initial partitions, with suitable amounts of ghosts included.

Thus, not every index-map is optimally partitioned, they are results of the initial mesh partitioning. In particular, you may create any index map you want in DOLFINx by using the relevant constructors:

This is for instance done for creating point cloud meshes in:

github.com/FEniCS/dolfinx

Vertex submesh support

main ← dokken/vertex-submesh

opened 07:06AM - 05 Oct 24 UTC

jorgensd

+16 -10

Allows for submeshes made of vertices. Coupled with: https://github.com/FEniC…S/basix/pull/851 Would also allow for creating of point-cloud meshes with a simple Python or C++ code of 20 lines, as shown in https://github.com/FEniCS/dolfinx/issues/2593 ```python # Create a mesh consisting of points only # Author: Jørgen S. Dokken # SPDX-License-Identifier: MIT from mpi4py import MPI import basix.ufl import dolfinx import numpy as np import numpy.typing as npt import ufl def create_point_mesh(comm: MPI.Intracomm, points:npt.NDArray[np.float32]|npt.NDArray[np.float64])->dolfinx.mesh.Mesh: """ Create a mesh consisting of points only. Note: No nodes are shared between processes. Args: comm: MPI communicator to create the mesh on. points: Points local to the process in the mesh. """ # Create mesh topology cells = np.arange(points.shape[0], dtype=np.int32).reshape(-1, 1) topology = dolfinx.cpp.mesh.Topology(MPI.COMM_WORLD, dolfinx.mesh.CellType.point) num_nodes_local = cells.shape[0] imap = dolfinx.common.IndexMap(MPI.COMM_WORLD, num_nodes_local) local_range = imap.local_range[0] igi = np.arange(num_nodes_local, dtype=np.int64)+local_range topology.set_index_map(0, imap) topology.set_connectivity(dolfinx.graph.adjacencylist(cells), 0, 0) # Create mesh geometry e = basix.ufl.element("Lagrange", "point", 0, shape=(points.shape[1],)) c_el = dolfinx.fem.coordinate_element(e.basix_element) geometry = dolfinx.mesh.create_geometry(imap, cells, c_el._cpp_object, points, igi) # Create DOLFINx mesh if points.dtype == np.float64: cpp_mesh = dolfinx.cpp.mesh.Mesh_float64(comm, topology, geometry._cpp_object) elif points.dtype == np.float32: cpp_mesh = dolfinx.cpp.mesh.Mesh_float32(comm, topology, geometry._cpp_object) else: raise RuntimeError(f"Unsupported dtype for mesh {points.dtype}") # Wrap as Python object return dolfinx.mesh.Mesh(cpp_mesh, domain = ufl.Mesh(e)) ```

where there is no ghosting in construction.
This is also used (with ghosting) in the internals of DOLFINx MPC, where one adds additional ghost information between processes depending on the multi-point constraint and partitioning.

To get the original input numbering of cells and indices, one uses original_cell_index and input_global_indices, as for instance sketched out in: How to apply a boundary condition to a range of DOFs? - #7 by dokken

Let me know if this helps your understanding.

Topic		Replies	Views
Mesh Indices mapping with the DG1 dofs dolfinx assembly , dolfinx , mpi , error	9	389	April 6, 2024
Questions about local dofs and ordering General	8	1284	February 15, 2023
Parallel usage of vertex_to_dof_map	13	1521	November 20, 2022
Global DOFs map from index_map.local_range in MPI dolfinx dolfinx	3	34	August 18, 2025
Is the dofmap contains glocal index or local index? dolfinx boundary-conditions , assembly , dolfinx	9	498	March 30, 2024

Regarding the vector reordering problem caused by parallel grid partitioning?

Related topics