Extremely slow solver when using coordinate transform

sbhasan · June 21, 2024, 9:37pm

Hi,

I am trying to introduce spatial transform in a domain. This causes the solver to become extremely slow in compiling or assembly stage. I will appreciate if someone knowledgeable could help me understand how to possibly define the problem so as to avoid it.

Below is an MWE executed in dolfinx 0.8 installed in WSL through conda:

import dolfinx, gmsh, ufl, mpi4py, time
from dolfinx.io import gmshio
import numpy as np
from dolfinx.fem.petsc import LinearProblem

def magnetic_potential_model(mesh, ct, ft, space_transform=False):

    # gmsh domain tags, by visible inspection
    magnet_tag, shell_tag, vacuum_tag = 1, 2, 3

    Omega = dolfinx.fem.functionspace(mesh, ("CG", fem_degree))
    u, v = ufl.TrialFunction(Omega), ufl.TestFunction(Omega)

    dx = ufl.Measure('dx', domain=mesh, subdomain_data=ct)

    F = ufl.dot(ufl.grad(u), ufl.grad(v))*dx((magnet_tag,vacuum_tag)) # whole domain except the outer shell
    # add remnanet flux
    F += -ufl.dot(ufl.as_vector((0, 1, 0)), ufl.grad(v))*dx(magnet_tag) # only inside the magnet

    if space_transform:
        x, y, z = ufl.SpatialCoordinate(mesh)
        r = ufl.sqrt(x**2 + y**2)
        grad_x = lambda u, jac, r_: (1/jac*x**2/r**2 + y**2/(r*r_))*u.dx(0) + (1/jac*x*y/r**2 - x*y/(r*r_))*u.dx(1)
        grad_y = lambda u, jac, r_: (1/jac*x*y/r**2 - x*y/(r*r_))*u.dx(0) + (x**2/(r*r_) + 1/jac*y**2/r**2)*u.dx(1)

        r0_ie = fov_r - t_shell

        # IE: polynomial stretching: r_ = r0 + exi**beta*L
        beta = 3
        alpha = 1000*t_shell
        exi = (r-r0_ie)/t_shell
        r_ = r0_ie + (exi**beta)*alpha
        jac = alpha*beta/t_shell*exi**(beta-1)

        # gradients in the layer
        grad_x_u = grad_x(u, jac, r_)
        grad_y_u = grad_y(u, jac, r_)
        grad_x_test = grad_x(v, jac, r_)
        grad_y_test = grad_y(v, jac, r_)

        grad_ie_trial = ufl.as_vector((grad_x_u, grad_y_u))
        grad_ie_test = ufl.as_vector((grad_x_test, grad_y_test))

        F += ufl.dot(grad_ie_trial, grad_ie_test)*jac*(r_/r)*dx(shell_tag)
    else:
        F += ufl.dot(ufl.grad(u), ufl.grad(v))*dx((shell_tag))
    
    a = ufl.lhs(F)
    L = ufl.rhs(F)

    mesh.topology.create_connectivity(mesh.topology.dim-1, mesh.topology.dim)
    boundary_facets = dolfinx.mesh.exterior_facet_indices(mesh.topology)
    boundary_dofs = dolfinx.fem.locate_dofs_topological(Omega, mesh.topology.dim-1, boundary_facets)

    u_dbc = dolfinx.fem.Function(Omega)
    u_dbc.x.array[:] = 0.0
    dbc = [dolfinx.fem.dirichletbc(u_dbc, boundary_dofs)]
        
    ## solve
    petsc_options={"ksp_type": "preonly", "pc_type": "lu", "pc_factor_mat_solver_type":"mumps"}
    linear_problem = LinearProblem(a, L, bcs = dbc, petsc_options=petsc_options)
    uh = linear_problem.solve()
    uh.x.scatter_forward()
    
    print("Solved for {} dofs. Min, max of the solved potential is {}, {}".format(uh.x.array.size, uh.x.array.min(), uh.x.array.max()))

    return uh

fov_r = 1
t_shell = 0.1
gdim = 3
fem_degree = 2

gmsh.initialize()
gmsh.option.setNumber("General.Terminal", 0)
gmsh.clear()
occ = gmsh.model.occ

mag = occ.add_box(-0.5, -0.25, -0.25, 1, 0.5, 0.5)
fov = occ.add_cylinder(0, 0, -0.5, 0, 0, 1, fov_r)
shell = occ.add_cylinder(0, 0, -0.5, 0, 0, 1, fov_r-t_shell)
frag, _ = occ.fragment([(gdim, fov)], [(gdim, mag), (gdim, shell)])
occ.synchronize()

all_doms = occ.get_entities(gdim)
for j, dom in enumerate(all_doms):
    gmsh.model.addPhysicalGroup(dom[0], [dom[1]], j + 1)  # create the main group/node

all_edges = gmsh.model.getEntities(gdim - 1)
for j, edge in enumerate(all_edges):
    gmsh.model.addPhysicalGroup(edge[0], [edge[1]], edge[1])  # create the main group/node

gmsh.model.mesh.generate(gdim)
gmsh.model.mesh.refine()

# gmsh.fltk.run()
mesh, ct, ft = gmshio.model_to_mesh(gmsh.model, mpi4py.MPI.COMM_WORLD, 0, gdim)

start_time = time.time()
uh = magnetic_potential_model(mesh, ct, ft, space_transform=False)
print("Solve time without spatial transform: {}".format(time.time()-start_time))

start_time = time.time()
uh_st = magnetic_potential_model(mesh, ct, ft, space_transform=True)
print("Solve time with spatial transform: {}".format(time.time()-start_time))

This leads to the following output on my device:

Solve time without spatial transform: 3.008222818374634s
Solve time with spatial transform: 124.13119649887085s

francesco-ballarin · June 22, 2024, 5:00am

I may be reading the gmsh part wrong, but it seems to me that your domain includes the origin of the (x, y) plane. In that case, your transformation which has 1/r becomes singular.

sbhasan · June 22, 2024, 11:15am

Thanks a lot for your feedback!

The origin x=y=z=0 should be inside the smaller box (magnet) which is located at the origin. The spatial transform is for the outer shell where r>0. The domain tags were found visibly and I did notice in the past that they changed at least for edges/facets from one execution environment (version?) to another. I do not have a quick idea at the moment to make them permanent but at least I have edited the code above by adding variable names to make the intent clear to the reader.

dokken · June 22, 2024, 11:39am

The problem is the automatic estimation of the quadrature degree for your integrals.
Setting:


    dx = ufl.Measure(
        "dx",
        domain=mesh,
        subdomain_data=ct,
        metadata={"quadrature_degree": 8},
    )

you get:

Solved for 16433 dofs. Min, max of the solved potential is -0.11544615271350633, 0.1157284028550102
Solve time without spatial transform: 1.4832050800323486
Solved for 16433 dofs. Min, max of the solved potential is -0.11654400959639583, 0.11686836195473972
Solve time with spatial transform: 1.1220996379852295

This is because the estimated quadrature degree is 62, as you can see with:

    from ufl.algorithms.compute_form_data import estimate_total_polynomial_degree

    print(
        f"{space_transform=} , {estimate_total_polynomial_degree(a)}, {estimate_total_polynomial_degree(L)}"
    )

yielding

space_transform=False , 2, 1
Solved for 16433 dofs. Min, max of the solved potential is -0.11544615271350633, 0.1157284028550102
Solve time without spatial transform: 0.6003363132476807
space_transform=True , 62, 1
Solved for 16433 dofs. Min, max of the solved potential is -0.11654400959639583, 0.11686836195473972
Solve time with spatial transform: 0.6055417060852051

You can set it up to about 15 without any huge performance hit, as there are special quadrature rules for such cases, see: basix/cpp/basix/quadrature.cpp at main · FEniCS/basix · GitHub

sbhasan · June 22, 2024, 12:03pm

Thanks a lot for your feedback!

For my understanding, is it a corner case in the automatic determination algorithm or it is somehow merited by the variational form I have in at least some situations?

dokken · June 22, 2024, 12:31pm

It is due to all the polynomials and fractions in

sbhasan:

  x, y, z = ufl.SpatialCoordinate(mesh)
        r = ufl.sqrt(x**2 + y**2)
        grad_x = lambda u, jac, r_: (1/jac*x**2/r**2 + y**2/(r*r_))*u.dx(0) + (1/jac*x*y/r**2 - x*y/(r*r_))*u.dx(1)
        grad_y = lambda u, jac, r_: (1/jac*x*y/r**2 - x*y/(r*r_))*u.dx(0) + (x**2/(r*r_) + 1/jac*y**2/r**2)*u.dx(1)

        r0_ie = fov_r - t_shell

        # IE: polynomial stretching: r_ = r0 + exi**beta*L
        beta = 3
        alpha = 1000*t_shell
        exi = (r-r0_ie)/t_shell
        r_ = r0_ie + (exi**beta)*alpha
        jac = alpha*beta/t_shell*exi**(beta-1)

        # gradients in the layer
        grad_x_u = grad_x(u, jac, r_)
        grad_y_u = grad_y(u, jac, r_)
        grad_x_test = grad_x(v, jac, r_)
        grad_y_test = grad_y(v, jac, r_)

        grad_ie_trial = ufl.as_vector((grad_x_u, grad_y_u))
        grad_ie_test = ufl.as_vector((grad_x_test, grad_y_test))

Topic		Replies	Views
PETSc NonlinearProblem running slow dolfinx	6	399	November 27, 2023
Dolfinx seems much slower than dolfin in solving nonlinear mechanics dolfinx dolfinx	6	129	June 22, 2025
Super slow code (3D elasticity problem)	6	2855	July 11, 2019
Using null space to eliminate translations and rotations in dolfinx dolfinx	10	138	July 12, 2024
FFCx compilation speed decreased? General	11	576	September 12, 2023

Extremely slow solver when using coordinate transform

Related topics