A first NCM

We have, so far, covered the basic input file schema for COMMET in Hello world: running a simulation without an NCM and the approach for defining custom material models in Defining material models using Pytorch. In this example, we demonstrate the following:

How to create a basic NCM.
How to train an NCM on some data in a supervised manner.
How to then use that NCM in a COMMET simulation.

Example files

The files for running the example can be downloaded as a single zip file here and it contains the following files:

.
├── ncm.torchscript
├── training.py
└── a_first_ncm.jsonc

Problem definition

The problem definition is similar to that the Hello World example (see Hello world: running a simulation without an NCM) in geometry and boundary conditions. However, we will model the material behaviour using an NCM that has been trained on some synthetic data.

Defining the NCM

The NCM will make use of isochoric invariants as the input (or the kinematic layer) and a monotonically increasing input convex neural network (MICNN) as the inner network, see the COMMET paper for more details. More specifically, we define the isochoric right Cauchy-Green tensor as

$\tilde{\mathbf{C}} := \mathbf{C} (\det{\mathbf{C}})^{-1/3}\,,$

and the isochoric first and second invariants, respectively, as

$\tilde{I}_1 := \text{tr}(\tilde{\mathbf{C}})\,,\qquad \tilde{I}_2 := \frac{1}{2}\left(\text{tr}(\tilde{\mathbf{C}})^2 -\text{tr}(\tilde{\mathbf{C}}^2)\right)\,.$

We then define the kinematic layer as follows:

$\mathcal{K}(\mathbf{F}) = \begin{bmatrix} \tilde{I}_1 - 3 \\ \tilde{I}_2^{3/2} - 3^{3/2} \\ (\det(\mathbf{C})^{1/2}-1)^{2} \end{bmatrix}$

Here, we use $\tilde{I}_2^{3/2}$ as opposed to simply $\tilde{I}_2$ since the former is polyconvex in $\mathbf{F}$ and the latter is not (see e.g. the Can KAN CANs paper and the references therein for comment and discussion on this).

Since the inputs to the inner network as themselves polyconvex, we require that the inner network is convex and monotonically increasing with respect to its input for the resulting model to be polyconvex. One such network is obtained by using input convex neural networks, or ICNNs (see the ICNN paper or the NN-EUCLID paper) but by constraining the weights in the skip connections to be positive in addition to the weights in the pass through layers.

Overall this leads to the NCM being defined as in the following listing which is a snippet from the training.py file

 import torch
 from torch import nn
 from typing import Union, List, Callable, Optional


 class ConvexLinear(nn.Module):

     def __init__(self,
                  size_in: int,
                  size_out: int,
                  use_bias=True):
         super(ConvexLinear, self).__init__()
         self.size_in: int = size_in
         self.size_out: int = size_out
         weights: torch.Tensor = torch.Tensor(size_out, size_in)
         self.weights = torch.nn.Parameter(weights)

         self.use_bias = use_bias

         if self.use_bias:
             self.bias = torch.nn.Parameter(torch.Tensor(size_out))

         else:
             self.bias = None

         torch.nn.init.kaiming_uniform_(self.weights, a=5**0.5)

     def forward(self, x: torch.Tensor) -> torch.Tensor:
         result = torch.mm(x, torch.nn.functional.softplus(self.weights.t()))
         if self.bias is not None:
             result = result + self.bias

         return result

 class MICNN(nn.Module):
     def __init__(self,
                  n_inputs: int,
                  n_outputs: int,
                  hidden_architecture: List[int] = [],
                  activation_function: Callable = torch.nn.functional.softplus):
         super(MICNN, self).__init__()

         self.n_inputs: int = n_inputs
         self.n_outputs: int = n_outputs
         self.activation_function: Callable = activation_function

         self.architecture = [self.n_inputs] + \
             hidden_architecture + [self.n_outputs]

         self.n_hidden_layers = len(hidden_architecture) - 1

         self.first_layer = ConvexLinear(self.n_inputs, self.architecture[1])

         self.layers = torch.nn.ModuleList([ConvexLinear(hidden_architecture[i],
                                                         hidden_architecture[i+1])
                                            for i in range(self.n_hidden_layers)])
         self.last_layer = ConvexLinear(self.architecture[-2], self.n_outputs)

         self.skip_layers = torch.nn.ModuleList([ConvexLinear(self.n_inputs,
                                                              hidden_architecture[i+1],
                                                              use_bias=False)
                                                 for i in range(self.n_hidden_layers)])
         self.last_skip = ConvexLinear(self.n_inputs, self.n_outputs)

     def forward(self, x: torch.Tensor):
         z = self.first_layer(x)

         for layer, skip_layer in zip(self.layers, self.skip_layers):
             z = self.activation_function(layer(z) + skip_layer(x))

         z = self.last_layer(z) + self.last_skip(x)

         return z


 class InvariantBasedNCM(nn.Module):
     def __init__(self,
                  inner_network: nn.Module):
         super(InvariantBasedNCM, self).__init__()
         self.inner_network: nn.Module = inner_network

     def get_K_from_C(self, C: torch.Tensor) -> torch.Tensor:

         C = 0.5*(C + C.transpose(1, 2))

         I3 = torch.det(C)
         C = C*I3[:, None, None]**(-1/3)

         I1 = torch.sum(C[:, [0, 1, 2], [0, 1, 2]], dim=1)

         C2 = torch.bmm(C, C)
         I2 = 0.5*(I1**2 - torch.sum(C2[:, [0, 1, 2], [0, 1, 2]], dim=[1]))

         J = torch.sqrt(I3)

         return torch.stack([
             I1-3,
             I2 ** (3/2) - (3 ** (3/2)),
             (J-1) ** 2
         ],
             dim=1)

     @torch.jit.export
     def W_NN_from_C(self,
                     C: torch. Tensor,
                     structural_vectors: Optional[torch.Tensor] = None) -> torch.Tensor:
         return self.inner_network(self.get_K_from_C(C))

     @torch.jit.export
     def W_NN_from_F(self,
                     F: torch. Tensor,
                     structural_vectors: Optional[torch.Tensor] = None) -> torch.Tensor:
         C = torch.bmm(F.transpose(1, 2), F)
         return self.W_NN_from_C(C, structural_vectors)

     def forward(self, F: torch.Tensor):
         return self.W_NN_from_F(F)


 if __name__=="__main__":

     inner_network = MICNN(n_inputs=3,
                           n_outputs=1,
                           hidden_architecture=[16, 16])
     ncm = InvariantBasedNCM(inner_network)

Training the NCM

As a didactic example, we will train the NCM on synthetic data obtained from a Gent-Thomas material model. The Gent-Thomas model is implemented similarly to the Neohookean model in Defining material models using Pytorch. However, the strain energy density is given by

$\Psi(\mathbf{F}) = c_1(\tilde{I}_1-3) + c_2\log(\tilde{I}_2/3) + \frac{1}{2}\kappa(\det(\mathbf{C})^{1/2}-1)^2$

We create some synthetic training data with the following code snippet.

 def get_stress(ncm: Union[InvariantBasedNCM, GentThomas],
                F: torch.Tensor) -> torch.Tensor:
     F = F.detach().requires_grad_(True)
     energy = ncm.W_NN_from_F(F)
     P = torch.autograd.grad(energy,
                             F,
                             torch.ones_like(energy),
                             create_graph=True)[0]
     return P


 gent_thomas_model = GentThomas()


 dim = 3

 F_uni = torch.zeros(100, dim, dim)
 for i in range(dim):
     F_uni[:, i, i] = 1

 F_uni[:, 0, 0] = torch.linspace(.8, 1.2, F_uni.shape[0])


 noise_scale = 0.1
 P_data = get_stress(gent_thomas_model, F_uni)
 P_data_noisey = P_data + P_data*torch.randn(P_data.shape)*noise_scale

Finally, we instantiate, train, and export the NCM to torchscript with the following code snippet

 inner_network = MICNN(n_inputs=3,
                       n_outputs=1,
                       hidden_architecture=[16, 16])
 ncm = InvariantBasedNCM(inner_network)



 optim = torch.optim.Adam(ncm.parameters(), lr=0.5)

 for i in range(4000):
     optim.zero_grad()
     P_pred = get_stress(ncm, F_uni)
     loss = torch.nn.functional.mse_loss(P_pred, P_data_noisey.detach())
     print(f"{i=}\t{loss.item()=}")
     loss.backward()
     optim.step()


 traced = torch.jit.trace(ncm, (F_uni, ))
 traced.save("ncm.torchscript")

 P_pred_uni = get_stress(ncm, F_uni)


 plt.plot(F_uni[:, 0, 0].detach().cpu(), P_data[:, 0, 0].detach().cpu(), label="Ground truth")
 plt.plot(F_uni[:, 0, 0].detach().cpu(), P_data_noisey[:, 0, 0].detach().cpu(), label="Noisey training data", marker='s', linewidth=0)
 plt.plot(F_uni[:, 0, 0].detach().cpu(), P_pred_uni[:, 0, 0].detach().cpu(), label="Prediction")

 plt.xlabel("$F_{11}$ [mm/mm]")
 plt.ylabel("$P_{11}$ [kPa]")
 plt.grid()
 plt.legend()

The resulting behaviour is displayed alongside the data in the figure below.

The input file

The input file is almost identical to that in Defining material models using Pytorch, all that changes is the path to the torchscript file: "path_to_torchscript":"ncm.torchscript".

Running the simulation

If you are using docker, you can run the example by going to the directory containing the input file and running

$ docker run --rm -v ./:/data -w /data commetcode/commet_solve mpirun -n <n_procs_to_use> commet_solve a_first_ncm.jsonc

If, instead, you have build COMMET locally and it is in your path, you can run

$ mpirun -np <n_procs_to_use> commet_solve a_first_ncm.jsonc