Next: About this document ...
Parallelization in PUMA
- Implements implicit time integration schemes on distributed
memory computers.
- Each compute node reads its own portion
of the grid file at startup.
- Cells are divided among the active compute nodes at runtime
based on cell ID (shown in figure)
and only faces associated with local cells are read.
- Faces on the interface surface between adjacent computational
domains are duplicated in both domains. Fluxes through these faces
are computed in both domains.
- Solution variables are communicated between domains at every
timestep which ensures that the computed solution is independent
of the number of compute nodes.
- Communication of the solution across domains is all that is
required for first-order spatial accuracy, since QL and QR
are simply cell averages to the first order.
- If the left and right states are computed to higher-order, then
QL and QR are shared explicitly with all adjacent domains. The
fluxes through each face are then computed in each domain to obtain
the residual for each local cell.
Anirudh Modi
2/26/1998