DNDSR

Executable	Model
`euler` / `euler3D`	Compressible Navier–Stokes
`eulerSA` / `eulerSA3D`	Spalart–Allmaras RANS (IDDES)
`euler2EQ` / `euler2EQ3D`	k-ω two-equation RANS
`eulerEX` / `eulerEX3D`	Reactive / multi-species

Module	Directory	Role	LoC
DNDS	`src/DNDS`	MPI arrays, serialization (JSON + HDF5), profiling, CUDA, config	large
Geom	`src/Geom`	Unstructured mesh, CGNS I/O, Metis/ParMetis partitioning	large
CFV	`src/CFV`	Compact Finite Volume, Variational Reconstruction, limiters	medium
Euler	`src/Euler`	Compressible N-S, SA, k-ω, dual-time orchestration	large
EulerP	`src/EulerP`	Alternative CUDA-optimized evaluator	medium
Solver	`src/Solver`	ODE integrators + Krylov — header-only	small

`_row_size`	`_row_max`	Layout	Use case
`>= 0`	—	`TABLE_StaticFixed`	Cell volume (1 real), Euler state (5 reals)
`DynamicSize`	—	`TABLE_Fixed`	VR coefficients (order decided at runtime)
`NonUniformSize`	`>= 0`	`TABLE_StaticMax`	Per-face node counts for a single element type
`NonUniformSize`	`DynamicSize`	`TABLE_Max`	Padded variable rows, runtime max
`NonUniformSize`	`NonUniformSize`	`CSR`	`cell2node`, `cell2cell`, wide-stencil adjacency

Type	`operator[](i)` returns	Use
`ArrayAdjacency<rs, rm>`	`AdjacencyRow` — lightweight span	mesh topology (`cell2node`, …)
`ArrayEigenVector<N>`	`Eigen::Map<Vector<real, N>>`	node coordinates (`coords`)
`ArrayEigenMatrix<M, N>`	`Eigen::Map<Matrix<real, M, N>>`	per-cell Jacobians, gradients
`ArrayEigenUniMatrixBatch<M, N>`	`j`-th matrix of a per-row batch	quadrature-point data

Alias	Purpose
`ArrayAdjacencyPair<rs, rm>`	mesh connectivity
`ArrayEigenVectorPair<N>`	coords
`ArrayEigenMatrixPair<M, N>`	per-entity matrices
`ArrayEigenUniMatrixBatchPair<M,N>`	quadrature data

Layer	File	State-aware?
DSL	`MeshConnectivity.hpp`
Checked wrappers	`MeshConnectivity_StateChecked.hpp`	asserts `idx.state()`
`UnstructuredMesh`	`Mesh.cpp`	owns `AdjPairTracked` members

Primitive	Signature	What it does
`Inverse<cone_rs>`	`(cone, nToLocal, mpi, fromL2G, toL2G, toGlobalMap) → tAdjPair`	A→B cone to B→A support, MPI push-back
`Compose<rs_AB, rs_BC, out_rs>`	`(AB, BC, ...) → tAdjPair`	A→B ∘ B→C → A→C
`ComposeFiltered`	`... pred, matchExtra=nullptr`	Compose with `SharedCountPredicate` filter
`Interpolate<p2n_rs>`	`(parent2node, SubEntityQuery, nParent, nNode, mpi)`	Local-only sub-entity extraction
`InterpolateGlobal<p2n_rs, e2p_rs>`		N-parent distributed interpolation with pbi-aware dedup
`evaluateGhostTree`	`(tree, mpi) → GhostResult`	BFS ghost evaluation

Mode	Meaning
`Unknown`	Auto-detect from `rank_offsets`
`Parts`	`MPI_Scan` over local sizes
`One`	Rank 0 owns the whole dataset
`EvenSplit`	Read-time split into `~N/np`
(explicit)	`isDist()` → `true`; `{localSize, globalStart}`

Variant	Entropy-fix / eigenvalue scheme
`Roe`	standard Roe + Harten–Yee
`Roe_M1`	cLLF (central + Local Lax–Friedrichs)
`Roe_M2`	Lax–Friedrichs
`Roe_M3`	LD Roe (low-dissipation)
`Roe_M4`	ID Roe (intermediate dissipation)
`Roe_M5`	LD cLLF
`Roe_M6`	H-correction only
`Roe_M7`	Harten–Yee only, no H-correction
`Roe_M8`	H-correction + Harten–Yee
`Roe_M9`	Reserved (eigScheme 9, currently asserts false)
`HLLC`	Harten–Lax–van Leer–Contact
`HLLEP`	HLLE with pressure fix
`HLLEP_V1`	HLLEP variant 1

`odeCode`	Class	Scheme
`103`	`ImplicitEulerDualTimeStep`	Backward Euler
`0`	`ImplicitBDFDualTimeStep`	BDF2 / BDF-k
—	`ImplicitVBDFDualTimeStep`	Variable-step BDF-k
`1`	`ImplicitSDIRK4DualTimeStep` (`schemeCode` 0…4)	SDIRK-4 · ESDIRK2/3 · Trapezoidal
`101`	(alias for `1`)	(backward-compat `odeCode`)
`401`	`ImplicitHermite3SimpleJacobianDualStep`	HM3 + p-Multigrid
`2`	`ExplicitSSPRK3TimeStepAsImplicitDualTimeStep`	SSP-RK3

Layer	Responsibility
`SerializerBase`	Abstract scalar / vector / byte-array interface
`SerializerH5`	MPI-parallel HDF5 (collective I/O)
`SerializerJSON`	Per-rank JSON (`IsPerRank() == true`), no MPI coordination
`Array`	Per-array metadata, structure tags, flat data buffer
`ParArray`	Global offsets, `EvenSplit`, CSR global row-starts
`ArrayPair`	Father-son bundle · `ReadSerializeRedistributed`
`ArrayRedistributor`	Rendezvous redistribution via `ArrayTransformer`

Sentinel	Meaning
`Unknown`	Auto-detect from companion `rank_offsets` dataset
`Parts`	Compute offset via `MPI_Scan` over local sizes
`One`	Rank 0 writes / reads the whole dataset
`EvenSplit`	Read: each rank gets `~nGlobal / nRanks` rows
`isDist()`	Explicit `{localSize, globalStart}`

DNDSR — CFD Research Code

`Configuration` — everything that tunes a run

Every sub-section uses DNDS_DECLARE_CONFIG so the full JSON schema is auto-generated.

TimeMarchControl — dtImplicit, nTimeStep, steadyQuit, useRestart, useImplicitPP, odeCode, odeSetting1..4, odeSettingsExtra (opaque JSON), dtCFLLimitScale, …
ImplicitReconstructionControl — useExplicit, nInternalRecStep, recLinearScheme (0 = SOR, 1 = GMRES), nGmresSpace/Iter, fpcgReset*, recThreshold.
OutputControl — outputIntervalStep, outputFormat (VTK, PLT, VTKHDF, series), parallel vs serial write.
CFLControl — initial / max CFL, ramping schedule.

ConvergenceControl — residual thresholds, monitor variables.
DataIOControl — read/write paths, restart checkpointing.
BoundaryDefinition — per-face-zone BC types, free-stream state.
LimiterControl — limiterProcedure, usePPRecLimiter, WBAP order.
LinearSolverControl — gmresCode, Krylov sub-space, iterations.
TimeAverageControl — long-time averaging for statistics.
EvaluatorSettings wraps EulerEvaluatorSettings<model>.
VFVSettings wraps VRSettings.

--emit-schema dumps the entire tree as a single JSON Schema document — euler_schema.json / eulerSA3D_schema.json / etc., each ~107 KB.

BC	Use
`BCWall`	No-slip wall (adiabatic)
`BCWallIsothermal`	No-slip wall at fixed temperature
`BCWallInvis`	Slip / symmetry
`BCSym`	Explicit symmetry plane
`BCFarField`	Riemann-invariant farfield
`BCIn`	Specified inflow
`BCOut` / `BCOutP`	Specified outflow / pressure-outflow
`BCPeriodic`	Standard periodic
`BCPeriodicRot`	Rotating periodic (turbomachinery)
`BCProfileIn`	Tabulated profile (boundary layer, RANS)
`BCActuator`	Actuator disk source term

Module	C++ executables	test cases	Python tests	np values
DNDS	8	249	9	1, 2, 4, 8
Geom	9	193	2	1, 2, 4, 8
CFV	4	67	43	1, 2, 4, 8
Euler	4	62	4	1, 2, 4, 8
Solver	4	29	—	1

Trigger	Time
No-op rebuild	< 1 s
Markdown-only edit	~10 s
Full (Doxygen + Sphinx)	~2.5 min

Series	Solver
BSSCA	DNDSR /BSSCA	64 → 10240 ranks
BSSCT	DNDSR /BSSCA	96 → 1920 ranks
CS	DNDSR /JS	32 → 256 ranks

DNDSR

A C++17 / Python CFD Research Code

Opening

Motivation · feature set · positioning

Why another CFD code?

DNDSR at a glance

Capabilities

Solver executables

Project shape in numbers

Code

Tests

Docs

The one-slide map

From zero to a running solver in six commands

Architecture

Arrays, MPI, ghosts, state machines

Six modules — responsibilities

Delayed abstraction ⇒ independent comm patterns

Array<T, rs, rm> — five layouts in one template

CSR has two internal modes

Decompressed mode

Compressed mode

ArrayView

ArrayTransformer — anatomy

Father / son addressing

Typed wrappers: ArrayDerived

ArrayPair<TArray> — the convenience bundle

Common type aliases

ArrayDof — the solver's vector space

Operations (CPU + CUDA specializations)

CFV aliases

Host / CUDA dispatch for DOF ops

State-tracked mesh adjacency (1 / 2)

State-tracked mesh adjacency (2 / 2)

AdjIndexInfo — private state + target map

AdjPairTracked<TPair>

Geometry pipeline

Elements · mesh build · ghosts · DSL

Supported elements — O1 / O2 pairs

UnstructuredMesh — what it owns

Mesh build pipeline — end-to-end

Partitioning — PartitionOptions

Two partitioners

Determinism

Ordering

The ghost specification DSL — types

The ghost DSL — compile & evaluate

Evaluator pseudocode (BFS per level)

GhostResult

DSL primitives on MeshConnectivity

SharedCountPredicate

Adjacency registry

Order elevation & bisection

O1 → O2 elevation

O2 → O1 bisection

Wall-distance computation

Options

Strategies

Cross-np restart

Offset sentinels

ReadSerializeRedistributed — three cases

Numerics

CFV · VR · flux · limiters · ODE · Krylov

Compact Finite Volume — the reconstruction

Variational Reconstruction — the functional

VR setup — the three Construct* calls

What ConstructMetrics builds

What ConstructBaseAndWeight builds

What ConstructRecCoeff builds

FiniteVolume — the metric cache

13 Riemann solvers

RoePreamble — the shared middle

Flux signature

Why this factoring

Limiters — the FWBAP L2 family

Multi-way (≥ 2 directions)

Biway (pair)

VR's own limiter — WBAP with characteristic transform

Flow

Smoothness indicators

`Array<T, rs, rm>` — five layouts in one template

`ArrayTransformer` — anatomy

Typed wrappers: `ArrayDerived`

`ArrayPair<TArray>` — the convenience bundle

`ArrayDof` — the solver's vector space

`AdjIndexInfo` — private state + target map

`AdjPairTracked<TPair>`

`UnstructuredMesh` — what it owns

Partitioning — `PartitionOptions`

`GhostResult`

DSL primitives on `MeshConnectivity`

`SharedCountPredicate`

Cross-`np` restart

`ReadSerializeRedistributed` — three cases

VR setup — the three `Construct*` calls

What `ConstructMetrics` builds

What `ConstructBaseAndWeight` builds

What `ConstructRecCoeff` builds

`FiniteVolume` — the metric cache

`RoePreamble` — the shared middle

Other `SDIRK4` codes

`HIndexed` — default

`InSituPack`

`BorrowGGIndexing` — avoid collective setup twice

CUDA path — `DeviceTransferable` CRTP

`SerializerBase` — the public interface

`ArrayGlobalOffset` — five offset modes

Typed JSON configs — `DNDS_DECLARE_CONFIG`

Model enum (`EulerModel`)

`EulerSolver` — the top-level conductor

`Configuration` — everything that tunes a run

`EulerEvaluator<model>` — the spatial operator