The underlying space tree

Every node in a GoFish scenegraph carries two pieces of information about its spatial structure: the kind of data space the node has established on each of its two axes (x and y), and any per-axis Monotonic that captures how visual size depends on a scale factor. Together these form an intermediate representation called the underlying space tree.

The data structure lives at src/ast/underlyingSpace.ts. The traversal that builds it lives at _node.ts's resolveUnderlyingSpace(). Layout, axis rendering, posScale construction, and ordinal scale building all consume the tree afterwards.

This doc explains what the tree is, why it exists, what each space kind means, and where to look in the code. If you're adding an operator that introduces or transforms an axis, this is the abstraction you're working with.

What and why, in brief

A data-driven graphic maps data space to visual space. Typically data space is described by a data schema like {lake: string, count: number}. Visual space is typically described using shapes and screen positions (i.e., SVG or Canvas attributes).

Most of the logic in GoFish lives in between data and visual space, for example computing scales and performing layout. The underlying space tree keeps that logic organized. Here are some kinds of things we need to figure out about a graphic that underlying space helps us answer:

If we overlay a scatterplot and a line chart in the same region of the screen (such as drawing a regression line), what should the axis domains be? What about when the two charts have different data spaces on one axis (like in a dual axis chart)?
If we draw a bar chart with vertically centered bars, what is the y-axis?
If we create faceted chart regions, how should those faceted regions relate to each other?
What if an operator arranges shapes in free space, but those objects have data-driven sizes that need to be scaled to fit the available screen space? (As when using the spread operator.)

In all of these cases, we have some information about data spaces and their encodings to positions and sizes of shapes. Operators compose this information together to create more complex relationships between data and visual space. Underlying space keeps track of this information explicitly so that we can more easily write algorithms that resolve scales and draw axes. For example, to resolve scale domains in the case of the overlaid scatterplot and line chart, we first have to determine whether the two charts' domains can be merged and then we can merge the domains. This information is later used to draw axes for the combined chart. We need to store intermediate results about these domains, and that's basically the role of the underlying space data structure.

Why an explicit IR

Conventional grammars of graphics treat a scale as a function from a data domain to a visual range. Quantitative x-scale: [30, 50] mpg → [0, 100] px. Color scale: species name → palette entry. Convenient — but too unstructured. If scales are arbitrary functions, the system can change their domains and ranges freely, slot them in anywhere, and inference doesn't know which combinations are meaningful.

In practice every visualization system relies on stronger invariants than "function from domain to range" can express. Domains can be merged only when they're compatible. Spatial continuous ranges aren't independent parameters at all — they're derived from available layout space. Some extents have meaningful origins; others only have meaningful differences. Some operators glue subspaces together; others separate them. Coordinate transforms preserve, warp, or erase parts of the underlying structure.

Discrete position scales make the mismatch concrete. D3 and Vega-Lite use point and band scales to handle categorical positions. Operationally, a band scale gives each category a continuous position together with a uniform bandwidth. That's already the abstraction carrying layout information indirectly. It also breaks down for bar-like charts whose elements have different widths, because the allocation of space is no longer a uniform function of category.

This kind of richer semantics shows up in the implementation of every serious grammar system, even when it isn't reified:

Vega-Lite parses each child view recursively, assigns scale-resolution policies (shared vs independent), and conditionally merges child scale components when their types are compatible. Compatibility groups several scale types together (e.g. temporal + ordinal-position). The merged result is a flat record keyed by channel — the tree structure of view composition guides merging, then disappears.
Observable Plot distributes inference across channels (fill, stroke, opacity, symbol first infer which named scale they should use), a scale-name registry, scale-type inference (using user-specified types, mark-imposed channel types, explicit domains, channel values, color schemes, special defaults like r getting a sqrt scale), domain-union inference, and range inference that depends on both domain and scale kind. Modular, but no single spatial IR owns the accumulated semantics — Plot's stack transform, for example, rewrites a length channel into y1/y2 so they can later participate in ordinary scale inference.

Each piece can be clean in isolation, but without an explicit source of truth for the inferred spatial semantics, scale and domain facts have to be passed around and reconstructed across the implementation. That's particularly limiting in GoFish, where users define new operators and new spaces — not just new marks inside a fixed scale-resolution pipeline.

GoFish's solution is to give the inference an explicit shared data structure to contribute to. Marks introduce local spatial facts; operators merge or separate them; coordinate transforms annotate them; and later passes consume the tree for layout, scale construction, and guide generation.

The five space kinds

Each axis (x and y) of each node carries one of:

kind	meaning	guide interpretation	example source
`position`	absolute positions are meaningful; the space carries an interval domain	conventional quantitative axis (distances and positions both work)	scatterplot x-position, y-axis of a stacked bar chart
`difference`	relative differences meaningful; absolute positions not	magnitude guide; an axis with an arbitrary zero would be misleading	a streamgraph after baseline shifting
`size`	data-driven extent, not yet placed in a shared position space	legend / measurement guide; a position axis is premature	a bar's height before stacking
`ordinal`	discrete keys; layout will assign positions	labels at laid-out keys; no continuous baseline necessarily implied	bars separated by category, facets
`undefined`	no data-space contribution on this axis	no guide	a purely aesthetic dimension or a decorative literal-pixel rect

These kinds map closely to Stevens's statistical data types, which is probably not a coincidence, but the relationship isn't clear yet. They deliberately separate facts that a scale-as-function model collapses. size and position may both eventually use numeric values and continuous mappings, but they mean different things: size is an unplaced extent; position is an extent embedded in a shared coordinate space. ordinal isn't "a band scale"; it's a statement that the values are discrete keys whose spatial allocation is the responsibility of layout.

A few additional notes on the individual kinds:

POSITION represents data-driven positions. Each position space has a domain (interval) that maps data values to screen positions.
DIFFERENCE represents spaces where differences/distances are meaningful, but absolute locations are not. This is a weakening of POSITION — once a space is DIFFERENCE, it cannot be converted back to POSITION. (Speculative: DIFFERENCE may be aesthetic position + data-driven size, whereas POSITION is data-driven position. This is not yet confirmed and should not be used for implementation.)
SIZE represents shapes with data-driven sizes but undetermined positions. SIZE tracks a single numeric value (which can be negative, e.g., for negative bars). Unlike DIFFERENCE, SIZE spaces can be merged into POSITION spaces when alignment is determined (e.g., when bars are aligned to a baseline). Example: individual bars in a bar chart have SIZE, but the stack operator merges them into POSITION space for baseline alignment.
ORDINAL represents nominal/ordinal spaces where relative positions are meaningful (like above, below, left, right), but not quantitatively meaningful.
UNDEFINED represents spaces with no data-driven information.

The data definitions:

// underlyingSpace.ts
export type POSITION_TYPE   = { kind: "position";   domain: Interval; ... };
export type DIFFERENCE_TYPE = { kind: "difference"; width: number;   ... };
export type SIZE_TYPE       = { kind: "size";       domain: Monotonic; ... };
export type ORDINAL_TYPE    = { kind: "ordinal";    domain?: string[]; ... };
export type UNDEFINED_TYPE  = { kind: "undefined";  ... };

SIZE_TYPE.domain is a Monotonic (util/monotonic.ts) — a function that describes how the visual extent depends on a scale factor. For a data-bound rect (rect({ h: "count" })), each rect emits SIZE(Monotonic.linear(value, 0)). Operators compose them (Monotonic.add, Monotonic.adds(spacing), Monotonic.smul(scale), Monotonic.max). At layout time, a parent that needs a shared scale factor calls space.domain.inverse(canvas_size) to solve for the scale factor that makes the subtree fit.

The contract

Each node implements _resolveUnderlyingSpace:

type ResolveUnderlyingSpace = (
  childSpaces: Size<UnderlyingSpace>[], // one [x, y] tuple per child
  childNodes: GoFishAST[],
  shared: Size<boolean> // [shared on x, shared on y]
) => FancySize<UnderlyingSpace>;

Returns the node's own [xSpace, ySpace], computed bottom-up from the already-resolved child spaces. The traversal is memoized at _node.ts's resolveUnderlyingSpace().

Three patterns cover most operators:

Leaf shapes (rect, ellipse, petal, text, image) decide the kind from their props. A rect with data-bound h emits SIZE(Monotonic.linear(value, 0)) on y; the same rect with literal y and y2 emits POSITION([y, y2]). Constants (no data-bound dim) emit UNDEFINED — the literal pixel value is handled at layout time by computeAesthetic, not via the underlying-space tree.

Compositional operators (spread, stack, layer, enclose) combine children's spaces. spread({ glue: false }) keeps SIZE composition along the stack direction so a parent can solve for shared scale factors via Monotonic.inverse. spread({ glue: true }) (i.e. stack) sums children's SIZE values into a POSITION([0, sum]) — the operator commits the data-driven extents to a positional axis. layer and overlay-style operators use unionChildSpaces (alignment.ts), which preserves SIZE when every child is SIZE and otherwise unions intervals.

Coordinate-transform operators (coord) annotate the resulting space with the transform that will later map underlying positions to display positions, but otherwise pass the kind through.

Worked example: stacked bar chart

Chart(seafood)
  .flow(spread({ by: "lake", dir: "x" }), stack({ by: "species", dir: "y" }))
  .mark(rect({ h: "count", fill: "species" }));

Each rect starts with a data-driven height and no data-driven y position: [UNDEFINED, SIZE(Monotonic.linear(count, 0))].

The vertical stack (which is spread({ glue: true, dir: "y" })) glues each lake's species rects together. Its stack-direction children are all-SIZE, so it sums their domains at scale 1 and emits POSITION([0, total_lake_sum]) on y. The alignment direction (x) of the stack is UNDEFINED because each rect's x is UNDEFINED.

The horizontal spread separates lakes. Its children are now stacks with [UNDEFINED on x, POSITION([0, total]) on y]. Stack direction (x): no children are SIZE, but they're named (the "by" key produces lake keys) → ORDINAL(["Lake A", ..., "Lake F"]). Alignment direction (y): all children are POSITION → POSITION(unionAll([0, total_i])) = POSITION([0, max_total]).

So the root underlying space is [ORDINAL(lakes), POSITION([0, max_total])]. The y-axis renders quantitative ticks (POSITION); the x-axis renders ordinal labels at laid-out positions (ORDINAL); both follow from the tree, with no special "bar chart" rule.

The stack's size → position transition is the important step. A single rect with a data-driven height doesn't by itself establish where that height lives in a shared coordinate system — it only says it has a quantitative extent. The stack gives those extents a common origin and glues them edge-to-edge, producing a position space from zero to the bar total. The spread doesn't glue; it separates.

Size resolution

To map data to screen space, we need to figure out how to scale it to fit. As a rule of thumb, we want all of underlying space to be visible. As a consequence, bar charts should never be truncated, because each bar is fully embedded in the underlying space. On the other hand, a scatterplot's points may be truncated on the edges of the frame since their sizes are not embedded in the underlying space of the graphic.

Continuous space resolution. For position and difference spaces, we are basically mapping some interval of minimum and maximum values to available physical space. This can be performed by a traditional scale function. For now, we assume these scales are always linear and lean on data pre-processing and coordinate transforms to introduce non-linearities.

Discrete space resolution. Layouts like spread's arrange things using pixel-based spacing (like putting 8 pixels of spacing between bars) so we can't compute a scale function right away. Instead, we assume we are looking for some linear scale factor (data could be scaled using a non-linear scale function before this) and we have to figure out how to scale the shapes that are being placed by creating a function from the scale factor to the output size if we use that scale factor. Then we solve.

A shape can have three kinds of sizes:

fixed (eg, rect({w: 10}))
inferred (eg, rect({w: undefined}))
data-driven (eg, rect({w: 'foo'}))

These correspond to three kinds of intrinsic sizes:

fixed: constant, non-zero size, no dependency on scale factor
inferred: constant, zero size, no dependency on scale factor (this seems a bit weird and may be changed later)
data-driven: size depends on scale factor

In truth, data-driven sizes seem to act like the inferred case as well, because they can take on any size given to them (although they sometimes have a minimum size, such as a spread operator where even if the shapes have 0 size, the spacing between the shapes yields some minimum overall size).

Layout dispatch

After resolveUnderlyingSpace, layout proceeds on the principle that SIZE space drives Monotonic composition; POSITION space drives position scales. The two pipelines are mutually exclusive on a per-node per-axis basis:

gofish.tsx (root):
  if root[axis].kind === "position"  → build a posScale via computePosScale
  if root[axis].kind === "size"      → invert the Monotonic against the canvas
                                       to seed the root scale factor
  pass both downward as (scaleFactors, posScales)

spread.layout (each spread/stack node):
  if shared[axis]:
    if myUSpace[axis].kind === "size"       → space.domain.inverse(size[axis])
    if myUSpace[axis].kind === "position"   → size[axis] / Interval.width(domain)
    if myUSpace[axis].kind === "difference" → size[axis] / space.width
    else → undefined (ORDINAL/UNDEFINED don't need a continuous scale factor)

Leaf shapes never need to compute their own scale factors — they receive them via the scaleFactors parameter and apply them in computeSize.

This dispatch is the practical embodiment of the underlying-space-kind distinction. It also happens to make the rendering pipeline more readable: once you know the kind, you know which arithmetic applies.

Axis inference

Conceptually, axis inference splits into two independent questions:

What guide could this space support? Answered by the kind. POSITION permits a quantitative axis. ORDINAL permits labels at laid-out keys. DIFFERENCE permits a magnitude guide but not an axis with a meaningful zero. SIZE wants a legend or measurement guide; a position axis would be premature. UNDEFINED contributes nothing.
Should that guide be drawn here? Independent of the kind. The root of a stacked bar may have a POSITION y-space that permits a quantitative axis; a nested stack inside a more complex diagram might have the same kind without deserving its own visible axis. Conversely, a facet operator might explicitly request labels for the ORDINAL spaces it creates.

The current implementation only does (1), and only at the root.gofish.tsx's render() takes a chart-level axes: boolean | { x?, y? } option and renders an axis when both the option is on and the root underlying space is POSITION (quantitative ticks), DIFFERENCE (a magnitude guide, currently limited), or ORDINAL (labels at laid-out positions). The space kind determines the axis style; the boolean option controls per-axis visibility globally.

What's not implemented: per-node axis annotations on the underlying- space tree. There's no way for an inner operator to mark "this nested POSITION space deserves its own visible axis" or for an outer operator to suppress an axis its child would otherwise produce. Today that's not a problem because GoFish charts have a single overall coordinate space at the root and axes are decided once at the chart level.

When this matters — for nested coordinate spaces, faceting with per-facet axes, or charts that want different guide kinds on different parts of the same axis — the natural extension is to tag nodes in the tree with { axis?: "auto" | "show" | "hide", title?: ... } and have guide selection walk the tree as its own pass. Future work; tracked informally as "axis-tag follow-up" until a chart actually needs it.

Discrete non-position channels

The tree is for spatial channels (x and y). Discrete non-position channels — color, symbol, texture, stroke pattern, marker shape — don't create an underlying spatial structure and aren't represented here. They still need shared resolution (categories should map consistently across a graphic; users should be able to override defaults; operators should be able to introduce or delimit scopes), but the right model may be closer to a theming API than to axis inference: a discrete color or symbol channel resolves by looking up a category in an inherited theme scope, with local operators or marks able to override the palette.

The current code does this with a unit.color map on scaleContext (seeded by resolveColorScale in _node.ts), which is enough for GoFish today but is not yet a general theming system. Future work. See Color Scale Resolution for what is implemented today.

Adding a new operator

Three things to consider:

What kinds of children does it expect? If your operator only ever sees POSITION children, you don't need to handle SIZE composition. If it can be the parent of a data-driven stack, you do.
What kind does it produce? Pick the most informative kind that honestly describes the result. A spread-style operator that lays children out side-by-side without summing should keep SIZE composition along its stack direction. An operator that fixes children to specific coordinates should produce POSITION. An operator that introduces a categorical axis should produce ORDINAL.
Does it transform spaces or merely pass them through? A coord transform annotates without changing the kind. enclose and wrap- style overlays use unionChildSpaces. position is a pass-through. Match the existing patterns in graphicalOperators/ and don't reinvent the merge logic per-operator.

If your operator is layout-time-only (no contribution to the kind tree), return [UNDEFINED, UNDEFINED] and rely on the children to drive inference upward through your wrapper (e.g. via unionChildSpaces from a parent layer).

Prior art

The general lesson — that graphical structure determines scale structure — is shared with Vega-Lite's resolver, Observable Plot's distributed inference, and Atom's recursive layout (Park et al. 2017). GoFish's contribution is generalizing that lesson into an explicit per-node intermediate representation rather than a set of operator-specific conventions. Anyone can add an operator that contributes, transforms, or consumes underlying-space facts; nothing in the layout, posScale, or guide pipelines is privileged.

The design also borrows from compiler architecture, especially typed intermediate representations and the value of an explicit elaboration pass that turns a convenient surface specification into a more precise representation that later passes can consume without re-inferring the same facts.

For a longer treatment, see the "Underlying Space Tree" section of GoFish's thesis chapter (parts/theory/underlying-space.typ in the companion thesis repo).

Pointers

The data definitions and constructors: src/ast/underlyingSpace.ts.
The traversal driver: _node.ts's resolveUnderlyingSpace().
Per-shape resolvers: src/ast/shapes/{rect,ellipse,petal,text,image}.tsx.
Per-operator resolvers (each colocated with the operator): src/ast/graphicalOperators/{spread,layer,scatter,enclose,porterDuff,position,connect,arrow,table,coord}.tsx.
Overlay union helpers: src/ast/graphicalOperators/alignment.ts.
The Monotonic algebra used by SIZE composition: src/util/monotonic.ts.
Layout consumption: gofish.tsx's layout() for root-level dispatch; spread.tsx's layout for the per-node computeScaleFactor.
Companion factory docs: The Mark Factory, The Operator Factory.

The underlying space tree ​

What and why, in brief ​

Why an explicit IR ​

The five space kinds ​

The contract ​

Worked example: stacked bar chart ​

Size resolution ​

Layout dispatch ​

Axis inference ​

Discrete non-position channels ​

Adding a new operator ​

Prior art ​

Pointers ​