Colimits and pushouts

Disclaimer. The content of this page is largely LM-generated. It was written as a stopgap to make the panproto system legible while we work through the book verifying and editing the content by hand. When a chapter has been verified or edited by a human, the parts that were verified or edited will be noted at the head of the chapter.

The coproduct of the previous chapter combined two objects without asking anything about how they might already be related. In the setting of panproto, and indeed in most settings where structured things have to be combined, that is not what one usually wants. Two schemas that both descend from a shared ancestor should combine in a way that identifies the shared piece; two protocols with a common sub-vocabulary should combine in a way that identifies the common vocabulary. The construction that handles this is the pushout, and the general theory it sits inside is that of colimits.

The chapter has three tasks. The first is to define a diagram — the mathematical gadget for naming a shape of combination — and the general notion of colimit over a diagram of any shape. The second is to specialise to the three-object diagram called a span, whose colimit is the pushout. The third is to show the pushout at work in panproto, where it governs both protocol composition and the three-way merge operation at the heart of schematic version control.

Diagrams

Before defining the general colimit we need a vocabulary for “shape of combination”. The relevant gadget is straightforward once one has functors in hand: a shape is itself a small category, and a diagram of that shape in the target is a functor from the shape into the target.

A diagram in a category $C$ is a functor $D : J \to C$ from some small category $J$ , called the shape of the diagram, into $C$ . The small category $J$ encodes the pattern of things to be combined and the relations among them; the functor $D$ picks out specific objects and morphisms of $C$ that realise the pattern.

A reader who has survived the last two chapters may feel the definition is understated — a diagram is just a functor, which is already a familiar object. That is the force of the definition. Everything we know about functors applies to diagrams, including the whole apparatus of natural transformations and morphisms between diagrams, and the theory of colimits inherits the universal-property style of argument from the previous chapter without new machinery.

Two examples of shape categories will carry most of the weight.

The discrete shape on two objects, call it $J_{disc}$ , has two objects and no non-identity morphisms. A diagram of this shape in $C$ picks out two objects of $C$ and says nothing about how they relate. Colimits over diagrams of this shape are coproducts, which we have met already; $n$ -object discrete shapes give $n$ -fold coproducts; and the empty shape gives an initial object.

The span shape, call it $J_{span}$ , has three objects $⋆_{0}, ⋆_{1}, ⋆_{2}$ and two non-identity morphisms, one from $⋆_{0}$ to $⋆_{1}$ and one from $⋆_{0}$ to $⋆_{2}$ , with no composites beyond identities. A diagram of this shape in $C$ picks out three objects $A, B, C$ and two morphisms $f : A \to B$ and $g : A \to C$ . Colimits over diagrams of this shape are pushouts, and they are the construction that does most of the book’s work.

Other shapes give other well-known colimits — a parallel-pair shape gives a coequaliser, an $ω$ -chain shape gives a sequential colimit — but we will not need those here. The span is the one to keep in hand.

Cocones and colimits

The universal property of the coproduct, read carefully, generalises to any diagram. A coproduct of $A$ and $B$ was an object equipped with two morphisms out of $A$ and $B$ , universal among such equipments. In the general case the equipment is a family of morphisms indexed by the objects of the diagram’s shape.

A cocone under a diagram $D : J \to C$ with apex $X \in Ob (C)$ is a natural transformation from $D$ to the constant functor $Δ X : J \to C$ that sends every object of $J$ to $X$ and every morphism to $id_{X}$ . Less economically: a cocone is a family of morphisms $α_{j} : D (j) \to X$ , one for each object $j$ of $J$ , subject to the requirement that for every morphism $u : j \to k$ in $J$ we have

$α_{k} \circ D (u) = α_{j} .$

The $α_{j}$ are the cocone’s leg morphisms, and the equation says the legs are compatible with the shape’s arrows: if $J$ prescribes an arrow from $j$ to $k$ , then the leg at $j$ must factor through the leg at $k$ via the image of that arrow.

A span diagram $B f A g C$ in $C$ has three legs, $α_{A} : A \to X$ , $α_{B} : B \to X$ , $α_{C} : C \to X$ , and two compatibility conditions, $α_{B} \circ f = α_{A}$ and $α_{C} \circ g = α_{A}$ . The two equations force $α_{A}$ to be determined by the other two legs, so a span cocone is effectively a pair $(α_{B}, α_{C})$ satisfying $α_{B} \circ f = α_{C} \circ g$ . Most practical specifications of a pushout skip straight to this pair-with-equation formulation and omit the explicit third leg.

A colimit of a diagram $D$ is a universal cocone: an apex $C$ with leg morphisms $ι_{j} : D (j) \to C$ such that every other cocone $(Z, α_{j})$ factors through $C$ by a unique morphism $u : C \to Z$ with $u \circ ι_{j} = α_{j}$ for every $j$ . The uniqueness-up-to-isomorphism template of the previous chapter applies verbatim: two cocones both satisfying the universal property are canonically identified, and we speak of the colimit.

Coproducts drop out of this as the case when the shape is discrete: the universal factorisation $[g_{1}, g_{2}]$ from the previous chapter is exactly the unique morphism the general definition produces. Initial objects drop out as the case when the shape is empty. The general pattern includes them as the smallest cases, and the next section specialises to the three-object case that will get most of our attention.

Pushouts

A pushout is the colimit of a span.

Given a span $B f A g C$ , a pushout is an object $P$ together with morphisms $ι_{B} : B \to P$ and $ι_{C} : C \to P$ satisfying

$ι_{B} \circ f = ι_{C} \circ g$

and universal among such: every other pair $(α_{B}, α_{C})$ into some object $Z$ that satisfies the analogous equation factors through $P$ by a unique morphism $u : P \to Z$ . The picture is

$A f ↓ ⏐ B g ι_{B} C ↓ ⏐ ι_{C} P$

Figure 4.1: the pushout square. The equation $ι_{B} \circ f = ι_{C} \circ g$ says the square commutes; universality says every other commuting square $A ⇉ B, C ⇉ Z$ factors through this one by a unique $P \to Z$ .

One sentence of intuition. The pushout $P$ glues $B$ and $C$ together along their shared image of $A$ . If $f$ and $g$ are absent — if the span is really a discrete pair — the pushout collapses to the coproduct $B ⊔ B$ , which does no gluing. What $f$ and $g$ add is the requirement that the two images of $A$ , one in $B$ and one in $C$ , be identified in the pushout. $P$ is the smallest object in which they are, which is what one usually means by “gluing two things together along a shared part”.

Pushout in $Set$

In the category of sets, the pushout has an explicit construction. Take the disjoint union $B ⊔ C$ . For each $a \in A$ , identify the element $f (a) \in B$ with the element $g (a) \in C$ . The quotient of the disjoint union by the equivalence relation generated by these identifications is the pushout, and the two injections $ι_{B}$ and $ι_{C}$ send an element to its equivalence class.

A minimal example: $A = {0}$ , $B = {b}$ , $C = {c}$ , $f (0) = b$ , $g (0) = c$ . The disjoint union has two elements, $b$ and $c$ ; the identification equates them; the pushout is a one-element set. Both injections land at the single element, and the original elements of $B$ and $C$ have been glued together along the common image from $A$ .

A slightly larger example makes the pattern visible. Let $A = {0, 1}$ , $B = {b_{0}, b_{1}, b_{*}}$ , $C = {c_{0}, c_{1}, c_{*}}$ , with $f (0) = b_{0}, f (1) = b_{1}$ and $g (0) = c_{0}, g (1) = c_{1}$ . The disjoint union has six elements. The identifications equate $b_{0}$ with $c_{0}$ and $b_{1}$ with $c_{1}$ , but the starred elements of $B$ and $C$ are unaffected, since nothing in $A$ maps to them. The pushout has four elements: ${b_{0} = c_{0}}$ , ${b_{1} = c_{1}}$ , ${b_{*}}$ , ${c_{*}}$ .

This is what “gluing along” means concretely in $Set$ : two copies of something, sharing a subset, welded at the subset. The same picture will return in Merge as pushout, where the “subset” is a common-ancestor schema and the “two copies” are the two branches’ schemas. There is no new construction there; there is only a different choice of category in which to take the same pushout.

When pushouts do not exist

Not every category has every pushout. $Set$ does, and so do the categories of groups, topological spaces, vector spaces, and panproto schemas. $Hask$ does not in general, because the sum type that a pushout would demand might carry an equational constraint Haskell’s type system does not allow the programmer to express. A category has all pushouts if every span in it admits a pushout; the word for a category with all (small) colimits is cocomplete. Most of the categories we work with are cocomplete for the same reason $Set$ is: their definitions allow the explicit constructions the abstract machinery asks for.

Pushout of theories

In the category of generalised algebraic theories — the subject of the next chapter — the pushout glues two theories along a common sub-theory.

Given a span $T_{1} f T_{0} g T_{2}$ , the pushout $T_{1} +_{T_{0}} T_{2}$ is a theory whose sorts and operations are the disjoint union of those in $T_{1}$ and $T_{2}$ , with sorts and operations in the image of $T_{0}$ identified along $f$ and $g$ . The construction parallels the set-theoretic one; the extra work is that the theories’ equations must be inherited, which functoriality of the translation takes care of.

This is the setting worked out at length in the institutions framework of Goguen & Burstall (1992), developed precisely to handle parametric combinations of logical and algebraic theories. An institution is, roughly, a category of theories together with a functor to its category of models, set up so that colimits of theories lift to operations on models in a controlled way. Panproto’s treatment of protocol composition is institutional in this sense, though we will not develop the full institutional machinery here; a reader who wants it can find it in Goguen and Burstall.

Pushouts in panproto

The two places where pushouts dominate panproto’s engineering are protocol composition and three-way merge.

Protocol composition

A protocol is a generalised algebraic theory together with a parser, an emitter, and a registration, as Protocols as theories, schemas as instances develops in detail. Two protocols that share a common sub-protocol combine by taking the pushout of the theories along the shared sub-theory.

The concrete case that makes this useful is when two protocols need to agree on some shared vocabulary. Panproto ships a protocol for ATProto lexicons and a separate one for Apache Avro schemas. Both represent records with named fields of declared types; both share a common sub-vocabulary of primitive types (strings, integers, booleans). A combined protocol accepting both formats, and translating between them at the boundaries, is the pushout of the two along the shared primitives sub-theory. The pushout is the place where the two vocabularies agree; the rest of each protocol sits above that shared substrate.

Panproto’s implementation lives in panproto_gat::colimit and panproto_schema::colimit. The pushout is verified by the usual property-based test: construct a cocone out of two arbitrary morphisms agreeing on the shared subprotocol, check that it factors through the computed pushout by a unique morphism, repeat across a sampled state space. Protocol colimits develops the machinery in full.

Three-way merge

The second and more visible application is the three-way merge operation at the heart of schematic version control.

Here the running example from Part I reappears. Two developers branch from a common-ancestor schema $S_{0}$ — our name-and-email address-record schema — and each evolves it independently. Developer A adds a phone field, producing $S_{1}$ . Developer B, working in parallel, renames email to contact_email, producing $S_{1}^{'}$ . The two branches need to merge.

Spelling out the span: $S_{1} m_{A} S_{0} m_{B} S_{1}^{'}$ , with $m_{A}$ the add-phone migration and $m_{B}$ the rename-email migration. The pushout $S_{1} +_{S_{0}} S_{1}^{'}$ is a schema containing every field of $S_{1}$ and every field of $S_{1}^{'}$ , with the fields inherited from $S_{0}$ (the original name, and email under its new name) appearing once. The injections $ι_{S_{1}}$ and $ι_{S_{1}^{'}}$ are the migrations from each branch into the merged schema; applied to each branch’s data, they yield the same records in the merged form.

$S_{0} m_{A} ↓ ⏐ S_{1} m_{B} ι_{S_{1}} S_{1}^{'} ↓ ⏐ ι_{S_{1}^{'}} S_{1} +_{S_{0}} S_{1}^{'}$

Figure 4.2: three-way merge as a pushout. The two branch migrations $m_{A}$ and $m_{B}$ fan out from the common ancestor; the pushout glues them together along $S_{0}$ ’s shared content, and a record on either branch is carried into the merged schema by the appropriate injection.

Not every merge has a pushout. If A renames email to contact_email while B renames the same field to email_addr, the pushout would have to agree with both renamings on $S_{0}$ ’s email, which it cannot. Panproto’s merge algorithm detects this at the level of the universal property: the two branches present a cocone that does not factor uniquely through any candidate pushout, which means the pushout does not exist as a schema under the protocol. The engineering reading is that the merge has a genuine conflict, and the algorithm reports the exact disagreement — the two migrations cannot both commute with a third — rather than producing conflict markers in a text file.

The same pushout construction appears in the patch-theoretic treatment of textual merges due to Mimram & Giusto (2013), which informs the Darcs and Pijul systems (Roundy 2005). Panproto’s contribution is to apply the construction at the schema level rather than the byte level, which produces merge results that survive changes to the textual representation that have nothing to do with schema content. Merge as pushout specialises the present chapter’s construction to the implementation, including the handling of data stored under each branch.

Closing

The next chapter introduces algebraic and generalised algebraic theories, the mathematical language in which panproto writes down what a protocol is.

Keyboard shortcuts

panproto