Universal properties

Disclaimer. The content of this page is largely LM-generated. It was written as a stopgap to make the panproto system legible while we work through the book verifying and editing the content by hand. When a chapter has been verified or edited by a human, the parts that were verified or edited will be noted at the head of the chapter.

Any construction in a category can be specified in two ways: by saying what it is (the Cartesian product of two sets is a set of ordered pairs; the disjoint union is a set of tagged alternatives), or by saying how morphisms into or out of it look. The first way is concrete; the second is abstract. The abstract way turns out to be more useful, because it pins the construction down up to isomorphism without committing to any particular representation, and because a single abstract specification can be instantiated in many different categories at once. The abstract way is what we mean by a universal property.

The present chapter develops the two smallest examples: products and coproducts. Generalisation to limits and colimits, which the next chapter covers, is immediate once the pattern of argument is clear in the small case; and essentially every construction in Parts II and V of this book will turn out to be universal.

Products

We start with the example every reader already knows: the Cartesian product of two sets.

Ordinarily one defines $A \times B$ by declaring its elements to be the ordered pairs $(a, b)$ with $a \in A$ and $b \in B$ , and one observes that two projection functions come along for free: $π_{1} (a, b) = a$ and $π_{2} (a, b) = b$ . That definition is perfectly adequate for computations in set theory, and one can hardly fault it. A category theorist would nonetheless ask a different question: what role does $A \times B$ play among all sets that come equipped with a pair of functions to $A$ and $B$ ?

Once the question is phrased this way, the answer writes itself. Take any set $Z$ together with a function $f_{1} : Z \to A$ and a function $f_{2} : Z \to B$ . Then there is one and only one function $⟨ f_{1}, f_{2} ⟩ : Z \to A \times B$ compatible with the projections, namely the function sending $z$ to the pair $(f_{1} (z), f_{2} (z))$ . Every other function $Z \to A \times B$ that claims to respect the projections — that is, whose composition with $π_{1}$ returns $f_{1}$ and whose composition with $π_{2}$ returns $f_{2}$ — has to agree with this one at every $z$ , because the two components $f_{1} (z)$ and $f_{2} (z)$ determine the pair they produce. The Cartesian product is therefore not just some set with projections but the universal such thing: every competing candidate factors through it, uniquely.

This is a universal property. It does not describe $A \times B$ by saying what its elements are. It describes it by saying what morphisms into it look like, and by asserting that those morphisms are uniquely determined by their projections. The shift of perspective — from “what is this thing made of” to “how does this thing relate to everything else” — is what categorical thinking does, and it turns out to generalise well beyond the setting of sets.

Stating the definition

Let $C$ be a category and let $A, B$ be two of its objects. A product of $A$ and $B$ is an object $P$ together with two morphisms

$π_{1} : P \to A π_{2} : P \to B$

(the projections) such that for every object $Z$ of $C$ and every pair of morphisms $f_{1} : Z \to A$ , $f_{2} : Z \to B$ , there exists a unique morphism $⟨ f_{1}, f_{2} ⟩ : Z \to P$ making both

$π_{1} \circ ⟨ f_{1}, f_{2} ⟩ = f_{1} and π_{2} \circ ⟨ f_{1}, f_{2} ⟩ = f_{2}$

hold. The standard picture of the situation is

$A π_{1} Z ↓ ⏐ ⟨ f_{1}, f_{2} ⟩ P π_{2} B$

Figure 3.1: the universal property of the product, drawn with the cone object $Z$ at the top, the two arbitrary morphisms $f_{1}$ and $f_{2}$ fanning downward (not drawn), and the factoring morphism into $P$ in the middle column. Any such cone factors through $P$ by exactly one morphism.

The definition is doing two things that a reader should notice separately. Existence says some factoring morphism is always available; uniqueness says at most one is. Drop existence and some cones over $A$ and $B$ will have nowhere to go inside the category. Drop uniqueness and calling $P$ “the product” becomes ambiguous, because different choices of factoring morphism give different squares, and the commutativity of the square no longer pins $⟨ f_{1}, f_{2} ⟩$ down.

Three examples

Sets. In the category $Set$ , the product of two sets is the Cartesian product $A \times B$ with the standard projections, and the factoring morphism is $⟨ f_{1}, f_{2} ⟩ (z) = (f_{1} (z), f_{2} (z))$ . The categorical definition has recovered the set-theoretic one. As it should: the reason to prefer the categorical version is not that it produces different answers in familiar cases but that it generalises to categories where “element of a set” is not the right notion.

Haskell types. In $Hask$ , the product of two types a and b is the pair type (a, b), projected out by fst and snd:

fst :: (a, b) -> a
fst (x, _) = x

snd :: (a, b) -> b
snd (_, y) = y

The factoring morphism is the two-function paired application:

pair :: (c -> a) -> (c -> b) -> c -> (a, b)
pair f g z = (f z, g z)

Uniqueness is forcible by pointwise reasoning. Any function h :: c -> (a, b) that agrees with f through fst and with g through snd has to satisfy h z = (f z, g z) for every z, and two functions equal on every input are equal; so h = pair f g. The universal property is not magic — it is a constraint provable in one line, and its work is to rule out the alternatives a less careful definition might allow.

Panproto schemas. In $Sch_{P}$ , the category of schemas under a fixed protocol, the product of two schemas $S_{1}$ and $S_{2}$ is a schema whose instances are pairs: one instance of $S_{1}$ alongside one instance of $S_{2}$ , with the two projection migrations extracting the two halves. Using the running example from the previous two chapters, the product $S_{0} \times S_{1}$ of the name-and-email schema with the name-and-email-and-phone schema has instances that are pairs — one record of each kind — and a migration into the product from some other schema is determined by a migration into each factor. The universal property holds by construction in panproto_schema::colimit, which holds both products and coproducts, limits and colimits being dual.

Products of schemas are not the most common operation panproto performs on its category of schemas: pushouts, covered in the next chapter, are more frequent. But they are available, and their universal property behaves exactly as the set-theoretic version prepared us to expect.

Uniqueness up to isomorphism

A universal property does not specify a unique object on the nose. It specifies an object up to a canonical isomorphism, which is almost as good, and the argument establishing this is worth walking through once because we will use the template of the argument again and again.

Suppose $(P, π_{1}, π_{2})$ and $(P^{'}, π_{1}^{'}, π_{2}^{'})$ are both products of $A$ and $B$ . The universal property of $P$ , applied to the cone $(P^{'}, π_{1}^{'}, π_{2}^{'})$ , produces a unique morphism $u : P^{'} \to P$ satisfying $π_{1} \circ u = π_{1}^{'}$ and $π_{2} \circ u = π_{2}^{'}$ . The universal property of $P^{'}$ , applied the other way, produces a unique morphism $v : P \to P^{'}$ . Compose the two in either order: $v \circ u : P^{'} \to P^{'}$ and $u \circ v : P \to P$ are each morphisms that factor their own target through itself in a way that respects the projections. Uniqueness forces each to be the identity, and therefore $u$ and $v$ are mutually inverse.

Two constructions claiming to be “the product”, therefore, are canonically identified by a unique isomorphism. We speak of the product of $A$ and $B$ , knowing different representatives yield isomorphic objects, and the isomorphism between them is itself forced. This is the template for every universal-property argument in the book: two candidates, each universal; apply each candidate’s property to the other; derive a unique iso.

Coproducts

The coproduct is the dual of the product. Everything we have just said about products applies with all the arrows reversed: injections replace projections, factoring out replaces factoring in, and the universal property characterises the smallest object through which pairs of morphisms out of $A$ and $B$ must factor.

A coproduct of $A$ and $B$ in a category $C$ is an object $S$ together with two morphisms

$ι_{1} : A \to S ι_{2} : B \to S$

(the injections) such that for every $Z$ and every pair of morphisms $g_{1} : A \to Z$ , $g_{2} : B \to Z$ , there exists a unique morphism $[g_{1}, g_{2}] : S \to Z$ with $[g_{1}, g_{2}] \circ ι_{1} = g_{1}$ and $[g_{1}, g_{2}] \circ ι_{2} = g_{2}$ .

Three examples again

Sets. The coproduct of two sets in $Set$ is the disjoint union $A ⊔ B$ , the set whose elements are tagged versions of the elements of $A$ and $B$ , arranged so that a shared element of $A$ and $B$ becomes two distinct elements of the union. The injections are the two tagging inclusions. The factoring morphism out of the coproduct is case analysis: apply $g_{1}$ to a tagged- $A$ element and $g_{2}$ to a tagged- $B$ one.

Haskell types. In $Hask$ , the coproduct of a and b is the sum type Either a b:

data Either a b = Left a | Right b

The two constructors Left and Right are the injections. The factoring morphism is the library function either:

either :: (a -> c) -> (b -> c) -> Either a b -> c
either f _ (Left  x) = f x
either _ g (Right y) = g y

Uniqueness here reads: any function out of Either a b that agrees with f on Left values and with g on Right values has to be either f g. The argument is again pointwise, and again the universal property is not magic but a forcible constraint on implementations.

Panproto schemas. In $Sch_{P}$ , the coproduct of two schemas is the disjoint-union schema: a schema whose instances are tagged, either an instance of the first summand or an instance of the second. A migration out of a coproduct schema is forced by a migration out of each summand, as the universal property demands. Coproducts of schemas, like products, live in panproto_schema::colimit; they will appear again in Merge as pushout as one ingredient of the more general pushout construction.

Initial and terminal objects

Two further universal constructions round out the picture: the universal cocone under an empty diagram (an initial object) and the universal cone over an empty diagram (a terminal object).

A terminal object $1$ in a category is one into which every other object has exactly one morphism. An initial object $0$ is one out of which every other object has exactly one morphism. Both are defined up to unique isomorphism by their universal properties, with the same template of argument as the one we gave for products.

Concretely: in $Set$ , any singleton ${*}$ is terminal (every set has exactly one function into it, the constant function) and the empty set is initial (every set has exactly one function out of it, the vacuous function). In $Hask$ , the unit type () is terminal and Void is initial. In $Sch_{P}$ , the schema with no fields is terminal and the schema with no instances is initial. Neither is a very exciting object on its own, but both become essential when we turn to diagrams larger than two objects, which is the next chapter.

The general pattern

Products, coproducts, terminal objects, and initial objects are four instances of a single pattern. Each fixes a shape of diagram (two objects, two objects, the empty diagram, the empty diagram) and declares the universal object that receives or produces the diagram’s data. Generalised to arbitrary shapes of diagram, the universal-cone construction is called a limit and the universal-cocone construction is called a colimit. A product is a limit (of a two-object discrete diagram); a terminal object is a limit (of the empty diagram); a coproduct and an initial object are colimits of the same two shapes. The next chapter develops the general theory, and the case that will matter most for the rest of the book — the pushout, colimit of a three-object span — gets a section of its own.

What the present chapter buys is that every construction in later parts of the book pinned down by a universal property enters with its uniqueness built in. The migration functors of Theory morphisms and instance migration are defined by universal properties ( $Σ_{f}$ as a left adjoint, $Π_{f}$ as a right adjoint, each forced up to isomorphism by what morphisms do to cones). The pushout of Merge as pushout is a colimit. The cross-protocol translations of Protocol colimits are computed by colimits in the category of protocols. Each of these is an instance of the pattern of this chapter, and each inherits its uniqueness from the universal-property machinery we have just written down.

Closing

The next chapter develops colimits and pushouts, generalising the universal-cocone construction of this chapter to arbitrary diagrams.

Keyboard shortcuts

panproto