Formal definition of the free group

https://arbital.com/p/free_group_formal_definition

by Patrick Stevens Aug 5 2016

Van der Waerden's trick lets us define the free groups in a slick but mostly incomprehensible way.


[summary: The Free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.]

The Free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.

The construction

Write Xr for the set that contains all the freely reduced words over XX1 (so, for instance, excluding the word aa1). %%note:We use the superscript r to denote "reduced".%%

We define the free group F(X), or FX, on the set X to be a certain Subgroup of the Symmetric group Sym(Xr): namely that which is generated by the following elements, one for each xXX1:

Recall that each ρx lies in Sym(Xr), so each is a Bijective function from Xr to Xr. We specify it by stating what it does to every element of Xr (that is, to every freely reduced word over X).

We first specify what it does to those words which don't end in x1: ρx simply appends an x to such words. We then specify it to the remaining words, those which do end in x1: then ρx just removes the x1.

It's easy to check that if ρx is given a freely-reduced word as input, then it produces a freely-reduced word as output, because the only change to the word is at the end and we make sure to provide a separate definition if x is to be cancelled. Therefore each ρx is a function XrXr.

Then we do it all again for all the inverses x1, creating the functions ρx1; and finally, we add in the Identity element, denoted ρε, which simply returns its input unchanged.

Notice that the ρx and ρx1 are all indeed bijective (and therefore members of Sym(Xr)), because in fact ρx and ρx1 are inverse to each other (each cancelling off what the other did), and a function with an inverse is bijective.

So, we've defined the free group as a certain subgroup of the symmetric group. Remember that the subgroup has as its group operation "function composition"; so ρxρy=ρxρy, for instance. We will write ρxρy for this, omitting the group operation.

Something key to notice is that if we apply ρanρan1ρa1 to the empty word ε, we get ρanρan1ρa1(ε)=ρanρan1ρa3(ρa2(a1))=ρanan1a3(a1a2)==a1a2an if a1a2an is a freely reduced word. (Indeed, if the word is freely reduced then none of the successive ρai,ρai+1 can have cancelled each other's effect out, so every application of a ρai must be appending a letter.) Hence we might hope to have captured the freely reduced words in our subgroup.

The formal definition is the same as the intuitive definition

We'll show that there is a bijection between the free group and the set of reduced words, by "converting" each reduced word into a corresponding member of the free group.

Take a reduced word, w=a1a2an, and produce the member of the free group (that is, the function) ρa1ρa2ρan. %%note:Recall that the group operation here is composition of functions, so this is actually the function ρa1ρa2ρan.%% This really does produce a member of the free group (i.e. of the subgroup of the symmetry group), because each ai is an element of XX1 and we have already specified how to make ρai from such an element.

Now, we claim that in fact this map is injective: that is, we can't take two words a1a2an and b1b2bm and produce the same member of the free group. (That is, we show that ρa1ρa2ρan=ρb1ρb2ρbm implies a1an=b1bm.) Indeed, if the two functions ("elements of the free group") are equal, then they must in particular do the same thing when they are applied to the empty word ε. But by the "key notice" above, when we evaluate ρa1ρa2ρan at the empty word, we get anan1a2a1; and when we evaluate ρb1ρb2ρbm at the empty word, we get bmbm1b2b1; so the two words must be equal after all. [todo: we've got this backwards]

Finally, the map is surjective: we can make any member of the free group by "converting the appropriate reduced word into a function". Indeed, the free group is generated by the ρx and ρx1 for xX, so every element is some ρx1ρxn for some selection of x1,,xnXX1. Note that x1xn need not necessarily be freely reduced as a word at the moment; but if it is indeed not freely reduced, so some xi,xi+1 cancel each other out, then removing that pair completely doesn't change the function ρx1ρxn. For example, ρx1ρx11ρx2=ρx2. Hence the process of "performing one step of a free reduction" (i.e. removing a cancelling pair) doesn't change the member of the free group as a function; and since each such removal makes the word shorter, it must eventually terminate. It remains to show that it doesn't matter in what order we remove the cancelling pairs; but that is immediate because we've already shown that our "conversion" process is injective: we started with a member of the free group, so if it corresponds to a freely reduced word then it corresponds to a unique freely reduced word. Since we've just shown that it does indeed correspond to a freely reduced word (by repeatedly removing cancelling pairs), we are done.

The above shows that the free group can be considered just to be the set of reduced words.