[summary: The Free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.]
The Free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.
The construction
Write Xr for the set that contains all the freely reduced words over X∪X−1 (so, for instance, excluding the word aa−1). %%note:We use the superscript r to denote "reduced".%%
We define the free group F(X), or FX, on the set X to be a certain Subgroup of the Symmetric group Sym(Xr): namely that which is generated by the following elements, one for each x∈X∪X−1:
- ρx:Sym(Xr)→Sym(Xr), sending a1a2…an↦a1a2…anx if an≠x−1, and a1a2…an−1x−1↦a1a2…an−1.
- ρx−1:Sym(Xr)→Sym(Xr), sending a1a2…an↦a1a2…anx−1 if an≠x, and a1a2…an−1x↦a1a2…an−1.
Recall that each ρx lies in Sym(Xr), so each is a Bijective function from Xr to Xr. We specify it by stating what it does to every element of Xr (that is, to every freely reduced word over X).
We first specify what it does to those words which don't end in x−1: ρx simply appends an x to such words. We then specify it to the remaining words, those which do end in x−1: then ρx just removes the x−1.
It's easy to check that if ρx is given a freely-reduced word as input, then it produces a freely-reduced word as output, because the only change to the word is at the end and we make sure to provide a separate definition if x is to be cancelled. Therefore each ρx is a function Xr→Xr.
Then we do it all again for all the inverses x−1, creating the functions ρx−1; and finally, we add in the Identity element, denoted ρε, which simply returns its input unchanged.
Notice that the ρx and ρx−1 are all indeed bijective (and therefore members of Sym(Xr)), because in fact ρx and ρx−1 are inverse to each other (each cancelling off what the other did), and a function with an inverse is bijective.
So, we've defined the free group as a certain subgroup of the symmetric group. Remember that the subgroup has as its group operation "function composition"; so ρx⋅ρy=ρx∘ρy, for instance. We will write ρxρy for this, omitting the group operation.
Something key to notice is that if we apply ρanρan−1…ρa1 to the empty word ε, we get ρanρan−1…ρa1(ε)=ρanρan−1…ρa3(ρa2(a1))=ρanan−1…a3(a1a2)=⋯=a1a2…an if a1a2…an is a freely reduced word. (Indeed, if the word is freely reduced then none of the successive ρai,ρai+1 can have cancelled each other's effect out, so every application of a ρai must be appending a letter.) Hence we might hope to have captured the freely reduced words in our subgroup.
The formal definition is the same as the intuitive definition
We'll show that there is a bijection between the free group and the set of reduced words, by "converting" each reduced word into a corresponding member of the free group.
Take a reduced word, w=a1a2…an, and produce the member of the free group (that is, the function) ρa1ρa2…ρan. %%note:Recall that the group operation here is composition of functions, so this is actually the function ρa1∘ρa2∘⋯∘ρan.%% This really does produce a member of the free group (i.e. of the subgroup of the symmetry group), because each ai is an element of X∪X−1 and we have already specified how to make ρai from such an element.
Now, we claim that in fact this map is injective: that is, we can't take two words a1a2…an and b1b2…bm and produce the same member of the free group. (That is, we show that ρa1ρa2…ρan=ρb1ρb2…ρbm implies a1…an=b1…bm.) Indeed, if the two functions ("elements of the free group") are equal, then they must in particular do the same thing when they are applied to the empty word ε. But by the "key notice" above, when we evaluate ρa1ρa2…ρan at the empty word, we get anan−1…a2a1; and when we evaluate ρb1ρb2…ρbm at the empty word, we get bmbm−1…b2b1; so the two words must be equal after all. [todo: we've got this backwards]
Finally, the map is surjective: we can make any member of the free group by "converting the appropriate reduced word into a function". Indeed, the free group is generated by the ρx and ρx−1 for x∈X, so every element is some ρx1…ρxn for some selection of x1,…,xn∈X∪X−1. Note that x1…xn need not necessarily be freely reduced as a word at the moment; but if it is indeed not freely reduced, so some xi,xi+1 cancel each other out, then removing that pair completely doesn't change the function ρx1…ρxn. For example, ρx1ρx−11ρx2=ρx2. Hence the process of "performing one step of a free reduction" (i.e. removing a cancelling pair) doesn't change the member of the free group as a function; and since each such removal makes the word shorter, it must eventually terminate. It remains to show that it doesn't matter in what order we remove the cancelling pairs; but that is immediate because we've already shown that our "conversion" process is injective: we started with a member of the free group, so if it corresponds to a freely reduced word then it corresponds to a unique freely reduced word. Since we've just shown that it does indeed correspond to a freely reduced word (by repeatedly removing cancelling pairs), we are done.
The above shows that the free group can be considered just to be the set of reduced words.