Formal definition of the free group

Written by Patrick Stevens last updated

The free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.

The construction

Write 

Xr
 for the set that contains all the freely reduced words over 
XX1
 (so, for instance, excluding the word 
aa1
). [1]

We define the free group 

F(X)
, or 
FX
, on the set 
X
 to be a certain subgroup of the symmetric group 
Sym(Xr)
: namely that which is generated by the following elements, one for each 
xXX1
:

  • ρx:Sym(Xr)Sym(Xr)
    , sending 
    a1a2ana1a2anx
     if 
    anx1
    , and 
    a1a2an1x1a1a2an1
    .
  • ρx1:Sym(Xr)Sym(Xr)
    , sending 
    a1a2ana1a2anx1
     if 
    anx
    , and 
    a1a2an1xa1a2an1
    .

Recall that each 

ρx
 lies in 
Sym(Xr)
, so each is a bijective function from 
Xr
 to 
Xr
. We specify it by stating what it does to every element of 
Xr
 (that is, to every freely reduced word over 
X
).

We first specify what it does to those words which don't end in 

x1
ρx
 simply appends an 
x
 to such words. We then specify it to the remaining words, those which do end in 
x1
: then 
ρx
 just removes the 
x1
.

It's easy to check that if 

ρx
 is given a freely-reduced word as input, then it produces a freely-reduced word as output, because the only change to the word is at the end and we make sure to provide a separate definition if 
x
 is to be cancelled. Therefore each 
ρx
 is a function 
XrXr
.

Then we do it all again for all the inverses 

x1
, creating the functions 
ρx1
; and finally, we add in the identity element, denoted 
ρε
, which simply returns its input unchanged.

Notice that the 

ρx
 and 
ρx1
 are all indeed bijective (and therefore members of 
Sym(Xr)
), because in fact 
ρx
 and 
ρx1
 are inverse to each other (each cancelling off what the other did), and a function with an inverse is bijective.

So, we've defined the free group as a certain subgroup of the symmetric group. Remember that the subgroup has as its group operation "function composition"; so 

ρxρy=ρxρy
, for instance. We will write 
ρxρy
 for this, omitting the group operation.

Something key to notice is that if we apply 

ρanρan1ρa1
 to the empty word 
ε
, we get 

ρanρan1ρa1(ε)=ρanρan1ρa3(ρa2(a1))=ρanan1a3(a1a2)==a1a2an

 if 

a1a2an
 is a freely reduced word. (Indeed, if the word is freely reduced then none of the successive 
ρai,ρai+1
 can have cancelled each other's effect out, so every application of a 
ρai
 must be appending a letter.) Hence we might hope to have captured the freely reduced words in our subgroup.

The formal definition is the same as the intuitive definition

We'll show that there is a bijection between the free group and the set of reduced words, by "converting" each reduced word into a corresponding member of the free group.

Take a reduced word, 

w=a1a2an
, and produce the member of the free group (that is, the function) 
ρa1ρa2ρan
. [2] This really does produce a member of the free group (i.e. of the subgroup of the symmetry group), because each 
ai
 is an element of 
XX1
 and we have already specified how to make 
ρai
 from such an element.

Now, we claim that in fact this map is injective: that is, we can't take two words 

a1a2an
 and 
b1b2bm
 and produce the same member of the free group. (That is, we show that 
ρa1ρa2ρan=ρb1ρb2ρbm
 implies 
a1an=b1bm
.) Indeed, if the two functions ("elements of the free group") are equal, then they must in particular do the same thing when they are applied to the empty word 
ε
. But by the "key notice" above, when we evaluate 
ρa1ρa2ρan
 at the empty word, we get 
anan1a2a1
; and when we evaluate 
ρb1ρb2ρbm
 at the empty word, we get 
bmbm1b2b1
; so the two words must be equal after all.

Todo

we've got this backwards


Finally, the map is surjective: we can make any member of the free group by "converting the appropriate reduced word into a function". Indeed, the free group is generated by the 

ρx
 and 
ρx1
 for 
xX
, so every element is some 
ρx1ρxn
 for some selection of 
x1,,xnXX1
. Note that 
x1xn
 need not necessarily be freely reduced as a word at the moment; but if it is indeed not freely reduced, so some 
xi,xi+1
 cancel each other out, then removing that pair completely doesn't change the function 
ρx1ρxn
. For example, 
ρx1ρx11ρx2=ρx2
. Hence the process of "performing one step of a free reduction" (i.e. removing a cancelling pair) doesn't change the member of the free group as a function; and since each such removal makes the word shorter, it must eventually terminate. It remains to show that it doesn't matter in what order we remove the cancelling pairs; but that is immediate because we've already shown that our "conversion" process is injective: we started with a member of the free group, so if it corresponds to a freely reduced word then it corresponds to a unique freely reduced word. Since we've just shown that it does indeed correspond to a freely reduced word (by repeatedly removing cancelling pairs), we are done.

The above shows that the free group can be considered just to be the set of reduced words.

  • ^

    We use the superscript 

    r
     to denote "reduced".

  • ^

    Recall that the group operation here is composition of functions, so this is actually the function 

    ρa1ρa2ρan
    .

  • Summaries

    You can edit summaries by clicking on them, reorder them by dragging, or add a new one (up to 3). By default you should avoid creating more than one summary unless the subject matter benefits substantially from multiple kinds of explanation.
  • Summary

    The free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.

    The free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.

  • The free group may be constructed formally using van der Waerden's trick, which is not intuitive at all but leads to a definition that is very easy to work with. This page will detail van der Waerden's construction, and will prove that the trick yields a group which has all the right properties to be the free group.

    The construction

    Write for the set that contains all the freely reduced words over (so, for instance, excluding the word ). [1]

    We define the free group , or , on the set to be a certain subgroup of the symmetric group : namely that which is generated by the following elements, one for each :

    • , sending if , and .
    • , sending if , and .

    Recall that each lies in , so each is a bijective function from to . We specify it by stating what it does to every element of (that is, to every freely reduced word over ).

    We first specify what it does to those words which don't end in : simply appends an to such words. We then specify it to the remaining words, those which do end in : then just removes the .

    It's easy to check that if is given a freely-reduced word as input, then it produces a freely-reduced word as output, because the only change to the word is at the end and we make sure to provide a separate definition if is to be cancelled. Therefore each is a function .

    Then we do it all again for all the inverses , creating the functions ; and finally, we add in the identity element, denoted , which simply returns its input unchanged.

    Notice that the and are all indeed bijective (and therefore members of ), because in fact and are inverse to each other (each cancelling off what the other did), and a function with an inverse is bijective.

    So, we've defined the free group as a certain subgroup of the symmetric group. Remember that the subgroup has as its group operation "function composition"; so , for instance. We will write for this, omitting the group operation.

    Something key to notice is that if we apply to the empty word , we get if is a freely reduced word. (Indeed, if the word is freely reduced then none of the successive can have cancelled each other's effect out, so every application of a must be appending a letter.) Hence we might hope to have captured the freely reduced words in our subgroup.

    The formal definition is the same as the intuitive definition

    We'll show that there is a bijection between the free group and the set of reduced words, by "converting" each reduced word into a corresponding member of the free group.

    Take a reduced word, , and produce the member of the free group (that is, the function) . [2] This really does produce a member of the free group (i.e. of the subgroup of the symmetry group), because each is an element of and we have already specified how to make from such an element.

    Now, we claim that in fact this map is injective: that is, we can't take two words and and produce the same member of the free group. (That is, we show that implies .) Indeed, if the two functions ("elements of the free group") are equal, then they must in particular do the same thing when they are applied to the empty word . But by the "key notice" above, when we evaluate at the empty word, we get ; and when we evaluate at the empty word, we get ; so the two words must be equal after all.

    Finally, the map is surjective: we can make any member of the free group by "converting the appropriate reduced word into a function". Indeed, the free group is generated by the and for , so every element is some for some selection of . Note that need not necessarily be freely reduced as a word at the moment; but if it is indeed not freely reduced, so some cancel each other out, then removing that pair completely doesn't change the function . For example, . Hence the process of "performing one step of a free reduction" (i.e. removing a cancelling pair) doesn't change the member of the free group as a function; and since each such removal makes the word shorter, it must eventually terminate. It remains to show that it doesn't matter in what order we remove the cancelling pairs; but that is immediate because we've already shown that our "conversion" process is injective: we started with a member of the free group, so if it corresponds to a freely reduced word then it corresponds to a unique freely reduced word. Since we've just shown that it does indeed correspond to a freely reduced word (by repeatedly removing cancelling pairs), we are done.

    The above shows that the free group can be considered just to be the set of reduced words.

    1. ^︎

      We use the superscript to denote "reduced".

    2. ^︎

      Recall that the group operation here is composition of functions, so this is actually the function .

    Parents:
    Parents
    1.
    ^︎

    We use the superscript to denote "reduced".

    2.
    ^︎

    Recall that the group operation here is composition of functions, so this is actually the function .