Suppose I give it pairs of strings and ask it to output 1 if the the number of 1s in the string is even and zero if it's odd.
e. g.
0 -> 0
1 -> 1
11 -> 0
101 -> 0
1101-> 1
10101001 -> 0
111000101110 -> 1
How well does it do on this task? What if we finetune it on sample data?
If you try this, reformat to work around the BPE problem as detailed in https://www.gwern.net/GPT-3#bpes