All of Ethan (EJ) Watkins's Comments + Replies

By the law of large numbers,  almost surely. This is the cross entropy of  and . Also note that if we subtract this from the entropy of , we get . So minimising the cross entropy over  is equivalent to maximising .

I think the cross entropy of  and  is actually  (note the negative sign). The entropy of  is . Since ... (read more)

I think there is a mistake in this equation.  and  are the wrong way round. It should be:

 

1CallumMcDougall
Yep that's right, thanks! Corrected.