By the law of large numbers, 1N∑Ni=1lnQθ(xi)→∑xP(x)lnQθ(x) almost surely. This is the cross entropy of P and Qθ. Also note that if we subtract this from the entropy of P, we get DKL(P||Qθ). So minimising the cross entropy over θ is equivalent to maximising DKL(P||Qθ).
I think the cross entropy of P and Qθ is actually H(P,Qθ)=−∑xP(x)lnQθ(x) (note the negative sign). The entropy of P is H(P)=−∑xP(x)lnP(x). Since DKL(P||Qθ)=∑xP(x)(ln(P(x)−lnQθ(x))=∑xP(x)lnP(x)−∑... (read more)
I think the cross entropy of P and Qθ is actually H(P,Qθ)=−∑xP(x)lnQθ(x) (note the negative sign). The entropy of P is H(P)=−∑xP(x)lnP(x). Since DKL(P||Qθ)=∑xP(x)(ln(P(x)−lnQθ(x))=∑xP(x)lnP(x)−∑... (read more)