In typical setups, we rely on a pre-trained score model, sϕ(x,t), to approximate the true score function ∇logpt(x). However, we can bypass this pre-trained model by using the following unbiased estimator:
∇logpt(xt)=−E[t2xt−xxt]
where x∼pdata and xt∼N(x;t2I). This implies that, given x and xt, we can estimate ∇logpt(xt) as −(xt−x)/t2.
This unbiased estimate serves as a sufficient replacement for the pre-trained diffusion model in consistency distillation, particularly when using the Euler method as the ODE solver in the limit of N→∞.