【Stable Diffusion】FID、CLIP、cfg-scales都是什么

在stable-diffusion 仓库中，是这样评价模型的。

Evaluations with different classifier-free guidance scales (1.5, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0) and 50 PLMS sampling steps show the relative improvements of the checkpoints，

对应了这张图

图里的FID score 、Clip score 和 cfg-scales 都是什么意思呢？

FID score

FID（Fréchet Inception Distance ） score 是一种用于评估生成图像质量的度量标准，专门用于评估模型生成图片的性能，计算公式如下所示：

F I D ( p , q ) = ∣ ∣ μ p − μ q ∣ ∣ 2 2 + T r ( C p + C q − 2 C p C q ) \mathrm{FID}(p, q) = ||\mu_p - \mu_q||_2^2 + \mathrm{Tr}(C_p + C_q - 2\sqrt{C_pC_q}) FID(p,q)=∣∣μp−μq