降維(比如PCA或者random projection) -> KDE(kernel density estimation)來估算密度 -> KL divergence