Theoretical Foundations for Clustering and Screening Heterogeneous and High dimensional Data