Scalable and Cost-Effective Data Flow Analysis for Distributed Software: Algorithms and Applications