Hierarchical Temporal Structure And Deep Learning Methods Of Speech And Music