Reducing Token Redundancy in Video-Language Models via Memory Consolidation Algorithm