Action Anticipation with Vision-Language Models