人気の記事一覧

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

1か月前

SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models

4か月前