VRSBench VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding Link 29.6k FIT-RS SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote ...
Ego4D Audio, IoT Text, IMU, Video, Audio, 3D 3, 670h data, 3.85M narrations Classification, Forecasting, etc. Ego-Exo4D Audio, IoT Text, IMU, Video, Audio, Eye Gaze ...