Skip to the content.

XS-VID: An Extra Small Object Video Detection Dataset

XS-VID is a comprehensive dataset for Extra Small Object Video Detection, including diverse day and night scenes such as rivers, forests, skyscrapers, and streets. area

Update

XS-VID

XS-VID contains a diverse array of scenes featuring multiple categories and sizes of targets. Notably, XS-VID achieves unprecedented breadth and depth in covering and quantifying minuscule targets (< $32^2$ pixels). Some example images are shown below. image

Here is a statistical comparison of our dataset with other related datasets

dataset

Results

We exhibit the quantitative experiment results of several representative methods on the XS-VID test-set and Visdrone2019 VID test-dev set as follows.

results

Download

We provide the downloading of our datasets.

Please choose a download method to download the annotations and all images. Make sure all the split archive files (e.g., images.zip, images.z01, images.z02, etc.) are in the same directory. Use the following command to extract them:

unzip images.zip
unzip annotations.zip

If you get an error while unpacking, you can get help from issues

Codes

The official codes of our benchmark, which mainly includes data preparation and evaluation, are released below.

Support or Contact

If you have any problems about our XS-VID benchmark, please feel free to contact us at gjh_hust@hust.edu.cn.