NTU-Outdoor-38 Person ReID Dataset

The NTU-Outdoor-38 dataset was collected by the Rapid-Rich Object Search (ROSE) Lab, Nanyang Technological University (NTU), Singapore. This initiative was the result of a collaboration between the ROSE Lab and the Defence Science and Technology Agency (DSTA) of Singapore.

NTU-Outdoor-38 is a Person Re-identification (ReID) dataset collected around the outdoor campus grounds of NTU, spanning 38 non-overlapping cameras. Due to the wide area of coverage, the distribution of foot traffic varies considerably across the multi-camera system and presents a challenging ReID task with high levels of inter-camera data imbalance. Standard baseline solutions exhibit corresponding performance imbalances, with “less-data” cameras underperforming the “more-data” cameras. 

 

Sample Images from NTU-Outdoor-38

 

NTU-Outdoor-38 consists of images from actual surveillance cameras mounted on lamp-posts, better highlighting the viewing angles and imbalances inherent in real-world networked camera systems. It consists of outdoor scenes with large changes in viewpoint, illumination, and resolution that manifest even within individual wide-angle cameras. Our dataset boasts significantly more cameras than other popular benchmarks and also comes with 40 additional categorical attributes annotated by participants. 

 

Participants in NTU-Outdoor-38 have provided informed consent to the use of their images for academic purposes. Furthermore, the faces of these subjects have been blurred. All participants have to manually aggree with our privacy policy before participating our dataset collection. Our privacy policy screenshot from mobile app is shown below:

 

NTU-Outdoor-38 captures the inter-camera performance imbalances present in real-life camera networks and serves as an excellent test-bed to further study this problem. To the best of our knowledge, it is the only publicly available Person ReID dataset collected from over 30 cameras.

 

Dataset Statistics

The following table presents a breakdown of the dataset attributes of NTU-Outdoor-38:

Property

Value

Train Images

23018

Query Images

1386

Gallery Images

23943

Total Images

48347

Train Appearance IDs

274

Query Appearance IDs

227

Gallery Appearance IDs

275

Total Appearance IDs

549

Cameras

38

Binary Attributes

40

Camera Type

Surveillance

Height

Lamp Post

Detector

YOLOV3

 

Dataset File Structure

Folder

Description

bounding_box_train

The training images. This folder contains the images from 274 unique appearances.

query

The query images. Each of them is from different identities and appearances in different cameras.

bounding_box_test

The gallery images. We retrieve a query from this image pool.


 

Image File Naming Rules

Given an example image file: "9_2018-10-24_16h57m44s_G1849_134_228_2070_0_female_black_blue_t-shirt_short_backpack_high.jpg", tokenize the file name using the underscore “_” as separator. The following table describes each token in order of appearance as applied to our example:

Index

Description

0

"9" is the camera id

1

"2018-10-24" is the date

2

"16h57m44s" is the start time for video recording during data collection

3

"G1849" is the record id in our system

4

"134" is the person id (out of 274 ids)

5

"228" is the appearance id. (out of 549 appearances)

6

"2070" is the frame number (video recorded at 30 frames per second)

7

"0" is the person index among all people detected within this frame

8

"female" is the gender of the person

9

"black" is the top clothing color

10

"blue" is the bottom clothing color

11

"t-shirt" is the top clothing type

12

"short" is the bottom clothing type (shorts)

13

An unfixed number of additional attributes (such as “backpack”) follow…

14

The last attribute is the annotation confidence, “high”