WBCAtt: White Blood Cell Attribute Dataset

This page introduces WBCAtt, a densely morphological annotation for white blood cell (WBC) images. It establishes the foundation for interpreting WBC recognition models. 

 

 

WBCAtt Annotations

The WBCAtt contains morphological annotations for 10,298 WBC images from the PBC dataset. Each WBC image is annotated with 11 attributes in terms of its cell structure: overall cell, nucleus, cytoplasm, and granules. The attributes and their possible values are tabulated in the table below. For further details, please see our paper: https://arxiv.org/pdf/2306.13531.pdf

 

Attribute

Possible Values

Cell-Size

Big, Small

Cell-Shape

Round, Irregular

Nucleus-Shape

Segmented-Bilobed, Unsegmented-Indented, Segmented-Multilobed, Unsegmented-Round, Unsegmented-Band, Irregular

Nuclear-Cytoplasmic-Ratio

Low, High

Chromatin-Density

Densely, Loosely

Cytoplasm-Vacuole

Yes, No

Cytoplasm-Texture

Clear, Frosted

Cytoplasm-Color

Light Blue, Blue, Purple Blue

Granule-Type

Small, Round, Coarse, Nil

Granule-Color

Pink, Red, Purple, Nil

Granularity

Yes, No

 

Annotation contents

pbc_attr_v1_train.csv, pbc_attr_v1_val.csv, and pbc_attr_v1_test.csv contain attribute annotations for the train/val/test splits. Please refer to https://github.com/apple2373/wbcatt/tree/main/submission for instructions to run an attribute predictor. 
•    cell_size, cell_shape, nucleus_shape, nuclear_cytoplasmic_ratio, chromatin_density, cytoplasm_vacuole, cytoplasm_texture, cytoplasm_colour, granule_type, granule_colour, granularity: The attribute columns.
•    img_name: This is the image file name. It can serve as a unique identifier.
•    label: One of the five WBC types (neutrophils, eosinophils, basophils, monocytes, and lymphocytes) provided by the PBC dataset.
•    path: Image path organized by the PBC dataset.


Release data statistics

Number of WBC images: 10,298
Number of image-attribute pairs: 113,278

WBC Class

Basophil

Eosinophil

Lymphocyte

Monocyte

Neutrophil

Total

No. of images

1,218

3,117

1,214

1,420

3,329

10,298

 

Sample images

The sample images below are from the PBC dataset, with the attribute annotations from WBCAtt. 

                                                                             

Attribute Value
Cell-Size Big
Cell-Shape Round
Nucleus-Shape Unsegmented-Band
Nuclear-Cytoplasmic-Ratio Low
Chromatin-Density Densely
Cytoplasm-Vacuole No
Cytoplasm-Texture Clear
Cytoplasm-Color Light Blue
Granule-Type Small
Granule-Color Pink
Granularity Yes
Attribute Value
Cell-Size Big
Cell-Shape Irregular
Nucleus-Shape Unsegmented-Round
Nuclear-Cytoplasmic-Ratio Low
Chromatin-Density Loosely
Cytoplasm-Vacuole Yes
Cytoplasm-Texture Clear
Cytoplasm-Color Light Blue
Granule-Type Nil
Granule-Color Nil
Granularity No

 

Citation

All publications using the WBCAtt annotations should include the following acknowledgement: “(Portions of) the research in this paper used the WBCAtt annotations made available by the ROSE Lab at Nanyang Technological University, Singapore.”
Any publications that result from using the WBCAtt annotations should cite the following paper:
•    Satoshi Tsutsui, Winnie Pang, and Bihan Wen. (2023). WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes. Advances in Neural Information Processing Systems (NeurIPS).

 

Download

Please find the following files available at https://github.com/apple2373/wbcatt/tree/main/submission.
•    pbc_attr_v1_train.csv
•    pbc_attr_v1_val.csv
•    pbc_attr_v1_test.csv