SegSperm - a dataset of sperm images for blurry and small object segmentation - Open Research Data - MOST Wiedzy

Wyszukiwarka

SegSperm - a dataset of sperm images for blurry and small object segmentation

Opis

Many deep learning applications require figure-ground segmentation. The performance of segmentation models varies across modalities and acquisition settings.

The release of SegSperm dataset intends to contribute to the development of algorithms for:

(1) segmentation of small objects,

(2) segmentation of elongated objects,

(3) segmentation of blurred objects,

(4) Computer-Assisted Sperm Analysis (CASA) systems.

As embryologists seek high-quality sperm with desired shape and motion attributes to increase chances of fertilization, sperm assessment can take advantage of the segmentation task. Despite the remarkable progress of deep neural networks in figure-ground segmentation in the last decade, blurry microscopic images of minuscule sperm, with spatially uneven contrast and light variations, continue to challenge modern deep neural networks.

Most works prioritize efforts toward detecting sperm heads to simplify the problem at the expense of accuracy. Notably, the tail has also been shown to play a key role in assessing sperm quality. On the other hand, the segmentation of the flagellum is much more challenging than the segmentation of the head due to the microscopic video characteristics. The low field depth of microscopes blurs the images, easily confusing human raters in discerning minuscule sperm from large backgrounds during image labeling. To spur further progress on training full sperm segmentation algorithms with noisy and imbalanced ground truth labels, we release SegSperm - a new dataset of microscopic images of sperm.

The SegSperm dataset contains microscopic images of full sperm. The images were selected from videos of Intracytoplasmic Sperm Injection (ICSI) procedures. The dataset consists of 551 gray images of resolution 512×512 pixels and associated binary ground truth masks of full sperm. The training set consists of 432 images from 40 videos and the test set consists of 119 images from 9 videos. The binary masks of spermatozoa were segmented manually by one rater. In addition, 23 images of sperm from the validation set were annotated by two more raters. The ground truth masks of each rater were split into head and tail part.

If you use the SegSperm dataset in your research, please cite the following paper:

Lewandowska, Emilia, Daniel Węsierski, Magdalena Mazur-Milecka, Joanna Liss, and Anna Jezierska. "Ensembling noisy segmentation masks of blurred sperm images." Computers in Biology and Medicine (2023): 107520.

Ilustracja publikacji

Plik z danymi badawczymi

SegSperm.zip
27.4 MB, S3 ETag 788aa5f628bd54a444752ca735a99919-1, pobrań: 130
Hash pliku liczony jest ze wzoru
hexmd5(md5(part1)+md5(part2)+...)-{parts_count} gdzie pojedyncza część pliku jest wielkości 512 MB

Przykładowy skrypt do wyliczenia:
https://github.com/antespi/s3md5
pobierz plik SegSperm.zip

Informacje szczegółowe o pliku

Licencja:
Creative Commons: by-nc 4.0 otwiera się w nowej karcie
CC BY-NC
Użycie niekomercyjne

Informacje szczegółowe

Rok publikacji:
2023
Data zatwierdzenia:
2023-02-28
Język danych badawczych:
angielski
Dyscypliny:
  • informatyka techniczna i telekomunikacja (Dziedzina nauk inżynieryjno-technicznych)
  • inżynieria biomedyczna (Dziedzina nauk inżynieryjno-technicznych)
DOI:
Identyfikator DOI 10.34808/6wm7-1159 otwiera się w nowej karcie
Weryfikacja:
Politechnika Gdańska

Słowa kluczowe

Cytuj jako

wyświetlono 644 razy