Opis
We introduce Vident-real, a large dataset of 100 video sequences of intra-oral scenes from real conservative dental treatments performed at the Medical University of Gdańsk, Poland. The dataset can be used for multi-task learning methods including:
- video enhancement
- video segmentation
- motion estimation
- video stabilization
The dataset allows for training and validating models on multiple vision-based tasks in challenging real conditions characterized by compromised visibility. The recordings were acquired with a tiny micro-camera firmly attached to dental handpieces with various dental burs and tools. The dental scenes were crowded due to the presence of dental tools and artifacts and featured occlusions, appearance variations, tool-teeth interactions, bleeding, motion blur, light reflections, splashing water and other fluids, and camera fouling.
Since the sequences recorded real dental treatment procedures, collecting target labels from referential, additional sensors in confined spaces is impractical. In the whole dataset, each input video frame, which is corrupted due to sensor miniaturization and other common adversarial factors, is paired with pseudo-labels of:
- enhanced frame
- segmented teeth
- teeth-based homography (4 dof / similarity) between consecutive frames
Vident-real contains 100 real intra-oral videos of 70K frames recorded during conservative treatment procedures. All sequences were recorded in RAW 10-bit format through a wide-angle lens with the sensor's resolution of 800x800 pixels and high sampling frequency ranging from 55 to 60 Hz. The RAW images were debayerized and stored in JPEG format. Sensor's gain and integration time were manually adjusted to each patient's intra-oral cavity to account for on-site low-light conditions thereby improving visibility and colors in the dynamically changing environment.
A miniaturized camera affixed to a dental handpiece could allow dentists to continuously monitor the progress of conservative dental interventions. Camera-augmented dental interventions hold the potential to facilitate dental training and education, optimize workflow ergonomics, and improve patient outcomes. For safe and effective navigation in the mouth, the necessary miniaturization of sensors and optics introduces artifacts to video streams. The inevitable camera shakes result in eye fatigue. The unique challenges posed by intra-oral conditions, such as noise, blur, texture paucity, light variations, shadows, reflections, and fluid dynamics make continuous macro-visualization of complex dental scenes on customized displays difficult. Enhancement of videos acquired in these challenging conditions appears as a natural step towards advancing the field of Video-Assisted Dentistry (VAD), enabling clearer view of the teeth, fractures, gums, blood, cavities, fillings, dentine, pulp, and dental tools.
Plik z danymi badawczymi
hexmd5(md5(part1)+md5(part2)+...)-{parts_count}
gdzie pojedyncza część pliku jest wielkości 512 MBPrzykładowy skrypt do wyliczenia:
https://github.com/antespi/s3md5
Informacje szczegółowe o pliku
- Licencja:
-
otwiera się w nowej karcieCC BY-NCUżycie niekomercyjne
- Embargo na plik:
- 2024-09-30
Informacje szczegółowe
- Rok publikacji:
- 2024
- Data zatwierdzenia:
- 2024-07-01
- Język danych badawczych:
- angielski
- Dyscypliny:
-
- informatyka techniczna i telekomunikacja (Dziedzina nauk inżynieryjno-technicznych)
- automatyka, elektronika, elektrotechnika i technologie kosmiczne (Dziedzina nauk inżynieryjno-technicznych)
- nauki medyczne (Dziedzina nauk medycznych i nauk o zdrowiu)
- inżynieria biomedyczna (Dziedzina nauk inżynieryjno-technicznych)
- DOI:
- Identyfikator DOI 10.34808/vjnh-9c35 otwiera się w nowej karcie
- Ethical papers:
- Zgoda nr KB-14/22 wystawiona przez Bioethics Committee at the Regional Medical Chamber in Gdańsk
- Weryfikacja:
- Politechnika Gdańska
Słowa kluczowe
- dental interventions
- video processing
- video enhancement
- motion estimation
- video segmentation
- video restoration
- dental imaging
- multi-task learning
- pseudo-labels
- video-assisted dentistry
Powiązane zasoby
- dane badawcze Vident-lab: a dataset for multi-task video processing of phantom dental scenes
- dane badawcze Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
- publikacja Multi-task Video Enhancement for Dental Interventions
Cytuj jako
Autorzy
wyświetlono 585 razy