This GitHub repository demonstrates the utilization of UNETR for multiclass image segmentation on the Landmark Guided Face Parsing dataset (LaPa) dataset. .
The block diagram of the Original UNETR model. |
LaPa stands for Landmark guided face Parsing dataset (LaPa). It is a large-scale dataset for human face parsing. It consists of more than 22,000 facial images with abundant variations in expression, pose and occlusion, and each image of LaPa is provided with a 11-category pixel-level label map and 106-point landmarks.
Download the dataset: Landmark guided face Parsing dataset (LaPa)
For more: LaPa-Dataset for face parsing
The sequence in the images below is Input Image
, Ground Truth
and Prediction
.
- Train on more epochs.
- Increase the input image resolution.
- Apply data augmentation.
For more follow me on: