Refactor Attention UNet #115

HarshithBachimanchi · 2024-06-11T14:46:53Z

This PR aims to refactor AttentionUNet. The following changes are made to make it modularly simple, and produce a much cleaner print statement:

DoubleConvBlock is now removed. It was used to implement residual connections at all levels in the UNet. Instead, now they are directly integrated to the FeatureIntegrationModule through a couple of Block modules.
The time step encoding (and class and context embeddings) are now integrated at the middle of the residual connections, rather than at the end.
Added a new parameter num_attention_heads to control the number of attention heads in self-attention and cross-attention heads. Also added a unittest for this.
Removed a warning that was disabling classifier-free guidance for context inputs.

To refactor into deeply style, I see that several styles need to be implemented for Conv2dBlock, followed by their integration to UNet2d. Some of them exist, but not in the way I want (For example styles, spatial_self_attention, and spatial_cross_attention). I gave it a quick try, and looks like it is possible to build it with styles but requires extensive testing. I prefer to keep AttentionUNet bespoke for now (Unless if you have any suggestions).

1. Removed DoubleConvBlock which was being used for the residual connection. Instead a skip connection is directly included in the FeatureIntegrationModule. 2. Time step embedding (and other embeddings) are now added within the residual block, rather than at the end. 3. Removed some warnings. 4. Added `num_attention_heads` parameter to control the self attention and cross attention heads in the model

HarshithBachimanchi added 2 commits June 11, 2024 15:12

Update test_attention_unet.py

5d1b282

HarshithBachimanchi requested review from JesusPinedaC, BenjaminMidtvedt and giovannivolpe June 11, 2024 14:47

giovannivolpe merged commit cf60b97 into DeepTrackAI:develop Jun 20, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Attention UNet #115

Refactor Attention UNet #115

HarshithBachimanchi commented Jun 11, 2024

Refactor Attention UNet #115

Refactor Attention UNet #115

Conversation

HarshithBachimanchi commented Jun 11, 2024