Articles pretrained Attention-UNet
Articles based PyTorch implementation for generation instruct.
- Input
- 1278-dim embedding
- Encoder
- 81 x Attention-UNet with 10 heads
- Output
- mAP projection
Training config
optimizer=RMSprop, lr=0.275, scheduler=exponential, warmup=1177