- 文章标题:BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
- 文章地址:https://arxiv.org/abs/2305.14720
- NIPS 2024


- 数据:LAION, COCO, Visual Genome, Conceptual Captions, OpenImage-V6
- 指标:DINO,CLIP-I,CLIP-T
- 硬件:16 A100/bs16
- 开源:https://github.com/salesforce/LAVIS/tree/main/projects/blip-diffusion