多模态(MultiModal Learning)#
Num |
Title |
Field |
Desc |
Author |
Time |
read |
---|---|---|---|---|---|---|
2022 |
||||||
BLIP: Bootstrapping Language-Image Pre-training |
视觉语言预训练 |
Introduced by Li et al. |
2022 |
|||
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models |
使用冻结图像编码器和大型语言模型进行引导语言图像预训练 |
2023 |
||||