There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.