Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
More generally, the last hidden states
will have a shape of seq_length + image_feature_pool_shape[0] *
config.image_feature_pool_shape[1].