Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
176 Bytes
This
method takes care of encoding the input and feeding the encoded hidden states via cross-attention layers to the decoder
and auto-regressively generates the decoder output.