Ideally, you should debug/print out | |
intermediate outputs of both implementations of the forward pass to find the exact position in the network where the 🤗 | |
Transformers implementation shows a different output than the original implementation. |
Ideally, you should debug/print out | |
intermediate outputs of both implementations of the forward pass to find the exact position in the network where the 🤗 | |
Transformers implementation shows a different output than the original implementation. |