Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
When you're confident that both implementations yield the same output, verify the outputs with
torch.allclose(original_output, output, atol=1e-3), you're done with the most difficult part!