Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The backbone extracts features from an input image, the neck combines and enhances the extracted features, and the head is used for the main task (e.g., object detection).