File size: 1,063 Bytes
57bdca5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

Backbone
A backbone is a model used for feature extraction for higher level computer vision tasks such as object detection and image classification. Transformers provides an [AutoBackbone] class for initializing a Transformers backbone from pretrained model weights, and two utility classes:

[~utils.BackboneMixin] enables initializing a backbone from Transformers or timm and includes functions for returning the output features and indices.
[~utils.BackboneConfigMixin] sets the output features and indices of the backbone configuration.

timm models are loaded with the [TimmBackbone] and [TimmBackboneConfig] classes.
Backbones are supported for the following models:

BEiT
BiT
ConvNet
ConvNextV2
DiNAT
DINOV2
FocalNet
MaskFormer
NAT
ResNet
Swin Transformer
Swin Transformer v2
ViTDet

AutoBackbone
[[autodoc]] AutoBackbone
BackboneMixin
[[autodoc]] utils.BackboneMixin
BackboneConfigMixin
[[autodoc]] utils.BackboneConfigMixin
TimmBackbone
[[autodoc]] models.timm_backbone.TimmBackbone
TimmBackboneConfig
[[autodoc]] models.timm_backbone.TimmBackboneConfig