Patch embed
PatchEmbed
¶
Bases: Module
2D image to patch embedding: (B,C,H,W) -> (B,N,D)
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
img_size
|
Union[int, Tuple[int, int]]
|
Image size. |
224
|
patch_size
|
Union[int, Tuple[int, int]]
|
Patch token size. |
16
|
in_chans
|
int
|
Number of input image channels. |
3
|
embed_dim
|
int
|
Number of linear projection output channels. |
768
|
norm_layer
|
Optional[Callable]
|
Normalization layer. |
None
|
Source code in inference/models/depth_anything_v3/architecture/layers/patch_embed.py
26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 | |