einops包中的rearrange，reduce, repeat及einops.layers.torch中的Rearrange，Reduce。对高维数据的处理方式 - 博客

[{"createTime":1735734952000,"id":1,"img":"hwy_ms_500_252.jpeg","link":"https://activity.huaweicloud.com/cps.html?fromacct=261f35b6-af54-4511-a2ca-910fa15905d1&utm_source=V1g3MDY4NTY=&utm_medium=cps&utm_campaign=201905","name":"华为云秒杀","status":9,"txt":"华为云38元秒杀","type":1,"updateTime":1735747411000,"userId":3},{"createTime":1736173885000,"id":2,"img":"txy_480_300.png","link":"https://cloud.tencent.com/act/cps/redirect?redirect=1077&cps_key=edb15096bfff75effaaa8c8bb66138bd&from=console","name":"腾讯云秒杀","status":9,"txt":"腾讯云限量秒杀","type":1,"updateTime":1736173885000,"userId":3},{"createTime":1736177492000,"id":3,"img":"aly_251_140.png","link":"https://www.aliyun.com/minisite/goods?userCode=pwp8kmv3","memo":"","name":"阿里云","status":9,"txt":"阿里云2折起","type":1,"updateTime":1736177492000,"userId":3},{"createTime":1735660800000,"id":4,"img":"vultr_560_300.png","link":"https://www.vultr.com/?ref=9603742-8H","name":"Vultr","status":9,"txt":"Vultr送$100","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":5,"img":"jdy_663_320.jpg","link":"https://3.cn/2ay1-e5t","name":"京东云","status":9,"txt":"京东云特惠专区","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":6,"img":"new_ads.png","link":"https://www.iodraw.com/ads","name":"发布广告","status":9,"txt":"发布广告","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":7,"img":"yun_910_50.png","link":"https://activity.huaweicloud.com/discount_area_v5/index.html?fromacct=261f35b6-af54-4511-a2ca-910fa15905d1&utm_source=aXhpYW95YW5nOA===&utm_medium=cps&utm_campaign=201905","name":"底部","status":9,"txt":"高性能云服务器2折起","type":2,"updateTime":1735660800000,"userId":3}]

from einops import rearrange, reduce, repeat from einops.layers.torch import
Rearrange, Reduce
一.rearrange和Rearrange，作用：从函数名称也可以看出是对张量尺度进行重排，

区别：
1.einops.layers.torch中的Rearrange，用于搭建网络结构时对张量进行“隐式”的处理

例如：
class PatchEmbedding(nn.Module): def __init__(self, in_channels: int = 3,
patch_size: int = 16, emb_size: int = 768, img_size: int = 224):
self.patch_size = patch_size super().__init__() self.projection =
nn.Sequential( # using a conv layer instead of a linear one -> performance
gains nn.Conv2d(in_channels, emb_size, kernel_size=patch_size,
stride=patch_size), Rearrange('b e (h) (w) -> b (h w) e'), )
这里的Rearrange('b e (h) (w) -> b (h w) e')
，表示将4维张量转换为3维，且原来的最后两维合并为一维：（16，512，4，16）->（16，64，512）
这样只要我们知道初始的张量维度就可以操作注释来对其进行维度重排。

2.eniops中的rearrange,用于对张量‘显示’的处理，是一个函数

例如：
rearrange(images, 'b h w c -> b (h w) c')
将4维张量转换为3维,同样的，只要我们知道初始维度，就可以操作注释对其进行重排
值得注意的是：这里的注释给定以后就代表当前维度，不能更改,例如：
image = torch.randn(1,2,3,2) # torch.Size([1,2,3,2]) out = rearrange(image, 'b
c h w -> b (c h w)', c=2,h=3,w=2) # torch.Size([1,12]) # h,w的值更改 err1 =
rearrange(image, 'b c h w -> b (c h w)', c=2,h=2,w=3) # 报错
二.repeat:即将tensor中的某一维度进行重复，以扩充该维度数量
B = 16 cls_token = torch.randn(1, 1, emb_size) cls_tokens = repeat(cls_token,
'() n e -> b n e', b=B)#维度为1的时候可用（）代替
将(1,1,emb_size)的张量处理为（B,1,emb_size）
R = 16 a = torch.randn(2,3,4) b = repeat(a, 'b n e -> (r b) n e', r = R) #(2R,
3, 4) c = repeat(a, 'b n e -> b (r n) e', r = R) #(2, 3R, 4) #错误用法: d =
repeat(a, 'b n e -> c n e', c = 2R)
#将（2,3,4）维张量处理为(2R, 3, 4)......
上面都是同纬度的扩充，我们看一个升维的扩充：
R = 5 a = torch.randn(2, 3, 4) d = repeat(a,'b n e-> b n c e ', c = R)
#将（2,3,4）维张量处理为(2, 3, 5, 4)......

这里我们同样只须操作维度注释即可完成相应的张量操作。

三.Reduce 和 reduce：
x = torch.randn(100, 32, 64) # perform max-reduction on the first axis: y0 =
reduce(x, 't b c -> b c', 'max') #(32, 64) #指定h2,w2，相当于指定池化核的大小 x =
torch.randn(10, 512, 30, 40) # 2d max-pooling with kernel size = 2 * 2 y1 =
reduce(x, 'b c (h1 h2) (w1 w2) -> b c h1 w1', 'max', h2=2, w2=2) #(10, 512, 15,
20) # go back to the original height and width y2 = rearrange(y1, 'b (c h2 w2)
h1 w1 -> b c (h1 h2) (w1 w2)', h2=2, w2=2) #(10, 128, 30, 40)
#指定h1,w1，相当于指定池化后张量的大小 # 2d max-pooling to 12 * 16 grid: y3 = reduce(x, 'b c
(h1 h2) (w1 w2) -> b c h1 w1', 'max', h1=12, w1=16) #(10, 512, 12, 16) # 2d
average-pooling to 12 * 16 grid: y4 = (reduce(x, 'b c (h1 h2) (w1 w2) -> b c h1
w1', 'mean', h1=12, w1=16) #(10, 512, 12, 16) # Global average pooling y5 =
reduce(x, 'b c h w -> b c', 'mean') #(10, 512)
Redece同理。

注意：这里我们以张量为例，einops同样可以处理numpy下的数据

技术

Java1212 篇
Python927 篇
开发语言608 篇
c语言463 篇
算法461 篇
MySQL438 篇
数据库394 篇
前端387 篇
更多...