pytorch中的卷积和池化计算方式详解

2023-08-03 14:57:04 77

TensorFlow里面的padding只有两个选项也就是valid和same

pytorch里面的padding么有这两个选项，它是数字0,1,2,3等等，默认是0

所以输出的h和w的计算方式也是稍微有一点点不同的：tf中的输出大小是和原来的大小成倍数关系，不能任意的输出大小；而nn输出大小可以通过padding进行改变

nn里面的卷积操作或者是池化操作的H和W部分都是一样的计算公式：H和W的计算

classtorch.nn.MaxPool2d(kernel_size,stride=None,padding=0,dilation=1,return_indices=False,ceil_mode=False):
"""
Parameters:
kernel_size–thesizeofthewindowtotakeamaxover
stride–thestrideofthewindow.默认值是kernel_size
padding–implicitzeropaddingtobeaddedonbothside,默认值是0
dilation–aparameterthatcontrolsthestrideofelementsinthewindow，默认值是1
return_indices–ifTrue,willreturnthemaxindicesalongwiththeoutputs.UsefulwhenUnpoolinglater
ceil_mode–whenTrue,willuseceilinsteadoffloortocomputetheoutputshape，向上取整和向下取整，默认是向下取整
"""

不一样的地方在于：第一点，步长stride默认值，上面默认和设定的kernel_size一样，下面默认是1；第二点，输出通道的不一样，上面的输出通道和输入通道是一样的也就是没有改变特征图的数目，下面改变特征图的数目为out_channels

classtorch.nn.Conv2d(in_channels,out_channels,kernel_size,stride=1,padding=0,dilation=1,groups=1,bias=True):
pass
"""
Parameters:
in_channels(int)–Numberofchannelsintheinputimage
out_channels(int)–Numberofchannelsproducedbytheconvolution
kernel_size(intortuple)–Sizeoftheconvolvingkernel
stride(intortuple,optional)–Strideoftheconvolution.Default:1,默认是1
padding(intortuple,optional)–Zero-paddingaddedtobothsidesoftheinput.Default:0
dilation(intortuple,optional)–Spacingbetweenkernelelements.Default:1
groups(int,optional)–Numberofblockedconnectionsfrominputchannelstooutputchannels.Default:1
bias(bool,optional)–IfTrue,addsalearnablebiastotheoutput.Default:True
"""

第三点不一样是卷积有一个参数groups,将特征图分开给不同的卷积进行操作然后再整合到一起，xception就是利用这一个。

"""
Atgroups=1,allinputsareconvolvedtoalloutputs.
Atgroups=2,theoperationbecomesequivalenttohavingtwoconvlayerssidebyside,eachseeinghalftheinputchannels,andproducinghalftheoutputchannels,andbothsubsequentlyconcatenated.
Atgroups=in_channels,eachinputchannelisconvolvedwithitsownsetoffilters(ofsize⌊out_channelsin_channels⌋
).
"""

pytorchAvgPool2d函数

classtorch.nn.AvgPool2d(kernel_size,stride=None,padding=0,
ceil_mode=False,count_include_pad=True):
pass
"""
kernel_size:thesizeofthewindow
stride:thestrideofthewindow.Defaultvalueis:attr:`kernel_size`
padding:implicitzeropaddingtobeaddedonbothsides
ceil_mode:whenTrue,willuse`ceil`insteadof`floor`tocomputetheoutputshape
count_include_pad:whenTrue,willincludethezero-paddingintheaveragingcalculation
"""

shape的计算公式，在（h,w)位置处的输出值的计算。

pytorch中的F.avg_pool1d（）平均池化操作作用于一维，input的维度是三维比如［２,２,７］。F.avg_pool1d（）中核ｓｉｚｅ是３，步长是２表示每三个数取平均，每隔两个数取一次．比如[1,3,3,4,5,6,7]安照3个数取均值，两步取一次，那么结果就是[2.3333,4,6]，也就是核是一维的，也只作用于一个维度。按照池化操作计算公式inputsize为[2,2,7],kernelsize为3，步长为2，则输出维度计算（7-3）/2+1=3所以输出维度是[2,2,3]，这与输出结果是一致的。

pytorch中的F.avg_pool2d（），input是维度是４维如［２，２，４，４］，表示这里批量数是２也就是两张图像，这里通道数量是２，图像是size是４＊４的．核size是（２，２），步长是（２，２）表示被核覆盖的数取平均，横向纵向的步长都是２．那么核是二维的，所以取均值时也是覆盖二维取的。输出中第一个1.5的计算是：(1+2+1+2)/4=1.5.表示第一张图像左上角的四个像素点的均值。按照池化操作计算公式inputsize为[2,2,4,4],kernelsize为2*2，步长为2，则输出维度计算（4-2）/2+1=2所以输出维度是[2,2,2,2]，这与输出结果是一致的。

Conv3d函数

classtorch.nn.Conv3d(in_channels,out_channels,kernel_size,stride=1,
padding=0,dilation=1,groups=1,bias=True):
pass
"""
in_channels(int):Numberofchannelsintheinputimage
out_channels(int):Numberofchannelsproducedbytheconvolution
kernel_size(intortuple):Sizeoftheconvolvingkernel
stride(intortuple,optional):Strideoftheconvolution.Default:1
padding(intortuple,optional):Zero-paddingaddedtoallthreesidesoftheinput.Default:0
dilation(intortuple,optional):Spacingbetweenkernelelements.Default:1
groups(int,optional):Numberofblockedconnectionsfrominputchannelstooutputchannels.Default:1
bias(bool,optional):If``True``,addsalearnablebiastotheoutput.Default:``True``
Shape:
-Input::math:`(N,C_{in},D_{in},H_{in},W_{in})`
-Output::math:`(N,C_{out},D_{out},H_{out},W_{out})`
"""
C_out=out_channels

以上这篇pytorch中的卷积和池化计算方式详解就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持毛票票。

声明：本文内容来源于网络，版权归原作者所有，内容由互联网用户自发贡献自行上传，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任。如果您发现有涉嫌版权的内容，欢迎发送邮件至：czq8825#qq.com（发邮件时，请将#更换为@）进行举报，并提供相关证据，一经查实，本站将立刻删除涉嫌侵权内容。

pytorch中的卷积和池化计算方式详解

热门推荐

随机推荐