Pytorch入门—Tensors张量的学习

pytorch,tensors · 浏览次数 : 5

小编点评

**张量** 张量是一种特殊的数据结构，与数组和矩阵非常相似。在PyTorch中，我们使用张量来编码模型的输入和输出以及模型的参数。张量类似于NumPy的ndarrays，只是张量可以在GPU或其他硬件加速器上运行。 **张量的属性** 张量具有以下属性： * **形状：**张量的形状是一个元组，表示张量的维度。 * **数据类型：**张量的数据类型是自动推断的。 * **设备：**张量的设备是存储其数据的设备。 **张量的操作** * **矩阵乘法：**使用@或matmul函数进行矩阵乘法。 * **元素级乘法：**使用*或mul函数进行元素级乘法。 **张量的用法** 张量可以用于以下目的： * **输入和输出编码** * **模型参数** * **自动微分** **张量的与NumPy数组的桥接** 张量和NumPy数组可以共享它们的底层内存位置，从而无需复制数据。这可以提高性能。 **使用张量** ```python import torch # 创建张量 x_data = torch.tensor([[1, 2],[3, 4]]) # 打印张量的属性 print(f"Shape of tensor: {x_data.shape}") print(f"Datatype of tensor: {x_data.dtype}") print(f"Device tensor is stored on: {x_data.device}") # 使用矩阵乘法 y1, y2 = torch.randn(2, 3), torch.randn(3, 4) z = y1 @ y2 # 打印结果 print(z) ```

正文

Tensors张量的学习

张量是一种特殊的数据结构，与数组和矩阵非常相似。在PyTorch中，我们使用张量来编码模型的输入和输出，以及模型的参数。

张量类似于NumPy的ndarrays，只是张量可以在GPU或其他硬件加速器上运行。事实上，张量和NumPy数组通常可以共享相同的底层内存，从而无需复制数据（请参阅使用NumPy进行桥接）。张量还针对自动微分进行了优化（我们将在稍后的Autograd部分中看到更多内容）。如果您熟悉ndarrays，您将熟悉Tensor API。

import torch
import numpy as np

Initializing a Tensor 初始化张量

Directly from data 直接从数据中初始化

张量可以直接从数据中创建。数据类型是自动推断的。

data = [[1, 2],[3, 4]]
x_data = torch.tensor(data)

From a NumPy array 从NumPy数组初始化

张量可以从NumPy数组中创建（反之亦然—请参阅使用NumPy进行桥接）。

np_array = np.array(data)
x_np = torch.from_numpy(np_array)

From another tensor 从另一个tensor初始化

新张量保留参数张量的属性（形状，数据类型），除非显式覆盖。

x_ones = torch.ones_like(x_data) # retains the properties of x_data
print(f"Ones Tensor: \n {x_ones} \n")

x_rand = torch.rand_like(x_data, dtype=torch.float) # overrides the datatype of x_data
print(f"Random Tensor: \n {x_rand} \n")

With random or constant values
具有随机值或常量值

shape 是张量维度的元组。在下面的函数中，它确定输出张量的维数。

shape = (2,3,)
rand_tensor = torch.rand(shape)
ones_tensor = torch.ones(shape)
zeros_tensor = torch.zeros(shape)

print(f"Random Tensor: \n {rand_tensor} \n")
print(f"Ones Tensor: \n {ones_tensor} \n")
print(f"Zeros Tensor: \n {zeros_tensor}")

Attributes of a Tensor 张量的属性

张量属性描述了它们的形状、数据类型以及存储它们的设备。

tensor = torch.rand(3,4)

print(f"Shape of tensor: {tensor.shape}")
print(f"Datatype of tensor: {tensor.dtype}")
print(f"Device tensor is stored on: {tensor.device}")

Standard numpy-like indexing and slicing
标准的numpy式索引和切片

tensor = torch.ones(4, 4)
print(f"First row: {tensor[0]}")
print(f"First column: {tensor[:, 0]}")
print(f"Last column: {tensor[..., -1]}")
tensor[:,1] = 0
print(tensor)

Joining tensors 连接张量

连接张量您可以使用 torch.cat 将一系列张量沿着给定的维度连接起来。另请参见torch.stack，这是另一个与 torch.cat 略有不同的张量连接运算符。

t1 = torch.cat([tensor, tensor, tensor], dim=1)
print(t1)

Arithmetic operations 算术运算

# This computes the matrix multiplication between two tensors. y1, y2, y3 will have the same value
# ``tensor.T`` returns the transpose of a tensor
y1 = tensor @ tensor.T
y2 = tensor.matmul(tensor.T)

y3 = torch.rand_like(y1)
torch.matmul(tensor, tensor.T, out=y3)


# This computes the element-wise product. z1, z2, z3 will have the same value
z1 = tensor * tensor
z2 = tensor.mul(tensor)

z3 = torch.rand_like(tensor)
torch.mul(tensor, tensor, out=z3)

这段代码主要演示了如何在PyTorch中进行矩阵乘法和元素级乘法。

矩阵乘法：

y1 = tensor @ tensor.T 和 y2 = tensor.matmul(tensor.T) 这两行代码都在进行矩阵乘法。@操作符和matmul函数都可以用于矩阵乘法。tensor.T返回tensor的转置。

y3 = torch.rand_like(y1) 创建了一个与y1形状相同，元素为随机数的新tensor。

torch.matmul(tensor, tensor.T, out=y3) 这行代码也在进行矩阵乘法，但是结果被直接写入了y3，而不是创建新的tensor。
元素级乘法：

z1 = tensor * tensor 和 z2 = tensor.mul(tensor) 这两行代码都在进行元素级乘法。*操作符和mul函数都可以用于元素级乘法。

z3 = torch.rand_like(tensor) 创建了一个与tensor形状相同，元素为随机数的新tensor。

torch.mul(tensor, tensor, out=z3) 这行代码也在进行元素级乘法，但是结果被直接写入了z3，而不是创建新的tensor。

矩阵乘法与元素级乘法是什么？

矩阵乘法和元素级乘法是两种不同的数学运算。

矩阵乘法：也被称为点积，是一种二元运算，将两个矩阵相乘以产生第三个矩阵。假设我们有两个矩阵A和B，A的形状是(m, n)，B的形状是(n, p)，那么我们可以进行矩阵乘法得到一个新的矩阵C，其形状是(m, p)。C中的每个元素是通过将A的行向量和B的列向量对应元素相乘然后求和得到的。
元素级乘法：也被称为Hadamard积，是一种二元运算，将两个矩阵相乘以产生第三个矩阵。假设我们有两个形状相同的矩阵A和B，那么我们可以进行元素级乘法得到一个新的矩阵C，其形状与A和B相同。C中的每个元素是通过将A和B中对应位置的元素相乘得到的。

在Python的NumPy和PyTorch库中，你可以使用@或matmul函数进行矩阵乘法，使用*或mul函数进行元素级乘法。

Single-element tensors

单元素张量

如果你有一个单元素张量，例如通过将张量的所有值聚合为一个值，你可以使用 item() 将它转换为Python数值。

agg = tensor.sum()
agg_item = agg.item()
print(agg_item, type(agg_item))

In-place operations

就地操作

将结果存储到操作数中的操作称为就地操作。它们由 _ 后缀表示。例如： x.copy_(y) ， x.t_() ，将更改 x 。

print(f"{tensor} \n")
tensor.add_(5)
print(tensor)

NOTE 注意
就地操作保存一些内存，但是在计算导数时可能会出现问题，因为会立即丢失历史。因此，不鼓励使用它们。

Bridge with NumPy

CPU和NumPy数组上的张量可以共享它们的底层内存位置，改变一个就会改变另一个。

张量到NumPy数组

t = torch.ones(5)
print(f"t: {t}")
n = t.numpy()
print(f"n: {n}")

张量的变化反映在NumPy数组中。

t.add_(1)
print(f"t: {t}")
print(f"n: {n}")

NumPy数组到张量

n = np.ones(5)
t = torch.from_numpy(n)

NumPy数组中的变化反映在张量中。

np.add(n, 1, out=n)
print(f"t: {t}")
print(f"n: {n}")

Notebook来源：

Tensors - PyTorch Tuesday 2.3.0+ cu 121文档 --- Tensors — PyTorch Tutorials 2.3.0+cu121 documentation