PyTorch

**PyTorch**

原作者	Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan
開發者	Meta AI（英语：Meta AI）
首次发布	2016年10月，7年前（2016-October）
当前版本	2.3.0 (2024年4月24日；穩定版本)^[1]
源代码库	github.com/pytorch/pytorch
编程语言	Python, C++, CUDA
操作系统	Linux, macOS, Windows
平台	IA-32, x86-64, ARM64
类型	机器学习和深度学习库
许可协议	BSD许可证
网站	pytorch.org

机器学习与数据挖掘

范式监督学习無監督學習線上機器學習元学习（英语：Meta-learning (computer science)）半监督学习自监督学习强化学习基于规则的机器学习（英语：Rule-based machine learning）量子機器學習
问题统计分类生成模型迴歸分析聚类分析降维密度估计（英语：density estimation）异常检测数据清洗自动机器学习关联规则学习語意分析结构预测（英语：Structured prediction）特征工程表征学习排序学习（英语：Learning to rank）语法归纳（英语：Grammar induction）本体学习（英语：Ontology learning）多模态学习（英语：Multimodal learning）
监督学习 (分类 · 回归) 学徒学习（英语：Apprenticeship learning）决策树学习集成学习 Bagging 提升方法随机森林 k-NN 線性回歸朴素贝叶斯人工神经网络邏輯斯諦迴歸感知器相关向量机（RVM）支持向量机（SVM）迁移学习微调
聚类分析 BIRCH CURE算法（英语：CURE algorithm）层次 k-平均 Fuzzy 期望最大化（EM） DBSCAN OPTICS 均值飘移（英语：Mean shift）
降维因素分析 CCA ICA LDA NMF（英语：Non-negative matrix factorization） PCA PGD（英语：Proper generalized decomposition） t-SNE（英语：t-distributed stochastic neighbor embedding） SDL
结构预测（英语：Structured prediction）圖模式貝氏網路條件隨機域隐马尔可夫模型
异常检测 RANSAC k-NN 局部异常因子（英语：Local outlier factor）孤立森林（英语：Isolation forest）
人工神经网络自编码器認知計算深度学习 DeepDream（英语：DeepDream）多层感知器 RNN LSTM GRU（英语：Gated recurrent unit） ESN（英语：Echo state network）储备池计算（英语：reservoir computing）受限玻尔兹曼机 GAN SOM CNN U-Net Transformer Vision transforme（英语：Vision transformer）脉冲神经网络（英语：Spiking neural network） Memtransistor（英语：Memtransistor）电化学RAM（英语：Electrochemical RAM）（ECRAM）
强化学习 Q学习 SARSA 时序差分（TD）多智能体（英语：Multi-agent reinforcement learning） Self-play（英语：Self-play (reinforcement learning technique)） RLHF
与人类学习主动学习（英语：Active learning (machine learning)）众包 Human-in-the-loop（英语：Human-in-the-loop）
模型诊断学习曲线（英语：Learning curve (machine learning)）
数学基础内核机器（英语：Kernel machines）偏差–方差困境（英语：Bias–variance tradeoff）计算学习理论（英语：Computational learning theory）经验风险最小化奥卡姆学习（英语：Occam learning） PAC学习（英语：Probably approximately correct learning）统计学习 VC理论
大会与出版物 NeurIPS ICML（英语：International Conference on Machine Learning） ICLR ML（英语：Machine Learning (journal)） JMLR（英语：Journal of Machine Learning Research）
相关条目人工智能术语（英语：Glossary of artificial intelligence）机器学习研究数据集列表（英语：List of datasets for machine-learning research）机器学习概要（英语：Outline of machine learning）
查论编

PyTorch是一个开源的Python 机器学习库，基于Torch（英语：Torch (machine_learning)）库^[2]^[3]^[4]，底层由C++实现，应用于人工智能领域，如计算机视觉和自然语言处理^[5]。它最初由Meta Platforms的人工智能研究团队开发，現在屬於Linux基金会的一部分^[6]^[7]^[8]。它是在修改後的BSD許可證下發布的自由及开放源代码软件。儘管Python接口更加完善並且是開發的主要重點，但 PyTorch 也有C++接口^[9]。

許多深度學習軟體都是基於 PyTorch 構建的，包括特斯拉自动驾驶^[10]、Uber的Pyro^[11]、Hugging Face的Transformers^[12]、 PyTorch Lightning^[13]^[14]、和Catalyst^[15]^[16]。

概述

PyTorch主要有两大特征：^[17]

类似于NumPy的张量计算，能在 GPU 或 MPS 等硬件加速器上加速；
基于带自动微分系统^[18]^[19]的深度神经网络^[20]。

PyTorch包括torch.autograd、torch.nn、torch.optim等子模块^[20]。

PyTorch包含多种损失函数，包括 MSE（均方误差 = L2 范数）、交叉熵损失和负熵似然损失（对分类器有用）等。

PyTorch張量

PyTorch定義了一個名為張量(torch.Tensor) 的類別來儲存和操作同構多維矩形數字陣列。 PyTorch張量與NumPy陣列類似，但也可以在支援 CUDA 的英伟达 GPU 上運作。 PyTorch 也一直在開發對其他 GPU 平台的支持，例如 AMD 的 ROCm 和 Apple 的Metal Framework^[21]。

张量是 PyTorch 中的核心数据抽象，PyTorch 支援各種張量子類型^[22]。通常地，一维张量称为向量（vector），二维张量称为矩阵（matrix）。

张量的数据类型包括：

torch.bool
torch.int8
torch.uint8
torch.int16
torch.int32
torch.int64
torch.half
torch.float
torch.double
torch.bfloat

PyTorch神經網絡

神经网络由对数据执行操作的层/模块组成。 torch.nn 命名空间提供了使用者需要的所有构建块来构建自己的神经网络。PyTorch 中的每个模块都对应nn.模块。神经网络本身是由其他模块（层）组成的模块。这种嵌套结构允许使用者轻松构建并管理复杂的架构。神经网络中的许多层都是参数化的，即具有相关的权重以及在训练期间优化的偏差。自动子类化跟踪模型对象中定义的所有字段，并生成所有参数可使用模型或方法访问。^[2]

import torch                     # for all things PyTorch
import torch.nn as nn            # for torch.nn.Module, the parent object for PyTorch models
import torch.nn.functional as F  # for the activation function

激活函数torch.nn.Module具有封装所有主要内容的对象激活功能，包括 ReLU 及其许多变体、Tanh、 Hardtanh、sigmoid 等。^[3]

PyTorch模型常见图层类型

线性层

最基本的神经网络层类型是线性或完全连接层。在这个层中，每个输入都会影响每个图层的输出到由图层权重指定的程度。如果模型有 m 个输入和 n 个输出，权重将是一个 m x n 矩阵。

卷积层

卷积层旨在处理高度空间相关性。它们在计算机视觉中非常常用，它们检测组成的特征的紧密分组更高级别的功能。它们也会在其他上下文中弹出。例如，在 NLP 应用程序中，单词的直接上下文（即序列中附近的其他单词）可以影响语句。

循环层

递归神经网络（RNN）是用于顺序数据（从科学仪器到时间序列测量）的自然语言句子。

例子

下面的程序用简单的例子展示这个程序库的低层功能。

>>> import torch
>>> dtype = torch.float
>>> device = torch.device("cpu") # 本次在CPU上执行所有的计算
>>> # device = torch.device("cuda:0") # 本次在GPU上执行所有的计算
>>> 
>>> # 建立一个张量并用随机数填充这个张量
>>> a = torch.randn(2, 3, device=device, dtype=dtype)
>>> print(a) # 输出张量a
tensor([[-0.1460, -0.3490,  0.3705],
        [-1.1141,  0.7661,  1.0823]])
>>> 
>>> # 建立一个张量并用随机数填充这个张量
>>> b = torch.randn(2, 3, device=device, dtype=dtype)
>>> print(b) # 输出张量B
tensor([[ 0.6901, -0.9663,  0.3634],
        [-0.6538, -0.3728, -1.1323]])
>>> 
>>> print(a*b) # 输出两个张量的乘积
tensor([[-0.1007,  0.3372,  0.1346],
        [ 0.7284, -0.2856, -1.2256]])
>>> print(a.sum()) # 输出在张量a中所有元素的总和
tensor(0.6097)
>>> 
>>> print(a[1,2]) # 输出第2行第3列（0起始）的元素
tensor(1.0823)
>>> 
>>> print(a.max()) # 输出在张量a中的极大值
tensor(1.0823)

下列代码块展示了nn模块提供的高层功能的例子。例子中定义了具有线性层的神经网络。

import torch
from torch import nn # 从PyTorch中导入nn子模块 

class NeuralNetwork(nn.Module): # 神经网络被定义为类
    def __init__(self): # 在__init__方法中定义诸层和变量
        super(NeuralNetwork, self).__init__() # 必须出现在所有网络中
        self.flatten = nn.Flatten() # 定义一个压平层
        self.linear_relu_stack = nn.Sequential( # 定义诸层的一个堆栈
            nn.Linear(28*28, 512), # 线性层有一个输入和输出形状
            nn.ReLU(), # ReLU是nn提供的诸多激活函数之一
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10), 
        )

    def forward(self, x): # 这个函数定义前向传递。
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

参考文献

^ ^1.0 ^1.1 Release 2.3.0. 2024年4月24日 [2024年4月25日].
^ ^2.0 ^2.1 Yegulalp, Serdar. Facebook brings GPU-powered machine learning to Python. InfoWorld. 19 January 2017 [11 December 2017]. （原始内容存档于2018-07-12）.
^ ^3.0 ^3.1 Lorica, Ben. Why AI and machine learning researchers are beginning to embrace PyTorch. O'Reilly Media. 3 August 2017 [11 December 2017]. （原始内容存档于2019-05-17）.
^ Ketkar, Nikhil. Deep Learning with Python. Apress, Berkeley, CA. 2017: 195–208 [2018-10-02]. ISBN 9781484227657. doi:10.1007/978-1-4842-2766-4_12. （原始内容存档于2018-07-12）（英语）.
^ Natural Language Processing (NLP) with PyTorch — NLP with PyTorch documentation. dl4nlp.info. [2017-12-18]. （原始内容存档于2019-06-21）（英语）.
^ Patel, Mo. When two trends fuse: PyTorch and recommender systems. O'Reilly Media. 2017-12-07 [2017-12-18]. （原始内容存档于2019-03-30）（英语）.
^ Mannes, John. Facebook and Microsoft collaborate to simplify conversions from PyTorch to Caffe2. TechCrunch. [2017-12-18]. （原始内容存档于2020-07-06）（英语）. FAIR is accustomed to working with PyTorch — a deep learning framework optimized for achieving state of the art results in research, regardless of resource constraints. Unfortunately in the real world, most of us are limited by the computational capabilities of our smartphones and computers.
^ Arakelyan, Sophia. Tech giants are using open source frameworks to dominate the AI community. VentureBeat. 2017-11-29 [2017-12-18]. （原始内容存档于2019-03-30）（美国英语）.
^ The C++ Frontend. PyTorch Master Documentation. [2019-07-29].
^ Karpathy, Andrej. PyTorch at Tesla - Andrej Karpathy, Tesla.
^ Uber AI Labs Open Sources Pyro, a Deep Probabilistic Programming Language. Uber Engineering Blog. 2017-11-03 [2017-12-18] （美国英语）.
^ PYTORCH-TRANSFORMERS: PyTorch implementations of popular NLP Transformers, PyTorch Hub, 2019-12-01 [2019-12-01]
^ PYTORCH-Lightning: The lightweight PyTorch wrapper for ML researchers. Scale your models. Write less boilerplate, Lightning-Team, 2020-06-18 [2020-06-18]
^ Ecosystem Tools. pytorch.org. [2020-06-18] （英语）.
^ GitHub - catalyst-team/catalyst: Accelerated DL & RL, Catalyst-Team, 2019-12-05 [2019-12-05]
^ Ecosystem Tools. pytorch.org. [2020-04-04] （英语）.
^ PyTorch – About. pytorch.org. [2018-06-11]. （原始内容存档于2018-06-15）.
^ R.E. Wengert. A simple automatic derivative evaluation program. Comm. ACM. 1964, 7: 463–464. doi:10.1145/355586.364791.
^ Bartholomew-Biggs, Michael; Brown, Steven; Christianson, Bruce; Dixon, Laurence. Automatic differentiation of algorithms (PDF). Journal of Computational and Applied Mathematics. 2000, 124 (1-2): 171–190. Bibcode:2000JCoAM.124..171B. doi:10.1016/S0377-0427(00)00422-2.
^ ^20.0 ^20.1 神经网络与PyTorch实战 Application of Neural Network and PyTorch. 机械工业出版社. 2018. ISBN 9787111605775.
^ Introducing Accelerated PyTorch Training on Mac. pytorch.org. [2022-06-04] （英语）.
^ An Introduction to PyTorch – A Simple yet Powerful Deep Learning Library. analyticsvidhya.com. 2018-02-22 [2018-06-11].

参见

深度学习软件比较（英语：Comparison of deep learning software）
人工神经网络
深度学习
机器学习
损失函数
激活函数

外部链接

官方网站
从 GitHub 访问 PyTorch 教程
PyTorch基金会的Youtube账号

查论编深度学习软件（英语：Comparison of deep learning software）

开源软件	Apache Singa（英语：Apache Singa） Blocks（英语：Blocks） Caffe Deeplearning4j Dlib（英语：Dlib） Microsoft Cognitive Toolkit MXNet OpenNN（英语：OpenNN） PyTorch scikit-learn LangChain Gradio RETURNN（英语：RETURNN） TensorFlow Keras Theano Torch（英语：Torch (machine learning)）

专有	Neural Designer（英语：Neural Designer） Wolfram Mathematica

分类比较

可微分计算

概论

可微分编程
自動微分
张量微积分（英语：Tensor calculus）
信息几何
统计流形
神经形态工程（英语：Neuromorphic engineering）
模式识别
运算学习理论（英语：Computational learning theory）
归纳偏置

概念

梯度下降
- SGD（英语：Stochastic gradient descent）
聚类
回归
- 过拟合
幻觉
对抗（英语：Adversarial machine learning）
注意力
卷积
損失函數
反向传播
激活函数
- softmax
- sigmoid
- ReLU
正则化
数据集
扩散（英语：Diffusion process）
自回归

应用

硬件

TPU
VPU
IPU（英语：Graphcore）
憶阻器
SpiNNaker（英语：SpiNNaker）

软件库

Theano
TensorFlow
- Keras
PyTorch
JAX
Flux.jl（英语：Flux (machine-learning framework)）

实现

视觉·语音	AlexNet WaveNet 人像合成手寫识别 OCR 语音合成语音识别人脸识别 AlphaFold DALL-E Midjourney Stable Diffusion Sora Whisper（英语：Whisper (speech recognition system)）

自然语言	Word2vec Seq2seq BERT LaMDA Bard NMT 辩手项目（英语：Project Debater）沃森 GPT GPT-1 GPT-2 GPT-3 GPT-4 GPT-J（英语：GPT-J） ChatGPT 文心一言 Chinchilla AI（英语：Chinchilla AI） PaLM（英语：PaLM） BLOOM（英语：BLOOM (language model)） LLaMA

决策	AlphaGo Q学习 SARSA OpenAI Five（英语：OpenAI Five）自动驾驶 MuZero 行动选择（英语：Action selection） Auto-GPT 机器人控制（英语：Robot control）