Transformer (machine learning model) wikipedia

约 2,150,000 个结果

时间不限

在新选项卡中打开链接

查看更多
前往 Wikipedia 查看全部内容
Wikipedia
https://en.wikipedia.org/wiki/Transformer_(deep...
Transformer (deep learning architecture) - Wikipedia
The transformer model has been implemented in standard deep learning frameworks such as TensorFlow and PyTorch. Transformers is a library produced by Hugging Face that supplies transformer-based architectures and pretrained models. 展开
Overview
A transformer is a deep learning architecture developed by researchers at Google and based on the multi-head attention mechanism, proposed in a 2017 paper "Attention Is All You Need". Text is converted to numerical … 展开
Training
Methods for stabilizing training
The plain transformer architecture had difficulty converging. In the original paper the authors recommended using learning rate warmup. That is, the learning rate should linearly scale up from 0 to maximal value for the first part of … 展开
Full transformer architecture
Sublayers
Each encoder layer contains 2 sublayers: the self-attention and the feedforward network. Each decoder layer contains 3 sublayers: the causally masked self-attention, the cross-attention, and the feedforward network. 展开
Applications
The transformer has had great success in natural language processing (NLP). Many large language models such as GPT-2, GPT-3, 展开
History
Predecessors
For many years, sequence modelling and generation was done by using plain recurrent neural networks (RNNs). A well-cited early example was the 展开
Architecture
All transformers have the same primary components:
• Tokenizers, which convert text into tokens. 展开
Subsequent work
Alternative activation functions
The original transformer uses ReLU activation function. Other activation functions were developed. … 展开
来自维基百科
内容
Overview
History
Training
Architecture
Full transformer architecture
查看所有章节
CC-BY-SA 许可证中的维基百科文本
反馈
谢谢!告诉我们更多信息
维基百科
https://zh.wikipedia.org/wiki/Transformer模型
Transformer模型 - 维基百科，自由的百科全书
概览
背景
架构
训练
应用
实现
参见
Transformer模型（直译为“变换器”）是一种采用注意力机制的深度学习模型，这一机制可以按输入数据各部分重要性的不同而分配不同的权重。该模型主要用于自然语言处理（NLP）与计算机视觉（CV）领域。
与循环神经网络（RNN）一样，Transformer模型旨在处理自然语言等顺序输入数据，可应用于翻译、文本摘要等任务。而与RNN不同的是，Transformer …
Wikipedia · CC-BY-SA 许可下的文字
Transformer (machine learning model) wikipedia 的视频

bing.com/videos
观看视频
5:50
What are Transformers (Machine Learning Model)?
已浏览 39.8万次2022年3月11日
YouTubeIBM Technology
观看视频
9:11
Transformers, explained: Understand the model behind GPT, BERT, and T5
已浏览 93.9万次2021年8月18日
YouTubeGoogle Cloud Tech
在 nvidia.com 上观看视频
What Is a Transformer Model?
10 个月之前
nvidia.com
观看视频
44:26
What are Transformer Models and how do they work?
已浏览 11.7万次10 个月之前
YouTubeSerrano.Academy
Wikipedia
https://en.wikipedia.org/wiki/BERT_(language_model)
BERT (language model) - Wikipedia
Design
Performance
Analysis
History
Recognition
Further Reading
External Links
BERT is an "encoder-only" transformerarchitecture. On a high level, BERT consists of three modules: 1. embedding. This module converts an array of one-hot encoded tokens into an array of vectors representing the tokens. 2. a stack of encoders. These encoders are the Transformer encoders. They perform transformations over the array of representation...
在en.wikipedia.org上查看更多信息
Wikipedia
https://en.wikipedia.org/wiki/Attention_I…
Attention Is All You Need - Wikipedia
网页"Attention Is All You Need" [1] is a 2017 landmark [2] [3] research paper in machine learning authored by eight scientists working at Google. The paper introduced a new deep learning architecture known as the …
标记:
Machine Learning
Deep learning
Wikipedia
https://simple.wikipedia.org/wiki/Transformer...
Transformer (machine learning model) - Simple English Wikipedia, …
网页A transformer is a computer model used for deep learning, which is a kind of machine learning where computers teach themselves. Transformers were introduced in a 2017 …
标记:
Machine Learning
Deep learning
Language model
IBM
https://www.ibm.com/topics/transform…
What is a Transformer Model? - IBM
网页A transformer model is a type of deep learning model that was introduced in 2017. These models have quickly become fundamental in natural language processing (NLP), and have been applied to a wide range of …
缺失:

wikipedia
必须包含:

wikipedia
标记:
Machine Learning
Deep learning
Language model
维基百科
https://zh.wikipedia.org/zh-hant/Transformer模型
Transformer模型 - 維基百科，自由的百科全書 - zh.wikipedia.org
网页Transformer模型（直譯為「變換器」）是一種採用注意力機制的深度學習模型，這一機制可以按輸入數據各部分重要性的不同而分配不同的權重。該模型主要用於自然語言處理 …
标记:
Machine Learning
Textstyle
Google Research
http://research.google/blog/transformer …
Transformer: A Novel Neural Network Architecture for …
网页In “Attention Is All You Need”, we introduce the Transformer, a novel neural network architecture based on a self-attention mechanism that we believe to be particularly well suited for language understanding.
标记:
Understanding
Language
arXiv.org
https://arxiv.org/abs/2304.10557
[2304.10557] An Introduction to Transformers - arXiv.org
网页2023年4月20日 · The transformer is a neural network component that can be used to learn useful representations of sequences or sets of data-points. The transformer has driven …
标记:
arXiv:2304.10557 [cs.LG
Introduction
arXiv.org
https://arxiv.org/pdf/2311.17633
[PDF]
Introduction to Transformers: an NLP Perspective - arXiv.org
网页Transformers have dominated empirical machine learning models of natural language pro-cessing. In this paper, we introduce basic concepts of Transformers and present key tech …
标记:
Machine Learning
Language model
Perspective Magazine
其他用户还搜索过
transformer based deep learning model
transformer for machine learning
transformer model explained
transformer based language models
transformer based models
transformer based deep learning
Transformer (machine learning model) wikipedia 的相关搜索
分页
- 1
- 2
- 3
- 4
- 下一页

Transformer (deep learning architecture) - Wikipedia

Transformer模型 - 维基百科，自由的百科全书

Transformer (machine learning model) wikipedia 的视频

BERT (language model) - Wikipedia

Attention Is All You Need - Wikipedia

Transformer (machine learning model) - Simple English Wikipedia, …

What is a Transformer Model? - IBM

缺失:

必须包含:

Transformer模型 - 維基百科，自由的百科全書 - zh.wikipedia.org

Transformer: A Novel Neural Network Architecture for …

[2304.10557] An Introduction to Transformers - arXiv.org

Introduction to Transformers: an NLP Perspective - arXiv.org

Transformer (machine learning model) wikipedia 的相关搜索

浏览更多