Transformers

GPT is a Transformer-based architecture and training procedure for natural language processing tasks. Training follows a two-stage procedure. First, a language modeling objective is used on the unlabeled data to learn the initial parameters of a neural network model. Subsequently, these parameters are adapted to a target task using the corresponding supervised objective.

Source: Improving Language Understanding by Generative Pre-Training

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Language Modelling 88 11.24%
Large Language Model 51 6.51%
Question Answering 34 4.34%
Prompt Engineering 25 3.19%
Text Generation 23 2.94%
Retrieval 21 2.68%
Sentence 20 2.55%
Decision Making 20 2.55%
In-Context Learning 20 2.55%

Categories