GPT Usage #170

kocemir · 2024-03-25T11:31:08Z

Hi,
I would like to ask a question about generative pretraining using GPT. As far as I know, GPT uses the transformer decoder as well and one of the main train scheme is to predict next tokens. In scGPT pretraining, what "next token"corresponds to? I did not get the difference between BERT architecture of scGPT architecture?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT Usage #170

GPT Usage #170

kocemir commented Mar 25, 2024

GPT Usage #170

GPT Usage #170

Comments

kocemir commented Mar 25, 2024