模型名称 | 发布时间 | 发布机构 | 参数规模 | Token 规模 | 相关资料 | 是否开源 |
---|---|---|---|---|---|---|
T5 | 2019-10 | 13B | https://arxiv.org/pdf/1910.10683.pdf ‣ | Y | ||
GPT-3 | 2020-05 | OpenAI | 175B | 300B | https://arxiv.org/pdf/2005.14165.pdf | |
LaMDA | 2021-05 | 137B | 2.8T | https://arxiv.org/pdf/2201.08239.pdf | ||
OPT | 2022-05 | Meta | 125M-175B | 180B | https://arxiv.org/pdf/2205.01068.pdf https://github.com/facebookresearch/metaseq/tree/main/projects/OPT | |
https://www.bilibili.com/video/BV1XT411v7c9 | Y | |||||
GLM-130B | 2022-08 | THU | 130B | 400B | https://arxiv.org/pdf/2210.02414.pdf ‣ | Y |
LLaMA | 2023-02 | Meta | 7B-65B | 1.4T | https://arxiv.org/pdf/2302.13971v1.pdf https://github.com/facebookresearch/llama | |
https://zhuanlan.zhihu.com/p/618776565 | Y | |||||
BLOOM | 202-07 | BigScience | 176B | 366B | https://arxiv.org/pdf/2211.05100.pdf https://huggingface.co/bigscience | Y |
More