聊天机器人,如何训练?

V2EX = way to explore

V2EX 是一个关于分享和探索的地方

现在注册

已注册用户请登录

这是一个创建于 485 天前的主题，其中的信息可能已经有所发展或是发生改变。

如题,喂他几本书,然后在给她一些资料, 怎么根据这些资料, 能训练出高质量的聊天机器人呢?

训练

聊天机器人

资料

7 条回复 • 2024-10-29 13:24:50 +08:00

musi

2024 年 10 月 28 日

几本书就想高质量？那也不用 Scaling Law 了

kaichen

PRO

2024 年 10 月 28 日

大力出奇迹，几本书是不够，要很多很多。

参考，推理能力超过 gpt-3.5 的 Llama3

- https://ai.meta.com/blog/meta-llama-3/
- https://ai.meta.com/blog/meta-llama-3-1/

> Meta reports on Llama 3.1's page on Huggingface, using 39.3 million hours of H100 80GB instances to train all 3.1 models (8, 70, 400 B).

大概是，两万四千张 H100 训练 74 天

> Llama 3 is pretrained on over 15T tokens that were all collected from publicly available sources.

大概等同于 60TB 数据，在它的技术报告里，提到这是更大的数据集上做清洗去重的精华

---

所以先有这么多的资源才能训练得到高质量机器人