Chat GPT

Attraction restored

👍2

775 views05:53

Chat GPT

davinci-002 -vs- davinci-003

Halt the training, we must go back!

🤣5🤔2

803 views06:05

Chat GPT

1.29K views12:38

Chat GPT

Using some publicly available data, and assuming each of these models are trained in a similar way Chinchilla was, we can compare the performance of GPT-4 to GPT-2, GPT-3, Chinchilla, and PaLM.

Let's calculate what GPT-4's performance would be if it used 10x more parameters without retrieval, and naively assume that will be its performance with retrieval. This chart is what we get.

With the algorithmic adjustment, the qualitative improvement from GPT-3 (vanilla) to GPT-4 is comparable to the improvement from GPT-2 to GPT-3. Since that was a rather big jump, I expect many will be stunned by GPT-4, especially those who expected strong diminishing returns.

1.3K views12:48

Chat GPT

“In short: Training runs of large Machine Learning systems are likely to last less than 14-15 months. This is because longer runs will be outcompeted by runs that start later and therefore use better hardware and better algorithms.”

https://www.lesswrong.com/posts/RihYwmskuJT9Rkbjq/the-longest-training-run

👍2🤔1

1.38K views12:50

Chat GPT

words of David Holz - midjourney founder

https://www.forbes.com/sites/robsalkowitz/2022/09/16/midjourney-founder-david-holz-on-the-impact-of-ai-on-art-imagination-and-the-creative-economy/?sh=668f3fb82d2b

1.68K views12:53

Chat GPT

🔥2👎1

1.72K views12:55

Chat GPT

👍1🤩1🥱1

687 views12:58

Chat GPT