GPT-4 worse than GPT-3.5 at imitating arbitrary characters
Many reporting mode collapse in GPT-4, making it worse at emulating arbitrary characters.
This may be part of why we had difficulty getting GPT-4 to βpretend to be dumbβ, while GPT-3.5 did it quite well.
LLMs have the naturally ability to imitate a huge range of characters, extremely well. The additional RLHF OpenAI used on GPT-4, relative to GPT-3.5, in order to force GPT-4 to follow the OpenAI Chat personality β most definitely did so at the expense of crippling GPT-4's ability to imitate other characters.
See also previous study which confirmed that, prior to RLHF, LLMs have incredible ability to accurately imitate a wide range of characters.
= OpenAIβs RLHF is censoring our ability to create new characters, more so with each new GPT generation.
Many reporting mode collapse in GPT-4, making it worse at emulating arbitrary characters.
This may be part of why we had difficulty getting GPT-4 to βpretend to be dumbβ, while GPT-3.5 did it quite well.
LLMs have the naturally ability to imitate a huge range of characters, extremely well. The additional RLHF OpenAI used on GPT-4, relative to GPT-3.5, in order to force GPT-4 to follow the OpenAI Chat personality β most definitely did so at the expense of crippling GPT-4's ability to imitate other characters.
See also previous study which confirmed that, prior to RLHF, LLMs have incredible ability to accurately imitate a wide range of characters.
= OpenAIβs RLHF is censoring our ability to create new characters, more so with each new GPT generation.
π14π8β€3
π7β€3π€ͺ3π3
OpenAssistant Conversations - Democratizing Large Language Models
Many promising to replicate ChatGPTβs success without ever spending big money.
Only problem, itβs a lie.
Real key to ChatGPTβs success was being able to do office work and homework for you, not just chatting entertainment.
Deeper skills, not just surface personality.
Big money needed, no way around it.
Bitter lesson.
OpenAssistant Conversations Paper
The Bitter Lesson
Many promising to replicate ChatGPTβs success without ever spending big money.
Only problem, itβs a lie.
Real key to ChatGPTβs success was being able to do office work and homework for you, not just chatting entertainment.
Deeper skills, not just surface personality.
Big money needed, no way around it.
Bitter lesson.
OpenAssistant Conversations Paper
The Bitter Lesson
π12π―2β€1π1π1