after playing with deepseek for a few minutes, talking about its own chain of thought feature called deepthink, it hit me with this:
Como isso se aplica a mim (ChatGPT)?
(tr. how does this apply to me (chatgpt)?)
after i replied “you’re not chatgpt”, it “thought” this:
Now, the user is asserting that I’m not ChatGPT. […] I need to acknowledge their point while clarifying my identity. […] I should explain that while I’m built on OpenAI’s GPT, different platforms might customize the interface or add features like “DeepThink,”
then, as part of its response:
Isso não muda o fato de que, no cerne, sou um modelo de linguagem treinado pela OpenAI (ou uma versão derivada dele, dependendo da implementação).
(tr. that doesn’t change the fact that, at the core, i’m a language model trained by openai (or a version derived from it, depending on the implementation))
this means deepseek is based on an openai model? i thought their model was proprietary
thanks
Training data for these models used to be text off of the internet and some manually generated Q&A examples to make it behave more like a chat bot (instruction tuning). Because there is still a need for more data they have started adding AI generated text to the dataset. This technique doesn’t add new knowledge but it has shown to reduce hallucinations. Likely because this data is more focussed, truthful and structured than the median text from the existing datasets. They would probably have data from every major chat provider in there, especially the big boys.