>>105984411Feed it a bunch of GGB related prompts. Measure which neurons light up the most. Feed it an unrelated prompt and manually bump up those neuron's outputs. Ez.
>>105984343Data curation is not really AI.
The two strands of AI in my book are these:
First you have the classical ML stuff where you try to find the optimal architecture to improve your validation loss as much as possible.
And then we have stuff that doesn't fit neatly into training and validation sets, but nonetheless is necessary to make AI smarter.
The basic question is whether the way forward is by some end to end mathematically elegant generic mechanism, or the way forward is to tack on ugly hacks like CoT and tool usage and hope that gets us to the point where AI is capable enough to improve by itself faster than we can improve it. And whether there is a mathematical way to explain, formalize and generalize such hacks.
Personally I think we won't figure out the theory ourselves and AI will surpass human capabilities mainly by tacking on a few more hacks and maybe one or two architecture generations beyond the transformer.
It already is smarter than the average human, and it's at the brink of being smart enough to improve itself without humans in the loop, mainly just by scaling compute and a few hacks like verifiable rewards.