May your regularizer be strong, lest you RLHF to slop. - Andrej Karpathy

" "

May your regularizer be strong, lest you RLHF to slop.

English
Collect this quote
Limited Time Offer

Premium members can get their quote collection automatically imported into their Quotewise collections.

Related quotes. More quotes will automatically load as you scroll down, or you can use the load more buttons.

Additional quotes by Andrej Karpathy

Don't think of LLMs as entities but as simulators. For example, when exploring a topic, don't ask:

"What do you think about xyz"?

There is no "you". Next time try:

"What would be a good group of people to explore xyz? What would they say?"

The LLM can channel/simulate many perspectives but it hasn't "thought about" xyz for a while and over time and formed its own opinions in the way we're used to. If you force it via the use of "you", it will give you something by adopting a personality embedding vector implied by the statistics of its finetuning data and then simulate that. It's fine to do, but there is a lot less mystique to it than I find people naively attribute to "asking an AI".

Works in ChatGPT, Claude, or Any AI

Add semantic quote search to your AI assistant via MCP. One command setup.

Loading...