NEW CriticGPT by OpenAI: RLHF + FSBS
ฝัง
- เผยแพร่เมื่อ 2 ก.ค. 2024
- OpenAI developed an optimized RLHF plus Force Sampling Beam Search (FSBS) algorithm to improve the quality of our LLMs.
I have a deep dive why OpenAI felt the need to develop this technique and what is the status quo of our current LLM optimizations methodologies.
All rights w/ authors:
cdn.openai.com/llm-critics-he...
LLM Critics Help Catch LLM Bugs
Finding GPT-4’s mistakes with GPT-4
openai.com/index/finding-gpt4...
#aiagents
#airesearch
#openai - วิทยาศาสตร์และเทคโนโลยี
Grasshopper here for class and leaving first comment. Thanks for the great videos!
Thanks for watching!
They're being open again ? I didn't expect this...
News agencies are the last thing I'd trust to deliver trustworthy data.
Yes. If your buddy in the bar didn’t say it, it ain’t true. A great option is also astrology 🔮 I always get the facts I want
There AI is a bit of a copycat.
Not much of a thinker. It's not really a copy as it's rewritten. But yeah they have some problems and more compute isn't going to fix it. I think they need to improve their neural nets architecture. And a bunch of other things.
This is basically AI amplification. You exponentially refine the AI with AI.
Now that they are doing the X^N trick where X is intelligence, the core question will be if the X is > 1 or < 1.