I bet that there's a way to boost these significantly. Keep in mind that GPT 4 can actually pass standardized tests quite well.
@andydataguy4 ай бұрын
Found it 💜 this paper is underrated AF
@LaboriousCretin6 ай бұрын
Good video. Thank you for sharing.
@jackflash63776 ай бұрын
Well done and informative. Thank you.
@densonsmith26 ай бұрын
Since we see that using slightly worse LLM or VLM makes the overall system much worse it seems likely if the LLM or VLM is slightly better the success rate of the overall system might improve by a lot?
@xhridhar6 ай бұрын
We will have to wait a few months or more I guess for a model that will return 95% or more success rate . Only at that rate of success , it can be in a production environment for complex tasks
@TechnoMageCreator6 ай бұрын
Not really, I feel using chatGPT 4 Turbo instead of Claude 3.5 and other AI in parallel and combine and reiterate answers that would have brought a much greater results
@wwkk49646 ай бұрын
14% or 1 in 7. Maybe it can work on tasks that would be done on a sunday by humans?
@joehopfield6 ай бұрын
Physics informed neural networks + management informed neural networks. Love it. But no match for the destructive power of middle management and executive suite neural networks . 😢