AI Scientist

  Рет қаралды 3,387

hu-po

hu-po

Күн бұрын

Пікірлер: 12
@wolpumba4099
@wolpumba4099 4 ай бұрын
*Summary* * *(**0:00**)* *AI Scientist is a system designed for automated machine learning research.* It can generate research ideas, write code, run experiments, and even write a paper summarizing its findings. * *(**20:00**)* *The system relies heavily on Large Language Models (LLMs) like GPT-4 and Claude.* Different LLMs were tested and compared for their effectiveness in different stages of the process. * *(**21:30**)* *It works by generating a variety of ideas, checking them for novelty, and then running small experiments based on the most promising ideas.* These experiments are often based on modifying an existing code base. * *(**29:30**)* *An LLM-based reviewer is also used to evaluate the generated papers.* This automated reviewer has shown to be effective at mimicking human review processes. * *(**37:30**)* *Concerns about the system include its potential to generate inaccurate or misleading papers, as well as ethical considerations about fully automating scientific research.* For example, one generated paper showed improvement simply by making the model bigger, without any genuine scientific innovation. * *(**47:40**)* *There are also concerns about the limitations of humans to understand and evaluate AI-generated research as AI systems become more advanced.* * *(**11:30**)* *The system highlights a shift from traditional coding to 'flow engineering,' where researchers design workflows and prompts for LLMs to execute.* * *(**1:15:00**)* *While the system shows promise for automating some aspects of research, it also raises important questions about the future role of human scientists and the nature of scientific progress itself.* *In short, AI Scientist is a compelling example of the potential for AI to automate research, but it also raises crucial questions and concerns about its limitations, ethics, and implications for the future of science.* I used Google Gemini 1.5 Pro exp 0801 to summarize the transcript. Cost (if I didn't use the free tier): $0.2134 Time: 87.28 seconds I added a 61 second delay to prevent a rate limit of the free tier. Input tokens: 58785 Output tokens: 725
@sirishkumar-m5z
@sirishkumar-m5z 3 ай бұрын
In today's tech world, the position of an AI scientist is becoming more and more important. The way artificial intelligence (AI) is expanding the realm of scientific investigation is truly remarkable.
@bycloudAI
@bycloudAI 4 ай бұрын
had to watch your take cuz the cope around this paper all around is kinda too much anddd good take as always, your analogies are on point
@tljstewart
@tljstewart 4 ай бұрын
You’re my fav llm 😅
@aresaurelian
@aresaurelian 4 ай бұрын
Interesting. There is much to explore. Grab your packs, let's go.
@vishalrajput9856
@vishalrajput9856 4 ай бұрын
One thing, LLMs can't come up with new idea in the sense that it doesn't know which one are interesting and which ones are not. It doesn't have a good understanding of feasibility of ideas.
@Leto2ndAtreides
@Leto2ndAtreides 4 ай бұрын
That'll change with time. But also, you can prioritize a lot of things by just the projected benefit vs cost of execution. A lot of innovation is just a fair amount of iteration over multople areas... Until you achieve a qualitative change.
@xiaojinyusaudiobookswebnov4951
@xiaojinyusaudiobookswebnov4951 4 ай бұрын
"Feasibility": This can be relatively easy if you have access to some simulation environment. "Interestingness": That is a lot more tricky. You could try to do something like "If the idea is novel, then it's interesting", but what constitutes novel depends on what the model has seen before. The fundamental issue is that what constitutes a "good idea" is not really a property of the idea itself, but of its context: The idea of building a combustion engine would have been useless 5000 years ago since you didn't have metallurgy, materials science, or machining capabilities. The idea of building a car would have been useless 200 years ago since you didn't have roads or paved surfaces.The idea of a self-driving car is probably pretty bad in places that have no infrastructure to support them (no street signs, poorly maintained roads, etc...). You could try to formalize this in something like a multi-agent system, where you have multiple "experts" that argue about the usefulness of an idea... and they all have to agree that the idea (generated by an LLM) is worth pursuing.
@andydataguy
@andydataguy 3 ай бұрын
​@@xiaojinyusaudiobookswebnov4951 bro dropping some serious alpha here 🔥
@zzzzzzz8473
@zzzzzzz8473 4 ай бұрын
great overview , really appreciate the evaluation of the generated "novel" paper as being an obfuscated "scale is all you need" and the potential for delusion of itself into believing it found something worthwhile . the flaws of humanity are baked into these LLMs , so different then what retrofuturism imagined as pure logic systems . i wonder if the process needs another agent who is a curmudgeon skeptic that points out all the flaws and the other agent is tasked to respond like a thesis defense . lots of potential training data from schmidhuber , " thats just an RNN with extra steps , i did all this 30 years ago "
@wolpumba4099
@wolpumba4099 4 ай бұрын
Summary starts at 1:47:43
MeshFormer vs MeshAnything
1:31:22
hu-po
Рет қаралды 1,8 М.
Michael Levin - Non-neural intelligence: biological architecture problem-solving in diverse spaces
59:42
Institute for Pure & Applied Mathematics (IPAM)
Рет қаралды 16 М.
UFC 310 : Рахмонов VS Мачадо Гэрри
05:00
Setanta Sports UFC
Рет қаралды 1,2 МЛН
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 56 МЛН
Quando A Diferença De Altura É Muito Grande 😲😂
00:12
Mari Maria
Рет қаралды 41 МЛН
Video Generation
1:34:16
hu-po
Рет қаралды 1,1 М.
Strawberry
1:55:38
hu-po
Рет қаралды 7 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
Generative Molecular Dynamics
1:37:15
hu-po
Рет қаралды 1,3 М.
Unreasonably Effective AI with Demis Hassabis
52:00
Google DeepMind
Рет қаралды 232 М.
Why AI needs God - Prof Rosalind Picard
1:02:33
Seen & Unseen
Рет қаралды 8 М.
LCM: The Ultimate Evolution of AI? Large Concept Models
30:13
Discover AI
Рет қаралды 18 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,4 МЛН
Айфон сдался: когда сделают складной телефон
0:11
Короче, новости
Рет қаралды 184 М.
Introducing the "VitaWear SmartBand,"😳 a next-gen wearable gadget🎉
0:26
Shreya fun shorts
Рет қаралды 4,2 МЛН
Cách tính trở kháng loa khi đấu nối tiếp và song song!
0:20