Retrieval Augmented Generation in the Wild: Anton Troynikov

  Рет қаралды 3,213

AI Engineer

AI Engineer

Күн бұрын

Пікірлер: 2
@Pure_Science_and_Technology
@Pure_Science_and_Technology 11 ай бұрын
FYI there is a failure of direct retrieval with GPT-4 using the new OpenAI Assistant API. GPT tokenizes text and creates its own vector embeddings based on its specific training data. The new terms and sequences may not connect well to the pretrained knowledge in GPT's weight tensors. There was no semantic similarity between the new API terms and GPT's existing vector space. This is a fundamental issue with retrieval augmentation systems like Rag - external knowledge is not truly integrated into the model's learned weights. Adding more vector stores cannot solve this core problem. The solution is to have multiple learned "knowledge planes" with trained weight tensors for specific tasks that can be switched in. This is better than just retrieving separate vector representations.
@Jaybearno
@Jaybearno 11 ай бұрын
Excellent presentation. I have found vanilla embeddings insufficient to do “level2” tasks, which require multiple pieces of context that may vary from ultra specific, to rolled up across the entire document. If anyone can link research on how to embed temporal meaning within chronological text, would love to take a look!
Open Questions for AI Engineering: Simon Willison
24:33
AI Engineer
Рет қаралды 4,7 М.
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 187 МЛН
Amazing remote control#devil  #lilith #funny #shorts
00:30
Devil Lilith
Рет қаралды 9 МЛН
Каха и лужа  #непосредственнокаха
00:15
СОБАКА ВЕРНУЛА ТАБАЛАПКИ😱#shorts
00:25
INNA SERG
Рет қаралды 2,3 МЛН
A Survey of Techniques for Maximizing LLM Performance
45:32
Retrieval-Augmented Generation (RAG)
24:04
Connor Shorten
Рет қаралды 32 М.
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
How to set up RAG - Retrieval Augmented Generation (demo)
19:52
Don Woodlock
Рет қаралды 36 М.
AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
23:47
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, with Patrick Lewis, Facebook AI
1:22:32
Natural Language Processing presentations
Рет қаралды 13 М.
Code Generation and Maintenance at Scale: Morgante Pell
18:54
AI Engineer
Рет қаралды 4 М.
RAG But Better: Rerankers with Cohere AI
23:43
James Briggs
Рет қаралды 61 М.
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 187 МЛН