Invoice Table Detection with Table Transformer

  Рет қаралды 1,961

Andrej Baranovskij

Andrej Baranovskij

Күн бұрын

Пікірлер: 9
@AI_Learner-v1l
@AI_Learner-v1l Ай бұрын
Hi Andrej, thank you very much for this video. I found an issue with long tables with Table-Transformer. If my pdf page has a long table that occupies whole page, then it is detected only partially, bottom part of table truncated and lost. Is there any solution to this?
@AndrejBaranovskij
@AndrejBaranovskij Ай бұрын
Most likely this is table transformer limitation, it cant handle all table layouts. This is the reason I dont stick to single model in Sparrow, but also rely on Vision LLMs, such as Qwen2 for data extraction. See my latest videos where I describe it.
@sudhirogale1687
@sudhirogale1687 Ай бұрын
Hi Andrej, I have question, I am still struggling to understand how table structure works. As I understand AI model will let us know table location and OCR for text bounding boxes in that table. And then you will write custom logic to identify table structure - which may not be AI model but simple python function. is that true?
@AndrejBaranovskij
@AndrejBaranovskij Ай бұрын
This was my attempt, but it doesnt work with more complex tables. If you check my latest videos, I focus on Vision LLMs usage in Sparrow, such as Qwen2 for data extraction, it works better and is more generic.
@sudhirogale1687
@sudhirogale1687 Ай бұрын
@@AndrejBaranovskij Thanks I will check.
@TrườngNguyễnQuang-b9c
@TrườngNguyễnQuang-b9c 3 ай бұрын
I don't see the code file as shown in the Example section of the video in your github repo. Can you send me the "table_detector" code file as shown in the "Example" section of the video?
@AndrejBaranovskij
@AndrejBaranovskij 3 ай бұрын
Code is inside Sparrow Parse lib. See table structure processor: github.com/katanaml/sparrow/tree/main/sparrow-data/parse/sparrow_parse/processors
@mayssamayel6259
@mayssamayel6259 5 ай бұрын
what changes that u did when u implemented the code from the notebook
@AndrejBaranovskij
@AndrejBaranovskij 5 ай бұрын
Different OCR processing, table transformer results cleanup. Merging column. Now working on fixing rows data. Quite a lot of changes.
Sparrow OCR Service with PaddleOCR
6:39
Andrej Baranovskij
Рет қаралды 1,1 М.
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН
Enceinte et en Bazard: Les Chroniques du Nettoyage ! 🚽✨
00:21
Two More French
Рет қаралды 42 МЛН
Sparrow Parse: Table Data Extraction with Table Transformer and OCR
7:29
Andrej Baranovskij
Рет қаралды 1,4 М.
DETR: End-to-End Object Detection with Transformers (Paper Explained)
40:57
Table Header Extraction with Table Transformer
6:30
Andrej Baranovskij
Рет қаралды 629
Object detection Using Detection Transformer (Detr) on custom dataset
18:21
LlamaParse: Convert PDF (with tables) to Markdown
15:55
Alejandro AO - Software & Ai
Рет қаралды 24 М.
Microsoft Table Transformer HuggingFace Demo
13:52
Rithesh Sreenivasan
Рет қаралды 12 М.
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН