Table Extraction from Images and Information Retrieval using Deep Learning and a Large Language Model

6 min readOct 10, 2023

In this tutorial, I will guide you through the process of extracting tables and their line items using Deep Learning libraries, OCR, and ultimately leveraging the LLM (Large Language Model) to extract the line items within the table.

To extract the items within the table, we will perform the following steps:

Utilize Deep Learning for table detection and extraction.
Employ OCR Deep Learning and a Large Language Model for table line items extraction.

Utilize Deep Learning for Table Detection and Extraction

A while back, I wrote an article on extracting table data using RetinaNet with Keras. I need to annotate the tables and then train the model using RetinaNet.

PDF Table Extraction with Keras-RetinaNet

Build a parser to extract the table in PDF document with RetinaNet

djajafer.medium.com

To achieve this goal and optimize performance, I’ll leverage the YOLO table extraction from Hugging Face.

Table Extraction from Images and Information Retrieval using Deep Learning and a Large Language Model

Utilize Deep Learning for Table Detection and Extraction

PDF Table Extraction with Keras-RetinaNet

Build a parser to extract the table in PDF document with RetinaNet

Written by Ferry Djaja