Build a Screen Parsing with GPT-4o Vision and OmniParserIn this tutorial, we’ll explore building a screen parsing agent using GPT-4o and OmniParser. Our goal is to demonstrate locking a computer…5d ago5d ago
Document Parsing with OmniParser and GPT4o VisionIn this blog, we’ll explore how to leverage Microsoft’s OmniParser as input for GPT-4’s vision capabilities, optimizing the parsing of…Nov 18Nov 18
Chat with PowerPoint Files Using LangGraph and GPT-4oIn this blog, I’ll guide you through creating a Python script that enables seamless conversation with your PowerPoint document (.PPTX)…Nov 10Nov 10
Build an Intelligent Document Processing with Confidence Scores with GPT-4oAn Intelligent Document Processing (IDP) provides actionable insights through confidence scores, allowing you to evaluate process…Oct 313Oct 313
Using Logprobs to Gauge GPT’s Confidence in OCR ResultIn my recent blog post on parsing complex PDFs using GPT, I explored the potential of incorporating confidence scores to quantify GPT’s…Oct 28Oct 28
Extract Information from Non-English PDFs Using GPT-4o and LangGraphIn this blog post, I want to show you how to get information from PDF files that have content in languages other than English. While it’s…Oct 20Oct 20
Understanding Workato Docs Copilot Web Traffic to Generate a Recipe DescriptionLearn how Workato Docs Copilot generates the recipe description and how to bypass the generation limit by understanding the underlying…Oct 13Oct 13
Augment Activity Audit Log Review with Agentic AI ApproachHere is what I would like to share how to automate the review process of Workato activity audit log on a quarterly basis using Agentic AI…Oct 11Oct 11
RAG with Multiple Complex PDF: Efficient Document Management with Vector StoresLearn how to create a Retrieval-Augmented Generation (RAG) model using multiple PDFs in this tutorial. We’ll extract and store relevant…Oct 6Oct 6
Create a RAG Agent with LangGraph to Extract the information from a PDF FileIn this blog, we will build a simple agent to extract the information from a PDF file with LangGraph. We will be using GPT-4o to extract…Sep 231Sep 231