How to verify that OCR processing is working properly when building a knowledge base

Austin_Ciou · August 22, 2025, 5:14am

Nowadays, many technical documents are delivered in PDF format.

When figures within a technical document contain text, such as user interface instructions, enabling OCR processing allows LLM to decipher the information hidden in the image, making its responses more appropriate and accurate.

But how do we confirm that LLM has correctly enabled OCR processing?
There are currently two methods.

The first one is to check with your browser’s developer tools.
After uploading the technical file, use the Preview feature and verify directly from the API that the payload parameter (ocr_enabled) is true or false.

The second is to use the Retrieval Testing feature in the Knowledge Base to directly confirm that LLM can return the correct information about the figure.

With OCR processing enabled.

OCR processing disabled.

Hope this sharing can help you.

See you next time.

Topic		Replies	Views
How to solve OEEpro dashboard show license fail Overall Equipment Effectiveness(OEE)	0	46	February 21, 2025
AE tech AI chatbot AgentBuilder	3	103	January 8, 2025
Learning Paradigms in Prompt Engineering AgentBuilder	0	52	March 28, 2025
Tips: Be careful when using the code executor node in Chatflow AgentBuilder	0	31	June 30, 2025
IOTedge can use webhook send tag value to Agent builder run AI Maintenance assistant IoT Edge	0	61	March 18, 2025

How to verify that OCR processing is working properly when building a knowledge base

Related topics