Skip to content

DocTR (OCR)

DocTR is an Optical Character Recognition model.

You can use DocTR with Inference to identify and recognize characters in images.

How to Use DocTR

To use DocTR with Inference, you will need a Roboflow API key. If you don't already have a Roboflow account, sign up for a free Roboflow account. Then, retrieve your API key from the Roboflow dashboard. Run the following command to set your API key in your coding environment:

export API_KEY=<your api key>

Create a new Python file and add the following code:

import requests
import base64
from PIL import Image
import supervision as sv
import os

API_KEY = os.environ["API_KEY"]
IMAGE = "container1.jpeg"

image = Image.open(IMAGE)

data = {
    "image": {
        "type": "base64",
        "value": base64.b64encode(image.tobytes()).decode("utf-8"),
    }
}

ocr_results = requests.post("http://localhost:9001/doctr/ocr?api_key=" + API_KEY, json=data).json()

print(ocr_results, class_name)

Above, replace container1.jpeg with the path to the image in which you want to detect objects.

Then, run the Python script you have created:

python app.py

The results of DocTR will appear in your terminal:

...

Further Reading