Analyzes the input image using Trace OCR text detection and recognition
Detection architecture to use. See the models page for available options.
Recognition architecture to use. See the models page for available options.
Return axis-aligned bounding boxes. Set to false for rotated documents to get 4-point polygons instead.
Pad the image to preserve its aspect ratio before feeding it to the model.
Detect and report the page orientation angle in the response.
Detect and report the page language in the response.
Pad symmetrically (centered) rather than bottom-right only.
Automatically rotate pages to correct detected skew before recognition.
Number of pages processed in parallel during detection. Increase for multi-page PDFs if you have enough memory.
Number of text crops processed in parallel during recognition. Decrease if you run into memory issues on large batches.
Skip page orientation classification entirely.
Skip text crop orientation classification.
Pixel-level threshold for the detection segmentation heatmap. Lower values detect more text but may introduce false positives.
Minimum confidence score to keep a detected bounding box. Lower values return more boxes but may include noise.
Group detected words into lines based on spatial proximity.
Group detected lines into blocks based on spatial proximity.
Normalized vertical distance threshold for splitting text into separate blocks. Only used when group_blocks is enabled.
One or more files to process. Accepts JPEG, PNG, and PDF formats.
Successful Response
"example.jpg"
{ "confidence": 0.99, "value": 0 }{ "confidence": 0.99, "value": "en" }{
"detection_score": 0.99,
"geometry": [0, 0, 0, 0],
"lines": [
{
"detection_score": 0.99,
"geometry": [0, 0, 0, 0],
"words": [
{
"confidence": 0.99,
"detection_score": 0.99,
"geometry": [0, 0, 0, 0],
"text_orientation": { "value": 0 },
"value": "example"
}
]
}
]
}