LDX hubLDX hub
  • Pricing
  • Documentation
  • API Reference
Information
Files
    List uploaded filesgetUpload a filepostGet file infogetDownload file contentget
Uploads
    Create an upload sessionpostAppend a chunk to an uploadput
StructFlow Models
    List available modelsget
StructFlow Jobs
    List StructFlow job historygetCreate a StructFlow jobpostGet StructFlow job status and resultsget
RefineLoop Models
    List available modelsget
RefineLoop Jobs
    List RefineLoop job historygetCreate a RefineLoop jobpostGet RefineLoop job status and resultsget
RenderOCR Engines
    List available enginesget
RenderOCR Jobs
    List RenderOCR job historygetCreate a RenderOCR jobpostGet RenderOCR job status and resultsget
CastDoc Engines
    List available enginesget
CastDoc Jobs
    List CastDoc job historygetCreate a CastDoc jobpostGet CastDoc job status and resultsget
ExtractDoc Engines
    List available enginesget
ExtractDoc Jobs
    List ExtractDoc job historygetCreate an ExtractDoc jobpostGet ExtractDoc job status and resultsget
MCP
    MCP Serverpost
Schemas
powered by Zuplo
LDX hub API
LDX hub API

Schemas


Model

id
​string

Unique model identifier. Use this value in job requests.

display_name
​string

Human-readable model name

description
​string

Model description and recommended use case

provider
​string

Model provider

FileInfo

file_id
​string

Unique file identifier. Use this value in job requests.

filename
​string

Original filename

size
​integer

File size in bytes

created_at
​string · date-time
updated_at
​string · date-time
expires_at
​string · date-time

Files are automatically deleted after a period of time

UploadCreateRequest

filename
​string · required

The filename of the file to upload

UploadAppendRequest

data
​string · required

Base64-encoded chunk of the file. Each chunk must not exceed 5MB of the original (pre-encoded) data.

Usage

total_input_characters
​integer

Total number of input characters submitted

processed_input_characters
​integer

Number of input characters actually processed

skipped_characters
​integer

Number of input characters skipped (e.g. numeric segments)

output_characters
​integer

Total number of output characters generated

vector_bytes
​integer

Vector data bytes (reserved for future use)

input_pages
​integer

Number of pages in the input PDF. Present for jobs whose input is a PDF file.

JobCreateResponse

Response returned immediately after job submission
job_id
​string

Unique job identifier. Use this to poll for results.

model
​string
status
​string · enum
Enum values:
queued
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
expires_at
​string · date-time

JobError

Job-level error. Present only when job status is failed.
code
​string
message
​string
request_id
​string

ApiError

Error response for 4xx/5xx status codes
code
​string
message
​string
request_id
​string

StructFlowCreateJobRequest

Either 'inputs' (inline records) or 'file_id' (a pre-uploaded JSONL file) must be provided, but not both. Use 'inputs' for small batches or ad-hoc calls, and 'file_id' for large batches or pipelines where the input is produced by another job.
model
​string · required

Model ID obtained from GET /structflow/models

system_prompt
​string · required

Instruction describing what structured data to extract and how

example_output
​object · required

A JSON object showing the desired output structure with example values

​object[]

List of input records to process. Mutually exclusive with 'file_id'.

file_id
​string

File ID of a pre-uploaded JSONL file (one JSON record per line), obtained from POST /files. Mutually exclusive with 'inputs'. When used, results are returned as an 'output_file_id' instead of inline 'results'.

StructFlowJobSummary

Job summary without results data. Returned by GET /structflow/jobs.
job_id
​string
model
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
output_file_id
​string

File ID of the output JSONL file. Present only for jobs submitted with 'file_id' input and status is completed.

​Usage

StructFlowJobDetail

Full job details including results. Returned by GET /structflow/jobs/{job_id}.
job_id
​string
model
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
​object

Result summary. Present only when status is completed AND the job was submitted with 'inputs' (inline).

​object[]

Extracted data per input record. Present only when status is completed AND the job was submitted with 'inputs' (inline). Each record may independently succeed or fail. For jobs submitted with 'file_id', use 'output_file_id' to download the results as a JSONL file.

​JobError

Job-level error. Present only when job status is failed.

created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
output_file_id
​string

File ID of the output JSONL file. Present only when status is completed AND the job was submitted with 'file_id' input. Each line contains either the extracted JSON for a successful record, or an object with a reserved '$error' key (e.g. {"$error":{"code":"...","message":"..."}}) for a failed record. Download via GET /files/{output_file_id}/content.

​Usage

RefineLoopCreateJobRequest

model
​string · required

Model ID obtained from GET /refineloop/models

file_id
​string · required

File ID of the XLIFF file to process, obtained from POST /files

exclude_numeric_segments
​boolean

If true, segments containing only numbers are skipped

Default: false
remove_hyphenation
​boolean

If true, hyphenation in source text is removed before processing

Default: true
max_revisions
​integer · min: 1 · max: 10

Maximum number of revision iterations per segment (1-10)

Default: 6
domain
​string

Subject domain to guide translation review (e.g. "IR", "medical", "legal")

note_language
​string

Language code for revision notes written by the AI (e.g. "ja", "en", "ko"). Defaults to the source language of the XLIFF file if not specified.

output_mode
​string · enum

Controls what is included in the output XLIFF

Enum values:
full
translations
none
Default: full
custom_instructions
​string

Additional instructions appended to the AI's base prompt

RefineLoopJobSummary

Job summary without output data. Returned by GET /refineloop/jobs.
job_id
​string
file_id
​string

Input file ID

model
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

RefineLoopJobDetail

Full job details. Returned by GET /refineloop/jobs/{job_id}.
job_id
​string
file_id
​string

Input file ID

model
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
output_file_id
​string

File ID of the refined XLIFF output. Present only when status is completed. Download via GET /files/{output_file_id}/content.

​JobError

Job-level error. Present only when job status is failed.

created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

Engine

id
​string

Unique engine identifier. Use this value in job requests.

display_name
​string

Human-readable engine name

provider
​string

Engine provider

description
​string

Engine description and recommended use case

​object[]

List of supported input/output format combinations

​object[]

List of supported OCR languages

RenderOCRCreateJobRequest

engine
​string · required

Engine ID obtained from GET /renderocr/engines

file_id
​string · required

File ID of the input file to process, obtained from POST /files

languages
​string[] · required

List of language codes for OCR recognition. Currently only the first language is used.

output_format
​string · enum · required

Output document format

Enum values:
docx
xlsx
pptx

RenderOCRJobSummary

Job summary without output data. Returned by GET /renderocr/jobs.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

RenderOCRJobDetail

Full job details. Returned by GET /renderocr/jobs/{job_id}.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
output_file_id
​string

File ID of the converted document output. Present only when status is completed. Download via GET /files/{output_file_id}/content.

​JobError

Job-level error. Present only when job status is failed.

created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

CastDocCreateJobRequest

engine
​string · required

Engine ID obtained from GET /castdoc/engines

file_id
​string · required

File ID of the input PDF file to process, obtained from POST /files

output_format
​string · enum · required

Output document format

Enum values:
docx
xlsx
pptx

CastDocJobSummary

Job summary without output data. Returned by GET /castdoc/jobs.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

CastDocJobDetail

Full job details. Returned by GET /castdoc/jobs/{job_id}.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
output_file_id
​string

File ID of the converted document output. Present only when status is completed. Download via GET /files/{output_file_id}/content.

​JobError

Job-level error. Present only when job status is failed.

created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

ExtractDocCreateJobRequest

engine
​string · required

Engine ID obtained from GET /extractdoc/engines

file_id
​string · required

File ID of the input file to process, obtained from POST /files. Supported input formats: PDF, DOCX, XLSX, PPTX. The input format is automatically detected from the uploaded file.

output_format
​string · enum · required

Output format. Use 'text' for plain text, or 'jsonl' for a single-line JSON object compatible with StructFlow input.

Enum values:
text
jsonl

ExtractDocJobSummary

Job summary without output data. Returned by GET /extractdoc/jobs.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage

ExtractDocJobDetail

Full job details. Returned by GET /extractdoc/jobs/{job_id}.
job_id
​string
file_id
​string

Input file ID

engine
​string
status
​string · enum
Enum values:
queued
processing
completed
failed
progress
​number · float
output_file_id
​string

File ID of the extracted output. Present only when status is completed. Download via GET /files/{output_file_id}/content.

​JobError

Job-level error. Present only when job status is failed.

created_at
​string · date-time
updated_at
​string · date-time
completed_at
​string · date-time
expires_at
​string · date-time
​Usage
On this page
  • Model
  • FileInfo
  • UploadCreateRequest
  • UploadAppendRequest
  • Usage
  • JobCreateResponse
  • JobError
  • ApiError
  • StructFlowCreateJobRequest
  • StructFlowJobSummary
  • StructFlowJobDetail
  • RefineLoopCreateJobRequest
  • RefineLoopJobSummary
  • RefineLoopJobDetail
  • Engine
  • RenderOCRCreateJobRequest
  • RenderOCRJobSummary
  • RenderOCRJobDetail
  • CastDocCreateJobRequest
  • CastDocJobSummary
  • CastDocJobDetail
  • ExtractDocCreateJobRequest
  • ExtractDocJobSummary
  • ExtractDocJobDetail