Data Intelligence Pipeline

Turn messy, sensitive documents into RAG-ready knowledge graphs. One call. Structured Data Packages. Production-grade quality metrics.

Get Started Pipeline Guide

latence-python SDK

v0.2

Pipeline-first SDK. Submit files, get structured Data Packages.

Tutorials & NotebooksCode ExamplesPipeline Guides

Explore on GitHub

Quick Start

1. Get your API Key

Create an account and generate an API key from your dashboard.

Create Account →

2. Install the SDK

pip install latence

View on GitHub →

3. Submit a Pipeline

job = client.pipeline.run(
  files=["doc.pdf"]
)

View Example →

4. Get Data Package

pkg = job.wait_for_completion()
print(pkg.document.markdown)
pkg.merge(save_to="out.json")

Data Package →

The Pipeline

Core Product

Submit documents. Get back structured, high-quality data packages ready for RAG, agents, and LLM workflows.

Smart Defaults

Just provide files. The intelligent default pipeline runs Document Intelligence → Entity Extraction → Knowledge Graph automatically. No configuration required.

Structured Data Package

Not raw JSON dumps. Organized sections with document markdown, entities, knowledge graphs, quality metrics, and confidence scores.

DAG Execution

Services execute as a directed acyclic graph -- independent branches run in parallel. Track per-stage progress with real-time callbacks. Resumable on partial failure.

ZIP Archive Export

Download results as an organized ZIP archive with markdown documents, entity JSON, knowledge graph data, quality reports, and a human-readable README.

Data Consolidation

Merge all outputs into a single, document-centric JSON with zero redundancy. One call to pkg.merge() and you have production-ready data for downstream consumption.

Full Guide

Pipeline Documentation

Complete guide: smart defaults, step configuration, Data Package structure, fluent builder, async job handling, and pricing.

Document Intelligence→Redaction→Entity Extraction→Knowledge Graph

View Guide

Authentication

All API requests require authentication using a Bearer token.

pipeline.py

from latence import Latence

client = Latence(api_key="YOUR_API_KEY")

# Submit files -- smart defaults handle the rest

job = client.pipeline.run(files=["contract.pdf"])

# Wait for the composed Data Package

pkg = job.wait_for_completion()

# Structured, summarized results

print(pkg.document.markdown) # Clean extracted text

print(pkg.entities.summary) # Entity counts by type

print(pkg.knowledge_graph.relations) # Full relation list

pkg.download_archive("./results.zip") # ZIP export

pkg.merge(save_to="./output.json") # Consolidated JSON

Get your API key from the Dashboard → API Keys page

Install the Python SDK: pip install latence

The SDK handles authentication automatically

Never share your API key or commit it to version control

Rate Limits

API requests are rate-limited per API key, per service. Limits apply uniformly — there are no tier-based rate limits.

Service	Rate Limit
Document Intelligence	500 req / min
Entity Extraction	1,000 req / min
Relation Extraction / Knowledge Graph	500 req / min
Redaction	1,000 req / min
Compression	1,500 req / min
Embed (unified)	1,500 req / min
Embedding (dense)	2,000 req / min
ColBERT	1,000 req / min
ColPali	1,000 req / min
Chunking	5,000 req / min
Dataset Intelligence	100 req / min
Enrichment (Coming Soon)	2,500 req / min

Rate Limit Headers

x-ratelimit-limitMaximum requests allowed in the window

x-ratelimit-remainingRequests remaining in the current window

x-credits-usedCredits charged for this request

x-credits-remainingYour remaining credit balance

Error Codes

Standard HTTP error codes with additional context.

Code	Name	Description
400	Bad Request	Invalid request parameters
401	Unauthorized	Missing or invalid API key
402	Insufficient Credits	Your credit balance is too low
429	Too Many Requests	Rate limit exceeded
500	Internal Server Error	Unexpected server error

Error Response ExampleJSON

{
  "error": "Rate limit exceeded",
  "details": "Maximum 500 requests per 60000ms",
  "retry_after": 60
}

Experimental / Developer APIs

Self-Service

Direct access to individual services for development, testing, and custom workflows.

These endpoints are available for development and testing. For production workloads, use the Data Intelligence Pipeline above -- it provides structured Data Packages, quality metrics, and is covered by Enterprise SLAs.

Experimental1 cr

Embedding

Embedding - generates dense vector embeddings with Matryoshka dimension support. Choose your embedding dimension (256, 512, 768, or 1024) to balance between quality and performance.

View Documentation

Experimental5 cr

ColBERT

ColBERT provides state-of-the-art neural retrieval with token-level embeddings. Using late interaction, it delivers superior ranking precision compared to traditional dense embeddings.

View Documentation

Experimental10 cr

ColPali

ColPali combines vision and language models for searching documents where visual context matters. Ideal for documents with charts, diagrams, tables, and complex formatting.

View Documentation