API · Document Intelligence

Your documents,
answered in milliseconds.

ContextIQ extracts answers, summaries, and citations from any document — PDF, Markdown, URL — with model-switching and < 200ms p99 latency.

Start free — 1,000 pages/month View API docs

SOC 2 Type II GDPR No training on your data

Try ContextIQ

Claude Sonnet 4
 

Q4 Roadmap.pdf 24 pages

Question

What are the top 3 priorities in Q4?

Answer

 1. Ship the eval pipeline [p.4]

Complete the faithfulness + groundedness scorer before Oct 31 to unblock the Pro launch.
 2. Enterprise SSO launch [p.11]

SAML 2.0 and OIDC support required by three Fortune-500 pilots already contracted.
 3. Latency SLA reduction [p.17]

Reduce p99 from 340ms to <200ms via streaming + prefix caching across all 4 model backends.

< 187ms

Teams using ContextIQ

Arcadia Health
Stormfront Analytics
Brio Legal
Novela Publishing
Tangram AI
Cascade Finance

< 200ms p99 response time
99.97% uptime (12-mo avg)
4 model backends

Features

Built for AI products that ship to production.

Every feature is designed around the real constraints of shipping AI-powered applications — latency, reliability, observability, and data governance.

Multi-model switching

Choose GPT-4o, Claude Sonnet, Gemini Pro, or Mistral Large per request. No lock-in, no extra config.
Citation-aware answers

Every response traces to its source — page, paragraph, exact passage. Not hallucinated references.
Sub-200ms p99

Streaming plus caching mean your users never wait. Benchmark table available in the docs.
Eval suite built in

Score every response with faithfulness, relevance, and groundedness metrics. Catch regressions before prod.
Structured output

Get JSON, Markdown, or plain text. Schema-validated responses. Works with any downstream pipeline.
Webhooks + SDK

Node.js, Python, Go SDKs. Webhook on async jobs. OpenAPI spec for self-generated clients.

How it works

From document to answer in three API calls.

Upload or link your document

PDF, URL, Markdown, or plain text. Up to 500 pages per request. We handle parsing, chunking, and embedding.

curl -X POST https://api.contextiq.dev/v1/ingest \ -H "Authorization: Bearer $CIQ_KEY" \ -F "file=@q4-roadmap.pdf" # → { "doc_id": "doc_9f3k2", "pages": 24 }
Choose your model and output format

Switch models per request. Specify a JSON schema or get Markdown by default. Every call is stateless.

curl -X POST https://api.contextiq.dev/v1/query \ -H "Authorization: Bearer $CIQ_KEY" \ -d '{ "doc_id": "doc_9f3k2", "question": "What are the top 3 priorities in Q4?", "model": "claude-sonnet-4", "format": "json" }'
Receive answers with citations

The endpoint returns a structured object: summary, Q&A pairs, citations array, latency stats. Ready to pipe into your product.

{ "answer": "The top 3 priorities are: ...", "citations": [ { "page": 4, "text": "Priority 1: Ship eval pipeline..." }, { "page": 11, "text": "Priority 2: Enterprise SSO launch..." }, { "page": 17, "text": "Priority 3: Latency SLA reduction..." } ], "model": "claude-sonnet-4", "latency_ms": 187 }

Pricing

Start free. Scale when you're ready.

No credit card required to start. The free tier is generous enough for most side projects.

Free

$0 /month

For personal projects and experimentation.

Get started free

1,000 pages / month
3 model backends (GPT-4o, Gemini Pro, Mistral)
JSON + Markdown output
10 req/min rate limit
99.9% SLA
Community support

Pro

$49 /month

For production apps and growing teams.

Start Pro trial

50,000 pages / month
All 4 model backends
Citation API included
Eval suite access
200 req/min rate limit
99.95% SLA
Email support

Enterprise

Custom

For regulated industries and large-scale deployments.

Talk to us

Unlimited pages
Private deployment option
SSO + RBAC
Dedicated SLA + SLO
SOC 2 Type II audit reports
Custom rate limits
Slack + dedicated support

FAQ

Common questions.

Still have questions? Email us at landix.ninal@gmail.com or reach out on LinkedIn.

Is my data used to train your models?

No. ContextIQ processes documents ephemerally. We do not store your content beyond your retention setting (default: 24 hours) and we never use it to train or fine-tune any model. This is covered in our Data Processing Agreement (DPA), available on request.

Which AI models are available?

GPT-4o, Claude Sonnet 4, Gemini Pro 1.5, and Mistral Large 2. The Free plan includes GPT-4o, Gemini Pro, and Mistral Large. The Pro and Enterprise plans include all four plus priority access to new models as they launch.

Which cloud providers do you support?

Documents can be fetched from Amazon S3, Google Cloud Storage, Azure Blob Storage, or any public HTTPS URL. No vendor lock-in — we speak standard S3-compatible APIs including Cloudflare R2 and Backblaze B2.

What document formats are supported?

PDF, Markdown, plain text (.txt), HTML, DOCX (via our conversion pipeline), and raw URL crawl (we fetch and parse the page). Support for XLSX and PPTX is on the roadmap for Q3 2026.

What are the rate limits?

Free: 10 requests/min, 1,000 pages/month. Pro: 200 requests/min, 50,000 pages/month. Enterprise: custom — contact us for bespoke limits. Burst capacity is available on Pro and Enterprise.

Do you offer a self-hosted version?

The Enterprise plan includes a containerized private-deployment option that runs in your own VPC. You retain full data sovereignty. Contact us for architecture details and pricing.

Get started

Start summarizing in 5 minutes.

Free tier. No credit card. API key delivered instantly.

Get your API key Read the docs

SOC 2 Type II · GDPR · Used by teams at Arcadia Health, Brio Legal, and more

Your documents, answered in milliseconds.

Built for AI products that ship to production.

Multi-model switching

Citation-aware answers

Sub-200ms p99

Eval suite built in

Structured output

Webhooks + SDK