ContextIQ
API · Document Intelligence

Your documents,
answered in milliseconds.

ContextIQ extracts answers, summaries, and citations from any document — PDF, Markdown, URL — with model-switching and < 200ms p99 latency.

SOC 2 Type II GDPR No training on your data
Try ContextIQ
Claude Sonnet 4
Q4 Roadmap.pdf 24 pages

Question

What are the top 3 priorities in Q4?

Answer

1. Ship the eval pipeline [p.4]
Complete the faithfulness + groundedness scorer before Oct 31 to unblock the Pro launch.

2. Enterprise SSO launch [p.11]
SAML 2.0 and OIDC support required by three Fortune-500 pilots already contracted.

3. Latency SLA reduction [p.17]
Reduce p99 from 340ms to <200ms via streaming + prefix caching across all 4 model backends.

< 187ms

Teams using ContextIQ

  • Arcadia Health
  • Stormfront Analytics
  • Brio Legal
  • Novela Publishing
  • Tangram AI
  • Cascade Finance

Performance benchmarks

  • < 200ms p99 response time
  • 99.97% uptime (12-mo avg)
  • 4 model backends

Features

Built for AI products that ship to production.

Every feature is designed around the real constraints of shipping AI-powered applications — latency, reliability, observability, and data governance.

  • Multi-model switching

    Choose GPT-4o, Claude Sonnet, Gemini Pro, or Mistral Large per request. No lock-in, no extra config.

  • Citation-aware answers

    Every response traces to its source — page, paragraph, exact passage. Not hallucinated references.

  • Sub-200ms p99

    Streaming plus caching mean your users never wait. Benchmark table available in the docs.

  • Eval suite built in

    Score every response with faithfulness, relevance, and groundedness metrics. Catch regressions before prod.

  • Structured output

    Get JSON, Markdown, or plain text. Schema-validated responses. Works with any downstream pipeline.

  • Webhooks + SDK

    Node.js, Python, Go SDKs. Webhook on async jobs. OpenAPI spec for self-generated clients.

How it works

From document to answer in three API calls.

  1. Upload or link your document

    PDF, URL, Markdown, or plain text. Up to 500 pages per request. We handle parsing, chunking, and embedding.

    curl -X POST https://api.contextiq.dev/v1/ingest \ -H "Authorization: Bearer $CIQ_KEY" \ -F "file=@q4-roadmap.pdf" # → { "doc_id": "doc_9f3k2", "pages": 24 }
  2. Choose your model and output format

    Switch models per request. Specify a JSON schema or get Markdown by default. Every call is stateless.

    curl -X POST https://api.contextiq.dev/v1/query \ -H "Authorization: Bearer $CIQ_KEY" \ -d '{ "doc_id": "doc_9f3k2", "question": "What are the top 3 priorities in Q4?", "model": "claude-sonnet-4", "format": "json" }'
  3. Receive answers with citations

    The endpoint returns a structured object: summary, Q&A pairs, citations array, latency stats. Ready to pipe into your product.

    { "answer": "The top 3 priorities are: ...", "citations": [ { "page": 4, "text": "Priority 1: Ship eval pipeline..." }, { "page": 11, "text": "Priority 2: Enterprise SSO launch..." }, { "page": 17, "text": "Priority 3: Latency SLA reduction..." } ], "model": "claude-sonnet-4", "latency_ms": 187 }

Pricing

Start free. Scale when you're ready.

No credit card required to start. The free tier is generous enough for most side projects.

Free

$0 /month

For personal projects and experimentation.

Get started free
  • 1,000 pages / month
  • 3 model backends (GPT-4o, Gemini Pro, Mistral)
  • JSON + Markdown output
  • 10 req/min rate limit
  • 99.9% SLA
  • Community support

Enterprise

Custom

For regulated industries and large-scale deployments.

Talk to us
  • Unlimited pages
  • Private deployment option
  • SSO + RBAC
  • Dedicated SLA + SLO
  • SOC 2 Type II audit reports
  • Custom rate limits
  • Slack + dedicated support

FAQ

Common questions.

Still have questions? Email us at landix.ninal@gmail.com or reach out on LinkedIn.

Is my data used to train your models?
No. ContextIQ processes documents ephemerally. We do not store your content beyond your retention setting (default: 24 hours) and we never use it to train or fine-tune any model. This is covered in our Data Processing Agreement (DPA), available on request.
Which AI models are available?
GPT-4o, Claude Sonnet 4, Gemini Pro 1.5, and Mistral Large 2. The Free plan includes GPT-4o, Gemini Pro, and Mistral Large. The Pro and Enterprise plans include all four plus priority access to new models as they launch.
Which cloud providers do you support?
Documents can be fetched from Amazon S3, Google Cloud Storage, Azure Blob Storage, or any public HTTPS URL. No vendor lock-in — we speak standard S3-compatible APIs including Cloudflare R2 and Backblaze B2.
What document formats are supported?
PDF, Markdown, plain text (.txt), HTML, DOCX (via our conversion pipeline), and raw URL crawl (we fetch and parse the page). Support for XLSX and PPTX is on the roadmap for Q3 2026.
What are the rate limits?
Free: 10 requests/min, 1,000 pages/month. Pro: 200 requests/min, 50,000 pages/month. Enterprise: custom — contact us for bespoke limits. Burst capacity is available on Pro and Enterprise.
Do you offer a self-hosted version?
The Enterprise plan includes a containerized private-deployment option that runs in your own VPC. You retain full data sovereignty. Contact us for architecture details and pricing.

Get started

Start summarizing in 5 minutes.

Free tier. No credit card. API key delivered instantly.

SOC 2 Type II · GDPR · Used by teams at Arcadia Health, Brio Legal, and more