Jump to Content
Not Diamond
DocumentationAPI Reference
Log InNot Diamond
Documentation
Log In
DocumentationAPI Reference

Getting started

  • What is Not Diamond?
  • Demo quickstart
  • Chat with Not Diamond
  • API keys
  • LLM models
  • Model gateway
  • Pricing
  • Support
  • Open source

Core features

  • Training a custom router
  • Cost and latency tradeoffs
  • `model_select` vs. `create`
  • Function calling
  • Structured outputs
  • Joint prompt optimization
  • Preference IDs
  • Personalized routing with feedback
  • Reliability, fallbacks, and load-balancing

RAG Auto-Optimization

  • Introduction
  • Creating test data
  • Optimize retrieval pipelines
  • Evaluating RAG applications
  • Evaluation metrics
  • Embedding models

Tutorials and examples

  • Chat
  • Strong vs weak LLM routing
  • RAG workflows with LlamaIndex
  • Agent workflows
  • LLM evaluation tutorial
  • SAMMO tutorial
  • DSPy tutorial
  • Human-in-the-loop routing
  • Routing between RAG agents

Use cases

  • Customer support
  • Legal summarization
  • Code documentation

Clients and libraries

  • API reference
  • Node library
  • Python Modules

How Not Diamond works

  • How Not Diamond works
  • Benchmark performance
  • Privacy, security, and local deployments

Integrations

  • OpenAI integration
  • OpenRouter integration
  • LangChain integration
  • LiteLLM