Magisterium AI

Pricing

Preview

Chat Completions

Rate Limit
5 RPM
Total Context
64K
Max Output
65,536
Input Price
Free
System Price
Free
Output Price
Free

News

Rate Limit
5 RPM
Price
Free

Pay-as-you-go

Chat Completions

Rate Limit
3,000 RPM
Total Context
64K
Max Output
65,536
Input Price
$0.50 / mtok
System Price
$0.70 / mtok
Output Price
$2.00 / mtok

News

Rate Limit
5 RPM
Price
Free during beta

Custom Plan

Need custom rate limits or hands-on support? Reach out to the Magisterium AI team for:

  • Magisterium AI-supported onboarding
  • Custom rate limits
  • Billing via monthly invoices
  • Prompting support
  • Deployment support

How are input, system, and output tokens calculated?

Example Usage:
User Input
“What is the meaning of life?”
6 tokens
System
Reading Evangelium Vitae, Dignitatis Humanae
1,930 tokens
Output
The meaning of life is ...
938 tokens

There are three components that make up the pricing:

  1. User question/prompt: The initial question sent via the API request. This counts towards your input token usage.
  2. System processes: Upon receiving the prompt, Magisterium AI utilizes various mechanisms to intelligently handle the query, including classification, database research, and additional tooling to ensure an accurate answer. These operations involve system tokens, with their usage varying based on the query’s complexity and depth of answer required.
  3. AI-generated response: The final output given in the API response, which includes the AI-generated answer, counts towards your output tokens.

The API console usage page allows you to track exactly how many input and output tokens are being consumed by system processes in addition to the more straightforward input and output costs associated with user input and answer generation. The pricing for input and output tokens consumed by system processes is the same as the standard pricing.

Visit the Playground to experiment with tokens consumption.