Model Detail
Inception: Mercury
—Inception: Mercury is a large language model released by Inception. And supports text->text inputs.
Inception: Mercury is priced at $0.25/M input tokens and $0.75/M output tokens. Operationally the model offers a 128K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.
The published knowledge cutoff is 2025-01-31, so newer events will not be reflected in zero-shot answers without retrieval.
Inception: Mercury is best fit for general-purpose chat and instruction-following workloads, and high-volume batch jobs where per-call cost dominates the budget. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.