Arcee AI: Virtuoso Large

arcee-ai/virtuoso-large

Description

Virtuoso-Large is Arcee's top-tier general-purpose LLM at 72 B parameters, tuned to tackle cross-domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k context inherited from Qwen 2.5, letting it ingest books, codebases or financial filings wholesale. Training blended DeepSeek R1 distillation, multi-epoch supervised fine-tuning and a final DPO/RLHF alignment stage, yielding strong performance on BIG-Bench-Hard, GSM-8K and long-context Needle-In-Haystack tests. Enterprises use Virtuoso-Large as the "fallback" brain in Conductor pipelines when other SLMs flag low confidence. Despite its size, aggressive KV-cache optimizations keep first-token latency in the low-second range on 8x H100 nodes, making it a practical production-grade powerhouse.

How this model compares

Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.

Price (per 1M tokens)

Min
Max
This model
336 models in this groupPrice (per 1M tokens)
Min
$0.04
Avg
$12.385886
Max
$750.00
This model: $1.20 / 1M tokens

Context length (tokens)

Min
Max
This model
336 models in this groupContext length (tokens)
Min
4,095 tokens
Avg
382,115.467 tokens
Max
10,000,000 tokens
This model: 131,072 tokens

Capabilities

Text → TextContext: 131,072 tokens
Input:
Text
Output:
Text
    Arcee AI: Virtuoso Large