Drise

Rate Limits & Quotas

Prompt limits per plan and the 5-hour window. All model variants are FP8-quantised.

Drise uses prompt-based quotas per 5-hour rolling window. Limits reset every 5 hours. All five GLM 5.2 model variants (see Models) share the same per-plan quota.

Per-plan limits

PlanPricePrompts per 5 hours
Free$030
Pro$15/mo1,500
Max$25/mo3,000
Turbo$50/mo5,000

Model-specific notes

  • drise-glm-5.2 and drise-glm-5.2-fast expose a 1,000,000-token context window.
  • drise-glm-5.2-short and drise-glm-5.2-short-fast use a 200,000-token context window for focused tasks.
  • drise-vision accepts image and text inputs.
  • All variants are FP8-quantised for cheaper, faster inference.

Notes

  • Higher tiers get higher routing priority. See Routing & Failover.
  • No per-token math. No surprise overages.
  • Upgrade or downgrade anytime, no lock-in.

On this page