You deploy and run Amplify GenAI yourself in your own Amazon Web Services account. You pay for the Amazon Web Services costs and the token costs for the models you use.
Case Study: Vanderbilt University currently provides unlimited usage of the platform to 1,300 pilot users. In April of 2024, the Vanderbilt cost per user was around $3/mo, including Amazon Web Services costs. The largest part of the bill was not token costs. The largest token cost was GPT-4 Turbo usage. For some organizations, per user pricing may be more cost effective. For Vanderbilt, paying for actual usage was far more cost effective. 50% of Vanderbilt users consumed less than $1/mo in token costs.