Amazon SageMaker
Run Deepgram inside your AWS account as a managed SageMaker AI Endpoint — with native AWS integrations, hourly billing, and a 14-day free trial.
Run Deepgram inside your AWS account as a managed SageMaker AI Endpoint — with native AWS integrations, hourly billing, and a 14-day free trial.
Amazon SageMaker is a managed cloud platform from Amazon Web Services (AWS) that enables deployment of Deepgram as a managed, container-based service. Once you deploy Deepgram as a SageMaker Model Endpoint, you can run inference against the service using the Amazon SageMaker AI Software Development Kit (SDK).
The Deepgram SDKs can also target a SageMaker Endpoint through the SageMaker transport, so you can keep the same client-side request and response patterns whether you call the Deepgram-hosted API or your own SageMaker deployment.
Deepgram on SageMaker is the fastest path to running Deepgram inside your own AWS account. Compared to self-hosting Deepgram on Docker or Kubernetes, SageMaker trades some flexibility for a managed endpoint that AWS operates on your behalf.
While SageMaker covers most production scenarios, AWS imposes a small number of platform-specific constraints — for example, callback URLs and external file URL ingestion are not supported. Review the full list in the Limitations section of the deployment guide before choosing SageMaker.
Most customers can stand up a ready-to-use endpoint in minutes through one of two paths:
Deepgram on SageMaker is billed hourly per instance type. Current rates are listed on the AWS Marketplace product pages for each Deepgram model.
For larger or longer-term deployments, AWS Marketplace Private Offers are available with negotiated unit economics and committed-use terms. Contact your AWS account team or Deepgram representative to start a Private Offer.
A 14-day free trial is available with unlimited product usage and zero Deepgram license charges during the trial window. Each trial is available once per AWS account per product. Contact a Deepgram representative if you need additional time for testing.
Infrastructure charges are set by AWS and billed separately from Deepgram license charges. Public pricing for SageMaker Real-Time Inference is available at aws.amazon.com/sagemaker/ai/pricing. For volume discounts or committed-use pricing on the underlying compute, contact your AWS Sales Representative.