Deploying Deepgram on AWS requires some preparation. In this section, you will learn how to expose and access your application in a secure and stable manner. You will need to perform some of these steps in the AWS Management Console and some in your local terminal.
Make sure you have completed the requirements in the On-prem Introduction.
AWS uses public-key cryptography to secure login information for your instance. A Linux instance has no password; you use a key pair to log in to your instance securely. For the following steps, be sure to use a non-root AWS account.
If you don’t already have an Amazon EC2 key pair, you will need to create one in order to access the AWS EC2 Virtual Machine. To learn how, read Create a key pair using Amazon EC2 in Amazon’s documentation.
Key pairs must be created in each AWS region in which you will deploy Deepgram on-premises.
At the end of this process, your browser should download a
private-key.pem file for your key pair. Move this file to a secure and memorable location.
To begin your on-prem installation with AWS as your cloud service provider, you need to create an Amazon EC2 instance.
The steps in this section utilize the new AWS launch experience. You may need to adjust the ordering if you are using the old AWS launch experience.
Navigate to the EC2 Dashboard and confirm that the proper AWS Region is configured, then choose Launch Instance to open the wizard.
For the Name and tags wizard step, type
For the Application and OS Images (Amazon Machine Image) wizard step, choose a basic configuration to serve as a template for your instance:
- Pick an Amazon Machine Image (AMI) from our list of recommended Linux distributions.
- Search for your desired configuration, such as Ubuntu Server 22.04 LTS (HVM), SSD Volume Type
- Ensure the 64-bit (x86) option is selected.
- Choose Select.
For the Choose an Instance Type wizard step, search and select
g4dn.2xlarge, or another GPU instance type that meets our minimum required hardware specifications.
For the Key pair (login) wizard step, search and select the key pair you created in the Create an Amazon EC2 Key Pair section. Alternatively, you can select
Create new key pair; ensure you download the private key and configure it properly on your local machine in order to access your server.
For the Network settings wizard step, select
Create security group, check the box
Allow SSH traffic from anywhere, and ensure
Auto-assign public IPis enabled. This will allow you to remotely login to your server.
If you want to be able to receive requests from the public internet, make sure to also check the box
Allow HTTPS traffic from the internet. If you are colocating your Deepgram on-prem deployment with other services and do not need to expose your server to the public internet, you can find configuration options on the Amazon networking documentation.
- For the Configure storage wizard step, change the root volume size to meet our minimum required hardware specifications.
- Verify that the summary box contains all rthe right details for your instance, and click
- Once the instance successfully launches, you need to find its Public IPv4 DNS entry. Either select the link to the instance and find the
Public IPv4 DNScolumn, or click on
Connect to instanceand follow the instructions under the
SSH clienttab. This will be the
AWS_HOSTNAMEvalue that is referenced later.
To complete the rest of the installation, including configuring your environment and transferring files between your local computer and your AWS instance, you must connect to the AWS EC2 instance that you launched.
Open the terminal application for your computer.
Connect to your AWS instance:
ssh -i /path/to/private-key.pem ubuntu@AWS_HOSTNAME
Be sure to replace the
AWS_HOSTNAMEplaceholder value with the hostname for your instance. Also check that the path to your
private-key.pemfile is correct.
For AWS Ubuntu instances, the default username is
useras their documentation would suggest. If you are using a different AMI, the default username will be different - see the
Connectpage on the instance for more details.
If you are on a Windows machine, alternatives to
sshinclude PuTTY and the AWS EC2 Instance Connect service.
- If you receive a message that indicates that the authenticity of the host can’t be established, type
yes, then press the Enter key on your keyboard. This message should only appear the first time you are connected to the instance.
Updated 30 days ago
Now that we have provisioned a deployment environment, we need to start configuring it for inference.