There are a few steps you’ll need to take to prepare your AWS account and the tools and files you need before starting the AWS Glue hand-on labs:
Let’s login to AWS Console, then following below step:
In this step, we will create new S3 bucket with folder and sub-folders for storing data source, data target, script job, etc. But first of all, we will just create S3 bucket.
aws-glue-labs-bucket
Next, we will create folder for storing data source. We will:
raw_data/
for storing source/raw data.raw_data/customer_database
raw_data/customer_database/customers_csv
raw_data/customer_database/customers_csv/client_created_date=20240423
client_created_date=20240423
raw_data
Repeat the steps above, and you will have:
raw_data/customer_database
raw_data/customer_database/customers_csv
raw_data/customer_database/customers_csv/client_created_date=20240423
To upload data into folder client_created_date=20240423
:
client_created_date=20240423
, then click Upload
As you can see, data file is uploaded successfully into S3. Next, we will create IAM Role for AWS Glue
AWSGlueServiceRole
and S3FullAccess
from Attach Permissions Policiesglue-full-access
In the next section, we’ll hands-on AWS Glue on the console.