Create crawler for Chplay-gold layer

Create crawler for gold layer

For quick and focused setup, everyone should create the main crawler for the gold layer, and do the same for the rest!

  1. Access AWS Glue interface
    • Select Crawlers
    • Select Create crawler

Create VPC

  1. Enter:
    • Name: chplay-gold
    • Description: Data after processed
    • Select Next

Create VPC

  1. Select Add a data source
    • Select the correct S3 folder: chplay-gold
    • Select Next

Create VPC

  1. Select appropriate IAM role
    • Select Next

Create VPC

  1. From AWS Glue interface
    • Select Databases
    • Select Create database
    • Name: chplay-gold
    • Select Create database

Create VPC

  1. In the set Output section
    • Select Target database: chplay-gold
    • Select Next

Create VPC

Note:

You can choose folder hierarchy using the prefix option

  1. Review everything once more

Create VPC

  1. Click Run crawler

Create VPC

⇒ At this point, AWS Glue Crawler has created a catalog for 2 tables: app_details and app_reviews