DZone

AWS Glue Logo

AWS Glue is a powerful tool that is managed, relieving you of the hassle associated with maintaining the infrastructure. It is hosted by AWS and offers Glue as Serverless ETL, which converts the code into Python/Scala and executes it in a Spark environment.

AWS Glue provisions all the required resources (Spark cluster) at runtime to execute the Spark Job which takes ~7-10 mins and then starts executing your actual ETL code. To reduce this time, AWS Glues provides a development endpoint, which can be configured in Apache Zeppelin (provisioned with the spark environment) to interactively, run, debug and test ETL code before deploying as Glue job or scheduling the ETL process.

Source: DZone