As part of this blog post, I shall discuss how we went about setting up Athena to query our JSON data.Īmazon Athena is an interactive query engine that makes it easy to analyze data in Amazon S3. Amazon Athena responds anywhere from few seconds to minutes for data than runs into hundreds of GBs and has pleasantly surprised us by its ease of use. The cost will vary from region to region, but we need to consider the data return and scanned factors while running queries.Querying Hundreds of GBs of JSON data with Amazon AthenaĪt DeltaX we have been using Amazon Athena as part of our data pipeline for running ad-hoc queries and analytic workloads on logs collected through our tracking and ad-serving system. Tiered price for: 50 GB 50 GB per month x 0.0230000000 USD = 1.15 USD Total tier cost = 1.1500 USD (S3 Standard storage cost) 100,000 SELECT requests in a month x 0.0000004 USD per request = 0.04 USD (S3 Standard SELECT requests cost) 20 GB per month x 0.0007 USD = 0.014 USD (S3 Select returned cost) 50 GB per month x 0.002 USD = 0.10 USD (S3 Select scanned cost) 1.15 USD + 0.04 USD + 0.014 USD + 0.10 USD = 1.30 USD (Total S3 Standard Storage, data requests, S3 Select cost) S3 Standard cost (monthly): 1.30 USD The approximate cost in this scenario has been worked out in the section below (as of Dec 2020).
0 Comments
Leave a Reply. |