Amazon Athena Reviews

Name: Amazon Athena
Brand: Amazon Web Services (AWS)
Rating: 3.9 (9 reviews)

Vendor: Amazon Web Services (AWS)

3.9 out of 5

9 reviews
88% willing to recommend

Leave a review

What is Amazon Athena?

Amazon Athena is a serverless, interactive query service for analyzing data in Amazon S3 using SQL. It efficiently supports data lake architectures and offers features for diverse data formats without needing extensive infrastructure. Athena's integration with AWS Glue enhances schema management.

Get the Amazon Athena Buyer's Guide and find out what your peers are saying about Amazon Athena, Elastic Search, Amazon OpenSearch Service and more!

Amazon Athena is the #7 ranked solution in top Search as a Service vendors. PeerSpot users give Amazon Athena an average rating of 7.8 out of 10. Amazon Athena is most commonly compared to Elastic Search: Amazon Athena vs Elastic Search. Amazon Athena is popular among the large enterprise segment, accounting for 59% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 17% of all views.

Helped 900,644 peers since 2012

Featured Amazon Athena reviews

Ciro Baldim Guerra

Sr Analytics Engineer at Itau Unibanco S.A.

I think there is room for improvement in Amazon Athena, and the first thing I will put is the data output. I use Python to query in Amazon Athena, and it's very complex and difficult just to save Amazon Athena results as an Excel file. The only option is copying the data, but sometimes if it exceeds 100 lines, if you copy and paste in Excel, it's very bad. You can't copy above 100 lines. The other option is downloading a CSV file, but the CSV file is not UTF-8 Unicode. Here in Brazil, we speak Portuguese, and there are a lot of special characters in the words and even names, and everything gets garbled when you put it in a CSV. You have to decode, encode, and there are a lot of problems. It could easily save as an Excel file since there are a lot of engines to help with it, so an XLSX file extension could be this way. Another point I would mention is the word completion. When I'm coding and making statements and queries, Amazon Athena tries to help me write the code, and that's very problematic. Sometimes I'm using some tables that I use every day, and Amazon Athena doesn't get the tables I'm using and suggests very improbable data. I have access to more than 30 databases and hundreds of tables. So, I turn it off, I disable the word completion because when I'm coding, the word completion makes the coding slower. It's very difficult, and every time I have to press escape to skip the completion. It's very ineffective, so I disable it because in other applications it functions very well, such as VS Code.

Read full review

Ashwin Kolgaonkar

Data Engineer at ZiMetrics

* Complex queries & large joins: Performance degrades compared to analytics-optimized warehouses like Redshift when doing heavy joins, aggregations over very large datasets, or extensive transformations. In those cases, we still need ETL / Spark / Glue jobs. * Transaction / ACID support is limited unless using table formats like Apache Iceberg. If you require updates, merges, deletes at scale, Athena alone may not suffice. * Cost risk: Because Athena charges per terabyte scanned ($5 per TB by default). Poor query design (e.g. selecting many columns, not using partitions, scanning raw text) can drive up costs. * Concurrency limits: By default, you can run about 20 concurrent queries per account per region. If workload spikes, you may hit throttling or queued queries. Increasing limits requires an AWS request. * Transformation limitations: Athena is built for querying, not for heavy data transformation or streaming data ingestion. For complex transformations, glue or Spark remain necessary.

Read full review

Wojciech Doganowski

Solutions Architect & PMO at AS TV Play Baltics/TV3 Group

I don't have any specific answer on how Amazon Athena can be improved. This integration is more on the Glue side rather than on Amazon Athena, I would guess. Nothing comes to my mind here. In terms of its integration capabilities, I would say it's not straightforward. It works, but it's a little bit tricky.

Read full review

Amazon Athena mindshare

As of June 2026, the mindshare of Amazon Athena in the Search as a Service category stands at 4.8%, down from 9.5% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Search as a Service Mindshare Distribution
Product	Mindshare (%)
Amazon Athena	4.8%
Elastic Search	17.2%
Xapien	12.0%
Other	66.0%

Search as a Service

PeerResearch reports based on Amazon Athena reviews

Type	Title	Date
Category	Search as a Service	Jun 23, 2026	Download
Product	Reviews, tips, and advice from real users	Jun 23, 2026	Download
Comparison	Amazon Athena vs Elastic Search	Jun 23, 2026	Download
Comparison	Amazon Athena vs Algolia	Jun 23, 2026	Download
Comparison	Amazon Athena vs Amazon OpenSearch Service	Jun 23, 2026	Download

Valuable Features

"Amazon Athena's ability to query structured and unstructured data has been beneficial."
"The best feature of Amazon Athena is that we can use Glue to build the schema from the data and then we can query the data directly on S3."
"Athena is serverless, so we don’t have to provision or manage compute clusters, and we can simply point Athena at our data in S3 and run SQL queries immediately."

Room for Improvement

"Transaction support is one of the biggest missing features."
"In terms of its integration capabilities, I would say it's not straightforward. It works, but it's a little bit tricky."
"I use Python to query in Amazon Athena, and it's very complex and difficult just to save Amazon Athena results as an Excel file."

Pricing

"Athena is very inexpensive for being a cloud tool."
"I am happy with what they are charging and how they charge it, especially because they charge you per query, and not per series."
"It doesn't cost much if you are already part of the AWS ecosystem."

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Amazon Athena Buyer's Guide for additional reliable information.

Review data by company size

By reviewers
Company Size	Count
Small Business	4
Midsize Enterprise	3
Large Enterprise	2

By reviewers

By visitors reading reviews
Company Size	Count
Small Business	33
Midsize Enterprise	15
Large Enterprise	69

By visitors reading reviews

Top industries

By visitors reading reviews

Financial Services Firm

17%

Manufacturing Company

13%

Computer Software Company

10%

Outsourcing Company

Government

Healthcare Company

Retailer

Comms Service Provider

University

Marketing Services Firm

Construction Company

Consumer Goods Company

Educational Organization

Logistics Company

Performing Arts

Transportation Company

Real Estate/Law Firm

Religious Institution

Insurance Company

Energy/Utilities Company

Wholesaler/Distributor

Leisure / Travel Company

Media Company

Aerospace/Defense Firm

International Affairs Institute

Recreational Facilities/Services Company

Compare Amazon Athena with alternative products

Learn more about Amazon Athena

Amazon Athena leverages a serverless architecture to provide scalable, cost-effective query capabilities for large datasets stored in Amazon S3. With native support for Parquet and Avro, it efficiently manages both structured and unstructured data. Its federated query functionality allows access to varied data sources, while database partitioning optimizes performance and cost. Integration with AWS Glue simplifies schema building and streamlines data querying, although it faces challenges with ease of use, transaction support, and third-party integrations. Performance optimization is needed for complex queries and handling large datasets, while API capabilities and scheduling features could be improved. Users benefit from cost-saving efficiencies in data processing and the ability to extract quick insights through SQL queries, fostering more agile data-driven decisions.

What are the most important features of Amazon Athena?

Database Partitioning: Enhances query performance and reduces data retrieval costs.
Federated Queries: Accesses data across multiple sources seamlessly.
Supports Diverse Formats: Handles data in Parquet, Avro, and more.
Serverless Architecture: Offers automatic scaling for fluctuating workloads.
Glue Integration: Simplifies schema management and data organization.
Columnar Storage Optimization: Speeds up query processing and lowers costs.
Data Quality Identification: Detects inconsistencies and errors proactively.

What benefits and ROI should be expected from Amazon Athena?

Scalability: Automatically adjusts to processing demands, eliminating the need for infrastructure management.
Cost Efficiency: Only pay for queries run, making it budget-friendly for varying workloads.
Quick Insights: Facilitates swift SQL-based analysis, improving decision-making timelines.
Comprehensive Data Handling: Manages both structured and unstructured data seamlessly.
Enhanced Query Performance: Utilizes partitioning and columnar storage for optimized speed and cost.

In sectors such as finance, retail, and technology, Amazon Athena is utilized for data lake management where voluminous structured and unstructured data exists. Businesses create dashboards, automate workflows, and execute ad-hoc analyses efficiently. Its integration with Lake Formation and Glue supports complex industry-specific data tasks, ensuring streamlined data operations.

Amazon Athena customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday

Product Categories

Search as a Service

Popular Comparisons

Elastic Search vs Amazon Athena

Amazon OpenSearch Service vs Amazon Athena

Glean Platform vs Amazon Athena

Azure AI Search vs Amazon Athena

Amazon Kendra vs Amazon Athena

Amazon AWS CloudSearch vs Amazon Athena

Solr vs Amazon Athena

See all alternatives

Amazon Athena Reviews Summary
Author info	Rating	Review Summary
Sr Analytics Engineer at Itau Unibanco S.A.	5.0	I use Amazon Athena daily for data analysis, finding it very stable, scalable, and easy to use. While it's fast, I wish for better data output options like Excel and improved, less problematic word completion.
Data Engineer at ZiMetrics	4.5	We use Athena for serverless querying of our S3 data lake, leveraging Glue and Lake Formation to reduce pre-processing. While it's cost-effective for ad-hoc queries, complex transformations, large joins, and managing cost/concurrency remain challenges.
Solutions Architect & PMO at AS TV Play Baltics/TV3 Group	3.5	I use Amazon Athena to query S3 data via Glue schemas, finding it stable, scalable, and reasonably priced. Setup is straightforward, but integration can be tricky. Overall, I rate it a seven.
Founder & CTO at QuriousBit	4.0	I appreciate Amazon Athena's serverless, cost-effective data lake querying for structured/unstructured data, especially for startups. However, its lack of transaction support, difficult ETL, unified access management, and insufficient query optimization resources are significant drawbacks.
Data Architect at a real estate/law firm with 1,001-5,000 employees	3.5	I used Amazon Athena to load relational databases and transition from Hadoop, appreciating its UI and compatibility with on-premises products. However, a drawback is having to build the metadata ourselves due to its cloud-based nature.
Senior Software Engineer at Tiger Analytics	3.5	Amazon Athena, part of AWS, works well with Glue for querying data, providing easy scalability and stable performance. However, it lacks the simplicity of Palantir as it requires multiple steps to upload and query data, unlike a straightforward drag-and-drop approach.
Head of Data Practice at a tech consulting company with 201-500 employees	2.5	I use Amazon Athena for dashboarding and reporting. It's very stable, but it feels less mature compared to Power BI or Qlik. Dashboard and reporting capabilities could improve, especially in generating statement reports. Overall, development is challenging.
Software Developer at a tech services company with 51-200 employees	4.5	I have been using Amazon Athena to query data across AWS, particularly from Redshift and S3. I value its partitioning, federated queries, and metastore features, though improvements are needed for third-party integrations. Current projects are personal, not monetized yet.
Director & Lead Solutions Architect at Abylle Solutions	4.0	Our company utilizes Amazon Athena for client-specific data integration and analysis. The solution is user-friendly, cost-effective, and serverless. However, it requires a better API for direct data querying and visualization, as we currently use QuickSight for this purpose.

Ciro Baldim Guerra

Sr Analytics Engineer at Itau Unibanco S.A.

Sep 9, 2025

Have struggled with exporting complex data and have disabled code suggestions due to inefficiency

What is our primary use case?

I have experience using Amazon Athena, and this service I have more experience with, actually.

I use Amazon Athena for my daily activity; I was using it just now for getting data, and we use Amazon Athena in Lake Formation, mainly using Data Mesh resources. Every table storage is in an S3 and Amazon Glue Catalog that stores the schemas, and we use Amazon Athena to read that data, and it runs with clustered performance. It is very fast to retrieve data, and I used to query SQL and get information and do all the data analyst jobs that are necessary. I use Amazon Athena a lot. I also use Amazon Athena from step functions, calling Athena saved queries or specific queries. I use Amazon Athena also with Python and Boto3, calling Amazon Athena as a client. I use Amazon Athena also inside a Glue job Python script when automating jobs and using Boto3 and AWS Wrangler libraries for Python to query data in Amazon Athena and use the service at AWS.

What is most valuable?

Using Amazon Athena is easy, very easy.

With Amazon Athena, we don't save when we are doing analytics. We don't think a lot about the billing and the costs. Only when we are creating automations do we think about it. When we're doing our daily job, they don't push us to think about it. But I know Amazon Athena is billed by data read. They bill by the data read, and I don't know how many cents of a dollar for each gigabyte, something like that. I think it's not cheap, but the price that appears here in our Cost Explorer is not the price that the company pays because they have discounts. We don't have the last price that was billed.

What needs improvement?

Another point I would mention is the word completion. When I'm coding and making statements and queries, Amazon Athena tries to help me write the code, and that's very problematic. Sometimes I'm using some tables that I use every day, and Amazon Athena doesn't get the tables I'm using and suggests very improbable data. I have access to more than 30 databases and hundreds of tables. So, I turn it off, I disable the word completion because when I'm coding, the word completion makes the coding slower. It's very difficult, and every time I have to press escape to skip the completion. It's very ineffective, so I disable it because in other applications it functions very well, such as VS Code.

For how long have I used the solution?

I have been using Amazon Athena for two years.

What do I think about the stability of the solution?

I have never faced any issues with the stability of Amazon Athena. In two years, maybe one day or two, but it is very uncommon. It's a very stable application.

What do I think about the scalability of the solution?

Amazon Athena works for scalability; I query data using tagged data that uses user usage of applications that contain very big data, millions and billions of lines, and it works very well. This is mainly because Amazon Athena can clusterize computation. It creates clusters and computes separately; it's very good. There are no problems with scalability.

What other advice do I have?

It is okay by me to have my name, my job, and my company when you post the review. I rate Amazon Athena a 10 out of 10.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Ashwin Kolgaonkar

Data Engineer at ZiMetrics

Sep 15, 2025

Have reduced data processing time and cost by directly querying raw files with a centralized catalog

What is our primary use case?

We use Amazon Athena to query data stored in Amazon S3, forming a data lake architecture. For metadata management and schema discovery, we rely on AWS Glue Catalog and Glue Crawlers. For governance—especially row-level and column-level security—we employ AWS Lake Formation.

When query performance suffers—especially with many concurrent queries or unexpectedly slow responses—I engage AWS Support to raise tickets and help with performance tuning.

What is most valuable?

Athena is serverless, so we don’t have to provision or manage compute clusters. We can simply point Athena at our data in S3 and run SQL queries immediately.

It supports a broad set of formats: structured, semi-structured, and unstructured (CSV, JSON, Avro, Parquet, ORC, etc.). Because it follows a schema-on-read model, there's no need for heavy upfront transformation.

The Glue Data Catalog, together with Glue Crawlers, helps to infer schemas and partition structure. This saves time, since for many raw-data sets we don’t have to build schema definitions by hand. Athena uses the catalog to understand data layout, which speeds up query planning and execution.

Partitioning and format optimizations (e.g. leveraging columnar storage, compression) drastically reduce the amount of data scanned. That yields cost savings and faster execution.

It gives visibility into data quality issues: when raw data is malformed or inconsistent, crawling + Athena queries help identify schema drift, missing fields, or unexpected patterns.

What needs improvement?

Complex queries & large joins: Performance degrades compared to analytics-optimized warehouses like Redshift when doing heavy joins, aggregations over very large datasets, or extensive transformations. In those cases, we still need ETL / Spark / Glue jobs.

Transaction / ACID support is limited unless using table formats like Apache Iceberg. If you require updates, merges, deletes at scale, Athena alone may not suffice.

Cost risk: Because Athena charges per terabyte scanned ($5 per TB by default). Poor query design (e.g. selecting many columns, not using partitions, scanning raw text) can drive up costs.

Concurrency limits: By default, you can run about 20 concurrent queries per account per region. If workload spikes, you may hit throttling or queued queries. Increasing limits requires an AWS request.

Transformation limitations: Athena is built for querying, not for heavy data transformation or streaming data ingestion. For complex transformations, glue or Spark remain necessary.

What do I think about the stability of the solution?

Athena has been reliable in our environment. With Lake Formation, access control and security have performed well. We have rarely had long outages, and error conditions are mostly due to misconfigurations or limits (e.g. exceeding concurrency), rather than service instability.

What do I think about the scalability of the solution?

Because it’s serverless, we automatically scale according to demand: when queries increase in volume or size, Athena manages compute behind the scenes.

With proper partitioning, efficient formats (Parquet/ORC), and good metadata (Glue Catalog), it scales to petabyte-scale datasets without manual infrastructure scaling.

Some limits are in place (e.g. concurrency, API quotas, query string length), but these are generally adjustable or manageable with planning.

Which solution did I use previously and why did I switch?

Previously, our pipeline involved cleaning, transforming, and loading data from S3 into relational or NoSQL databases (e.g. DynamoDB) before analysis. That incurred overhead in time, compute, storage, and in managing ETL pipelines. Switching to Athena allowed us to query raw data in place, shortening turnaround time and reducing resource usage.

How was the initial setup?

Created one or more S3 buckets: one for raw data, one for query results.

Organized raw data in partitioned folder structures (e.g. year=YYYY/month=MM/day=DD/) to support efficient querying.

Defined schema metadata in the Glue Data Catalog, either by manually creating tables or by running Glue Crawlers to infer schema and detect partitions.

Assigned the appropriate IAM permissions and Lake Formation access policies so that users have necessary rights to run queries, read data, etc.

Configured the result/output location for Athena queries in S3 (so that query results are stored correctly, and so queries don’t fail due to missing output path).

What was our ROI?

We significantly reduced time spent on pre-processing and transformations, because many reports/analyses can now pull data directly from raw files.

Cost savings came from reducing compute and storage overhead, avoiding maintaining separate databases just for analytics staging.

Query costs dropped when we applied best practices (partitioning, using columnar formats, compression).

Which other solutions did I evaluate?

DuckDB: evaluated for querying files locally or via cloud storage. While it was useful for lightweight or embedded analytics, it did not scale as smoothly for large, concurrent cloud workloads, or integrate with S3 and AWS governance/tools in the way Athena does.

Possibly evaluated some data-warehouse services (Redshift, Snowflake) for comparison, but found Athena struck the best balance for ad-hoc querying, minimal management, proper cost/performance tradeoff.

What other advice do I have?

Always use columnar formats (Parquet / ORC) + compression (Snappy or similar) to reduce scanned data volumes.

Partition your data on logical fields (e.g. date, region) to limit scope of queries.

Avoid SELECT * unless necessary; query only the columns you need.

Monitor Athena usage via CloudWatch: watch for data scanned, query runtime, concurrency, etc. Use alerts when you're nearing service quotas.

For high throughput or many users, consider using capacity reservations or separating workloads (workgroups) so that one heavy workload doesn’t degrade performance of others.

Use Athena’s built-in EXPLAIN or query plan features to profile and optimize queries.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Wojciech Doganowski

Solutions Architect & PMO at AS TV Play Baltics/TV3 Group

Sep 29, 2025

Have used it to query cloud storage effectively and found integration setup could be more intuitive

What is our primary use case?

The typical use case for Amazon Athena is that we have data in a data lake, and if we need to query the data from the data lake, we use Amazon Athena before it gets to the data warehouse where we were using Snowflake, so the proper warehouse. That's the main use case. It's just to verify some data on the data lake, and it also allows us to query data on S3 directly, which is what we use it for.

What is most valuable?

The best feature of Amazon Athena is that we can use Glue to build the schema from the data and then we can query the data directly on S3. This is the main feature which is most important for us, to be able to create schema from data on S3 and then query the data on S3.

What needs improvement?

I don't have any specific answer on how Amazon Athena can be improved. This integration is more on the Glue side rather than on Amazon Athena, I would guess. Nothing comes to my mind here.

In terms of its integration capabilities, I would say it's not straightforward. It works, but it's a little bit tricky.

For how long have I used the solution?

I have worked with this solution for about two years.

What do I think about the stability of the solution?

It is a stable solution.

What do I think about the scalability of the solution?

Amazon Athena is a scalable solution.

How are customer service and support?

I would rate technical support from Amazon an eight. We mainly use documentation and the documentation is quite good.

How would you rate customer service and support?

Positive

How was the initial setup?

After having Glue set up correctly, the initial setup of Amazon Athena is quite straightforward to use.

What was our ROI?

We didn't really calculate an ROI on Amazon Athena because it would be hard to determine.

What other advice do I have?

I have experience of integration of Amazon Athena with AWS Glue.

I think the pricing of Amazon Athena is quite reasonable as we use it in pay-as-you-go mode.

On a scale from one to ten, I rate Amazon Athena a seven.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Shubham-Joshi

Founder & CTO at QuriousBit

Oct 14, 2025

Has significantly improved data analytics with unstructured data and seamless querying from cloud storage

What is our primary use case?

Amazon Athena is mostly used for querying data. In teams with analysts and project managers, there are many AWS services helping to understand data, such as Redshift and Amazon Athena. The main use case is to understand, visualize, analyze, and query the data on the lake.

A significant use case for Amazon Athena is its ability to query unstructured data. For example, any data kept in S3 can be read directly and queried.

We have used Amazon Athena integration with AWS Glue. We have integrated AWS Glue with Amazon Athena with some Iceberg tables. Iceberg is currently the de-facto standard that Amazon Athena supports. We have Glue Catalog on top of that, and we query those Iceberg tables with Amazon Athena.

Amazon Athena's ability to query structured and unstructured data has been beneficial. For a startup, procuring a database such as Redshift is challenging to maintain. You have to run jobs and hire engineers to maintain the database. To remove these hassles, a startup can easily put the data on cloud and query it with Amazon Athena. It reduces operational overhead significantly. If your data scale is not extensive, you can query very cost-effectively because Amazon Athena's pricing is approximately $5 per TB of scan.

What is most valuable?

Amazon Athena is very compatible with data lake concepts. You can put all your data in S3, and there are different data formats. In the last few years, many open data formats have emerged. Amazon Athena supports formats such as Parquet, CSV, and Iceberg natively. It also supports Hoodie, created by Uber, and Avro. It has extensive support for different types of data structures.

Amazon Athena is cost-effective and performs efficiently. Being serverless means you don't need any compute resources. You don't have to manage how long the data will take, making it quite scalable.

What needs improvement?

Amazon Athena is based on Trino, an open database. When a company wants to run ETL on Amazon Athena, they cannot do it easily. For instance, if you want to delete something on a primary key or perform CRUD operations with Step Functions to automate processes, these operations are not straightforward in Amazon Athena.

Transaction support is one of the biggest missing features. If you are running multiple statements, such as a delete followed by an insert, and something goes wrong during insertion, the deletion should be reverted, but that doesn't happen. We have to implement workarounds, whereas these capabilities are available in Redshift.

While Amazon Athena has notebook support where analysts can write their work, scheduling these notebooks is not user-friendly. If an analyst wants to schedule a notebook to trigger at a specific time, they need a developer's assistance.

There should be unanimous access management in Amazon Athena, which is not readily available. Though they have Lake Formation and other features, there isn't one place to manage access. For example, restricting access to specific columns for particular users requires alternative approaches.

The service is only available on-demand from AWS, not through the Marketplace.

For how long have I used the solution?

I have been working with Amazon Athena for approximately five years, including usage in my previous organization.

How are customer service and support?

I contacted Amazon support regarding Amazon Athena long ago. After using it for so long, I have discovered that I know more than some support staff. I had an issue with transactions about a year ago, and I found that their support staff sometimes lacks extensive experience. While they have knowledge, I have had the opportunity to work with the infrastructure hands-on. The support people AWS provides often don't have much context, though senior staff members might have more expertise.

How would you rate customer service and support?

Neutral

Which other solutions did I evaluate?

I have evaluated various solutions including Databricks, Snowflake, and Google BigQuery. For any startup operating in one cloud, it is better to remain in that ecosystem rather than managing multiple clouds as it complicates the infrastructure. Since we were in AWS, we chose to use their available services instead of moving outside, following the industry trend of consolidation.

What other advice do I have?

We tried Amazon QuickSight with Amazon Athena for visualization, but we are using Superset on top of that.

The ecosystem is straightforward and excellent to get started with. If organizations don't want to spend time managing databases and prefer focusing on their business terms to move quickly, Amazon Athena is a good solution.

Amazon Athena can become costly if analysts lack knowledge of partition keys and querying tables efficiently. Since the pricing is $5 per TB, large queries on big datasets can result in substantial bills. There aren't many resources available to help optimize queries in Amazon Athena, with only a few blogs available. They should provide more information about query optimization.

I rate Amazon Athena an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Denise Smallwood

Data Architect at a real estate/law firm with 1,001-5,000 employees

Jul 26, 2023

A compatible tool with a great UI, but requires you to build the metadata yourself

What is our primary use case?

I used the solution to load relational databases.

What is most valuable?

We used Athena to get off Hadoop, and Athena is very much like Hadoop in its capabilities. Athena has a really good UI and is very compatible with on-prem products.

What needs improvement?

You have to build out the metadata yourself because of the nature of the cloud.

For how long have I used the solution?

I’ve used Amazon Athena for two and a half years, but I stopped using it seven months ago. I used the version on the cloud.

What do I think about the scalability of the solution?

The solution is scalable.

How was the initial setup?

The solution was straightforward to deploy, but I’ve been doing this for a long time. I don't know if someone new to the solution would struggle or not.

What's my experience with pricing, setup cost, and licensing?

Athena is very inexpensive for being a cloud tool.

What other advice do I have?

Athena only has a relational database, so using it depends on your use case. I rate Amazon Athena a seven out of ten.

Which deployment model are you using for this solution?

Public Cloud

Manilal Kasera

Senior Software Engineer at Tiger Analytics

Nov 22, 2022

A great AWS application that is easy to set up and simple to expand

What is our primary use case?

In the case of both Athena and Glue, if you have some data and want to query upon that, then you can basically use Glue to get the schema and Athena to query the data. You need both of them to work.

What is most valuable?

It's a great AWS application. Amazon Athena is just one of the features in a bigger cloud infrastructure, in the AWS platform. We have Athena and Glue as some of the features we really like working with. We also use EMR, EC2, and S3. There's a lot out there to take advantage of.

The storage is simple.

It's easy to set up the product.

The solution is stable.

We can scale it easily.

What needs improvement?

If you compare it with Palantir, if you have some data and you want to quickly have a look at it, then that feature is not available in Amazon Cloud. We'd like it better if, for example, when you have some data, you can easily query it and you can easily read it at a glance. We'd like it to just be almost like a drag-and-drop situation. In Amazon Cloud, you actually have first to upload the data into S3. For that, you have to create a bucket. Now you have to create a Glue service, which will get you the schema. Then that schema would create basically a database and a table. After that, you have to go to Athena to query the data. It's a three-step process in Amazon Cloud. In Palantir, you just have to drag and drop.

For how long have I used the solution?

I've been using the solution for four months.

What do I think about the stability of the solution?

It's a stable solution. It's reliable. There are no bugs or glitches. It doesn't crash or freeze.

What do I think about the scalability of the solution?

As a cloud solution, it's quite simple to scale.

Almost our entire team uses the product. That's about 30 to 40 people right now. They are mainly data engineers and data analysts.

How are customer service and support?

I've never needed to contact technical support.

Which solution did I use previously and why did I switch?

I've also used Palantir.

How was the initial setup?

The initial setup process is easy. I'd rate it an eight out of ten in terms of ease of implementation. It does take some time to deploy.

What about the implementation team?

We were able to set up the solution ourselves.

What's my experience with pricing, setup cost, and licensing?

The costs related to the solution depend on the data and how much of it there is. They may lay out costs on their website.

What other advice do I have?

We're an Amazon partner.

Before getting started, I'd advise new users that you must upload some data into an S3 bucket. You have to learn the basics of Amazon Cloud first. Then you have to know about some services like Amazon S3, Amazon Glue, and Athena. You also have to have some idea about the IAM role. It's important to get a handle on the solution beforehand.

I'd rate the solution seven out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

MusaMongwe

Head of Data Practice at a tech consulting company with 201-500 employees

Jan 23, 2023

A stable serverless, interactive analytics service, but its dashboarding and reporting capabilities could be better

What is our primary use case?

We use Amazon Athena as a dashboarding and reporting tool.

What is most valuable?

Amazon Athena is very stable. I never had any issues with it. The dashboarding tool is okay.

What needs improvement?

I think it would be better if the product were more mature. It's still a young product compared to Power BI or Qlik. I find that development is a bit difficult, but it might be because I'm used to other tools. The dashboarding capabilities could be better. The reporting and statement generation could be better. I couldn't technically initiate picture-perfect reporting, for example, to send out statements every month for banking customers.

For how long have I used the solution?

I have been using Amazon Athena for about six to eight months.

What do I think about the stability of the solution?

Amazon Athena is very stable.

On a scale from one to ten, I would give its stability a ten.

What do I think about the scalability of the solution?

Amazon Athena is a scalable product.

On a scale from one to ten, I would give scalability an eight.

How was the initial setup?

The initial setup was straightforward but with some challenges. Like all other Amazon products, the permission setup is a granular and trial-and-error process. For example, you might think you have set up Athena and tried and connect to something, and it might say you don't have this permission to complete the process. It's not an intuitive process to set up the permissions.

On a scale from one to ten, I would give the initial setup a 6.

What's my experience with pricing, setup cost, and licensing?

It doesn't cost much if you are already part of the AWS ecosystem.

On a scale from one to ten, I would give their pricing a seven.

What other advice do I have?

On a scale from one to ten, I would give Amazon Athena a five.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Gift Malahlela

Software Developer at a tech services company with 51-200 employees

Jan 20, 2023

Fast solution with data partitioning, federated queries, and per-query billing

What is our primary use case?

I have been using Amazon Athena to query across the AWS platform, from my Redshift warehouse and S3 storage.

How has it helped my organization?

I have found that Amazon Athena is very fast, especially when it comes to making queries. It also helps that you can connect it to other services, such as with the ability to create a federated query to query other databases.

What is most valuable?

One of the most valuable features is the ability to partition your databases. I also like the federated query functionality, for cases when you have to query outside your S3 storage, or even completely outside of the AWS platform.

There is also a useful feature called a metastore, which lets you access all the data you have shared across all your AWS accounts.

Another perk is that Athena allows you to optimize your queries.

What needs improvement?

One improvement I can suggest is that Athena needs to work better with third-parties. For example, the process of querying a Microsoft SQL warehouse could be improved. When querying outside of AWS, you can use federated queries, but it's not always easy to do so.

For how long have I used the solution?

I have been using Amazon Athena for about two years.

What do I think about the stability of the solution?

I have not encountered any stability issues.

What do I think about the scalability of the solution?

In terms of scalability, Athena doesn't store your results, but the objects that you are querying get stored in S3, so for me, this makes things easier and I would recommend it over any other platform.

How are customer service and support?

I have not had contact with AWS technical support yet.

Which solution did I use previously and why did I switch?

I have only been using AWS for the current project that I'm working on. I am willing to integrate other tools as I go along, but it all depends on whether there is a suitable tool that already exists in AWS.

How was the initial setup?

The setup is easy and I would rate it a nine out of ten. It only took me a couple of minutes because I was following through with a manual.

What about the implementation team?

I deployed it by myself and didn't need to hire anyone to help, mainly because what I'm working on right now is a startup solution.

What was our ROI?

Currently, I can see that my projects using Athena are working well, but because they are personal projects I am not receiving any money from them nor have I approached any clients about them yet. They are just for me at the moment, as I simply saw a solution that I needed to solve.

What's my experience with pricing, setup cost, and licensing?

I am happy with what they are charging and how they charge it, especially because they charge you per query, and not per series.

I would rate their pricing a nine out of ten.

What other advice do I have?

I think AWS Athena is a good service if you are using S3 as your file storage. And if you're not using S3, then maybe you should think about pairing Athena with S3 instead of what you are currently using.

I would rate AWS Athena a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Rajesh Nagaral

Director & Lead Solutions Architect at Abylle Solutions

Dec 19, 2022

Very easy to use and integrations are handled smoothly

What is our primary use case?

Our company uses the solution for a client-specific requirement to conduct data integration and analysis.

What is most valuable?

The solution is very easy to use and integrations are very smooth.

The serverless model only charges you for data that is consumed.

What needs improvement?

The solution should include a better API for query services so that data can be dumped and queried directly in customer's products.

The API should include some sort of data visualization that can be plugged into applications. We had to use QuickSight to help us with the visualization.

For how long have I used the solution?

I have been using the solution for ten months.

What do I think about the scalability of the solution?

The solution is scalable based on data consumption.

How was the initial setup?

Our technical team handled setup so I don't have specific details.

What about the implementation team?

We implemented the solution in-house.

What's my experience with pricing, setup cost, and licensing?

The solution operates on a serverless model so you only pay for data that you consume.

Which other solutions did I evaluate?

Our client required that we use the solution for their project.

What other advice do I have?

I rate the solution an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Title	Rating	Mindshare	Recommending
Elastic Search	4.1	17.2%	98%	99 interviews Add to research
Amazon OpenSearch Service	3.8	11.5%	92%	13 interviews Add to research