

Elastic Search and AWS Glue compete in the data processing and analytics category. Elastic Search holds an advantage in log monitoring and data visualization, while AWS Glue is superior in data processing and integration with AWS services.
Features: Elastic Search stands out with its integration of Elasticsearch, Logstash, and Kibana for robust log monitoring. It provides comprehensive search functionality and support for key-value pair storage. Its flexibility is enhanced with anomaly detection and multi-language search queries. AWS Glue offers seamless integration with AWS services, simple ETL processes using Spark, and automation features like Glue Studio for minimal coding. It excels in data cataloging and real-time data streaming.
Room for Improvement: Elastic Search could improve its security in the open-source version, enhance machine learning and alerting features, and refine its licensing model. AWS Glue faces high costs, complex integration issues, and limited native services. Enhancements are needed for advanced transformations, multi-cloud capabilities, and a more user-friendly experience.
Ease of Deployment and Customer Service: Elastic Search offers flexibility in on-premises deployments but relies on third-party consulting for direct support. AWS Glue benefits from wide adoption in the AWS ecosystem but lacks substantial support for customized implementations, relying on AWS's technical team. Elastic Search provides community-driven resources, whereas AWS Glue integrates seamlessly within AWS environments.
Pricing and ROI: Elastic Search is open-source with cost-effective options, though backend management can be high for enterprise features. It serves as an alternative to more expensive solutions with variable ROI based on implementation. AWS Glue's pay-as-you-go model emphasizes scalability but can be costly with extensive usage, aligning with modern cloud practices and offering perceived value for streamlined ETL processes.
I advocate using Glue in such cases.
We have not purchased any licensed products, and our use of Elastic Search is purely open-source, contributing positively to our ROI.
It is stable, and we do not encounter critical issues like server downtime, which could result in data loss.
The main benefits observed from using Elastic Search include improvements in operational efficiency, along with cost, time, and resource savings.
Upgrades occur every four months, and new developments coincide with version updates.
For complex Glue-related problems such as job failures or permission issues, their documentation is good, but having direct access to support helps cut down troubleshooting time significantly.
For P1 tickets, they provide very immediate quick responses and join calls to support and troubleshoot the issue accordingly.
The customer support for Elastic Search is one of the best I have ever tried.
They have always been really responsible and responsive to my requests.
It is beneficial to upgrade jobs, and we conduct extensive testing in development before migrating to production.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
We can search through that document quite easily, sometimes in 7 milliseconds, sometimes one or two milliseconds.
Performance tests involving one million requests at once, we encountered issues with shards and nodes not upscaling as needed, leading to crashes and minimal data loss.
I would rate its scalability a ten.
AWS Glue is highly stable, and I would rate its stability as nine.
The data transfer sometimes exceeded the bandwidth limits without proper notification, which caused issues.
The stability of Elasticsearch was very high.
When you put one keyword, everything related to that keyword in your ecosystem will showcase all the results.
Migrating jobs from version 3.0 to 4.0 can present compatibility issues.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
A more user-friendly and simpler process would help speed up the deployment process.
From a technical point of view, there are no significant issues recalled as Elastic Search has been absolutely awesome for this use case and covers 100% of the needs.
If I need to parse one million records saved into Elastic Search, it becomes a nightmare because I need to do the pagination, and it is very problematic in that regard.
Observability features like search latency, indexing rate, and maybe rejected requests should be added to make the platform more reliable and accessible for everyone.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
AWS charges based on runtime, which can be quite pricey.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
On the AWS side, it is very expensive because they charge based on query basis or how much data is transferred in and out, making it very expensive.
Having the hosted solution and not having to pay for essentially a DevOps person on staff to manage makes it affordable.
You can host it on-premises, which would incur zero cost, or take it as a SaaS-based service, where the expenses remain minimal.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models.
AWS Glue has reduced efforts by 60%, which is the main benefit.
Elastic Search makes handling large data volumes efficient and supports complex search operations.
The most valuable feature of Elasticsearch was the quick search capability, allowing us to search by any criteria needed.
The speed with which Elastic Search is able to search through all of the documents we place into it is quite remarkable, as we search through 65 billion documents in less than a second in most cases, on a constant consistent basis.
| Product | Mindshare (%) |
|---|---|
| AWS Glue | 7.6% |
| Elastic Search | 1.7% |
| Other | 90.7% |
| Company Size | Count |
|---|---|
| Small Business | 11 |
| Midsize Enterprise | 6 |
| Large Enterprise | 32 |
| Company Size | Count |
|---|---|
| Small Business | 39 |
| Midsize Enterprise | 12 |
| Large Enterprise | 47 |
AWS Glue is a serverless data integration service offering seamless integration with AWS services like S3, Redshift, and Athena. Known for its flexibility with data formats and automation of ETL tasks, AWS Glue enhances data management and transformation.
AWS Glue facilitates seamless data extraction, transformation, and loading for businesses, integrating with key AWS services, allowing efficient data pipeline automation. It's valued for a user-friendly GUI, scalability, and cost-effectiveness, supporting PySpark for complex datasets and includes a robust data catalog, real-time backup capabilities, and code generation. Despite its strengths, improvements are needed in documentation, training, and broader programming language support. Users face challenges with its complex interface and integration with non-AWS products, driving demand for enhancements in its usability and performance.
What are AWS Glue's most important features?Businesses leverage AWS Glue in industries for ETL processes, data integration, and transformation. It is used to optimize data lakes or warehouses integration, enhancing data cataloging and real-time integration. Its serverless feature enables efficient data processing in sectors like finance and healthcare, where handling complex data-intensive tasks is crucial.
Elasticsearch is a prominent open-source search and analytics engine known for its scalability, reliability, and straightforward management. It's a favored choice among enterprises for real-time data search, analysis, and visualization. Open-source Elasticsearch is free, offering a comprehensive feature set and scalability. It allows full control over deployments but requires managing and maintaining the infrastructure. On the other hand, Elastic Cloud provides a managed service with features like automated provisioning, high availability, security, and global reach.
Elasticsearch excels in handling time-sensitive data and complex search requirements across large datasets. Its scalability allows it to handle growing data volumes efficiently, maintaining high performance and fast response times. Integrated with Kibana, Elasticsearch enables powerful data visualization, providing real-time insights crucial for data-driven decision-making.
Elastic Cloud reduces operational overhead and improves scalability and performance, though it comes with associated costs. It is available on your preferred cloud provider — AWS, Azure, or Google Cloud. Customers who want to manage the software themselves, whether on public, private, or hybrid cloud, can download the Elastic Stack.
At its core, Elasticsearch is renowned for its full-text search capabilities, capable of performing complex queries and supporting features like fuzzy matching and auto-complete.
Peer reviews from various professionals highlight its strengths and weaknesses. Pros include its detection and correlation features, flexibility, cloud-readiness, extensibility, and efficient search capabilities. However, users have noted challenges like steep learning curves, data analysis limitations, and integration complexities. The platform is generally viewed as stable and scalable, with varying degrees of satisfaction regarding its usability and feature set.
In summary, Elasticsearch stands out for its high-speed search, scalability, and versatile analytics, making it a go-to solution for organizations managing large datasets. Its adaptability to different enterprise needs, robust community support, and continuous development keep it at the forefront of enterprise search and analytics solutions. However, potential users should be aware of its learning curve and the need for skilled personnel for optimization.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.