

Elastic Search and AWS Glue compete in the data management and processing space. Elastic Search stands out for its search capabilities, while AWS Glue offers strong data integration within the AWS ecosystem.
Features: Elastic Search offers high search capabilities, efficient sharding, and indexing, making it user-friendly and highly available. AWS Glue provides robust data integration services, effective for data transformations and workflows, and offers seamless integration within the AWS ecosystem.
Room for Improvement: Elastic Search faces challenges with mapping conflicts and complex Kibana dashboard configurations, requiring enhancements for semantic searches and AI features. AWS Glue has high start-up times and struggles with integration for certain non-AWS databases, needing improvements in user-friendliness and cost scalability.
Ease of Deployment and Customer Service: Elastic Search supports on-premises, cloud, and hybrid deployments, relying on community support and forums. AWS Glue is widely used in cloud environments with strong AWS integration, offering robust technical support though at a potentially higher cost.
Pricing and ROI: Elastic Search is open-source, initially free but potentially costly with enterprise features, realizing ROI through operational efficiency. AWS Glue's pay-as-you-go model offers cost-effectiveness with serverless architecture, but costs can escalate for high-volume operations.
I advocate using Glue in such cases.
We have not purchased any licensed products, and our use of Elastic Search is purely open-source, contributing positively to our ROI.
It is stable, and we do not encounter critical issues like server downtime, which could result in data loss.
The main benefits observed from using Elastic Search include improvements in operational efficiency, along with cost, time, and resource savings.
Upgrades occur every four months, and new developments coincide with version updates.
For complex Glue-related problems such as job failures or permission issues, their documentation is good, but having direct access to support helps cut down troubleshooting time significantly.
For P1 tickets, they provide very immediate quick responses and join calls to support and troubleshoot the issue accordingly.
The customer support for Elastic Search is one of the best I have ever tried.
They have always been really responsible and responsive to my requests.
It is beneficial to upgrade jobs, and we conduct extensive testing in development before migrating to production.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
We can search through that document quite easily, sometimes in 7 milliseconds, sometimes one or two milliseconds.
Performance tests involving one million requests at once, we encountered issues with shards and nodes not upscaling as needed, leading to crashes and minimal data loss.
I would rate its scalability a ten.
AWS Glue is highly stable, and I would rate its stability as nine.
The data transfer sometimes exceeded the bandwidth limits without proper notification, which caused issues.
The stability of Elasticsearch was very high.
When you put one keyword, everything related to that keyword in your ecosystem will showcase all the results.
Migrating jobs from version 3.0 to 4.0 can present compatibility issues.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
A more user-friendly and simpler process would help speed up the deployment process.
From a technical point of view, there are no significant issues recalled as Elastic Search has been absolutely awesome for this use case and covers 100% of the needs.
If I need to parse one million records saved into Elastic Search, it becomes a nightmare because I need to do the pagination, and it is very problematic in that regard.
Observability features like search latency, indexing rate, and maybe rejected requests should be added to make the platform more reliable and accessible for everyone.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
AWS charges based on runtime, which can be quite pricey.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
On the AWS side, it is very expensive because they charge based on query basis or how much data is transferred in and out, making it very expensive.
Having the hosted solution and not having to pay for essentially a DevOps person on staff to manage makes it affordable.
You can host it on-premises, which would incur zero cost, or take it as a SaaS-based service, where the expenses remain minimal.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models.
AWS Glue has reduced efforts by 60%, which is the main benefit.
Elastic Search makes handling large data volumes efficient and supports complex search operations.
The most valuable feature of Elasticsearch was the quick search capability, allowing us to search by any criteria needed.
The speed with which Elastic Search is able to search through all of the documents we place into it is quite remarkable, as we search through 65 billion documents in less than a second in most cases, on a constant consistent basis.
| Product | Mindshare (%) |
|---|---|
| AWS Glue | 7.6% |
| Elastic Search | 1.7% |
| Other | 90.7% |
| Company Size | Count |
|---|---|
| Small Business | 11 |
| Midsize Enterprise | 6 |
| Large Enterprise | 34 |
| Company Size | Count |
|---|---|
| Small Business | 40 |
| Midsize Enterprise | 12 |
| Large Enterprise | 49 |
AWS Glue is a serverless data integration service offering seamless integration with AWS services like S3, Redshift, and Athena. Known for its flexibility with data formats and automation of ETL tasks, AWS Glue enhances data management and transformation.
AWS Glue facilitates seamless data extraction, transformation, and loading for businesses, integrating with key AWS services, allowing efficient data pipeline automation. It's valued for a user-friendly GUI, scalability, and cost-effectiveness, supporting PySpark for complex datasets and includes a robust data catalog, real-time backup capabilities, and code generation. Despite its strengths, improvements are needed in documentation, training, and broader programming language support. Users face challenges with its complex interface and integration with non-AWS products, driving demand for enhancements in its usability and performance.
What are AWS Glue's most important features?Businesses leverage AWS Glue in industries for ETL processes, data integration, and transformation. It is used to optimize data lakes or warehouses integration, enhancing data cataloging and real-time integration. Its serverless feature enables efficient data processing in sectors like finance and healthcare, where handling complex data-intensive tasks is crucial.
Elasticsearch is a prominent open-source search and analytics engine known for its scalability, reliability, and straightforward management. It's a favored choice among enterprises for real-time data search, analysis, and visualization. Open-source Elasticsearch is free, offering a comprehensive feature set and scalability. It allows full control over deployments but requires managing and maintaining the infrastructure. On the other hand, Elastic Cloud provides a managed service with features like automated provisioning, high availability, security, and global reach.
Elasticsearch excels in handling time-sensitive data and complex search requirements across large datasets. Its scalability allows it to handle growing data volumes efficiently, maintaining high performance and fast response times. Integrated with Kibana, Elasticsearch enables powerful data visualization, providing real-time insights crucial for data-driven decision-making.
Elastic Cloud reduces operational overhead and improves scalability and performance, though it comes with associated costs. It is available on your preferred cloud provider — AWS, Azure, or Google Cloud. Customers who want to manage the software themselves, whether on public, private, or hybrid cloud, can download the Elastic Stack.
At its core, Elasticsearch is renowned for its full-text search capabilities, capable of performing complex queries and supporting features like fuzzy matching and auto-complete.
Peer reviews from various professionals highlight its strengths and weaknesses. Pros include its detection and correlation features, flexibility, cloud-readiness, extensibility, and efficient search capabilities. However, users have noted challenges like steep learning curves, data analysis limitations, and integration complexities. The platform is generally viewed as stable and scalable, with varying degrees of satisfaction regarding its usability and feature set.
In summary, Elasticsearch stands out for its high-speed search, scalability, and versatile analytics, making it a go-to solution for organizations managing large datasets. Its adaptability to different enterprise needs, robust community support, and continuous development keep it at the forefront of enterprise search and analytics solutions. However, potential users should be aware of its learning curve and the need for skilled personnel for optimization.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.