

Dremio and AWS Lake Formation compete in the data management and governance space. Dremio appears to have an edge due to its robust data lineage capabilities and flexibility across deployment scenarios.
Features: Dremio offers robust data management capabilities, including integration with Apache Airflow and Amazon S3, and enables federated queries across multiple sources. It uniquely allows for the creation of visual tables without impacting original data. AWS Lake Formation is integrated within the AWS ecosystem, facilitating management of access patterns, data classification, and providing fine-grained access controls and centralized policies.
Room for Improvement: Dremio faces challenges with Delta connector support, long query execution times, and performance issues with specific databases. Its documentation requires enhancement. AWS Lake Formation could improve integration with non-AWS services and faces challenges with pricing and governance functionalities. Enhancements in real-time monitoring and cross-account sharing would be beneficial.
Ease of Deployment and Customer Service: Dremio provides flexibility with deployment options across public, hybrid, and on-premises environments. It receives positive feedback for professional services but faces challenges with staffing integration. AWS Lake Formation is primarily deployed in the AWS public cloud, offering centralized support, and is noted for its reliability but incurs higher costs for professional services.
Pricing and ROI: Dremio is perceived as cost-effective, reducing personnel needs and providing a competitive pricing model for enterprises managing data lakes. AWS Lake Formation's pricing is complex, tied to services like S3 and Athena, and varies based on consumption models, which might increase total costs.
| Product | Mindshare (%) |
|---|---|
| Dremio | 5.1% |
| AWS Lake Formation | 4.1% |
| Other | 90.8% |


| Company Size | Count |
|---|---|
| Small Business | 3 |
| Midsize Enterprise | 2 |
| Large Enterprise | 15 |
| Company Size | Count |
|---|---|
| Small Business | 1 |
| Midsize Enterprise | 5 |
| Large Enterprise | 5 |
AWS Lake Formation streamlines data lake organization by integrating seamlessly with AWS services, ensuring robust access control, and facilitating effective data management. Its extensible framework and centralized data handling make it ideal for cloud environments that require scalability.
AWS Lake Formation offers comprehensive data synchronization with AWS, robust security through fine-grained policies, and seamless team collaboration tools. It centralizes data assets, supports analytics with Glue, and enhances data mesh structures. Its IAM integration supports efficient governance, with reliable data storage and retrieval backed by extensive documentation. While highly efficient within AWS ecosystems, Lake Formation faces integration challenges with third-party libraries, and users experience clutter in its interface. Aspects such as automation, policy management, and debugging are areas for improvement. Enhancements in orchestration, cross-account sharing, and integrations with services like SageMaker are user recommendations.
What are the key features of AWS Lake Formation?Organizations utilize AWS Lake Formation across various industries to build and manage data lakes, ensuring effective data governance and storage solutions. It helps integrate and transform data, control permissions, and support analytics. From e-commerce to healthcare, AWS Lake Formation delivers centralized data repositories for structured and unstructured data, securing information while providing scalable infrastructure and efficient resource management.
Dremio offers a comprehensive platform for data warehousing and data engineering, integrating seamlessly with data storage systems like Amazon S3 and Azure. Its main features include scalability, query federation, and data reflection.
Dremio's core strength lies in its ability to function as a robust data lake query engine and data warehousing solution. It facilitates the creation of complex queries with ease, thanks to its support for Apache Airflow and query federation across endpoints. Despite challenges with Delta connector support, complex query execution, and expensive licensing, users find it valuable for managing ad-hoc queries and financial data analytics. The platform aids in SQL table management and BI traffic visualization while reducing storage costs and resolving storage conflicts typical in traditional data warehouses.
What are Dremio's most valuable features?Dremio is primarily implemented in industries requiring extensive data engineering and analytics, including finance and technology. Companies use it for constructing data frameworks, efficiently processing financial analytics, and visualizing BI traffic. It acts as a viable alternative to AWS Glue and Apache Hive, integrating seamlessly with multiple databases, including Oracle and MySQL, offering robust solutions for data-driven strategies. Despite some challenges, its ability to reduce data storage costs and manage complex queries makes it a favorable choice among enterprise users.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.