Try our new research platform with insights from 80,000+ expert users
Nick Grigoriev - PeerSpot reviewer
Senior Data Engineer at a marketing services firm with 1,001-5,000 employees
Real User
Nov 14, 2022
Stable self-service data tool used for service to service integration and to manage simple ad hoc queries
Pros and Cons
  • "Dremio gives you the ability to create services which do not require additional resources and sterilization."
  • "Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement."

What is our primary use case?

I have used this solution as an ETL tool to create data marks on data lakes for bridging. I have used it as a greater layer for ad-hoc queries and for some services which do not require sub-second latency to credit data from very big data lakes. I have also used it to manage simple ad-hoc queries similar to Athena, Presto or BigQuery

We do not have a large number of people using this solution because it's mainly setup as a service to service integration. We integrated a big workload when we started using Dremio and this was very expensive. The migration is still in progress. As soon as this migration is finished, we plan to migrate ad-hoc queries from our analytical team. 

What is most valuable?

The native error interface has been most valuable. With Apache Airflow, which Dremio supports, you don't need to digitalize data. Dremio gives you the ability to create services which do not require additional resources and serialization. 

What needs improvement?

Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement. 

The community version lacks information or documentation. If you want to create a more dynamic scaling policy for your cluster, it's always a problem because you need an additional custom code to be sure that Dremio shuts down gracefully and does not get interrupted.

The support of the Delta I/O connector could be improved because the current support is very basic and the performance isn't great. 

For how long have I used the solution?

I have used this solution for three years in a non-production environment and for nine months in a production environment. 

Buyer's Guide
Dremio
January 2026
Learn what your peers think about Dremio. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.

What do I think about the stability of the solution?

This is a stable solution.

How are customer service and support?

I have used the community forum to get most of my answers for Dremio. This is best suited for simple queries but does not work as well for advanced questions that need input from a developer. 

How was the initial setup?

The complexity of the setup depends on the use case. If you are working with a cluster that you plan to shut down after a week, the setup is simple. For more complicated scenarios where you need to connect parquet table, it is more complicated and the documentation to support this setup is not comprehensive. 

Experienced DevOps teams need to spend at least three to five days to understand the level of ability between multiple masters and how the scheduling works. 

I tried to integrate my Dremio metrics and logs to DataDog and it wasn't simple.

What's my experience with pricing, setup cost, and licensing?

Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have.

Which other solutions did I evaluate?

We considered Apache Drill, Apache Presto, Flink and Spark. The main reason we chose Dremio was cost and multitenancy.

What other advice do I have?

I would advise others not try to use Dremio as an ETL framework from day one. I also recommend do not use reflections, or any other tool which make dremio cluster statefull.

The installation for a production environment is complex and I would not recommend this solution for small businesses.

I would rate this solution a seven out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer2283825 - PeerSpot reviewer
Security Data Engineer at a pharma/biotech company with 5,001-10,000 employees
Real User
Jan 17, 2024
A highly stable solution that works like a data warehouse on top of data lakes
Pros and Cons
  • "Dremio allows querying the files I have on my block storage or object storage."
  • "I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported."

What is our primary use case?

We use Dremio for data engineering.

How has it helped my organization?

Dremio has resolved my data lineage and data governance problems. The solution has also resolved the data availability for a different range of users, which used to be a problem.

What is most valuable?

Dremio allows querying the files I have on my block storage or object storage. The solution gives me a place where I can play around with the data virtually by creating VDSs or PDSs. Dremio works just like a data warehouse on top of my data lake, which is interesting.

What needs improvement?

Dremio's interface is good, but it has a few limitations. I cannot do a lot of things with ANSI SQL or basic SQL. I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported.

The use case I am working on requires building trees and hierarchical structures. Most of the time, it requires complex nested data structures to be made simpler for end users. It would be good if Dremio could provide a way to create trees just like Oracle does using commands like CONNECT BY and NO CYCLE.

You can use a few languages to simplify complicated JSON and XML. It would be very helpful if Dremio could provide a solution to simplify building trees and building meaningful data from complex data.

What do I think about the stability of the solution?

I rate Dremio ten out of ten for stability.

What do I think about the scalability of the solution?

I rate Dremio ten out of ten for scalability.

How was the initial setup?

I rate Dremio ten out of ten for the ease of its initial setup.

What about the implementation team?

We implemented the solution through an in-house team. Dremio's deployment can be done quickly.

What other advice do I have?

Overall, I rate Dremio ten out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer. Customer
PeerSpot user
Buyer's Guide
Dremio
January 2026
Learn what your peers think about Dremio. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.
reviewer2036529 - PeerSpot reviewer
Database Engineer at a tech services company with 201-500 employees
Real User
Top 20
Nov 29, 2022
Beneficial memory competition, good support, and price well
Pros and Cons
  • "The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."
  • "Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake."

What is our primary use case?

We were using Dremio as a data lake query engine tool. We were creating our PDSs and VDSs on top of our S3 buckets, and our data lake and the data-scientist teams were using the data for further processing. We didn't use it for any ETL jobs. We were using it as a data-lake tool.

What is most valuable?

The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory.

What needs improvement?

Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake.

For how long have I used the solution?

I have been using Dremio for approximately two years.

What do I think about the stability of the solution?

Dremio is highly stable.

What do I think about the scalability of the solution?

Dremio is scalable which is a benefit of the solution. You can scale up to the number of instances you want in case you are feeling the load, and in case you feel your query is running low or you are receiving extra traffic.  

You can set the configuration while installing and while setting it up in the Dremio. In the configuration file, you can set up a lot of settings, such as what time.

We have approximately 18 people using the solution in my organization.

How are customer service and support?

We have been in touch with the support from Dremio when we had some internal issues. This happened approximately two times. The support is good.

Which solution did I use previously and why did I switch?

This is the first tool in this category that I have used.

How was the initial setup?

Dremio's initial setup took one or two days, one day is sufficient and typical.

What about the implementation team?

There was one DevOps person used for the deployment and maintenance of the solution.

What's my experience with pricing, setup cost, and licensing?

Dremio is less costly competitively to Snowflake or any other tool.

What other advice do I have?

My advice to others is if they are creating a data lake for a customer, Dremio would be useful for a data engineering team. If they're willing to create a data lake and they wanted to use it with the cloud-agnostic tool, then it is a good choice. If this solution meets their requirement they should try it out.

I rate Dremio an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Dremio Report and get advice and tips from experienced pros sharing their opinions.
Updated: January 2026
Buyer's Guide
Download our free Dremio Report and get advice and tips from experienced pros sharing their opinions.