What is our primary use case?
As a BI developer, my main use case for Dremio involves registering data zones and also doing inquiries, views, and making it persistent to use on BI tools, like Tableau reports. In a project, we create data zones, typically one that we have in our company to receive files. After doing the ingestion inside Dremio, what we do is set up permissions, and after setting up the permissions, the refresh session allows us to go over and start building the views and all those things to be possible to connect via JDBC driver and consume on Dremio dashboard.
What is most valuable?
The first feature that stands out for me in Dremio is the federated type of query, which allows the possibility to use multiple endpoints without worrying about writing custom SQL that runs only for SQL Server or for Postgres and Redshift. We build a SQL on Dremio standard, and then Dremio goes over and rewrites the whole lineage to make it possible to use multiple sources of data.
The feature impacts our daily work by reducing complexity because we don't need to worry about where the data comes from. We are slightly migrating to Snowflake, so Dremio is currently being replaced because Snowflake is a more robust platform, but we are kind of happy with the work Dremio does.
I think the reduction of complexity is a positive impact that Dremio has had on my organization. The main thing is that it is a place where you can write down simple SQLs and see the lineage, the way it integrates together; this is the best part of it.
What needs improvement?
We also have a close relationship with the team that does the Dremio maintenance for the database, like upgrading the versions and they know about some specific problems we had in the past, such as a memory leak. We had a memory leak on some versions, which sometimes stopped the service. Since we are using Dremio installed like a server, not a SaaS solution, many times we need to stop and restart the service to clear all the cache and all that, and this is the thing I should add.
I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement.
I remember using some features in the past, like pivot tables, which proved to be really difficult, but I know this is a fault also for other vendors. Pivoting, transposing, and unpivoting are often not so good. CTEs also many times prove to be not so good, so I think these two main items could be improved significantly if they standardize them.
For how long have I used the solution?
I have been using Dremio for approximately three years, since January of 2022.
What do I think about the stability of the solution?
We had a memory leak on some versions, which sometimes stopped the service. Many times, we need to stop and restart the service to clear all the cache and all that. I rate Dremio a nine in terms of stability. I think it is stable, but we need to restart it many times, and we need to monitor it regularly.
What do I think about the scalability of the solution?
Dremio's scalability can handle growing data and user demands easily.
How are customer service and support?
We have had to reach out for customer support many times, and they respond, so they are pretty supportive about some long-term issues. Many times it's with vendors where there is a missing feature or ongoing problem that they say will be fixed in the next release, but this happens not only with Dremio but with almost every vendor we have.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
We haven't previously used a different solution before Dremio. Snowflake is the new solution we are currently using to replace Dremio.
What was our ROI?
I cannot share the exact metric itself, but Dremio surely saves time, reduces costs, and all those things because we don't have to worry so much about the infrastructure to make the different tools communicate. Dremio was a solution that made it possible for us to have different data sources with only one registration on our data lake.
What's my experience with pricing, setup cost, and licensing?
I don't have information about pricing, setup cost, and licensing for Dremio, so I am not entitled to discuss it.
Which other solutions did I evaluate?
I probably don't remember all the options we evaluated, but we talked a lot about Databricks and Redshift, and Dremio was the best choice. Also, we don't use Tableau Prep because we use Dremio, which is an important thing.
What other advice do I have?
My advice to others looking into using Dremio is that it is a great tool because it keeps all your efforts together. It's a good thing to have if you want to have a unified catalog or metadata or something like this, so this is certainly a good tool to consider. I would rate this review an eight out of ten overall.
Which deployment model are you using for this solution?
On-premises