We used Dataiku for a demand forecasting project where the objective is to forecast the demand for each product for the next three months.
Manager at a tech vendor with 10,001+ employees
Low-code projects have empowered non-technical teams and now need better integration and visuals
Pros and Cons
- "The best features Dataiku offers include the ability for users to use the node without having to code and the functionality related to low-code/no-code."
- "I have not seen a return on investment with Dataiku in terms of time saved, money saved, or fewer employees needed."
What is our primary use case?
My main use case for Dataiku is data science and AI projects.
What is most valuable?
The best features Dataiku offers include the ability for users to use the node without having to code and the functionality related to low-code/no-code.
Dataiku has positively impacted my organization by allowing non-technical users to adapt a data science project and to maintain a part of a data science project.
What needs improvement?
I think a pain point related to Dataiku is the visualization, which is not straightforward, and the integration, which is also not straightforward for non-technical users.
To improve Dataiku, the company could enhance the capabilities related to integration and visualization.
For how long have I used the solution?
I have been using Dataiku for three years.
Buyer's Guide
Dataiku
January 2026
Learn what your peers think about Dataiku. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,114 professionals have used our research since 2012.
What do I think about the stability of the solution?
Dataiku is stable.
What do I think about the scalability of the solution?
Dataiku's scalability can be better.
How are customer service and support?
I have never tried Dataiku's customer support.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
Before, we used a solution that I cannot mention, but the change is more related to using a more straightforward solution for non-technical users.
Before choosing Dataiku, I evaluated KNIME.
What was our ROI?
I have not seen any specific outcomes or metrics such as time saved, reduced costs, or improved project delivery.
I have not seen a return on investment with Dataiku in terms of time saved, money saved, or fewer employees needed.
What's my experience with pricing, setup cost, and licensing?
I am not the person involved in the process regarding pricing, setup cost, and licensing.
What other advice do I have?
My advice to others looking into using Dataiku is to use it principally to help and support non-technical users.
Dataiku is deployed in my organization on a public cloud on Amazon Web Services.
Amazon Web Services is our cloud provider.
I am not the person involved in the process of determining whether we purchased Dataiku through the AWS Marketplace.
My review rating for Dataiku is 7.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Last updated: Jan 9, 2026
Flag as inappropriateData Scientist at a energy/utilities company with 10,001+ employees
Saves a lot of time because I can quickly handle all the data preparation tasks and concentrate on building my machine learning algorithms
Pros and Cons
- "The advantage is that you can focus on machine learning while having access to what they call 'recipes.' These recipes allow me to preprocess and prepare data without writing any code."
- "One of the main challenges was collaboration. Developers typically use GitHub to push and manage code, but integrating GitHub with Dataiku was complicated."
What is our primary use case?
We use the solution for data science and machine learning.
How has it helped my organization?
We were a team of six Dataiku scientists and one data engineer. We focused on fully leveraging Dataiku for all our data science-related tasks. This included data preparation, preprocessing, benchmarking machine learning algorithms, handling everything related to production, and making our algorithms available to stakeholders.
What is most valuable?
The advantage is that you can focus on machine learning while having access to what they call 'recipes.' These recipes allow me to preprocess and prepare data without writing any code. This saves a lot of time because I can quickly handle all the data preparation tasks and concentrate on building my machine learning algorithms.
What needs improvement?
One of the main challenges was collaboration. Developers typically use GitHub to push and manage code, but integrating GitHub with Dataiku was complicated. While it was theoretically possible to use GitHub with Dataiku, in practice, it was difficult to manage our code effectively and push it from Dataiku to GitHub.
Another limitation was its ability to handle different types of data. While Dataiku is powerful for working with structured data, like regular or geospatial data, it struggled with more complex data types such as text and image. In addition to the challenges with GitHub integration, the limited support for diverse data types was another feature lacking at that time.
For how long have I used the solution?
I have been using Dataiku for over a year.
What do I think about the stability of the solution?
Since Dataiku relies on various open-source libraries and tools, updates or upgrades to these components can sometimes impact the stability of Dataiku's features. This can make it challenging to maintain consistent stability, as changes in the underlying open-source tools can affect how Dataiku functions.
I rate the stability as six out of ten.
What do I think about the scalability of the solution?
There are some scalability issues.
I rate the scalability as seven out of ten.
How are customer service and support?
Technical support was very good compared to other tools. We had access to chat and support.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup is very easy. It has many tutorials and many guidelines. After the initial deployment, it took about a week to manage all the setup and resolve various issues before we had a stable version of Dataiku that we could use consistently.
I rate it as eight out of ten, whereas ten is easy.
What's my experience with pricing, setup cost, and licensing?
It is very expensive.
What other advice do I have?
I wouldn't recommend using Dataiku if only one data scientist is on the team. However, having a larger team—let's say more than five data scientists—can be very helpful. Dataiku offers features that are especially useful when multiple people are working on the same project, and it also has tools that make it easier to move from the proof of concept stage to production.
Overall, I rate the solution as seven out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Buyer's Guide
Dataiku
January 2026
Learn what your peers think about Dataiku. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,114 professionals have used our research since 2012.
Consultant at a tech services company with 51-200 employees
Gives different aspects of modeling approaches and good for multiple teams' collaboration
Pros and Cons
- "If many teams are collaborating and sharing Jupyter notebooks, it's very useful."
- "Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku."
What is our primary use case?
My current client has Dataiku. We do sentiment analysis and some small large language models right now. We use Dataiku as a Jupyter Notebook.
We use it a lot for marketing and analytics. The marketing and sales team uses Dataiku.
What is most valuable?
It's got good feature selection and creation of feature stores, and it also gives different aspects of modeling approaches. There are a lot of similarities with DataRobot.
So feature selection, different modeling, and financial metrics are good aspects.
What needs improvement?
The no-code/low-code aspect, where DataRobot doesn't need much coding at all.
Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku because you still have to code and use either Python or R, or Scala. However, with DataRobot, you don't have to do that at all.
For how long have I used the solution?
I've used Dataiku for about four years.
How are customer service and support?
The company is based in France. But they're more and more in America as well.
So, the support was okay.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I used DataRobot. Dataiku has a different kind of structure to it. It's not financially heavy like DataRobot, which caters more to financial companies, like banks. Dataiku doesn't have that yet. I think they are also working on that area. But yeah, there are some key differences between the two products.
DataRobot has an additional feature with financial firms that it creates all these financial metrics when you run a time series analysis. Those things I have not seen in Dataiku.
If any financial company is choosing between DataRobot and Dataiku, they will definitely go for DataRobot because it creates all these financial metrics. It creates deltas, time series, time difference fields, and things like that. So, that is an added feature that DataRobot has.
What's my experience with pricing, setup cost, and licensing?
Pricing is pretty steep. Dataiku is also not that cheap. It depends on the client and how much they want to spend towards a tool.
What other advice do I have?
Overall, I would rate it an eight out of ten, except for some coding things that are there, which some people may not want to do, like certain business data scientists.
Dataiku is good for multiple teams' collaboration. If many teams are collaborating and sharing Jupyter notebooks, it's very useful. It has a good data processing structure and includes most of the models. I haven't checked the large language models in it yet, but it's a pretty good tool. It does well with analytics and has a sound structure on the back end.
Some coding aspects are necessary, but it generates SQL code, which is an added feature. A lot of data engineers like Dataiku because it generates SQL code on the right side.
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Senior Data Engineer at a consultancy with 1,001-5,000 employees
The model is very useful
Pros and Cons
- "Data Science Studio's data science model is very useful."
- "I think it would help if Data Science Studio added some more features and improved the data model."
What is our primary use case?
The use case is data science, and we've deployed Data Science Studio in multiple regions for four environments: dev, preset, pre-production, and production.
How has it helped my organization?
The solution responds to the requirements of the business team.
What is most valuable?
Data Science Studio's data science model is very useful.
What needs improvement?
I think it would help if Data Science Studio added some more features and improved the data model.
For how long have I used the solution?
We've been using Data Science Studio for three years.
What do I think about the stability of the solution?
Data Science Studio is stable.
What do I think about the scalability of the solution?
Data Science Studio is horizontally scalable.
How was the initial setup?
Setting up Data Science Studio was simple, and deployment took about three months. You only need around three people to deploy and maintain.
What other advice do I have?
I rate Dataiku Data Science Studio nine out of 10.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Scientist II at a computer software company with 51-200 employees
Flexible and intuitive with good stability
Pros and Cons
- "The solution is quite stable."
- "The interface for the web app can be a bit difficult. It needs to have better capabilities, at least for developers who like to code. This is due to the fact that everything is enabled in a single window with different tabs. For them to actually develop and do the concurrent testing that needs to be done, it takes a bit of time. That is one improvement that I would like to see - from a web app developer perspective."
What is most valuable?
The best feature is the user interface. It allows us to see the visual flows.
The intuitiveness of the software itself is very good.
It has both click or code flexibility, which is quite useful. I already have coding knowledge, so I go for coding as my go-to method of interaction. However, for a normal person that may not have a background in coding, the visual lists are really great. It makes it easy for non-IT individuals to navigate everything.
The pricing is very good.
The solution is quite stable.
You can scale the solution well.
What needs improvement?
When the flows get complex, there are too many data sets on them. Normal users will get confused by the influx of information. This was a problem up until recently, however, they have since resolved it. Now, there's a feature called Zones(introduced in version 8) that allows users to collapse multiple flows into a single flow. It allows everyone to easily carry on with their work.
Other than that issue, that has now been resolved, there isn't anything lacking by way of features.
The interface for the web app can be a bit difficult. It needs to have better capabilities, at least for developers who like to code. This is due to the fact that everything is enabled in a single window with different tabs. For them to actually develop and do the concurrent testing that needs to be done, it takes a bit of time. That is one improvement that I would like to see - from a web app developer perspective.
For how long have I used the solution?
I've been using the solution for more than a year at this point.
What do I think about the stability of the solution?
The solution is pretty stable. Since ours is a shared platform, the solution does freeze sometimes when a complex model is training or if you keep too many notebooks open the in-memory after editing. Though the usage has been optimised by admin end using resource control. Clearing notebook memory and job logs through in-built macros keep the solution stable.
What do I think about the scalability of the solution?
The solution is scalable. It has the power to integrate the cloud platform. This makes scalability both possible and easy.
We have ~50 developers/users working with the solution right now.
How are customer service and technical support?
Technical support is really, really good. It's excellent, actually. If you click on the "get help" button and you mark it as urgent, you get a reply from the CTO himself. I find that most of my conversations are direct with the CTO, and that's due to the fact that most of the time my questions are quite complex.
They're very knowledgeable and are able to assist us very well. They always reply very promptly. We're quite satisfied with their level of support.
I'd rate it overall four out of five from a solutions perspective.
Which solution did I use previously and why did I switch?
I haven't used another solution. This is my first as we are just growing this space within our organization.
How was the initial setup?
I didn't handle the initial setup of the solution. Therefore, I can't speak to how difficult of complex the setup was.
What's my experience with pricing, setup cost, and licensing?
This solution is cheaper than other competitors. It's less expensive, for example, than Alteryx or any other product.
What other advice do I have?
While I started using version 5.1 of the solution, I've currently updated it to 7.0.1.
I would recommend the solution. It's affordable and user-friendly.
Overall, I'd rate it nine out of ten. I'd rate it higher, however, I don't have enough experience with other similar solutions, therefore, it's hard to compare.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Principal Full Stack Data Scientist at a tech services company with 51-200 employees
Good data preparation tools and integrates well with BigQuery
Pros and Cons
- "The most valuable feature is the set of visual data preparation tools."
- "In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin."
What is our primary use case?
Our primary uses for this solution are data preparation and data modeling.
We have a testing environment, a production environment, and two API nodes.
How has it helped my organization?
Using Dataiku has meant that we spend less time on preparing and cleaning data, and we spend less time on blending models together. Ultimately, it means that we can spend more time modeling.
What is most valuable?
The most valuable feature is the set of visual data preparation tools.
The solution supports code from different languages including Python and R. Whatever code you want to use, it works well.
This solution allows us to store and retrieve data directly into BigQuery.
The documentation and tutorials are quite good.
What needs improvement?
From an administrative point of view, I would like to be able to communicate with the users who are logged into the system. For example, I would like to be able to send a broadcast message that says "I am shutting down the system."
I would like to see more organization and better cohesion within the tool.
In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin.
I would like to have a better way to manage images and sound.
The error messages are not self explanatory and can sometimes be difficult to understand.
For how long have I used the solution?
I have been using this solution for one year.
What do I think about the stability of the solution?
The stability is quite good.
What do I think about the scalability of the solution?
This is a scalable solution where we integrate with BigQuery for storing and retrieving our data. There are only two of us in the company who use this solution, although we would like to increase our usage.
How are customer service and technical support?
The technical support is quite good. We have had to open a few tickets and they replied in just a few minutes. They are quick and supportive.
Which solution did I use previously and why did I switch?
I still use several other solutions for data science including RapidMiner 9 and Weka. I have also been using Octave and MATLAB for modeling.
I use the Community Editions for these other products, so I have restrictions when it comes to things like the size of the dataset. When I need to be free of restrictions then I use Dataiku Data Science Studio.
How was the initial setup?
The initial setup is very, very simple.
To deploy the entire platform will take one or two days.
What about the implementation team?
We handled the deployment ourselves. We can work totally independently.
What's my experience with pricing, setup cost, and licensing?
The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything.
What other advice do I have?
At the moment, we haven't had any need to use containers or Spark because everything is included in BigQuery.
My advice for anybody who is implementing this solution is to start with having somebody who can mentor you. Whether this is the case or not, the tutorial and documentation are quite good, so I would suggest going through the whole tutorial and academy material.
This solution does have a learning curve, although it is not steep.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner.
Business Intelligence Developer/ Data Scientist at a tech services company with 11-50 employees
User interface is colorful, beautiful, and well-designed but sometimes the solution can be slow
Pros and Cons
- "I like the interface, which is probably my favorite part of the solution. It is really user-friendly for an IT person."
- "I find that it is a little slow during use. It takes more time than I would expect for operations to complete."
What is our primary use case?
I just use the product for data migration. Once a month, I push data from one Redshift data warehouse to another Redshift data warehouse. I do that by using a simple SQL query.
What is most valuable?
I like the interface, which is probably my favorite part of the solution. It is really user-friendly. I might think that it is user-friendly because I'm in IT and it seems familiar to me. It is colorful and I think it is really beautiful and well-designed.
What needs improvement?
I think the interface is very nice, but for somebody who is not as familiar with IT as I am, it may be much more difficult for them. It is nice for me because I'm familiar with this type of software that falls in the realm of the data science platform. I can see how a client who really doesn't know anything about IT or computers might try to use it and find that it would be a little difficult to access some features. That type of user may really need training in order to work with Dataiku. So, in the next release of Dataiku DSS (Data Science Studio), they should make it more friendly for everybody to use, not just IT people.
For me, I find that it is a little slow during use. When I use Dataiku to run my script to transfer data, it takes more time than I would expect for the operation to complete.
For how long have I used the solution?
I have been using this solution for a year.
What do I think about the stability of the solution?
I have not done a lot of exploration into Dataiku and its other features as I only use it for a particular task. But from my experiences and from the review I'm seeing online, it is a stable solution.
What do I think about the scalability of the solution?
I think there is some room for improvement in scalability as I already find it performs a little slowly. In my company, there are three of us. There is an IT manager, there is the data warehouse manager, and there is me. In the company that we use it for, there are more than 800 employees.
How are customer service and technical support?
I have contacted customer service before. They respond quickly, so that is a plus.
Which solution did I use previously and why did I switch?
We did not use other solutions before switching so much as we are using a few different solutions together. We use Knime and Alteryx for data science and analytics. I just use Dataiku for data migration for now. I just started using Knime, but I'm most familiar with Alteryx because I have used it for one-and-a-half-years to practice ATL (Active Template Library) on my data. Alteryx is somewhat the same as Knime, but it is more user-friendly.
How was the initial setup?
The company I work for did the setup for me. It was already set up when I came.
What about the implementation team?
We are consultants so we do the implementations ourselves. Another consultant introduced Dataiku to us initially. I guess they really appreciated the solution.
What other advice do I have?
Dataiku is a very broad solution that offers many possibilities. If you want to use it you must be fully committed to it.
The biggest lesson I learned from using the product is that you can do many things with it. But you must commit the time to discover the tool.
On a scale from one to ten where one is the worst and ten is the best, I would rate Dataiku as a seven. It is a little bit of a conservative rating because it is a nice solution and I just use it for a particular task.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner.
Practice Manager Data Intelligence at a tech services company with 1,001-5,000 employees
An all-in-one data prediction tool that is simple to install and use
Pros and Cons
- "The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction."
- "The ability to have charts right from the explorer would be an improvement."
What is our primary use case?
I use this solution to make predictions from time-series data. I am a consultant and operate this solution for my clients.
What is most valuable?
The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction.
What needs improvement?
I would like to have better exclusion of data capability.
The ability to have charts right from the explorer would be an improvement.
I would like to see additions to the architecture for specific business use cases.
For how long have I used the solution?
I have been using this solution for two years.
What do I think about the stability of the solution?
This is a stable solution. I have not seen any bugs or glitches.
What do I think about the scalability of the solution?
This platform is easy to scale. You can take full advantage of your office architecture.
My client has more than twenty people using this solution.
How are customer service and technical support?
I have been in contact with technical support through the website. They have chatbots that dispatch tickets to specific people, depending on the problem. They respond quickly and I am satisfied with the support.
How was the initial setup?
The initial setup of this solution is straightforward.
What about the implementation team?
I performed the implementation myself. By reading the documentation, it is simple.
What other advice do I have?
My advice to anybody who is implementing this solution is to use the tutorial first. There are lots of tutorials available that help to quickly explain the solution.
This is a product that I recommend.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Reseller.
Buyer's Guide
Download our free Dataiku Report and get advice and tips from experienced pros
sharing their opinions.
Updated: January 2026
Product Categories
Data Science PlatformsPopular Comparisons
Databricks
KNIME Business Hub
IBM SPSS Statistics
Amazon SageMaker
Alteryx
Microsoft Azure Machine Learning Studio
Altair RapidMiner
Dremio
IBM Watson Studio
IBM SPSS Modeler
Anaconda Business
Starburst Enterprise
Domino Data Science Platform
H2O.ai
Google Cloud Datalab
Buyer's Guide
Download our free Dataiku Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which AutoML solution is the best?
- Which solution do you prefer: SAS Viya or Dataiku Data Science Studio?
- Which are the best end-to-end data science platforms?
- What enterprise data analytics platform has the most powerful data visualization capabilities?
- What Data Science Platform is best suited to a large-scale enterprise?
- When evaluating Data Science Platforms, what aspect do you think is the most important to look for?
- How can ML platforms be used to improve business processes?
- Why is Data Science Platforms important for companies?
















