Try our new research platform with insights from 80,000+ expert users
it_user837543 - PeerSpot reviewer
Senior Business Technology Analyst at a consultancy with 5,001-10,000 employees
Real User
Apr 5, 2018
GUI-based functionality is easy to use, but server up-time needs to be improved
Pros and Cons
  • "Extremely easy to use with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors."
  • "Cloud-based process run helps in not keeping the systems on while processes are running."
  • "Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable."
  • "Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days)."
  • "There were stability issues: 1) SQL operations, such as partitioning, had bugs and showed wrong results. 2) Due to server downtime, scheduled processes used to fail. 3) Access to project folders was compromised (privacy issue) with wrong people getting access to confidential project folders."

What is our primary use case?

ETL development.

How has it helped my organization?

Compared to Informatica, this tool is extremely easy with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors.

What is most valuable?

  • Process scheduler (called Scenario).
  • Cloud-based process run, which helps in not keeping the systems on while processes are running.

What needs improvement?

Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable.

Buyer's Guide
Dataiku
January 2026
Learn what your peers think about Dataiku. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.

For how long have I used the solution?

One to three years.

What do I think about the stability of the solution?

There were stability issues.

  • SQL operations, such as partitioning, had bugs and showed wrong results.
  • Due to server downtime, scheduled processes used to fail.
  • Access to project folders was compromised (privacy issue) with wrong people getting access to confidential project folders.

What do I think about the scalability of the solution?

Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days).

How are customer service and support?

I would rate tech support at six out of 10.

Which solution did I use previously and why did I switch?

I have previously worked with Alteryx. I switched for these reasons: 

  • Big Data tools like Python and Hive are already supported
  • Cloud-based processing
  • Better value for money for client.

How was the initial setup?

Initial setup was done by the IT team from the client's side. No involvement in the initial setup from our side.

What's my experience with pricing, setup cost, and licensing?

Since this was a client-advised tool, no information of costs is available to us.

Which other solutions did I evaluate?

Unfortunately, no.

What other advice do I have?

Easy-to-use tool, but slightly unstable in terms of a few modules. Proceed with some amount of caution and knowledge.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Dataiku Report and get advice and tips from experienced pros sharing their opinions.
Updated: January 2026
Product Categories
Data Science Platforms
Buyer's Guide
Download our free Dataiku Report and get advice and tips from experienced pros sharing their opinions.