Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs MarkLogic comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
10th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd)
MarkLogic
Ranking in NoSQL Databases
13th
Average Rating
8.8
Reviews Sentiment
5.6
Number of Reviews
4
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of February 2026, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 3.3%, up from 1.9% compared to the previous year. The mindshare of MarkLogic is 2.2%, up from 1.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Market Share Distribution
ProductMarket Share (%)
Cloudera Distribution for Hadoop3.3%
MarkLogic2.2%
Other94.5%
NoSQL Databases
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Manager, Bussines Development & Co Owner at Troia d.o.o.
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
DS
Staff Engineer at a tech vendor with 10,001+ employees
Search workflows have become faster for complex data while cloud-native flexibility still needs work
I think MarkLogic can be improved by providing good cloud infrastructure. Since we are in a very tech era, MarkLogic should provide good cloud infrastructure. If you look at other databases or systems like Kafka and MongoDB, they have cloud infrastructure. You do not need to worry about maintaining your own servers or provisioning your own servers. You simply log in and tell MarkLogic you want a certain number of clusters or nodes in a cluster and what cloud provider you want to use, then click okay, and they will build it for you. That is a headache with MarkLogic. Apart from that, I don't see any issues. It is a really great product.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It has the best proxy, security, and support features compared to open-source products."
"The product provides better data processing features than other tools."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"The solution is stable."
"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"The file system is a valuable feature."
"Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform, offers power processing, supports different file systems and query engines, and provides parallel processing for handling many requests."
"MarkLogic's greatest asset is its strong engineering foundation. It was specifically designed with search capabilities in mind, and the developers placed a great emphasis on ensuring the quality of the indexing and all subsequent layers that were added."
"MarkLogic has positively impacted my organization by making everything quick and fast, and I believe that is a major change we have seen here."
"We moved to MarkLogic and created the API using JavaScript server-side language, and we saw almost 60% improvement in the speed of the search."
"The rules can show us if there are missing items, like titles, and we can add them in to ensure everything is filled and makes sense and there are no missing details."
 

Cons

"The initial setup of Cloudera is difficult."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"There are better solutions out there that have more features than this one."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"The spreadsheet capabilities could be improved."
"One of the most common requests is to improve the user interface of the database. While it is primarily a database, there are other databases available that offer more user-friendly interfaces. The UI is good for developers but not for regular users. More visuals would be beneficial."
"MarkLogic's scalability is very bad. In production, when you get to know that your data is increasing and you need to add one more node, that is not easy and not straightforward."
 

Pricing and Cost Advice

"I haven't bought a license for this solution. I'm only using the Apache license version."
"It is an expensive product."
"The price could be better for the product."
"Cloudera requires a license to use."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The price is very high. The solution is expensive."
"I believe we pay for a three-year license."
"MarkLogic is a pricey option, but there are some advantages to its pricing structure. For medium-sized clients or departments within larger companies, it is possible to obtain a license for one or two nodes for less than a hundred thousand dollars. Additionally, if you only need to deploy a single node, you can do so for under fifty thousand dollars. This is in contrast to other high-quality software options that are only accessible to larger businesses, where the starting price can be upwards of two hundred thousand dollars."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
881,733 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Marketing Services Firm
9%
Computer Software Company
8%
Comms Service Provider
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your experience regarding pricing and costs for MarkLogic?
Regarding my experience with pricing, setup cost, and licensing, I find it quite high compared to other things, but I believe it is justified.
What needs improvement with MarkLogic?
As for areas where MarkLogic can be improved, nothing else comes to mind. I have been using MarkLogic for seven years, and many improvements have already been addressed. I do not recall any additio...
What is your primary use case for MarkLogic?
My main use case for MarkLogic is data development. We have a banking type of data, so we used to convert it to PDF or use some for applications we were running. A quick, specific example of how I ...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
ALM, American Psychological Association, American Society of Agronomy, Cond_ Nast, Centers for Medicare and Medicaid Services, Institute of Engineering and Technology, JWG Group, Lagardre Active, RSuite CMS, Wiley
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. MarkLogic and other solutions. Updated: February 2026.
881,733 professionals have used our research since 2012.