Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs MarkLogic comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
8th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd)
MarkLogic
Ranking in NoSQL Databases
19th
Average Rating
9.6
Reviews Sentiment
7.5
Number of Reviews
2
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of August 2025, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 2.4%, down from 2.5% compared to the previous year. The mindshare of MarkLogic is 1.8%, up from 0.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Beverly R. Jamison - PeerSpot reviewer
Frequent updates, helpful search capabilities, and high quality support
MarkLogic's greatest asset is its strong engineering foundation. It was specifically designed with search capabilities in mind, and the developers placed a great emphasis on ensuring the quality of the indexing and all subsequent layers that were added. The solution has been good at providing the updates that were what we were hoping for. They frequently update the solution.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The main advantage is the storage is less expensive."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"Cloudera provides a hybrid solution that combines compute on cloud or on-premises."
"Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform, offers power processing, supports different file systems and query engines, and provides parallel processing for handling many requests."
"The data science aspect of the solution is valuable."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"It is helpful to gather and process data."
"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"The rules can show us if there are missing items, like titles, and we can add them in to ensure everything is filled and makes sense and there are no missing details."
"MarkLogic's greatest asset is its strong engineering foundation. It was specifically designed with search capabilities in mind, and the developers placed a great emphasis on ensuring the quality of the indexing and all subsequent layers that were added."
 

Cons

"They should focus on upgrading their technical capabilities in the market."
"The initial setup of Cloudera is difficult."
"Cloudera's support is extremely bad and cannot be relied on."
"It could be faster and more user-friendly."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"The Cloudera training has deteriorated significantly."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"The spreadsheet capabilities could be improved."
"One of the most common requests is to improve the user interface of the database. While it is primarily a database, there are other databases available that offer more user-friendly interfaces. The UI is good for developers but not for regular users. More visuals would be beneficial."
 

Pricing and Cost Advice

"I wouldn't recommend CDH to others because of its high cost."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The product’s price depends from project to project."
"The price could be better for the product."
"Cloudera requires a license to use."
"The tool is not expensive."
"The price is very high. The solution is expensive."
"MarkLogic is a pricey option, but there are some advantages to its pricing structure. For medium-sized clients or departments within larger companies, it is possible to obtain a license for one or two nodes for less than a hundred thousand dollars. Additionally, if you only need to deploy a single node, you can do so for under fifty thousand dollars. This is in contrast to other high-quality software options that are only accessible to larger businesses, where the starting price can be upwards of two hundred thousand dollars."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
865,295 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Educational Organization
17%
Computer Software Company
12%
Energy/Utilities Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation. Integrating wit...
Ask a question
Earn 20 points
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
ALM, American Psychological Association, American Society of Agronomy, Cond_ Nast, Centers for Medicare and Medicaid Services, Institute of Engineering and Technology, JWG Group, Lagardre Active, RSuite CMS, Wiley
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. MarkLogic and other solutions. Updated: July 2025.
865,295 professionals have used our research since 2012.