No more typing reviews! Try our Samantha, our new voice AI agent.
reviewer1247268 - PeerSpot reviewer
Technology Lead at a tech services company with 10,001+ employees
MSP
Jan 27, 2020
A cost-effective solution for high volume, multi-source data collection
Pros and Cons
  • "The most valuable feature is that it can handle high volume."
  • "Other than the problems with having no control over the queue, Apache Kafka is wonderful."
  • "Kafka does not provide control over the message queue, so we do not know whether we are experiencing lost or duplicate messages."

What is our primary use case?

Our company provides services and we use Apache Kafka as part of the solution that we provide to clients.

One of the use cases is to collect all of the data from multiple endpoints and provide it to the users. Our application integrates with Kafka as a consumer using the API, and then sends information to the users who connect. 

What is most valuable?

The most valuable feature is that it can handle high volume.

Apache Kafa is open-source and some of our clients are interested in becoming more involved in that.

What needs improvement?

Kafka does not provide control over the message queue, so we do not know whether we are experiencing lost or duplicate messages. Better control over the message queue would be an improvement. Solutions such as ActiveMQ do afford better control. Because of this, there is sometimes a gap in the results where we have either lost messages, or there are duplicates.

We have had problems when there was an imbalance because all of the messages were being sent back.

For how long have I used the solution?

I'm a beginner with Apache Kafka.

Buyer's Guide
Apache Kafka
June 2026
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,838 professionals have used our research since 2012.

What do I think about the stability of the solution?

I cannot judge stability without having better control over the message queue, although I feel that it is not 100% stable. 

How are customer service and support?

We have not been in contact with technical support. For our first implementation with it, Kafka was already set up and running. When we did our PoC, I was not part of the team who was facing issues and it was they who were in contact with support.

Which solution did I use previously and why did I switch?

I also have experience with IBM MQ.

How was the initial setup?

We had problems when we were setting up Kafka ourselves to conduct our PoC internally. Kafka would not start and it was related to parameters or property settings in Java. We were able to work around it, but we had problems like adding certificates.

What about the implementation team?

In one case, we were using Kafka after it had already been set up, externally. It worked fine and we just had to configure some of the connectors that we wanted to try out.

What's my experience with pricing, setup cost, and licensing?

Apache Kafka is open-source and can be used free of charge.

What other advice do I have?

In this type of solution, you need to be able to accept a high volume of messages, but not lose any, and not have any duplicates. Because we are unable to control the queue in Kafka, I cannot say that this works 100%.

The suitability of this solution depends on the use cases. There are two or three things that we are worried about, and we will be very careful in choosing solutions. In cases where the messages are well organized, or there is no worry that there will be duplicate or dropped messages, then I recommend using Kafka. Also, I recommend this solution for those looking to get involved with open-source applications.

Other than the problems with having no control over the queue, Apache Kafka is wonderful.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Senior Big Data Developer | Cloudera at Dilisim
Real User
Jan 21, 2020
Good scalability and excellent for storing data used for analytics but lacks a user interface
Pros and Cons
  • "Kafka provides us with a way to store the data used for analytics. That's the big selling point. There's very good log management."
  • "If the graphical user interface was easier for the Kafka administration it would be much better. Right now, you need to use the program with the command-line interface. If the graphical user interface was easier, it could be a better product."

What is our primary use case?

We are currently using this solution on our cloud-based clusters.

How has it helped my organization?

We use Kafka as part of our services. Our product (cloud clusters) has many components and Kafka is one of them.

For example, we use Kafka as a data integration tool. If you take Oracle GoldenGate as a typical use case, what happens is GoldenGate collects the data for the replication and sends this data to the Kafka servers. We collect the data on the Kafka servers, and we create some transformations, some operations, from that data. We then copy the data to the HTTP or hub site.

Previously, when I worked at Nokia, we were collected data using Kafka and then we stored the data on the Kafka servers. We did all transformations through Kafka streaming. Later, Kafka moved data over to the HP site. 

What is most valuable?

Kafka has a good storage layer on its side. I can store this data if it's streaming, and, if we do encounter any error, for example, on the network or server, we can later use the data to do some analytics on it using the Kafka server.

Kafka provides us with a way to store the data used for analytics. That's the big selling point. There's very good log management. 

Kafka provides many APIs that can be flexible and can be placed or expanded using the development life cycle. For example, using Java, I can customize the API according to our customers' demands. I can expand the functionality according to our customer demands as well. It's also possible to create some models. It allows for more flexibility than much of the competition.

What needs improvement?

If the graphical user interface was easier for the Kafka administration it would be much better. Right now, you need to use the program with a command-line interface. If the graphical user interface was easier, it could be a better product.

For how long have I used the solution?

I've been using the solution for more than three years.

What do I think about the stability of the solution?

The solution can be quite stable. We haven't encountered any issues on the Kafka side. However, Creating custom stabilizations would be good for dealing with stabilizing issues.

What do I think about the scalability of the solution?

The scalability of the solution is very good. You can analyze system events horizontally and the cluster can be brought over to the cloud side with the Kafka user's server.

We use the solution for both small and medium-sized organizations, but also larger enterprises. Some of our clients are in the banking and financial sector.

How are customer service and technical support?

Officially, I did not create any Kafka support tasks on the configuration support that is offered. I have created some questions on the stack overflow, however. Technical support is very good and I've found their response is very quick, giving you an answer within a day.

Which solution did I use previously and why did I switch?

We didn't previously use a different solution. We did some applications with Java for the consumer content but not the application function within that. We did objects instead.

How was the initial setup?

The initial setup isn't too complex. I know Kafka very well and don't find it to be overly difficult. There's also very good documentation which users can take advantage of.

Deployment, including security integration, only took about one day.

Two people handled the deployment. One person created the authentification group and after creating groups and users, another handled topic authentification and user definition for the customer.

What about the implementation team?

I handled the implementation for cloud-based clusters. I defined the broker nodes and other nodes for Kafka. We are a cloud integrator, so we handled it ourselves.

What's my experience with pricing, setup cost, and licensing?

I'm unaware of the costs surrounding licensing and setup.

What other advice do I have?

We're using the 2.1.30 version of the solution for our cloud-based clusters. We use the on-premises deployment model. Most customers use the on-premise solution for cloud-based clusters.

Kafka is a very good solution for log management. If you need anything done related to log management, Kafka can do it. Kafka can also store the data in the brokers. This prevents data loss as well as the duplication of data. It's quite comprehensive.

I'd rate the solution seven out of ten. If the solution could provide a user interface I'd rate it higher. This is important for managing Kafka's clusters on the administration side. It would also be helpful if two to three files could be minimized to one configuration file.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Apache Kafka
June 2026
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,838 professionals have used our research since 2012.
Developer Infrastructure at Outbrain
Real User
Dec 31, 2019
Very easy to install, stable, and has good scaling options
Pros and Cons
  • "It's very easy to keep to install and it's pretty stable."
  • "In our current position, we use it to move a lot of data and I think it's definitely working well."
  • "The third party is not very stable and sometimes you have problems with this component. There are some developments in newer versions and we're about to try them out, but I'm not sure if it closes the gap."

How has it helped my organization?

In my previous company, we had a proprietary implementation and we changed it with Kafka. We changed it because we had many different connectors available and it also allowed us to create a window to our products for the client. It was an on-premise product and it allowed the outline to take the data out, without us developing anything.

You can connect in any language and there are a lot of connectors available, it helps a lot. And it creates visibility into the data and stability. There are several alternatives but this is one of the best options for this.

What is most valuable?

It's very easy to install and it's pretty stable.

The possibility to have connectors is very helpful. Another valuable aspect is that it's mature and open-source. 

From a scalability point of view, you just add servers and it's scalable. The whole architecture is very scalable.

What needs improvement?

There is a feature that we're currently using called MirrorMaker. We use it to combine the information from different Kafka servers into another server. It's very wide and it gives a very generic scenario. I think it would be great if the possibility would exist out of the box and not as a third party. The third party is not very stable and sometimes you have problems with this component. There are some developments in newer versions and we're about to try them out, but I'm not sure if it closes the gap.

For how long have I used the solution?

I have been using this solution for six months. I also worked with it additionally in my previous company but not so intensively. 

How are customer service and technical support?

I have never needed to use technical support. I know it's available but we haven't needed it because there's a lot of information on the internet that has helped us to solve our issues. 

What other advice do I have?

I would definitely recommend Kafka. In our current position, we use it to move a lot of data and I think it's definitely working well. I would definitely recommend it.

I would rate it an eight out of ten. 

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user928569 - PeerSpot reviewer
Technical Consultant at KPMG
Real User
Oct 17, 2018
It eases our current data flow and framework
Pros and Cons
  • "It eases our current data flow and framework."
  • "With such a large digest, I was genuinely impressed at the process being almost real-time."
  • "Kafka 2.0 has been released for over a month, and I wanted to try out the new features. However, the configuration is a little bit complicated: Kafka Broker, Kafka Manager, ZooKeeper Servers, etc."

What is our primary use case?

It's convenient and flexible for almost all kinds of data producers. We integrated it with Kafka Streams, which can perform some easy data processing, like summary, count, group, etc

How has it helped my organization?

It eases our current data flow and framework, which digests all types of sources regardless of it being structured or not.

What is most valuable?

  • High availability
  • High throughput

With such a large digest, I was genuinely impressed at the process being almost real-time.

What needs improvement?

Kafka 2.0 has been released for over a month, and I wanted to try out the new features. However, the configuration is a little bit complicated: Kafka Broker, Kafka Manager, ZooKeeper Servers, etc.

For how long have I used the solution?

Less than one year.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Senior Technical Architect at a computer software company with 51-200 employees
Real User
Nov 6, 2017
Its publisher-subscriber pattern has allowed our applications to access and consume data in real time.
Pros and Cons
  • "I like the performance and reliability of Kafka, as I needed a data streaming buffer that could handle thousands of messages per second with at least one processing point for an analytics pipeline, and Kafka fits this requirement very well, as it is a fast, distributed message broker that does exactly what it is designed to do."
  • "As an open-source project, Kafka is still fairly young and has not yet built out the stability and features that other open-source projects have acquired over the many years. If done correctly, Kafka can also take over the stream-processing space that technologies such as Apache Storm cover."

How has it helped my organization?

Through its publisher-subscriber pattern, Kafka has allowed our applications to access and consume data at a real time pace.

What is most valuable?

I like the performance and reliability of Kafka. I needed a data streaming buffer that could handle thousands of messages per second with at least one processing point for an analytics pipeline. Kafka fits this requirement very well, as it is a fast, distributed message broker. It definitely does exactly what it is designed to do.

What needs improvement?

As an open-source project, Kafka is still fairly young and has not yet built out the stability and features that other open-source projects have acquired over the many years. If done correctly, Kafka can also take over the stream-processing space that technologies such as Apache Storm cover.

Currently, as it is in the big/fast data integration world, you need to piece together many different open-source technologies. For example, to create a reliable, fault-tolerant streaming processing system that ingests data, you need:

  • a producer service
  • an event/message buffer such as Kafka or a message queue
  • a stream processing consumer such as Spark, Flink, Storm, etc.
  • something to help facilitate the ingestion into target datasources such as Flume or some customized concoction.

This is simply to ingest the data and does not necessarily account for the analytical pieces, which may consist of Spark ML, SystemML, ElasticSearch, Mahout, etc.

What I'm getting at is basically the need for a Spring framework of big data.

What do I think about the stability of the solution?

The only stability issues we had were mostly a result of the evolving APIs and existing bugs.

What do I think about the scalability of the solution?

Kafka is designed to be very easily scalable so I did not have any trouble here.

How are customer service and technical support?

We used the open-source version and did not buy support from Confluent.

Which solution did I use previously and why did I switch?

We did not have any other previous solutions. Our project was green field and a new type of project development.

How was the initial setup?

Initial setup was straightforward. We simply hosted multiple Kafka brokers and ZooKeeper servers on AWS EC2 instances.

What about the implementation team?

We implemented it in-house and then went with the Hortonworks Data Platform distribution.

Which other solutions did I evaluate?

We evaluated AWS Kinesis as well.

What other advice do I have?

Kafka is open source and requires an administrator to maintain the servers.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user660591 - PeerSpot reviewer
Senior Java Consultant at a tech services company with 501-1,000 employees
Consultant
Oct 9, 2017
The product is a distributed system for persistent messaging
Pros and Cons
  • "It's a high-performance distributed system."

    What is most valuable?

    The most valuable features are performance, persistent messaging, and reliability. It allows us to persist the message for a configurable number of days, even after it has been delivered to the consumer. The message delivery is also fast.

    How has it helped my organization?

    We wanted to track the customer activities on our application and store those details on another system(RDBMS/Apache Hadoop). We do extensive analysis with that. This helps the company to analyze the customer activities, such as search terms, and do better.

    What needs improvement?

    It’s perfect for our requirements.

    For how long have I used the solution?

    I have been using Apache Kafka for two years.

    What do I think about the stability of the solution?

    We have had no issues with stability.

    What do I think about the scalability of the solution?

    We have had no issues with scalability.

    How are customer service and technical support?

    We use the open source one, so we did not opt for any technical support.

    Which solution did I use previously and why did I switch?

    We started to use Apache Kafka with our application from scratch.

    How was the initial setup?

    The initial setup was straightforward. We faced some issues during the development in areas such as message producer and consumer. We rectified those with the tweaking the producer and consumer configurations. The documentation is very good.

    What's my experience with pricing, setup cost, and licensing?

    I don’t have any idea, as we use the open source version.

    What other advice do I have?

    It's a high-performance distributed system. If you want to track the user activities or any stream processing, then this is perfect. We have used Docker Kafka for our implementation. It's very easy for setup and testing. You could also try the same.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    it_user650004 - PeerSpot reviewer
    Team Lead at a financial services firm with 1,001-5,000 employees
    Vendor
    May 24, 2017
    Messages stay in Kafka after clients consume them. A message can be consumed by the same or a different client until topic retention kicks in and the oldest messages get deleted.
    Pros and Cons
    • "It has become dead simple to connect different application and services, saving a lot of development hours."
    • "The standard Kafka Java library, which is shipped with the product, is too complex for inexperienced users."

    What is most valuable?

    • Message Retention: Unlike regular message queues, messages stay in Kafka after clients consume them. A message can be consumed over and over again by the same or a different client until topic retention (by max data size or oldest message timestamp) kicks in and the oldest messages get deleted. This can be very handy in many scenarios: handling bugs in software, testing code, simple distribution of message processing, and routing messages to many different consumers simultaneously.
    • Horizontal Scalability: To add more capacity, both in terms of storage and performance to a Kafka cluster, you just need to add more servers. Regular message queues usually work in a master-slave configuration and do not scale very well horizontally.
    • Simplicity in operations.

    How has it helped my organization?

    It has become dead simple to connect different application and services, saving a lot of development hours.

    What needs improvement?

    The standard Kafka Java library, which is shipped with the product, is too complex for inexperienced users. At my company, engineering teams ended up writing wrapper libraries to solve complex issues. Kafka client libraries in general are complex, regardless of language. This is the price Kafka users have to pay for having simple, yet robust, server-side code.

    What could be improved is the hard dependency on ZooKeeper. The work in this direction has already been started, though. Overall, the project is moving forward at a very good pace

    For how long have I used the solution?

    I have used Kafka for three years.

    What do I think about the stability of the solution?

    Sometimes we have stability issues, but not often.

    What do I think about the scalability of the solution?

    We have not had any scalability issues.

    How are customer service and technical support?

    There is no official technical support as the product is 100% open source.

    Which solution did I use previously and why did I switch?

    We used RabbitMQ before. It does not scale well.

    How was the initial setup?

    The setup was pretty straightforward.

    What's my experience with pricing, setup cost, and licensing?

    There is no pricing and licensing.

    Which other solutions did I evaluate?

    We didn't evaluate any other options.

    What other advice do I have?

    Go ahead. It's a great product.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    it_user642168 - PeerSpot reviewer
    Big Data Lead at a marketing services firm with 51-200 employees
    Vendor
    May 23, 2017
    We use it as an MQ. From it, we have several consumers like Secor that upload raw data to S3.
    Pros and Cons
    • "We are growing and currently, we manage 1M events per second in Kafka."
    • "We used to have problems in Kafka every three weeks and our dev ops team fixed a few issues."

    What is most valuable?

    We are using Kafka consumer and producer.

    How has it helped my organization?

    We are using Kafka as MQ; our servers generate events which are being sent to Kafka. From Kafka, we have several consumers like Secor (https://github.com/pinterest/secor) that upload raw data to S3; Spark stream that is doing aggregations and saving the result in Cassandra; and Druid for OLAP.

    What needs improvement?

    • Maintenance: Sometimes brokers disconnect and there are repartitions issues.
    • Built-in monitoring application for Kafka infrastructure.
    • UI for Kafka would also be great (similar to http://www.kafkatool.com/).

    For how long have I used the solution?

    I have used this product for two years.

    What do I think about the stability of the solution?

    We used to have problems in Kafka every three weeks and our dev ops team fixed a few issues. For the last six months, there have been no production problems, but during the time Kafka was not stable, it was not easy to understand what was wrong and how to fix it.

    What do I think about the scalability of the solution?

    We have not encountered any scalability issues yet. We are growing and currently, we manage 1M events per second in Kafka.

    How are customer service and technical support?

    We need more documentation regarding maintenance issues.

    Which solution did I use previously and why did I switch?

    I used RabbitMQ and ActiveMQ. Kafka is the standard, so there is no question what to use (unless you need better performance, like in ZeroMQ).

    Which other solutions did I evaluate?

    We did not evaluate other options as Apache Kafka is the standard.

    What other advice do I have?

    Read the documentation and understand the offset issues (where to save them, read from start to end).

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    PeerSpot user
    Hadoop Technical Lead (Assistant Consultant) at a tech services company with 10,001+ employees
    Real User
    Top 20
    May 23, 2017
    This is the base streaming component of our IoT platform. It needs a separate cluster and a separate administrator.
    Pros and Cons
    • "This is the base streaming component of our IoT platform."
    • "It needs a separate cluster and a separate administrator to manage the Kafka cluster, adding an extra cost."

    What is most valuable?

    • Distributed
    • Persistence
    • Offset management by consumer

    How has it helped my organization?

    This is the base streaming component of our IoT platform.

    In case of disaster recovery, we mirror the data in the cluster by maintaining the offsets and store the data within Hadoop 2.8 HDFS.

    What needs improvement?

    • It needs a separate cluster and a separate administrator to manage the Kafka cluster, adding an extra cost.
    • It is challenging when data is moved to a mirror cluster, in the case of disaster recovery. It doesn't keep the offset.

    For how long have I used the solution?

    I have used this solution for one year.

    How are customer service and technical support?

    The open source community is very strong. Also, distributors like Cloudera and Hortonworks provide paid support.

    Which solution did I use previously and why did I switch?

    For big data, we did not have a previous solution. I have used Microsoft MQ for building traditional systems.

    How was the initial setup?

    The setup was straightforward.

    What's my experience with pricing, setup cost, and licensing?

    This is open source with the cost of a cluster administrator.

    Which other solutions did I evaluate?

    We did not look at anything else. At that time, this was already accepted by the industry for streaming data processing.

    What other advice do I have?

    If the Hadoop distribution is MapR, then consider MapR Streaming. MapR Streaming has overcome these fundamental issues. It stores data within the MapR-FS itself. So there is extra overhead, but with a licensing cost.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    FounderC32bc - PeerSpot reviewer
    Founder, CEO at a tech vendor with 1-10 employees
    Real User
    May 14, 2017
    The ability to partition data is valuable. There are far superior and cheaper alternatives in cloud-based solutions
    Pros and Cons
    • "The ability to partition data on Kafka is valuable."
    • "Kafka is good, but Kafka as a cloud service is awesome!!"
    • "The product is good, but it needs implementation and on-going support. The whole cloud engagement model has made the adoption of Kafka better due to PaaS (Amazon Kinesis, a fully managed service by AWS)."
    • "The only reason I give Kafka as product a low rating is because there are far superior and cheaper alternatives in cloud-based solutions, where we save money on manpower, electricity, servers, datacenters, networking, etc."

    How has it helped my organization?

    We have used Kafka for streaming customer web clicks from live sessions to understand customer behavioral patterns.

    What is most valuable?

    The ability to partition data on Kafka is valuable. But Kafka needs support and management. It is better to have it fully managed on the cloud.

    The only reason I give Kafka as product a low rating is because there are far superior and cheaper alternatives in cloud-based solutions, where we save money on manpower, electricity, servers, datacenters, networking, etc.

    In fact, this is the view I have for pretty much all open source software compared to cloud based services. They just make things cheaper, faster, scalable and manageable. Kafka is good, but Kafka as a cloud service is awesome!!

    This is a relative rating (compared to cloud services), not that something is wrong with Kafka. I hope that is clear.

    What needs improvement?

    The product is good, but it needs implementation and on-going support. The whole cloud engagement model has made the adoption of Kafka better due to PaaS (Amazon Kinesis, a fully managed service by AWS).

    What do I think about the stability of the solution?

    No issues here with stability.

    What do I think about the scalability of the solution?

    Ah, scalability!!! We need to set up multiple servers again for handling the load, which makes Kafka not scalable, unless you subscribe to cloud services.

    How are customer service and technical support?

    It’s an Apache-community based support, so it is not really prioritized if you have a business issue. This is why most enterprise customers pay for cloud services.

    Which solution did I use previously and why did I switch?

    We didn’t have a previous solution. We started with Kafka and then switched to Amazon Kinesis (PaaS for Kafka). I think Microsoft Azure also released a competing service.

    How was the initial setup?

    The setup was straightforward.

    What's my experience with pricing, setup cost, and licensing?

    Licensing issues are not applicable. Apache licensing makes it simple with almost zero cost for the software itself.

    Which other solutions did I evaluate?

    We unsuccessfully, and kind of foolishly, tried Apache Camel. They were not similar in services, so we moved to Kafka rightfully, and then to AWS cloud ultimately.

    What other advice do I have?

    If you have a dedicated Kafka resource to implement and manage the services, then go for Apache Kafka. Otherwise, do consider cloud-based services from AWS or Azure.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    Buyer's Guide
    Download our free Apache Kafka Report and get advice and tips from experienced pros sharing their opinions.
    Updated: June 2026
    Product Categories
    Streaming Analytics
    Buyer's Guide
    Download our free Apache Kafka Report and get advice and tips from experienced pros sharing their opinions.