Try our new research platform with insights from 80,000+ expert users
RABBAHMahmoud - PeerSpot reviewer
Senior Technical Architect at RABBAH SOFT
Real User
Top 10
Enables us to move data from static files to a legacy system
Pros and Cons
  • "Scalability is very good."
  • "In the next release, I would like for there to be some authorization and HTL security."

What is our primary use case?

I'm a software architect. The use case will depend my customers. They usually use it for data transfer from static files to a legacy system.

What needs improvement?

In the next release, I would like for there to be some authorization features and HTL security. 

We also need bigger software and better monitoring. 

For how long have I used the solution?

I have been using Apache Kafka for the last ten years. 

What do I think about the stability of the solution?

The stability is good. We've never had any issues. 

Buyer's Guide
Apache Kafka
June 2025
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
860,592 professionals have used our research since 2012.

What do I think about the scalability of the solution?

Scalability is very good. 

How are customer service and support?

I have never needed to contact technical support. My colleagues get support from here, in Morrocco. 

How was the initial setup?

The setup is not a big deal for us. We can handle it. After the system is set up, the person who administers it has to do so with Apache Kafka.

Depending on the setup, it will usually take two weeks.

What other advice do I have?

I would rate it a nine out of ten. Not a ten because of the monitoring and admin improvement I'd like for them to make. 

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Senior Big Data Developer | Cloudera at Dilisim
Real User
Good scalability and excellent for storing data used for analytics but lacks a user interface
Pros and Cons
  • "Kafka provides us with a way to store the data used for analytics. That's the big selling point. There's very good log management."
  • "If the graphical user interface was easier for the Kafka administration it would be much better. Right now, you need to use the program with the command-line interface. If the graphical user interface was easier, it could be a better product."

What is our primary use case?

We are currently using this solution on our cloud-based clusters.

How has it helped my organization?

We use Kafka as part of our services. Our product (cloud clusters) has many components and Kafka is one of them.

For example, we use Kafka as a data integration tool. If you take Oracle GoldenGate as a typical use case, what happens is GoldenGate collects the data for the replication and sends this data to the Kafka servers. We collect the data on the Kafka servers, and we create some transformations, some operations, from that data. We then copy the data to the HTTP or hub site.

Previously, when I worked at Nokia, we were collected data using Kafka and then we stored the data on the Kafka servers. We did all transformations through Kafka streaming. Later, Kafka moved data over to the HP site. 

What is most valuable?

Kafka has a good storage layer on its side. I can store this data if it's streaming, and, if we do encounter any error, for example, on the network or server, we can later use the data to do some analytics on it using the Kafka server.

Kafka provides us with a way to store the data used for analytics. That's the big selling point. There's very good log management. 

Kafka provides many APIs that can be flexible and can be placed or expanded using the development life cycle. For example, using Java, I can customize the API according to our customers' demands. I can expand the functionality according to our customer demands as well. It's also possible to create some models. It allows for more flexibility than much of the competition.

What needs improvement?

If the graphical user interface was easier for the Kafka administration it would be much better. Right now, you need to use the program with a command-line interface. If the graphical user interface was easier, it could be a better product.

For how long have I used the solution?

I've been using the solution for more than three years.

What do I think about the stability of the solution?

The solution can be quite stable. We haven't encountered any issues on the Kafka side. However, Creating custom stabilizations would be good for dealing with stabilizing issues.

What do I think about the scalability of the solution?

The scalability of the solution is very good. You can analyze system events horizontally and the cluster can be brought over to the cloud side with the Kafka user's server.

We use the solution for both small and medium-sized organizations, but also larger enterprises. Some of our clients are in the banking and financial sector.

How are customer service and technical support?

Officially, I did not create any Kafka support tasks on the configuration support that is offered. I have created some questions on the stack overflow, however. Technical support is very good and I've found their response is very quick, giving you an answer within a day.

Which solution did I use previously and why did I switch?

We didn't previously use a different solution. We did some applications with Java for the consumer content but not the application function within that. We did objects instead.

How was the initial setup?

The initial setup isn't too complex. I know Kafka very well and don't find it to be overly difficult. There's also very good documentation which users can take advantage of.

Deployment, including security integration, only took about one day.

Two people handled the deployment. One person created the authentification group and after creating groups and users, another handled topic authentification and user definition for the customer.

What about the implementation team?

I handled the implementation for cloud-based clusters. I defined the broker nodes and other nodes for Kafka. We are a cloud integrator, so we handled it ourselves.

What's my experience with pricing, setup cost, and licensing?

I'm unaware of the costs surrounding licensing and setup.

What other advice do I have?

We're using the 2.1.30 version of the solution for our cloud-based clusters. We use the on-premises deployment model. Most customers use the on-premise solution for cloud-based clusters.

Kafka is a very good solution for log management. If you need anything done related to log management, Kafka can do it. Kafka can also store the data in the brokers. This prevents data loss as well as the duplication of data. It's quite comprehensive.

I'd rate the solution seven out of ten. If the solution could provide a user interface I'd rate it higher. This is important for managing Kafka's clusters on the administration side. It would also be helpful if two to three files could be minimized to one configuration file.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Apache Kafka
June 2025
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
860,592 professionals have used our research since 2012.
Developer Infrastructure at Outbrain
Real User
Very easy to install, stable, and has good scaling options
Pros and Cons
  • "It's very easy to keep to install and it's pretty stable."
  • "The third party is not very stable and sometimes you have problems with this component. There are some developments in newer versions and we're about to try them out, but I'm not sure if it closes the gap."

How has it helped my organization?

In my previous company, we had a proprietary implementation and we changed it with Kafka. We changed it because we had many different connectors available and it also allowed us to create a window to our products for the client. It was an on-premise product and it allowed the outline to take the data out, without us developing anything.

You can connect in any language and there are a lot of connectors available, it helps a lot. And it creates visibility into the data and stability. There are several alternatives but this is one of the best options for this.

What is most valuable?

It's very easy to install and it's pretty stable.

The possibility to have connectors is very helpful. Another valuable aspect is that it's mature and open-source. 

From a scalability point of view, you just add servers and it's scalable. The whole architecture is very scalable.

What needs improvement?

There is a feature that we're currently using called MirrorMaker. We use it to combine the information from different Kafka servers into another server. It's very wide and it gives a very generic scenario. I think it would be great if the possibility would exist out of the box and not as a third party. The third party is not very stable and sometimes you have problems with this component. There are some developments in newer versions and we're about to try them out, but I'm not sure if it closes the gap.

For how long have I used the solution?

I have been using this solution for six months. I also worked with it additionally in my previous company but not so intensively. 

How are customer service and technical support?

I have never needed to use technical support. I know it's available but we haven't needed it because there's a lot of information on the internet that has helped us to solve our issues. 

What other advice do I have?

I would definitely recommend Kafka. In our current position, we use it to move a lot of data and I think it's definitely working well. I would definitely recommend it.

I would rate it an eight out of ten. 

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user928569 - PeerSpot reviewer
Technical Consultant at KPMG
Real User
It eases our current data flow and framework
Pros and Cons
  • "It eases our current data flow and framework."
  • "Kafka 2.0 has been released for over a month, and I wanted to try out the new features. However, the configuration is a little bit complicated: Kafka Broker, Kafka Manager, ZooKeeper Servers, etc."

What is our primary use case?

It's convenient and flexible for almost all kinds of data producers. We integrated it with Kafka Streams, which can perform some easy data processing, like summary, count, group, etc

How has it helped my organization?

It eases our current data flow and framework, which digests all types of sources regardless of it being structured or not.

What is most valuable?

  • High availability
  • High throughput

With such a large digest, I was genuinely impressed at the process being almost real-time.

What needs improvement?

Kafka 2.0 has been released for over a month, and I wanted to try out the new features. However, the configuration is a little bit complicated: Kafka Broker, Kafka Manager, ZooKeeper Servers, etc.

For how long have I used the solution?

Less than one year.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Senior Technical Architect at a computer software company with 51-200 employees
Real User
Its publisher-subscriber pattern has allowed our applications to access and consume data in real time.
Pros and Cons
  • "I like the performance and reliability of Kafka. I needed a data streaming buffer that could handle thousands of messages per second with at least one processing point for an analytics pipeline. Kafka fits this requirement very well."
  • "As an open-source project, Kafka is still fairly young and has not yet built out the stability and features that other open-source projects have acquired over the many years. If done correctly, Kafka can also take over the stream-processing space that technologies such as Apache Storm cover."

How has it helped my organization?

Through its publisher-subscriber pattern, Kafka has allowed our applications to access and consume data at a real time pace.

What is most valuable?

I like the performance and reliability of Kafka. I needed a data streaming buffer that could handle thousands of messages per second with at least one processing point for an analytics pipeline. Kafka fits this requirement very well, as it is a fast, distributed message broker. It definitely does exactly what it is designed to do.

What needs improvement?

As an open-source project, Kafka is still fairly young and has not yet built out the stability and features that other open-source projects have acquired over the many years. If done correctly, Kafka can also take over the stream-processing space that technologies such as Apache Storm cover.

Currently, as it is in the big/fast data integration world, you need to piece together many different open-source technologies. For example, to create a reliable, fault-tolerant streaming processing system that ingests data, you need:

  • a producer service
  • an event/message buffer such as Kafka or a message queue
  • a stream processing consumer such as Spark, Flink, Storm, etc.
  • something to help facilitate the ingestion into target datasources such as Flume or some customized concoction.

This is simply to ingest the data and does not necessarily account for the analytical pieces, which may consist of Spark ML, SystemML, ElasticSearch, Mahout, etc.

What I'm getting at is basically the need for a Spring framework of big data.

What do I think about the stability of the solution?

The only stability issues we had were mostly a result of the evolving APIs and existing bugs.

What do I think about the scalability of the solution?

Kafka is designed to be very easily scalable so I did not have any trouble here.

How are customer service and technical support?

We used the open-source version and did not buy support from Confluent.

Which solution did I use previously and why did I switch?

We did not have any other previous solutions. Our project was green field and a new type of project development.

How was the initial setup?

Initial setup was straightforward. We simply hosted multiple Kafka brokers and ZooKeeper servers on AWS EC2 instances.

What about the implementation team?

We implemented it in-house and then went with the Hortonworks Data Platform distribution.

Which other solutions did I evaluate?

We evaluated AWS Kinesis as well.

What other advice do I have?

Kafka is open source and requires an administrator to maintain the servers.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user660591 - PeerSpot reviewer
Senior Java Consultant at a tech services company with 501-1,000 employees
Consultant
The product is a distributed system for persistent messaging

What is most valuable?

The most valuable features are performance, persistent messaging, and reliability. It allows us to persist the message for a configurable number of days, even after it has been delivered to the consumer. The message delivery is also fast.

How has it helped my organization?

We wanted to track the customer activities on our application and store those details on another system(RDBMS/Apache Hadoop). We do extensive analysis with that. This helps the company to analyze the customer activities, such as search terms, and do better.

What needs improvement?

It’s perfect for our requirements.

For how long have I used the solution?

I have been using Apache Kafka for two years.

What do I think about the stability of the solution?

We have had no issues with stability.

What do I think about the scalability of the solution?

We have had no issues with scalability.

How are customer service and technical support?

We use the open source one, so we did not opt for any technical support.

Which solution did I use previously and why did I switch?

We started to use Apache Kafka with our application from scratch.

How was the initial setup?

The initial setup was straightforward. We faced some issues during the development in areas such as message producer and consumer. We rectified those with the tweaking the producer and consumer configurations. The documentation is very good.

What's my experience with pricing, setup cost, and licensing?

I don’t have any idea, as we use the open source version.

What other advice do I have?

It's a high-performance distributed system. If you want to track the user activities or any stream processing, then this is perfect. We have used Docker Kafka for our implementation. It's very easy for setup and testing. You could also try the same.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user650004 - PeerSpot reviewer
Team Lead at a financial services firm with 1,001-5,000 employees
Vendor
Messages stay in Kafka after clients consume them. A message can be consumed by the same or a different client until topic retention kicks in and the oldest messages get deleted.

What is most valuable?

  • Message Retention: Unlike regular message queues, messages stay in Kafka after clients consume them. A message can be consumed over and over again by the same or a different client until topic retention (by max data size or oldest message timestamp) kicks in and the oldest messages get deleted. This can be very handy in many scenarios: handling bugs in software, testing code, simple distribution of message processing, and routing messages to many different consumers simultaneously.
  • Horizontal Scalability: To add more capacity, both in terms of storage and performance to a Kafka cluster, you just need to add more servers. Regular message queues usually work in a master-slave configuration and do not scale very well horizontally.
  • Simplicity in operations.

How has it helped my organization?

It has become dead simple to connect different application and services, saving a lot of development hours.

What needs improvement?

The standard Kafka Java library, which is shipped with the product, is too complex for inexperienced users. At my company, engineering teams ended up writing wrapper libraries to solve complex issues. Kafka client libraries in general are complex, regardless of language. This is the price Kafka users have to pay for having simple, yet robust, server-side code.

What could be improved is the hard dependency on ZooKeeper. The work in this direction has already been started, though. Overall, the project is moving forward at a very good pace

For how long have I used the solution?

I have used Kafka for three years.

What do I think about the stability of the solution?

Sometimes we have stability issues, but not often.

What do I think about the scalability of the solution?

We have not had any scalability issues.

How are customer service and technical support?

There is no official technical support as the product is 100% open source.

Which solution did I use previously and why did I switch?

We used RabbitMQ before. It does not scale well.

How was the initial setup?

The setup was pretty straightforward.

What's my experience with pricing, setup cost, and licensing?

There is no pricing and licensing.

Which other solutions did I evaluate?

We didn't evaluate any other options.

What other advice do I have?

Go ahead. It's a great product.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user642168 - PeerSpot reviewer
Big Data Lead at a marketing services firm with 51-200 employees
Vendor
We use it as an MQ. From it, we have several consumers like Secor that upload raw data to S3.

What is most valuable?

We are using Kafka consumer and producer.

How has it helped my organization?

We are using Kafka as MQ; our servers generate events which are being sent to Kafka. From Kafka, we have several consumers like Secor (https://github.com/pinterest/secor) that upload raw data to S3; Spark stream that is doing aggregations and saving the result in Cassandra; and Druid for OLAP.

What needs improvement?

  • Maintenance: Sometimes brokers disconnect and there are repartitions issues.
  • Built-in monitoring application for Kafka infrastructure.
  • UI for Kafka would also be great (similar to http://www.kafkatool.com/).

For how long have I used the solution?

I have used this product for two years.

What do I think about the stability of the solution?

We used to have problems in Kafka every three weeks and our dev ops team fixed a few issues. For the last six months, there have been no production problems, but during the time Kafka was not stable, it was not easy to understand what was wrong and how to fix it.

What do I think about the scalability of the solution?

We have not encountered any scalability issues yet. We are growing and currently, we manage 1M events per second in Kafka.

How are customer service and technical support?

We need more documentation regarding maintenance issues.

Which solution did I use previously and why did I switch?

I used RabbitMQ and ActiveMQ. Kafka is the standard, so there is no question what to use (unless you need better performance, like in ZeroMQ).

Which other solutions did I evaluate?

We did not evaluate other options as Apache Kafka is the standard.

What other advice do I have?

Read the documentation and understand the offset issues (where to save them, read from start to end).

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user