Apache Flink Reviews

Name: Apache Flink
Brand: Apache
Rating: 3.9 (19 reviews)

Vendor: Apache

3.9 out of 5

19 reviews
95% willing to recommend

Leave a review

What is Apache Flink?

Apache Flink is a powerful open-source framework for stateful computations over data streams, designed for both real-time and batch processing. It efficiently handles massive volumes of data with low-latency responses, offering versatility for complex event processing scenarios.

Get the Apache Flink Buyer's Guide and find out what your peers are saying about Apache Flink, Databricks, Qlik Talend Cloud and more!

Apache Flink is the #4 ranked solution in Streaming Analytics tools. PeerSpot users give Apache Flink an average rating of 7.8 out of 10. Apache Flink is most commonly compared to Databricks: Apache Flink vs Databricks. Apache Flink is popular among the large enterprise segment, accounting for 70% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 19% of all views.

Helped 900,644 peers since 2012

Featured Apache Flink reviews

Sanjay Srivastava

Software Architect at IBM

We are not using Apache Flink in its advanced window capabilities. We are using the Apache Flink job in Apache SeaTunnel, meaning we can write the code inside Apache SeaTunnel. Currently, we are moving; both solutions are there. We are doing it on-premises with the help of Kubernetes and OpenShift. The main reason why Apache Flink is better is that it has more functions, and being open source with easy code in Apache SeaTunnel helps us achieve that. Cost is a major issue. I would rate the stability of the product as an eight. For Apache Flink, the final point can be rated an eight. I can recommend Apache Flink to other users for streaming support, and I am recommending it. I would rate this review an eight overall.

Read full review

Aswini Atibudhi

Distinguished AI Leader at Walmart Global Tech at Walmart

Apache Flink is very powerful, but it can be challenging for beginners because it requires prior experience with similar tools and technologies, such as Kafka and batch processing. It's essential to have a clear foundation; hence, it can be tough for beginners. However, once they grasp the concepts and have examples or references, it becomes easier. Intermediate users who are integrating with Kafka or other sources may find it smoother. After setting up and understanding the concepts, it becomes quite stable and scalable, allowing for customization of jobs. Every software, including Apache Flink, has room for improvement as it evolves. One key area for enhancement is user-friendliness and the developer experience; improving documentation and API specifications is essential, as they can currently be verbose and complex. Debugging and local testing pose challenges for newcomers, particularly when learning about concepts such as time semantics and state handling. Although the APIs exist, they aren't intuitive enough. We also need to simplify operational procedures, such as developing tools and tuning Flink clusters, as these processes can be quite complex. Additionally, implementing one-click rollback for failures and improving state management during dynamic scaling while retaining the last states is vital, as the current large states pose scaling challenges.

Read full review

Phani Mallojala

Technical Lead at a computer software company with 10,001+ employees

Apache Flink makes it easy to write on-the-fly transformations on streaming data, which is especially beneficial for users familiar with SQL. It allows me to join different data streams and perform aggregations, offering a smooth user experience. The tool is highly effective for use cases where real-time data processing is required. The ease of usage, even for complex tasks, stands out. It is a solution that allows quick validation of architectural patterns within a few days. It is scalable and stable, enhancing its attractiveness for enterprise use.

Read full review

Apache Flink mindshare

As of June 2026, the mindshare of Apache Flink in the Streaming Analytics category stands at 8.2%, down from 13.7% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Streaming Analytics Mindshare Distribution
Product	Mindshare (%)
Apache Flink	8.2%
Databricks	7.9%
Azure Stream Analytics	6.8%
Other	77.1%

Streaming Analytics

PeerResearch reports based on Apache Flink reviews

Type	Title	Date
Category	Streaming Analytics	Jun 23, 2026	Download
Product	Reviews, tips, and advice from real users	Jun 23, 2026	Download
Comparison	Apache Flink vs Databricks	Jun 23, 2026	Download
Comparison	Apache Flink vs Azure Stream Analytics	Jun 23, 2026	Download
Comparison	Apache Flink vs Apache Kafka	Jun 23, 2026	Download

Valuable Features

Apache Flink's most valuable features include stateful streaming, enabling easy aggregation and management of out-of-order messages. Its windowing mechanism, inbuilt checkpointing, and stateful transformation capabilities streamline code and reduce latency. Flink's real-time data processing supports both batch and streaming modes without distinction. It offers excellent flexibility, scalability, and resource control, with APIs that enhance user experience and facilitate complex event processing. Community support and documentation have significantly improved.

"We value this solution's intricate system because it comes with a state inside the mechanism and product, allowing us to process batch data, stream to real-time and build pipelines, and we do not need to process data from the beginning when we pause as we can continue from the same point where we stopped, helping us save time as 95% of our pipelines will now be on Amazon and we'll save money by saving time."
"The end-to-end latency was drastically reduced, and our capability of handling high throughput has increased by using Flink."
"Flink moved on to becoming a standard technology for location platform."

Room for Improvement

Apache Flink requires improved integration with Python for seamless machine learning workflows. The initial setup is complex, and the debugging system could be more user-friendly. Support for additional data connectors is limited, and integration with other ecosystems could be enhanced. Documentation is insufficient, causing learning challenges. Users find the infrastructure's scalability demanding and require better reporting capabilities. Enhancements in connectors for data integration and technical support quality are essential. PyFlink's limited capabilities need addressing.

"One way to improve Flink would be to enhance integration between different ecosystems."
"I am using the Python API and I have found the solution to be underdeveloped compared to others. There needs to be better integration with notebooks to allow for more practical development."
"Flink has become a lot more stable but the machine learning library is still not very flexible."

Pricing

Apache Flink is predominantly recognized as an open-source platform available free of charge, eliminating licensing costs. Many users emphasize its open-source nature, highlighting a supportive community. Some, however, describe the solution as expensive, rating its pricing highly in terms of cost. Overall, enterprise buyers find Apache Flink's cost structure advantageous due to its free use, dependent mainly on implementation and other associated expenses.

"It's an open source."
"It's an open-source solution."
"Apache Flink is open source so we pay no licensing for the use of the software."

Popular Use Cases

Companies primarily utilize Apache Flink for real-time data processing and aggregation in various applications, such as handling event streams from Kafka, supporting data cleaning in CRM systems, and executing analytics for e-commerce metrics. Organizations utilize it for handling high-volume events, processing cab booking events quickly, monitoring network consumption, and orchestrating retail operations. Businesses leverage Flink to perform data transformations, real-time ETL pipelines, and process millions of records efficiently for immediate insights.

Service and Support

Many users rely on forums, community support, and detailed documentation rather than Apache Flink's technical services. While some leverage external tech support, such as Amazon, others face challenges due to limited official support. The open-source nature allows customization but lacks accountability outside of paid services. Community resources like Slack, GitHub, and Stack Overflow play significant roles, although some find the need for improvement in official support quality and responsibility.

Deployment

Many users experienced easy initial setup for Apache Flink on Mac, while production deployment on Kubernetes proved complex, requiring DevOps knowledge. AWS EMR provided easier management via its managed service. Documentation supports various deployment strategies, but Flink's relatively new status makes learning necessary without extensive community help. Some found setup straightforward, others found it time-consuming, depending on use cases and requirements. Expertise plays a significant role in setup complexity and duration.

Scalability

Apache Flink supports scalability through Docker and Kubernetes, offering flexibility in deployment. Users can efficiently scale up and down, although documentation could improve. It handles large data loads with ease, making it suitable for engineering teams and big companies. Challenges exist with stateful conditions, yet it surpasses alternatives for streaming needs. It requires significant infrastructure setup. Users appreciate the ease of scaling parallel processes and memory allocation for task managers.

Stability

Apache Flink is mostly stable with users observing good uptime and resource utilization. Some face intermittent issues with stateful aggregations and checkpointing, requiring optimizations. Running on Kubernetes, stability improves with robust infrastructure and configuration. Stability challenges can arise during cluster upgrades or long-running jobs, but a dedicated team for maintenance can enhance performance. Users report it performs well within specific use cases, but complex implementations may occasionally encounter failures requiring attention.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Apache Flink Buyer's Guide for additional reliable information.

Review data by company size

By reviewers
Company Size	Count
Small Business	3
Midsize Enterprise	2
Large Enterprise	11

By reviewers

By visitors reading reviews
Company Size	Count
Small Business	84
Midsize Enterprise	66
Large Enterprise	345

By visitors reading reviews

Top industries

By visitors reading reviews

Financial Services Firm

19%

Retailer

13%

Computer Software Company

Manufacturing Company

Comms Service Provider

University

Healthcare Company

Energy/Utilities Company

Insurance Company

Construction Company

Wholesaler/Distributor

Outsourcing Company

Transportation Company

Government

Educational Organization

Media Company

Marketing Services Firm

Legal Firm

Hospitality Company

Real Estate/Law Firm

Logistics Company

Recreational Facilities/Services Company

Consumer Goods Company

Performing Arts

Photography Company

Non Profit

Recruiting/Hr Firm

Aerospace/Defense Firm

Compare Apache Flink with alternative products

Learn more about Apache Flink

Apache Flink excels in processing high-throughput data streams, enabling seamless state management across distributed applications. Users appreciate its robust features like stateful transformations and checkpointing, simplifying deployment in diverse environments. Though powerful, it poses challenges for beginners due to its complexity and limited documentation, requiring some prior experience to master. Its flexible integration with systems like Kafka and support for Kubernetes on AWS makes it suitable for demanding environments where quick, real-time analysis is essential.

What are the key features of Apache Flink?

Stateful Transformations: Allows complex stateful operations on data streams with precise handling.
Low Latency: Ensures real-time data processing with minimal delays.
Checkpointing: Provides efficient and reliable checkpointing for fault tolerance.
Kafka Integration: Easy integration with Kafka for seamless data ingestion and processing.
API Support: Provides robust APIs for diverse data processing needs.
Flexible Deployment: Offers options for deploying on-premise or in cloud environments.

What benefits should users look for?

Versatility: Supports both batch and stream processing in a unified model.
Community Support: Backed by an active community that continuously enhances its features.
Ease of Use: Simplifies the coding process compared to similar frameworks like Apache Storm.
Real-Time Analytics: Facilitates immediate insights and data-driven decision-making.

Organizations leverage Apache Flink primarily for real-time data processing in sectors such as retail, transportation, and telecommunications. By deploying on AWS with Kubernetes, companies can utilize it for data cleaning, generating customer insights, and providing swift real-time updates. It effectively manages millions of events per second, serving use cases like cab aggregations, map-making, and outlier detection in telecom networks, enabling seamless integration of streaming data with existing pipelines.

Apache Flink was previously known as Flink.

Apache Flink customers

LogRhythm, Inc., Inter-American Development Bank, Scientific Technologies Corporation, LotLinx, Inc., Benevity, Inc.

Product Categories

Streaming Analytics

Popular Comparisons

Databricks vs Apache Flink

Qlik Talend Cloud vs Apache Flink

Coralogix vs Apache Flink

Confluent vs Apache Flink

Azure Stream Analytics vs Apache Flink

Spring Cloud Data Flow vs Apache Flink

PubSub+ Platform vs Apache Flink

Amazon Kinesis vs Apache Flink

Google Cloud Dataflow vs Apache Flink

Amazon MSK vs Apache Flink

Starburst Enterprise vs Apache Flink

Apache Spark Streaming vs Apache Flink

Striim vs Apache Flink

Apache Pulsar vs Apache Flink

IBM Streams vs Apache Flink

See all alternatives

Apache Flink Reviews Summary
Author info	Rating	Review Summary
Software Architect at IBM	4.0	I've used Apache Flink for over a year in a data integration project with Apache SeaTunnel, finding its streaming capabilities fast and cost-effective, though its limited connectors and poor technical support are areas needing improvement.
Distinguished AI Leader at Walmart Global Tech at Walmart	4.0	I use Apache Flink for enterprise orchestration and value its open-source, distributed stream processing framework. It's powerful but challenging for beginners, requiring prior experience. Enhancements in user-friendliness, documentation, and operational procedures are needed for smoother integration.
Technical Lead at a computer software company with 10,001+ employees	4.0	I provided architectural patterns for an insurance client's streaming analytics solution using Apache Flink. Its ease of use for real-time data processing stood out. More examples would enhance its utility. AWS was my first experience with such tools.
Head of Data at a energy/utilities company with 51-200 employees	3.5	We use Apache Flink for batch processing, finding it advantageous due to its easy learning curve and flexibility to deploy on any cluster. However, the initial setup process could be improved for easier configuration and efficient project startups.
Senior Software Development Engineer at Yahoo!	4.5	I migrated from Spark to this solution for data processing pipelines, valuing its stateful processing and ability to handle batch/real-time streams. I'd like improved user-friendliness and debugging, rating it 9/10.
Partner / Head of Data & Analytics at Intelligence Software Consulting	4.0	I use Apache Flink in telecom to handle millions of events per second. It offers strong development configurations but needs more libraries and machine learning capabilities. A more user-friendly interface for pipeline configuration and monitoring would be beneficial.
Principal Engineer at InnovAccer Inc.	4.0	I use Apache Flink for real-time data processing and ETL tasks due to its ability to handle high data volumes with low latency. It excels in stateful transformations, although PyFlink's limitations could be improved. I deploy it on AWS.
Consultant at a tech vendor with 10,001+ employees	3.5	I used Apache Flink for real-time analytics via AWS Kinesis, finding its deployment manageable. However, schema management and AWS integration were challenging. I preferred it over Kafka due to flexibility, although ROI insights post-deployment were unavailable. Talend had limitations.
Sr. Software Engineer at a tech services company with 10,001+ employees	4.0	I use Apache Flink for real-time risk detection and aggregations, valuing its "exactly once" processing and checkpointing. Stability and initial setup are challenging, demanding significant infrastructure investment, particularly for stateful applications.
CTO at ReNew	4.0	We utilize Apache Flink to process data from IoT devices, creating reports and insights using machine learning. Its integration capabilities support complex data tasks tailored to client needs. However, improvements in data capability and migration are necessary.

Sanjay Srivastava

Software Architect at IBM

Dec 15, 2025

Streaming workflows have improved data integration and support real-time pipelines across platforms

What is our primary use case?

I am working with Apache Flink, which is the tool we use for data integration. Apache Flink is for data, and we are working on the data integration project, not big data, using Apache Flink and Apache Spark on both sides.

It's for data integration; in terms of big data, we are taking the big data from Kafka and Elasticsearch, and we are moving the data pipeline.

We are not using Apache Flink stateful computations. We are not doing event processing; we are basically in the tool and not in the application. We are writing different database connectors to move the data. The application team will be writing the events and application events.

I have not used Apache Flink DataStream API. We are just using the Apache Flink pipeline in Apache SeaTunnel to move the streaming data.

What is most valuable?

Streaming functionality is the most useful function in Apache Flink for me. Data streaming is what we prioritize.

Apache Flink provides faster and low-cost investment for me; I find it to have low hardware requirements, and it's faster with low code, meaning it's easy to understand for moving the streaming data. Those two use cases are there.

What needs improvement?

Apache could improve Apache Flink by providing more functionality, as they need to fully support data integration. The connectors are still very few for Apache Flink.

There is a lack of functionality concerning data connectors.

The technical support from Apache is not good; support needs to be improved. I would rate them from one to ten as not good.

The points for improvement in technical support are in quality and also responsibility; everything they have to work on it.

For how long have I used the solution?

I have been working with Apache Flink for one to one and a half years or more.

What do I think about the scalability of the solution?

Its ability to scale and expand is also rated an eight or nine. It's quite scalable for the big data solution.

How are customer service and support?

The technical support from Apache is not good; support needs to be improved. I would rate them from one to ten as not good.

The points for improvement in technical support are in quality and also responsibility; everything they have to work on it.

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

I have compared Apache Flink with other vendors; AWS is a bigger competitor, such as AWS Glue.

I think Apache Flink is better than AWS Glue because of the integration with Apache SeaTunnel.

How was the initial setup?

The initial setup is very easy.

Which other solutions did I evaluate?

I have compared Apache Flink with other vendors; AWS is a bigger competitor, such as AWS Glue.

I think Apache Flink is better than AWS Glue because of the integration with Apache SeaTunnel.

What other advice do I have?

We are not using Apache Flink in its advanced window capabilities. We are using the Apache Flink job in Apache SeaTunnel, meaning we can write the code inside Apache SeaTunnel.

Currently, we are moving; both solutions are there. We are doing it on-premises with the help of Kubernetes and OpenShift.

The main reason why Apache Flink is better is that it has more functions, and being open source with easy code in Apache SeaTunnel helps us achieve that. Cost is a major issue.

I would rate the stability of the product as an eight.

For Apache Flink, the final point can be rated an eight.

I can recommend Apache Flink to other users for streaming support, and I am recommending it. I would rate this review an eight overall.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Other

Aswini Atibudhi

Distinguished AI Leader at Walmart Global Tech at Walmart

May 8, 2025

Enables robust real-time data processing but documentation needs refinement

What is our primary use case?

We use Apache Flink for enterprise orchestration, mostly for Walmart, which has many use cases for retail. We use it for our transportation, for the merchant use case related to suppliers, and for customer orchestrations.

What is most valuable?

What I appreciate best about Apache Flink is that it's open source and geared towards a distributed stream processing framework. We use it on the transportation side for real-time or near real-time data with low latency and high output, ensuring eventual consistency. Most of our event-driven applications utilize Apache Flink, alongside Spark and Kafka streaming. From the core components, we also use data streaming APIs, tables or SQL APIs, and state management, along with Flink connectors for integrations with Kafka, JDBC connections, file systems, and more.

We have numerous use cases for complex event processing, particularly concerning suppliers needing real-time data on store performance for predictive pattern detection, which involves complex space management and model creation for dispatching. We are also exploring Apache Flink ML focused on machine learning libraries, though this is currently in the early stages.

What needs improvement?

Apache Flink is very powerful, but it can be challenging for beginners because it requires prior experience with similar tools and technologies, such as Kafka and batch processing. It's essential to have a clear foundation; hence, it can be tough for beginners. However, once they grasp the concepts and have examples or references, it becomes easier. Intermediate users who are integrating with Kafka or other sources may find it smoother. After setting up and understanding the concepts, it becomes quite stable and scalable, allowing for customization of jobs.

Every software, including Apache Flink, has room for improvement as it evolves. One key area for enhancement is user-friendliness and the developer experience; improving documentation and API specifications is essential, as they can currently be verbose and complex. Debugging and local testing pose challenges for newcomers, particularly when learning about concepts such as time semantics and state handling. Although the APIs exist, they aren't intuitive enough. We also need to simplify operational procedures, such as developing tools and tuning Flink clusters, as these processes can be quite complex. Additionally, implementing one-click rollback for failures and improving state management during dynamic scaling while retaining the last states is vital, as the current large states pose scaling challenges.

For how long have I used the solution?

I have been using Apache Flink for the last five years, and we have multiple enterprise products that we use with it.

What do I think about the stability of the solution?

In terms of stability, every system exhibits some challenges, particularly when customized through lift and shift. In general, Apache Flink is very stable, but it can face failures during customization. Stream processing at scale shows stability, yet I sometimes encounter timeouts with long-running jobs, indicating a need for optimizations in failure recovery. Cluster upgrades and compatibility with saved points need improvement. Despite these challenges, Apache Flink remains quite stable; every product evolves and introduces new features, which can lead to optimization needs.

What do I think about the scalability of the solution?

I rate the scalability of Apache Flink an eight because it is quite scalable, but we need improvements in API documentation and state management.

How are customer service and support?

Apache Flink does not offer enterprise software support, so companies rely on community support, such as mailing lists and forums, along with Slack channels. There is also the option to create issues on GitHub or check Stack Overflow for help. While some may have customizations or use cloud services offering specific support, it's essential to understand that community support is strong for Apache Flink, making it a viable option for many users despite lacking dedicated enterprise support.

How would you rate customer service and support?

Positive

What other advice do I have?

The integration process for Apache Flink can be approached in two ways. The first method involves running it as an independent standalone service, where you create your own clusters and systems, and then submit jobs that connect to it. This approach is more reliable for stream analytics, ETL, or continuous data processing, as it handles scaling and fault tolerance. You can connect with REST APIs, SQL gateways, or JDBC, which is a standard and easier approach. The second option is embedding Apache Flink directly into your application, which is rarely used and mainly suited for simpler tasks. Nonetheless, I recommend running Apache Flink as a standalone service for distributed real-time processing.

I recommend Apache Flink to others, particularly for new teams assessing whether it is the right solution. Despite competition from Kafka streams and Apache Beam, Apache Flink stands out for its scalability, stability, and persistence capabilities. Personally, I have promoted its integration within Walmart, advocating for its use across eight to nine teams, as I am a big fan of data streaming and processing tools. Given its proven track record and stability in production, I confidently recommend it to anyone exploring unified batch processing or streaming databases with SQL or the table API.

On a scale of one to ten, I rate Apache Flink an eight overall.

Which deployment model are you using for this solution?

On-premises

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Other

Phani Mallojala

Technical Lead at a computer software company with 10,001+ employees

Apr 25, 2025

Transforms real-time data for insurance projects efficiently

What is our primary use case?

I worked on a project where I provided architectural patterns for a client in the insurance industry, similar to companies like State Farm or Travelers Insurance. They needed a solution for streaming analytics with near real-time data processing, combining streaming data with existing data in S3.

What is most valuable?

What needs improvement?

Apache should provide more examples and sample code related to streaming to help me better adapt and utilize the tool. There is a need for increased awareness and education, especially around best practices, to help people avoid unnecessary costs.

For how long have I used the solution?

I have been using Apache Flink as part of a project-focused exploration.

What do I think about the stability of the solution?

Apache Flink is stable for the tasks I have explored. It performed well within the limits of my use case, although I have not implemented it in a full-scale, end-to-end data engineering project.

What do I think about the scalability of the solution?

Apache Flink is scalable. Solutions can always be scaled beyond initial boundaries.

Which solution did I use previously and why did I switch?

This was my first experience with streaming analytics tools using AWS.

How was the initial setup?

Setting up Apache Flink is not complex. Services like Kinesis and Kinesis Firehose are easy to set up, which subsequently makes integrating Flink straightforward.

Which other solutions did I evaluate?

I did not use or evaluate other streaming analytics tools before using Apache Flink.

What other advice do I have?

I would rate this solution an eight out of ten. The ease of usage and technology capability are impressive. Increased awareness and more comprehensive documentation would benefit users. Using Apache Flink can help with more efficient streaming analytics processes.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Madhan Potluri

Head of Data at a energy/utilities company with 51-200 employees

Dec 15, 2023

While providing powerful stream and batch processing capabilities, it needs improvement in command automation and stability

What is our primary use case?

We use it for batch processing, specifically for hiding certain data, including location information and other entity-specific attributes.

What is most valuable?

The significant advantage is the learning curve as it is easy. It's not associated with any specific cloud entity. It provides us the flexibility to deploy it on any cluster without being constrained by cloud-based limitations. It enables freedom for making optimizations, allowing for specific customizations as needed.

What needs improvement?

There is room for improvement in the initial setup process. I found myself spending a significant amount of time navigating through documentation to configure it. It would be beneficial to have streamlined commands, where the environment could be quickly initialized, including database setup, providing a more efficient and convenient starting point for projects. This would be particularly advantageous for development and experimentation, allowing more focus on feature testing rather than spending time on the setup process.

For how long have I used the solution?

I have been working with it for one year.

What do I think about the stability of the solution?

In the tech industry, there's always room for enhancements. It's a dynamic environment where products evolve over time. I would rate it eight out of ten.

What do I think about the scalability of the solution?

I would rate its scalability capabilities eight out of ten. Currently, a team of fifteen individuals is involved in everyday processes.

How are customer service and support?

Regarding tech support, the challenge lies in the open-source realm where there's no inherent accountability. Modules are developed, and users often rely on forums for assistance. However, for more critical or specific problems, especially those related to compliance or complaints, a paid service is available with a dedicated support team. I would rate it seven out of ten.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

We initially utilized Databricks for batch processing, which also supports real-time capabilities. However, due to limited computing power usage, we transitioned to a more focused approach. We now employ smaller containers for handling about five to six attributes specific to each entity in real time which allows us to experiment with alerts, actions, and responses for various purposes.

How was the initial setup?

The initial setup was complex.

What about the implementation team?

Deployment takes a total of two days: one day for learning and another day for the actual deployment process. We have a small dedicated team that manages updates and handles the networking aspects of the development website. I would rate it eight out of ten.

What's my experience with pricing, setup cost, and licensing?

It's an open source.

What other advice do I have?

I would give it a rating of seven out of ten due to the need for improvement in command automation and perhaps some stability concerns. However, overall, I would still recommend it.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Ilya Afanasyev

Senior Software Development Engineer at Yahoo!

Aug 3, 2022

A great solution with an intricate system and allows for batch data processing

What is our primary use case?

We predominantly use this solution on-premises but intend to migrate to some management services. Our primary use case for this solution is maintaining some pipelines which process data. We are currently integrating some pipelines from Pig to Spark and Pig to Flink.

What is most valuable?

We value this solution's intricate system because it comes with a state inside the mechanism and product. The system allows us to process batch data, stream to real-time and build pipelines. Additionally, we do not need to process data from the beginning when we pause, and we can continue from the same point where we stopped. It helps us save time as 95% of our pipelines will now be on Amazon, and we'll save money by saving time.

What needs improvement?

The solution could be more user-friendly. The debugging system could be more suitable in the new release.

For how long have I used the solution?

We have been using this solution for approximately half a year and we are currently using version 1.15.1.

What do I think about the stability of the solution?

The solution is currently stable.

What do I think about the scalability of the solution?

We don't have users operating the system because it is an online system which consumes the data we process. For example, our most extensive pipeline has processed 300 billion events daily, and the configuration works on Spark. The configuration keeps about 25 terabytes of RAM.

Regarding the deployment for users, our team operates internationally with employees in Israel, Dublin and the United States, so about 10% of our employees work with this solution, including DevOps. We intend to increase the usage of this solution, but we are currently completing integrations within our organization's pipeline centres.

How are customer service and support?

We have an Amazon support team working with us, so we reach out to our technical support personnel at Amazon when we have issues.

How was the initial setup?

The initial setup was not complex and we installed it on our own. However, we have engineers available to assist us if we need some configuration or installation. It took a few days to install and configure the solution.

What's my experience with pricing, setup cost, and licensing?

I cannot comment on licensing costs because we only assessed prices at the beginning of this process.

Which other solutions did I evaluate?

We previously used Apache Spark before using Apache Flink.

What other advice do I have?

I rate this solution a nine out of ten. I would recommend Apache Flink to new users. In my opinion, it is possible to move from Spark to Apache Flink. Apache Flink's functionality overlaps Spark's functionality. The solution is good, but the debugging process could be improved.

Armando Becerril

Partner / Head of Data & Analytics at Intelligence Software Consulting

May 31, 2024

Handle smillions of events per second and offers powerful configuration

What is our primary use case?

We use the solution to handle million of events per second in telecom. We use the mobile AT&T. It is very simple.

How has it helped my organization?

We have a low latency. We can't process more data pipelines in real time. We could aggregate and make operations with less resource. It is very flexible for an optimization for reduced cost.

What is most valuable?

Apache Flink offers a range of powerful configurations and experiences for development teams. Its strength lies in its development experience and capabilities.

What needs improvement?

There are more libraries that are missing and also maybe more capabilities for machine learning. It could have a friendly user interface for pipeline configuration, deployment, and monitoring.

For how long have I used the solution?

I have been using Apache Flink for five years.

What do I think about the stability of the solution?

I rate the solution’s stability an eight out of ten.

How are customer service and support?

The solution is open-source.

How was the initial setup?

The initial setup is straightforward and fast. It take couple of hours to complete. My team has recently made some optimizations and changed the way we deploy Apache Flink solutions.

I rate the initial setup an eight out of ten, where one is difficult, and ten is easy.

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. I rate the product’s pricing a nine out of ten, where one is cheap and ten is expensive.

What other advice do I have?

The solution is is difficult to manage and handle.

Apache Flink is one of the main solutions for real-time decision-making and rapid site provision on Azure. Currently, it's predominantly utilized for data engineering projects rather than AI initiatives, although it can indirectly influence analytics, machine learning, and other products in the stack.

The solution is resilient and store more capabilities and features.

Overall, I rate the solution an eight out of ten.

PrashantVaghela

Principal Engineer at InnovAccer Inc.

Nov 20, 2023

Offers stateful transformations and complete offset management between transformations

What is our primary use case?

Apache Flink is not a solution but a framework. Spark is a framework, not a tool.

So, when dealing with real-time data processing and ETL use cases that require on-the-fly transformations, Apache Flink seems like a suitable choice as a framework.

Apache Flink allows you to reduce latency and process data in real-time, making it ideal for such scenarios.

I've worked on three large-scale platform use cases involving Apache Flink. One of those use cases handled a volume of approximately two hundred to 300,000,000 records per day. It translates to approximately 900 to 1000 records per second.

What is most valuable?

If you have data that is streaming from different kinds of sources, Flink has many advantages.

One is that Flink basically gives you stateful transformations. So, if you want to transform your streaming data that is coming in, and you want to apply stateful transformations, one transformation after another, depending on the result of the first transformation, Flink allows you to do that.

There are many other positive points. The reason why the Apache organization actually came up with Flink in the first place is that it offers stateful transformations and complete offset management between transformations.

What needs improvement?

One of the ways to interact with Flink is through a tool called PipeLINK for writing Flink code, and it doesn't require you to use Python directly.

While it does offer a Python-like syntax called PyFlink. PyFlink is a subset of Python that is specifically designed for writing Flink code. It provides a simpler and more accessible way to write Flink code compared to using the Java or Scala APIs.

PyFlink is not as fully featured as Python itself, so there are some limitations to what you can do with it. So, this is an area for improvement.

However, it is a good choice for users who are not familiar with Java or Scala.

For how long have I used the solution?

I've been working with Apache Flink for about a year and a half.

What do I think about the scalability of the solution?

My previous organization, where I was employed as a CTO, had about 120 people on my team, and all of them were engineers.

It's a very engineering-specific tool. Many big organizations like Alibaba and Amazon use it. Actually, Amazon has Apache Flink as one of its offerings in the AWS ecosystem.

How was the initial setup?

Flink is a framework, not a tool. You can deploy it on-premise servers, your own machines, or in Kubernetes on the cloud. It doesn't matter.

What about the implementation team?

My company has a technical team that does deployment, maintenance, and everything for Apache Flink.

What's my experience with pricing, setup cost, and licensing?

Flink is free, it's open source. Flink is open source.

What other advice do I have?

Depending on the use case, we use the appropriate framework. For certain use cases, it's an excellent choice. However, other use cases might require a different framework, such as Lambda or Spark Streaming. So, the choice of framework depends on the specific requirements of the task.

When it comes to real-time ETL and real-time transformation, I would rate Flink very highly, an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Sunil Morya

Consultant at a tech vendor with 10,001+ employees

Nov 18, 2022

Easy to deploy and manage; lacking simple integration with Amazon products

What is our primary use case?

We used this solution for real-time analytics, and for identifying the outliers. We received live industrial data through the Kinesis data streams and Flink processed that data. We had two uses; through Lambda, Flink was putting into the S3 bucket and it was also able to put data into another S3 bucket, which was useful for our business analytics.

How has it helped my organization?

Amazon AWS Kinesis has two flavors; SQL and Flink. With SQL, you can file your statement, apply the schema and get what you need. With Flink, you can run it as a job. If your analytics are on the runtime streams, Flink is beneficial.

What is most valuable?

Apache Flink was easy to deploy and manage.

What needs improvement?

The issue we had with Flink was that when you had to refer the schema into the input data stream, it had to be done directly into code. The XLS format where the schema is stored, had to be stored in Python. If the schema changes, you have to redeploy Flink because the basic tasks and jobs are already running. That's one disadvantage. Another was a restriction with Amazon's CloudFormation templates which don't allow for direct deployment in the private subnet. You have to deploy into the public subnet and then from the Amazon console, specify a different private subnet that requires a lot of settings. In general, the integration with Amazon products was not good and was very time-consuming. I'd like to think that has changed.

For how long have I used the solution?

I've used this solution for six months.

What do I think about the stability of the solution?

Stability was not an issue.

How are customer service and support?

We did not use Apache technical support. We were able to manage with whatever documentation was available from Apache for Flink.

Which solution did I use previously and why did I switch?

I wasn't using anything else for analytics purposes specifically, but I was working with Kafka, which basically works on the same concept as Amazon's Kinesis data streams with brokers. The limitation of Kafka is that it's less flexible than Kinesis. But I have not applied analytics over this on that front. So only data input source, because I am from an IoT background. So integration with the input sources was my core responsibility. I've used Kafka for a postal service company that provided delivery services and required different conditions such as temperature control.

How was the initial setup?

I implemented this solution with two colleagues and the remainder of the team dealt with the CI/CD pipeline, automated scripts and configuring the different deployment environments like development, pre-product and production. For development and integration with the actual input source we used two engineers. If you're experienced, deployment can be completed within two to three weeks. The solution is used extensively. The company we deployed in had multiple factories in different locations.

What was our ROI?

I'm unaware of ROI because once we deploy to the customer, our job is done and we don't have any follow-up with them.

Which other solutions did I evaluate?

The company looked at Talend which is the complete analytics platform. But they have certain limitations and the customer was not using it to its full capacity. We wanted to replace it because of its limitations.

What other advice do I have?

I find this solution very handy. Prior to using Flink I had experience on audio and video data streaming. so I don't know how useful Flink is when you want to do real-time analytics for audio and video data. I think if real-time analytics could be supported by Flink, that would be good.

I rate this solution seven out of 10 based on the fact that I haven't used all of Flink's features.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

-Rahul Agarwal

Sr. Software Engineer at a tech services company with 10,001+ employees

Oct 22, 2020

Scalable framework for stateful streaming aggregations

What is our primary use case?

Initially, we created our own servers and then eBay created their infrastructure. Now it's deployed on the eBay cloud.

Our primary use case is trying to do real time aggregations/near-real time aggregations. Let's say for example that we are trying to do some count, sum,min,max distinct counts for different metrics that we care about, but we do this in real time. So let's say, you have an e-commerce company and you want to measure different metrics. If I take the example of risk, let's say you want to check if one particular seller on your site is doing something fishy or not. What is the behavior? How many listings do they have? In the past five minutes, one hour or one day or one year? You want to measure this over time.

This data is very important to you from the business metric point of view. Often this data data is delayed by 1 day via offline analytics. You do ETL for these aggregations ,it's okay for offline business metrics. But when you want to do risk detection for online businesses, it needs to be right away in real time, and that's where those systems fail and where Apache Flink helps. And if combined with Lambda architecture, you can get them real time with the help of a parallel system that captures very latest data.

How has it helped my organization?

The mighty work on risk, right? So we have to provide risk insights. When we have machine learning models or deep learning models or route based systems and when we want to evaluate some user or somebody's behavior on the site, we want to evaluate it right away. If we don't evaluate it right away, it's of no use to us. Let's say that a fraudulent buyer comes in and is trying to buy an item. In the process of buying, when the person clicks on the button and purchases the item, if you're not able to detect the fraud right then, it's of no use to us.

At the same time, we also have to be able to make sure that we are not pissing off the real people if they are good. So we have to create the very hard balance between whether it's a good person or a bad person without seeing what they're doing. Then you need to do it in real time. That's why with offline systems the aggregation of your data comes after one day, which is a typical case of ETL and is of no use to us. We want to do it right away. But at the same time, we don't want to provide too much friction either. This aggregation includes: how many transactions have you done in the past, in the last year? How many transactions did you do in the last minute? How many coupons did you use in last five years or how many coupons did you use in five minutes? All these kinds of metrics can be useful and can feed into a machine learning or deep learning model or a rule based system to do something with it.

Let's say if we feel that the seller has crossed a limit, or our seller is doing something fishy or buying something fishy, we can throw a capture to the user or we can provide some friction -"can you do any multifactor authentication?" These are just examples. So our use case is the idea that we want to have real time aggregations. One thing to note is that Flink will not help you to do real time navigation very well. It helps you in near-real time navigation. But with the lack that you get from near-real time vs. real time is that you can have it from another system. So let's say that Flink is able to give you all the aggregations accurately either daily or in five minutes. I'm just taking an example for our use here. That's how it works. Five minutes. And it depends on the complexity of the problem that you are trying to solve or how much infrastructure you have. So if it is delayed by five minutes, what you can do is have a panel system that only takes care of the latest data. And then you can take this to the system and combine it with the Lambda architecture. Where you combine a historical system with the real time system and then give the data aggregation. What we do is we apply the same concept in a near-real time integrated fashion and then we are able to give real time analytics on anything that happens on us.

What is most valuable?

You need to understand that a bunch of links work on streaming data. Among the streaming data, out of the nuances of streaming, we have called exactly once are the ones we call semantics. Exactly once being the hardest, which is being very well maintained by flex. What I mean by that is, when an event or when data is coming in, you're only processing that event. Only one stop. This is very important when you're trying to do some aggregates.

Let's say you are a seller on the site and we know that depending on when a seller makes a setting limit, you're allowed to sell only three items a day. If we miscount you in the last five minutes, from three to four, but you did three listings, because of the current listing the data does not reflect that, we do not aggregate now. We allowed you to do this. What is wrong? We can't allow a wrong decision on your side based on the setting limits and the reason for that is because the data is out-dated because the data got delayed. That is a reason the data got delayed. The other reasons could be that the data got lost or they will not aggregate exactly the way it was supposed to be aggregated. So exit aggregations are what makes it much harder to do in real time. That's why you use offline systems like Spark and Hadoop. But you can do real time with Symantec, which is very well supported in Flink. So that's one of the best features.

Another feature is how Flink handles its radiuses. It has something called the checkpointing concept. You're dealing with billions and billions of requests, so your system is going to fail in large storage systems. Flink handles this by using the concept of checkpointing and savepointing, where they write the aggregated state into some separate storage. So in case of failure, you can basically recall from that state and come back.

I'll take an example of Call of Duty. Let's say when you play a game of Call of Duty there are five levels. And in each level, there are different obstacles that you need to clear to advance. There are five levels and in each level, there are different obstacles. So let's say in each level there are 10 obstacles. If you've cleared three obstacles and you die in that process you don't want to start from scratch again in the same level, right? You want to start from the third level. So that is what the concept of checkpointing and savepointing allow you to do. I have done work until this point at the third level, and now I want to restart from the third level only. I don't want to redo that part again. That's what checkpointing and savepointing do. So how does it help in our case? What is the current date? It's Wednesday, October, 15, right? So October 15 until 12:00 PM, we open all the applications. We take that at a regular interval. We take that aggregation snapshot and store it in a different storage. If the system goes down after 12:00PM because the load is high or some other hardware failure, you can recover that data from 12:00 PM and reprocess. You're not recovering the entire thing.

Let's say the seller's count listing was two but you don't want to contrast for that particular seller from zero. You want to count from two after 12:00 PM. Right? That's what Flink helps you do.

Additionally, it helps you scale very well, but there are a lot of nuances. Because Flink allows you to do good aggregations upon segregations, maintaining the system is not that easy because you need a machine that has high RAM requirements. You need to have good memory requirements. It depends on what kind of problems you're solving. The problem that I'm describing is actually the the hardest kind of problem, when you're putting state in Flink's memory, stateful problems. There are other problems called stateless problems. Let's say you are a trucking company and you want to track the new data of your trucks. Let's say you have 15 trucks and you want to look at each of your trucks and each and every truck is spitting out latitude and longitude info when they're moving. In this case, maybe your intention is only to track them real time but you're okay with it being delayed by five, 10 minutes. But in this case, you're not aggregating something. There's nothing stateful about it. You take the data and you dump it to storage. That storage could be anything, plastic surgery or anything. And then you can create a graph and plot a trendline. Where is it going? In a map or something like that. The data comes in and it gets written to a place and then uses a straight graph. So it stays put. In this kind of application, Flink works very well and you can really do a lot of real time analytics on it. But in the case of the problem that I described where you do real time applications, that's a little bit tricky because in this case, you need to recover from the right state so that you don't mess up the relations.

What needs improvement?

In Flink, maintaining the infrastructure is not easy. You have to design the architecture well. If you want to scale for a larger number of streaming data you need good machines. You need good resilience architecture so that if it fails, you can recover from those with minimum downtime. You should have good storage systems to store and retrieve intermediate flink states(in case of stateful applications). Basically all the problems that come with a distribution system. So you have to have all that infrastructure for it to perform well. Best way is to look at the use cases you wish to support in 5-10 years ahead and design the architecture around flink accordingly.

For how long have I used the solution?

I started using Apache Flink in October 2017. My team has been using it since May 2017.

What do I think about the stability of the solution?

In terms of stability with Flink, it is something that you have to deal with every time. Stability is the number one problem that we have seen with Flink, and it really depends on the kind of problem that you're trying to solve. If you're trying to solve the problems that we are trying to solve, which is stateful aggregations, you will find a lot of stability problems. That's why you have to invest money and time into understanding what problem you are trying to solve. How much infrastructure do you need? Stable infrastructure would take time to mature. Once you do that, you also need to spend time understanding and figuring out an optimized way of making it cluster-ready. You don't want to throw money just like that. If you want to throw money, you want to throw money in the right way.

When you create clusters of machines or something like that you're going to need a lot of analysis upfront. Let's say you're selling Flink as a product to different people, how can you do this? One way is you take a bunch of use cases, common use cases, and do experiments with it and form clusters based on that. You can then call them flavors. Let's say, for example, flavor A can do this kind of thing very well, flavor B can deal with these kind of things. And flavor A has its own infrastructure in the sense that flavor A has five job managers, 16 task managers, five zookeepers and all those configurations. Then different clients can use this kind of model. I'm just giving some ideas for how you can make the things work if you are selling this as a product.

What do I think about the scalability of the solution?

In terms of scalability, there's a lot of room for improvement in the case when you're in stateful conditions. There is no system as of now which does scaling very well for stateful aggregations. There are other frameworks like Apache Beam which actually is an app but for other kinds of things. Then there are other things, like Apache Pulsar. But among these, Flink performs best and it's actually very good for streaming architecture.

When we started, we were only three people working on a bunch of things. We were the first people. I was the software engineer. Basically I was the most junior of all. They're mostly principal engineers and I was a software engineer.

When we started we were creating code and generating the chart file. It was worth creating our own clusters. That's why I have that insight. We were doing all the settings on those clusters so that it would work. Then, you wouldn't apply the job on those clusters. We give all these insights to the other team, the platform team, so that they can evangelize the product for the entire company who would want to use Flink. For us it was a side project setting up the clusters and all that, because for us we are solving other business problems, but we had to do it because there was nothing available for us. Then the platform team did all of that. Now we just write code. We understand the business problem. We write code, we generate a plan for them. Everything else is taken care of by them on the machine itself.

It's getting popular really fast now. This idea of when it's easy for people to use it, people will use it if it solves the problem. In my experience, it is going to be big just like Spark. It's just that you need a little bit more infrastructure, because it's trying to solve a complex problem, which is what people need to understand. If it is a complex problem, you have to spend time and energy to make it make it stable. So you need to go to the infrastructure team who can write and who can create a stable Flink protection, then it will help you solve a lot of the problems.

How are customer service and technical support?

I have not had any experience with customer support. We did everything ourselves, but we read a lot of Apache's documentation on that.

How was the initial setup?

The initial setup is not straightforward. It would take time. You have to know a lot of things. But one thing is that when we started, Flink was very new. The product is maturing and people are using it more. They will understand what people need and all that stuff. Maybe it would not be as difficult as it was, but it does require you to understand a lot of things.

So how do you set up a cluster? Let's say you want to do aggregation on 15 million emits for one particular Flink job. In Flink, when you deploy an application, it's called a flincher. When you do that, how do you design the cluster? What boxes do you have? How much RAM do you need? Once you have one particular box, how do you design the topology of the cluster? Let's say the way it works is it has something called job manager, which are the coordinating notes. Then it has the task editor which are the machines which basically do the real work, the aggregation. Then there are the zookeepers. The zookeepers are something which helps you maintain the health of the cluster. You have to make a balance.

How many zookeepers do you want? How many job managers do you want? How many task editors do you want? How much RAM do you want to have in each cluster? How much network do you want to open? How much traffic can the Flink cluster take in the stable manner so that it doesn't go down frequently? You have to do a lot of experiments with all of this setup, depending on what problem you're solving, it depends upon the load that you're getting from your business.

What about the implementation team?

Deployment takes around two minutes if everything is good. It's very fast.

How do we do that? Just like any application, we write code, we build a bit of the manifest on the job, the code that is able to be deployed. You take that bell JAR. It's called a JAR. Generally we have used the JAR version because Flink is a JVM theme, so we are using the JVM version, which supports Java and SQL. We write code for whatever you want to do with it. We build the code and it converts into a JAR file. We take the JAR file and upload it into a Flink server and then you just click a button and it's deployed.

You don't have to do anything. If you are starting up, the moment you install Flink in your local machine, when you're trying it out and you start the server, you will see your Flink server UI coming up. There's an option so that you can deploy a job. What you have to do is build your code, generate a JAR file and then simply go to the UI, upload the JAR and start the server. That's it.

Which other solutions did I evaluate?

Before Apache Flink my team tried Apache Storm and it did not work for them. I think Apache Storm is not being used by anyone else in the company.

What other advice do I have?

This is general advice if you're trying to do anything: Any problem that you're trying to evaluate, you have to really understand the problem that you're trying to solve, what is the nature of the problem? And by nature of the problem, the business side is one thing, but you have to understand how you're solving things. For example, do you want something to be fast enough, scalable and for any new product? Every time they advertise it is fast, scalable, highly distributed, etc... But in what context? What kind of use cases is this product built for? You have to understand the principle and only then you choose a product. If you want Apache Flink, it's about if you want something for near-real time metrics that may be useful for your business.

In that case, Apache Flink is your friend, because it's built on streaming architecture. If the nature of your application or your business is streaming, the data is coming at a very high rate and you want to do something with it, then Apache Flink is a good option. Another example I can give you: let's say you run a company, you are the CEO of Twitter, right? So in Twitter, a lot of people are writing a lot of stuff. A lot of streaming data is coming in. Because a lot of people are tweeting at the same time all around the world there's a lot of streaming of data coming in.

Let's say you're a celebrity and 5,000 people follow you. When you write a tweet, all 5,000 people have to see that tweet as quickly as possible. So when your tweet comes in, a very complex system from Twitter's backend has to take that tweet, has to know which of those people and display it on their feed timeline. Now this might sound easy when you only have five people, but if you have 315 million people tweeting, it's a very complex system and you have to make it available, etc... So when you're dealing with streaming data Apache Flink is a good option.

On a scale of one to ten, I would rate Apache Flink around seven to eight. It's pretty good if you're solving a streaming type of problem. My experience is limited. I only worked with Apache Storm a little bit and Apache Flink. Among all of this, if I would talk about streaming, Apache Flink wins hands down, but there are other products like Apache Pulsar which I have no idea. So my perspective is very limited.

Which deployment model are you using for this solution?

Public Cloud

Agustin Calderon

CTO at ReNew

Feb 5, 2024

Helps us to create both simple and complex data processing tasks

What is our primary use case?

We utilize IoT devices to gather data for our clients. This data is analyzed to produce reports and insights, and we leverage machine learning and artificial intelligence models.

What is most valuable?

The product helps us to create both simple and complex data processing tasks. Over time, it has facilitated integration and navigation across multiple data sources tailored to each client's needs. We use Apache Flink to control our clients' installations.

What needs improvement?

Apache Flink should improve its data capability and data migration.

For how long have I used the solution?

I have been using the product for five years.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

Apache Flink is scalable.

How was the initial setup?

The product's deployment can be completed in minutes, and we have a special team. It is straightforward. We initiate our requirements within our secure software and utilize Jenkins and our pipelines to carry out the deployment process, whether for expanding services on the cloud or on-premise servers.

What about the implementation team?

Apache Flink can be deployed in-house.

What other advice do I have?

I rate the product an eight out of ten.

Which deployment model are you using for this solution?

On-premises

Title	Rating	Mindshare	Recommending
Databricks	4.1	7.9%	96%	94 interviews Add to research
Qlik Talend Cloud	4.0	3.1%	89%	56 interviews Add to research

Apache Flink Reviews

What is Apache Flink?

Featured Apache Flink reviews

Apache Flink mindshare

PeerResearch reports based on Apache Flink reviews

Valuable Features

Room for Improvement

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

Top industries

Compare Apache Flink with alternative products

Learn more about Apache Flink

Apache Flink customers

Related questions

Product Categories

Popular Comparisons

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

Which other solutions did I evaluate?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

Which solution did I use previously and why did I switch?

How was the initial setup?

Which other solutions did I evaluate?

What other advice do I have?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

What about the implementation team?

What's my experience with pricing, setup cost, and licensing?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How was the initial setup?

What's my experience with pricing, setup cost, and licensing?