What is our primary use case?
I am implementing the ETL workflow using
Apache NiFi to prepare data and upload it to the cloud. Our use case involves importing data from on-premise and private servers to build a data hub and data mart. The data mart is then published on the cloud.
How has it helped my organization?
We primarily use
Apache NiFi for data preparation tasks.
What is most valuable?
The visual workflow aspect of Apache NiFi is an invaluable feature as it operates on a no-code platform that allows for easy drag-and-drop pipeline construction. Compared to
Airflow, which requires programming before visual representation, Apache NiFi offers clarity in pipeline activities. This feature greatly aids in understanding what the pipeline is doing.
What needs improvement?
The logging system of Apache NiFi needs improvement. It is difficult to debug compared to
Airflow, where task details and issues are clear. With Apache NiFi, I have encountered processes that die without any traceable error, which might relate to the inadequate logging system.
For how long have I used the solution?
I have been working with Apache NiFi for about six months.
What do I think about the stability of the solution?
Sometimes, when I run Apache NiFi, processes crash without any clue, which might relate to the logging system. The process can die, and the logs do not show any detail to identify the problem, impacting stability.
What do I think about the scalability of the solution?
For scalability, I would rate it an eight. We can run parallel pipelines simultaneously without issues unless memory is full. Scarcity of memory is the only constraint, but processing capabilities allow us to handle much simultaneously.
How are customer service and support?
The technical support from the official Apache team is rated a three out of ten. Issues often require self-resolution or community help, as the support isn't effectively managed.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
I have used Airflow before, which required programming first and then visual representation of the workflow.
What about the implementation team?
There is another team responsible for setting up Apache NiFi, so I'm not involved in the deployment process.
What's my experience with pricing, setup cost, and licensing?
Apache NiFi is open-source and free. Its integration with systems like Cloudera can be expensive, but Apache NiFi itself presents the best pricing as a standalone tool.
Which other solutions did I evaluate?
Prior to Apache NiFi, I used Airflow, which differed mainly in its approach to programming and workflow visualization.
What other advice do I have?
Overall, I rate Apache NiFi an eight out of ten. I am quite happy with it.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer: partner