My main use case for Data Hub is to implement a data catalog for one of the clients that the consultancy I work at is serving.
A specific example of how the data catalog was used for that client is that it was used to define business terms and to explore the terms from the data glossary by adding definitions. It was also used to capture all the tables and fields that were connected to a data lake, allowing me to explore the entire production data lake and tag the tables and fields, segmenting these tables by domains such as sales tables and marketing tables.
Data Hub offers several best features including the tagging capability, domain segmentation, data exploration, and creation of a data glossary, which was very interesting to me. Additionally, the ease of plugging in new data sources is exceptional. Data Hub can be easily integrated with a data lake, and the environment can be explored through the metadata via Data Hub. I found the connection part straightforward.
Data Hub had a positive impact on my organization by disclosing to the organization and to business users what existed in the data lake. The interface that the technical team has with the tables and fields is designed for professionals in the technical area. Having a data catalog helps provide a better interface for data discovery and data democratization within the organization since everyone should have access to what types of data the organization has, and that was the biggest impact.
I started using the quality part for consistency, but I had limited contact with it and we did not progress much.
I believe the data quality module can always be improved by examining what is available in the market and making appropriate improvements to the tool. The data quality part is very important and it is not always fully leveraged as it should be. I also think that providing consulting or support with professionals who are qualified to use Data Hub would be interesting, along with providing training and certifications for the tool so that those who are implementing it can specialize increasingly in its features.
I have been using Data Hub for around one year.
Data Hub is stable, and I did not have any stability problems when I was working with the tool.
Data Hub's scalability is very easy, as we were able to add users and new datasets very quickly and smoothly.
I was not previously using a different solution. The implementation was already directly part of a data governance initiative and it was done directly with Data Hub, meaning there was no previous solution.
I believe the consultancy has some kind of commercial relationship with Data Hub to promote and offer Data Hub as a data catalog solution.
Before choosing Data Hub, the consultancy worked with some tools such as Google's DataPlex and Purview.
My advice for others thinking about using Data Hub is to have the governance initiative well-structured and to have all the documentation for data owners and data stewardship so you know who will be the points of contact when the tool starts being configured, ensuring that you have people responsible for doing reviews and approvals in the tool. I would rate this product an eight out of ten.