What is our primary use case?
It's for our daily data processing, and there's a batch job that executes it. The process involves more than ten servers or systems. Some of them use a mobile network, some are ONTAP networks, and they're on some kind of system. Everything comes through the Microsoft service. It's not a tentative process, but when the accounting system requires it, a batch job processes the data and secures it in Datalab.
In GCP, there's an engine called Dataflow. You need to put some scripts in it. This process happens automatically every day around midnight. It gets executed, processes the data, and returns it to the database.
What is most valuable?
The dashboards are good. When you want to show data in a dashboard, like in pipelines, it can be done by the end-user as well. You can only put limited data (JSON) in Datalab.
For example, if you want to show captions in a bar search, you need to provide the information to the data engine. You need to specify that it is a relation in GCP. It will be automatically displayed in the dashboard. That is the advantage of using GCP.
I believe GCP is more costly than other cloud platforms because of this feature for end-users. The data visualization is more meaningful for the end-user compared to Azure Cloud. I have worked with Azure Cloud as well, and Google Cloud provides more meaningful information in its dashboards.
There are a lot of AI feature as well. One AI feature I found is auto-completion. GCP also uses AI for recording purposes and data management in Data Proc, maintaining logs. You just need to mention where to store the logs, either in the cloud or in a data directory. It's very easy.
However, there are limitations with GCP's AI. You need to configure it based on the limits of the nodes. If it goes beyond the limit, it should go to the next node, but that doesn't always happen properly. It could be due to the wrong configuration, but it's not immediately moved to a separate node. You need to restart the server and reconnect it to another node.
What needs improvement?
Access is always via URL, and unless your network is fast, it would be a little tough in India. In India, if we had a faster network, it would be easier. In a big data environment, like when forcing your database with over a billion records, it can be tough for the end-user to manage the data. You need to have a single entity system in each environment. It's not because of GCP, but it would be great to have options like MongoDB or other similar tools in GCP. Then, we wouldn't always need to connect to the cloud and execute SQL queries.
Even if your application is always connected to its database, the processing can be cumbersome. It shouldn't be so complicated. Once the data is collected, it should be easily sorted.
For how long have I used the solution?
I have been using it since 2017.
What do I think about the stability of the solution?
For me, it has been a stable product. We can manage it as long as it's there. Maybe some requests may not be provided in the database or reporting for Cloud, but overall, I can manage it.
Maybe if the data becomes larger, there could be changes or it might not be able to store it anymore. But as of now, there are no complaints, so I am happy.
What do I think about the scalability of the solution?
Last time, I scaled up to 94 nodes. I'm not sure how much it can be scaled for the hardware. Normally, on my system, it's connected to Linux, and I can configure up to 77 nodes. It will work properly until then, but if you go beyond 99 nodes, you will need a separate server. You need to configure the next server and install the unit. There are some steps you need to follow. Then, you need to set up your microservices so they can be executed when you call one. That also needs to be configured.
If your system is using the same microservices, you need to pass the same information you have. Generally, they can do cross-solution directly. Once they do the cross-origin request (CORS), they will connect. So, microservices and the database are separate, and they can connect to the database via microservices only. You need to pass it through cross-routing.
How are customer service and support?
We make a call and send them the ticket number, and they use the address there. We get the response immediately. We just make and raise the ticket, then make a call, and they will prioritize it based on urgency, find it separately, and then we will get the response immediately.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
The solutions are all the same, with the same features and purpose. We can achieve the same results using other tools like SQL Server or microservices. I am interested in using GCP because it's available everywhere, and I can access it wherever I go. I choose it for the performance demands of my work.
How was the initial setup?
The initial setup is not a big thing. The documentation is there. I've watched videos and referred to the documentation. You can do it yourself. If you have any issues, you can optimize it in Google Cloud. It's a bit different from other platforms, but if you follow the documentation, you can do it. Everything is documented by Google Cloud, so you can set it up yourself.
What about the implementation team?
What other advice do I have?
Overall, I would rate it a nine out of ten.
Google Cloud is very good. Once you go through the features of Google Cloud, it's a good idea to get a GCP certification so you have an idea of how it can be configured. Most things will be covered in the certification. Once you have around 80-84% knowledge of GCP, you can start using it.
Other services, like S3 in AWS or Azure Cloud, provide the same. But I am confident in GCP. Azure Cloud is also good for me, but I haven't done any bucket tracking directly on it. Both Azure Cloud and GCP are good for me.