What is our primary use case?
We use FlexPod for onsite virtualized and leveraged hardware, so we work with many VMs and generic workloads. There is no specific single-purpose workload. It would be like the VMware solution.
We have three primary production environments and a legacy environment. Right now, the entire workload of the organization is roughly split into thirds across those various generations: Gen one, Gen two, Gen three, and legacy. Legacy can't move off of old hardware, or it's in the process of being moved to modern platforms.
Gen one was FlexPod, Gen two was a different Dell solution, and Gen three was yet another Dell solution because somebody up the chain loves Dell these days. They're building a fourth generation of primarily containerized stuff based on Red Hat Kubernetes for new workloads or Red Hat OpenShift. The compute workloads in FlexPod are slowly being moved out. There's no plan to replace that or to continue using that as a FlexPod. However, the storage driving the current compute and network shares will be transitioned into the strategic platform for file-sharing platforms across the enterprise.
It highlights the flexibility that comes with the product set. They're still using VMware everywhere. They're not going to be using the compute environment. They're still using Cisco network switches everywhere and will continue to use NetApp storage everywhere. Interestingly, this new containerized OpenShift environment they have in place today gets its storage back from the NetApp through NetApp automation and integration tools in their cloud set. It highlights the flexibility of what you can do with the entire platform when you need to do something.
In terms of administration, we have five or six people on the storage side that are primarily doing storage. There's a rather large team in charge of server administration, but they're doing server administration across the entire enterprise as a team. There are around 25 or 30 Windows and Linux system administrators. The house's Windows side also manages the necessary VMware work, but they're not dedicated to FlexPod. They're across the entire enterprise, but any of them could come in and do most of the work.
In addition, there are about 10 people on the engineering side. They're the ones who are primarily responsible for designing and driving operational standards. The same people are in charge of infrastructure across the entire enterprise and aren't limited to FlexPod. There's also a good smattering of network folk.
How has it helped my organization?
The pre-validated architectures are attractive. My organization is in the Fintech industry, where standards are critical, so validations and compliance are essential. There are vendor-designed and vendor-validated design starting points. You can build up for scale or if you need more compute from those starting points. If you need more storage, you put out more storage. For performance and things like that, you can put the components you need to meet your specific needs as long as you stay within those validated designs.
Even if you start with one and diverge from it, it's still a FlexPod, and you get support from the three primary organizations: Cisco, VMware, and NetApp. Even if you don't use a Cisco-validated or NetApp-validated design, you can still qualify as a FlexPod if you're out of those standard deployments. Executives like having those models as a starting point.
FlexPod has saved us some time, but I must add a caveat. The challenge was getting the operations side of the house aligned with what FlexPod was. Even today, the operational side remains heavily siloed. There are teams for networking, storage, Windows, etc. That separation of concerns continues to be the standard model for operational support today. It took a lot to break down that model a small amount because everybody in FlexPod has to know and work together because of the integration.
You can't just do the network piece because there's so much that the network touches. You can't just do the compute piece because there's so much the compute touches, especially with VMware. You can't just do the storage piece. You have to understand how that relates to virtualization and the compute. It took them a while to get some people in place that could cross these silos to make troubleshooting more effective. Has it become more effective? Yes. But it took a long time to get there for external reasons.
It hasn't impacted our TCO because everybody who came in still runs everything else they used to run. When they decided to do the next major expansion, and somebody wanted to go with Dell, and they went with Dell hardware, all the same people had to learn that as well. So we don't have a TCO advantage on FlexPod versus anything else.
The pre-validated architectures also haven't affected our productivity if you look at application performance. The application performance is what it is. It also hasn't increased our productivity regarding our ability to operate the system because many outside factors affect our operations, and nothing about the FlexPod design helps us overcome them.
What is most valuable?
The most significant benefit is storage flexibility. FlexPod is a reference design to build on and add the appropriate resources you need for your workloads. That's what differentiates it from something like HCI, where you have to buy certain size pieces when you want to expand.
I can do just about anything storage-related with it. In a FlexPod environment, we can provide whatever you want from a data storage perspective. Replication, backup, disaster recovery, etc., are all there right out of the box. I can set up fiber channel LAN, channel-over-ethernet LAN, etc. Storage is highly flexible in this environment. I don't have to buy two or three products and don't have to do some sort of software virtualization of the storage, which takes away your performance.
For hyperscalers, you have insights on the Cisco side that you can use to look at tons of stuff. From the storage side, you could put a range of NetApp tools in the cloud. You could put VMware in the cloud and talk back to the native on-prem Cisco compute environment.
Their network environment extends into the cloud. There are no limits in terms of integration. NetApp is the most integrated hyper-scale for storage and moving data into the cloud for long-term backup storage. VMware is fully available, so you could run VMware or Kubernetes on VMware in the cloud and tie it to your local storage through NetApp integration.
It's an excellent match because your compute could easily DR to the cloud and be ready to go with all your storage without any modifications necessary because of the native integrations. It integrates well with the hyperscalers.
What needs improvement?
Support could be more integrated. For example, you might call NetApp, and they'll determine that VMware is the issue. It would be helpful if they could automatically engage VMware and bring them onto the same call to transfer ticket information and work together.
Ideally, you can call any vendor potentially causing the problem—Cisco, VMware, or NetApp. They should automatically bring in the other teams as necessary if it isn't. That doesn't happen as smoothly as it should.
For how long have I used the solution?
I have been using FlexPod for six years. I'm in my seventh year now.
What do I think about the stability of the solution?
FlexPod is highly stable at this point. However, there were some growing pains. For instance, when I came on board, they had a bad experience with an ONTAP upgrade that created some unexpected downtime. They have had some challenges with how they wanted to use the entire system.
For instance, the primary use case is virtualized workload. The secondary use case used a fiber channel over ethernet with physical servers. That presented some challenges during certain operations. You'd expect it to be automatic because it wasn't set up correctly. You don't discover the issue until you do routine patching or maintenance work and suddenly lose all your connections to a server. We had some of those growing pains early on, but it has become highly stable after six years.
What do I think about the scalability of the solution?
We've scaled FlexPod as much as we needed in any direction. We haven't even scratched the surface of what it could do. There are no practical limits on scalability as a design standard.
How are customer service and support?
I rate FlexPod support seven out of 10. It depends on which person you get when you call. Unfortunately, I think that's true of most vendors' technical support. If you don't get the right person at the right level to start with, you're in for a potentially bad experience.
How was the initial setup?
I joined the company after the initial setup. However, it seems complex from what I've seen. The team who deployed it had a reseller set it up for them, but they changed it afterward. They had a different idea of how they wanted it to operate. They tried to make it work in their model instead of adapting to what FlexPod was designed to be. I think they complicated things.
They only use one cloud provider, and FlexPod is minimally connected. Their newer environments are more connected, but backup to the cloud is the only use case they're using for primary functionality, and it's somewhat limited. There was no challenge in doing it, but the overall use case for that is still somewhat limited.
What other advice do I have?
I rate FlexPod nine out of 10. FlexPod allows you to build what you need when you need it. "Flex" is in the name. If you need more compute, you add it and don't need to worry about it. If you don't want to use it in the FlexPod at some point, the compute can be repurposed for something else. The same is true for the network switches. When you need more network capacity, you replace the switch and interfaces or maybe increase the speeds on your network interfaces. Replace the network components with what you want, the capacity you need, etc. The same goes for the storage on the NetApp side.
My advice is to start with what you think you need. As your needs change, put new hardware and performance capabilities in place. You add it as required in the same environment space, so you don't have to change your product line. Your disaster recovery or secondary sites all stay the same. The Cisco and VMware stuff inter-operates. VMware is all over the place, and NetApp ONTAP is ONTAP. It's all the same platform, no matter where you put it in the cloud.
In Nutanix's environment, they've got two or three. You can have external storage because maybe their internal storage isn't good enough. That external storage platform is something outside of the converged environment. You've got to manage it separately. You've got two hypervisor capabilities here. You could put VMware on it, or you could put Nutanix on it. They're not interchangeable. Once you pick one, you are stuck with that one forever. And if it's not good enough, you have to tear down the whole thing to start over. You're not tied into anything.
Disclosure: My company has a business relationship with this vendor other than being a customer: