Try our new research platform with insights from 80,000+ expert users
it_user17130 - PeerSpot reviewer
Manager Enterprise Data & Analytics Infrastructure/DBA at a insurance company with 10,001+ employees
Real User
It works well for many “campaign” queries that run for our clinical analytical area, but while some queries do run very fast, you cannot load up the box with several queries.

What is most valuable?

It can run heavy CPU queries really fast, and this is valuable.

How has it helped my organization?

We have a lot of “campaign” queries that run for our clinical analytical area and this product seems to work well for them.

What needs improvement?

While some queries do run very fast, you cannot load up the box with several queries.

For how long have I used the solution?

We've used TwinFin for over four years and Striper for around one-and-a-half.

Buyer's Guide
IBM Netezza Performance Server
June 2025
Learn what your peers think about IBM Netezza Performance Server. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
860,632 professionals have used our research since 2012.

What was my experience with deployment of the solution?

It does not scale like other databases and expect them to run at the same time.

What do I think about the scalability of the solution?

Netezza does not scale like other databases, so it cannot run several queries at once.

How are customer service and support?

It's good.

How was the initial setup?

It was pretty straightforward, but we had to get several areas involved in initial setup.

What about the implementation team?

We worked with the vendor to implement it.

What's my experience with pricing, setup cost, and licensing?

It is a pretty expensive solution, but it is worth it for the right environment.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
PeerSpot user
Technology Consultant (Netezza/Big Data) at a tech vendor with 1,001-5,000 employees
Vendor
The cost-based analysis of query by optimization, it makes query execution faster and provides enough scope to DBA to improve queries.

What is most valuable?

With a field-programmable gate array, it has the capabilities to do arithmetic calculations at memory level. Also, the cost-based analysis of query by optimization, it makes query execution faster and provides enough scope to DBA to improve queries.

What needs improvement?

In my opinion the product is the best for the purpose it has been built.

For how long have I used the solution?

I've used it for six years.

What was my experience with deployment of the solution?

Not much, IBM guys are quite professional and ready to help with every bit and piece.

What do I think about the stability of the solution?

Not much, IBM guys are quite professional and ready to help with every bit and piece.

What do I think about the scalability of the solution?

Not much, IBM guys are quite professional and ready to help with every bit and piece.

How are customer service and technical support?

Customer Service:

The level of customer service is good.

Technical Support:

The level of technical support is good.

Which solution did I use previously and why did I switch?

I've never used any other solutions.

How was the initial setup?

Initial setup is pretty straightforward, as IBM provides ramp-up training through in-house training and assigns a technical account manager to make faster resolution of queries.

What about the implementation team?

Both ways, as the box got setup by us and IBM provided a huge set of documents, which made life easier.

Which other solutions did I evaluate?

Before buying the physical product from IBM, there is an option to use the solution in the cloud to perform a POC and get your stats ready with actual data for better analysis of your investment.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
IBM Netezza Performance Server
June 2025
Learn what your peers think about IBM Netezza Performance Server. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
860,632 professionals have used our research since 2012.
it_user357468 - PeerSpot reviewer
Regional Support Specialist at a financial services firm with 501-1,000 employees
Vendor
It has the capacity to handle terabytes of data with query response times that are very low. Setup is complex.

What is most valuable?

Personally, I like the Compression and Partitioning features.

How has it helped my organization?

It has the capacity to handle terabytes of data with query response times that are very low. This has helped us a lot.

What needs improvement?

It’s a good product for analytical processing, but they need to conduct more seminars and hands-on training and events. They need to be showcasing the product and features to create awareness among businesses.

For how long have I used the solution?

I have been working on it for the last two-and-a-half years and now we are in process of a tech refresh. We’re moving from the old Netezza to the newer IBM Pure Data Analytics, using a quarter of the rack for Pure Data.

What was my experience with deployment of the solution?

No issues encountered with deployment.

What do I think about the stability of the solution?

No issues encountered with stability.

What do I think about the scalability of the solution?

No issues encountered with scalability.

How are customer service and technical support?

They're very good, helpful, and knowledgeable.

Which solution did I use previously and why did I switch?

It’s a complex setup.

How was the initial setup?

It was handled by the vendor as it's proprietary.

What other advice do I have?

It's fit for the purpose it's designed for. It's an analytical/hierarchical database, now in great demand, that can store plenty of data and return the results in no time for complex queries.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
solution306072 - PeerSpot reviewer
solution306072Business Unit technical Lead at a tech services company with 1,001-5,000 employees
Real User

If you are talking about when IBM comes in initially and does the setup I can see your point it is complex. If you are talking about the upgrade of the os and the dbos (database operating system) that can be fairly complex especially when you have HA (high availability) option on your server. There is a lot of things the IBM tech has to go through usually both os upgrade and then the dbos. Typically IBM will do a fail over doing the upgrade. I have watched them (through webex) during these upgrades and there are a lot of things they have to go through.

Database and table setup Netezza is about as easy as they come. There are classes that IBM offers around basic database and server care. Also a class on interpreting explain plans. There are other vendors that offer classes as well on these topics. There is plenty of material on the internet that offers some advice and help on some of these topics. There is a forum available at IBM that is available just search it up on google. The forum will occasionally offer "Enzee" (free) around Netezza and there are tons of blogs available around care and maintenance.

The biggest event that IBM Insight is available and there are many presentations and certification can be done during the conference.

Good luck

it_user351462 - PeerSpot reviewer
Data Architect/Modeler with 501-1,000 employees
Vendor
It provides outstanding performance for structured data, but bitemporal support is needed.​

What is most valuable?

  1. Performance
  2. Not complex to administer

How has it helped my organization?

It provides a key repository for risk information - suitable due to volume of data.

What needs improvement?

Bitemporal support is needed.

For how long have I used the solution?

I've used it for three years.

What do I think about the stability of the solution?

We found some bugs - boundary conditions mostly.

What do I think about the scalability of the solution?

There are no issues with scalability.

How are customer service and technical support?

9/10 - I have always be able to reach someone at IBM with the correct answer.

Which solution did I use previously and why did I switch?

Non analytic databases used to be all there was. Netezza provides outstanding performance for structured data.

How was the initial setup?

I didn't set up but our database administrators have said it is much simpler than Oracle, or DB2.In house - have a person trained on PureData - it will pay off

What about the implementation team?

In house - have a person trained on Pure Data - it will pay off

What was our ROI?

I can't quantify ROI.

What's my experience with pricing, setup cost, and licensing?

Licensing is straightforward - you end up buying the box with Netezza on it as it's an appliance.

What other advice do I have?

Every query has to be set based - no iteration over a result set. stored procedures should be used sparingly.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user347586 - PeerSpot reviewer
Solution Architect at a comms service provider with 11-50 employees
Real User
Billions of data are processed within an optimal amount of time, and it supports almost real-time analytics.

What is most valuable?

  • Scalability
  • User friendly
  • Always innovative

How has it helped my organization?

Billions of data are processed within an optimal amount of time, and it supports almost real-time analytics. This helps the executives to define their strategy for further expansion and identify potential opportunities with informed decisions.

What needs improvement?

It has already integrated with Big Data Hadoop; however, integration with XML is still not there for nzload utility.If nzlad supports XML files along with flat flies ,it will give them an edge over other MPP Architecture.

For how long have I used the solution?

I've used it for eight years.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

No issues encountered.

What do I think about the scalability of the solution?

No issues encountered.

How are customer service and technical support?

It's the best.

Which solution did I use previously and why did I switch?

We used other MPP architecture solutions such as Microsoft PDW and Teradata. However, Netezza has an edge as it is convenient to use and it doesn't need a database administrator to perform a successful implementation of a Netezza migration from any RDBMS.

How was the initial setup?

The initial setup is straightforward as it is an appliance. IBM's technical support team takes care of the entire setup process, and you don't need an expert in Netezza to perform the implementation.

What about the implementation team?

We did it in-collaboration with IBM, and in my opinion, this is the best way to embrace new technology. Once you gradually build up your capability in new technology, you should then take control of it with your in-house talented experts.

What was our ROI?

We had a significant ROI as business increased multi-fold with such a scalable solution.

What's my experience with pricing, setup cost, and licensing?

Compared to other MPP's, Netezza's pricing is quite reasonable considering long term business expansion with low maintenance cost.

What other advice do I have?

Any potential customer who has an inclination towards large scale analytics, should consider Netezza as an option. This not only gives a faster response, but you can also save on resource cost compared to other MPP's. Netezza's maintenance cost is quite low and this will give you an edge for long term revenue growth.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
solution306072 - PeerSpot reviewer
Business Unit technical Lead at a tech services company with 1,001-5,000 employees
Real User
SQL is 100% ANSI compliant, but support for interfaces has been poor.

What is most valuable?

  • The ability to load data very quickly
  • SQL is 100% ANSI compliant
  • Queries run very fast and require no hints (Oracle)
  • No space management
  • Tuning SQL is generally pretty easy

How has it helped my organization?

For every organization that I have been in that has Netezza, the ability to load and run queries is greatly simplified and A LOT faster than any other DBMS out there.

What needs improvement?

Netezza is just starting on real time integration with Big Insights (big data). Support for interfaces has been poor. Hoping this improves.

For how long have I used the solution?

I've used it for nine years.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

Netezza servers used to have disk issues in the early days, however, for the most part this has been addressed.

What do I think about the scalability of the solution?

No issues encountered.

How are customer service and technical support?

Customer Service:

9/10.

Technical Support:

9/10.

Which solution did I use previously and why did I switch?

I have used Oracle and Teradata as a DBA. I have found Netezza to be the easiest in terms of DBA management

How was the initial setup?

Very straightforward. Data movement is very easy. It works better if you do 1 large file as opposed to several small files

What about the implementation team?

Netezza servers are installed by IBM but implementations are sometimes handled in house and for others it maybe a mixture with BI vendors. We are a BI vendor and do implementations.

Which other solutions did I evaluate?

Most shops I am in have used Oracle, Sql Server, or Teradata. Depending on what the data is being used for (OLTP, OLAP). OLAP is the best application for a Netezza server.

Disclosure: My company has a business relationship with this vendor other than being a customer. We are partners with IBM.
PeerSpot user
PeerSpot user
Senior Hadoop Engineer with 1,001-5,000 employees
Vendor
The cross-database writing function allows users to run commands in the block as single execution.

Valuable Features:

Cross-database writing and more UDFs.

Netezza NPS 7.1 has added a feature of SET CATALOG

This also allows users to run GROOM, GENERATE STATISTICS etc. in the block as a single execution (auto commit).

Improvements to My Organization:

Simplified development process with less administration

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
PeerSpot user
Senior Data Architect at a pharma/biotech company with 1,001-5,000 employees
Real User
Netezza vs. Teradata

Original published at https://www.linkedin.com/pulse/should-i-choose-net...

Two leading Massively Parallel Processing (MPP) architectures for Data Warehousing (DW) are IBM PureData System for Analytics (formerly Netezza) and Teradata. I thought talking about the similarities and differences would be useful to decision makers who may need to choose or recommend one technology over the other.

A few years ago, I evaluated the viability of Netezza and Teradata (shortly before the Aster Data acquisition), to meet my company’s DW needs. Both Netezza and Teradata follow the relational database paradigm and use table joins. My recommendation was Netezza, based on the particular DW use case and Netezza’s MPP architecture, in-database analytics, low DBA maintenance and price.

Bottom line, Netezza costs less to own and operate, and is easier to manage. If your DW environment doesn’t need to support many thousands of users or ever increasing volumes of data, then Netezza is an excellent choice. Teradata scales better but is also more expensive and requires more skilled DBA labor.

I recently spoke with a VP who migrated off Teradata to Netezza because Teradata was too expensive and required a lot more DBA effort than his company’s needs required. In this post, I’ll contrast the two technologies at a Database Architect or Designer level of understanding. I’ll first point out some similarities, then highlight a few important differences.

Architectural Similarities

The idea behind parallel processing is that “many hands make light work”. In other words large tasks become small when divided among several people. In this context “people” are processing units of memory, CPU and disk. MPP is when you get 64, 128, 256, or more synchronized processing units executing a database query simultaneously.

The primary design philosophy of both Netezza and Teradata is simple. Each processor has total control and responsibility for a disk. The rows of every table are spread or distributed as evenly as possible across all the disks in the system and then data can be retrieved in parallel.

A processing unit in Teradata is called an Access Module Processor (AMP). In Netezza it’s called a Snippet Processing Unit (SPU). In Netezza, a database query is first compiled into C and divided into units of work called snippets.

Each processing unit (Teradata AMP or Netezza SPU) has its own memory, CPU and disk. This is also called a “shared nothing” architecture. Data retrieval and manipulation operations proceed in parallel and are N times faster than they otherwise would be (where N is the degree of parallelism).

All the processing units (worker bees) are guided by the query optimizer and query coordinator. In Teradata this is the parsing engine.

Architectural Differences

Concurrency

Concurrency is a by-product of performance. Concurrency is the number of simultaneous database queries running at any one given time in the database. In this context, the word “query” includes searching, adding, updating or deleting data. Netezza will never physically run more than 48 queries at a time. Netezza can support up to 2,000 active read-only queries at one time, but at most 48 will be running, and the rest will be queued. The active query is interrupted so other queries can use the CPU. This context switch prevents any one query from monopolizing the CPU, and ensures all queries get a fair share of CPU time. The limit makes sure Netezza is not wasting time switching between sessions.

While there can be up to 2,000 concurrent read-only queries, Netezza has a limit of 64 active add, update or delete queries (anything that might change data). This usually isn’t an issue in a typical analytics environment where the work of getting data in and out of Netezza is done as quickly as possible and the writers are typically ETL processes.

In contrast, Teradata can support millions of concurrent queries allowing greater flexibility. Teradata’s benchmarks show their system in a better light relative to concurrency but they are not real world workloads.

Enforcing Referential Integrity (RI)

Netezza does not enforce RI, but depending on your DW use case, that may be perfectly fine. Netezza defines primary and foreign key constraints as metadata, but doesn’t enforce them. In general even DWs like Oracle Exadata and Teradata that do enforce RI will disable the constraints when loading data. Otherwise the load process would be radically slower since the referenced keys would need to be validated one row at a time.

For a DW, RI is often performed in the Data Integration Framework and incoming data is cross-checked with the available keys. Once the ETL has a handle on the data quality and is preventing data errors from entering the DW, the constraints can be disabled forever because the ETL becomes the de facto gatekeeper of data quality.

With IBM PureData System for Analytics, data load and bulk-data comparisons are incredibly fast. And because the primary and foreign key pairs are known metadata, some repeatable, metadata-driven patterns can be built that allow the referential checks to be parameterized in the ETL.

Primary and foreign key constraints are usually enforced with indexes, which Teradata provides to improve query speed and performance. Teradata distributes the data based on Primary Index (PI). Choosing a PI is based on data distribution and join frequency of the column. Secondary indexes provide another path to access data. Both primary and secondary indexes can be unique or non-unique.

In contrast, Netezza’s approach to indexes is simple. Netezza doesn’t offer them. Instead, Netezza uses a distribution key, performs massively parallel table scans, and relies on zone maps for performance.

Minimizing I/O

Both Netezza and Teradata compress data and use cost-based optimizers to calculate the most efficient query plan. A query plan is like the “GPS” in your car. In a DW, it’s used to find and retrieve data.

Netezza minimizes I/O by applying restriction and projection conditions to data in a Field Programmable Gate Array (FPGA) as the data comes off the disk. The FPGA architectural firmware is the “secret rocket” that gives Netezza such incredible speed. Each Netezza processing unit includes memory, CPU, a disk drive and an FPGA.

Applying restriction and projection conditions to data in Netezza as it comes off the disks and before it reaches the rest of the I/O sub-system makes the most sense where only a relatively small fraction of the data in a particular table is required to support the rest of the query. It is still the case that all of the data has to first get off the drive before the unwanted data can be discarded in the FPGA. Although incredibly fast, Netezza’s use of parallelized full table scans can limit query concurrency.

Instead of an FPGA paired with each disk drive, Teradata minimizes I/O through more traditional methods such as range-based partitioning, which supports partition elimination and the use of advanced indexing which reduces the amount of data that would otherwise have to be scanned. Partitioning and indexing strategies require more DBA involvement.

Netezza has a single active host node that can become a performance bottleneck. All sessions and data must flow through this single node for final sorting and merging of results. Netezza’s use of co-located joins helps minimize sorting. Teradata automatically spreads sessions across multiple servers, providing scalable bandwidth for data flow.

Mixed Workloads

Mixed workloads are different types of queries running in the Data Warehouse and are directly related to concurrency. A DW needs to optimize a mix of ad-hoc and tactical queries, reports, data mining, data loads and visualization queries.

Using a traffic analogy, think about driving through a big city at 6 AM. The roads are quiet and uncrowded and traffic is light. But by 9 AM, the crowds, taxis, trucks and ambulances turn the streets into an all-day traffic jam. Your vehicle only goes as fast as the car in front of you. Not until 8 PM does the congestion dissipate.

Now imagine no stop signs or traffic lights. Then, eliminate the "drive on the right side of the street" rule. There would be constant chaos and traffic would gridlock.

This is similar to the daily workloads that pass through a DW. Reports (cars), tactical queries (motorcycles and bikes), executive queries (ambulances), data mining (buses) and data loading (trucks) can simultaneously clog the system by taking up space (CPU) and producing congestion (blocking others). Like a city, query elapsed time is faster and the DW performs better if the traffic flow is organized.

Fair-share, priority and pre-emptive scheduling (slow lanes, fast lanes, cutting in front) ensure that no active queries starve for lack of CPU time. Resource governors (stop signs and traffic lights) throttle out-of-control queries like billion row table-joins. Organizing and prioritizing workloads ensures the executive query (ambulance) or tactical query (motorcycle or bike) are consistently fast and can zip though, regardless of concurrent traffic.

Teradata has what is generally acknowledged to be the best mixed-workload management capability in the industry.

Distribution, Skew and Co-Located Joins

Let’s use an example of distributing 128 million rows across 128 SPUs. Once loaded, using “random” distribution, each SPU will control 1 million rows. So the table exists logically once, and physically 128 times. The SQL query will physically run on all 128 SPUs simultaneously. Each SPU will work on its portion of the data and be merged into a result set. So, the total duration of the query is the speed a SPU can scan 1 million rows. Using “random” distribution, all the SPUs will move at this speed, in parallel, and finish at the same time.

Let’s say the 128 million rows are in an Order table, and 256 million rows are in Order_Detail. Both tables are joined on Order_ID. Rather than using “random” distribution, it appears Order_ID is what we want for a distribution, but this may skew the data. When a distribution is assigned, Netezza will hash the distribution key into one of 128 hash values (the number of SPUs). Every time a particular key appears, it will always be assigned the same hash value and land on the same SPU. So now we can distribute both tables on Order_ID and be absolutely certain that for any given SPU, all of the same ID's for both the Order and Order_Detail table are physically co-located on the same disk.

If we choose Order_ID and it turns out that a large number of rows hash to one or more overloaded SPUs, then the data distribution is “skewed” and detrimental to performance. Skew makes queries run slow, because the other SPUs will finish faster and wait on the overworked SPUs with the extra data.

Each Netezza table has only one distribution key. If a table is distributed on another key, the data would have to physically leave the SPU as it finds a new SPU home to align with its distribution key. Because redistributing data is the single biggest performance hit, the columns selected as distribution keys cannot be updated. You would need to delete the row, and insert a new one.

The distribution key can consist of 1-4 table columns. If the chosen key provides good physical distribution, then there is no reason to use more columns. More granularity in a good distribution has no value. All columns in the distribution key must be used in a join in order to achieve co-location. If we use a compound (1-4 column) distribution key, we are committed to using all columns in all joins, and this is rarely the case. You would usually use additional columns only if a single column produces high physical skew. The distribution key is a hash value for SPU assignment, not an index. If all of the columns in the distribution key are not mentioned in the join, Netezza will not attempt co-location. So even if a particular distribution key does not "directly" participate in the functional answer, it must directly participate in the join to achieve co-location.

Collocated joins provide optimal performance because data movement is minimized. In a non-collocated join, the data first needs to be sorted in memory. In a collocated join, the two data sets are already in sorted order on disk and each SPU can operate independently of the others without network traffic or communication between the other SPUs.

Scaling Up

IBM now offers a “Growth on Demand” model where IBM will bring in more capacity than initially required and start out licensing half of that environment. Customers can then “turn on” capacity when needed by licensing more of the environment. This is controlled by IBM’s Workload Management capabilities. An example would be to bring in a two rack system and license it as if it were a single rack. Customers can then add in 1/8 increments from there until the two rack system is fully utilized. If you grow past the two racks, then you’d consider upgrading to a 4, 6, 8 or 10 rack configuration.

Teradata, by contrast, allows systems to expand incrementally by adding server nodes as necessary to meet growth needs.

Conclusion

In conclusion, before choosing or recommending one DW technology over another, evaluate your particular DW needs and execute a well-constructed benchmark for the platform you are considering with your workload and data.

These views are my own and may not necessarily reflect those of my current or previous employers.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user669453 - PeerSpot reviewer
it_user669453Data Warehouse Architect at a consultancy
Real User

excellent article

See all 3 comments
Buyer's Guide
Download our free IBM Netezza Performance Server Report and get advice and tips from experienced pros sharing their opinions.
Updated: June 2025
Product Categories
Data Warehouse
Buyer's Guide
Download our free IBM Netezza Performance Server Report and get advice and tips from experienced pros sharing their opinions.