Most data warehouse upstart vendors -- including Aster, Greenplum, Kognitio, ParAccel and Vertica -- have designed their database software for massively parallel processing on industry-standard hardware from the likes of Dell, EMC, HP and IBM. Adding a high-end performance option to that mix, ParAccel, a column-store database provider, today announced support for solid state drives (SSDs) from Fusion-io (a move also announced today by Aster Data). SSDs replace spinning, mechanical disks with flash memory chips for faster performance. ParAccel, another vendor that made the Gartner MQ report for the first time this year, is promising 15 times faster query performance with optional flash SSDs installed by the hardware supplier of the customer's choice.
"Flash per megabyte is quite expensive, but in terms of dollars-per-megabyte-per-second it's very close to conventional disk prices now," says Barry Zane, chief technology officer at ParAccel. "In other words, performance-per-dollar is attractive and in keeping with our column-store database performance."
Column-store databases are typically faster than conventional, row-oriented databases (such as Oracle, IBM DB2 and Microsoft SQL Server) in analytic applications because they can query specific data attributes in columns -- such as zip codes, product stock numbers or transaction totals -- while skipping all the other data, row-by-row, that might not be relevant to a query. Some relational vendors, including Oracle, have added columnar data compression, but Zane says they have not matched all the performance advantages of column-oriented databases.
"We don't need the indexes, projections and materialized views that relational databases need to perform at scale, so we don't suffer from bloat and end up penalizing performance," he explains.
The data warehousing industry leaders have already embraced flash technology. Oracle put a huge emphasis on flash memory in the Sun-Oracle V2 upgrade of Exadata announced last October. Teradata's SSD-based Extreme Performance Server 4555 was also announced last October, and it's slated for general availability in the first half of this year. IBM has demonstrated an SSD-based test appliance, but it has yet to announce or ship a production SSD-enabled data warehouse appliance.
Vertica stands out among alternative vendors in that it has more than 120 customers and is the largest column-store database vendor after Sybase, with its Sybase IQ product. Tomorrow, Vertica will announce upgrades including better workload and resource management features aimed at handling mixed query workloads and many more users.
"The biggest obstacle to tackling enterprise data warehouse workloads is having big batch jobs competing with smaller queries that could otherwise be answered instantaneously," says David Menninger, Vertica's vice president of marketing and product management. "We've created resource pools that administrators can dedicate to different users and types of activities, so you can guarantee that resources are available when crucial queries come in."
Rounding out this week's announcements, Kognitio today announced that GroupM, a division of advertising giant WPP, has asked it to build an analytical environment to match and monitor advertising placements on global scale. GroupM operates in more than 80 countries, and data from these local operations will ultimately be fed into a centralized Kognitio database that will reportedly analyze almost one-third of the world's advertising spend across multiple media outlets.
Up-and-coming new-media companies, telecos and financial firms are typical of the firms selecting alternative database providers. What they all have in common is huge data processing demands. It's clear why young firms would be willing to try upstart vendors: they often have little capital to spare, and they are less likely to be deeply invested in technologies from incumbents such as Teradata, Oracle and IBM.
But even well-established firms such as Verizon and JP Morgan (Vertica), Intuit and ComScore (Aster Data), Officemax (ParAccel) and GE Capital (Kognitio) are turning to the new wave of data warehouse vendors to meet their ever-growing data analysis needs. Whether the selections were made based on price, performance or personal attention from the highest levels at these upstart vendors, it's clear that competition is alive and well in the data warehousing market.