Upcoming Events

A Network Computing Webcast:
SSDs and New Storage Options in the Data Center

March 13, 2013
11:00 AM PT / 2:00 PM ET

Solid state is showing up at every level of the storage stack -- as a memory cache, an auxiliary storage tier for hot data that's automatically shuttled between flash and mechanical disk, even as dedicated primary storage, so-called Tier 0. But if funds are limited, where should you use solid state to get the best bang for the buck? In this Network Computing webcast, we'll discuss various deployment options.

Register Now!


Interop Las Vegas 2013
May 6-10, 2013
Mandalay Bay Conference Center
Las Vegas

Attend Interop Las Vegas 2013 and get access to 125+ workshops and conference classes, 350+ exhibiting companies and the latest tech.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

EMC Greenplum Offers Free Open Source Tool For Building Database Apps

The Greenplum division of storage vendor EMC is offering a Free Community License version of its EMC Greenplum Database software, which allows software developers to build new applications to deal with the explosion of so-called "big data" that businesses and other enterprises have to try to manage. The community license is based on code from Greenplum's massive parallel processing (MPP) database product, and includes the open-source MADlib library of analytic algorithms and Alpine Miner, a data mining modeling tool.

As companies build databases of ever-expanding amounts of data, they need more tools to analyze it and make business decisions based on those findings. Eventually, the databases hit a limit on how much they can scale, says Luke Lonergan, chief technology officer and VP of EMC Data Computing Products Division and co-founder of Greenplum, which EMC acquired in July 2010.

Lonergan gave an example of a company that introduces a new product that quickly becomes popular and all of a sudden they've got 1 million visitors to their site within a month or two. "What does an operation do when they get hit by the scale truck?" Lonergan asks.

Big data applications require "scale-out" technology, he says, which keeps up with demand as enterprises add more servers and storage hardware, and need database analytics software that keeps up with the data. The community license is to be used only for research; a commercial license is required to deploy an application in production or for commercial purposes. Greenplum's commercial- and community-licensed database software is based on the open-source PostgreSQL database software project, to which Greenplum has been a contributor.

The MADlib library offers tools that provide mathematical, statistical and machine learning methods for structured and unstructured data. MAD stands for "magnetic, agile and deep." Alpine Miner is a visual data mining tool from a company that Greenplum incubated within its own company, Lonergan says. Its chief advantage is that it can run right in the database engine as opposed to a situation where a small amount of data is copied from the database and tested in a separate workstation, saving several steps in the modeling process.


Page:  1 | 2  | Next Page »


Related Reading


More Insights


Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
 
IaaS Providers
Cloud Computing Comparison
With 17 top vendors and features matrixes covering more than 60 decision points, this is your one-stop shop for an IaaS shortlist.
IaaS Providers

Research and Reports

The Virtual Network
February 2013

Network Computing: February 2013

Upcoming Events



TechWeb Careers