EMC and NetApp offer Hadoop solutions
Source: EMC The judges at the fourth MediaGuardian Innovation Award aren't the only ones that consider Apache Hadoop to be the "Swiss Army Knife of the 21st century"; a growing number of companies are also integrating Hadoop into their solutions in order to process giant amounts of distributed data.
At its EMC World conference, which took place earlier this week in Las Vegas, US storage company EMC announced the availability of its own Hadoop platform which it calls "EMC Greenplum HD" as part of its Greenplum product range. EMC acquired Greenplum and its big scale database expertise in mid-2010.
The company says that the Enterprise Edition of EMC Greenplum HD is "a 100 per cent interface-compatible implementation" of the Apache Hadoop stack which delivers two to five times the performance of standard packaged versions of Apache Hadoop.
The Enterprise Edition is a proprietary product but the company also offers a "Community Edition" of Greenplum HD which is an open source packaged version of Apache Hadoop with Greenplum adding fault tolerance to the Hadoop Name Node and Job Tracker. Greenplum also launched what it calls a "purpose-built, high-performance data co-processing Hadoop appliance" which runs both Hadoop and the EMC Greenplum Database.
NetApp has also announced a Hadoop storage solution. The company says that it is planning an appliance based on its E-Series Platform. With it, NetApp customers will reportedly be able to analyse large amounts of data in real-time. The company calls the devices "hadooplers" in a blog posting where it also explains that it is committed to using the open source Apache Hadoop implementation rather than creating it's own proprietary version.