Lexis-Nexis finally releases source code to HPCC
HPCC Systems, part of LexisNexis Risk Solutions, has finally made the source code to the HPCC (High Performance Computing Cluster) available, after announcing it would be open-sourced in June. The C++ source code, hosted on Github, is licensed under the AGPLv3 rather than the GPLv3 as originally planned and announced.
HPCC has two major components: Thor, which extracts data from unstructured or semi-structured "big data" sources and analyses it, and Roxie, which stores the results from Thor's processing in a form amenable to quick querying by client systems. The system is brought together with ECL, Enterprise Control Language, a data-centric language for controlling the cluster.
Also available from HPCC Systems are pre-compiled beta versions of a "community edition" of HPCC for CentOS, Red Hat, Ubuntu and SUSE Linux. Contributors to the project will need to sign a contributor agreement; although it requests copyright assignment, that can be withheld and HPCC Systems will instead accept an irrevocable licence to use contributions.