LucidWorks Big Data promises text analysis on a large scale
LucidWorks, formerly Lucid Imagination – the commercial developer behind Apache's open source Lucene and Solr search engine technology – has announced that LucidWorks Big Data has emerged from beta status. The product is based on Lucene and Solr 4.0, and is designed for processing large amounts of textual data. Along with Lucene and Solr, the stack in LucidWorks Big Data is made up of Apache Hadoop, the Apache Mahout machine learning library, Hive and Apache OpenNLP.
The combination of open source technologies provides an application development platform that is said to be usable out of the box and can be deployed "within a few hours" either on premises or on LucidWork's cloud service. The system can work on top of Hadoop, MapR, HortonWorks or Cloudera and provides a RESTful API that supports JSON formats for importing and exporting of data.
At the time of writing, the product is not yet ready for download, but interested users can ask to be notified when LucidWorks makes it available.
- Solr/Lucene certification now available from Lucid, a report from The H.