BigData Search engine and framework links

Tue 28 July 2015
  • Apache Lucene: Search engine library
  • Apache Solr: Search platform for Apache Lucene
  • ElasticSearch: Search and analytics engine based on Apache Lucene
  • Elasticsearch Hadoop: Elasticsearch real-time search and analytics natively integrated with Hadoop. Supports Map/Reduce, Cascading, Apache Hive and Apache Pig.
  • Enigma.io: Freemium robust web application for exploring, filtering, analyzing, searching and exporting massive datasets scraped from across the Web
  • Facebook Unicorn: social graph search platform
  • Google Caffeine: continuous indexing system
  • Google Percolator: continuous indexing system ">TeraGoogle : large search index
  • Haeinsa: linearly scalable multi-row, multi-table transaction library for HBase based on Percolator
  • HBase Coprocessor: implementation of Percolator, part of HBase
  • hIndex: Secondary Index for HBase
  • Lily HBase Indexer: quickly and easily search for any content stored in HBase
  • LinkedIn Bobo: is a Faceted Search implementation written purely in Java, an extension to Apache Lucene
  • LinkedIn Cleo: is a flexible software library for enabling rapid development of partial, out-of-order and real-time typeahead search
  • LinkedIn Galene: search architecture at LinkedIn
  • LinkedIn Zoie: is a realtime search/indexing system written in Java
  • Sphnix Search Server: fulltext search engine