- Adobe Spindle: Next-generation web analytics processing with Scala, Spark, and Parquet
- Apache Kiji: framework to collect and analyze data in real-time, based on HBase
- Apache Nutch: open source web crawler
- Apache OODT: capturing, processing and sharing of data for NASA's scientific archives
- Apache Tika: content analysis toolkit
- Domino: Run, scale, share, and deploy models Ñ without any infrastructure.
- Eclipse BIRT: Eclipse-based reporting system
- Eventhub: open source event analytics platform
- HIPI Library: API for performing image processing tasks on Hadoop's MapReduce
- Hunk: Splunk analytics for Hadoop
- MADlib: data-processing library of an RDBMS to analyze data
- PivotalR: R on Pivotal HD / HAWQ and PostgreSQL
- Qubole: auto-scaling Hadoop cluster, built-in data connectors
- Sense: Cloud Platform for Data Science and Big Data Analytics
- Snowplow: enterprise-strength web and event analytics, powered by Hadoop, Kinesis, Redshift and Postgres
- SparkR: R frontend for Spark
- Splunk: analyzer for machine-generated date
- Talend: unified open source environment for YARN, Hadoop, HBASE, Hive, HCatalog, Pig
BigData Applications links
Tue 28 July 2015