Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Spark

Claimed by Apache Software Foundation Analyzed 15 minutes ago

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with ... [More]

1.31M lines of code

374 current contributors

1 day since last commit

56 users on Open Hub

Very High Activity

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Apache Fluo

Analyzed 1 day ago

Apache Fluo (incubating) is an open source implementation of Percolator (which populates Google's search index) for Apache Accumulo. Fluo makes it possible to update the results of a large-scale computation, index, or analytic as new data is discovered.

31.9K lines of code

5 current contributors

about 2 months since last commit

0 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags accumulo analytics apache-software-foundation bigdata bigtable cluster clustercomputing fault_tolerant graph_computing hadoop highthroughput incremental 6 more...