Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

ELKI

Compare

  Analyzed 1 day ago

ELKI: "Environment for Developing KDD-Applications Supported by Index-Structures" is a development framework for data mining algorithms written in Java. It includes a large variety of popular data mining algorithms, distance functions and index structures. Its focus is particularly on clustering ... [More] and outlier detection methods, in contrast to many other data mining toolkits that focus on classification. Additionally, it includes support for index structures to improve algorithm performance such as R*-Tree and M-Tree. The modular architecture is meant to allow adding custom components such as distance functions or algorithms, while being able to reuse the other parts for evaluation. [Less]

214K lines of code

2 current contributors

about 2 months since last commit

1 users on Open Hub

Moderate Activity
5.0
 
I Use This

Harry Tool

Compare

  Analyzed about 21 hours ago

Harry is a small tool for comparing strings. The tool supports several common distance and kernel functions for strings as well as some excotic similarity measures. The focus of Harry lies on implicit similarity measures, that is, comparison functions that do not give rise to an explicit vector ... [More] space. Examples of such similarity measures are the Levenshtein distance, the Jaro-Winkler distance or the sectrum kernel. Harry is implemented using OpenMP, such that the computation time for a set of strings scales linear with the number of available CPU cores. Moreover, efficient implementations of several similarity measures, effective caching of similarity values and low-overhead locking further speedup the computation. [Less]

8.05K lines of code

1 current contributors

about 5 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

extractor-bundle

Compare

  No analysis available

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: No declared licenses

keboola-juicer

Compare

  Analyzed about 22 hours ago

Framework for development of data extractors. This is a continuation of https://www.openhub.net/p/keboola-extractor-bundle

5.44K lines of code

3 current contributors

9 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Licenses: No declared licenses

Lumeer

Compare

  Analyzed 2 days ago

Lumeer changes the way we work with our business data by leveraging state of the art technologies.

309K lines of code

4 current contributors

16 days since last commit

1 users on Open Hub

Low Activity
5.0
 
I Use This

jscourses

Compare

  Analyzed 1 day ago

Courses by John Samuel

8.48K lines of code

1 current contributors

8 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Licenses: No declared licenses

lambdo

Compare

  Analyzed about 21 hours ago

A column-oriented approach to feature engineering. Feature engineering and machine learning: together at last!

3.05K lines of code

2 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

Datumbox Machine Learning Framework

Compare

  Analyzed 1 day ago

The Datumbox Machine Learning Framework is an open-source framework written in Java which allows the rapid development Machine Learning and Statistical applications. The main focus of the framework is to include a large number of machine learning algorithms & statistical tests and being able to handle medium-large sized datasets.

22.8K lines of code

0 current contributors

almost 4 years since last commit

0 users on Open Hub

Inactive
5.0
 
I Use This

food-ingredient-parser-ruby

Compare

  Analyzed 3 days ago

Extract the structure of ingredient lists on food products

1.47K lines of code

1 current contributors

4 months since last commit

0 users on Open Hub

Very Low Activity
0.0
 
I Use This

digger

Compare

  Analyzed 4 days ago

Digging into some data mines (last.fm, identica, twitter, github, distributed version control systems) and generating nice graphs

1.26K lines of code

0 current contributors

over 13 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This