Machine Learning Group at the University of Waikato

Projects related to Weka

There is a number of software projects that make use of Weka, allow data in ARFF format to processed, or enable access to Weka functionality from other programming environments. In particular, there is an interface to Weka for the statistical programming language R and for Mathematica.

  • Spectral clustering by Luigi Dragone.
  • Kea - automatic keyphrase extraction.
  • Word sense disambiguation by Ted Pedersen.
  • WekaMetal - a meta-learning extension to Weka.
  • maxdView visualisation tool for microarray data.
  • LocBoost classification demo applet.
  • Tertius: a system for rule discovery.
  • Weka-Parallel - parallel processing for Weka.
  • KDDML-MQL - support for KDD process.
  • GATE - NLP workbench with Weka interface.
  • TClass - classifying multivariate time series.
  • Learning Vector Quantization - and more with Weka.
  • Bayesian Network Classifiers - with bindings for Weka.
  • RSW - sequential classification with Weka.
  • Cahit Arf - a data extraction utility for Weka.
  • Judge - software for document classification and clustering.
  • Milk - a workbench for multi-instance learning.
  • Modified version of Weka, including time series mining and visualization tools.
  • Grid Weka - grid computing with Weka.
  • Balie* - BAseLine Information Extraction.
  • FAEHIM - Data Mining Web services.
  • BioWeka - knowledge discovery and analysis for biologists.
  • FastKMeans - a faster version of k-means clustering (.zip file).
  • Semi-Supervised and Collective Classification using Weka.
  • Mathematica interface for Weka.
  • weka4WS - distributed data mining.
  • RWeka - an R interface to Weka.
  • Rarff - A Ruby library for manipulating ARFF files.
  • PROMPT - Statistical comparison and mapping of protein sets. Import/Export of WEKA arff data files.
  • Agent Academy - Java integrated development framework for creating Intelligent Agents and Multi Agent Systems
  • GeneticProgramming - Genetic Programming Classifier for Weka
  • Weka-GDPM - extended version of Weka 3.4 to support automatic geographic data preprocessing for spatial data mining.
  • Fuzzy if-then rules - Classification using fuzzy if-then rules.
  • Mulan - Multi-label classification built on top of Weka.
  • csv2arff - An online CSV to ARFF converter.
  • Debellor - Data mining platform for data streaming.
  • MOA (Massive Online Analysis) - A framework for data streams.
  • Epitopes Toolkit (EpiT) - A platform for developing epitope prediction tools.
  • OpenSubspace - An open source framework for evaluation and exploration of subspace clustering algorithms in WEKA.
  • Olex-GA - A genetic algorithm for the induction of rule-based text classifiers.
  • Graph RAT - A framework for combining graph and non-graph algorithms.
  • TunedIT - Automated tests of machine-learning algorithms. Repository of datasets, algorithms and benchmarks.
  • TUBE - Tree-based Density Estimation Algorithms.
  • ScalaLab - Provides a Scala based Matlab-like interface to Weka's algorithms.
  • GroovyLab - Provides a Groovy based Matlab-like interface to Weka's algorithms.
  • Contrast Mining in Weka.
  • Python library for processing ARFF files.
  • x2arff - A simple VB application to convert data stored in excel files into Attribute-Relation File Format.
  • Weka for Computational Genetics - Multifactor Dimensionality Reduction (MDR) added to the Weka package.
  • ADAMS - Advanced Data mining And Machine learning System offers workflow engine that includes Weka.
  • Cost-sensitive classifiers - Adaboost extensions for cost-sensitive classification.

See also the article on WekaWiki.