Advanced Data Mining with Weka

Advanced Data Mining with Weka

Course provided by FutureLearn

Summary overview

  • Online anytime

  • 20 hours study time

  • Cross-sector

  • Free

  • NA

  • Level NA

About this course

This course will bring you to the wizard level of skill in data mining, following on from Data Mining with Weka and More Data Mining with Weka, by showing how to use popular packages that extend Weka’s functionality. You’ll learn about forecasting time series and mining data streams. You’ll connect up the popular R statistical package and learn how to use its extensive visualisation and preprocessing functions from Weka. You’ll script Weka in Python – all from within the friendly Weka interface. And you’ll learn how to distribute data mining jobs over several computers using Apache SPARK. Free to register. Fee to certificate.

Learning outcomes

  1. Calculate optimal parameter values for non-linear support vector machines.
  2. Demonstrate the use of R classifiers in Weka.
  3. Develop R commands and R scripts from Weka.
  4. Explain how distributed Weka runs Weka on a cluster of machines.
  5. Experiment with distributed implementations of Weka classifiers and clusterers.
  6. Explain how 'map' and 'reduce' tasks are used to distribute Weka.
  7. Design Python and Groovy scripts for Weka operations.
  8. Apply Python libraries to produce sophisticated visualisations of Weka output
  9. Describe how Weka can be invoked from within a Python environment.