DATA MINING

...is nothing else than torturing the data until it confesses…and if you torture it enough, you can get it to confess to anything (Fred Menger)

Data Mining Techniques
   

Classifying data mining techiques has been allways a sensitive subject. There are dozens of classifications of data mining with classes, sub-classes and sub-sub-classes. And sometimes a particular class may have dozens of names. One can argue that particular methods are not data mining techiques but pure statistical concepts. So I'll never brag about being able to draw the complete picture. But I hope one can have at least an idea about the data mining methods by looking at the classification below. I should also add that it refers only to the "predictive" part of the data mining. The other part, the "descriptive" one, it relates more to statistics rather than data mining (correlations, ANOVA, Ztest, histograms, etc). This is not to say that I don't consider regression a statistical concept. I do, but, again, the border between statistics and data mining is sometimes blurred. The science of statistics has itself a well rounded set of predictive methods that were borrowed by data mining and never returned...

Data Mining Classification

  1. Association Analysis
    1. With Candidate Generation
      Algorithms: APRIORI
    2. Without Candidate Generation
      Algorithms: RELIM, FP-GROWTH
  2. Classification
    1. Decision Trees
      Algorithms: CHART, CAID, C4.5
    2. Artificial Neural Networks
      Algorithms: SLP, COHONEN, MNP
    3. Bayesian Classification
    4. K-Nearest Neighbors
      Algorithms: PEBLS
    5. Support Vector Machine
    6. Genetic Algorithm
  3. Clustering
    1. Hierarchical
      Agglomerative Algorithms: HACM, SLINK, COBWEB, BIRCH, CURE, ROCK, CHAMELEON
      Divisive Algorithms:
    2. Partitional
      Algorithms: K-MEANS, CLARA, CLARANS, PAM
    3. Density Based
      Algorithms: DBSCAN, OPTICS
  4. Regression
    1. Linear Regression
    2. Non-Linear Regression
    3. Logical Regression


Data Mining Articles
If Only I Knew - by Tim Graettinger

Missing Inaction - by Tim Graettinger

More Articles...


Data Mining Resources

Data Mining 101
       by Radu Lovin
Part 1 - Introduction to Data Mining

Part 2 - Data Mining Classification

Part 3 - Mining Frequent Patterns (Maximal and Closed Frequent Itemsets)

Part 4 - Association Rules

Part 5 - Algorithms for Mining Frequent Itemsets

More...


Home   Contributors   Resources

© 2007-2011, dataminingarticles.com