Course Description: Data mining is used to discover patterns and relationships in data. Emphasis is on large complex data sets such as those in very large databases or through web mining. Topics: decision trees, neural networks, association rules, clustering, case based methods, and data visualization. The following chapters from the textbook will be covered in this order:

  • Chapter 1 – Introduction
  • Chapter 2 – Data
  • Chapter 3 – Exploring Data
  • Chapter 6 – Association Analysis: Basic Concepts and Algorithms
  • Chapter 4 – Classification: Basic Concepts, Decision Trees, and Model Evaluation
  • Chapter 5 – Classification: Alternative Techniques (naive bayes models, support vector machines)
  • Chapter 8 – Cluster Analysis: Basic Concepts and Algorithms
  • Chapter 10 – Anomaly Detection

Course Schedule and Lecture Notes in PDF format from Stanford for Statistical Aspects of Data Mining.