M/W 12pm--1:20pm,
Room 3006 (Lift 17/18)
Tutorials: Thur. 6pm-6:50pm
Week of ...
Topics
Slides
Assignments and Projects
Readings
(BI: Business Intelligence by C. Vercellis; ML: Machine Learning by E. Alpaydin; TL: Tang and Liu book; TSK: Tan, Steinbach, Kumar Book.Semester Starts: Feb 1
Introduction
* First tutorial on Thursday at 6pm.
- KDDCUPs
- KDDCUP 2012
- BI: 6.1, 6.2, and 7
Feb 6
Dimensionality Reduction
- Dimensionality Reduction (pdf)
- PPT: Chapter 4 of link
Project 1: Tutorial Introduction
Feb 13
Regression
- Linear Discrimination (pdf, ppt)
- Combining Multiple Learners (pdf, ppt)
- Ensemble slides from TSK book.
Project 1;
Assignment 1
- BI: 8.2-8.5, 10.5
- ML: 10.7
- BI: 10.2; ML: 3.3;
- ML:17
- N. Chawla, Data Mining for Imbalanced Datasets: An Overview, In: O. Maimon and L. Rokach, Eds., Data Mining and Knowledge Discovery Handbook, Springer, New York, 2010, pp. 875-886: link
Feb 20
Bayesian and Graphical Methods
- Probability reviewed
- Graphical Models (pdf, ppt)
- BI: 10:4;
- ML: 16.2,16.3,16.6
Feb 27
Text and Web Mining
- Text Mining and the Vector Space Model
- The Page Rank Algorithm (Ullman's PPT)
- Query classification (PPT, Paper)
Assignment 2
- VSM: link
- Page Rank on Wiki: link
- Query and Search: link, TrustRank
- References at Stanford U.
- J. Ullman's Book Draft: Link Analysis
Mar 5
Time Series, Streaming Data
- Time Series Models (PDF slides from Faloutsos tutorial)
- Time Series Concepts
- Data Stream Mining.ppt
Mar 12
Large-scale Computation
Project 2 Mar 19
Social Networks/Media
- Social Networks and Graphs (PPT)
Assignment 3
- TL: 1-3
Mar 26
Social Networks/Media
- Community Detection and Graph-based Clustering (PPT)
** Students sign up for presentation topics
- TL: 4-5
Apr 2
Recommendation Systems
Apr 9
Social Recommendation Systems
- Social Tagging (B_rkur Sigurbj_rnsson and Roelof van Zwol,)Flickr Tag Recommendation based on Collective Knowledge, WWW 2008 (Slides)
- Collective Intelligence (Slides)
Assignment4
- ECML PKDD 2009 Challenge: Introduction, Remarks
- Steffen Rendle and Lars Schmidt-Thieme, Pairwise Interaction Tensor Factorization for Personalized Tag Recommendation, WSDM 2010 (Slides)
- ACM WSDM Tutorial on Crowdsourcing (the Turk on Youtube) (Slides)
- T. Sakaki, et al.: Earthquake shakes Twitter users: real-time event detection by social sensors. WWW 2010, 851-860. (Slides)
Apr 16
Presentations and term papers
Each student should give two class presentations (20 minute each) using PowerPoint or PDF slides, and write a 10-page term paper on the chosen topic.
Please select papers in recent 3 years from the listed conference proceedings (see right column). Please choose papers on topics relevant to social media, social networks and social recommendationsApr 23
Presentations and demos
Presentations and Term Paper Schedule and Requirement
May 14: 9:30 -15:30
Presentations (30 min/student)
Final Project Report and Term Paper are due on May 14th, 2012
For details on the term paper, please see the above link.
Links:
· ML Book: Lecture Slides
· Ullman's Data Mining Course Page
· TA's Tutorial Page
· Assignments and Projects
· Papers
· Datasets