You are on page 1of 4

1 Format No.

: FM2/Issue: 01/Revision: 00

SRINIVASAN ENGINEERING COLLEGE, PERAMBALUR
ODD SEMESTER 2013 2014

COURSE HANDOUT
DATE: 20.06.2013


SUB.CODE : MC9280
SUB. TITLE : DATA MINING AND DATA WAREHOUSING
STAFF NAME : K.NITHYA KALYANI



Scope and Objective of the Course:

To retrieve the data from heterogeneous database effectively using various algorithms and
techniques.


Text book(s) [TB]:
TB1. Jiawei Han and Micheline Kamber, Data Mining Concepts and techniques, Second Edition,
Elsevier-2007
TB2. Alex Berson and J.Stephen Smith, Data Warehousing, Data Mining & OLAP,
Tata McGraw Hill Edition, Reprint 2007



COURSE PLAN / SCHEDULE:

S.No Topics to be covered Learning objectives Ref. to
Text
Book
No. of
lecture
s
UNIT I- DATA WAREHOUSING AND BUSINESS ANALYSIS
1
Data Warehousing
Components
Intoduction of the components of
data warehousing, overall
architecture
TB2(115-
127) 1
2
Building a Data warehouse
Business consideration of warehouse TB2(129-
149)
1
3 Mapping the Data
Warehouse to
Multiprocessor
Architecture
Database technology for data
warehouse
TB2(151-
167)
2
4
DBMS Schemas for
Decision Support
Data layout for best access TB2(169-
185)
2
5
Data Extraction, Cleanup,
and Transformation Tools
Tool requirements for data
extraction, cleanup and
transformation
TB2(187-
203) 2
6
Metadata
Definition and trends TB2(205-
219)
1
7
Reporting and Query Tools
and Applications
Tool categories and need for
applications
TB2(223-
243) 2

2 Format No.: FM2/Issue: 01/Revision: 00

8
Online Analytical
Processing (OLAP)
Process of online analysis TB2(247
1
9
Multidimensional Data
Model
Deals with Multidimensional OLAP TB2
(248-256)
1
UNIT II DATA MINING
10
Data Mining Functionalities
Different kinds of patterns can be
mined. Mining frequent patterns
TB1(21-
27)
1
11
Data Preprocessing, Data
Cleaning, Data Integration
and Transformation

How the data be processed in order
to help to improve the quality of the
data.
TB1(47-
72)
2
12 Data Reduction, Data
Discretization and Concept
Hierarchy Generation
Can reduce the data size by
aggregation, Elimination and
redundant features.
TB1(72-
96) 2
13 Efficient and Scalable
Frequent Item set Mining
Methods
Data mining deals with
patterns,associations and
correlations
TB1(234-
249) 2
14 Mining Various Kinds of
Association Rules
Various Kinds of Association Rules
for multilevel mining
TB1(250-
259)
2
15
Association Mining to
Correlation Analysis
It deals with from association
analysis to correlation analysis
TB1(259-
265)
1
16 Constraint-Based
Association Mining
Mining guided by rule constraints TB1(265-
272)
1
UNIT III- CLASSIFICATION AND PREDICTION
17
Classification and
Prediction
Classification and Prediction are two
forms of data analysis that can be
used to extract model
TB1-285
1
18 Issues Regarding
Classification and
Prediction
Informations regarding
Classification and Prediction
TB1(289-
291) 1
19 Classification by Decision
Tree Introduction
Learning of decision trees TB1(291-
310)
1
20
Bayesian Classification
It is based on Bayes theorem
,studies comparing Bayesian
algorithm
TB1(310-
318) 2
21
Rule Based Classification
Look at Rule Based classifiers TB1(318-
327)
1
22
Classification by
Backpropagation
Backpropagation is neural network
algorithm & it deals with its
classification
TB1
(327-336) 1
23
Support Vector Machines
A method for the classification of
both linear and non-linear data
TB1(337-
344)
1
24
Associative Classification
Classification by Association rule
analysis
TB1
(344-347)
1
25 Lazy
Learners
Learning from your neighbors TB1(347-
350)
1
26 Other Classification
Methods
Methods of Other Classifications TB1(351-
354)
1
27
Prediction
It is the task of predicting values TB1(354-
1

3 Format No.: FM2/Issue: 01/Revision: 00

359)
28
Accuracy and error
measures
To compute classifier accuracy and
measures in techniques for accuracy
estimation
TB1(359-
363) 1
29 Evaluating the Accuracy of
a Classifier or Predictor
Estimating accuracy using different
types of methods
TB1(363-
366)
1
30
Ensemble Methods
Discuss about bagging and boosting
techniques
TB1(366-
370)
1
31
Model Section
Discuss about two models such as
estimating confident intervals and
ROC curves
TB1(370-
373) 1
UNIT IV- CLUSTER ANALYSIS
32
Cluster Analysis
Intro of cluster analysis, it deals with
grouping of processes
TB1(383-
386)
1
33 Types of Data in cluster
analysis
Different kinds of data TB1(386-
398)
2
34 Categorization of Major
Clustering Methods
Different kinds of Clustering
Methods
TB1
(398-401)
2
35
Partitioning Methods
Discuss about various kinds of
partitioning algorithms
TB1
(401-408)
2
36
Hierarchical Methods
Discuss about various kinds of
algorithms
TB1(408-
418)
2
37
Density-Based Methods
Clusters of dense regions of objects
in the data space
TB1(418-
424)
1
38
Grid Based Methods
Multiresolution grid data structure TB1(424-
429)
1
39
Model-Based Clustering
Methods
To optimize the fit between the
given data and some mathematical
model
TB1(429-
434) 1
40 Clustering High
Dimensional Data
Design of clustering for
multidimensional
TB1(434-
444)
1
41
Constraint Based Cluster
Analysis
Clusters that satisfy user-specified
preferences or constraints, different
objects & functions
TB1(444-
451) 1
42
Outlier Analysis
Data objects that do not comply with
the general behavior
TB1(451-
459)
1
UNIT V- MINING OBJECT, SPATIAL, MULTIMEDIA, TEXT AND WEB DATA
43

Multidimensional Analysis
and Descriptive Mining of
Complex Data Objects
Discuss about generalization and
aggregation
TB1(591-
596) 1
44
Spatial
Data Mining
Discuss about Spatial data cube,
spatial OLAP, Raster databases &
clustering methods
TB1(600-
607) 1
45
Multimedia Data Mining
Multimedia audio & video data
mining
TB1(607-
613)
1
46
Text Mining
Dimensionality Reduction for Text
& approaches
TB1(614-
624)
1
47 Mining the World Wide
Web
To learn many information from
World Wide Web Mining
TB1(628-
640)
1
Total number of classes planned: 60

4 Format No.: FM2/Issue: 01/Revision: 00



EVALUATION SCHEME INTERNAL ASSESMENT



I. ASSIGNMENT TOPIC:

Data preprocessing
Mining Various Kinds of Association Rules
Types of Data in cluster analysis


II. Part-A Question with answer will be issued by the instructor on completion of each unit.

Timings for chamber consultation: Students should contact the Course Instructor in her/his
chamber during lunch break.
Notices: All notices will be displayed on the Department Notice Board.




STAFF IN-CHARGE HOD
(K.NITHYA KALYANI)




EC
No.
Evaluation
Componen
ts
Duration Weighta
ge
Date & Time Venue
1 Slip Tests 40 min 20%
2 Cycle Test
1
1.30hr 30% 29.7.13
31.7.13














T
o

b
e






a
n
n
o
u
n
c
e
d

l
a
t
e
r

3 Cycle Test
2
1.30hr 21.8.13 -
23.8.13
4 Cycle Test
3
1.30hr 2.9.13 4.9.13
5 Model
Exam
3hr 20% 4.10.13
10.10.13
7 Assignment 15 days 10%
8 Attendance
Percentage
Continuous 20%

You might also like