Topics to be covered in the Course: Data Warehousing and Data Mining
Data Warehousing
- Introduction
to Data Warehousing – Batch, OLTP, DSS Applications. Different natures of
OLTP and DW databases. Commercial Importance of DW. Data Marts
- Basic
Elements of DataWarehouse – Source System, Data Staging Area, Presentation
Server
- Business
Dimensional Life Cycle
- Dimensional
Modeling. Multidimensional Data Model, Data Cubes, OLAP
- DW Bus
Architecture, Conformed Dimensions
- Star
Schema and Snowflake Schema
- Normalization
VS Dimensional Modeling
- Slicing
and Dicing, Drilling, Drill-up, Drill-down, Drill-within, Drill-across.
- Bitmap
Index
- Aggregation
- Metadata
- Design
Issues, Partitioning, Size Estimation
- Example
Applications: Retail, CRM, Telecom, E-Commerce, Insurance
- End-user
applications
Data Mining
- KDD
and Data Mining
- SQL
and Data Mining
- Association
Rules
- Bayesian
Network Approach
- Decision
Trees
- Neural
Networks, Genetic Algorithms, Rough Sets, SVM
- Temporal
& Spatial Data Mining
- Sequence
Mining
- Text
Mining
- Web
Mining
Suggested Text Books
a. R.Kimball – DataWarehouse
Lifecycle Toolkit (J.Wiley)
b. R.Kimball – DataWarehouse
Toolkit (J.Wiley)
c. Anahory and Murray – Data
Warehousing in the Real World (Pearson Education)
d. A.K.Pujari – Data mining
(University Press)
e. J. Hahn and Micheline Kamber -
Data Mining: Concepts and Techniques (Morgan Kaufmann)
All the above books have Indian
editions.