This course will enable students to learn basic data mining concepts. This will help them in understanding analytical procedures used in Business Analytics through data mining approach.
Learning outcome (at course level) | Learning and teaching strategies | Assessment Strategies |
| Approach in teaching: Interactive Lectures, Discussion, Demonstrations, Group activities, Teaching using advanced IT audio-video tools. Learning activities for the students: Self-learning assignments, Effective questions, Seminar presentation, Giving tasks. | Two Internal Tests, Quiz, Home Assignments, Presentations.
|
Architecture of Data Warehouse, Data Preprocessing – Need, Data Cleaning, Data Integration &Transformation, Data Reduction, Machine Learning, Pattern Matching. Introduction to Data Mining: Basic Data Mining Tasks, Data Mining versus Knowledge Discovery in Databases, Data Mining Metrics, Data Mining Query Language, Applications of Data Mining.
Frequent item-sets and Association rule mining: Apriori algorithm, Use of sampling for frequent item-set,FP tree algorithm, Graph Mining, Frequent sub-graph mining. Market Basket Analysis and Association Analysis, Market Basket Data, Stores, Customers, Orders, Items, Order Characteristics, Product Popularity, Tracking Marketing Interventions.
Decision tree learning: Construction, performance, attribute selection Issues: Over-fitting, tree pruning methods, missing values, Information Gain, Gain Ratio, Gini Index, continuous classes. Classification and Regression Trees (CART) and C 5.0.
Bayes Theorem, Naïve Bayes classifier, Bayesian Networks Inference, Parameter and structure learning: Linear classifiers, Least squares, logistic, perceptron and SVM classifiers, Prediction: Linear regression, Non-linear regression (Artificial Neural Networks).
Precision, recall, F-measure, confusion matrix, cross-validation, bootstrap, Clustering: k-means, Expectation Maximization (M) algorithm, Hierarchical clustering, Correlation clustering, DBSCAN.
1. W. H. Inmon, “Building the Data Warehouse”, Wiley Dreamtech India Pvt. Ltd., 4th Edition, 2005
e-Resources:
1. https://nptel.ac.in/courses/106106222
2. www.kaggle.com
Journals:
1. International Journal of Design
2. Journal of the Brazilian Computer Society, SpringerOpen
Journal of Internet Services and Applications, SpringerOpen
Links:
[1] https://www.csit.iisuniv.ac.in/courses/subjects/introduction-data-mining-5
[2] https://www.csit.iisuniv.ac.in/academic-year/2025-26