DATA WAREHOUSING, MINING AND BUSINESS INTELLIGENCE (DWMBI)
B.E. (IT) Semester- 7, (December 2010)
Con. 6110-10.
GT-8967
(3 Hours)
[Total Mark 100]
N.B. : (1) Question No 1 is compulsory.
(2) Attempt any four questions out of remaining questions.
1. One of India’s large Retail Department Chains, with annual revenues touching $2.5 billion mark and having over 3600 employees working at diverse location, was keenly interested in a business intelligence solution that can bring clear insights on operations and performance of departmental stores across the retail chain. The company needed to support a data warehouse that exceeds daily sales data from point of sales (POS) across all locations, with 80 million rows and 71 columns.
(a) List the dimensions and facts for above application. --- (5 Marks)
(b) Design Star Schema and Snow flake schema for the above application. --- (5 Marks)
(c) Design a BI application which will provide Retail Chain Company with features and performance that meet their objectives using any data mining technique. --- (10 Marks)
2. (a) What are the major issue in data mining? --- (6 Marks)
(b) Explain data Mining Task Primitives. --- (6 Marks)
(c) Explain Fact less fact table with suitable example. --- (12 Marks)
3. (a) What is web mining and explain web content mining. --- (10 Marks)
(b) Explain any one method of hierarchical clustering with an example. --- (10 marks)
4. (a) A database has four transactions. Let minimum support and confidence in 50%.
Tid
|
Item Bought
|
1
|
A,B,D
|
2
|
A,D
|
3
|
A,C
|
4
|
B,D,E,F
|
Find out the frequent item sets and strong association rules for above example. --- (5 Marks)
(a) Explain multi dimensional association rules with example. --- (5 Marks)
(b) Explain multilevel association rules. --- (5 Marks)
(c) Explain constraint based association rule mining. --- (5 Marks)
5. (a) Describe Data Discretization, Summarization with an example. --- (7 Marks)
(b) Explain Numerosity Reduction in data preprocessing. --- (6 Marks)
(c) Explain BIRCH method of clustering with an example. --- (7 Marks)
6.(a) Using a given table. Create classification model using any algorithm and hence classify following table. <income=medium, credit=good> --- (10 Marks)
Transaction ID
|
Income
|
Credit
|
Decision
|
1
|
Very High | Excellent | Authorize |
2
|
High | Good | Authorize |
3
|
Medium | Excellent | Authorize |
4
|
High | Good | Authorize |
5 | Very High | Good | Authorize |
6 | Medium | Excellent | Authorize |
7 | High | Bad | Request ID |
8 | Medium | Bad | Request ID |
9 | High | Bad | Reject |
10 | Low | Bad | Call Police |
(b) Explain K- means clustering and solve the following with K=2. ---- (10 Marks)
{ | 2, | 25, | 10, | 15, | 5, | 20, | 4, | 40, | } |
7. Write short notes on any two: --- (20 Marks)
(a) Spatial OLAP
(b) Association rule mining in data stream
(c) Approaches in Text mining
(d) Sequence mining in transactional Data base
Also see Data warehousing, Mining and business intelligence question papers for May 2010
No comments:
Post a Comment