Wednesday, July 2, 2014

DATA WAREHOUSE AND DATA MINING (DWDM) MAY 2011 COMPUTER SCIENCE SEMESTER 6

DATA WAREHOUSE AND DATA MINING (DWDM) MAY 2011 COMPUTER SCIENCE SEMESTER 6

                                                              (3 Hours)                              [Total Marks : -100]

1. (a) Describe the steps in the KDD process with a suitable block diagram. [5 Marks]
    (b) Compare between OLTP and OLAP. [5 Marks]
    (c) What will be the effect of performing Attribute Oriented Induction (AOI) on the initial working
          relation student with attributes such as name, gender, birth date, birth place, address,
          phone-no and gpa.[10 Marks]   

2. (a) Using the table given below, create a classification model using decision true technique.
         Indicate how to utilize the model to estimate the risk category of the customer with
         (Credit-History bad, debt - high, collateral - none, income - (15-35k)). [10 Marks]
    (b) Define a data warehouse. Explain the architecture of data warehouse with suitable block
         diagram. [10 Marks]

3. (a) Consider the data set given. Create the adjacency matrix. Use single link
        agglomarative technique to cluster the given data. Draw the dendogram. [10 Marks]

    (b) What are different ways of finding the distance between two clusters? [5 Marks]
    (c) Define factless Fact table with a suitable example. [5 Marks]

4. (a) What is association rule mining? Give the Apriori algorithm. Apply AR Mining to find
         all frequent itemsets from following table:-
    (b) Explain the major steps in the ETL process with a suitable diagram and an
         example. [10 Marks]

5. (a) The college wants to record the grades for the courses  completed by students.
          There are four dimensions:-
                (i) Course      (ii) Student
                (ii) Professor   (iii) period
       The only fact that is to be recorded in the table is course grade :-
               (i) Design star schema. [5 Marks] 
              (ii) Write DMQL for the above star shchema. [5 Marks]
      (b) Using the above example describe the following OLAP operations :- [10 Marks]
            (i) Slice,  Dice, Roll up, Drill-down, Pivot

6. (a) What are crawlers? How do periodic crawlers differ from incremental crawlers?
          Give the architecture of focussed crawlers and explain how its is used. [10 Marks]
    (b) Explain how HITS algorithm finds hubs and authoritative pages. [10 Marks]

7. Write short notes on (any four) :- [20 Marks]
     (a) Qutlier Mining
     (b) Applications of Web Usage Mining.
     (c) Snowflake Schema\
     (d) Generalized Association Rules
     (e) Top-Down and Bottom-Up approaches in data warehousing.  

No comments:

Post a Comment