Tuesday, November 18, 2014

DMDW Assignment III Sem I 2014_15 Class:BE IT

MGM’s College of Engineering, Nanded.
Department of IT
Semester I (2014-15)
Class: BE(IT)       Subject: DMDW         Assignment III
___________________________________________________________________
1. Discuss the essential steps of apriori algorithm.
2. Define: 1) Support 2) Confidence. From the following database, find out the items with Support ≥ 40 %.
Tid
Items
T1
M, O, N, K
T2
D,O,N
T3
M, A, K, E
T4
M, U, C, Y
T5
C,O,O,K

3.  For the above transaction database, find the itemsets having confidence ≥ 60 %.
4. Explain the requirements for data mining GUI.
5.  List the ways to improve the efficiency of apriori algorithm.
6. Give the comparison between Classification and Clustering.
7. What are Iceberg queries? Explain with an example.
8. What are the various forms of presenting and visualizing the discovered patterns?
9. What is information gain? How Information Gain is calculated?
10. Discuss the Multilevel Association Rules mining for transaction database.
11. State and explain the Bayesian algorithm.
12. Explain the decision tree induction algorithm.
13. What are the different techniques to calculate the distance between patterns?
14. Explain the different types of cluster analysis methods and discuss their features.
15. Describe k-means algorithm and discuss its strengths and weaknesses.
16. The following table shows a set of paired data where x is the number of years of work experience of a college graduate and y is corresponding salary of the graduate. There is a linear relationship between the two variables, x and y. Use Straight-line regression method with least squares and predict the salary of a college graduate with 10 years of experience.

x years experience
y salary (in $1000s)
5
32
10
59
11
66
15
74
5
38
8
45
13
61
23
92
3
22
18
85

      

Faculty Incharge: Hashmi S A