Tuesday, October 29, 2013

Semester I 2013-14 Class: BE(IT) Subject: DMDW Assignment III

MGM’s College of Engineering, Nanded.
Department of IT
Semester I (2013-14)
Class: BE(IT) Subject: DMDW Assignment III
___________________________________________________________________
1.Define: 1) Support 2) Confidence 3) Frequent itemset.
2.Explain the functional components required for data mining GUI.
3.What are the steps of apriori algorithm? Explain in detail.
4.For the following transaction database, find the frequent itemsets using apriori algorithm.
(Use support as 50 %).










5.List the ways to improve the efficiency of apriori algorithm.
6.Give the comparison between supervised learning and unsupervised learning.
7.What are Iceberg queries? Explain with an example.
8.What are the various forms of presenting and visualizing the discovered patterns?
9.What is information gain? How Information Gain is calculated?
10.Discuss the Multilevel Association Rules mining for transaction database.
11.Explain the different approaches for Multilevel Association Rules mining for transaction database.
12.Explain the decision tree algorithm.
13.What are the different types of data in cluster analysis?
14.Explain the different types of cluster analysis method and discuss their features.
15.Describe k-means algorithm and discuss its strengths and weaknesses.
16.The following table shows a set of paired data where x is the number of years of work experience of a college graduate and y is corresponding salary of the graduate. There is a linear relationship between the two variables, x and y. Use Straight-line regression method with least squares and predict the salary of a college graduate with 10 years of experience.

Faculty Incharge: Hashmi S A


Saturday, September 21, 2013


DMDW Assignment II Sept. 2013 2013-14

MGM’s College of Engineering, Nanded.
Department of IT
Semester I (2013-14)
Class: BE(IT) Subject: DMDW Assignment II
________________________________________________________
1. What is data mining? Explain its characteristics.
2. Why data preprocessing is required for DM? What are the types of data preprocessing?
3. How missing values are filled for an attribute in DM data cleaning process?
4. What is a noise in data? Explain the data smoothing techniques in DM.
5. What is data transformation? What functions are performed in data transformation?
6. Define Min-max normalization. Suppose the minimum and maximum values for the attribute salary are Rs. 50000 and Rs. 95000, respectively. Using min-max normalization, transform and map value Rs. 76100 to the range [0.0, 1.0].
7. Define z-score normalization. The mean and standard deviation of the values for the attribute total_marks are 810 and 900, respectively. Using z-score normalization transform a value of 985.
8. Suppose that the data for analysis includes the attribute age. The age values for the data tuples are (in increasing order) 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33,33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
(a) What is the mean of the data? What is the median?
(b) What is the mode of the data? Comment on the data’s modality (i.e., bimodal, trimodal, etc.).
9. For the data set of Q.8 above
(c) What is the midrange of the data?
(d) Can you find (roughly) the first quartile (Q1) and the third quartile
(Q3) of the data?
10. For the data set of Q.8 above
(e) Give the five-number summary of the data.
(f) Show a boxplot of the data.
(g) How is a quantile-quantile plot different from a quantile plot?
11. Define the following DM functionalities: characterization, discrimination, association and correlation analysis. Give examples of each DM functionality, using a real-life database with which you are familiar.
12. Define classification, prediction, clustering, and evolution analysis. Give examples of each using a real-life database with which you are familiar.
13. List and describe the five primitives for specifying a data mining task.
14. Describe the differences between the following approaches for the
integration of a data mining system with a database or DW system: no
coupling, loose coupling, semitight coupling, and tight coupling.
15. Write data mining query in DMQL for the following case study :
Suppose, as a marketing manager of AllElectronics, you would like to classify customers based on their buying patterns. You are especially interested in those customers whose salary is no less than $40,000, and who have bought more than $1,000 worth of items, each of which is priced at no less than $100. In particular, you are interested in the customer’s age, income, the types of items purchased, the purchase location, and where the items were made. You would like to view the resulting classification in the form of rules.
16. What is KDD? Enlist and explain the stages of KDD.



Faculty Incharge: Hashmi S A

Saturday, August 17, 2013

2013-14 Sem I DMDW Assignment I

MGM’s College of Engineering, Nanded.

Department of IT

Semester I (2013-14)

Class: BE(IT) Subject: DMDW Assignment I

________________________________________________________

1. What is Data warehouse? Explain the characteristics of DW.

2. Give the comparison between OLTP and OLAP.

3. What is data cube? Explain with an appropriate example.

4. Compare star, snow-flake and fact constellation schema.

5. Draw and explain Star and snow-flake schema of a DW for sales.

6. Explain the DW applications.

7. Explain OLAP operations with examples.

8. Explain the benefits of using DW for the businesses.

9. Explain the steps of data warehouse design process.

10. What is concept hierarchy? Explain with an example.

11. What are the different categories of Measures? Explain.

12. Write a note on starnet query model.

13. Draw and explain the architecture of datawarehouse.

14. Explain the data warehouse back-end tools.

15. What are the contents of metadata repository?

16. Write down the DMQL for

a. Star schema of a DW for sales

b. Snowflake schema of a DW for sales

c. Fact constellation schema of a DW for sales and shipping.

17. Discuss the types of OLAP servers.











Faculty Incharge: Hashmi S A

Thursday, May 2, 2013

2012-13 SEM II Class: BE (IT) Subject: GCC Assignment III

MGM’s College of Engineering, Nanded.

Department of IT
Semester II (2012-13)
Class: BE (IT) Subject: GCC Assignment III
________________________________________________________

1. How cloud services can be used for Calendar applications and Scheduling applications?
2. Give the comparison between Google, Yahoo and Windows Calendar.
3. What are the expectations from cloud-based Event Management applications?
4. What business applications are available from salesforce.com?
5. Explain the modules of Zoho CRM cloud service.
6. What are the important tasks of project management? How cloud services can be used for project management?
7. What are the advantages of managing projects online?
8. What are the benefits of cloud-based word processors?
9. Who should use a cloud-based word processor? Why?
10. Explain the features of Google Apps?
11. Who shouldn’t use cloud-based Spreadsheet? Why?
12. What are advantages of cloud-based databases?
13. Write a short note on: Map-reduce.
14. Discuss the risks of storing data in the clouds.
15. Give the comparison between different web-based communication tools.
16. Enlist and explain briefly the cloud applications for databases.



Faculty Incharge: Hashmi S A

Friday, April 5, 2013

April 2013 / Sem II Class: BE (IT) Subject: GCC Assignment II

MGM’s College of Engineering, Nanded.


Department of IT

Semester II (2012-13)

Class: BE (IT) Subject: GCC Assignment II

______________________________________________________

1. Define cloud computing? Discuss its pros.

2. Why cloud computing should not be used? Elaborate.

3. Explain the difference between Client/server computing and cloud computing.

4. What are the important properties of cloud computing? Explain.

5. Draw & explain the architecture of cloud computing.

6. Who benefits from cloud computing?

7. Cloud computing is not useful for which users and Why?

8. What are the services provided by the cloud computing?

9. Explain the EC2 of Amazon.

10. What are the different types of cloud services? Explain with examples.

11. How cloud computing can be used by family?

12. Which activities of the community can be provided on cloud?

13. Explain PAAS with appropriate example.

14. What are the advantages of cloud computing for the corporation?

15. Write a note on: Salesforce.com.

16. How schedule management & project management can be done using cloud computing?



Faculty Incharge: Hashmi S A

Thursday, February 28, 2013

GCC -I Semester II (2012-13) Class: BE(IT)

MGM’s College of Engineering, Nanded.
Department of IT
Semester II (2012-13)
Class: BE(IT) Subject: GCC Assignment I
________________________________________________________

1. What is Grid Computing? Discuss its characteristics.

2. Explain the OGSA architecture.

3. Explain the services provided by grid computing.

4. Draw and explain the Architecture of Globus Tool Kit.

5. How security is implemented in Globus Tool kit? Explain with appropriate diagram.

6. What is virtualization? Discuss its importance in grid computing.

7. Explain the different types of grids.

8. Write a note on : GARUDA-The National Grid of India.

9. What are the functions of resource broker in grid computing?

10. Explain the core functional data requirements for grid computing applications.

11. What are the core functional computational requirements for grid applications?

12. What are the benefits of Grid Computing in business environments?

13. Discuss the application of grid computing in Life Sciences.

14. Discuss the application of grid computing in Collaborative Games.

15. What is Scheduler in grid computing environment and what are its functions?

16. Discuss security issues in Grid Computing.





Faculty Incharge: Hashmi S A