Lab 1

 LAB EXERCISE


INTRODUCTION INTO WEKA


Exercise 1:

i. Using weather.numeric dataset, find the min, max and average for each variable.

ii. Using weather.numeric dataset, find the frequencies for each value for each variable.


Exercise 2:

No Activity

Load the labor.arff file into WEKA. The file is in the data folder of WEKA. This dataset classifies individuals described by a set of attributes.

1. How many instances and attributes contained in the dataset?


Number of instances

Number of attributes


 

2. Identify type of attribute for vacation, the values of vacation and the number of instances for each value.


3. For the third value of attribute vacation, give the count of each class.


Vacation = 

Class=good –

Class=bad – 




Exercise 3:

i. Go to the following link: https://www.data.gov.my/data/ms_MY/dataset/senarai-lokaliti-hotspot-denggi-di-malaysia

ii. Download the dataset

iii. Try to prepare or process the data until it can be uploaded to WEKA

Note: If you still unable to do it, see this link: https://youtu.be/itixU0jIX3Q


Exercise 4:

i. Find any related data of COVID19 from the Internet. State the URL for that website.

ii. Download the dataset.

iii. Prepare the dataset until it can be uploaded to WEKA. Report the problems that you encounter and briefly explain how you solve it.

iv. Identify the knowledge that you think can be discovered from this dataset.

e.g., of Knowledge:  Trends of COVID19 outbreak (See Mehrota & Agarwal, 2021)


Mehrotra, A., & Agarwal, R. (2021). A Review of Use of Data Mining during COVID-19 Pandemic. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(6), 4547-4552.



No comments:

Post a Comment

Welcome to Data Mining Blog

The Data Mining course introduces the concepts and methods of data mining and shows its relationship with data science. All the steps involv...