Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Computer Engineering Expert

Assignment: Clustering

Your task for this assignment is to implement and evaluate the k-means clustering algorithm.

1. Implement the k-means clustering algorithm.

a. You can use any programming language that you are familiar with.

b. The program should be executable with at least 3 parameters: the name of the dataset file, k, and the name of the output file.

c. The output file should contain numerical class labels (formatted as one number per row) for all the records in the test dataset and report the sum squared error (SSE) in the last row.

d. You only need to handle numerical attributes (categorical attributes are not required).

2. Select two datasets from the UCI repository and evaluate the algorithm using SSE and another metric of your choice (e.g. BCubed precision and recall or Jaccard score if you have the class labels) with varying k. (I intend to run your implementation on the fisher iris dataset without the labels.

3. Write a brief report to:

a. Describe the datasets.

b. Describe your implementation and experiment setup, e.g. any preprocessing you performed on the dataset such as normalizing the attributes, distance metrics you used, etc.

c. Present the experiment results with varying k.

d. Discuss the insights and conclusions from your experiments.

4. This is an individual assignment.

5. Submission. You will upload two items to Canvas: your PDF report and a zip or tar file.

This zip/tar file must contain:

Your source files (include your name(s) in commented form at the top of all source files), the executable, a README file explaining how to compile/run your program, the output files for your test datasets.

Computer Engineering, Engineering

  • Category:- Computer Engineering
  • Reference No.:- M92757710

Have any Question?


Related Questions in Computer Engineering

Respond to the statement below in at least 100 words

Respond to the statement below in at least 100 words. Original answers only. If developers are making decisions on the requirements, then how do they know that the software will work properly for the end user? Developers ...

Hayley motorcycle company just paid a dividend of 14 today

Hayley Motorcycle Company just paid a dividend of $1.4 today, and is expected to pay a dividend in year 1 of $1.8, a dividend in year 2 of $2.3, a dividend in year 3 of $2.9, and a dividend in year 4 of $3.8. After year ...

Say that a processor addresses bytes with 64 bits addresses

Say that a processor addresses bytes with 64 bits addresses, has 8 byte words, and 1024 cache entries that are each 8 words long. a) How would the bits of the address be used to indicate the set, line, word and byte? b) ...

What is marketing discipline what is most peoples

What is marketing discipline? What is most people's perception of marketing discipline? Name an organization that has done a great job marketing. What did they do to make you feel this way?

It is said the best icons are more like traffic signs than

It is said "the best icons are more like traffic signs than graphic illustrations" providing a quick indication of the task they represent. Take a look at the Ribbon and tell us how well the graphic on each of the Ribbon ...

Sub classassume super class and methods defined in question

Sub Class Assume Super Class and methods defined in question VIII Assume a Sub class named RunSomeSuper Write the Begin method Construct an instance of SomeSuper with parms of your choosing Just the Begin method, not the ...

Why regulated industries should be required to follow

Why regulated industries should be required to follow security standards. Provide 2 examples of industries that would fall under this category.

Start your c development tool and view the swatthebugs16cpp

Start your C++ development tool and view the SwatTheBugs16.cpp file. The file is contained in either the Cpp7\Chap05\Swat TheBugs16 Project folder or the Cpp7\Chap05 folder. (Depending on your C++ development tool, you m ...

Question you need to research the topic and discuss the

Question: You need to research the topic and discuss the topic in at least 400-500 words with references. A post without a reference will not count as a discussion. What is text mining and what is the purpose of it? Give ...

Question suppose you have a class whose objects are very

Question : Suppose you have a class whose objects are very, very large. Briefly, describe the advantages and drawbacks of call-by-value and call-by-reference for large objects. Describe a parameter passing mechanism that ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As