1. Engineering
  2. Computer Science
  3. consider the following information for questions 1 and 2 topco...

Question: consider the following information for questions 1 and 2 topco...

Question details

Consider the following information for Questions #1 and #2: Topco, Inc., a manufacturer of self-propelled lawn mowers, wants to learn more about differences between residential lawn mower buyers. The market research department of Topco believes that customers differ in terms of the importance that they attach to the following two lawn mower feotures (1) that the lawn mower allows for bagging (of grass clippings), and (2) that the lawn mower offers rear wheel drive (best for lawns with bumps and slopes). A sample of five representative Topco lawn mower customers reveals the following set of preferences for these two attributes/benefits: ID bagging wheel where (1) the measurement scale is continuous, and ranges from 1-very unimportant to 10-very important (2) the first column indicates a respondents id.number (3) the second column (x-axis) is the importance weight attached to bagging (4) the third column (y-axis) is the importance weight attached to rear wheel drive Note: In this assignment, refer to the second column fi.e., bagging) as thex-axis, and the third column (i.e., wheel) as the y-axis. For example, Customer ID #3s importance weights are given by the coordinate values (7.00, 4.00) For Question #1 below, parts (a) through (j), perform a k-means cluster analysis of the Topco data. For purposes of this question, set k-2, and use the coordinate point (2,4) as initial centroid #1 and the coordinate point (3,2) as initial centroid #2. Perform and report all numeric calculations to 3 decimal places of precision (e., 8.352)

(a) which customers are assigned to starting centroid #1, and what is the Euclidean distance between each of these customers and starting centroid #1? in your answer clearly indicate each customers id and the Euclidean distance (to 3 decimal places) between the customer and starting (i.e., initial) centroid Buyer id: _ Buyer id: -_ Buyer id Buyer id: -_ Buyer id Distance Distance Distance Distance _ Distance (Note: here, and below, complete for as many customers as appropriate) (b) which customers are assigned to starting centroid #2, and what is the Euclidean distance between each of these customers and starting centroid #2? in your answer clearly indicate each customers id and the Euclidean distance (to 3 decimal places) between the customer and starting centroid #2. Buyer id: -_ Buyer id Buyer id: -_ Buyer id Buyer id Distance Distance Distance Distance Distance (c) Following your assignment (in parts a and b above) of customers to the two starting centroids, what are the revised (i.e., updated) centroid coordinate values for centroid #1 and centroid #2? (Note lets refer to these revised values as 1s iteration-revised centroids) 1st iteration-revised centroid #1 1st iteration-revised centroid #2 (d) Next, based on your answer to part (c), and continuing the k-means clustering process, which customers should be assigned to the 1st iteration-revised centroid #1, and what is the Euclidean distance between each of these customers and the 1st iteration-revised centroid #1? Buyer id Buyer id: -_ Buyer id Buyer id: -_ Buyer id: Distance Distance Distance Distance Distance

(e) Similarly, based on your answer to part (c), which customers should be assigned to the 1t iteration- revised centroid #2, and what is the Euclidean distance between each of these customers and the 1st iteration-revised centroid #2? Buyer id: Distance Buyer id: Buyer id: Buyer id: _ Buyer id: Distance Distance Distance Distance (f) Based on your customer assignments in parts d and e above, what are the 2nd iteration-revised centroid values, for centroid #1 and centroid #2? 2nd iteration-revised centroid # 1: 2nd iteration-revised centroid #2 :- (e) Next, based on your answer to part [ above, and continuing the k-means clustering process, which customers should be assigned to the 2st iteration-revised centroid #1, and what is the Euclidean distance between each of these customers and the 2.1 iteration-revised centroid #1? Buyer id: Buyer id: _ Buyer id: _ Buyer id: Buyer id: _ Distance Distance Distance Distance Distance

(h) Similarly, based on your answer to part f above, which customers should be assigned to the 2s iteration-revised centroid #2, and what is the Euclidean distance between each of these customers and the 2st iteration-revised centroid #2? Buyer id:_ Buyer id:_ Buyer id: _ Buyer id:_ Buyer id:_ Distance:- Distance:- Distance Distance Distance:- (i) Based on your customer assignments in parts g and h above, what are the 3r iteration-revised centroid values, for centroid #1 and centroid #2? 3rd iteration-revised centroid #1: 3rd iteration-revised centroid #2: G) Are any additional iterations needed in this k-means clustering problem? Yes or no? Why or why not? Yes or no? Why or why not?

Solution by an expert tutor
Blurred Solution
This question has been solved
Subscribe to see this solution