Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Intro to Data mining

17/08/2020 Client: sathvik2009 Deadline: 2 Day

 


Chapter 5 exercises


20. Consider the task of building a classifier from random data, where the attribute values are generated randomly irrespective of the class labels. Assume the data set contains records from two classes, “+” and “−.” Half of the data set is used for training while the remaining half is used for testing.


(a) Suppose there are an equal number of positive and negative records in the data and the decision tree classifier predicts every test record to be positive. What is the expected error rate of the classifier on the test data?


(b) Repeat the previous analysis assuming that the classifier predicts each test record to be positive class with probability 0.8 and negative class with probability 0.2.


(c) Suppose two-thirds of the data belong to the positive class and the remaining one-third belong to the negative class. What is the expected error of a classifier that predicts every test record to be positive?


(d) Repeat the previous analysis assuming that the classifier predicts each test record to be positive class with probability 2/3 and negative class with probability 1/3.


Chapter 6 exercises


5. Prove Equation 6.3 in the book. (Hint: First, count the number of ways to create an itemset that forms the left hand side of the rule. Next, for each size k itemset selected for the left-hand side, count the number of ways to choose the remaining d − k items to form the right-hand side of the rule.)


17. Suppose we have market basket data consisting of 100 transactions and 20 items. If the support for item a is 25%, the support for item b is 90% and the support for itemset {a, b} is 20%. Let the support and confidence thresholds be 10% and 60%, respectively.


(a) Compute the confidence of the association rule {a} -> {b}. Is the rule interesting according to the confidence measure?


(b) Compute the interest measure for the association pattern {a, b}. Describe the nature of the relationship between item a and item b in terms of the interest measure.


(c) What conclusions can you draw from the results of parts (a) and (b)?


(d) NOT NEEDED FOR THE TEST


Chapter 7 exercises


5. For the data set with the attributes given below, describe how you would convert it into a binary transaction data set appropriate for association analysis. Specifically, indicate for each attribute in the original data set.

(a) How many binary attributes it would correspond to in the transaction data set,


(b) How the values of the original attribute would be mapped to values of the binary attributes, and


(c) If there is any hierarchical structure in the data values of an attribute that could be useful for grouping the data into fewer binary attributes. The following is a list of attributes for the data set along with their possible values. Assume that all attributes are collected on a per-student basis:


• Year : Freshman, Sophomore, Junior, Senior, Graduate: Masters, Graduate: PhD, Professional


• Zip code : zip code for the home address of a U.S. student, zip code for the local address of a non-U.S. student


• College : Agriculture, Architecture, Continuing Education, Education, Liberal Arts, Engineering, Natural Sciences, Business, Law, Medical, Dentistry, Pharmacy, Nursing, Veterinary Medicine


• On Campus : 1 if the student lives on campus, 0 otherwise


• Each of the following is a separate attribute that has a value of 1 if the person speaks the language and a value of 0, otherwise.


– Arabic

– Bengali

– Chinese Mandarin

– English

– Portuguese

– Russian

– Spanish

Chapter 8 exercises


1. Consider a data set consisting of 2^(20) data vectors, where each vector has 32 components and each component is a 4-byte value. Suppose that vector quantization is used for compression and that 2^(16) prototype vectors are used. How many bytes of storage does that data set take before and after compression and what is the compression ratio?


8. Consider the mean of a cluster of objects from a binary transaction data set. What are the minimum and maximum values of the components of the mean? What is the interpretation of components of the cluster mean? Which components most accurately characterize the objects in the cluster?


9. Give an example of a data set consisting of three natural clusters, for which (almost always) K-means would likely find the correct clusters, but bisecting K-means would not.


11. Total SSE is the sum of the SSE for each separate attribute. What does it mean if the SSE for one variable is low for all clusters? Low for just one cluster? High for all clusters? High for just one cluster? How could you use the per variable SSE information to improve your clustering?


13. The Voronoi diagram for a set of 1( points in the plane is a partition of all the points of the plane into K regions, such that every point (of the plane) is assigned to the closest point among the 1( specified points. (See Figure 8.38.) What is the relationship between Voronoi diagrams and K-means clusters? What do Voronoi diagrams tell us about the possible shapes of K-means clusters?

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Homework Guru
Top Grade Tutor
Top Essay Tutor
Online Assignment Help
Assignment Hub
Engineering Guru
Writer Writer Name Offer Chat
Homework Guru

ONLINE

Homework Guru

I am a Ph.D. writer with more than 9 years of working experience in Writing. I have successfully completed more than 4500 projects for my clients with their full amount of satisfaction. I will provide you super quality work according to your given requirements and deadline with ZERO plagiarism. I can manage business and professional writing against very reasonable prices.

$230 Chat With Writer
Top Grade Tutor

ONLINE

Top Grade Tutor

I can provide you with a guarantee of plagiarism free work. I am producing quality content for my clients including ARTICLE WRITING, ESSAY WRITING, RESEARCH PAPERS, BUSINESS PLAN, TECHNICAL WRITING, MATLAB, THESIS & DISSERTATIONS.

$245 Chat With Writer
Top Essay Tutor

ONLINE

Top Essay Tutor

I feel, I would be the best choice for this project, I have more than 10 years of working experience in writing essys, reports, case studies and dissertations. Give me your work and get relax

$220 Chat With Writer
Online Assignment Help

ONLINE

Online Assignment Help

I am an elite class Ph.D. writer who can deliver you a supreme level of content within your given deadline. I will give you plagiarism free content within your given timeline.

$225 Chat With Writer
Assignment Hub

ONLINE

Assignment Hub

I feel, I am the best option for you to fulfill this project with 100% perfection. I am working on this forum since 2014 and I have served more than 1200 clients with a full amount of satisfaction.

$225 Chat With Writer
Engineering Guru

ONLINE

Engineering Guru

Hello, I have more than 10 years of writing experience. I can manage essays, summaries, reports and analysis works in very short period of time. I produce plagiarism free content for my clients, will send you FREE TURNITIN Reports as well. Thank you.

$250 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

0.38 repeating as a fraction - What is centrelink working credit - Steps to lighting a bunsen burner - Ashley tisdale sons of anarchy imdb - Homework - Does trevor noah have a child - 35.4 expressed as a decimal becomes - Hamilton county judges try thousands of cases per year - Formal analysis film - Ida sidha karya company is a family owned - TLMT601 Week 3 Case Study 1 - Biblical financial planning ron blue - The eyeball is wrapped in adipose tissue - Final Project: Briefing Project - Family trainer wii download - What is the la galaxy product - The analytical frame of mind - Michael glabicki net worth - IT GOVERNANCE - 4 Page Paper on Big Data - 2 days - ONLINe LOVE ((USA)))*+91-9924492424 Love Problem Solution Specialist Baba Ji - Ikea invades america case study pdf - How to tame a wild tongue questions - Swain v waverley municipal council - Monohybrid practice problems 2 answers - Sin cos tan hexagon - Advance Health Assessment - 4s week 12 S assignment EH - 2 stats problems need solved in TWO HOURS - Bsbfim501 assessment task 1 - Myitlab excel chapter 2 grader project - Reading summary - Pocahontas county schools wv - Invitrogen superscript iii protocol - 2-3 pages - ASSESMENT 3 - Border crossings catherine cucinella pdf - How many edges does a hexagonal prism have - Roberts v. lanigan auto sales - The gilded six bits pdf - Role of education in democracy ppt - How to view xbrl balance sheet - Spo model - Music paper - Superdry questionnaire answers - THE IMPACT OF STANDARDIZED NURSING TERMINOLOGY - Skimming is an unethical business practice involving - Kiehl's case analysis - Essay - Initiating a project checklist - Essay on Dream Job - Oxidation number of h in nah - The Learner - Sap ecc 6.0 modules diagram - Constructivism in international relations - Commonwealth coat of arms shield - Escience lab 14 mendelian genetics answers - Louth county council planning - Entire classes work - Barack obama nobel acceptance speech - Woolworths car insurance payment - Centripetal pump in purifier - Which of the statements below best describes office layout - 12460 uncle charlies spur dunkirk md - Ernest van den haag the ultimate punishment - Program Capstone - Describe the departmentalization approach to organizational structure - Www mbgnet net desert - Using method of joints determine the force - Gyorgy kurtag kafka fragments - What is s8 medication - Need 2 papers - Assignment 4: Artifacts - Phase change of water lab report - The shape of nevada - Trends in cohabitation outcomes - The pleasure of my company - 3820 assignment 2 - Ap physics 2 electrostatics - Create a table of Sorting Algorithms for use as a personal reference or to use if you were explaining algorithms to a peer or coworker. - Nexiq device tester error 275 - SUBJECT: Request for Reconsideration - Islington council emergency repairs - Coventry university netball team - Asp net mvc 4 runtime - The Deinsitiutionalization of American Marriage - C program files google drive googledrivesync exe - Identify learning styles and generational characteristics - Howard arkley cause of death - Soap Note - Periodic table metal non metal metalloids - The concept of confidentiality can be substantiated based on the right of - Philosophy begins in wonder meaning - Eco maps in social work - Zone 1 tram melbourne - Discussion Board - Catastrophic bleed palliative care - Without resorting to computations, what is the total contribution margin at the break-even point? - Metal stud wall design example - Earth Science - Whole foods market supplier portal