Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

July 27 2003 kelowna ufo

29/11/2021 Client: muhammad11 Deadline: 2 Day

Machine Learning

{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "

\n", "

Assignment 2: Data Preprocessing

\n", "
\n", "

UFO Sighting Data Exploration

\n", "
\n", "

MCIS 6283-Machine Learning

\n", "\n", "

Due date: Jan 23rd, 2019 (Wednesday)

\n", "

Total Points: 100

\n", "\n", "

Instructor: Xin Yang

\n", "\n", "\n", "
" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Please put your name, student ID, date and time here (5 points)\n", "* Name:Sannidha Nallamothu\n", "* Student ID:999900278\n", "* Date:\n", "* Time:" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "* In this assignment, you will investigate UFO data over the last century to gain some insight.\n", "* Please use all the techniques we have learned in the class to preprocesss/clean the dataset

ufo_sightings_large.csv

. \n", "* After the dataset is preprocessed, please split the dataset into training sets and test sets\n", "* Fit KNN to the training sets. \n", "* Print the score of KNN on the test sets" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1. Import dataset \"ufo_sightings_large.csv\" in pandas (5 points)" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 2. Checking column types & Converting Column types (10 points)\n", "Take a look at the UFO dataset's column types using the dtypes attribute. Please convert the column types to the proper types.\n", "For example, the date column, which can be transformed into the datetime type. \n", "That will make our feature engineering efforts easier later on." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 3. Dropping missing data (10 points)\n", "Let's remove some of the rows where certain columns have missing values. " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 4. Extracting numbers from strings (10 points)\n", "The length_of_time column in the UFO dataset is a text field that has the number of \n", "minutes within the string. \n", "Here, you'll extract that number from that text field using regular expressions." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 5. Identifying features for standardization (10 points)\n", "In this section, you'll investigate the variance of columns in the UFO dataset to \n", "determine which features should be standardized. You can log normlize the high variance column." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 6. Encoding categorical variables (20 points)\n", "There are couple of columns in the UFO dataset that need to be encoded before they can be \n", "modeled through scikit-learn. \n", "You'll do that transformation here, using both binary and one-hot encoding methods." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 7. Text vectorization (10 points)\n", "Let's transform the desc column in the UFO dataset into tf/idf vectors, \n", "since there's likely something we can learn from this field." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 8. Selecting the ideal dataset (10 points)\n", "Let's get rid of some of the unnecessary features. " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 9. Split the X and y using train_test_split, setting stratify = y (5 points)" ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "collapsed": true }, "outputs": [], "source": [ "X = ufo.drop([\"type\"],axis = 1)\n", "y = ufo[\"type\"].astype(str)\n", "\n", "\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 10. Fit knn to the training sets and print the score of knn on the test sets (5 points)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [ "from sklearn.neighbors import KNeighborsClassifier\n", "knn = KNeighborsClassifier(n_neighbors=5)\n", "# Fit knn to the training sets\n", "knn.fit(train_X, train_y)\n", "# Print the score of knn on the test sets\n", "print(knn.score(test_X, test_y))" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.7" } }, "nbformat": 4, "nbformat_minor": 2 }

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Engineering Guru
Instant Assignment Writer
Phd Writer
Isabella K.
Helping Hand
A Grade Exams
Writer Writer Name Offer Chat
Engineering Guru

ONLINE

Engineering Guru

I will be delighted to work on your project. As an experienced writer, I can provide you top quality, well researched, concise and error-free work within your provided deadline at very reasonable prices.

$43 Chat With Writer
Instant Assignment Writer

ONLINE

Instant Assignment Writer

I have done dissertations, thesis, reports related to these topics, and I cover all the CHAPTERS accordingly and provide proper updates on the project.

$31 Chat With Writer
Phd Writer

ONLINE

Phd Writer

I have written research reports, assignments, thesis, research proposals, and dissertations for different level students and on different subjects.

$18 Chat With Writer
Isabella K.

ONLINE

Isabella K.

I am a PhD writer with 10 years of experience. I will be delivering high-quality, plagiarism-free work to you in the minimum amount of time. Waiting for your message.

$36 Chat With Writer
Helping Hand

ONLINE

Helping Hand

I am a professional and experienced writer and I have written research reports, proposals, essays, thesis and dissertations on a variety of topics.

$45 Chat With Writer
A Grade Exams

ONLINE

A Grade Exams

I have written research reports, assignments, thesis, research proposals, and dissertations for different level students and on different subjects.

$42 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

Putting autozone into drive case study - Discussion - Real time dashboard - Individual Assignment (405) - Ffx al bhed primers - Busi 201 Article Review - Moshi monsters puzzle palace - Australian standard safety glasses - MANAGEMENT DISCUSIONS - Arb classic canopy internal support - Methods of Program Evaluation - You want custom motorcycle back patches Saudi Arabia? - COME TODAY 0609702423 DR SAM ABORTION CLINIC IN KIMBERLEY - RESEARCH PAPER - Ap statistics section 4.1 exercises answers - What physical feature most distinguishes biotite mica from muscovite mica - Blue mountains trains timetable - Business Policy and Strategy IV - What is the net electric flux through the torus - The tub of a washer goes into - ACT 01 - Lifetrons business note writer software - Clergymen letter to dr king - The poop on composting case study answers - Prince evans solicitors llp - Government accounting - Lord goring character analysis - State farm business plan proposal - Hclo strong or weak electrolyte - Eagles nest arthurs seat - Baguley hall school holidays - Deutsche allgemeine versicherung case study solution - Methods for estimating project times and costs - Bigger than a0 size - South devon college moodle leap login - Unethical behaviour in childcare - Pilkington fire rated glass - North hinksey parish council - Order 2316287: art appreciation - Agree to time frames for carrying out workplace instructions - Procurement process in construction industry ppt - Pyramid writing for spelling - Why do organizations still have information deficiency problems - Institut za uporedno pravo - Disabilities - Sas nodupkey multiple variables - American society of public administration code of ethics - Grafton district golf club - The local supermarket buys lettuce each day - Q ha t1 t2 - Ang bao for brothers and sisters - Paper - Bmw currency hedging 2007 case solution - Www mcspam dot net - Ignore correction proofreading mark - Prodiscover basic software - 250 words homework (within 24 hours): Support Department and Joint Cost Allocation - Foundation of Mathematics - J3 - Simnet office 2016 integrated applications project 3 - Enterprise Risk management - Briggs and stratton charging system diagram - Examples of irony in the lottery - Greg doucette protein ice cream - Margin of safety in dollars is - Why wasn t gabriel at the naming in the giver - Orthodontist lauriston place edinburgh - How old was john flynn when he died - Abbott point of care clew update - MA_Discussion 7 - Erlenmeyer flask uses and functions - Inner west mental health - Lattice enthalpy of formation - Macquarie university summer hill chiropractic clinic - Electric field mapping lab report conclusion - Mac genie food truck indianapolis - RESEARCH - How many diagonals does a parallelogram have - Transfer function of spring mass system - Jaycar step down transformer - Fundamentals of machine drawing - Diamond mesh lath installation - 56 redbanks road willaston - Adlerian psychology birth order - Are you smarter than a monkey - What is the last stage in the conflict assessment guide - Queensland health annual refresher - 204/19 pickles street port melbourne - What is axial coding - Excel sam capstone project 1 campus fitness center - Harvard global supply chain management simulation - Figurative language in boy in the striped pajamas - Police Subculture - The sequence of the how something gets into memory is - Full time apprenticeship hours - Mother inferior hanna rosin - 9st 9lbs in kg - Chapter 3 summary jasper jones - Internal Assessment Survey 04 - Is benzil polar or nonpolar