Welcome to TutorsOnSpot.Com!

World's No. 1 Assignment Writing Market

Post Your Homework

Proposals

Post your homework and get free proposals here!

Post Your Homework

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Get Free Quotes Post Your Requirements

July 27 2003 kelowna ufo

29/11/2021 Client: muhammad11 Deadline: 2 Day

Machine Learning

{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "

\n", "

Assignment 2: Data Preprocessing

\n", "

UFO Sighting Data Exploration

\n", "

MCIS 6283-Machine Learning

\n", "\n", "

Due date: Jan 23rd, 2019 (Wednesday)

\n", "

Total Points: 100

\n", "\n", "

Instructor: Xin Yang

\n", "\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Please put your name, student ID, date and time here (5 points)\n", "* Name:Sannidha Nallamothu\n", "* Student ID:999900278\n", "* Date:\n", "* Time:" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "* In this assignment, you will investigate UFO data over the last century to gain some insight.\n", "* Please use all the techniques we have learned in the class to preprocesss/clean the dataset

ufo_sightings_large.csv

. \n", "* After the dataset is preprocessed, please split the dataset into training sets and test sets\n", "* Fit KNN to the training sets. \n", "* Print the score of KNN on the test sets" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1. Import dataset \"ufo_sightings_large.csv\" in pandas (5 points)" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 2. Checking column types & Converting Column types (10 points)\n", "Take a look at the UFO dataset's column types using the dtypes attribute. Please convert the column types to the proper types.\n", "For example, the date column, which can be transformed into the datetime type. \n", "That will make our feature engineering efforts easier later on." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 3. Dropping missing data (10 points)\n", "Let's remove some of the rows where certain columns have missing values. " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 4. Extracting numbers from strings (10 points)\n", "The length_of_time column in the UFO dataset is a text field that has the number of \n", "minutes within the string. \n", "Here, you'll extract that number from that text field using regular expressions." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 5. Identifying features for standardization (10 points)\n", "In this section, you'll investigate the variance of columns in the UFO dataset to \n", "determine which features should be standardized. You can log normlize the high variance column." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 6. Encoding categorical variables (20 points)\n", "There are couple of columns in the UFO dataset that need to be encoded before they can be \n", "modeled through scikit-learn. \n", "You'll do that transformation here, using both binary and one-hot encoding methods." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 7. Text vectorization (10 points)\n", "Let's transform the desc column in the UFO dataset into tf/idf vectors, \n", "since there's likely something we can learn from this field." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 8. Selecting the ideal dataset (10 points)\n", "Let's get rid of some of the unnecessary features. " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 9. Split the X and y using train_test_split, setting stratify = y (5 points)" ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "collapsed": true }, "outputs": [], "source": [ "X = ufo.drop([\"type\"],axis = 1)\n", "y = ufo[\"type\"].astype(str)\n", "\n", "\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 10. Fit knn to the training sets and print the score of knn on the test sets (5 points)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [ "from sklearn.neighbors import KNeighborsClassifier\n", "knn = KNeighborsClassifier(n_neighbors=5)\n", "# Fit knn to the training sets\n", "knn.fit(train_X, train_y)\n", "# Print the score of knn on the test sets\n", "print(knn.score(test_X, test_y))" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.7" } }, "nbformat": 4, "nbformat_minor": 2 }

Homework is Completed By:

Writer	Writer Name	Amount	Client Comments & Rating
ONLINE	Instant Homework Helper 4.8 4305 Orders Completed	$36	She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up! 5.00
Answer.docx Turnitin Report.pdf Contact Writer For Solution Contact Writer For Solution

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

100% Plagiarism Free
Proper APA/MLA/Harvard Referencing
Delivery in 3 Hours After Placing Order
Free Turnitin Report
Unlimited Revisions
Privacy Guaranteed

Order Now

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

100% Plagiarism Free
Proper APA/MLA/Harvard Referencing
Delivery in 6 Hours After Placing Order
Free Turnitin Report
Unlimited Revisions
Privacy Guaranteed

Order Now

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

100% Plagiarism Free
Proper APA/MLA/Harvard Referencing
Delivery in 12 Hours After Placing Order
Free Turnitin Report
Unlimited Revisions
Privacy Guaranteed

Order Now

6 writers have sent their proposals to do this homework:

Writer	Writer Name	Offer	Chat
ONLINE	Engineering Guru I will be delighted to work on your project. As an experienced writer, I can provide you top quality, well researched, concise and error-free work within your provided deadline at very reasonable prices. 4.8 8001 Orders Completed	$43	Chat With Writer
ONLINE	Instant Assignment Writer I have done dissertations, thesis, reports related to these topics, and I cover all the CHAPTERS accordingly and provide proper updates on the project. 4.9 1722 Orders Completed	$31	Chat With Writer
ONLINE	Phd Writer I have written research reports, assignments, thesis, research proposals, and dissertations for different level students and on different subjects. 0 Orders Completed	$18	Chat With Writer
ONLINE	Isabella K. I am a PhD writer with 10 years of experience. I will be delivering high-quality, plagiarism-free work to you in the minimum amount of time. Waiting for your message. 4.9 21 Orders Completed	$36	Chat With Writer
ONLINE	Helping Hand I am a professional and experienced writer and I have written research reports, proposals, essays, thesis and dissertations on a variety of topics. 4.7 1701 Orders Completed	$45	Chat With Writer
ONLINE	A Grade Exams I have written research reports, assignments, thesis, research proposals, and dissertations for different level students and on different subjects. 4.4 1785 Orders Completed	$42	Chat With Writer