Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Mapr sandbox download

12/11/2020 Client: papadok01 Deadline: 24 Hours

8Lab 3: Hive

For this lab,

1. You will investigate how Hive works; create, load, query and store data in Apache Hive in

both our HU cloud platform and MapR sandbox.

2. You will compare the Hive performance between our HU cloud platform and MapR sandbox.

* Most contents are coming from https://learn.mapr.com/ by the permission of MapR

technology.

Prerequisite:

To our HU cloud platform,

For Hadoop Cluster Overview,

http://hdfs-namenode-hadoop.apps.myhu.cloud/dfshealth.html#tab-overview

For accessing Hadoop cluster nodes,

https://master1.myhu.cloud:8443/console/project/hadoop/browse/pods

User Name : hadoop, passwd: hadoop

To access the terminal of name node, please, click “hdfs-namenode-0” and then click

“Terminal”.

1. Create the folder named with your student id and work under the folder created.

2. To upload file, please use “wget” or other commands you like.

3. If you have any problem/issue with HU cloud, please report it to your submission and

use google cloud or Amazon cloud

4. If you have any problem/issue with HU cloud, google cloud or amazon cloud, please

report it to your submission and work only with MapR sandbox.

For using MapR sandbox,

Please, download the one of the MapR sandboxes listed below.

• VMware Course Sandbox: http://package.mapr.com/releases/v5.1.0/sandbox/MapR-

Sandbox-

https://learn.mapr.com/
http://hdfs-namenode-hadoop.apps.myhu.cloud/dfshealth.html#tab-overview
https://master1.myhu.cloud:8443/console/project/hadoop/browse/pods
For-Hadoop-5.1.0-vmware.ova

• VirtualBox Course Sandbox: http://package.mapr.com/releases/v5.1.0/sandbox/MapR-

Sandbox-

For-Hadoop-5.1.0.ova

For the installation, please refer to https://mapr.com/docs/52/SandboxHadoop/c_sandbox_overview.html

Logging in to the Command Line

● Before you get started, you'll want to have the IP address handy for your Sandbox VM.

See the screenshot below for an example of where to find that.

● Next, use an SSH client such as Putty (Windows) or Terminal (Mac) to login. See below

for an example:

● use userid: user01 and password: mapr.

● For VMWare use: $ ssh user01@ipaddress

● For Virtualbox use: $ ssh user01@127.0.0.1 -p 2222

For MapR sandbox,

Connect to the Hive CLI

The lab file contains data and source code you will use to complete the lab exercises.

1. Log in to your cluster as user01 (password is mapr).

2. Position yourself in the /user/user01 directory in the cluster file system:

$ cd /mapr/MyCluster/user/user01

3. Then, download and unzip the lab files

$ wget http://course-files.mapr.com/DA4400-R1/DA440-LabFiles.zip

$ unzip DA440-LabFiles.zip

Connect to the Hive Shell

1. Run the Hive shell program by typing hive in a terminal connected to the MapR Sandbox.

[user01@maprdemo ~]$ hive

hive>

Note: You are now in the Hive CLI. For the remainder of this lab, > is prompting a Hive

command in the Hive CLI, while $ is prompting a bash command in your terminal.

You may find it useful to type commands in a text editor rather than typing commands

directly into the command line. Copy and pasting commands from a text editor will allow you

to edit and save them, as well as allowing you to write longer queries.

2. Use the SHOW FUNCTIONS command to list available Hive Query Language functions.

3. Type the following SQL data definition in the Hive shell:

CREATE TABLE ebay.auction

(openingBid FLOAT,finalBid FLOAT,itemType STRING,days INT)

ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';

4. Load the eBay auction data with the following command:

LOAD DATA LOCAL INPATH

'file:///user/user01/DA440-LabFiles/auctiondata.csv'

INTO TABLE ebay.auction;

Submission!

5. Try querying this data with SQL commands you know. Do they work as you expect them to?

Create a Database

1. Use Hive data definition language (DDL) to create a database. Create the database in your home

directory, and name it the same as your user name. For example:

> CREATE DATABASE user01 LOCATION

'/user/user01/hive/user01.db';

2. Use the SHOW DATABASES command to list all the databases available in this Hive instance.

hive> SHOW DATABASES;

OK

default

user01

Time taken: 0.127 seconds, Fetched: 2 row(s)

You should see your new user01 database now.

Note: Hive Query Language (HQL) commands are shown in upper case. This is a

convention, not a requirement. HQL commands are case-insensitive, and may be

written in either upper or lowercase. All HQL statements must end with a semicolon.

3. Quit the Hive shell, and look at the database from your bash shell:

hive> quit;

$ hadoop fs -ls /user/user01/hive

You should see the user01 database.

Create a Simple Table

1. Log back in to the Hive shell:

$ hive

hive >

2. Create a location table inside the user01 database, with the characteristics listed below. Create a

location table with the following characteristics:

• A station column, of type string

• A latitude column, of type integer

• A longitude column, of type integer

• A row format of delimited

• Fields terminated by comma

• Lines terminated by the line feed character

• Stored as a text file

3. Show the tables in your database:

hive> SHOW TABLES IN user01;

4. Show the characteristics of the table:

Submission: Command and the result of command (Screen capture)

5. Drop the table:

hive: DROP TABLE user01.location;

6. Recreate the table, but this time name the second column city instead of latitude.

7. Show the table you created.

8. Rename the city column to latitude:

hive> ALTER TABLE user01.location CHANGE COLUMN city latitude INT;

Create Partitioned and External Tables

Partitioning data can speed up queries and optimize results. Create the windspeed table as a partitioned

table with the following characteristics

• A year column, of type integer

• A month column, of type string

• A knots column, of type float

• A partition using station as the column, of type string

• Delimited row format

• Fields terminated by comma

• Lines terminated by linefeed

• Stored as a text file

Create an External Table

Create an external table called temperature that uses a text file stored in your lab files folder, with the

following characteristics:

• A station column, of type string

• A year column, of type integer

• A month column, of type string

• A celsius column, of type float

• Delimited row format

• Fields terminated by comma

• Lines terminated by linefeed

• Stored as a text file

• A location pointing to the temperature folder in your lab files folder

Submission

Screen Capture for SELECT * FROM user01.temperature LIMIT 10;

Load Data into Tables

1. Use LOAD DATA to load the location table. Remember to replace user01 with your own

userID in the file path and the database notation if necessary.

2. Load data into the partitioned table, windspeed. Since this table is partitioned, we’ll have to add

the PARTITION clause to the LOAD DATA command.

3. You can also explore the warehouse directory using Hadoop FS commands to see

how the partitioned table is laid out. Exit Hive using QUIT;, then enter:

$ hadoop fs -ls /user/user01/hive/user01.db/windspeed

Submission

Screen Capture for $ hadoop fs -ls /user/user01/hive/user01.db/windspeed

Examine Databases and Tables

The location, windspeed and temperature tables should have data in them. If you are

familiar with SQL, run some basic queries on these tables. Here are some queries to try:

Submission

Screen Capture for SELECT * FROM location;

Screen Capture for SELECT count(*) FROM windspeed;

Screen Capture for SELECT * FROM windspeed LIMIT 20;

Screen Capture for SELECT * FROM temperature WHERE year = 2000;

Query Data with SELECT

1. First, explore the temperature table. This table holds the average monthly temperatures, in

degrees Celsius, from eight different weather stations in Antarctica from several decades.

2.Let’s look at all the temperatures from January, 1970, which is the time when Unix time began:

Submission

Command and Screen capture for the command

2. Let’s try the same query, but for July, when it is winter in Antarctica:

Submission

Command and Screen capture for the command

3. The name of the weather station at the South Pole is called Clean Air, because very little manmade

pollution can be found there. Let’s find the temperatures in July at the South Pole:

Submission

Command and Screen capture for the command

4. Find the average temperature in Antarctica in 1970:

Submission

Command and Screen capture for the command

5. Run #4 in HU Cloud

Submission

Compare the performance between MapR and HU Cloud.

5. Find the hottest and coldest temperatures recorded in Antarctica:

Submission

Command and Screen capture for the command

6. Run #5 in HU Cloud

Submission

Compare the performance between MapR and HU Cloud.

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Custom Coursework Service
Finance Homework Help
Writer Writer Name Offer Chat
Custom Coursework Service

ONLINE

Custom Coursework Service

Hey, Hope you are doing great :) I have read your project description. I am a high qualified writer. I will surely assist you in writing paper in which i will be explaining and analyzing the formulation and implementation of the strategy of Nestle. I will cover all the points which you have mentioned in your project details. I have a clear idea of what you are looking for. The work will be done according to your expectations. I will provide you Turnitin report as well to check the similarity. I am familiar with APA, MLA, Harvard, Chicago and Turabian referencing styles. I have more than 5 years’ experience in technical and academic writing. Please message me to discuss further details. I will be glad to assist you out.

$55 Chat With Writer
Finance Homework Help

ONLINE

Finance Homework Help

I have a Master’s degree and experience of more than 5 years in this industry, I have worked on several similar projects of Research writing, Academic writing & Business writing and can deliver A+ quality writing even to Short Deadlines. I have successfully completed more than 2100+ projects on different websites for respective clients. I can generally write 10-15 pages daily. I am interested to hear more about the project and about the subject matter of the writing. I will deliver Premium quality work without Plagiarism at less price and time. Get quality work by awarding this project to me, I look forward to getting started for you as soon as possible. Thanks!

$55 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

Lopez corporation incurred the following costs while manufacturing its product - Variable speed drive wiki - Water diviner crossword clue - Is the character of Iago in the play Othello a literary example of a psychopath? - Political engagement activity example - Nuruzzaman rutgers - Access Control - IS - 1600 mm to m - Why does the afc curve continually decline - Explain how big box retailers are logistical trendsetters - Sustainable project - Controllability and observability matlab - Week 10-2 please proof read and check errors and grammars - How similar are the observed phenotypes in each replicate - Professional development - The guitarist picasso 1965 - Epistemology Assignment - Business and Society 400 word essay - Book of genesis paintings - Tanglewood nursing home horncastle - A marketing mix typically involves - Case study 8 1 klm airlines - Unit 6 Assignment: Scholarly Source - Volkswagen ethical issues case study - Isbn 978 0 13 214911 2 - Borderline symptom checklist scoring - Indra nooyi a transcultural leader - Mrs. J is repeatedly asking for a nurse; other patients are complaining, and you simply cannot be available to Mrs. J for long periods. Considering the setting and the OBRA guidelines, what would you do to manage the situation? - Chargaff's dna data 4.4 answers - Swimming - Egg stores v leibovici - Mary watson is 24 years old and single - The lean startup page count - What is a soliloquy - All my own work module 5 answers - Fallacies in thank you for smoking - Quartz sandstone is changed during metamorphism into - Renaming fractions as decimals - Statistical studies statistical investigations student activity sheet 1 - High crags primary school - Case Study - Thermo scientific material safety data sheet - Camscanner history - Recruitment Project - In behavior modification a research design is used to - Australian early development cencus - A government that is formally limited by laws and rules - Change cell to 20pt in excel - Management planning worksheet mgt 521 - Biking vectors mastering physics - Capital brewing dan murphys - What organelle stores material within the cell - Coulomb's law experiment lab report conclusion - The adventures of Sherlock Holmes research paper - Critical visions in film theory pdf - Tupac and my non thug life - Lfpf - Double pole double throw relay schematic - Wanniassa high school senior campus - Business - Discussion social influence - Glow stick experiment water - Discussion Board.... Due In 2 Hours - Gagne model of learning - Who offers Best Social Psychology Writing Services? - Primitive rule of the templars - Shadow health focused exam cough - Pit and crevice corrosion - Apa 7 referencing unimelb - Http www epa gov oar oaqps gooduphigh - What are the benefits of participative budgeting - 0.192 as a fraction - 3 pages essay - How does priam change in ransom - P&g food facts for home student worksheet 4 - Calypso offers odysseus immortality quote - Daft organization theory and design pdf - Nasal polyps treatment in homeopathy - Kirks build passion led us here - How many moles are in 22 grams of argon - I heard that oxygen and magnesium got together - Monty python and the holy grail wooden rabbit - Dod dd form courier authorization - Training and development in banking sector ppt - Iq stephenson house address - PSC1515 Miami Florida is considered ground zero for climate change, in particular rising seas will not only drown coastal sections of the city but will disrupt our local supply of drinking water. - London circuit commercial court guide - Ducal pottery - Mcbride & the ride no more crying - Byte of accounting inc answers - Enterprise risk management syllabus - Dulce et decorum est technique analysis - W9 bus - Review questions - How kimberly clark keeps client costco in diapers - Anti sex league 1984 quotes - The odyssey plot diagram - Nursing informatics scope and standards of practice 2nd 15 - Millivolt drop test for busbar - Student information system rfp