Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Information Retrieval Techniques Problem Solving Task

14/04/2020 Client: azharr Deadline: 24 Hours

Home

Business & Finance homework help

Operations Management homework help

Report Issue

Question 1: (15 marks)


Suppose you have joined a search engine development team to design a search algorithm based on both the Vector model and the Boolean model. You have collected the following documents (unstructured) and plan to apply an index technique to convert them into an inverted index.


Doc 1:Information retrieval is the activity of obtaining information resources relevant to an information need from a collection of information resources. Searches can be based on full-text or other content-based indexing.


Doc 2:Information retrieval is finding material of an unstructured nature that satisfies an information need from within large collections


Doc 3:Information systems is the study of complementary networks of hardware and software that people and organizations use to collect, filter, process, create, and distribute data. In the process of creating the inverted index, please complete the following steps:


a. Remove all stop words and punctuation, and then apply Porter’s stemming algorithm to the documents. The list of stop words for this task is provided as follows: Is, The, Of, To, An, A, From, Can, Be, On, Or, That, Within, And, Use


b. Create a merged inverted list including the within-document frequencies for each term.


c. Use the index created in part (b) to create a dictionary and the related posting file.


 d. You may like to test the inverted index by using the following keywords: information, system, index


 e. Please design three Boolean queries, (for example, web AND search) and list the relevant documents for each query.


 f. Please use the Vector model to query on the inverted index, and compare the result with the Boolean model. (Hint: you can use cosine similarity and set a similarity threshold). Question 2 (IR Evaluation) (15 marks)




Question 2 (IR Evaluation) (15 marks)


In this question, you are required to evaluate the performance of different search engines.


 First, please find two search engines you are familiar with, such as Google, Bing, Yahoo!, etc.


 Second, please choose one target from the following list, and design two queries to search in both search engines. So both query 1 and query 2 have to be tested in both search engines.


 i. Target 1: obtain the course information for S779.


 ii. Target 2: obtain the price of the new Samsung Tablet.


 iii. Target 3: obtain the manual of installing tera term.


 iv. Target 4: obtain the oracle SQL tutorial.


 v. Target 5: obtain the price of new Xbox one.


 Third, select the first 20 results in both search engines, if they return the target, then mark them as relevant documents, otherwise, they are irrelevant. Assume that you have 14 relevant documents in total (retrieved and not-retrieved).


 The following questions are based on your search results.




a) List your target, results and designed search queries (You can use any keywords you think are related to the target).




Get the precision and recall values for 20 documents for query 1 in search engine 1. Interpolate them to 11 standard recall levels. Then plot them into a chart.




Get the precision and recall values for 20 documents for query 2 in search engine 1. Interpolate them to 11 standard recall levels. Then plot them into the same chart as above.




Now find the average precision of query 1 and query 2 for search engine 1 and plot it into the same chart.


So you will have total of 3 curves in one single chart.




b) List your target, results and designed search queries


 Get the precision and recall values for 20 documents for query 1 in search engine 2. Interpolate them to 11 standard recall levels. Then plot them into a chart.


 Get the precision and recall values for 20 documents for query 2 in search engine 2. Interpolate them to 11 standard recall levels. Then plot them into the same chart as above.




Now find the average precision of query 1 and query 2 for search engine 2 and plot it into the same chart. So, you will have total of 3 curves in one single chart, separate to that of part (a). Plot the averages for Search Engine 1 and Search Engine 2 on a separate chart, and compare the algorithms in terms of precision and recall. Which search engine do you think is superior? Why?

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Assignment Hut

ONLINE

Assignment Hut

$45
Top quality material

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Assignment Hut
Writer Writer Name Offer Chat
Assignment Hut

ONLINE

Assignment Hut

Please share further details to proceed.

$45 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

Recrystallization of benzoic acid - Juke box love song analysis - Cyber Security - Spanish commands with pronouns worksheet - Falling head test lab report - The segment begins with an unfortunate verbal error - Amanda todd summary - Assessment from head to toe - Boc liquid nitrogen storage tanks - Cloud computing thomas erl pdf - Eu good distribution practice - Meridian 1 option 11c - Chapter 1 introduction to organizational behavior - Blc sharp essay examples - Negative influence of music on youth - Determine whether the block shown is in equilibrium - 87.5 as a fraction - The benefits of change management mgt 362 - Four seasons organizational structure - Power Point Questions - Walgreens inventory turnover - Breaking down a topic brainstorming - The kite runner chapter 8 9 summary - PSY/301 Week 2 Presentation...1 Slide and a couple paragraphs - Pelican paper inc and timberland forest - Amazing grace keyboard letters - Because you loved me movie - Orbital weld head clearance - Technical memo template word - Rubber band racer design - Theodor schwann cell theory facts - Little animals activity centre - Flvs english 4 honors answers - Persuasive speech why you should eat breakfast - Introduction to humanities arts and social science - Assignment 2 block business letter - Order 2123916: Identify and discuss in a coherent manner some of the ways that Aeneas as a warrior and family man in the Aeneid differs from Hektor in the Iliad. - Improving vocabulary skills 4th edition answer key - Aws solution architect professional dumps - North walsham junior school - Csi web adventures rookie training - Four lenses of wellness - What is the sacrament of healing - Forensic science chapter 5 pollen and spore examination review answers - According to renaissance philosophy commoners often represent - Ethical dilemmas in organization development - How to divide a whole number with a decimal - Vosburgh electronics corporation balance sheet - Interpretation project 2 bibl 110 - Which of the following statements is true of utilitarianism - Critical ethnography in educational research a theoretical and practical guide - Nissan maxima wont crank - Naap basic coordinates and motions answer key - Hum 100 worksheet cultures and artifacts - Discussion - Rapid storage technology enterprise - Monica allende sunday times - What is an acceptable safeassign score - Financial reporting problem ii part 1 - The farm life inside angola prison summary - Crown napkin folding procedure - Initiating the Project-1 - Written observation example - Northwind database csv - What is the bond's yield to maturity (expressed as an apr with semiannual compounding)? - Develop a resource schedule in the loading chart - Chase sapphire creating a millennial cult brand - Rockwell integrated architecture builder - How to write a technical description of an object - Social Studies and the Arts Unit Plan - Rear view camera installation diagram - English_Double Entry Journal - Trifles - Kirchhoff's voltage law khan academy - Gut directed hypnotherapy sydney - Ee waddell language academy - Meadow heights community centre - Escape from goblin town pdf - Measurements and calculations worksheet - Charlotte the harlot lay dying - Poetry should ride the bus - NEED A REWRITE ON WEEK ONE Quality Control - Eagle arduino mega 2560 - Squares square roots cubes and cube roots pdf - The main reason suppliers can offer quantity discounts is that - Areola serve para plantar grama - I can feel the love can you feel it to - Cu otf 2 py 4 - Properties of metals nonmetals and metalloids - Ethical choices in business situations are most often made - Timeshare exchange fair - Darkroom health and safety - Fiberglass chemical resistance guide - Tyler junior college dorms - Anime kaika discount code - Data at rest threats - Difference between paraphrasing and summarising - Research week 8 erm - Hollanders model of personality - First angle of projection - Student exploration gravitational force worksheet answers