[Get it solved] In this assignment, you will run an experiment to study t...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

In this assignment, you will run an experiment to study the effects of relevance feedback on the recall, precision and Mean Average Precision (MAP) values of an IR system.

computer science

Description

In this assignment, you will run an experiment to study the effects of relevance feedback on the recall, precision and Mean Average Precision (MAP) values of an IR system. The IR system will use a vector space model with cosine similarity (tf-idf weighting). You will run the study on the TIME dataset, provided along with the assignment.

Part A: Cosine Similarity and Rocchio’s algorithm [40pts]

You will implement a cosine similarity measure with tf-idf weighting. Your index should contain the information that you will need to calculate the cosine similarity measure such as tf and idf values. You may reuse code from the previous assignments as needed
Implement the Rocchio algorithm for query refinement. Your system should display results and then prompt the user for providing positive and negative feedback. Use ?= 1, ?= 0.75, and ?= 0.15 as parameters for the Rocchio’s algorithm.

Part B: Experimental study [35pts]

Run your system for at least 3 queries from the test bed. Pick queries that have 5 or more relevant documents (see TIME.REL file). For each query, you will perform a series of 5 relevance feedback and plot the change in precision, recall and MAP
You will prepare a report on the experimental study where you will provide at least the following details for each of the queries:

o Query text and ID (provided in the testbed)
o Precision, recall and MAP values of the query
o IDs of documents which are Positive and Negative feedback provided for each query

during each iteration of the Rocchio algorithm
o For each iteration of the Rocchio algorithm, provide the terms of the new query and

their weights

Your report will have 3 plots (precision vs Rocchio iteration, recall vs Rocchio iteration, and MAP measure vs Rocchio iteration) that depict the progressive change in the performance values over the iterations of the Rocchio algorithm, for the three queries.
Also discuss any query drift that you may observe in your results.
Note: The queries provided in the testbed have varying number of relevant documents (see

TIME.REL file). This can be a problem when calculating the performance values, if k is kept constant during the retrieval. For the experimental study, assume that the number of relevant

documents is provided to the system along with the query. In other words, the value of k will change with the query.

Part C: Pseudo Relevance Feedback [25pts]

Related Questions in computer science category

Coin Toss Simulator Need help with High School level Java question which is 2 parts

The principal goal of this report is to plan and suggest a successful planning methodology to the organisation to improve information system and to constructively change the current methodology.

Explore content providers and the techniques of saving data by working with mobile application databases.

An input-processing output chart or IPO chart tracks incoming data, what needs to be processed and the final output.

Suppose you have a small retail store and decide to create a database to track sales.

Last week we examined the multiple access problem and its solutions. This week, as part of our study of wireless and mobility, we are considering another solution to the same problem, namely Code Division Multiple Access (CDMA). What is CDMA, and how does

With the availability of enormous numbers of remote sensing images produced by satellites and airborne sensors, high-resolution remote sensing image analyses

how to avoid safety concerns when dealing with charged electrolytic capacitors

Cartoon page must have a title. Include a link under the canvas to return to your main page.

For this project you will be designing and implementing a system, in C or C++ to store employee data.

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

Get Free Quote!

359 Experts Online

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews

Get Help Now

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

In this assignment, you will run an experiment to study the effects of relevance feedback on the recall, precision and Mean Average Precision (MAP) values of an IR system.

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews

We Provide Services Across The Globe

Enroll in the complete course for only $250 USD*

In this assignment, you will run an experiment to study the effects of relevance feedback on the recall, precision and Mean Average Precision (MAP) values of an IR system.

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

Get Instant Help with your Questions & boost your grades

you can count us with it Highly Satisfied Students 4.9/5 Based On 19835+ Reviews

We Provide Services Across The Globe

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews