[Get it solved] Beginner Python assignment: Description In this project y...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

Beginner Python assignment: Description In this project you are going to index a set of documents in a python

computer science

Description

Beginner Python assignment

Description In this project you are going to index a set of documents in a python open-source search engine called tinysearch, devise a set of test queries and evaluate the system on those queries. Indexing the Documents • download the search engine and the corpse from OWL->Resources->Assignment2_Files on your machines and index them (file name: tinysearch.zip). o Note that this corpse contains dumped Wikipedia documents and it is a few years old. o There are instructions concerning these steps further down in this document. Provided corpse This corpse has some of the Wikipedia documents that can be used for mirroring, personal use, informal backups, off-line use or database queries. All text content is licensed under the GNU Free Documentation License (GFDL). Topics and Questions In this part of the assignment, you are going to think of an application domain (i.e. a subject) that is of interest to you. For example, you could choose health, politics, sport, geography, music (any kind), etc. Now, create twenty queries in your chosen domain. For example: Q1: species and dogs Q2: Akita dogs Q3: wolfdog Note: These are just examples chosen by a student who was interested in dogs. You can choose whatever subject that: • It is covered by the documents you are using; • You can think of some quite difficult queries on your chosen topic. Retrieval Experiments Test the performance of the provided search engine using TF-IDF by applying the following steps: 1. Run the queries, as prepared above, through the system and collect the first ten files (or so) returned for each. 2. Compute precision and recall at the following levels of n (where n is the number of documents considered): n=5, n=10. 2 3. To do this, for each query you need to look at (for example) the first ten results (i.e. files) returned and see for each file whether is it Relevant or Not Relevant. A file is relevant if it contains the answer to your query. It does not matter where in the file the answer occurs as long as it is present somewhere. Note that this is not as easy as it sounds since there will be occasions when you are not sure. You need to make a note of the rationale for making your final decision in cases of doubt. Computing recall poses a problem in that we need to know for each query all the correct answers in the collection. Strictly, we cannot know that without inspecting every document in the collection. At TREC they use a pooling method as discussed in the lecture. To get around the problem here, simply check the first n documents (n = 20) returned for each query. Count the number of correct responses there and assume that these are all the correct responses in the collection. Then use this information to compute recall at n=5 and n=10 as above. Assignment Report Write up your results in a short report USING THE TEMPLATE SUPPLIED with the following headings exactly as shown in the template: 1. Cover page includes your (formal) name, ID and the program you are currently enrolledin. 2. Topic and Queries • What topic you chose; how the queries were devised. 3. Indexing the Documents • How was this done? • What problems were encountered (if any) and how were they solved? 4. TF-IDF Performance • Method - short text outlining what you did. • Results - a table summarizing the numerical results as above (review assignment 2 - appendix 1). • Discussion - a short description of what the results show (was TF-IDF always better, always worse or sometimes better/worse?), any interesting problem cases, any technical problems encountered and so on. Report Appendix 1 • Include the queries you used for your TF-IDF evaluation and the IDs of the right answers found for each (if any). • Example (this is just a sample): Num Query IDs of Answers 1 hot chicken 5003 … 20 chicken 0

Related Questions in computer science category

Write a class called Clock to conceivably be used for keeping track of time. Assume that this is a 24 hour clock, where the hour number ranges from 0 (which is midnight) to 23 (which is 11 pm), and the minutes range from 0 to 59. Include the following met

Luigi's pizza in Houston needs a new website desperately.

Understand and explain how virtual memory is laid out into different regions

The purpose of this presentation is to present the hardware and software solutuion for a decision. for, example,make and model of hardware and short description should be in the slide but, brief and in bullet format. The speaker notes are for amplication.

this is computer science writing an assignment. need it in next 4hrs if possible. offering $40 for 4 pages double-spaced. APA

Cryptography, or cryptology is the practice and study of techniques for secure communication in the presence of third parties called adversaries.

The security policy is a fundamental tool for a security program

the essay should include intext reference and reference list

in this assignment you will write a Java program to implement a lexical analyzer (scanner) and a syntax analyzer(recursive descent parser) for the following language, which similar to the syntax of Ada:

As a UTSA student, you are bound by the honor code, so DO NOT cheat on any of your coursework.

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

March

January

February

March

April

May

June

July

August

September

October

November

December

2025

1950

1951

1952

1953

1954

1955

1956

1957

1958

1959

1960

1961

1962

1963

1964

1965

1966

1967

1968

1969

1970

1971

1972

1973

1974

1975

1976

1977

1978

1979

1980

1981

1982

1983

1984

1985

1986

1987

1988

1989

1990

1991

1992

1993

1994

1995

1996

1997

1998

1999

2000

2001

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

2027

2028

2029

2030

2031

2032

2033

2034

2035

2036

2037

2038

2039

2040

2041

2042

2043

2044

2045

2046

2047

2048

2049

2050

Sun	Mon	Tue	Wed	Thu	Fri	Sat
23	24	25	26	27	28	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31	1	2	3	4	5

00:00

00:30

01:00

01:30

02:00

02:30

03:00

03:30

04:00

04:30

05:00

05:30

06:00

06:30

07:00

07:30

08:00

08:30

09:00

09:30

10:00

10:30

11:00

11:30

12:00

12:30

13:00

13:30

14:00

14:30

15:00

15:30

16:00

16:30

17:00

17:30

18:00

18:30

19:00

19:30

20:00

20:30

21:00

21:30

22:00

22:30

23:00

23:30

Get Free Quote!

361 Experts Online

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews

Get Help Now

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

Beginner Python assignment: Description In this project you are going to index a set of documents in a python

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews

We Provide Services Across The Globe

Sun	Mon	Tue	Wed	Thu	Fri	Sat
23	24	25	26	27	28	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31	1	2	3	4	5

Sun	Mon	Tue	Wed	Thu	Fri	Sat
23	24	25	26	27	28	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31	1	2	3	4	5

Enroll in the complete course for only $250 USD*

Beginner Python assignment: Description In this project you are going to index a set of documents in a python

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

Get Instant Help with your Questions & boost your grades

you can count us with it Highly Satisfied Students 4.9/5 Based On 19835+ Reviews

We Provide Services Across The Globe

Get Instant Help with your Questions &
boost your grades

you can count us with it
Highly Satisfied Students 4.9/5
Based On 19835+ Reviews

Sun	Mon	Tue	Wed	Thu	Fri	Sat
23	24	25	26	27	28	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31	1	2	3	4	5