Improving results for the 2009 and 2010 INEX focused tasks.

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Title

Improving results for the 2009 and 2010 INEX focused tasks.

Published Date

2011-08

Publisher

Type

Thesis or Dissertation

Abstract

Information retrieval systems aim to retrieve precise and relevant information in response to a user's query. In past years entire documents which were considered to be relevant or highly correlating were returned to users. However with growth of the web and large numbers of XML documents, smaller elements or passages can be returned to the user for more precise results. This thesis explains Flex, our system for dynamic element retrieval, where in XML elements rather than entire documents are retrieved and returned to the user. It also gives an overview of the process of generating highly correlating elements (from a large document collection) for a set of queries. The aim of this thesis is to improve the results for the INEX 2009 and 2010 Ad Hoc Focused Tasks. The Focused Tasks require that each query return a result set of non-overlapping elements. This thesis describes the techniques involved in producing such elements and compares the results produced.

Description

University of Minnesota M.S. thesis. August 2011. Major: Computer science. Advisor: Dr. Carolyn J. Crouch. 1 computer file (PDF); iv, 52 pages.

Related to

Replaces

License

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Suggested citation

Acquilla, Natasha Deepak. (2011). Improving results for the 2009 and 2010 INEX focused tasks.. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/116857.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.