Annual report 2013

Users can judge the precision of an IR system by inspecting the retrieved documents. They cannot judge recall, however, which would involve inspecting the whole collection, thus obviating the IR system, and impossible in the case of WWW. How then, can we ascertain whether recall is a valid metric fo...

Full description

Saved in:  
Bibliographic Details
Authors: Hoenkamp, E. C. M. (Author) ; Vugt, H. C. van (Author)
Format: Electronic Article
Language:English
Published: 2001
In: Annual report
Year: 2001
Online Access: Volltext (kostenfrei)
Journals Online & Print:
Drawer...
Check availability: HBZ Gateway
Description
Summary:Users can judge the precision of an IR system by inspecting the retrieved documents. They cannot judge recall, however, which would involve inspecting the whole collection, thus obviating the IR system, and impossible in the case of WWW. How then, can we ascertain whether recall is a valid metric for the quality of an IR system as perceived by the end-user? In a carefully controlled experiment we presented users with a simulated web search engine. Besides the search results, the engine could give a (spurious) recall estimate, presented as a pie chart. We manipulated this recall feedback and whether the information need was fulfilled with respect to quantification type (the number of documents requested). The results reported here suggest that whereas recall may be a valid metric for designers and evaluators of IR systems it may lack validity as a metric for search quality as perceived by the end-user
ISSN:0313-4474