I’ve been preparing a talk for ESSIR and it’s been great thinking through some information retrieval problems to talk about. Two of the are linked to quality and not quantity of evidence. If you do a search and find 1000 results, most people will find that 950 too many… So, it’s important that the top results are the best. Our algorithm attempts to do it and it does it pretty well, but it could be better.
I found myself looking at the search for prostate cancer screening which generated over 12,000 results. Of the top ten, 6 are ‘evidence based’ (ie the top green layer) while 4 are journal articles (these get boosted as they’re published in the last two years). This struck me as a bit odd as we emphasise the secondary evidence side of things… So, what if we did things differently:
- A search for prostate cancer screening, restricted to the title words and restricted to just secondary evidence = 119 results. If you add in ‘Key primary research that jumps to 167 results.
- As above but restrict to the last 5 years and it’s 35 results.
- A general search for prostate cancer screening, organised by relevancy, restricted to secondary evidence and the last 5 years = 1,352 results, restricted to the last 3 years and it’s 994 results.
In the above examples the search is very specific, removes the lower quality evidence and the user loses seeing some potential journal articles – which they can navigate to if they wish.
For information specialists – who typically want all results – we could allow that. But for busy clinicians we could allow some way of only returning a fraction of the results with a focus on higher quality evidence.
Discuss!
Recent Comments