TY - JOUR
T1 - Classifying search queries using the Web as a source of knowledge
AU - Gabrilovich, Evgeniy
AU - Broder, Andrei
AU - Fontoura, Marcus
AU - Joshi, Amruta
AU - Josifovski, Vanja
AU - Riedel, Lance
AU - Zhang, Tong
PY - 2009/4/1
Y1 - 2009/4/1
N2 - We propose a methodology for building a robust query classification system that can identify thousands of query classes, while dealing in real time with the query volume of a commercial Web search engine. We use a pseudo relevance feedback technique: given a query, we determine its topic by classifying the Web search results retrieved by the query. Motivated by the needs of search advertising, we primarily focus on rare queries, which are the hardest from the point of view of machine learning, yet in aggregate account for a considerable fraction of search engine traffic. Empirical evaluation confirms that our methodology yields a considerably higher classification accuracy than previously reported. We believe that the proposed methodology will lead to better matching of online ads to rare queries and overall to a better user experience.
AB - We propose a methodology for building a robust query classification system that can identify thousands of query classes, while dealing in real time with the query volume of a commercial Web search engine. We use a pseudo relevance feedback technique: given a query, we determine its topic by classifying the Web search results retrieved by the query. Motivated by the needs of search advertising, we primarily focus on rare queries, which are the hardest from the point of view of machine learning, yet in aggregate account for a considerable fraction of search engine traffic. Empirical evaluation confirms that our methodology yields a considerably higher classification accuracy than previously reported. We believe that the proposed methodology will lead to better matching of online ads to rare queries and overall to a better user experience.
KW - Pseudo relevance feedback
KW - Query classification
KW - Web search
UR - http://www.scopus.com/inward/record.url?scp=70349742467&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349742467&partnerID=8YFLogxK
U2 - 10.1145/1513876.1513877
DO - 10.1145/1513876.1513877
M3 - Article
AN - SCOPUS:70349742467
SN - 1559-1131
VL - 3
JO - ACM Transactions on the Web
JF - ACM Transactions on the Web
IS - 2
M1 - 5
ER -