Supervised by: Ministry of Culture of PRC

Sponsored by:National Library of China
  Library Society of China

ISSN 1001-8867    CN 11-2746/G2

Investigating Weak Supervision in Deep Ranking

Abstract: A number of deep neural networks have beenproposed to improve the performance of documentranking in information retrieval studies. However,the training processes of these models usually needa large scale of labeled data, leading to data shortagebecoming a major hindrance to the improvement ofneural ranking models’ performances. Recently, severalweakly supervised methods have been proposed toaddress this challenge with the help of heuristics or users’interaction in the Search Engine Result Pages (SERPs)to generate weak relevance labels. In this work, weadopt two kinds of weakly supervised relevance, BM25-based relevance and click model-based relevance, andmake a deep investigation into their differences in thetraining of neural ranking models. Experimental resultsshow that BM25-based relevance helps models capturemore exact matching signals, while click model-basedrelevance enhances the rankings of documents that maybe preferred by users. We further proposed a cascaderanking framework to combine the two weakly supervisedrelevance, which significantly promotes the rankingperformance of neural ranking models and outperformsthe best result in the last NTCIR-13 We Want Web (WWW)task. This work reveals the potential of constructing betterdocument retrieval systems based on multiple kinds of weak relevance signals.

Keywords: document ranking, ad hoc retrieval, neuralranking model, weak supervision.


怀安县| 穆棱市| 怀集县| 新巴尔虎右旗| 嘉义县| 思南县| 沿河| 安溪县| 和平县| 大兴区| 颍上县| 泉州市| 津市市| 商南县| 于田县| 陵川县| 扎赉特旗| 洪湖市| 昭觉县| 绿春县| 镇坪县| 临潭县| 财经| 抚宁县| 赤峰市| 客服| 泾源县| 扎囊县| 昆明市| 徐闻县| 嵊州市| 同江市| 绿春县| 古田县| 北票市| 苗栗县| 英吉沙县| 油尖旺区| 汉沽区| 融水| 日照市|