Poster
in
Workshop: ICML Workshop on Human in the Loop Learning (HILL)
Effect of Combination of HBM and Certainty Sampling onWorkload of Semi-Automated Grey Literature Screening
JINGHUI LU · Brian Mac Namee
With the rapid increase of unstructured text data, grey literature has become an important source of information to support research and innovation activities. In this paper, we propose a novel semi-automated grey literature screening approach that combines a Hierarchical BERT Model (HBM) with active learning to reduce the human workload in grey literature screening. Evaluations over three real-world grey literature datasets demonstrate that the proposed approach can save up to 64.88% of the human screening workload, while maintaining high screening accuracy. We also demonstrate how the use of the HBM model allows salient sentences within grey literature documents to be selected and highlighted to support workers in screening tasks.