Not Logged In

Community Question Retrieval in Health Forums

Full Text: BHI17-1.pdf PDF

Community Question Answering (CQA) has emerged as a popular type of service enabling users to ask and answer questions, and access the existing knowledge base. CQ archives contain a lot of useful user-generated content and have been recognized as important information resources for the web. To improve accessibility to this body of knowledge in CQA archives, effective and efficient question retrieval is required. Question retrieval in a CQA archive aims to identify and retrieve existing questions that are relevant to new user questions. The objective of this study is to develop a question retrieval system that can sift through such forums and identify existing questions which are most similar to the userprovided question. We focus on health forums, and propose a CQA system using weighted TF-IDF, relevance heuristics, and term expansion. We compare our proposed algorithm against other well-known methods, and demonstrate that our method outperforms the Latent Dirichlet allocation (LDA) topic model, Latent Semantic Indexing (LSI), language model based information retrieval, BM25, vector space, Word2Vec, and semantic similarity approaches. Our initial experiments use datasets from the IEEE Healthcare Data Analytics Challenge 2015, and we also present our efforts towards development of a Bronze Standard for question similarity evaluation using self-annotations and annotations provided by affiliates of Mayo Clinic.

Citation

H. Samuel, M. Kim, S. Prabhakar, M. Jabbar, O. Zaiane. "Community Question Retrieval in Health Forums". International Conference on Biomedical and Health Informatics, Orlando, USA, February 2017.

Keywords:  
Category: In Conference
Web Links: Webdocs

BibTeX

@incollection{Samuel+al:17,
  author = {Hamman Samuel and Mi-Young Kim and Sankalp Prabhakar and Mohomed
    Shazan Mohomed Jabbar and Osmar R. Zaiane},
  title = {Community Question Retrieval in Health Forums},
  booktitle = {International Conference on Biomedical and Health Informatics},
  year = 2017,
}

Last Updated: November 04, 2019
Submitted by Sabina P

University of Alberta Logo AICML Logo