Not Logged In

Sequence prediction exploiting similarity information

Full Text: IJCAI07-254.pdf PDF

When data is scarce or the alphabet is large, smoothing the probability estimates becomes inescapable when estimating n-gram models. In this paper we propose a method that implements a form of smoothing by exploiting similarity information of the alphabet elements. The idea is to view the log-conditional probability function as a smooth function defined over the similarity graph. The algorithm that we propose uses the eigenvectors of the similarity graph as the basis of the expansion of the log conditional probability function whose coefficients are found by solving a regularized logistic regression problem. The experimental results demonstrate the superiority of the method when the similarity graph contains relevant information, whilst the method still remains competitive with state-of-the-art smoothing methods even in the lack of such information.

Citation

I. Biro, Z. Szamonek, C. Szepesvari. "Sequence prediction exploiting similarity information". International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, March 2007.

Keywords: machine learning
Category: In Conference

BibTeX

@incollection{Biro+al:IJCAI07,
  author = {Istavan Biro and Zoltan Szamonek and Csaba Szepesvari},
  title = {Sequence prediction exploiting similarity information},
  booktitle = {International Joint Conference on Artificial Intelligence
    (IJCAI)},
  year = 2007,
}

Last Updated: April 24, 2007
Submitted by Nelson Loyola

University of Alberta Logo AICML Logo