Resource title

Text Clustering with String Kernels in R

Resource image

image for OpenScout resource :: Text Clustering with String Kernels in R

Resource description

We present a package which provides a general framework, including tools and algorithms, for text mining in R using the S4 class system. Using this package and the kernlab R package we explore the use of kernel methods for clustering (e.g., kernel k-means and spectral clustering) on a set of text documents, using string kernels. We compare these methods to a more traditional clustering technique like k-means on a bag of word representation of the text and evaluate the viability of kernel-based methods as a text clustering technique. (author's abstract) ; Series: Research Report Series / Department of Statistics and Mathematics

Resource author

Alexandros Karatzoglou, Ingo Feinerer

Resource publisher

Resource publish date

Resource language

en

Resource content type

application/pdf

Resource resource URL

http://epub.wu.ac.at/1002/1/document.pdf

Resource license

Adapt according to the license agreement. Always reference the original source and author.