Skip to content
View swelcker's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report swelcker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. cmd.csp.classifier cmd.csp.classifier Public

    Simple implementation of text classifier in Java with built in SVM, C4.5, kNN, and naive Bayesian classifiers. Support for common text preprocessors and for CVS format. You can plugin your own clas…

    Java

  2. cmd.csp.similarity cmd.csp.similarity Public

    A library implementing different string similarity and distance measures for ease of use. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subs…

    Java 1

  3. cmd.csp.postagger cmd.csp.postagger Public

    A robust and easy-to-use toolkit for POS (Part of Speech; NLP) tagging. It's approach is to automatically construct tagging rules in the form of a binary tree. Supports pre-trained UPOS, XPOS taggi…

    Java

  4. cmd.csp.stemmer cmd.csp.stemmer Public

    Simple implementation of Snowball Stemmer (http://snowballstem.org/) in Java with Stemmers for 20+ languages. Helpful to reduce tokens to their core syntax esp. when processing them in Machine Lear…

    Java