{% extends 'silpa.html'%} {% block modulescript %} {% endblock %} {% block content%}

Indic W-shingling Library

What is W-shingling?

a w-shingling is a set of unique "shingles"—contiguous subsequences of tokens in a document—that can be used to gauge the similarity of two documents. The w denotes the number of tokens in each shingle in the set.

Read more about W-shingling

Supported Languages

English, Hindi, Malayalam, Kannada, Bengali

Enter the text for getting the w-shingling below. Language of each word will be detected. You can give the text in any language and even with mixed language.

W :


Python Wshingling API

This service provides indic w-shingling libraries
{% endblock %}