100 Best GitHub: N-gram


 

See also:

100 Best GitHub: Ngram | 100 Best N-gram VideosConcGrams | N-gram & Tag Clouds | N-gram Dialog Systems | N-gram Grammars | N-gram Transducers (NGT)


[100x Nov 2017]

  • 0jag/wordsworth frequency analysis tool – counts words, letters, n-grams and more!
  • rockymadden/stringmetric ? string metrics and phonetic algorithms for scala (e.g. dice/sorensen, hamming, jaccard, jaro, jaro-winkler, levensh…
  • proycon/pynlpl pynlpl, pronounced as ‘pineapple’, is a python library for natural language processing. it contains various modules u…
  • zhezhaoa/ngram2vec four word embedding models implemented in python. supporting arbitrary context features
  • jwieting/charagram code to train and use models from “charagram: embedding words and sentences via character n-grams”.
  • grakic/textcat-sr serbian cyrillic and latin language models for libexttextcat, a free software n-gram based language guessing library
  • pomax/nrgrammar the nihongo resources grammar book: “an introduction to japanese; syntax, grammar & language”
  • sunpinyin/open-gram an open solution for collecting n-gram chinese lexicon and n-gram statistics
  • bigfav/n-grams my python n-gram language model from an nlp course. since there are so public implementations, i feel free to post mine.
  • ayushoriginal/ngram-graphs ?research [nlp] analysis of n-gram graphs and their applications in the domain of text classification and extraction …
  • vsiivola/varikn a toolkit for producing n-gram language models. the highlights are the implementation of kneser-ney growing and revis…
  • proycon/colibri-core colibri core is an nlp tool as well as a c++ and python library for working with basic linguistic constructions such as
  • pebbe/textcat a go package for n-gram based text categorization, with support for utf-8 and raw text
  • reddavis/n-gram n-gram generator in ruby – http://en.wikipedia.org/wiki/n-gram
  • wpm/tfidf a generic tf-idf utility with example code that works on n-grams extracted from a text document.
  • bburns/languagemodels comparison of n-gram vs rnn (recurrent neural network) language models (predicting next word in a sequence), using py…
  • smashedtoatoms/the_fuzz string metrics and phonetic algorithms for elixir (e.g. dice/sorensen, hamming, jaccard, jaro, jaro-winkler, levensht…
  • cidles/pressagio pressagio is a library that predicts text based on n-gram models. for example, you can send a string and the library …
  • dexyk/stringosim string similarity functions, jaccard, levenshtein, hamming, jaro-winkler, q-grams, n-grams, lcs – longest common subs…
  • sparell/phraser phraser is a phrase generator using n-grams and markov chains to generate phrases for passphrase cracking. the genera…
  • rxnlp/text-mining-and-nlp-apis rxnlp apis for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or url, c…
  • cedias/word2vec tool for computing continuous distributed representations of word. modified to learn n-grams
  • libofang/dv-ngram code for iclr workshop paper “learning document embedding by predicting n-grams for sentiment classification of long …
  • eddiesong/pos sample implementation of hmm tagger and n-gram tagger for pos tagging
  • ggianna/jinsect the jinsect toolkit is a java-based toolkit and library that supports and demonstrates the use of n-gram graphs withi…
  • clips/clinspell clinical spelling correction with word and character n-gram embeddings.
  • pteichman/quotefix insert matching punctuation for mismatched quotation marks, parentheses, etc. good postprocessing for n-gram text syn…
  • tonytonyjan/tjngram n-gram generator in ruby, supporting english, chinese, janpanese and korean.
  • crodas/textcat simple and lightweight library to classify text using n-grams (useful to detect language)
  • pan-webis-de/jairescalante11 reimplementation of the authorship attribution approach described in “hugo jair escalante, thamar solorio, and manuel…
  • pan-webis-de/sidorov14 reimplementation of the authorship attribution approach described in “grigori sidorov, francisco velasquez, efstathio…
  • vspandan/queryexpansion query expansion is the project developed at iiit hyderabad, as part of course work. we have gone through existing app…
  • pan-webis-de/keselj03 reimplementation of the authorship attribution approach described in “vlado kešelj, fuchun peng, nick cercone, and ca…
  • halfvim/n-gram semantic analysis of movie reviews using character n-gram
  • dineshkarthik/n-gram_processor using n-gram get set of words and their frequency of occurrence in given directory / sub-directory/ text file, which …
  • atrilla/nlg natural language generator based on n-gram language models
(Visited 1,601 times, 1 visits today)