< Back

oscar_icu

latest version: 1.0


Description : Thai unigram word frequency from OSCAR Corpus (icu word tokenize)

Long Description : Thai unigram word frequency from OSCAR Corpus (icu word tokenize)

HomePage : https://web.facebook.com/groups/colab.thailand/permalink/1524070061101680/?_rdc=1&_rdr

Authors : Korakot Chaovavanich


Download and Use

Download

from pythainlp.corpus import download
download('oscar_icu')

Use

It's get path file of corpus.
from pythainlp.corpus import get_corpus_path
get_corpus_path('oscar_icu')

if get_corpus_path('oscar_icu') is None than you not download oscar_icu.

Release history

1.0

File Name : oscar_word_freq.csv

md5 : -

PyThaiNLP version: >=2.2

Link Download : oscar_word_freq.csv