(beta 公測版本)

數據分析函數清單


函數名:charcount
簡介:

counts the number of characters in the input and return a JSON dictionary
including non-CJK characters

用嚟統計字(唔係詞)嘅使用頻率。

Data License: public domain. Credits to words.hk appreciated.
授權:公有領域。

數據更新日期:2018年6月24日13:52:02

函數名:existingwordcount
簡介:

Query database for all word representations, and count the number of occurrences, without trying to do segmentation on the article content

用嚟統計資料庫現有嘅詞嘅使用頻率。
請留意,呢個清單只包括粵文庫入面見過嘅詞,唔包括《粵典》有收錄但粵文庫冇出現過嘅詞。

Data License: public domain. Credits to words.hk appreciated.
授權:公有領域。

數據更新日期:2018年7月5日23:59:21

簡介:

Query database for all word jyutpings, and just dump it out by character if it seems valid.

*** Use: just run the analysis on one single article (the "only need any one article" option should be turned on for this), any will do since the article contents do not matter. ***

可以用作{字=>粵拼}嘅數據

Data License: public domain. Credits to words.hk appreciated.
授權:公有領域。

數據更新日期:2018年6月27日16:41:38