Datasets¶
Data sources available through cihai.
UNIHAN
Unicode Han Database – readings, meanings, variants.
Planned datasets¶
For all data sets, the goal is to achieve:
Clear and permissive a licensing for public and private use
Compatibility with data Packages, for data to be language agnostic and consistent
Open source scripting used to process data into a common format
Set |
License |
Data Package |
Project |
|---|---|---|---|
UNIHAN |
OK [1] |
OK [2] |
OK [3] |
edict |
OK |
TODO |
TODO |
cedict |
OK [4] |
TODO |
TODO |
cedictgr |
OK |
TODO |
TODO |
handedict |
OK |
TODO |
TODO |
cfdict |
OK |
MISSING [5] |
UNKNOWN |