UNIHAN - cihai.data.unihan

Bootstrapping

Fetch + extract + transform + load UNIHAN dataset to Cihai.

cihai.data.unihan.bootstrap.bootstrap_unihan(engine, metadata, options=None)
function[source]
function[source]
cihai.data.unihan.bootstrap.bootstrap_unihan(engine, metadata, options=None)

UNIHAN bootstrap script (download from web, import to database).

Parameters:
Return type:

None

cihai.data.unihan.bootstrap.is_bootstrapped(metadata)
function[source]
function[source]
cihai.data.unihan.bootstrap.is_bootstrapped(metadata)

Return True if cihai is correctly bootstrapped.

Parameters:

metadata (MetaData)

Return type:

bool

cihai.data.unihan.bootstrap.create_unihan_table(columns, metadata)
function[source]
function[source]
cihai.data.unihan.bootstrap.create_unihan_table(columns, metadata)

Create table and return sqlalchemy.sql.schema.Table.

Parameters:
Returns:

Newly created table with columns and index.

Return type:

sqlalchemy.schema.Table

class cihai.data.unihan.dataset.Unihan
class cihai.data.unihan.dataset.Unihan

Bases: Dataset, SQLAlchemyMixin

UNIHAN Dataset for cihai.

Constants for UNIHAN cihai dataset.

cihai.data.unihan.constants.UNIHAN_FIELDS: list[str] = <...truncated, 529 chars>
data
data
cihai.data.unihan.constants.UNIHAN_FIELDS: list[str] = <...truncated, 529 chars>

Mapping of field names from unihan-etl (UNIHAN database)

cihai.data.unihan.constants.UNIHAN_ETL_DEFAULT_OPTIONS = <...truncated, 761 chars>
data
data
cihai.data.unihan.constants.UNIHAN_ETL_DEFAULT_OPTIONS = <...truncated, 761 chars>

Default settings passed to unihan-etl

Variants plugin

class cihai.data.unihan.dataset.UnihanVariants
class cihai.data.unihan.dataset.UnihanVariants

Bases: DatasetPlugin, SQLAlchemyMixin

Support for CJK Variant lookups through UNIHAN dataset.