home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PatentChar: POS Tags: NOUN

There are 1 NOUN lemmas (7%), 293 NOUN types (39%) and 1661 NOUN tokens (35%). Out of 15 observed tags, the rank of NOUN is: 7 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: 数据、 信息、 中、 特征、 位置、 单元、 地址、 方法、 用户、 模型

The 10 most frequent ambiguous lemmas: _ (NOUN 1661, VERB 948, PUNCT 560, ADJ 474, PART 346, ADP 259, NUM 185, CCONJ 106, ADV 68, PROPN 60, PRON 48, DET 39, X 14, SCONJ 10, AUX 6)

The 10 most frequent ambiguous types: 中 (NOUN 44, PART 2), 权利 (NOUN 21, VERB 1), 上 (NOUN 17, PART 1), 业务 (NOUN 17, VERB 2), 种 (NOUN 14, NUM 2), 镜像 (NOUN 13, ADP 1), 标识 (NOUN 12, VERB 2), 的 (PART 287, NOUN 11), 个 (NOUN 9, DET 1, NUM 1), 时 (NOUN 9, ADP 1)

Morphology

The form / lemma ratio of NOUN is 293.000000 (the average of all parts of speech is 50.400000).

The 1st highest number of forms (293) was observed with the lemma “_”: 1, 3, CPU, FPGA, SFP, 一, 上, 上方, 下, 与, 业务, 个, 中, 中板, 主, 主板, 之间, 事件, 事件表, 于, 互信息, 井, 以太口, 传感器, 位置, 体, 信号, 信号线, 信息, 信息表, 值, 元件, 元数据, 兆, 关系, 兴趣点, 其中, 内, 内存, 内容, 内核, 内部, 出, 函数, 分布, 分析, 到, 制冷剂, 前, 前插卡, 力量, 功率, 区块, 区域, 单元, 卡片点, 卫星, 历史, 压力, 厚度, 参数, 发电机, 变电站, 变量, 合法性, 后, 后端, 向量, 含水率, 器, 器件, 四则, 图层, 图形, 图标, 在, 地图, 地址, 地线, 地质, 均值, 块, 型, 基座, 基站, 壳, 壳体, 处理器, 头像, 子, 孔隙度, 存储器, 存放区, 学生, 安全, 实质, 客户端, 客车, 宿主机, 寄存器, 对象, 导体, 封闭式, 尺寸, 嵌槽, 工况, 工区, 差值, 常用, 平台, 应用, 开关, 强度, 形式, 征, 快照, 思维, 情况, 感测区, 所述, 扣卡, 拓扑, 指令, 指数, 指针, 按键, 据, 接入卡, 接口, 接地点, 接近度, 控件, 散点图, 数, 数学, 数据, 数据库, 数量, 整体, 整车, 文件, 方向, 方式, 方案, 方法, 方钢, 日志, 时, 时刻, 时长, 时间, 时间戳, 时间段, 显示区, 智能, 曲线, 有, 服务器, 服务端, 机器, 权利, 条, 条件, 来源, 架构, 标识, 校验, 根据, 桌面, 桶状部, 梁, 梯度, 模, 模块, 模型, 模态, 步骤, 水平井, 油层, 流量, 消息, 渗透率, 源, 源数据, 热值, 煤电比, 版本, 物理, 特征, 状态, 环境, 用户, 电, 电压, 电子, 电容, 电极, 电流, 电源, 电磁, 电路, 界面, 的, 目录, 目标, 目的, 相似度, 相同, 瞬态, 硅油, 种, 积分, 程序, 程度, 稳态, 空间, 符号, 符号表, 第一, 等值, 签名, 算法, 类, 类型, 系数, 系统, 累积值, 线图, 组, 终端, 经验, 结构, 结构体, 结果, 网格, 网络, 腔体, 自容式, 自由度, 自身, 节点, 芯片, 获取, 虚拟机, 行为, 表单, 表征, 装置, 要求, 要素, 规则, 规格, 规范, 触控笔, 设备, 请求, 质心, 资料, 走线, 距离, 车, 车身, 轮胎, 软硬件, 载荷, 输入端, 输出端, 边界, 过程, 运算器, 连线, 选择器, 通知, 部分, 配置表, 金属, 键帽, 键盘, 镜像, 间, 阈值, 阶段, 集电极, 非显示区, 面板, 面积, 面积比, 页, 页面, 项, 频率, 饱和度, 驱动.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 11 different relations: nmod (633; 38% instances), obj (350; 21% instances), obl (282; 17% instances), nsubj (203; 12% instances), conj (72; 4% instances), root (62; 4% instances), obl:arg (48; 3% instances), xcomp (5; 0% instances), parataxis (3; 0% instances), compound:vv (2; 0% instances), appos (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (773; 47% instances), NOUN (766; 46% instances), (62; 4% instances), ADJ (39; 2% instances), ADP (4; 0% instances), PART (4; 0% instances), PROPN (4; 0% instances), ADV (3; 0% instances), PRON (3; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

447 (27%) NOUN nodes are leaves.

466 (28%) NOUN nodes have one child.

419 (25%) NOUN nodes have two children.

329 (20%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 17 different relations: nmod (731; 29% instances), amod (378; 15% instances), acl (356; 14% instances), case (336; 14% instances), punct (152; 6% instances), nummod (141; 6% instances), obj (79; 3% instances), conj (73; 3% instances), parataxis (64; 3% instances), cc (58; 2% instances), dep (39; 2% instances), advmod (32; 1% instances), ccomp (29; 1% instances), appos (11; 0% instances), det (5; 0% instances), nsubj (2; 0% instances), compound:vv (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (766; 31% instances), VERB (442; 18% instances), ADJ (406; 16% instances), PART (187; 8% instances), NUM (165; 7% instances), ADP (162; 7% instances), PUNCT (152; 6% instances), CCONJ (68; 3% instances), PRON (45; 2% instances), PROPN (39; 2% instances), DET (35; 1% instances), ADV (17; 1% instances), X (2; 0% instances), AUX (1; 0% instances)