home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: PRON

There are 235 PRON lemmas (1%), 236 PRON types (1%) and 4066 PRON tokens (3%). Out of 14 observed tags, the rank of PRON is: 5 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: 저+는, 우리, 나+는, 저+의, 내+가, 우리+는, 나+의, 제+가, 우리+가, 나+에게

The 10 most frequent PRON types: 저는, 우리, 나는, 내가, 제, 우리는, 내, 제가, 우리가, 나에게

The 10 most frequent ambiguous lemmas: 저+는 (PRON 642, VERB 2), 우리 (PRON 538, NOUN 11), 나+는 (PRON 464, VERB 9), 내+가 (PRON 292, NOUN 6), 자기 (NOUN 50, PRON 49), 나 (PRON 27, ADP 3, NOUN 3), 누구+나 (PRON 22, ADV 1), 나+ㄴ (PRON 21, VERB 7), 저 (PRON 14, DET 6, ADV 1), 그것+이 (PRON 10, NOUN 1)

The 10 most frequent ambiguous types: 저는 (PRON 641, VERB 2), 우리 (PRON 538, NOUN 11, ADV 1), 나는 (PRON 462, VERB 9), 내가 (PRON 292, NOUN 6), 제 (PRON 278, NOUN 3, DET 1), 내 (PRON 176, NOUN 1), 자기 (NOUN 50, PRON 49, VERB 3), 나 (PRON 27, ADP 3, NOUN 3, X 1), 난 (PRON 23, VERB 7, X 1), 누구나 (PRON 22, ADV 1)

Morphology

The form / lemma ratio of PRON is 1.004255 (the average of all parts of speech is 1.007876).

The 1st highest number of forms (3) was observed with the lemma “저+의”: 재, 저의, 제.

The 2nd highest number of forms (2) was observed with the lemma “그것+이”: 그것이, 그게.

The 3rd highest number of forms (2) was observed with the lemma “나+는”: 나는, 난.

PRON occurs with 1 features: Typo (2; 0% instances)

PRON occurs with 1 feature-value pairs: Typo=Yes

PRON occurs with 2 feature combinations. The most frequent feature combination is _ (4064 tokens). Examples: 저는, 우리, 나는, 내가, 제, 우리는, 내, 제가, 우리가, 나에게

Relations

PRON nodes are attached to their parents using 15 different relations: nsubj (2239; 55% instances), nmod:poss (636; 16% instances), nmod (410; 10% instances), obl (374; 9% instances), dislocated (258; 6% instances), obj (124; 3% instances), advcl (5; 0% instances), flat (4; 0% instances), appos (3; 0% instances), conj (3; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), amod (2; 0% instances), compound (2; 0% instances), ccomp (1; 0% instances)

Parents of PRON nodes belong to 10 different parts of speech: VERB (2495; 61% instances), NOUN (797; 20% instances), ADJ (445; 11% instances), ADV (300; 7% instances), AUX (22; 1% instances), ADP (2; 0% instances), DET (2; 0% instances), INTJ (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances)

3963 (97%) PRON nodes are leaves.

96 (2%) PRON nodes have one child.

5 (0%) PRON nodes have two children.

2 (0%) PRON nodes have three or more children.

The highest child degree of a PRON node is 3.

Children of PRON nodes are attached using 17 different relations: conj (26; 23% instances), case (23; 21% instances), acl (17; 15% instances), nmod (10; 9% instances), flat (8; 7% instances), punct (7; 6% instances), amod (5; 4% instances), advcl (2; 2% instances), advmod (2; 2% instances), appos (2; 2% instances), list (2; 2% instances), nmod:poss (2; 2% instances), nsubj (2; 2% instances), det (1; 1% instances), goeswith (1; 1% instances), obj (1; 1% instances), obl (1; 1% instances)

Children of PRON nodes belong to 10 different parts of speech: NOUN (44; 39% instances), ADP (23; 21% instances), VERB (15; 13% instances), ADV (10; 9% instances), ADJ (7; 6% instances), PUNCT (7; 6% instances), DET (3; 3% instances), AUX (1; 1% instances), PRON (1; 1% instances), X (1; 1% instances)