Treebank Statistics: UD_Gwichin-TueCL: POS Tags: PROPN
There are 28 PROPN lemmas (9%), 28 PROPN types (6%) and 33 PROPN tokens (3%).
Out of 15 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.
The 10 most frequent PROPN lemmas: gwichyaa, John, K’ǫǫ, Vashrą̀įį, Alice, April, Bishop, Burke, Dodson, Dr.
The 10 most frequent PROPN types: Gwichyaa, John, K’ǫǫ, Vashrą̀įį, Alice, April, Bishop, Burke, Dodson, Dr.
The 10 most frequent ambiguous lemmas: _ (PUNCT 319, VERB 37, ADV 7, ADP 1, NOUN 1, PROPN 1, X 1), dinjii (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.693333).
The 1st highest number of forms (1) was observed with the lemma “Alice”: Alice.
The 2nd highest number of forms (1) was observed with the lemma “April”: April.
The 3rd highest number of forms (1) was observed with the lemma “Bishop”: Bishop.
PROPN does not occur with any features.
Relations
PROPN nodes are attached to their parents using 8 different relations: flat (9; 27% instances), obl (7; 21% instances), nsubj (5; 15% instances), conj (4; 12% instances), compound (3; 9% instances), obj (3; 9% instances), appos (1; 3% instances), dep (1; 3% instances)
Parents of PROPN nodes belong to 4 different parts of speech: VERB (14; 42% instances), PROPN (12; 36% instances), NOUN (5; 15% instances), PRON (2; 6% instances)
15 (45%) PROPN nodes are leaves.
13 (39%) PROPN nodes have one child.
3 (9%) PROPN nodes have two children.
2 (6%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 3.
Children of PROPN nodes are attached using 7 different relations: flat (9; 36% instances), case (6; 24% instances), cc (3; 12% instances), punct (3; 12% instances), conj (2; 8% instances), compound (1; 4% instances), nummod (1; 4% instances)
Children of PROPN nodes belong to 6 different parts of speech: PROPN (12; 48% instances), ADP (6; 24% instances), PUNCT (3; 12% instances), CCONJ (2; 8% instances), ADV (1; 4% instances), NUM (1; 4% instances)