home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CHILDES: POS Tags: PUNCT

There are 23 PUNCT lemmas (0%), 23 PUNCT types (0%) and 48217 PUNCT tokens (16%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ., ?, !, dere, :, o, that, Swww, a, dis

The 10 most frequent PUNCT types: ., ?, !, dere, :, dat, o, Swww, a, dis

The 10 most frequent ambiguous lemmas: dere (ADV 44, PROPN 33, INTJ 21, NOUN 21, PRON 12, PUNCT 8, DET 2, ADJ 1), o (NOUN 14, X 5, PUNCT 3, INTJ 2, PROPN 1, SYM 1), that (PRON 4425, DET 1080, SCONJ 361, INTJ 39, ADV 28, PROPN 13, ADP 10, NOUN 10, PART 7, PUNCT 3, SYM 1), Swww (PROPN 58, PUNCT 2), a (DET 7218, NOUN 36, X 15, SYM 10, PRON 8, INTJ 6, PROPN 4, ADP 2, PUNCT 2), dis (PRON 74, DET 34, PROPN 23, INTJ 14, NOUN 13, ADV 4, ADJ 2, PART 2, PUNCT 2, ADP 1, X 1), t (NOUN 13, X 3, ADV 2, INTJ 2, PART 2, PROPN 2, PUNCT 2, ADJ 1), Dwww (PROPN 20, NOUN 1, PUNCT 1), b (NOUN 46, PROPN 9, NUM 5, X 3, INTJ 2, ADJ 1, ADP 1, PUNCT 1, SYM 1), horsie (NOUN 84, PROPN 6, ADJ 4, ADV 2, INTJ 1, PUNCT 1, VERB 1, X 1)

The 10 most frequent ambiguous types: dere (ADV 64, PROPN 33, NOUN 21, INTJ 12, PRON 10, PUNCT 8, VERB 5, DET 2, ADJ 1), dat (PRON 108, DET 42, NOUN 8, PROPN 7, PART 6, INTJ 5, ADP 4, PUNCT 3, SCONJ 2, SYM 1), o (NOUN 14, X 5, PUNCT 3, ADP 1, INTJ 1, PROPN 1, SYM 1), Swww (PROPN 58, PUNCT 2, INTJ 1, PRON 1), a (DET 5737, NOUN 32, PART 14, X 13, ADP 12, SYM 9, PRON 6, PROPN 4, INTJ 3, PUNCT 2, ADV 1, AUX 1), dis (PRON 56, DET 34, PROPN 21, NOUN 13, ADV 4, INTJ 4, ADJ 2, PART 2, PUNCT 2, ADP 1, X 1), t (NOUN 13, PART 2, PUNCT 2, ADJ 1, ADV 1, PROPN 1, X 1), Dwww (PROPN 20, NOUN 2, PRON 2, INTJ 1, PUNCT 1), b (NOUN 45, PROPN 9, X 3, NUM 2, ADJ 1, ADP 1, PUNCT 1, SYM 1), horsie (NOUN 87, PROPN 10, ADJ 4, ADV 2, PUNCT 1, X 1)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.232942).

The 1st highest number of forms (1) was observed with the lemma “!”: !.

The 2nd highest number of forms (1) was observed with the lemma “-”: -.

The 3rd highest number of forms (1) was observed with the lemma “.”: ..

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 2 different relations: punct (48186; 100% instances), root (31; 0% instances)

Parents of PUNCT nodes belong to 18 different parts of speech: VERB (27919; 58% instances), NOUN (9694; 20% instances), ADJ (2623; 5% instances), PRON (1928; 4% instances), ADV (1852; 4% instances), AUX (1317; 3% instances), PROPN (1248; 3% instances), INTJ (790; 2% instances), NUM (367; 1% instances), DET (172; 0% instances), ADP (90; 0% instances), SCONJ (81; 0% instances), PART (33; 0% instances), PUNCT (31; 0% instances), (31; 0% instances), CCONJ (19; 0% instances), X (16; 0% instances), SYM (6; 0% instances)

48186 (100%) PUNCT nodes are leaves.

0 (0%) PUNCT nodes have one child.

15 (0%) PUNCT nodes have two children.

16 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 6.

Children of PUNCT nodes are attached using 11 different relations: punct (31; 35% instances), nsubj (21; 24% instances), cop (11; 13% instances), discourse (8; 9% instances), det (6; 7% instances), case (3; 3% instances), nummod (3; 3% instances), amod (2; 2% instances), advmod (1; 1% instances), compound (1; 1% instances), mark (1; 1% instances)

Children of PUNCT nodes belong to 11 different parts of speech: PUNCT (31; 35% instances), PRON (12; 14% instances), AUX (11; 13% instances), INTJ (10; 11% instances), NOUN (9; 10% instances), DET (6; 7% instances), NUM (4; 5% instances), ADJ (2; 2% instances), ADP (1; 1% instances), ADV (1; 1% instances), SCONJ (1; 1% instances)