This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home fi/dep issue tracker

punct: punctuation

The dependency type punct is used to mark punctuation. The dependent is the punctuation symbol, and the governor is the element which the punctuation symbol delimits. For instance, with coordination, the first coordinated element is the head of all punct dependencies in the coordination, and with subordinate clauses, the head of the subordinate clause is the governor of the punct.

Diffs

By the current release of FI_FTB (FinnTreeBank), the manual annotation of punctuation marks has not been completed. Instead the automatic annotation links the punctuation marks to the closest token available (usually the previous one).


Treebank Statistics (UD_Finnish)

This relation is universal.

26398 nodes (15%) are attached to their parents as punct.

19192 instances of punct (73%) are left-to-right (parent precedes child). Average distance between parent and child is 6.59462838093795.

The following 18 pairs of parts of speech are connected with punct: VERB-PUNCT (17244; 65% instances), NOUN-PUNCT (4451; 17% instances), ADJ-PUNCT (1651; 6% instances), PROPN-PUNCT (1030; 4% instances), NUM-PUNCT (816; 3% instances), ADV-PUNCT (496; 2% instances), PRON-PUNCT (231; 1% instances), SYM-PUNCT (152; 1% instances), X-PUNCT (119; 0% instances), INTJ-PUNCT (106; 0% instances), SCONJ-PUNCT (34; 0% instances), NOUN-SYM (20; 0% instances), CONJ-PUNCT (19; 0% instances), AUX-PUNCT (13; 0% instances), ADP-PUNCT (6; 0% instances), VERB-SYM (5; 0% instances), PUNCT-PUNCT (4; 0% instances), NUM-SYM (1; 0% instances).


Treebank Statistics (UD_Finnish-FTB)

This relation is universal.

22565 nodes (14%) are attached to their parents as punct.

22202 instances of punct (98%) are left-to-right (parent precedes child). Average distance between parent and child is 1.10325725681365.

The following 16 pairs of parts of speech are connected with punct: NOUN-PUNCT (9942; 44% instances), VERB-PUNCT (4677; 21% instances), ADV-PUNCT (2084; 9% instances), ADJ-PUNCT (1820; 8% instances), PRON-PUNCT (1102; 5% instances), PROPN-PUNCT (943; 4% instances), ADP-PUNCT (858; 4% instances), PART-PUNCT (329; 1% instances), NUM-PUNCT (275; 1% instances), PUNCT-PUNCT (207; 1% instances), INTJ-PUNCT (97; 0% instances), CONJ-PUNCT (87; 0% instances), SCONJ-PUNCT (80; 0% instances), X-PUNCT (34; 0% instances), DET-PUNCT (27; 0% instances), SYM-PUNCT (3; 0% instances).


punct in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]
BESbswyBESbswyBESbswyBESbswy