Treebank Statistics: UD_Icelandic-GC: POS Tags: PUNCT
There are 31 PUNCT
lemmas (0%), 25 PUNCT
types (0%) and 7861 PUNCT
tokens (8%).
Out of 17 observed tags, the rank of PUNCT
is: 11 in number of lemmas, 12 in number of types and 5 in number of tokens.
The 10 most frequent PUNCT
lemmas: _, ., ,, :, ?, —, –, -, “, „
The 10 most frequent PUNCT
types: ., ,, :, ?, —, –, (, ), -, ;
The 10 most frequent ambiguous lemmas: _ (PUNCT 4781, ADV 1, NOUN 1, PROPN 1), , (PUNCT 1090, ADV 1), - (PUNCT 21, NOUN 1), ( (PUNCT 10, ADV 2), … (PUNCT 5, ADV 1), flugfélag (NOUN 9, PROPN 1, PUNCT 1), frásögn (NOUN 4, PUNCT 1), kona (NOUN 97, PUNCT 1), maí (ADV 11, NOUN 8, PUNCT 1)
The 10 most frequent ambiguous types: - (PUNCT 25, NOUN 1), ( (PUNCT 14, ADV 2), … (PUNCT 5, ADV 1)
- -
- (
- PUNCT 14: 10. tíundin á tilfinnanlega meira en 9 ( 41% )
- ADV 2: Ljóðabókin Ljóð námu völd var tilnefnd til Bókmenntaverðlauna Norðurlandaráðs árið 1993 , Sigurður hlaut Íslensku bókmenntaverðlaunin fyrir Minnisbók árið 2007 og hafði þá áður verið tilnefndur fyrir ljóðabækurnar Ljóðlínuskip ( 1995 ) og Ljóðtímaleit ( 2001 ) .
- …
- PUNCT 5: Á næsta ári munu flestar stofnanir vinna ársáætlanir sínar í nýja kerf …
- ADV 1: Breytingarnar á fjármálastefnunni og fyrirhugaðar breytingar á fjármálaáætlun ríkisstjórnarinnar eru sagðar endurspegla þá afstöðu stjórnvalda að þrátt fyrir að breyttar efnahagsforsendur leiði óhjákvæmilegra til hóflegri afkomumarkmiða á næstu árum ( … ) þá sé engu að síður óásættanlegt að svo mikil slökun verði gerð á afkomu hins opinbera að hún snúist í halla við núverandi aðstæður .
Morphology
The form / lemma ratio of PUNCT
is 0.806452 (the average of all parts of speech is 1.434754).
The 1st highest number of forms (18) was observed with the lemma “_”: !, (, ), (, ), ,, -, ., :, ;, ?, [, \(, ], –, —, “, „.
The 2nd highest number of forms (2) was observed with the lemma “–”: -, –.
The 3rd highest number of forms (1) was observed with the lemma “!”: !.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (7861; 100% instances)
Parents of PUNCT
nodes belong to 16 different parts of speech: VERB (4203; 53% instances), NOUN (1701; 22% instances), PROPN (684; 9% instances), ADJ (487; 6% instances), ADV (293; 4% instances), PRON (228; 3% instances), NUM (166; 2% instances), SCONJ (40; 1% instances), PUNCT (16; 0% instances), X (14; 0% instances), ADP (10; 0% instances), INTJ (7; 0% instances), AUX (4; 0% instances), PART (4; 0% instances), SYM (3; 0% instances), CCONJ (1; 0% instances)
7845 (100%) PUNCT
nodes are leaves.
16 (0%) PUNCT
nodes have one child.
The highest child degree of a PUNCT
node is 1.
Children of PUNCT
nodes are attached using 1 different relations: punct (16; 100% instances)
Children of PUNCT
nodes belong to 1 different parts of speech: PUNCT (16; 100% instances)