Treebank Statistics: UD_Skolt_Sami-Giellagas: POS Tags: ADV
There are 84 ADV lemmas (16%), 87 ADV types (11%) and 301 ADV tokens (10%).
Out of 16 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.
The 10 most frequent ADV lemmas: âʹtte, âʹpet, de, kâʹl, tok, mäʹhtt, pâi, âʹte, nuʹtt, teâđast
The 10 most frequent ADV types: âʹtte, âʹpet, de, kâʹl, tok, mäʹhtt, pâi, teâđast, še, tõʹst
The 10 most frequent ambiguous lemmas: de (CCONJ 63, ADV 12), mäʹhtt (ADV 9, SCONJ 2), še (ADV 8, PART 2), mieʹldd (ADV 4, ADP 1), näʹde (INTJ 13, ADV 3), ko (SCONJ 21, ADV 2), kuʹǩǩ (ADJ 1, ADV 1), mii (PRON 13, ADV 1), ni (PART 13, ADV 1), no (INTJ 13, CCONJ 3, ADV 1)
The 10 most frequent ambiguous types: de (CCONJ 50, ADV 6), mäʹhtt (ADV 8, SCONJ 2), še (ADV 8, PART 2), tõʹst (ADV 7, PRON 1), kuuʹǩǩ (ADV 4, ADJ 1), mieʹldd (ADV 4, ADP 1), näʹde (INTJ 3, ADV 1), No (INTJ 12, CCONJ 3, ADV 1), ni (PART 13, ADV 1), ool (ADP 7, ADV 1)
- de
- mäʹhtt
- še
- tõʹst
- kuuʹǩǩ
- mieʹldd
- näʹde
- No
- ni
- ool
Morphology
The form / lemma ratio of ADV is 1.035714 (the average of all parts of speech is 1.476809).
The 1st highest number of forms (3) was observed with the lemma “nuʹtt”: Nuʹtt, nuʹt, nuʹt-i.
The 2nd highest number of forms (2) was observed with the lemma “eʹpet”: eʹpet, eʹpet-i.
The 3rd highest number of forms (2) was observed with the lemma “koozz”: koozz, koozz-a.
ADV occurs with 8 features: AdvType (76; 25% instances), Case (21; 7% instances), Clitic (8; 3% instances), ExtPos (5; 2% instances), PronType (3; 1% instances), Typo (3; 1% instances), Number (2; 1% instances), PartType (2; 1% instances)
ADV occurs with 12 feature-value pairs: AdvType=Tim, Case=Ill, Case=Loc, Clitic=AddI, Clitic=QstA, ExtPos=ADV, ExtPos=SCONJ, Number=Sing, PartType=Int, PronType=Int, PronType=Rel, Typo=Yes
ADV occurs with 16 feature combinations.
The most frequent feature combination is _ (195 tokens).
Examples: âʹpet, de, kâʹl, tok, mäʹhtt, pâi, teâđast, še, mååusat, toʹb
Relations
ADV nodes are attached to their parents using 10 different relations: advmod (270; 90% instances), discourse (8; 3% instances), orphan (8; 3% instances), conj (3; 1% instances), mark (3; 1% instances), obl (3; 1% instances), root (3; 1% instances), appos (1; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances)
Parents of ADV nodes belong to 9 different parts of speech: VERB (233; 77% instances), PRON (20; 7% instances), NOUN (18; 6% instances), AUX (10; 3% instances), ADV (9; 3% instances), ADJ (6; 2% instances), (3; 1% instances), PART (1; 0% instances), PROPN (1; 0% instances)
274 (91%) ADV nodes are leaves.
20 (7%) ADV nodes have one child.
3 (1%) ADV nodes have two children.
4 (1%) ADV nodes have three or more children.
The highest child degree of a ADV node is 6.
Children of ADV nodes are attached using 15 different relations: punct (8; 18% instances), advmod (6; 13% instances), fixed (5; 11% instances), conj (4; 9% instances), cop (4; 9% instances), discourse (3; 7% instances), nmod (3; 7% instances), nsubj:cop (3; 7% instances), appos (2; 4% instances), cc (2; 4% instances), advcl (1; 2% instances), advmod:neg (1; 2% instances), aux (1; 2% instances), ccomp (1; 2% instances), mark (1; 2% instances)
Children of ADV nodes belong to 10 different parts of speech: ADV (9; 20% instances), PUNCT (8; 18% instances), VERB (6; 13% instances), AUX (5; 11% instances), NOUN (4; 9% instances), PRON (4; 9% instances), SCONJ (3; 7% instances), CCONJ (2; 4% instances), INTJ (2; 4% instances), PART (2; 4% instances)