-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy patheval.log
56 lines (56 loc) · 3.71 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_English-LinES:
commit 0aa14cdeddef9d1d4a6bb155d1669f2e1694c039
Merge: 61a1435 eced5e1
Author: Dan Zeman <[email protected]>
Size: counted 94217 of 94217 words (nodes).
Size: min(0, log((N/1000)**2)) = 9.09120126478277.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 59858 out of 94217 total words have one or more features.
Features: source of annotation (from README) factor is 0.4.
Universal relations: 33 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.9.
Udapi:
TOTAL 4090
Udapi: found 4090 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 94217 words.
Genres: found 3 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-LinES/en_lines-ud-dev.conllu
[Line 3587 Sent en_lines-ud-dev-doc2-3327 Node 16]: [L3 Warning leaf-det] 'det' not expected to have children (16:the:det --> 15:as:mark)
[Line 6515 Sent en_lines-ud-dev-doc3-3477 Node 11]: [L3 Warning leaf-det] 'det' not expected to have children (11:a:det --> 10:half:amod)
[Line 12630 Sent en_lines-ud-dev-doc5-3766 Node 2]: [L3 Warning leaf-det] 'det' not expected to have children (2:a:det --> 1:Such:amod)
[Line 19087 Sent en_lines-ud-dev-doc7-4045 Node 5]: [L3 Warning leaf-det] 'det' not expected to have children (5:the:det --> 3:they:nsubj)
Warnings: 4
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-LinES/en_lines-ud-test.conllu
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang en --max-err=10 UD_English-LinES/en_lines-ud-train.conllu
[Line 9726 Sent en_lines-ud-train-doc2-460 Node 7]: [L3 Warning leaf-det] 'det' not expected to have children (7:the:det --> 6:as:mark)
[Line 9919 Sent en_lines-ud-train-doc2-466 Node 19]: [L3 Warning leaf-det] 'det' not expected to have children (19:all:det --> 18:that:mark)
[Line 22429 Sent en_lines-ud-train-doc3-1089 Node 15]: [L3 Warning leaf-det] 'det' not expected to have children (15:the:det --> 13:one:nsubj)
[Line 31501 Sent en_lines-ud-train-doc4-1464 Node 8]: [L3 Warning leaf-det] 'det' not expected to have children (8:a:det --> 10:a:appos)
[Line 53951 Sent en_lines-ud-train-doc7-2493 Node 25]: [L3 Warning leaf-det] 'det' not expected to have children (25:a:det --> 23:to:mark)
[Line 57681 Sent en_lines-ud-train-doc7-2677 Node 13]: [L3 Warning leaf-det] 'det' not expected to have children (13:a:det --> 12:such:amod)
Warnings: 6
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.4) = 0.0307692307692308
(weight=0.0769230769230769) * (score{genres}=0.176470588235294) = 0.0135746606334842
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.658043090527837) = 0.16872899757124
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.8) = 0.0615384615384615
(weight=0.307692307692308) * (score{udapi}=0.565895751297537) = 0.174121769630011
(weight=0.0769230769230769) * (score{udeprels}=0.802702702702703) = 0.0617463617463617
(TOTAL score=0.623299994709302) * (availability=1) * (validity=1) = 0.623299994709302
STARS = 3
UD_English-LinES 0.623299994709302 3