Skip to content

Commit

Permalink
deploy: dd9efaa
Browse files Browse the repository at this point in the history
  • Loading branch information
Trondtr committed Apr 5, 2024
1 parent ce709dc commit 8cdad6e
Show file tree
Hide file tree
Showing 5 changed files with 66 additions and 2 deletions.
1 change: 0 additions & 1 deletion Links.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,6 @@
* `tools/`
* `grammarcheckers/`
* [grammarchecker.cg3](tools-grammarcheckers-grammarchecker.cg3.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/grammarcheckers/grammarchecker.cg3))
* `/`
* [grc-disambiguator.cg3](tools-grammarcheckers-grc-disambiguator.cg3.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/grammarcheckers/grc-disambiguator.cg3))
* `tokenisers/`
* [tokeniser-disamb-gt-desc.pmscript](tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/tokenisers/tokeniser-disamb-gt-desc.pmscript))
Expand Down
8 changes: 8 additions & 0 deletions Makefile.in
Original file line number Diff line number Diff line change
Expand Up @@ -999,24 +999,32 @@ $(LINKS):
if test "x$$d1" != "x$$oldd1" ; then \
echo "* \`$$d1/\`" ;\
oldd1=$$d1 ;\
oldd2="";\
oldd3="";\
oldd4="";\
fi ; \
if test "x$$d2" = x ; then \
echo " * [$$docname]($$html) ([src]($(REPOURL)/$$doc))" ;\
elif test "x$$d2" != "x$$oldd2" ; then \
echo " * \`$$d2/\`" ;\
oldd2=$$d2 ;\
oldd3="";\
oldd4="";\
oldd5="";\
fi ; \
if test "x$$d3" = x -a "x$$d2" != x; then \
echo " * [$$docname]($$html) ([src]($(REPOURL)/$$doc))" ;\
elif test "x$$d3" != "x$$oldd3" ; then \
echo " * \`$$d3/\`" ;\
oldd3=$$d3 ;\
oldd4="";\
fi ; \
if test "x$$d4" = x -a "x$$d3" != x ; then \
echo " * [$$docname]($$html) ([src]($(REPOURL)/$$doc))" ;\
elif test "x$$d4" != "x$$oldd4" ; then \
echo " * \`$$d4/\`" ;\
oldd4=$$d4 ;\
oldd5="";\
fi ; \
if test "x$$d5" = x -a "x$$d4" != x ; then \
echo " * [$$docname]($$html) ([src]($(REPOURL)/$$doc))" ;\
Expand Down
1 change: 1 addition & 0 deletions index-header.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,7 @@ and in a Kven spellchecker.
* [Yamltestit maaliskuu 2019](YamltestitMaaliskuu2019.html)
* [n_11-feilit](n_11-feilit.html)
* [Kaikki generoidut paradigmat](KaikkiGeneroidutParadigmat.html)
* [Test diary](test-diary.md)

# In-source documentation

Expand Down
2 changes: 1 addition & 1 deletion index.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,7 @@ and in a Kven spellchecker.
* [Yamltestit maaliskuu 2019](YamltestitMaaliskuu2019.html)
* [n_11-feilit](n_11-feilit.html)
* [Kaikki generoidut paradigmat](KaikkiGeneroidutParadigmat.html)
* [Test diary](test-diary.md)

# In-source documentation

Expand Down Expand Up @@ -99,7 +100,6 @@ Below is an autogenerated list of documentation pages built from structured comm
* `tools/`
* `grammarcheckers/`
* [grammarchecker.cg3](tools-grammarcheckers-grammarchecker.cg3.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/grammarcheckers/grammarchecker.cg3))
* `/`
* [grc-disambiguator.cg3](tools-grammarcheckers-grc-disambiguator.cg3.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/grammarcheckers/grc-disambiguator.cg3))
* `tokenisers/`
* [tokeniser-disamb-gt-desc.pmscript](tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html) ([src](https://github.com/giellalt/lang-fkv/blob/main/tools/tokenisers/tokeniser-disamb-gt-desc.pmscript))
Expand Down
56 changes: 56 additions & 0 deletions test-diary.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
Test diary
==========

This document writes down test statistics

The overal test command: `make check`

## yaml

The command:

`sh test/yaml-check.sh`

(data forthcoming)

## Lexical coverage
fkv
Number of words (standing in `lang-fkv`):

```
cat test/data/freecorpus.txt |\
hfst-tokenise tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst |wc -l
```

Number of unknown words:

```
cat test/data/freecorpus.txt |\
hfst-tokenise tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst |\
preprocess --corr=test/data/typos.txt|\
hfst-tokenise -cg tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst |\
grep " ?"|cut -d'"' -f2|wc -l
```

Test with the full corpus (free + bound):



### Lexical coverage of freecorpus

The file is `test/data/freecorpus.txt`.

Coverage:

- 240405: 1-(42819/607401) = 0.9295

### Lexical coverage of free + bound

Coverage:







0 comments on commit 8cdad6e

Please sign in to comment.