Skip to content

Commit

Permalink
Merge branch 'lang-smn'
Browse files Browse the repository at this point in the history
  • Loading branch information
albbas committed May 13, 2020
1 parent 0cf6251 commit ae84c83
Show file tree
Hide file tree
Showing 552 changed files with 1,278,473 additions and 1 deletion.
6 changes: 6 additions & 0 deletions AUTHORS
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
Authors of gtlangs-smn package.

The following people have legal copyright to files and software in this
directory and its subdirectories:

__FIXME__
365 changes: 365 additions & 0 deletions INSTALL

Large diffs are not rendered by default.

4 changes: 4 additions & 0 deletions LICENCE
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
It is of extreme importance that you fill in this file with copyright
information about the morphology contained in this directory.

__FIXME__
65 changes: 65 additions & 0 deletions am-shared/00-DO-NOT-EDIT-THIS-DIRECTORY-readme.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,65 @@
README
======

This directory SHOULD NOT BE CHANGED MANUALLY.

It should ONLY BE CHANGED using an AUTOMATIC MERGE from the template dir.


HOW TO CHANGE
-------------

That is, if you find you need to change this dir, do as follows:

CHANGE THE TEMPLATE
-------------------

1) make your changes _without_ committing (or by using a branch)
2) test your changes
3) when ok, manually copy your changes to $GIELLA_TEMPLATES/langs/und/
4) edit $GIELLA_TEMPLATES/langs/und/und.timestamp to contain a change note
5) commit the changes in $GIELLA_TEMPLATES/langs/und/

PROPAGATE THE CHANGES
---------------------

Then, in each langs/ dir (langs/, startup-langs/, experiment-langs/), run the
following commands (it might be a good idea to check out these langs dirs
independently for this operation):

First time:

./autogen.sh
./configure

Each time:

`make templatemerge`

or

`make templatemerge USERNAME=<your-svn-username>`

This command will svn-merge the changes in $GIELLA_TEMPLATES/langs/und/ with
each language dir.

Review the merge results, if ok then commit (preferably all languages at once).


!!! WARNING !!!
===============

We want to avoid merge conflicts at any costs since this merge scheme is the
core of the scaleability of the infrastructure, so:

KEEPING THE am-shared DIR FREE OF LOCAL MODIFICATIONS IS IMPERATIVE FOR THE
MERGE SCHEME TO FUNCTION!


ALTERNATIVE
-----------

In the case the above procedure is too complicated, or doesn't fit the changes
you need, you can instead leave a (diff and) change request at
http://giellatekno.uit.no/bugzilla/, or just send an e-mail to
[email protected].
23 changes: 23 additions & 0 deletions am-shared/hfst-format-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
## Include this file in top-level srcdir to compile FSTs
## Copyright: Sámediggi/Divvun/UiT
## Licence: GPL v3+

if WITH_OFST_TROPICAL
HFST_FORMAT= --format=openfst-tropical
HFST_OLFORMAT= --format=optimized-lookup-weighted
endif # WITH_OFST_TROPICAL

if WITH_OFST_LOG
HFST_FORMAT= --format=openfst-log
HFST_OLFORMAT= --format=optimized-lookup-weighted
endif # WITH_OFST_LOG

if WITH_SFST
HFST_FORMAT= --format=sfst
HFST_OLFORMAT= --format=optimized-lookup-unweighted
endif # WITH_SFST

if WITH_FOMA
HFST_FORMAT= --format=foma
HFST_OLFORMAT= --format=optimized-lookup-unweighted
endif # WITH_FOMA
59 changes: 59 additions & 0 deletions am-shared/src-phonetics-dir-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
## Process this file with automake to produce Makefile.in

## Copyright (C) 2011 Samediggi

## This program is free software: you can redistribute it and/or modify
## it under the terms of the GNU General Public License as published by
## the Free Software Foundation, either version 3 of the License, or
## (at your option) any later version.

## This program is distributed in the hope that it will be useful,
## but WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
## GNU General Public License for more details.

## You should have received a copy of the GNU General Public License
## along with this program. If not, see <http://www.gnu.org/licenses/>.

####### Source file defs: ########

#! @param GT_PHONETICS_MAIN required, the source of phonetic transcription
#! @param GT_PHONETICS_SUPPLEMENTS optional, other sources to distribute and
#! compile for other things
#! Both @params are specified in $(GTLANG)/src/phonetics/Makefile.am.
GT_PHONETICS_SRCS=$(GT_PHONETICS_MAIN) $(GT_PHONETICS_SUPPLEMENTS)

# All sources need to be included in the tarball
EXTRA_DIST=$(GT_PHONETICS_SRCS)

####### Automake targets: ########

# The rule transducer will be built, not installed
GT_PHONETICS=

if WANT_PHONETIC
if CAN_HFST
GT_PHONETICS+=$(patsubst %.xfscript,%.lookup.hfst,$(GT_PHONETICS_SRCS))
GT_PHONETICS+=$(patsubst %.xfscript,%.compose.hfst,$(GT_PHONETICS_SRCS))
endif # CAN_HFST

if CAN_XFST
GT_PHONETICS+=$(patsubst %.xfscript,%.lookup.xfst,$(GT_PHONETICS_SRCS))
GT_PHONETICS+=$(patsubst %.xfscript,%.compose.xfst,$(GT_PHONETICS_SRCS))
endif # CAN_XFST
endif # WANT_PHONETIC

noinst_DATA=$(GT_PHONETICS)

####### Other targets: ###########
clean-local:
-rm -f *.hfst *.xfst

##########################################
# General build rules included from here:#
include $(top_srcdir)/am-shared/twolc-include.am
include $(top_srcdir)/am-shared/xfscript-include.am
include $(top_srcdir)/am-shared/lookup-include.am
include $(top_srcdir)/am-shared/silent_build-include.am

# vim: set ft=automake:
48 changes: 48 additions & 0 deletions am-shared/tools-mt-apertium-filters-dir-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
## Process this file with automake to produce Makefile.in

## Copyright (C) 2011 Samediggi

## This program is free software: you can redistribute it and/or modify
## it under the terms of the GNU General Public License as published by
## the Free Software Foundation, either version 3 of the License, or
## (at your option) any later version.

## This program is distributed in the hope that it will be useful,
## but WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
## GNU General Public License for more details.

## You should have received a copy of the GNU General Public License
## along with this program. If not, see <http://www.gnu.org/licenses/>.

####### Source file and target defs: ########

# These will be included in the distribution:
EXTRA_DIST=$(GT_MT_FILTER_SRCS)

MT_FILTER_ALL_SRCS=$(GT_MT_FILTER_SRCS) $(GENERATED_REGEXES)

# This is what we build:
noinst_DATA=$(GT_FILTER_TARGETS)

# Automatically detect the targets to be built:
GT_FILTER_TARGETS=

if WANT_APERTIUM
if CAN_HFST
GT_FILTER_TARGETS+=$(patsubst %.regex,%.hfst,$(MT_FILTER_ALL_SRCS))
endif # CAN_HFST
endif # WANT_APERTIUM

####### Other targets: ###########
clean-local:
-rm -f *.hfstol *.xfst *.foma *.hfst

##########################################
# General build rules included from here:#

include $(top_srcdir)/am-shared/regex-include.am
include $(top_srcdir)/am-shared/hfst-format-include.am
include $(top_srcdir)/am-shared/silent_build-include.am

# vim: set ft=automake:
63 changes: 63 additions & 0 deletions am-shared/tools-spellcheckers-fstbased-desktop-foma-dir-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,63 @@
## Process this file with automake to produce Makefile.in

## Copyright (C) 2011 Samediggi

## This program is free software: you can redistribute it and/or modify
## it under the terms of the GNU General Public License as published by
## the Free Software Foundation, either version 3 of the License, or
## (at your option) any later version.

## This program is distributed in the hope that it will be useful,
## but WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
## GNU General Public License for more details.

## You should have received a copy of the GNU General Public License
## along with this program. If not, see <http://www.gnu.org/licenses/>.

GT_SPELLER_HFST=generator-fstspeller-gt-norm.hfst
GT_FOMA_ACCEPTOR=acceptor-$(GTLANG).foma

####### Automake targets: ########

if WANT_SPELLERS
if CAN_HFST
if CAN_FOMA_SPELLER
hfstdatadir=$(datadir)/giella/$(GTLANG)
hfstdata_DATA=$(GT_FOMA_ACCEPTOR)
endif #CAN_FOMA_SPELLER
endif # CAN_HFST
endif # WANT_SPELLERS

####### HFST build rules: ########

# Build the automaton used for the speller
generator-fomaspeller-gt-norm.hfst: \
$(top_builddir)/tools/spellcheckers/fstbased/$(GT_SPELLER_HFST) \
$(top_builddir)/src/filters/remove-dialect-tags.hfst \
$(top_builddir)/src/filters/remove-homonymy-tags.hfst \
$(top_builddir)/src/filters/remove-variant-tags.hfst \
$(top_builddir)/src/filters/remove-number-string-tags.hfst \
$(top_builddir)/src/filters/remove-usage-tags.hfst \
$(top_builddir)/src/filters/remove-word-boundary.hfst
$(AM_V_GEN)$(PRINTF) "\
@\"$(top_builddir)/src/filters/remove-dialect-tags.hfst\" \
.o. @\"$(top_builddir)/src/filters/remove-homonymy-tags.hfst\" \
.o. @\"$(top_builddir)/src/filters/remove-variant-tags.hfst\" \
.o. @\"$(top_builddir)/src/filters/remove-number-string-tags.hfst\" \
.o. @\"$(top_builddir)/src/filters/remove-usage-tags.hfst\" \
.o. @\"$<\" \
.o. @\"$(top_builddir)/src/filters/remove-word-boundary.hfst\" \
;" \
| $(HFST_REGEXP2FST) $(HFST_FLAGS) -S --xerox-composition=ON \
> $@

$(GT_FOMA_ACCEPTOR): generator-fomaspeller-gt-norm.hfst
$(AM_V_GEN)$(HFST_FST2FST) $(HFST_FLAGS) -f foma -b -i $< \
| $(GZIP) > $@

####### Other targets: ###########
clean-local:
-rm -f *.foma

# vim: set ft=automake:
54 changes: 54 additions & 0 deletions am-shared/tools-spellcheckers-fstbased-mobile-vfst-dir-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,54 @@
## Process this file with automake to produce Makefile.in

## Copyright (C) 2011 Samediggi

## This program is free software: you can redistribute it and/or modify
## it under the terms of the GNU General Public License as published by
## the Free Software Foundation, either version 3 of the License, or
## (at your option) any later version.

## This program is distributed in the hope that it will be useful,
## but WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
## GNU General Public License for more details.

## You should have received a copy of the GNU General Public License
## along with this program. If not, see <http://www.gnu.org/licenses/>.

# Source fst dir:
SOURCE_DIR=../hfst

# Hfst source variables:
GT_ACCEPTOR_SRC_HFST=acceptor.default.hfst
GT_ERRMODEL_SRC_HFST=errmodel.default.hfst

# Vfst target variables
GT_ACCEPTOR_VFST=
GT_ERRMODEL_VFST=

####### Automake targets: ########

if WANT_SPELLERS
if CAN_HFST
if WANT_VFST_SPELLER
GT_ACCEPTOR_VFST+=spl.vfst
GT_ERRMODEL_VFST+=err.vfst
endif # WANT_VFST_SPELLER
endif # CAN_HFST
endif # WANT_SPELLERS

noinst_DATA=$(GT_ACCEPTOR_VFST) $(GT_ERRMODEL_VFST)

####### Build rules: ########

$(GT_ACCEPTOR_VFST): $(SOURCE_DIR)/$(GT_ACCEPTOR_SRC_HFST)
$(AM_V_FST2TXT)$(HFST_FST2TXT) $< | sort -n | $(VOIKKOVFSTC) -w log -o $@

$(GT_ERRMODEL_VFST): $(SOURCE_DIR)/$(GT_ERRMODEL_SRC_HFST)
$(AM_V_FST2TXT)$(HFST_FST2TXT) $< | sort -n | $(VOIKKOVFSTC) -w log -o $@

####### Other targets: ###########
clean-local:
-rm -f *.vfst

include $(top_srcdir)/am-shared/silent_build-include.am
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
## Process this file with automake to produce Makefile.in

## Copyright (C) 2011 Samediggi

## This program is free software: you can redistribute it and/or modify
## it under the terms of the GNU General Public License as published by
## the Free Software Foundation, either version 3 of the License, or
## (at your option) any later version.

## This program is distributed in the hope that it will be useful,
## but WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
## GNU General Public License for more details.

## You should have received a copy of the GNU General Public License
## along with this program. If not, see <http://www.gnu.org/licenses/>.

####### Automake targets: ########

if WANT_SPELLERS
noinst_DATA=$(GT_CLEAN_SPELLER_CORPUS)
endif # WANT_SPELLERS

# Clean the corpus data if only raw data is found:
%.clean.txt: %.raw.txt
$(AM_V_GEN)$(GTCORE)/scripts/clean_weight_corpus.sh $< $@


####### Other targets: ###########
clean-local:
-rm -f *.clean.txt
Loading

0 comments on commit ae84c83

Please sign in to comment.