Skip to content

Scripts for building a Docker image for the FinnPos ftb-label tagger

License

Notifications You must be signed in to change notification settings

ruokolt/ftb-label

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Building a Docker image for ftb-label

This repository contains scripts to build a local Docker image for the ftb-label tagger of FinnPos, a morphological tagging and lemmatization tool for Finnish. To build the image, run:

$ git clone https://github.com/teemu-ruokolainen/ftb-label.git
$ cd ftb-label
$ docker build --tag your_preferred_image_name .

The resulting image has been published in the Docker Hub where Docker can find it automatically.

To predict morphological tags and lemmas for running Finnish tex, run:

$ echo "Tämä on hyvä esimerkkilause!" | docker run -i teemuruokolainen/ftb-label:latest

The output looks like this:

Tämä _ tämä [POS=PRONOUN]|[SUBCAT=DEMONSTRATIVE]|[NUM=SG]|[CASE=NOM] _
on _ olla [POS=VERB]|[VOICE=ACT]|[MOOD=INDV]|[TENSE=PRESENT]|[PERS=SG3] _
esimerkkilause _ esimerkkilause [POS=NOUN]|[NUM=SG]|[CASE=NOM] _
! _ ! [POS=PUNCTUATION] _ \

To predict morphological tags and lemmas for already tokenized Finnish text, run:

$ echo "Huomasitko , että tämä teksti on jo tokenisoitu ?" | docker run -i teemuruokolainen/ftb-label:latest notokenize

To predict morphological tags and lemmas for collections of texts (e.g. sentences or documents), run:

$ cat text-file.txt | docker run -i teemuruokolainen/ftb-label:latest

where text-file.txt should contain one text per row.

About

Scripts for building a Docker image for the FinnPos ftb-label tagger

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published