Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend the Extraction Framework for your language #9

Open
mgns opened this issue Jan 23, 2018 · 4 comments
Open

Extend the Extraction Framework for your language #9

mgns opened this issue Jan 23, 2018 · 4 comments
Labels
warmup-task Warmup task to practice before applying for GSoC.

Comments

@mgns
Copy link
Member

mgns commented Jan 23, 2018

Effort

1-2 days

Skills

basic maven, scala

Description

The DBpedia extraction framework has a default configuration that is language agnostic. However, language specific configuration can boost the coverage and precision of the extracted data for that particular language. We keep all language specific configurations here. Browse through the code and try to see how you can improve existing languages of provide configuration for a new one.

Impact

Improvements in the data quality & quantity for a particular language

@AnubhavUjjawal
Copy link

I created a pull request in reference to this warm-up task.

@JJ-Author
Copy link
Contributor

see e.g. also dbpedia/extraction-framework@f60edd4 which is not yet merged to master but shows some language specific configuration for the number parser

@AnubhavUjjawal
Copy link

Ok? Should I check the language specific configurations made in dbpedia/extraction-framework@f60edd4 if they are correct or not?

@JJ-Author
Copy link
Contributor

no it was just an additional note. in case other people do use this as warmup task as well that they do not duplicate the work already done ;-). luckily in your case that did not happen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
warmup-task Warmup task to practice before applying for GSoC.
Projects
None yet
Development

No branches or pull requests

4 participants