Skip to content
This repository has been archived by the owner on Dec 21, 2023. It is now read-only.

Commit

Permalink
cmdi metadata file
Browse files Browse the repository at this point in the history
  • Loading branch information
dirkroorda committed Mar 7, 2022
1 parent 97456a6 commit 1e9cfc8
Show file tree
Hide file tree
Showing 26 changed files with 1,609 additions and 40 deletions.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -17,3 +17,5 @@ site/*
build/*
log/*
_local
_local/*
_docker/*
211 changes: 211 additions & 0 deletions docs/cmdi/SHEBANQ.cmdi.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,211 @@
<?xml version='1.0' encoding='UTF8'?>
<CMD xmlns="http://www.clarin.eu/cmd/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" CMDVersion="1.1" xsi:schemaLocation="http://www.clarin.eu/cmd/ http://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/profiles/clarin.eu:cr1:p_1342181139640/xsd ">
<Header>
<MdCreator>rogierkraf</MdCreator>
<MdCreationDate>2013-11-30+02:00</MdCreationDate>
<MdProfile>clarin.eu:cr1:p_1342181139640</MdProfile>
<MdCollectionDisplayName>CLARIN Netherlands</MdCollectionDisplayName></Header>
<Resources>
<ResourceProxyList>
<ResourceProxy id="SHEBANQ001">
<ResourceType>SearchPage</ResourceType>
<ResourceRef>https://shebanq.ancient-data.org/</ResourceRef>
</ResourceProxy>
<ResourceProxy id="SHEBANQ002">
<ResourceType>Resource</ResourceType>
<ResourceRef>https://github.com/ETCBC/bhsa</ResourceRef>
</ResourceProxy>
<ResourceProxy id="SHEBANQ003">
<ResourceType>Resource</ResourceType>
<ResourceRef>https://doi.org/10.5281/zenodo.1007624</ResourceRef>
</ResourceProxy>
<ResourceProxy id="SHEBANQ004">
<ResourceType>Resource</ResourceType>
<ResourceRef>https://doi.org/10.17026/dans-2z3-arxf</ResourceRef>
</ResourceProxy>
<ResourceProxy id="SHEBANQ005">
<ResourceType>Resource</ResourceType>
<ResourceRef>https://doi.org/10.17026/dans-z6y-skyh</ResourceRef>
</ResourceProxy>
</ResourceProxyList>
<JournalFileProxyList/>
<ResourceRelationList/>
</Resources>
<Components>
<ClarinSoftwareDescription>
<GeneralInfo>
<name xml:lang="eng">SHEBANQ</name>
<title xml:lang="eng">SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup</title>
<publicationYear>2014</publicationYear>
<url>https://shebanq.ancient-data.org/</url><CLARINCentre>DANS</CLARINCentre><OriginalSource>https://github.com/ETCBC/shebanq/blob/master/docs/cmdi/SHEBANQ.cmdi.xml</OriginalSource>
<ReleaseStatus>
<LifeCycleStatus>published</LifeCycleStatus>
<lastUpdate>2022-03-07</lastUpdate>
</ReleaseStatus>
<NationalProjects><Project><name>CLARIN-NL</name><title>CLARIN in the Netherlands</title><id>184.021.003</id><funder>NWO</funder><url>http://www.clarin.nl</url><Contact><Person>Jan Odijk</Person><Role>National Coordinator</Role><Address>Utrecht, the Netherlands</Address><Email>[email protected]</Email><Department>UiL-OTS</Department><Organisation>Utrecht University</Organisation></Contact><Duration><StartYear>2009</StartYear><CompletionYear>2015</CompletionYear></Duration></Project></NationalProjects><Country><CountryName>Netherlands</CountryName><CountryCoding>NL</CountryCoding></Country><Description>
<Description>The Hebrew Text Database by the ETCBC (Eep Talstra Centre for Bible and Computer, formerly WIVU Werkgroep Informatica Vrije Universiteit) contains the Biblia Hebraica Stuttgartensia Amstelodamensis (BHSA) dataset, which consists of the BHS version of the text of the Hebrew Bible plus linguistic annotations by the ETCBC. The annotations of the ETCBC have been produced in the form of features of words, phrases, clauses, sentences, etc. The information in these features result from algorithmic linguistic analysis aided by human decisions. The database can be queried by means of a language that is optimized to deal with data that is modeled as objects + features. SHEBANQ builds a bridge between the linguistically annotated Hebrew Text corpus and biblical scholars by (1) making this text, including its annotations, available to scholars; (2) demonstrating how queries can function to address research questions; the query saver and the metadata added to them will be a growing repository of valuable best practices of what queries are used in addressing research questions and how they contribute to answering these questions; (3) giving textual scholarship a more empirical basis, by creating the opportunity that claims made in scholarly articles (e.g.: “this syntactic pattern is not attested elsewhere in the Hebrew Bible”) can be accompanied by the unique identifiers that refer to the saved queries that have led to the claim. The WIVU database is a resource under long-term development. New features are being added, new corrections are being made over time. When SHEBANQ was created, it was fed with textual/linguistic data in LAF format, produced by an adhoc conversion. Nowadays SHEBANQ's data resides in https://github.com/ETCBC/bhsa, and there is a pipeline from the ETCBC production environment to the BHSA on GitHub and from there to the databases that power SHEBANQ. See https://github.com/ETCBC/pipeline.</Description>
</Description>
</GeneralInfo>
<SoftwareFunction>
<toolCategory>search tool</toolCategory>
<toolCategory>written language tool</toolCategory>
<ToolTasks><toolTask>corpus exploration</toolTask>
<toolTask>corpus searching</toolTask>
<toolTask>browsing</toolTask>
<toolTask>analysis</toolTask>
<toolTask>visualisation</toolTask>
</ToolTasks><ResearchPhases><ResearchPhase>Browsing and Searching</ResearchPhase><ResearchPhase>Data analysis</ResearchPhase></ResearchPhases><ResearchDomains>
<researchDomain>Linguistics</researchDomain>
<researchDomain>Religion Studies</researchDomain>
</ResearchDomains>
<LinguisticsSubject>
<linguisticsSubject>historical linguistics</linguisticsSubject>
<Description>
<Description/>
</Description>
</LinguisticsSubject>
<LinguisticsSubject>
<linguisticsSubject>syntax</linguisticsSubject>
<Description>
<Description/>
</Description>
</LinguisticsSubject>
<LinguisticsSubject>
<linguisticsSubject>morpho-syntax</linguisticsSubject>
<Description>
<Description/>
</Description>
</LinguisticsSubject>
<LanguageVariety>
<languageDependent>yes</languageDependent>
<Language><LanguageName>Hebrew</LanguageName><ISO639><iso-639-3-code>hbo</iso-639-3-code></ISO639></Language><Centuries>
<centuryDependent>yes</centuryDependent>
<CenturyInterval>
<centuryFrom>-12</centuryFrom>
<centuryThrough>02</centuryThrough>
</CenturyInterval>
</Centuries>
</LanguageVariety>
<LanguageVariety>
<languageDependent>yes</languageDependent>
<Language><LanguageName>Aramaic</LanguageName><ISO639><iso-639-3-code>arc</iso-639-3-code></ISO639></Language><Centuries>
<centuryDependent>yes</centuryDependent>
<CenturyInterval>
<centuryFrom>-12</centuryFrom>
<centuryThrough>02</centuryThrough>
</CenturyInterval>
</Centuries>
</LanguageVariety>
</SoftwareFunction>
<SoftwareImplementation>
<distributionMedium>Online available</distributionMedium>
<UserInterface>
<interfaceType>graphical user interface</interfaceType>
<applicationType>web application</applicationType>
</UserInterface>
<Input>
<inputType>text</inputType>
</Input>
<Output>
<outputType>text</outputType>
<characterEncoding>UTF8</characterEncoding>
<outputResource>query results</outputResource>
<Schema><schemaname/></Schema><MimeType>
<MimeType>text/plain</MimeType>
</MimeType>
</Output>
<Output>
<outputType>text</outputType>
<characterEncoding>UTF8</characterEncoding>
<outputResource>query results</outputResource>
<Schema><schemaname/></Schema><MimeType>
<MimeType>text/csv</MimeType>
</MimeType>
</Output>
</SoftwareImplementation>
<Access>
<ResourceLicense>
<license>Creative Commons (non-commercial)</license>
<distributionType>public</distributionType>
<Price>
<amount>0</amount>
<ISO4217>
<iso-4217-currency>EUR</iso-4217-currency>
</ISO4217>
</Price>
</ResourceLicense>
<Contact>
<Person>Dirk Roorda</Person>
<Email>[email protected]</Email>
<Organisation xml:lang="eng">Data Archiving and Networked Services (DANS)</Organisation>
</Contact>
</Access>
<ResourceDocumentation>
<Documentation>
<title>SHEBANQ - help</title>
<documentationTarget>user</documentationTarget>
<url>https://github.com/ETCBC/shebanq/wiki</url>
<ISO639>
<iso-639-3-code>eng</iso-639-3-code>
</ISO639>
</Documentation>
<Publication>
<publicationCategory>article</publicationCategory>
<publicationPurpose>background on the data resource</publicationPurpose>
<peerReviewStatus>yes</peerReviewStatus>
<Description>
<Description LanguageID="nld">Roorda, D. (2018). Coding the Hebrew Bible, Research Data Journal for the Humanities and Social Sciences, Volume 3, Issue 1, July 31, 2018, pp. 27-41. Leiden, Brill https://doi.org/10.1163/24523666-01000011
</Description>
</Description>
</Publication>
<Publication><publicationCategory>in book</publicationCategory><publicationPurpose>scientific background</publicationPurpose><peerReviewStatus>yes</peerReviewStatus><Description><Description LanguageID="eng">Roorda, D. 2017. The Hebrew Bible as Data: Laboratory - Sharing - Experiences. In: Odijk, J and van Hessen, A. (eds.) CLARIN in the Low Countries, Pp. 217–229. London: Ubiquity Press. DOI: https://doi.org/10.5334/bbi.18. License: CC-BY 4.0</Description></Description></Publication><Publication>
<publicationCategory>unpublished</publicationCategory>
<publicationPurpose>scientific background</publicationPurpose>
<peerReviewStatus>no</peerReviewStatus>
<Description>
<Description LanguageID="eng">Roorda, D. (2015). The Hebrew Bible as Data: Laboratory - Sharing - Experiences http://arxiv.org/abs/1501.01866
</Description>
</Description>
</Publication>
<Publication>
<publicationCategory>article</publicationCategory>
<publicationPurpose>scientific background</publicationPurpose>
<peerReviewStatus>yes</peerReviewStatus>
<Description>
<Description LanguageID="nld">Roorda, D. (2014). LAF-Fabric: a data analysis tool for Linguistic Annotation Framework with an application to the Hebrew Bible, Computational Linguistics in the Netherlands Journal, Volume 4, December 2014, pp. 105-109 http://www.clinjournal.org/sites/clinjournal.org/files/08-Roorda-etal-CLIN2014.pdf and http://arxiv.org/abs/1410.0286
</Description>
</Description>
</Publication>
<Pictures>
<picture type="other">
https://shebanq.ancient-data.org/static/apple-touch-icon.png
</picture>
</Pictures>
</ResourceDocumentation>
<SoftwareDevelopment>
<Project>
<name>SHEBANQ</name>
<title>SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup</title>
<funder>CLARIN-NL</funder>
<url>http://portal.clarin.nl/node/4180</url>
<Contact>
<Email>[email protected]</Email>
</Contact>
<Duration/>
</Project>
<Creator>
<Contact>
<Person>Prof. Dr. W.T. (Willem) van Peursen</Person>
<Email>[email protected]</Email>
</Contact>
</Creator>
</SoftwareDevelopment>
<TechnicalInfo>
<ImplementationLanguage>
<implementationLanguage>Python</implementationLanguage>
<version>continuous</version>
</ImplementationLanguage>
</TechnicalInfo>
</ClarinSoftwareDescription>
</Components>
</CMD>
40 changes: 0 additions & 40 deletions scripts/Dockerfile

This file was deleted.

1 change: 1 addition & 0 deletions scripts/docker/build-shebanq-db.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
docker build -t shebanq-db "$DOCKER_DIR/shebanq-db"
112 changes: 112 additions & 0 deletions scripts/docker/configtemplate.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,112 @@
#!/bin/bash

# BEFORE YOU START !!
#
# before running any docker script:
#
# - read docker.md
# - tweak the parameters in the next section
# - read and understand the script that you want to run

# ALL_CAPS variables are used by the other scripts
# camelCase variables only occur in this file

# TWEAKING PART #################################################
#
# !!!!!!!!
# CAUTION: make sure you are not editing this content in its
# location scripts/maintenance/configtemplate.sh .
# Instead, create a copy _dockerconfig/config.sh and modify that.
# The directory _dockerconfig will not be pushed online.

# Do not tweak the original files in the scripts/docker directory
# !!!!!!!!
#
# Adapt the following settings to your situation before
# running the maintenance scripts.
#
#
# Version of the Emdros software that is in use.
# see also https://emdros.org
#
emdrosVersion="3.7.3"
#
#
# Version of the Web2py software that is in use.
# see also https://github.com/web2py/web2py
#
web2pyVersion="2.21.1-stable"
#
#
#
# Versions of the ETCBC data that you want to install/update
# NB: the shebanq software has hardcoded references to these versions.
#
STATIC_VERSIONS="4 4b c 2017 2021"
#
#
# Where backups of the user-generated data of shebanq can be found
# When installing the shebanq on a new server with a
# new database, you should make a backup of this data from the current
# server with ./backup.sh which will copy it here.
# The provision script will copy that over to the server and
# import it in the new database.
# There is a separate setting for backups made on
# the test server, the new production server, and the new other server.
# So you will not inadvertently restore a non-production backup
# to the production server.
#
BACKUP_DIR=~/Documents/shebanq/backups # point to an existing dir
#
#
# Where your local github directory resided, under which
# shebanq has been cloned.
#
githubBase=~/github
#
#
#
# Where the Apache config files are on the server
APACHE_DIR="/etc/httpd/conf.d"
#
#
# Server specifications
#
# server is the server name as internet address
#
# serverUrl: the url of shebanq when served from this server;
#
# dbHost is the host server where the mysql database resides
# We assume mysql is served on an other server.
# There is are options whether to import the static and dynamic
# data into this database.
# In that case we do create database users, administer database grants,
# and fill database tables.
# We also assume that the grants of the database server
# are not host specific, so that when we access the database from a
# new server, the same grants apply as when we used the old server.
#
# We also configure mail settings: server and sender on behalf
# of which SHEBANQ sends emails to users (for password verification)
# Make sure that your server is set up so that it is permitted
# to send mail for this user.
# If you do not want the server to send mail, put an empty value
# in the mailSender... fields.
#
# Your username on the server
#
SERVER_USER="you" #replace by your user name on the server
#
#
SERVER="machine.server.edu" # replace by your own
SERVER_URL="shebanq.mydomain.org" # replace by your own
DB_HOST="mysql.server.edu" # replace by your own
MYSQL_SHEBANQ=xxx # obtain yourself
MYSQL_SHEBANQ_ADMIN=yyy # obtain yourself
CERT_FILE=/etc/pki/tls/certs/other_server_edu.cer
CERT_KEY=/etc/pki/tls/private/other_server_edu.key
CERT_CHAIN=/etc/pki/tls/certs/other_server_edu.cer
MAIL_SERVER=localhost
[email protected]
#
# END TWEAKING PART #################################################
Loading

0 comments on commit 1e9cfc8

Please sign in to comment.