Skip to content

htrc/Metadata-extract-seq-files

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scala CI codecov GitHub release (latest SemVer including pre-releases)

Metadata-extract-seq-files

This tool can be used to extract sequence files

Build

sbt clean stage - generates the unpacked, runnable application in the target/universal/stage/ folder.
sbt clean universal:packageBin - generates an application ZIP file

Usage

Note: Must use one of the supported JVMs for Apache Spark (at this time Java 8 through Java 11 are supported)

extract-seq-files <version>
HathiTrust Research Center
  -c, --cores  <N>                      (Optional) The number of CPU cores to
                                        use (if not specified, uses all
                                        available cores)
  -f, --format  <pairtree|stubbytree>   (Optional) One of pairtree or stubbytree
                                        (default = pairtree)
  -l, --log-level  <LEVEL>              (Optional) The application log level;
                                        one of INFO, DEBUG, OFF (default = INFO)
  -o, --output  <DIR>                   Write the output to DIR
      --spark-log  <FILE>               (Optional) Where to write logging output
                                        from Spark to
  -h, --help                            Show help message
  -v, --version                         Show version of this program

 trailing arguments:
  input (required)   The path to the folder containing the sequence file parts

About

Tool for extracting files out of sequence files

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages