Skip to content

Releases: macmillancontentscience/wordpiece

wordpiece 2.1.3

03 Mar 16:21
3eb92c7
Compare
Choose a tag to compare
  • Implemented various speed improvements, improving speed and memory usage by over 1000x. (#27, @jonathanbratt)
  • Removed purrr dependency. (#30, @jonthegeek)

wordpiece 2.0.1

03 Mar 16:16
3c96823
Compare
Choose a tag to compare
  • Refactored wordpiece_tokenize to accept a character vector with length > 1. This makes the package more usable within a workflow, but will break scripts that used the previous version (the output is now a list of character vectors, instead of a single character vector). (@jonthegeek)
  • Added a pair of default vocabularies via the {wordpiece.data} package. (@jonthegeek)

Initial CRAN release

03 Mar 16:12
3c0cfb7
Compare
Choose a tag to compare