audubon 0.5.2

Corrected probabilistic IDF calculation by global_idf3.
Refactored bind_tf_idf2.
- Changed behavior when norm=TRUE. Cosine nomalization is now performed on tf_idf values as in the RMeCab package.
- Added tf="itf" and idf="df" options.
Refactored pack for performance.

audubon 0.5.1

Refactored tokenize_mecab and tokenize_sudachipy.

audubon 0.5.0

Added bind_lr function which can calculate the ‘LR’ value of bigrams.
pack now always returns a tibble, not a data.frame.

audubon 0.4.0

Added some new functions.
- bind_tf_idf2 can calculate and bind the term frequency, inverse document frequency, and tf-idf of the tidy text dataset.
- collapse_tokens, mute_tokens, and lexical_density can be used for handling a tidy text dataset of tokens.
strj_tokenize now preserves the original order of text names.
prettify now can get delim argument.

audubon 0.3.0

Updated strj_fill_iter_mark function.
- strj_fill_iter_mark now replaces a sequence of iteration marks recursively.
Updated strj_tokenize function.
- strj_tokenize now can retrieve engine argument to switch tokenizers for splitting text into tokens.

audubon 0.2.0

Updated ngram_tokenizer function.
Added a wrapper function of the ‘TinySegmenter’ written by Taku Kudo.

audubon 0.1.2

Updated pack function.
- Switched arguments order of pack function. pack now accepts pull as its second argument and n as its third argument.
- pull now can accept a symbol.

audubon 0.1.1

Updated documentation.

audubon 0.1.0

Relicensed as Apache License, Version 2.0.
Added a NEWS.md file to track changes to the package.