Skip to content

textrecipes (development version)

  • Documentation for tidy methods for all steps has been improved to describe the return value more accurately. (#262)

  • Calling ?tidy.step_*() now sends you to the documentation for step_*() where the outcome is documented. (#261)

  • step_textfeatures() has been made faster and more robust. (#265)

  • Fixed bug in step_clean_levels() where it would produce NAs for character columns. (#274)

textrecipes 1.0.6

CRAN release: 2023-11-15

  • textfeatures has been removed from Suggests. (#255)

  • step_textfeatures() no longer returns a politeness feature. (#254)

textrecipes 1.0.5

CRAN release: 2023-10-20

  • step_untokenize() and step_normalization() now returns factors instead of strings. (#247) # textrecipes 1.0.4

Improvements

  • step_clean_names() now throw an informative error if needed non-standard role columns are missing during bake(). (#235)

  • The keep_original_cols argument has been added to step_tokenmerge. This change should mean that every step that produces new columns has the keep_original_cols argument. (#242)

  • Many internal changes to improve consistency and slight speed increases.

Bug Fixes

textrecipes 1.0.3

CRAN release: 2023-04-14

Improvements

  • Steps with tunable arguments now have those arguments listed in the documentation.

  • All steps that add new columns will now informatively error if name collision occurs.

Bug Fixes

  • Fixed bug where step_tf() wasn’t tunable for weight argument.

textrecipes 1.0.2

CRAN release: 2022-12-21

textrecipes 1.0.1

CRAN release: 2022-10-06

  • Examples no longer include if (require(...)) code.

textrecipes 1.0.0

CRAN release: 2022-07-02

  • Indicate which steps support case weights (none), to align documentation with other packages.

textrecipes 0.5.2

CRAN release: 2022-05-04

  • Remove use of okc_text in vignette

  • Fix bug in printing of tokenlists

textrecipes 0.5.1

CRAN release: 2022-03-29

textrecipes 0.5.0

CRAN release: 2022-03-20

New steps

Improvements and Other Changes

Breaking Changes

  • Steps with prefix argument now creates names according to the pattern prefix_variablename_name/number. (#124)

textrecipes 0.4.1

CRAN release: 2021-07-11

Bug fixes

textrecipes 0.4.0

CRAN release: 2020-11-12

Breaking Changes

  • step_lda() now takes a tokenlist instead of a character variable. See readme for more detail.

New Features

textrecipes 0.3.0

CRAN release: 2020-07-08

  • stopwords package have been moved from Imports to Suggests.
  • step_ngram() gained an argument min_num_tokens to be able to return multiple n-grams together. (#90)
  • Adds step_text_normalization() to perform unicode normalization on character vectors. (#86)

textrecipes 0.2.3

CRAN release: 2020-05-22

textrecipes 0.2.2

CRAN release: 2020-05-10

  • step_word_embeddings() got a argument aggregation_default to specify value in cases where no words matches embedding.

textrecipes 0.2.1

CRAN release: 2020-05-04

textrecipes 0.2.0

CRAN release: 2020-04-14

textrecipes 0.1.0

CRAN release: 2020-03-05

textrecipes 0.0.2

CRAN release: 2019-09-07

  • Custom stemming function can now be used in step_stem using the custom_stemmer argument.
  • step_textfeatures() have been added, allows for multiple numerical features to be pulled from text.
  • step_sequence_onehot() have been added, allows for one hot encoding of sequences of fixed width.
  • step_word2vec() have been added, calculates word2vec dimensions.
  • step_tokenmerge() have been added, combines multiple list columns into one list-columns.
  • step_texthash() now correctly accepts signed argument.
  • Documentation have been improved to showcase the importance of filtering tokens before applying step_tf() and step_tfidf().

textrecipes 0.0.1

CRAN release: 2018-12-17

First CRAN version