(String/text processing)++: stringi 0.2-3 released by Marek Gągolewski.
From the post:
stringi is a package providing (but definitely not limiting to) replacements for nearly all the character string processing functions known from base R. While developing the package we had high performance and portability of its facilities in our minds.
Here is a very general list of the most important features available in the current version of
- string searching:
- with ICU (Java-like) regular expressions,
- ICU USearch-based locale-aware string searching (quite slow, but working properly e.g. for non-Unicode normalized strings),
- very fast, locale-independent byte-wise pattern matching;
- joining and duplicating strings;
- extracting and replacing substrings;
- string trimming, padding, and text wrapping (e.g. with Knuth's dynamic word wrap algorithm);
- text transliteration;
- text collation (comparing, sorting);
- text boundary analysis (e.g. for extracting individual words);
- random string generation;
- Unicode normalization;
- character encoding conversion and detection;
and many more.
Interesting isn’t it? How CS keeps circling around back to strings?