Skip to content

CRAN v0.2.0

Compare
Choose a tag to compare
@juba juba released this 11 Feb 16:31
· 124 commits to main since this release

Important and breaking changes

  • min_uc_size, uc_size1 and uc_size2 arguments to rainette and rainette2 have been renamed to min_segment_size, min_segment_size1 and min_segment_size2.
  • The default value of min_segment_size in rainette is now 0, which means that no merging is done between segments by default. Results could then be different from previous package versions when min_uc_size was not specified.
  • Important bugfix : merging of segments based on min_segment_size was not handled correctly in the previous versions regarding the segment sources, as segments from different documents could be merged together. This should now be fixed.

New features

  • A new graphical interface to browse cluster documents has been added to rainette_explor and rainette2_explor.
  • New function clusters_by_doc_table which gives the number of segments of each cluster for each document.
  • New function docs_by_cluster_table which gives, for each cluster, the number of documents with at least one segment in this cluster.
  • split_segments should now be about 4 times faster.
  • Terms frequencies and documents proportions statistics have been added to the explor interfaces.

Other

  • When rainette is called with min_segment_size > 0, a doc_id argument must be given which is the name of a dtm docvar identifying the segments source. If the corpus has been produced by split_segments, the added segment_source docvar is used by default.
  • Color palette for clusters changed to "Tableau 10".
  • Negative keyness values are not shown by default anymore in rainette_explor, rainette2_explor, rainette_plot and rainette2_plot.
  • Wordcloud plots have been removed from explor interfaces.
  • A warning is displayed when min_split_members < 3.
  • If rainette_explor is called on a rainette2 results object, rainette2_explor is launched automatically.