Skip to content

v0.14.0 - 2022-03-21

Compare
Choose a tag to compare
@katxiao katxiao released this 21 Mar 15:38

This release updates the sampling API and splits the existing functionality into three methods - sample, sample_conditions,
and sample_remaining_columns. We also add support for sampling in batches, displaying a progress bar when sampling with more than one batch,
sampling deterministically, and writing the sampled results to an output file. Finally, we include fixes for sampling with conditions
and updates to the documentation.

Bugs Fixed

  • Fix write to file in sampling - Issue #732 by @katxiao
  • Conditional sampling doesn't work if the model has a CustomConstraint - Issue #696 by @katxiao

New Features

  • Updates to GaussianCopula conditional sampling methods - Issue #729 by @katxiao
  • Update conditional sampling errors - Issue #730 by @katxiao
  • Enable Batch Sampling + Progress Bar - Issue #693 by @katxiao
  • Create sample_remaining_columns() method - Issue #692 by @katxiao
  • Create sample_conditions() method - Issue #691 by @katxiao
  • Improve sample() method - Issue #690 by @katxiao
  • Create Condition object - Issue #689 by @katxiao
  • Is it possible to generate data with new set of primary keys? - Issue #686 by @katxiao
  • No way to fix the random seed? - Issue #157 by @katxiao
  • Can you set a random state for the sdv.tabular.ctgan.CTGAN.sample method? - Issue #515 by @katxiao
  • generating different synthetic data while training the model multiple times. - Issue #299 by @katxiao

Documentation Changes

  • Typo in the document documentation - Issue #680 by @katxiao