Skip to content

v1.5.0 - 2023-10-13

Compare
Choose a tag to compare
@amontanez24 amontanez24 released this 13 Oct 17:30
· 213 commits to main since this release

Several improvements and bug fixes were made in this release. Most notably, the metadata detection was substantially improved. Support for the 'unknown' sdtype was added, providing more flexibility in data representation. The software now attempts to intelligently detect primary keys and identify parent-child relationships in the metadata, streamlining the metadata creation process.

Additionally, issues related to conditional sampling with negative float values, the inability to update transformers for columns created by constraints, and compatibility with numpy version 1.25 and higher were addressed. The default branch was also switched from 'master' to 'main' for better development practices. Various bugs and errors, including those involving HMA and datetime format detection, were also resolved.

New Features

  • Improve metadata detection - Issue #1515 by @R-Palazzo
  • Support 'unknown' sdtype - Issue #1516 by @R-Palazzo
  • Detect primary keys in metadata - Issue #1521 by @frances-h
  • Detect relationships in MultiTableMetadata - Issue #1522 by @frances-h
  • Make function to estimate number of columns HMA produces. - Issue #1572 by @fealho
  • Add wrapper for get_cardinalty_plot - Issue #1573 by @frances-h
  • [Metadata detection] Add a cardinality cap when choosing between categorical vs. numerical - Issue #1584 by @pvk-developer
  • [Metadata Detection] Only make primary/foreign keys sdtype id (leave others as unknown) - Issue #1598 by @amontanez24
  • Check and supply a more descriptive error when trying to use 'gaussian_kde' with HMA - Issue #1604 by @frances-h

Bugs Fixed

  • Conditional sampling with negative float values doesn't work - Issue #1161 by @fealho
  • Cannot update transformers for columns that get created by constraints (KeyError) - Issue #1454 by @frances-h
  • HMA produces KeyError for a schema with 3+ levels of depth - Issue #1558 by @fealho
  • Columns consisting of only Nones are being detected as datetime - Issue #1589 by @pvk-developer
  • HMASynthesizer throws an error when sampling multi table models with three levels of depths - Issue #1600 by @amontanez24
  • ValueError: Invalid distribution specification when setting numerical_distributions on child table (HMA) - Issue #1605 by @fealho
  • Bug: updating transformers in DataProcessor resets warning filters - Issue #1618 by @rwedge

Maintenance