Releases: capitalone/DataProfiler
Releases · capitalone/DataProfiler
0.10.9
Profiler
Documentation
Full Changelog: 0.10.8...0.10.9
What's Changed
- Staging into
main
fromdev
by @taylorfturner in #1106 - Version.py update 0.10.9 by @taylorfturner in #1107
Full Changelog: 0.10.8...0.10.9
0.10.8
Profiler
- Staging/main/0.10.8 #1081
- Depedency: matplotlib version bump #1072
- Make _assimilate_histogram() not use self #1071
- Feature: added parquet sampling #1070
Documentation
- Update: Documentation 0.10.8 #1084
- Docs update to include option for sample_nrows for parquet files #1082
Miscellaneous
- Bump actions/setup-python from 4 to 5 #1078
Full Changelog: 0.10.7...0.10.8
What's Changed
- Staging/main/0.10.8 by @taylorfturner in #1081
Full Changelog: 0.10.7...0.10.8
0.10.7
Profiler
Documentation
- Update: Documentation 0.10.7 #1069
Full Changelog: 0.10.6...0.10.7
What's Changed
- Staging/main/0.10.7 by @taylorfturner in #1068
Full Changelog: 0.10.6...0.10.7
0.10.6
Profiler
- Staging/main/0.10.6 #1065
- Update: Version 0.10.6 #1064
- Feature: Plugins #1060
- Hot Fix: Contribution Doc #1059
- Rename references to degree of freedom from df to deg_of_free #1056
- add_s3_connection_remote_loading_s3uri_feature #1054
- feat: add null ratio to column stats #1052
- Delay transforming priority_order into ndarray #1045
- Fix Codeowners List #1043
Documentation
- Update: Documentation 0.10.6 #1066
- Docs: AWS S3 Data Reading #1063
- Update docs to reflect renamed output of deg_of_free #1057
Full Changelog: 0.10.5...0.10.6
What's Changed
- Fix Codeowners List by @taylorfturner in #1044
- Staging/main/0.10.6 by @taylorfturner in #1065
Full Changelog: 0.10.5...0.10.6
0.10.5
Profiler
Documentation
Full Changelog: 0.10.4...0.10.5
What's Changed
- Categorical PSI by @taylorfturner in #1040
Full Changelog: 0.10.4...0.10.5
0.10.4
Profiler
- version bump (#1032) #1036
- Staging/main/0.10.4 #1029
- added psi calculation to categorical columns #1027
- Bump actions/checkout from 3 to 4 #1024
- Minor: Profiler Path Fix in Example Notebook #1021
- modified the assignees for issue creation #1016
- Make sure random_state is a list before indexed assignment #968
Documentation
Full Changelog: 0.10.3...0.10.4
What's Changed
Full Changelog: 0.10.3...0.10.4
0.10.3
Profiler
Feature: Multiprocess
- Staging: into dev feature/multiprocess #998
- Multiprocess automation feature into staging/dev. #997
- Syncing feature/multiprocess into staging/dev/multiprocess #992
- Automate multiprocess option #984
Feature: num_quantiles
option
- Staging: into dev feature/num-quantiles #990
- Fix Scipy Mend Issue #988
- HistogramAndQuantilesOption sync with dev branch #987
Documentation
- Update docs to 0.10.3 #1012
- Update docs to 0.10.3 #1011
- fixed snappy install issue on Mac #1010
- Staging: into dev-gh-pages the docs for multiprocess. #1001
- Add docs to multiprocess option in StructuredOptions. #999
- Staging: into dev-gh-pages the docs for num_quantiles. #993
- Add docs for num_quantiles option for histogram_and_quantiles. #991
Full Changelog: 0.10.2...0.10.3
What's Changed
- Staging: main
0.10.3
by @taylorfturner in #1004
Full Changelog: 0.10.2...0.10.3
0.10.2
Profiler
- hotfix[0.10.2]: cat vs float bug #973
Documentation
Full Changelog: 0.10.1...0.10.2
What's Changed
Full Changelog: 0.10.1...0.10.2
0.10.1
Profiler
- Hot Fix: .astype("bool") #960
Documentation
Full Changelog: 0.10.0...0.10.1
What's Changed
- Hot Fix:
.astype("bool")
by @taylorfturner in #960
Full Changelog: 0.10.0...0.10.1
0.10.0
Profiler
- Forking workflow directions CONTRIBUTING.md #857
- Fixing diagram rendering in CONTRIBUTING.md #862
- Fix initial value of processor_type #863
- fix: test bug due to bad mocks #878
- added differences section to unstructured data example #877
- Reservoir sampling refactor #910
- feat: add dev to workfow for testing #897
- Cms for categorical #892
- Hotfix: fix post feature serialization merge #942
- Update version to 0.10.0 #944
- Staging/main/0.10.0 #943
Profiler: Profile Serialization
- Staging/dev/profile serialization #940
- fix: order bug #939
- fix: null_rep mat should calculate even if datetime #933
- Profiler: load_method hotfix #932
- Top level hotfix: save / load .lower() #931
- Notebook Example save/load Profile #930
- refactor: use seed for sample for consistency #927
- Profile Builder load() serialization #925
- Reuse passed labeler #924
- BaseProfiler save() for json #923
- Added testing for values for test_json_decode_after_update #915
- UnstructuredProfiler: Added NoImplementationError #907
- fix: bug and add tests for structuredcolprofiler #904
- Stuctured profiler encode decode #903
- refactor: allow options to go through all #902
- StructuredColProfiler Encode / Decode #901
- Decode options #894
- Quick Test update #893
- Deserialization of datalabeler #891
- ColumnDataLabelerCompiler: serialize / deserialize #888
- Add Serialization and Deserialization Tests for Stats Compiler, plus refactors for order Typing #887
- Adds deserialization for compilers and validates tests for Primitive; fixes numerical deserialization #886
- Adds tests validating serialization with Primitive type for compiler #885
- feat: add test and compiler serialization #884
- ready datalabeler for deserialization and improvement on serializatio… #879
- Encode Options #875
- Encode/Decode TextColumnProfiler #870
- Created encoder for the datalabelercolumn #869
- Added test to ensure order attribute for ordered column profiler functions correctly after deserialization #868
- Added decoding for encoding of ordered column profiles #864
- Json decode date time column #861
- Float column profiler encode decode #854
- hot fixes for encode and decode of numeric stats mixin and intcol pro… #852
Profiler: Options
- staging/dev/options #909
- RowStatisticsOptions: Implementing option #871
- New preset implementation and test #867
- RowStatisticsOptions: Add option #865
Documentation
- Staging update docs 0.10.0 #945
- Documentation: Fix Req #922
- Documentation: Update for Reservoir Sampling #919
- documentation update for cms specific options to category #917
- Add forking / branch workflow image #858
Documentation: Profile Serialization
- Merge staging/dev-gh-pages/profile-serialization into dev-gh-pages #937
- Docs: Profiler Serialization Clean Up #936
- Docs: Profiler Serialization #928
Documentation: Options
- Documentation: feature/options branch docs updates #921
- Row statistics option documentation #883
- updating docs for preset name #882
- Add documentation for median_abs_deviation option #881
- Preset test updated w new names and different toggles #880
- reset ignore, update .gitignore, update documentation on presets #874
- Fixed documentation for sampling_ratio option #873
Full Changelog: 0.9.0...0.10.0
What's Changed
- Sampling ratio implement by @joshuart in #845
- StructuredOptions:
hhl_row_hashing
by @micdavis in #841 - Forking workflow directions CONTRIBUTING.md by @taylorfturner in #857
- Fixing diagram rendering in
CONTRIBUTING.md
by @taylorfturner in #862 - StructuredProfiler: HLLRowHashing by @micdavis in #842
- added differences section to unstructured data example by @lizlouise1335 in #877
- fix: test bug due to bad mocks by @JGSweets in #878
- Fix initial value of processor_type by @junholee6a in #863
- Staging/main/0.10.0 by @taylorfturner in #943
Full Changelog: 0.9.0...0.10.0